信息检索 — Scifaro

Rec-Distill: An Industrial Distillation Pipeline for Large-Scale Recommendation Models

Large recommendation models have demonstrated substantial potential gains under scaling laws, yet these gains are difficult to realize in industrial recommendation systems because real-world deployment requires lightweight models with…

信息检索 · 计算机科学 2026-05-29 Haoran Ding , Wenlin Zhao , Yuchen Jiang , Juren Li , Jie Zhu , Xinchun Li , Yishujie Zhao , Yi Zhang , Ao Qiao , Jianhui Dong , Cheng Chen , Ziyan Gong , Deping Xie , Peng Xu , Zikai Wang , Yuwei Wang , Huizhi Yang , Zhe Chen , Yuchao Zheng

FLASH-MAXSIM: IO-Aware Fused Kernels for Late-Interaction Scoring

Late-interaction retrieval (ColBERT, ColPali) scores a query against a document with the MaxSim operator: for every query token, the maximum similarity over the document tokens, summed over query tokens. The standard implementation…

信息检索 · 计算机科学 2026-05-29 Roi Pony , Adi Raz Goldfarb , Idan Friedman , Daniel Ezer , Udi Barzelay

Latent Terms: Dense Retrievers Contain Trivially Extractable BM25-ready Zipfian Vocabularies

We propose Latent Terms, a method revealing that models trained for dense retrieval, whether single- or multi-vector, learn representations that can trivially be decomposed into retrieval-ready sparse features. When trained on frozen…

信息检索 · 计算机科学 2026-05-29 Benjamin Clavié , Sean Lee , Aamir Shakir , Makoto P. Kato

ACE: Anisotropy-Controllable Embedding for LLM-enhanced Sequential Recommendation

Recent advances in the LLM-as-Extractor paradigm leverage large language models (LLMs) to transfer semantically rich item embeddings into sequential recommendation (SR) backbones. However, LLM-generated embeddings often suffer from strong…

信息检索 · 计算机科学 2026-05-29 Dongcheol Lee , Hye-young Kim , Jongwuk Lee

UniNote: A Unified Embedding Model for Multimodal Representation and Ranking

Item-to-Item (I2I) retrieval is a fundamental part of modern content platforms, supporting critical industrial workflows from recommendation engines to content auditing. While multimodal embedding methods have advanced general retrieval,…

信息检索 · 计算机科学 2026-05-29 Jinghan Zhao , Wenwei Jin , Anqi Li , Jintao Tong , Luya Mo , Jiawei Li , Bin Li , Yao Hu

CrossAlpha: An Annual-Report Benchmark for Cross-Market Factor Research

Cross-market factor research studies whether firm-level signals from one or more markets can predict returns in a target market, but existing public benchmarks do not support cross-market disclosure-to-return evaluation. Building such a…

信息检索 · 计算机科学 2026-05-29 Qian Wang , Zhongyi Tong , Nuo Chen , Zhaomin Wu , Bingsheng He

On the Practice of Scaling Search Conversion Rate Prediction

Scaling a Search Conversion Rate (CVR) prediction model, especially in high-traffic environments, presents a challenge: superior model quality needs to be balanced with strict constraints on training cost and serving latency. This paper…

信息检索 · 计算机科学 2026-05-29 James Pak , Jyun-Yu Jiang , Fan Zhang , Sen Wang , Taekmin Kim , Henry Tsai , Vijay Rajaram , Juexin Lin , Mohitdeep Singh , Alessandro Magnani , Johnny Chen , Qian Zhao , Rao Fu , Zhirong Liang , Jordan Gilliland , Winter Jiao

Toward User Preference Alignment in LLM Recommendation via Explicit Context Feedback

Traditional recommender systems (RecSys) primarily infer user preferences from implicit signals (such as clicks, watches, and purchases), often neglecting the rich explicit contextual feedback users provide through verbal text, like…

信息检索 · 计算机科学 2026-05-29 Weizhi Zhang , Wooseong Yang , Yuxin Cui , Zhaohui Guo , Hins Hu , Liangwei Yang , Henry Peng Zou , Qifei Wang , Hanqing Zeng , Jiayi Liu , Yinglong Xia , Philip S. Yu

Generative Spatiotemporal Intent Sequence Recommendation via Implicit Reasoning in Amap

Real-world user behavior rarely consists of isolated actions; instead, it often forms intent flows governed by spatiotemporal dependencies. To provide integrated service recommendations, we focus on the task of Generative Spatiotemporal…

信息检索 · 计算机科学 2026-05-29 Sicong Wang , Ruiting Dong , Yue Liu , Bowen Zheng , Jun Meng , Jie Li , Shuaijun Guo , Yu Gu , Fanyi Di , Xin Li

Echoes in Filter Bubble: Diagnosing and Curing Popularity Bias in Generative Recommenders

Recently, Generative Recommenders (GRs), characterized by a unified end-to-end framework, have exhibited astonishing potential in transforming the recommendation paradigm. Despite their effectiveness, we recognize that GRs are still…

信息检索 · 计算机科学 2026-05-29 Jun Yin , Bangguo Zhu , Peng Huo , Ruochen Liu , Hao Chen , Senzhang Wang , Shirui Pan , Chengqi Zhang

DiffRetriever: Parallel Representative Tokens for Retrieval with Diffusion Language Models

This paper shows how diffusion language models (DLMs) can be used as effective and efficient retrievers. Existing DLM-based retrievers (e.g., DiffEmbed) follow BERT-style encoding, representing each query or passage as a single mean-pooled…

信息检索 · 计算机科学 2026-05-29 Shuai Wang , Yu Yin , Shengyao Zhuang , Bevan Koopman , Guido Zuccon

Dynamic Ranked List Truncation for Reranking Pipelines via LLM-generated Reference-Documents

Large Language Models (LLM) have been widely used in reranking. Computational overhead and large context lengths remain a challenging issue for LLM rerankers. Efficient reranking usually involves selecting a subset of the ranked list from…

信息检索 · 计算机科学 2026-05-29 Nilanjan Sinhababu , Soumedhik Bharati , Debasis Ganguly , Pabitra Mitra

Bridge-RAG: An Abstract Bridge Tree Based Retrieval Augmented Generation Algorithm

As an important paradigm for enhancing the generation quality of Large Language Models (LLMs), retrieval-augmented generation (RAG) faces the two challenges regarding retrieval accuracy and computational efficiency. This paper presents a…

信息检索 · 计算机科学 2026-05-29 Zihang Li , Wenjun Liu , Yikun Zong , Jiawen Tao , Siying Dai , Songcheng Ren , Zirui Liu , Yuhang Wang , Yanbing Jiang , Tong Yang

APAO: Adaptive Prefix-Aware Optimization for Generative Recommendation

Generative recommendation has recently emerged as a promising paradigm for sequential recommendation. It formulates the task as an autoregressive generation process, predicting tokens of the next item conditioned on user interaction…

信息检索 · 计算机科学 2026-05-29 Yuanqing Yu , Yifan Wang , Weizhi Ma , Zhiqiang Guo , Min Zhang

The Best of the Two Worlds: Harmonizing Semantic and Hash IDs for Sequential Recommendation

Conventional Sequential Recommender Systems (SRS) typically assign unique hash IDs (HID) to construct item embeddings, which mainly capture collaborative signals from historical user-item interactions. However, such embeddings are…

信息检索 · 计算机科学 2026-05-29 Ziwei Liu , Yejing Wang , Wanyu Wang , Wang Zejian , Qidong Liu , Zijian Zhang , Chong Chen , Wei Huang , Xiangyu Zhao

VOGUE: A Multimodal Dataset for Conversational Recommendation in Fashion

Multimodal conversational recommendation has recently emerged as a promising paradigm for delivering personalized experiences through natural dialogue enriched by visual and contextual grounding. Yet currently available multimodal…

信息检索 · 计算机科学 2026-05-29 David Guo , Minqi Sun , Yilun Jiang , Jiazhou Liang , Scott Sanner

FORGE: Forming Semantic Identifiers for Generative Retrieval in Industrial Datasets

Semantic identifiers (SIDs) have gained increasing attention in generative retrieval (GR) for recommendation due to their meaningful semantic discriminability. However, current studies in this field primarily (1) offer limited investigation…

信息检索 · 计算机科学 2026-05-29 Kairui Fu , Tao Zhang , Shuwen Xiao , Ziyang Wang , Xinming Zhang , Chenchi Zhang , Yuliang Yan , Junjun Zheng , Xiangheng Kong , Shengyu Zhang , Kun Kuang , Yuning Jiang

Page image classification for content-specific data processing

Digitization projects in humanities often generate vast quantities of page images from historical documents, presenting significant challenges for manual sorting and analysis. These archives contain diverse content, including various text…

信息检索 · 计算机科学 2026-05-29 Kateryna Lutsai

Learning User-Aware Recall: Personalized Retrieval in Long-Term Conversational Memory

Long-term conversational agents are expected to remember past interactions, but memory is useful only when the right evidence is recalled for the right user. Existing memory-augmented LLM agents have made progress in building compact memory…

信息检索 · 计算机科学 2026-05-28 ZhiShu Jiang , Haibo Liu , Xin Shen , Guanqiang QI , Chenxi Miao , Weikang Li , Liwei Qian , Xin Pei , Jizhou Huang

Do Agents Need Semantic Metadata? A Comparative Study in Agentic Data Retrieval

In the era of autonomous agents, machine-actionable data is critical for data-driven workflows. For more than a decade, semantic metadata like schema.org has anchored the FAIR principles (Findable, Accessible, Interoperable, and Reusable)…

信息检索 · 计算机科学 2026-05-28 Shiyu Chen , Tarfah Alrashed , Alon Halevy , Natasha Noy