信息检索 — Scifaro

SABER-Math: Automated Benchmark for Information Retrieval Evaluation in Mathematics

As agentic AI systems tackle more complex mathematical tasks, they increasingly rely on information retrieval (IR) to search problem databases, theorem libraries, and educational resources. However, choosing the right retriever remains…

信息检索 · 计算机科学 2026-06-29 Nikolay Georgiev , Maria Drencheva , Kseniia Ibragimova , Ivo Petrov , Dimitar I. Dimitrov , Martin Vechev

Do Recommendation Algorithms Work When Users Are LLM Agents? A Case Study on Moltbook

Large language model (LLM) agents are increasingly populating web platforms, raising a fundamental question for recommender systems: do algorithms designed for human users still work when users are LLM agents that may not have well-defined…

信息检索 · 计算机科学 2026-06-29 Daming Li , Simeng Han , Jialu Zhang

Diagnosing and Mitigating Context Rot in Long-horizon Search

Extensive context has become the norm as Large Language Models (LLMs) are increasingly deployed in long-horizon tasks. The concern that increasing context length degrades model capabilities, known as context rot, has become a central issue…

信息检索 · 计算机科学 2026-06-29 Shijie Xia , Yikun Wang , Zhen Huang , Pengfei Liu

ARMOR: Adaptive Retriever Optimization for Low-Resource Telecom Question Answering

Telecom question answering (QA) is a challenging setting for retrieval-augmented generation (RAG): evidence is fragmented across standards, papers, encyclopedic resources, and web documents, and answers often hinge on technical tables,…

信息检索 · 计算机科学 2026-06-29 Heshan Fernando , Quan Xiao , Yan Xin , Tianyi Chen

As We May Search

The sensitive information in personal documents, legal files, and medical records is among the most valuable things to search, yet current retrieval-augmented generation systems still require sending content to remote servers. We propose…

信息检索 · 计算机科学 2026-06-28 Saber Zerhoudi , Adam Roegiest , Jelena Mitrovic , Michael Granitzer

Metadata, Structure, or Strategy? A Decomposition of RAG Context Enrichment

Retrieval-augmented generation (RAG) systems increasingly enrich retrieved passages by attaching quality metadata, structuring them into explicit records, and adopting multi-hop retrieval strategies that accumulate evidence across steps.…

信息检索 · 计算机科学 2026-06-28 Saber Zerhoudi , Michael Granitzer , Jelena Mitrovic

Monosemanticity in Recommender Systems

Latent factor models such as matrix factorization are widely used in recommender systems, yet the learned embedding dimensions typically lack explicit semantic interpretation. This opacity limits transparency, explainability, and principled…

信息检索 · 计算机科学 2026-06-28 Yagel Alfasi , Eden Rzezak , Eadan Schechter

Covering the Unseen: Information Demand Coverage Optimization for Retrieval-Augmented Generation

Retrieval-augmented generation (RAG) typically treats context selection as ranking chunks against a single query embedding. This assumption breaks down for complex queries, such as multi-hop or ambiguous questions, where top-k selection…

信息检索 · 计算机科学 2026-06-28 Bingxue Zhang , Jianying Jia , Feida Zhu

Fairness Attacks on Recommender Systems

The unfairness of recommender systems has become a topic of concern due to its significant social and ethical implications. Although existing works have shown the effectiveness of attacks on the performance of recommender systems (e.g.,…

信息检索 · 计算机科学 2026-06-27 Yanan Wang , Yong Ge

Human-in-the-Loop Nugget Annotation for Accountable LLM-as-a-Judge Evaluations

Evaluating AI/Agentic system outputs reliably requires human judgment, but how one incorporates the human determines whether one gets a real quality signal or expensive theater. The common approaches either accidentally anchor human experts…

信息检索 · 计算机科学 2026-06-27 Laura Dietz

Multimodal Graph RAG for Long-range Visually Rich Document Understanding

Multimodal large language models (MLLMs) are widely applied to visual document understanding. However, comprehending long documents remains an issue by the limited context window. Though recent multimodal retrieval-augmented generation…

信息检索 · 计算机科学 2026-06-27 Yi-Cheng Wang , Chu-Song Chen

Reproducing FACTER: Fairness via Conformal Thresholding and Prompt Repair

Fayyazi et al. (2025) recently proposed FACTER, a model-agnostic framework designed to jointly enforce fairness and statistical coverage in LLM-based recommendation through conformal thresholding and iterative prompt repair. In this work,…

信息检索 · 计算机科学 2026-06-26 Oscar Miró López-Feliu , Daimy van Loo , Xanthos Kekkos , Mikel Blom , Clara Rus

R$^2$-Searcher: Calibrating Retrieval and Reasoning Boundaries for Agentic Search

Recent search agents for multi-hop reasoning often fail by either retrieving incomplete evidence or reasoning over irrelevant portions of the retrieved content, leading to a retrieval-reasoning boundary shift. We propose R$^2$-Searcher, a…

信息检索 · 计算机科学 2026-06-26 Sheng Zhang , Junyi Li , Wenlin Zhang , Xiaowei Qian , Yichao Wang , Yingyi Zhang , Maolin Wang , Yong Liu , Xiangyu Zhao

CMSL: Constructive Multi-Sequence Learning for Recommendation Systems

Sequence learning has emerged as the promising paradigm in recommendation systems, surpassing traditional Deep Learning Recommendation Models (DLRM) by capturing the temporal nuances of user behavior. However, current state-of-the-art…

信息检索 · 计算机科学 2026-06-26 Zikun Cui , Renzhi Wu , Junjie Yang , Li Sheng , Jijie Wei , Linfeng Liu , Tai Guo , Tao Jia , Xiaodong Wang , Hong Li , Li Yu , Sri Reddy , Hong Yan

SemFlowRAG: Directed Semantic Flow from Abstraction to Evidence for Complex Reasoning

Retrieval-Augmented Generation (RAG) enhanced by Knowledge Graphs has shown promise in complex multi-hop reasoning tasks. However, existing graph-based retrieval methods typically rely on flat, undirected topologies. During the retrieval…

信息检索 · 计算机科学 2026-06-26 Houyuan Qin , Rong Wu , Qinyuan Qin , Botian Shi , Jingjing Qu , Yang Sun , Pinlong Cai

Fast and Feasible: Permutation-based Constrained Reranking for Revenue Maximization

Search and recommender systems have produced highly relevant search results. A natural next step in the development of such systems in e-commerce is to rerank these results to increase the platform's revenue from paid promotion products.…

信息检索 · 计算机科学 2026-06-26 Svetlana Shirokovskikh , Anastasiia Soboleva , Ekaterina Solodneva , Aleksandr Katrutsa , Roman Loginov , Egor Samosvat

An LLM-Powered Semantic Alignment Framework for Journal Recommendation

Journal recommendation is an important task in scholarly information systems. Existing approaches typically rely on supervised learning models, manually engineered features, or historical interaction data, which may limit their…

信息检索 · 计算机科学 2026-06-26 Yanglin Yan , Zicheng Xie , Tianchen Gao , Rui Pan , Hansheng Wang

NOVA: A Verification-Aware Agent Harness for Architecture Evolution in Industrial Recommender Systems

Industrial advertising recommender models are continuously improved through architecture evolution. Upgrades such as RankMixer, TokenMixer-Large, and MixFormer show that better structures remain a key source of quality and business gains.…

信息检索 · 计算机科学 2026-06-25 Shaohua Liu , Liang Fang , Yilong Sun , Shudong Huang , Qingsong Luo , Xiaoyang Chen , Dongqiang Liu , Chuangang Ma , Zhenzhen Chai , Henghuan Wang , Shijie Quan , Changyuan Cui , Zhangbin Zhu , Peng Chen , Wei Xu , Lei Xiao , Haijie Gu , Jie Jiang

TRUST: Item-Calibrated Interval Evidence for Temporal Session-Based Recommendation

Temporal signals have been widely used in session-based recommendation to infer user interest. Existing temporal session-based recommenders primarily rely on absolute interval values, implicitly assuming that the same interval carries…

信息检索 · 计算机科学 2026-06-25 Linjiang Guo , Nitin Bisht , Shiqing Wu , Yifan Yin , Guandong Xu

UniFormer: Efficient and Unified Model-Centric Scaling for Industrial Recommendation

Recently, substantial progress has been made in industrial recommendation through component-centric model scaling, where individual components such as behavior modeling, feature interaction, or task modeling are independently scaled to…

信息检索 · 计算机科学 2026-06-25 Bo Chen , Jinlong Jiao , Tijian Hu , Ruihao Zhang , Yanzhi Liu , Chenghou Jin , Qinglin Jia , Baixuan He , Hechang Pan , Yiwu Liu , Jian Liang , Chaoyi Ma , Ruiming Tang , Han Li , Kun Gai