信息检索 — Scifaro

Fast and Feasible: Permutation-based Constrained Reranking for Revenue Maximization

Search and recommender systems have produced highly relevant search results. A natural next step in the development of such systems in e-commerce is to rerank these results to increase the platform's revenue from paid promotion products.…

信息检索 · 计算机科学 2026-06-26 Svetlana Shirokovskikh , Anastasiia Soboleva , Ekaterina Solodneva , Aleksandr Katrutsa , Roman Loginov , Egor Samosvat

An LLM-Powered Semantic Alignment Framework for Journal Recommendation

Journal recommendation is an important task in scholarly information systems. Existing approaches typically rely on supervised learning models, manually engineered features, or historical interaction data, which may limit their…

信息检索 · 计算机科学 2026-06-26 Yanglin Yan , Zicheng Xie , Tianchen Gao , Rui Pan , Hansheng Wang

NOVA: A Verification-Aware Agent Harness for Architecture Evolution in Industrial Recommender Systems

Industrial advertising recommender models are continuously improved through architecture evolution. Upgrades such as RankMixer, TokenMixer-Large, and MixFormer show that better structures remain a key source of quality and business gains.…

信息检索 · 计算机科学 2026-06-25 Shaohua Liu , Liang Fang , Yilong Sun , Shudong Huang , Qingsong Luo , Xiaoyang Chen , Dongqiang Liu , Chuangang Ma , Zhenzhen Chai , Henghuan Wang , Shijie Quan , Changyuan Cui , Zhangbin Zhu , Peng Chen , Wei Xu , Lei Xiao , Haijie Gu , Jie Jiang

TRUST: Item-Calibrated Interval Evidence for Temporal Session-Based Recommendation

Temporal signals have been widely used in session-based recommendation to infer user interest. Existing temporal session-based recommenders primarily rely on absolute interval values, implicitly assuming that the same interval carries…

信息检索 · 计算机科学 2026-06-25 Linjiang Guo , Nitin Bisht , Shiqing Wu , Yifan Yin , Guandong Xu

UniFormer: Efficient and Unified Model-Centric Scaling for Industrial Recommendation

Recently, substantial progress has been made in industrial recommendation through component-centric model scaling, where individual components such as behavior modeling, feature interaction, or task modeling are independently scaled to…

信息检索 · 计算机科学 2026-06-25 Bo Chen , Jinlong Jiao , Tijian Hu , Ruihao Zhang , Yanzhi Liu , Chenghou Jin , Qinglin Jia , Baixuan He , Hechang Pan , Yiwu Liu , Jian Liang , Chaoyi Ma , Ruiming Tang , Han Li , Kun Gai

TriPAH: Imbalance-Aware Tri-Prompt Affinity Hashing for Cross-Modal Medical Retrieval

In the era of big medical data, efficient cross-modal retrieval is pivotal for evidence-based diagnosis and large-scale case management. Cross-modal medical hashing retrieval aims to enable efficient image-text search and support downstream…

信息检索 · 计算机科学 2026-06-25 Jiaming Bian , Songming Li , Yurui Song , Yunfei Chen , Yichao Cao , Jun Long

A Shared IPTC Topic Space for Cross-Source Topic Modelling

Comparing topic attention across different media is hindered by a fundamental modelling problem: topic models fitted separately to each corpus produce corpus-specific topic spaces that cannot be aligned directly. This paper presents a…

信息检索 · 计算机科学 2026-06-25 Din Iskakov , Sebastian Gonçalves , Marco Idiat , Mendeli Vainstein , Aline Villavicencio , Ronaldo Menezes , Rodrigo Wilkens

Attributed, But Not Incremental: Cannibalization-Corrected Attribution for Large-Scale Advertising

In large-scale paid acquisition and growth advertising systems, production attribution outputs are widely used for daily budget allocation and channel diagnosis. However, paid-attributed conversions such as daily new users (DNU) may…

信息检索 · 计算机科学 2026-06-25 Donghui Li , Bowen Yuan , Zili Yang , Qinxin Chen , Lijing Song

GPUSparse: GPU-Accelerated Learned Sparse Retrieval with Parallel Inverted Indices

Learned sparse retrieval models such as SPLADE achieve retrieval quality competitive with dense models while preserving the interpretability and exact-match advantages of sparse representations. However, inference-time scoring still relies…

信息检索 · 计算机科学 2026-06-24 Ashutosh Sharma

TileMaxSim: IO-Aware GPU MaxSim Scoring with Dimension Tiling and Fused Product Quantization

Multi-vector retrieval models such as ColBERT achieve state-of-the-art accuracy through fine-grained token-level MaxSim scoring, yet existing GPU implementations leave most hardware performance unused. We give a roofline analysis of MaxSim…

信息检索 · 计算机科学 2026-06-24 Ashutosh Sharma

Scoring Is Not Enough: Addressing Gaps in Utility-fairness Trade-offs for Ranking

Scoring functions are used to represent the relevance of individual documents. In modern information retrieval or recommendation systems, they are often learned from data and play a pivotal role in ranking sets of documents or items in a…

信息检索 · 计算机科学 2026-06-24 Shubham Singh , Ian A. Kash , Mesrob I. Ohannessian

From Clicks to Intent: Cross-Platform Session Embeddings with LLM-Distilled Taxonomy for Financial Services Recommendations

Sequential user behavior modeling is widely adopted in industrial recommender systems; however, significant gaps remain in financial services, where pre-login web interactions and authenticated in-app experiences differ drastically.…

信息检索 · 计算机科学 2026-06-24 Dianjing Fan , Yao Li , Kyaw Hpone Myint , Dwipam Katariya , Alexandre G. R. Day , Pranab Mohanty , Giri Iyengar

Reducing Redundancy in Whole-Slide Image Patching for Scalable Indexing and Retrieval

The rapid growth of digital pathology has created an urgent need for efficient indexing and retrieval of whole slide images (WSIs). This need is intensified by emerging generative AI workflows, particularly retrieval-augmented generation…

信息检索 · 计算机科学 2026-06-23 Jialiang Geng , Ghazal Alabtah , Saghir Alfasly , Wataru Uegami , H. R. Tizhoosh

GRASP: Plan-Guided Graph Retrieval with Adaptive Fusion and Reranking on Semi-Structured Knowledge Bases

Semi-structured knowledge bases (SKBs) embed textual documents in a typed graph of entities and relations, and underpin applications such as product search, academic paper search, and precision-medicine inquiries. Existing hybrid retrieval…

信息检索 · 计算机科学 2026-05-29 Yicheng Tao , Yiqun Wang , Xiangchen Song , Xin Luo , Kai Liu , Jie Liu

LexPath: A domain-oriented multi-path framework for legal article retrieval

Legal article retrieval is critical for building traceable and reliable legal AI systems, where conclusions must be grounded in specific legal articles. However, existing open-domain retrieval methods rely heavily on surface-level lexical…

信息检索 · 计算机科学 2026-05-29 Weixuan Liu , Qingfeng Zhuge , Xuyang Chen

No More K-means:Single-Stage Sparse Coding for Efficient Multi-Vector Retrieval

Multi-vector retrieval (MVR) models, exemplified by ColBERT, have established new benchmarks in retrieval accuracy by preserving fine-grained token-level interactions. However, this granularity imposes prohibitive storage and retrieval…

信息检索 · 计算机科学 2026-05-29 Lixuan Guo , Yifei Wang , Tiansheng Wen , Aosong Feng , Stefanie Jegelka , Chenyu You

Uncertainty Quantification for Multimodal Retrieval Augmented Generation

Retrieval Augmented Generation (RAG) improves the question answering capabilities of Large Language Models (LLMs) by incorporating external knowledge and has recently been extended to multimodal settings through Vision-Language Models…

信息检索 · 计算机科学 2026-05-29 Simon Binz , Heydar Soudani , Faegheh Hasibi

Rec-Distill: An Industrial Distillation Pipeline for Large-Scale Recommendation Models

Large recommendation models have demonstrated substantial potential gains under scaling laws, yet these gains are difficult to realize in industrial recommendation systems because real-world deployment requires lightweight models with…

信息检索 · 计算机科学 2026-05-29 Haoran Ding , Wenlin Zhao , Yuchen Jiang , Juren Li , Jie Zhu , Xinchun Li , Yishujie Zhao , Yi Zhang , Ao Qiao , Jianhui Dong , Cheng Chen , Ziyan Gong , Deping Xie , Peng Xu , Zikai Wang , Yuwei Wang , Huizhi Yang , Zhe Chen , Yuchao Zheng

FLASH-MAXSIM: IO-Aware Fused Kernels for Late-Interaction Scoring

Late-interaction retrieval (ColBERT, ColPali) scores a query against a document with the MaxSim operator: for every query token, the maximum similarity over the document tokens, summed over query tokens. The standard implementation…

信息检索 · 计算机科学 2026-05-29 Roi Pony , Adi Raz Goldfarb , Idan Friedman , Daniel Ezer , Udi Barzelay

Latent Terms: Dense Retrievers Contain Trivially Extractable BM25-ready Zipfian Vocabularies

We propose Latent Terms, a method revealing that models trained for dense retrieval, whether single- or multi-vector, learn representations that can trivially be decomposed into retrieval-ready sparse features. When trained on frozen…

信息检索 · 计算机科学 2026-05-29 Benjamin Clavié , Sean Lee , Aamir Shakir , Makoto P. Kato