Computer Science

SpecBench: Evaluating Specification-Level Reasoning for Software Engineering LLM Agents

Software engineering (SWE) agents are transitioning from code generation to full software development lifecycle automation. A critical phase in this lifecycle is specification design: transforming initial proposals into carefully considered…

Multiagent Systems · Computer Science 2026-05-29 Grant Hamblin , Kevin Song , Zhanda Zhu , Anand Jayarajan , Sihang Liu , Nandita Vijaykumar , Gennady Pekhimenko

EASE Configuration Facilitates A Reproducible Science of LLM Social Simulations

LLMs are increasingly deployed to simulate social interactions, yet many of the existing simulators remain ad hoc and monolithic. This lack of architectural standardization prevents reproducible research and complicates downstream…

Multiagent Systems · Computer Science 2026-05-29 Sneheel Sarangi , Maximilian Puelma Touzel , Aurélien Bück-Kaeffer , Zachary Yang , Jean-François Godbout , Reihaneh Rabbany

GRASP: Plan-Guided Graph Retrieval with Adaptive Fusion and Reranking on Semi-Structured Knowledge Bases

Semi-structured knowledge bases (SKBs) embed textual documents in a typed graph of entities and relations, and underpin applications such as product search, academic paper search, and precision-medicine inquiries. Existing hybrid retrieval…

Information Retrieval · Computer Science 2026-05-29 Yicheng Tao , Yiqun Wang , Xiangchen Song , Xin Luo , Kai Liu , Jie Liu

Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization

While Multi-Agent Systems (MAS) empower Large Language Models to tackle complex reasoning tasks through collaborative interaction, optimizing their dynamics remains a formidable challenge due to the discrete, non-differentiable nature of…

Multiagent Systems · Computer Science 2026-05-29 Wenwu Li , Yuran Song , Mingze Zhao , Bo Jin , Wenhao Li

LexPath: A domain-oriented multi-path framework for legal article retrieval

Legal article retrieval is critical for building traceable and reliable legal AI systems, where conclusions must be grounded in specific legal articles. However, existing open-domain retrieval methods rely heavily on surface-level lexical…

Information Retrieval · Computer Science 2026-05-29 Weixuan Liu , Qingfeng Zhuge , Xuyang Chen

No More K-means:Single-Stage Sparse Coding for Efficient Multi-Vector Retrieval

Multi-vector retrieval (MVR) models, exemplified by ColBERT, have established new benchmarks in retrieval accuracy by preserving fine-grained token-level interactions. However, this granularity imposes prohibitive storage and retrieval…

Information Retrieval · Computer Science 2026-05-29 Lixuan Guo , Yifei Wang , Tiansheng Wen , Aosong Feng , Stefanie Jegelka , Chenyu You

When Cloud Agents Meet Device Agents: Lessons from Hybrid Multi-Agent Systems

The design space of agentic AI inference spans two extremes: frontier large language models (LLMs), typically hosted in the cloud and offering strong performance across a wide range of tasks at substantially high cost, and more…

Multiagent Systems · Computer Science 2026-05-29 Corrado Rainone , Davide Belli , Bence Major , Arash Behboodi

Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas

We study two-level autoresearch for cooperation: an outer-loop AI agent autonomously redesigns the inner-loop pipeline of an LLM policy-synthesis system for multi-agent Sequential Social Dilemmas (SSDs). A researcher agent $\mathcal{R}$…

Multiagent Systems · Computer Science 2026-05-29 Víctor Gallego

Uncertainty Quantification for Multimodal Retrieval Augmented Generation

Retrieval Augmented Generation (RAG) improves the question answering capabilities of Large Language Models (LLMs) by incorporating external knowledge and has recently been extended to multimodal settings through Vision-Language Models…

Information Retrieval · Computer Science 2026-05-29 Simon Binz , Heydar Soudani , Faegheh Hasibi

Evolutionary Dynamics of Cooperation in Next-Generation LLM Agent Systems: A Cross-Provider Empirical Extension

Do next-generation LLM agents inherit the cooperative biases documented in their predecessors, or does scale and provider diversity reshape equilibrium behaviour in competitive multi-agent settings? Willis et al. established a benchmark for…

Multiagent Systems · Computer Science 2026-05-29 Francisco León Zúñiga Bolívar

Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems

LLM-based multi-agent systems (MAS) have emerged as an effective paradigm for complex and long-horizon tasks. However, in real-world tasks, MAS often exhibit various failures during execution and such failures are difficult to eliminate…

Multiagent Systems · Computer Science 2026-05-29 Zhezheng Hao , Tianfu Wang , Huanshuo Dong , Ziyan Liu , Hong Wang , Xiankun Lin , Qiang Lin , Can Wang , Hande Dong , Jiawei Chen

Rec-Distill: An Industrial Distillation Pipeline for Large-Scale Recommendation Models

Large recommendation models have demonstrated substantial potential gains under scaling laws, yet these gains are difficult to realize in industrial recommendation systems because real-world deployment requires lightweight models with…

Information Retrieval · Computer Science 2026-05-29 Haoran Ding , Wenlin Zhao , Yuchen Jiang , Juren Li , Jie Zhu , Xinchun Li , Yishujie Zhao , Yi Zhang , Ao Qiao , Jianhui Dong , Cheng Chen , Ziyan Gong , Deping Xie , Peng Xu , Zikai Wang , Yuwei Wang , Huizhi Yang , Zhe Chen , Yuchao Zheng

CONCAT: Consensus- and Confidence-Driven Ad Hoc Teaming for Efficient LLM-Based Multi-Agent Systems

Although large language model (LLM) based multi-agent systems (MAS) show their capability to solve complex tasks and achieve higher performance over single agent systems, they lead to huge computational overheads because of heavy…

Multiagent Systems · Computer Science 2026-05-29 Ziyang Ma , Dingyi Zhang , Sichu Liang , Jiajia Chu , Pengfei Xia , Hui Zang , Deyu Zhou

FLASH-MAXSIM: IO-Aware Fused Kernels for Late-Interaction Scoring

Late-interaction retrieval (ColBERT, ColPali) scores a query against a document with the MaxSim operator: for every query token, the maximum similarity over the document tokens, summed over query tokens. The standard implementation…

Information Retrieval · Computer Science 2026-05-29 Roi Pony , Adi Raz Goldfarb , Idan Friedman , Daniel Ezer , Udi Barzelay

DynaGraph: Lightweight Multi-Model Interaction Framework via Dynamic Topological Reconfiguration

Tackling complex reasoning tasks typically relies on massive monolithic LLMs, which suffer from severe computational redundancy. While task decomposition through structured pipelines or multi-agent collaborations offers an alternative,…

Multiagent Systems · Computer Science 2026-05-29 Yanxing Guo , Zihao Zheng , Fangzhou Wu , Ling Liang , Lin Bao , Zongwei Wang , Yimao Cai

Latent Terms: Dense Retrievers Contain Trivially Extractable BM25-ready Zipfian Vocabularies

We propose Latent Terms, a method revealing that models trained for dense retrieval, whether single- or multi-vector, learn representations that can trivially be decomposed into retrieval-ready sparse features. When trained on frozen…

Information Retrieval · Computer Science 2026-05-29 Benjamin Clavié , Sean Lee , Aamir Shakir , Makoto P. Kato

ACE: Anisotropy-Controllable Embedding for LLM-enhanced Sequential Recommendation

Recent advances in the LLM-as-Extractor paradigm leverage large language models (LLMs) to transfer semantically rich item embeddings into sequential recommendation (SR) backbones. However, LLM-generated embeddings often suffer from strong…

Information Retrieval · Computer Science 2026-05-29 Dongcheol Lee , Hye-young Kim , Jongwuk Lee

LLM-ALSO: LLM-Driven Adaptive Learning-Signal Optimization for Multi-Agent Reinforcement Learning

Effective training-time guidance is central to multi-agent reinforcement learning (MARL), yet remains difficult in sparse-reward settings where weak supervision limits coordination and policy improvement, and existing methods often require…

Multiagent Systems · Computer Science 2026-05-29 Xiaoguang Wu , Zhi Zheng , Hui Xiong

UniNote: A Unified Embedding Model for Multimodal Representation and Ranking

Item-to-Item (I2I) retrieval is a fundamental part of modern content platforms, supporting critical industrial workflows from recommendation engines to content auditing. While multimodal embedding methods have advanced general retrieval,…

Information Retrieval · Computer Science 2026-05-29 Jinghan Zhao , Wenwei Jin , Anqi Li , Jintao Tong , Luya Mo , Jiawei Li , Bin Li , Yao Hu

CrossAlpha: An Annual-Report Benchmark for Cross-Market Factor Research

Cross-market factor research studies whether firm-level signals from one or more markets can predict returns in a target market, but existing public benchmarks do not support cross-market disclosure-to-return evaluation. Building such a…

Information Retrieval · Computer Science 2026-05-29 Qian Wang , Zhongyi Tong , Nuo Chen , Zhaomin Wu , Bingsheng He