Computer Science

SpecBench: Evaluating Specification-Level Reasoning for Software Engineering LLM Agents

Software engineering (SWE) agents are transitioning from code generation to full software development lifecycle automation. A critical phase in this lifecycle is specification design: transforming initial proposals into carefully considered…

Multiagent Systems · Computer Science 2026-05-29 Grant Hamblin , Kevin Song , Zhanda Zhu , Anand Jayarajan , Sihang Liu , Nandita Vijaykumar , Gennady Pekhimenko

EASE Configuration Facilitates A Reproducible Science of LLM Social Simulations

LLMs are increasingly deployed to simulate social interactions, yet many of the existing simulators remain ad hoc and monolithic. This lack of architectural standardization prevents reproducible research and complicates downstream…

Multiagent Systems · Computer Science 2026-05-29 Sneheel Sarangi , Maximilian Puelma Touzel , Aurélien Bück-Kaeffer , Zachary Yang , Jean-François Godbout , Reihaneh Rabbany

Unifying Temporal and Structural Credit Assignment in LLM-Based Multi-Agent Prompt Optimization

While Multi-Agent Systems (MAS) empower Large Language Models to tackle complex reasoning tasks through collaborative interaction, optimizing their dynamics remains a formidable challenge due to the discrete, non-differentiable nature of…

Multiagent Systems · Computer Science 2026-05-29 Wenwu Li , Yuran Song , Mingze Zhao , Bo Jin , Wenhao Li

Deep Binarized Photonic Reservoir Computing for Ultrafast Multimedia Signal Processing

We present a deep photonic neural network architecture based on ultrafast binary optical modulation from a digital micro-mirror device (DMD), optical scattering in random medium, high-speed photodetection with a CMOS sensor, and…

Neural and Evolutionary Computing · Computer Science 2026-05-29 Muhammad Waqar Iqbal , Mohamad Alassir , Nicolas Marsal , Damien Rontani

When Cloud Agents Meet Device Agents: Lessons from Hybrid Multi-Agent Systems

The design space of agentic AI inference spans two extremes: frontier large language models (LLMs), typically hosted in the cloud and offering strong performance across a wide range of tasks at substantially high cost, and more…

Multiagent Systems · Computer Science 2026-05-29 Corrado Rainone , Davide Belli , Bence Major , Arash Behboodi

Discovering Cooperative Pipelines: Autoresearch for Sequential Social Dilemmas

We study two-level autoresearch for cooperation: an outer-loop AI agent autonomously redesigns the inner-loop pipeline of an LLM policy-synthesis system for multi-agent Sequential Social Dilemmas (SSDs). A researcher agent $\mathcal{R}$…

Multiagent Systems · Computer Science 2026-05-29 Víctor Gallego

Selection Hyper-heuristics Can Automatically Adjust the Learning Period to Optimally Solve Pseudo-Boolean Problems

The Random Gradient hyper-heuristic was recently shown to be able to learn the optimal neighbourhood size when optimizing the LeadingOnes benchmark via the Randomised Local Search (RLS) meta-heuristic. However, for this to happen, a…

Neural and Evolutionary Computing · Computer Science 2026-05-29 Benjamin Doerr , Pietro S. Oliveto , John Alasdair Warwicker

Evolutionary Dynamics of Cooperation in Next-Generation LLM Agent Systems: A Cross-Provider Empirical Extension

Do next-generation LLM agents inherit the cooperative biases documented in their predecessors, or does scale and provider diversity reshape equilibrium behaviour in competitive multi-agent settings? Willis et al. established a benchmark for…

Multiagent Systems · Computer Science 2026-05-29 Francisco León Zúñiga Bolívar

Evolve as a Team: Collaborative Self-Evolution for LLM-based Multi-Agent Systems

LLM-based multi-agent systems (MAS) have emerged as an effective paradigm for complex and long-horizon tasks. However, in real-world tasks, MAS often exhibit various failures during execution and such failures are difficult to eliminate…

Multiagent Systems · Computer Science 2026-05-29 Zhezheng Hao , Tianfu Wang , Huanshuo Dong , Ziyan Liu , Hong Wang , Xiankun Lin , Qiang Lin , Can Wang , Hande Dong , Jiawei Chen

RTP-LLM: High-Performance Alibaba LLM Inference Engine

Large Language Models (LLMs) have revolutionized AI applications, but deploying them at scale presents significant challenges. We present RTP-LLM, a high-performance inference engine for industrial-scale LLM deployment, successfully…

Operating Systems · Computer Science 2026-05-29 Boyu Tan , Jiarui Guo , Zongwei Lv , Hanbo Sun , Tong Yang , Kan Liu , Xinfei Shi , Zetao Hu , Yaxin Yu , Chi Zhang , Jianning Zhang , Xi Yang , Wei Zhang , Bo Cai , Silu Zhou , Xiyu Wang , Na He , Yinghao Yu , Wending Bao , Guiyang Huang , Yuxing Yuan , Juncheng Yin , Nan Wang , Lin Yang , Zechao Zhang , Lu Chen , Guoding Li , Tao Lan , Lin Qu

CONCAT: Consensus- and Confidence-Driven Ad Hoc Teaming for Efficient LLM-Based Multi-Agent Systems

Although large language model (LLM) based multi-agent systems (MAS) show their capability to solve complex tasks and achieve higher performance over single agent systems, they lead to huge computational overheads because of heavy…

Multiagent Systems · Computer Science 2026-05-29 Ziyang Ma , Dingyi Zhang , Sichu Liang , Jiajia Chu , Pengfei Xia , Hui Zang , Deyu Zhou

DynaGraph: Lightweight Multi-Model Interaction Framework via Dynamic Topological Reconfiguration

Tackling complex reasoning tasks typically relies on massive monolithic LLMs, which suffer from severe computational redundancy. While task decomposition through structured pipelines or multi-agent collaborations offers an alternative,…

Multiagent Systems · Computer Science 2026-05-29 Yanxing Guo , Zihao Zheng , Fangzhou Wu , Ling Liang , Lin Bao , Zongwei Wang , Yimao Cai

Evolutionary Rule Extraction from Corporate Default Prediction Models

Small and medium-sized enterprises (SMEs) represent the majority of firms in most economies and often face financial constraints and higher vulnerability to financial distress. Predicting SME default is therefore crucial for financial…

Neural and Evolutionary Computing · Computer Science 2026-05-29 Desirè Fabbretti , Matteo Pasquino , Elia Pacioni , Caterina Lucarelli , Davide Calvaresi

Runtime Analysis of a Compact Genetic Algorithm on a Truly Multi-valued OneMax Function

Recently, the runtime analysis of multi-valued estimation-of-distribution algorithms in the framework of Ben Jedidia et al. (TCS 2024) has made significant advancements. However, almost all existing analyses are limited to multi-valued…

Neural and Evolutionary Computing · Computer Science 2026-05-29 Martin S. Krejca , Carsten Witt

EvoGM: Learning to Merge LLMs via Evolutionary Generative Optimization

Evolutionary model merging provides a powerful framework for the automated, training-free composition of LLMs through parameter-space search. However, existing methods predominantly rely on stochastic, hand-crafted operators that overlook…

Neural and Evolutionary Computing · Computer Science 2026-05-29 Tao Jiang , Xinmeng Yu , Chenhao Yi , Yiling Wu , Yan Li , Ran Cheng , Dongmei Jiang , Jianguo Zhang

LLM-ALSO: LLM-Driven Adaptive Learning-Signal Optimization for Multi-Agent Reinforcement Learning

Effective training-time guidance is central to multi-agent reinforcement learning (MARL), yet remains difficult in sparse-reward settings where weak supervision limits coordination and policy improvement, and existing methods often require…

Multiagent Systems · Computer Science 2026-05-29 Xiaoguang Wu , Zhi Zheng , Hui Xiong

The incremental voter model: mean-field analysis and convergence to equilibrium

We introduce the incremental voter model (IVM), a discrete-opinion multi-agent system where agents undergo step-wise transitions biased by the opinion of a randomly selected persuader. Our incremental voter model comprises a large…

Multiagent Systems · Computer Science 2026-05-29 Fei Cao , Xiaoqian Gong

WASHH: An Anchor-Aware Whale-Guided Selection Hyper-Heuristic for Continuous Optimization and SVC Configuration

Learning-assisted algorithm design often has to make reliable search decisions under small evaluation budgets, where committing to a single metaheuristic can be unreliable. We propose WASHH, a Whale-guided Adaptive Selection Hyper-Heuristic…

Neural and Evolutionary Computing · Computer Science 2026-05-29 Yifu Zhao , Xiaofan Zou , Junhao Wei , Yanxiao Li , Baili Lu , Zhenhong Peng , Dexing Yao , Haochen Li , Qinbin He , Sio-Kei Im , Xu Yang , Yapeng Wang

CalBench: Evaluating Coordination-Privacy Trade-offs in Multi-Agent LLMs

Personal AI assistants are beginning to act as delegates with access to calendars, inboxes, and user preferences. Calendar scheduling makes the trust problem concrete: an assistant must coordinate with other assistants while deciding what…

Multiagent Systems · Computer Science 2026-05-29 Chelsea Zou , Yiheng Yao , Selena She , Noah Goodman , Robert D. Hawkins

Architecture-Induced Recoverability Bias in Differentiable Symbolic Regression

Symbolic regression aims to recover closed-form expressions from numerical data, but in differentiable symbolic regression the recovered expression depends not only on the grammar but also on the fixed architecture through which variables…

Neural and Evolutionary Computing · Computer Science 2026-05-29 Chakshu Gupta , Theodore J. LaGrow