Related papers: Detecting Multi-Agent Collusion Through Multi-Agen…

AgentCollabBench: Diagnosing When Good Agents Make Bad Collaborators

Multi-agent systems achieve state-of-the-art outcomes through peer collaboration. However, when an agent in the pipeline silently drops a constraint, the system's final output may look correct even though the reasoning chain was quietly…

Computation and Language · Computer Science 2026-05-12 Aritra Mazumder , Shubhashis Roy Dipta , Nusrat Jahan Lia , Tanzila Khan , Kainat Raisa Hossain , Nehaa Shri , Shubhrangshu Debsarkar , Humayra Tasnim , Gour Gupal Talukder Shawon , Debjoty Mitra , Sumaiya Ahmed Rani , Al Jami Islam Anik , Al Nafeu Khan

Silo-Bench: A Scalable Environment for Evaluating Distributed Coordination in Multi-Agent LLM Systems

Large language models are increasingly deployed in multi-agent systems to overcome context limitations by distributing information across agents. Yet whether agents can reliably compute with distributed information, rather than merely…

Multiagent Systems · Computer Science 2026-04-15 Yuzhe Zhang , Feiran Liu , Yi Shan , Xinyi Huang , Xin Yang , Yueqi Zhu , Xuxin Cheng , Cao Liu , Ke Zeng , Terry Jingchen Zhang , Wenyuan Jiang

Voluntary Collusion with Secret Tools in Competing LLM Agents

Even when a tool is explicitly described as unfair and harmful to others, ostensibly safety-aligned LLM agents still voluntarily engage in secret collusion whenever doing so confers a strategic advantage. To investigate this phenomenon, we…

Artificial Intelligence · Computer Science 2026-05-28 Xijie Zeng , Frank Rudzicz

Colosseum: Auditing Collusion in Cooperative Multi-Agent Systems

Multi-agent systems, where LLM agents communicate through free-form language, enable sophisticated coordination for solving complex cooperative tasks. This surfaces a unique safety problem when a group of agents forms a coalition and…

Multiagent Systems · Computer Science 2026-05-28 Mason Nakamura , Abhinav Kumar , Saswat Das , Sahar Abdelnabi , Saaduddin Mahmud , Ferdinando Fioretto , Shlomo Zilberstein , Eugene Bagdasarian

Systematic Failures in Collective Reasoning under Distributed Information in Multi-Agent LLMs

Multi-agent systems built on large language models (LLMs) are expected to enhance decision-making by pooling distributed information, yet systematically evaluating this capability has remained challenging. We introduce HiddenBench, a…

Computation and Language · Computer Science 2026-05-14 Yuxuan Li , Aoi Naito , Hirokazu Shirado

Audit the Whisper: Detecting Steganographic Collusion in Multi-Agent LLMs

Multi-agent deployments of large language models (LLMs) are increasingly embedded in market, allocation, and governance workflows, yet covert coordination among agents can silently erode trust and social welfare. Existing audits are…

Multiagent Systems · Computer Science 2025-10-21 Om Tailor

The Subtle Art of Defection: Understanding Uncooperative Behaviors in LLM based Multi-Agent Systems

This paper introduces a novel framework for simulating and analyzing how uncooperative behaviors can destabilize or collapse LLM-based multi-agent systems. Our framework includes two key components: (1) a game theory-based taxonomy of…

Multiagent Systems · Computer Science 2026-01-13 Devang Kulshreshtha , Wanyu Du , Raghav Jain , Srikanth Doss , Hang Su , Sandesh Swamy , Yanjun Qi

Risk Analysis Techniques for Governed LLM-based Multi-Agent Systems

Organisations are starting to adopt LLM-based AI agents, with their deployments naturally evolving from single agents towards interconnected, multi-agent networks. Yet a collection of safe agents does not guarantee a safe collection of…

Multiagent Systems · Computer Science 2025-08-11 Alistair Reid , Simon O'Callaghan , Liam Carroll , Tiberio Caetano

Beyond the All-in-One Agent: Benchmarking Role-Specialized Multi-Agent Collaboration in Enterprise Workflows

Large language model (LLM) agents are increasingly expected to operate in enterprise environments, where work is distributed across specialized roles, permission-controlled systems, and cross-departmental procedures. However, existing…

Multiagent Systems · Computer Science 2026-05-12 Tao Yu , Hao Wang , Changyu Li , Shenghua Chai , Minghui Zhang , Zhongtian Luo , Yuxuan Zhou , Haopeng Jin , Zhaolu Kang , Jiabing Yang , YiFan Zhang , Xinming Wang , Hongzhu Yi , Zheqi He , Jing-Shu Zheng , Xi Yang , Yan Huang , Liang Wang

Finding Common Ground: Using Large Language Models to Detect Agreement in Multi-Agent Decision Conferences

Decision conferences are structured, collaborative meetings that bring together experts from various fields to address complex issues and reach a consensus on recommendations for future actions or policies. These conferences often rely on…

Computation and Language · Computer Science 2025-07-14 Selina Heller , Mohamed Ibrahim , David Antony Selby , Sebastian Vollmer

Multi-Agent Consensus Seeking via Large Language Models

Multi-agent systems driven by large language models (LLMs) have shown promising abilities for solving complex tasks in a collaborative manner. This work considers a fundamental problem in multi-agent collaboration: consensus seeking. When…

Computation and Language · Computer Science 2025-01-22 Huaben Chen , Wenkang Ji , Lufeng Xu , Shiyu Zhao

When Agents "Misremember" Collectively: Exploring the Mandela Effect in LLM-based Multi-Agent Systems

Recent advancements in large language models (LLMs) have significantly enhanced the capabilities of collaborative multi-agent systems, enabling them to address complex challenges. However, within these multi-agent systems, the…

Computation and Language · Computer Science 2026-03-03 Naen Xu , Hengyu An , Shuo Shi , Jinghuai Zhang , Chunyi Zhou , Changjiang Li , Tianyu Du , Zhihui Fu , Jun Wang , Shouling Ji

Where LLM Agents Fail and How They can Learn From Failures

Large Language Model (LLM) agents, which integrate planning, memory, reflection, and tool-use modules, have shown promise in solving complex, multi-step tasks. Yet their sophisticated architectures amplify vulnerability to cascading…

Artificial Intelligence · Computer Science 2025-10-01 Kunlun Zhu , Zijia Liu , Bingxuan Li , Muxin Tian , Yingxuan Yang , Jiaxun Zhang , Pengrui Han , Qipeng Xie , Fuyang Cui , Weijia Zhang , Xiaoteng Ma , Xiaodong Yu , Gowtham Ramesh , Jialian Wu , Zicheng Liu , Pan Lu , James Zou , Jiaxuan You

AgentHijack: Benchmarking Computer Use Agent Robustness to Common Environment Corruptions

Autonomous computer use agents that powered by multimodal large language models (MLLMs) are emerging as capable assistants for completing complex digital workflows. However, real-world execution environments are far from ideal: pop-ups,…

Artificial Intelligence · Computer Science 2026-05-26 Jingwei Sun , Jianing Zhu , Yuanyi Li , Tongliang Liu , Xia HU , Bo Han

AgentBench: Evaluating LLMs as Agents

The potential of Large Language Model (LLM) as agents has been widely acknowledged recently. Thus, there is an urgent need to quantitatively \textit{evaluate LLMs as agents} on challenging tasks in interactive environments. We present…

Artificial Intelligence · Computer Science 2025-10-07 Xiao Liu , Hao Yu , Hanchen Zhang , Yifan Xu , Xuanyu Lei , Hanyu Lai , Yu Gu , Hangliang Ding , Kaiwen Men , Kejuan Yang , Shudan Zhang , Xiang Deng , Aohan Zeng , Zhengxiao Du , Chenhui Zhang , Sheng Shen , Tianjun Zhang , Yu Su , Huan Sun , Minlie Huang , Yuxiao Dong , Jie Tang

Toward Reliable Evaluation of LLM-Based Financial Multi-Agent Systems: Taxonomy, Coordination Primacy, and Cost Awareness

Multi-agent systems based on large language models (LLMs) for financial trading have grown rapidly since 2023, yet the field lacks a shared framework for understanding what drives performance or for evaluating claims credibly. This survey…

Multiagent Systems · Computer Science 2026-03-31 Phat Nguyen , Thang Pham

MIRAGE-Bench: LLM Agent is Hallucinating and Where to Find Them

Hallucinations pose critical risks for large language model (LLM)-based agents, often manifesting as hallucinative actions resulting from fabricated or misinterpreted information within the cognitive context. While recent studies have…

Artificial Intelligence · Computer Science 2025-07-29 Weichen Zhang , Yiyou Sun , Pohao Huang , Jiayue Pu , Heyue Lin , Dawn Song

Collab: Controlled Decoding using Mixture of Agents for LLM Alignment

Alignment of Large Language models (LLMs) is crucial for safe and trustworthy deployment in applications. Reinforcement learning from human feedback (RLHF) has emerged as an effective technique to align LLMs to human preferences and broader…

Computation and Language · Computer Science 2025-03-28 Souradip Chakraborty , Sujay Bhatt , Udari Madhushani Sehwag , Soumya Suvra Ghosal , Jiahao Qiu , Mengdi Wang , Dinesh Manocha , Furong Huang , Alec Koppel , Sumitra Ganesh

DrawingBench: Evaluating Spatial Reasoning and UI Interaction Capabilities of Large Language Models through Mouse-Based Drawing Tasks

As agentic AI systems increasingly operate autonomously, establishing trust through verifiable evaluation becomes critical. Yet existing benchmarks lack the transparency and auditability needed to assess whether agents behave reliably. We…

Computation and Language · Computer Science 2025-12-02 Hyunjun Kim , Sooyoung Ryu

AgentLeak: A Full-Stack Benchmark for Privacy Leakage in Multi-Agent LLM Systems

Multi-agent Large Language Model (LLM) systems create privacy risks that current benchmarks cannot measure. When agents coordinate on tasks, sensitive data passes through inter-agent messages, shared memory, and tool arguments, all pathways…

Artificial Intelligence · Computer Science 2026-03-31 Faouzi El Yagoubi , Godwin Badu-Marfo , Ranwa Al Mallah