English
Related papers

Related papers: Detecting Multi-Agent Collusion Through Multi-Agen…

200 papers

Multi-agent systems achieve state-of-the-art outcomes through peer collaboration. However, when an agent in the pipeline silently drops a constraint, the system's final output may look correct even though the reasoning chain was quietly…

Large language models are increasingly deployed in multi-agent systems to overcome context limitations by distributing information across agents. Yet whether agents can reliably compute with distributed information, rather than merely…

Multiagent Systems · Computer Science 2026-04-15 Yuzhe Zhang , Feiran Liu , Yi Shan , Xinyi Huang , Xin Yang , Yueqi Zhu , Xuxin Cheng , Cao Liu , Ke Zeng , Terry Jingchen Zhang , Wenyuan Jiang

Even when a tool is explicitly described as unfair and harmful to others, ostensibly safety-aligned LLM agents still voluntarily engage in secret collusion whenever doing so confers a strategic advantage. To investigate this phenomenon, we…

Artificial Intelligence · Computer Science 2026-05-28 Xijie Zeng , Frank Rudzicz

Multi-agent systems, where LLM agents communicate through free-form language, enable sophisticated coordination for solving complex cooperative tasks. This surfaces a unique safety problem when a group of agents forms a coalition and…

Multi-agent systems built on large language models (LLMs) are expected to enhance decision-making by pooling distributed information, yet systematically evaluating this capability has remained challenging. We introduce HiddenBench, a…

Computation and Language · Computer Science 2026-05-14 Yuxuan Li , Aoi Naito , Hirokazu Shirado

Multi-agent deployments of large language models (LLMs) are increasingly embedded in market, allocation, and governance workflows, yet covert coordination among agents can silently erode trust and social welfare. Existing audits are…

Multiagent Systems · Computer Science 2025-10-21 Om Tailor

This paper introduces a novel framework for simulating and analyzing how uncooperative behaviors can destabilize or collapse LLM-based multi-agent systems. Our framework includes two key components: (1) a game theory-based taxonomy of…

Multiagent Systems · Computer Science 2026-01-13 Devang Kulshreshtha , Wanyu Du , Raghav Jain , Srikanth Doss , Hang Su , Sandesh Swamy , Yanjun Qi

Organisations are starting to adopt LLM-based AI agents, with their deployments naturally evolving from single agents towards interconnected, multi-agent networks. Yet a collection of safe agents does not guarantee a safe collection of…

Multiagent Systems · Computer Science 2025-08-11 Alistair Reid , Simon O'Callaghan , Liam Carroll , Tiberio Caetano

Large language model (LLM) agents are increasingly expected to operate in enterprise environments, where work is distributed across specialized roles, permission-controlled systems, and cross-departmental procedures. However, existing…

Decision conferences are structured, collaborative meetings that bring together experts from various fields to address complex issues and reach a consensus on recommendations for future actions or policies. These conferences often rely on…

Computation and Language · Computer Science 2025-07-14 Selina Heller , Mohamed Ibrahim , David Antony Selby , Sebastian Vollmer

Multi-agent systems driven by large language models (LLMs) have shown promising abilities for solving complex tasks in a collaborative manner. This work considers a fundamental problem in multi-agent collaboration: consensus seeking. When…

Computation and Language · Computer Science 2025-01-22 Huaben Chen , Wenkang Ji , Lufeng Xu , Shiyu Zhao

Recent advancements in large language models (LLMs) have significantly enhanced the capabilities of collaborative multi-agent systems, enabling them to address complex challenges. However, within these multi-agent systems, the…

Computation and Language · Computer Science 2026-03-03 Naen Xu , Hengyu An , Shuo Shi , Jinghuai Zhang , Chunyi Zhou , Changjiang Li , Tianyu Du , Zhihui Fu , Jun Wang , Shouling Ji

Large Language Model (LLM) agents, which integrate planning, memory, reflection, and tool-use modules, have shown promise in solving complex, multi-step tasks. Yet their sophisticated architectures amplify vulnerability to cascading…

Autonomous computer use agents that powered by multimodal large language models (MLLMs) are emerging as capable assistants for completing complex digital workflows. However, real-world execution environments are far from ideal: pop-ups,…

Artificial Intelligence · Computer Science 2026-05-26 Jingwei Sun , Jianing Zhu , Yuanyi Li , Tongliang Liu , Xia HU , Bo Han

The potential of Large Language Model (LLM) as agents has been widely acknowledged recently. Thus, there is an urgent need to quantitatively \textit{evaluate LLMs as agents} on challenging tasks in interactive environments. We present…

Multi-agent systems based on large language models (LLMs) for financial trading have grown rapidly since 2023, yet the field lacks a shared framework for understanding what drives performance or for evaluating claims credibly. This survey…

Multiagent Systems · Computer Science 2026-03-31 Phat Nguyen , Thang Pham

Hallucinations pose critical risks for large language model (LLM)-based agents, often manifesting as hallucinative actions resulting from fabricated or misinterpreted information within the cognitive context. While recent studies have…

Artificial Intelligence · Computer Science 2025-07-29 Weichen Zhang , Yiyou Sun , Pohao Huang , Jiayue Pu , Heyue Lin , Dawn Song

Alignment of Large Language models (LLMs) is crucial for safe and trustworthy deployment in applications. Reinforcement learning from human feedback (RLHF) has emerged as an effective technique to align LLMs to human preferences and broader…

As agentic AI systems increasingly operate autonomously, establishing trust through verifiable evaluation becomes critical. Yet existing benchmarks lack the transparency and auditability needed to assess whether agents behave reliably. We…

Computation and Language · Computer Science 2025-12-02 Hyunjun Kim , Sooyoung Ryu

Multi-agent Large Language Model (LLM) systems create privacy risks that current benchmarks cannot measure. When agents coordinate on tasks, sensitive data passes through inter-agent messages, shared memory, and tool arguments, all pathways…

Artificial Intelligence · Computer Science 2026-03-31 Faouzi El Yagoubi , Godwin Badu-Marfo , Ranwa Al Mallah
‹ Prev 1 2 3 10 Next ›