Related papers: Analyzing Code Injection Attacks on LLM-based Mult…

Multi-Agent Systems Execute Arbitrary Malicious Code

Multi-agent systems coordinate LLM-based agents to perform tasks on users' behalf. In real-world applications, multi-agent systems will inevitably interact with untrusted inputs, such as malicious Web content, files, email attachments, and…

Cryptography and Security · Computer Science 2025-09-16 Harold Triedman , Rishi Jha , Vitaly Shmatikov

Prompt Injection Attacks on Agentic Coding Assistants: A Systematic Analysis of Vulnerabilities in Skills, Tools, and Protocol Ecosystems

The proliferation of agentic AI coding assistants, including Claude Code, GitHub Copilot, Cursor, and emerging skill-based architectures, has fundamentally transformed software development workflows. These systems leverage Large Language…

Cryptography and Security · Computer Science 2026-01-27 Narek Maloyan , Dmitry Namiot

Lessons from Penetration Tests on Large-Scale Agent Systems

As AI systems gain increasing autonomy and execution capability, the number of discovered security vulnerabilities continues to rise. However, many of these vulnerabilities are not fundamentally novel, but instead reflect recurring classes…

Cryptography and Security · Computer Science 2026-05-27 Kevin Eykholt , Dhilung Kirat , Xiaokui Shu , Jiyong Jang , Frederico Araujo , Ian Molloy

Prompt Infection: LLM-to-LLM Prompt Injection within Multi-Agent Systems

As Large Language Models (LLMs) grow increasingly powerful, multi-agent systems are becoming more prevalent in modern AI applications. Most safety research, however, has focused on vulnerabilities in single-agent LLMs. These include prompt…

Multiagent Systems · Computer Science 2024-10-11 Donghyun Lee , Mo Tiwari

Architecting Secure AI Agents: Perspectives on System-Level Defenses Against Indirect Prompt Injection Attacks

AI agents, predominantly powered by large language models (LLMs), are vulnerable to indirect prompt injection, in which malicious instructions embedded in untrusted data can trigger dangerous agent actions. This position paper discusses our…

Cryptography and Security · Computer Science 2026-04-01 Chong Xiang , Drew Zagieboylo , Shaona Ghosh , Sanjay Kariyappa , Kai Greshake , Hanshen Xiao , Chaowei Xiao , G. Edward Suh

Shadows in the Code: Exploring the Risks and Defenses of LLM-based Multi-Agent Software Development Systems

The rapid advancement of Large Language Model (LLM)-driven multi-agent systems has significantly streamlined software developing tasks, enabling users with little technical expertise to develop executable applications. While these systems…

Cryptography and Security · Computer Science 2025-11-25 Xiaoqing Wang , Keman Huang , Bin Liang , Hongyu Li , Xiaoyong Du

Design Patterns for Securing LLM Agents against Prompt Injections

As AI agents powered by Large Language Models (LLMs) become increasingly versatile and capable of addressing a broad spectrum of tasks, ensuring their security has become a critical challenge. Among the most pressing threats are prompt…

Machine Learning · Computer Science 2025-06-30 Luca Beurer-Kellner , Beat Buesser , Ana-Maria Creţu , Edoardo Debenedetti , Daniel Dobos , Daniel Fabian , Marc Fischer , David Froelicher , Kathrin Grosse , Daniel Naeff , Ezinwanne Ozoani , Andrew Paverd , Florian Tramèr , Václav Volhejn

Towards Unifying Quantitative Security Benchmarking for Multi Agent Systems

Evolving AI systems increasingly deploy multi-agent architectures where autonomous agents collaborate, share information, and delegate tasks through developing protocols. This connectivity, while powerful, introduces novel security risks.…

Cryptography and Security · Computer Science 2025-07-30 Gauri Sharma , Vidhi Kulkarni , Miles King , Ken Huang

Architecture Matters for Multi-Agent Security

Multi-agent systems (MAS), composed of networks of two or more autonomous AI agents, have become increasingly popular in production deployments, yet introduce security risks that do not arise in single-agent settings. Even if individual…

Multiagent Systems · Computer Science 2026-04-28 Ben Hagag , William L. Anderson , Christian Schroeder de Witt , Sarah Scheffler

PeerGuard: Defending Multi-Agent Systems Against Backdoor Attacks Through Mutual Reasoning

Multi-agent systems leverage advanced AI models as autonomous agents that interact, cooperate, or compete to complete complex tasks across applications such as robotics and traffic management. Despite their growing importance, safety in…

Multiagent Systems · Computer Science 2025-05-28 Falong Fan , Xi Li

A Multi-Agent LLM Defense Pipeline Against Prompt Injection Attacks

Prompt injection attacks represent a major vulnerability in Large Language Model (LLM) deployments, where malicious instructions embedded in user inputs can override system prompts and induce unintended behaviors. This paper presents a…

Cryptography and Security · Computer Science 2025-12-18 S M Asif Hossain , Ruksat Khan Shayoni , Mohd Ruhul Ameen , Akif Islam , M. F. Mridha , Jungpil Shin

Exposing Weak Links in Multi-Agent Systems under Adversarial Prompting

LLM-based agents are increasingly deployed in multi-agent systems (MAS). As these systems move toward real-world applications, their security becomes paramount. Existing research largely evaluates single-agent security, leaving a critical…

Multiagent Systems · Computer Science 2025-11-17 Nirmit Arora , Sathvik Joel , Ishan Kavathekar , Palak , Rohan Gandhi , Yash Pandya , Tanuja Ganu , Aditya Kanade , Akshay Nambi

Risk Analysis Techniques for Governed LLM-based Multi-Agent Systems

Organisations are starting to adopt LLM-based AI agents, with their deployments naturally evolving from single agents towards interconnected, multi-agent networks. Yet a collection of safe agents does not guarantee a safe collection of…

Multiagent Systems · Computer Science 2025-08-11 Alistair Reid , Simon O'Callaghan , Liam Carroll , Tiberio Caetano

On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents

Large language model-based multi-agent systems have shown great abilities across various tasks due to the collaboration of expert agents, each focusing on a specific domain. However, the impact of clumsy or even malicious agents--those who…

Artificial Intelligence · Computer Science 2025-05-30 Jen-tse Huang , Jiaxu Zhou , Tailin Jin , Xuhui Zhou , Zixi Chen , Wenxuan Wang , Youliang Yuan , Michael R. Lyu , Maarten Sap

When Bots Take the Bait: Exposing and Mitigating the Emerging Social Engineering Attack in Web Automation Agent

Web agents, powered by large language models (LLMs), are increasingly deployed to automate complex web interactions. The rise of open-source frameworks (e.g., Browser Use, Skyvern-AI) has accelerated adoption, but also broadened the attack…

Cryptography and Security · Computer Science 2026-01-13 Xinyi Wu , Geng Hong , Yueyue Chen , MingXuan Liu , Feier Jin , Xudong Pan , Jiarun Dai , Baojun Liu

Benchmarking the Robustness of Agentic Systems to Adversarially-Induced Harms

Ensuring the safe use of agentic systems requires a thorough understanding of the range of malicious behaviors these systems may exhibit when under attack. In this paper, we evaluate the robustness of LLM-based agentic systems against…

Machine Learning · Computer Science 2025-10-08 Jonathan Nöther , Adish Singla , Goran Radanovic

SoK: The Attack Surface of Agentic AI -- Tools, and Autonomy

Recent AI systems combine large language models with tools, external knowledge via retrieval-augmented generation (RAG), and even autonomous multi-agent decision loops. This agentic AI paradigm greatly expands capabilities - but also vastly…

Cryptography and Security · Computer Science 2026-03-25 Ali Dehghantanha , Sajad Homayoun

Agentic AI for Autonomous Defense in Software Supply Chain Security: Beyond Provenance to Vulnerability Mitigation

The software supply chain attacks are becoming more and more focused on trusted development and delivery procedures, so the conventional post-build integrity mechanisms cannot be used anymore. The available frameworks like SLSA, SBOM and in…

Cryptography and Security · Computer Science 2025-12-30 Toqeer Ali Syed , Mohammad Riyaz Belgaum , Salman Jan , Asadullah Abdullah Khan , Saad Said Alqahtani

Takedown: How It's Done in Modern Coding Agent Exploits

Coding agents, which are LLM-driven agents specialized in software development, have become increasingly prevalent in modern programming environments. Unlike traditional AI coding assistants, which offer simple code completion and…

Cryptography and Security · Computer Science 2025-09-30 Eunkyu Lee , Donghyeon Kim , Wonyoung Kim , Insu Yun

Bridging AI and Software Security: A Comparative Vulnerability Assessment of LLM Agent Deployment Paradigms

Large Language Model (LLM) agents face security vulnerabilities spanning AI-specific and traditional software domains, yet current research addresses these separately. This study bridges this gap through comparative evaluation of Function…

Cryptography and Security · Computer Science 2025-07-10 Tarek Gasmi , Ramzi Guesmi , Ines Belhadj , Jihene Bennaceur