Related papers: Multi-Agent Planning Using Visual Language Models

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

Large Language Models (LLMs) have achieved remarkable success across a wide array of tasks. Due to the impressive planning and reasoning abilities of LLMs, they have been used as autonomous agents to do many tasks automatically. Recently,…

Computation and Language · Computer Science 2024-04-22 Taicheng Guo , Xiuying Chen , Yaqi Wang , Ruidi Chang , Shichao Pei , Nitesh V. Chawla , Olaf Wiest , Xiangliang Zhang

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

This study focuses on using large language models (LLMs) as a planner for embodied agents that can follow natural language instructions to complete complex tasks in a visually-perceived environment. The high data cost and poor sample…

Artificial Intelligence · Computer Science 2023-09-08 Chan Hee Song , Jiaman Wu , Clayton Washington , Brian M. Sadler , Wei-Lun Chao , Yu Su

Adaptive Domain Modeling with Language Models: A Multi-Agent Approach to Task Planning

We introduce TAPAS (Task-based Adaptation and Planning using AgentS), a multi-agent framework that integrates Large Language Models (LLMs) with symbolic planning to solve complex tasks without the need for manually defined environment…

Artificial Intelligence · Computer Science 2025-07-01 Harisankar Babu , Philipp Schillinger , Tamim Asfour

Voice2Action: Language Models as Agent for Efficient Real-Time Interaction in Virtual Reality

Large Language Models (LLMs) are trained and aligned to follow natural language instructions with only a handful of examples, and they are prompted as task-driven autonomous agents to adapt to various sources of execution environments.…

Computation and Language · Computer Science 2023-10-03 Yang Su

Multi-Agent Systems for Robotic Autonomy with LLMs

Since the advent of Large Language Models (LLMs), various research based on such models have maintained significant academic attention and impact, especially in AI and robotics. In this paper, we propose a multi-agent framework with LLMs to…

Robotics · Computer Science 2025-05-12 Junhong Chen , Ziqi Yang , Haoyuan G Xu , Dandan Zhang , George Mylonas

Towards Embodied Agentic AI: Review and Classification of LLM- and VLM-Driven Robot Autonomy and Interaction

Foundation models, including large language models (LLMs) and vision-language models (VLMs), have recently enabled novel approaches to robot autonomy and human-robot interfaces. In parallel, vision-language-action models (VLAs) or large…

Robotics · Computer Science 2025-11-14 Sahar Salimpour , Lei Fu , Kajetan Rachwał , Pascal Bertrand , Kevin O'Sullivan , Robert Jakob , Farhad Keramat , Leonardo Militano , Giovanni Toffetti , Harry Edelman , Jorge Peña Queralta

ALMAS: an Autonomous LLM-based Multi-Agent Software Engineering Framework

Multi-agent Large Language Model (LLM) systems have been leading the way in applied LLM research across a number of fields. One notable area is software development, where researchers have advanced the automation of code implementation,…

Software Engineering · Computer Science 2025-11-25 Vali Tawosi , Keshav Ramani , Salwa Alamir , Xiaomo Liu

Brain-Inspired Graph Multi-Agent Systems for LLM Reasoning

Large Language Models (LLMs) have demonstrated remarkable capabilities across a wide range of language tasks, yet complex multi-step reasoning remains a fundamental challenge. While Large Reasoning Models (LRMs) equipped with extended…

Artificial Intelligence · Computer Science 2026-03-17 Guangfu Hao , Yuming Dai , Xianzhe Qin , Shan Yu

LLM-Based Multi-Agent Systems for Software Engineering: Literature Review, Vision and the Road Ahead

Integrating Large Language Models (LLMs) into autonomous agents marks a significant shift in the research landscape by offering cognitive abilities that are competitive with human planning and reasoning. This paper explores the…

Software Engineering · Computer Science 2025-07-21 Junda He , Christoph Treude , David Lo

A Lightweight Large Language Model-Based Multi-Agent System for 2D Frame Structural Analysis

Large language models (LLMs) have recently been used to empower autonomous agents in engineering, significantly improving automation and efficiency in labor-intensive workflows. However, their potential remains underexplored in structural…

Computation and Language · Computer Science 2025-10-08 Ziheng Geng , Jiachen Liu , Ran Cao , Lu Cheng , Haifeng Wang , Minghui Cheng

RoboAgent: Chaining Basic Capabilities for Embodied Task Planning

This paper focuses on embodied task planning, where an agent acquires visual observations from the environment and executes atomic actions to accomplish a given task. Although recent Vision-Language Models (VLMs) have achieved impressive…

Robotics · Computer Science 2026-04-10 Peiran Xu , Jiaqi Zheng , Yadong Mu

Modeling Layered Consciousness with Multi-Agent Large Language Models

We propose a multi-agent framework for modeling artificial consciousness in large language models (LLMs), grounded in psychoanalytic theory. Our \textbf{Psychodynamic Model} simulates self-awareness, preconsciousness, and unconsciousness…

Computation and Language · Computer Science 2025-10-22 Sang Hun Kim , Jongmin Lee , Dongkyu Park , So Young Lee , Yosep Chong

Query Optimization Beyond Data Systems: The Case for Multi-Agent Systems

The proliferation of large language models (LLMs) has accelerated the adoption of agent-based workflows, where multiple autonomous agents reason, invoke functions, and collaborate to compose complex data pipelines. However, current…

Databases · Computer Science 2025-12-15 Zoi Kaoudi , Ioana Giurgiu

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Large Language Models (LLMs) have shown remarkable capabilities as autonomous agents, yet existing benchmarks either focus on single-agent tasks or are confined to narrow domains, failing to capture the dynamics of multi-agent coordination…

Multiagent Systems · Computer Science 2025-03-05 Kunlun Zhu , Hongyi Du , Zhaochen Hong , Xiaocheng Yang , Shuyi Guo , Zhe Wang , Zhenhailong Wang , Cheng Qian , Xiangru Tang , Heng Ji , Jiaxuan You

Intrinsic Memory Agents: Heterogeneous Multi-Agent LLM Systems through Structured Contextual Memory

Multi-agent systems built on Large Language Models (LLMs) show exceptional promise for complex collaborative problem-solving, yet they face fundamental challenges stemming from context window limitations that impair memory consistency, role…

Artificial Intelligence · Computer Science 2026-01-13 Sizhe Yuen , Francisco Gomez Medina , Ting Su , Yali Du , Adam J. Sobey

Scaling Autonomous Agents via Automatic Reward Modeling And Planning

Large language models (LLMs) have demonstrated remarkable capabilities across a range of text-generation tasks. However, LLMs still struggle with problems requiring multi-step decision-making and environmental feedback, such as online…

Artificial Intelligence · Computer Science 2025-02-18 Zhenfang Chen , Delin Chen , Rui Sun , Wenjun Liu , Chuang Gan

VLM Q-Learning: Aligning Vision-Language Models for Interactive Decision-Making

Recent research looks to harness the general knowledge and reasoning of large language models (LLMs) into agents that accomplish user-specified goals in interactive environments. Vision-language models (VLMs) extend LLMs to multi-modal data…

Machine Learning · Computer Science 2025-05-07 Jake Grigsby , Yuke Zhu , Michael Ryoo , Juan Carlos Niebles

Multi-agent Architecture Search via Agentic Supernet

Large Language Model (LLM)-empowered multi-agent systems extend the cognitive boundaries of individual agents through disciplined collaboration and interaction, while constructing these systems often requires labor-intensive manual designs.…

Machine Learning · Computer Science 2025-06-10 Guibin Zhang , Luyang Niu , Junfeng Fang , Kun Wang , Lei Bai , Xiang Wang

A Novel Multi-Agent Architecture to Reduce Hallucinations of Large Language Models in Multi-Step Structural Modeling

Large language models (LLMs) such as GPT and Gemini have demonstrated remarkable capabilities in contextual understanding and reasoning. The strong performance of LLMs has sparked growing interest in leveraging them to automate tasks…

Artificial Intelligence · Computer Science 2026-03-10 Ziheng Geng , Jiachen Liu , Ran Cao , Lu Cheng , Dan M. Frangopol , Minghui Cheng

Multi-Agents Based on Large Language Models for Knowledge-based Visual Question Answering

Large Language Models (LLMs) have achieved impressive results in knowledge-based Visual Question Answering (VQA). However existing methods still have challenges: the inability to use external tools autonomously, and the inability to work in…

Computation and Language · Computer Science 2025-08-08 Zhongjian Hu , Peng Yang , Bing Li , Zhenqi Wang