English
Related papers

Related papers: Multi-Agent Planning Using Visual Language Models

200 papers

Large Language Models (LLMs) have achieved remarkable success across a wide array of tasks. Due to the impressive planning and reasoning abilities of LLMs, they have been used as autonomous agents to do many tasks automatically. Recently,…

Computation and Language · Computer Science 2024-04-22 Taicheng Guo , Xiuying Chen , Yaqi Wang , Ruidi Chang , Shichao Pei , Nitesh V. Chawla , Olaf Wiest , Xiangliang Zhang

This study focuses on using large language models (LLMs) as a planner for embodied agents that can follow natural language instructions to complete complex tasks in a visually-perceived environment. The high data cost and poor sample…

Artificial Intelligence · Computer Science 2023-09-08 Chan Hee Song , Jiaman Wu , Clayton Washington , Brian M. Sadler , Wei-Lun Chao , Yu Su

We introduce TAPAS (Task-based Adaptation and Planning using AgentS), a multi-agent framework that integrates Large Language Models (LLMs) with symbolic planning to solve complex tasks without the need for manually defined environment…

Artificial Intelligence · Computer Science 2025-07-01 Harisankar Babu , Philipp Schillinger , Tamim Asfour

Large Language Models (LLMs) are trained and aligned to follow natural language instructions with only a handful of examples, and they are prompted as task-driven autonomous agents to adapt to various sources of execution environments.…

Computation and Language · Computer Science 2023-10-03 Yang Su

Since the advent of Large Language Models (LLMs), various research based on such models have maintained significant academic attention and impact, especially in AI and robotics. In this paper, we propose a multi-agent framework with LLMs to…

Robotics · Computer Science 2025-05-12 Junhong Chen , Ziqi Yang , Haoyuan G Xu , Dandan Zhang , George Mylonas

Foundation models, including large language models (LLMs) and vision-language models (VLMs), have recently enabled novel approaches to robot autonomy and human-robot interfaces. In parallel, vision-language-action models (VLAs) or large…

Multi-agent Large Language Model (LLM) systems have been leading the way in applied LLM research across a number of fields. One notable area is software development, where researchers have advanced the automation of code implementation,…

Software Engineering · Computer Science 2025-11-25 Vali Tawosi , Keshav Ramani , Salwa Alamir , Xiaomo Liu

Large Language Models (LLMs) have demonstrated remarkable capabilities across a wide range of language tasks, yet complex multi-step reasoning remains a fundamental challenge. While Large Reasoning Models (LRMs) equipped with extended…

Artificial Intelligence · Computer Science 2026-03-17 Guangfu Hao , Yuming Dai , Xianzhe Qin , Shan Yu

Integrating Large Language Models (LLMs) into autonomous agents marks a significant shift in the research landscape by offering cognitive abilities that are competitive with human planning and reasoning. This paper explores the…

Software Engineering · Computer Science 2025-07-21 Junda He , Christoph Treude , David Lo

Large language models (LLMs) have recently been used to empower autonomous agents in engineering, significantly improving automation and efficiency in labor-intensive workflows. However, their potential remains underexplored in structural…

Computation and Language · Computer Science 2025-10-08 Ziheng Geng , Jiachen Liu , Ran Cao , Lu Cheng , Haifeng Wang , Minghui Cheng

This paper focuses on embodied task planning, where an agent acquires visual observations from the environment and executes atomic actions to accomplish a given task. Although recent Vision-Language Models (VLMs) have achieved impressive…

Robotics · Computer Science 2026-04-10 Peiran Xu , Jiaqi Zheng , Yadong Mu

We propose a multi-agent framework for modeling artificial consciousness in large language models (LLMs), grounded in psychoanalytic theory. Our \textbf{Psychodynamic Model} simulates self-awareness, preconsciousness, and unconsciousness…

Computation and Language · Computer Science 2025-10-22 Sang Hun Kim , Jongmin Lee , Dongkyu Park , So Young Lee , Yosep Chong

The proliferation of large language models (LLMs) has accelerated the adoption of agent-based workflows, where multiple autonomous agents reason, invoke functions, and collaborate to compose complex data pipelines. However, current…

Databases · Computer Science 2025-12-15 Zoi Kaoudi , Ioana Giurgiu

Large Language Models (LLMs) have shown remarkable capabilities as autonomous agents, yet existing benchmarks either focus on single-agent tasks or are confined to narrow domains, failing to capture the dynamics of multi-agent coordination…

Multiagent Systems · Computer Science 2025-03-05 Kunlun Zhu , Hongyi Du , Zhaochen Hong , Xiaocheng Yang , Shuyi Guo , Zhe Wang , Zhenhailong Wang , Cheng Qian , Xiangru Tang , Heng Ji , Jiaxuan You

Multi-agent systems built on Large Language Models (LLMs) show exceptional promise for complex collaborative problem-solving, yet they face fundamental challenges stemming from context window limitations that impair memory consistency, role…

Artificial Intelligence · Computer Science 2026-01-13 Sizhe Yuen , Francisco Gomez Medina , Ting Su , Yali Du , Adam J. Sobey

Large language models (LLMs) have demonstrated remarkable capabilities across a range of text-generation tasks. However, LLMs still struggle with problems requiring multi-step decision-making and environmental feedback, such as online…

Artificial Intelligence · Computer Science 2025-02-18 Zhenfang Chen , Delin Chen , Rui Sun , Wenjun Liu , Chuang Gan

Recent research looks to harness the general knowledge and reasoning of large language models (LLMs) into agents that accomplish user-specified goals in interactive environments. Vision-language models (VLMs) extend LLMs to multi-modal data…

Machine Learning · Computer Science 2025-05-07 Jake Grigsby , Yuke Zhu , Michael Ryoo , Juan Carlos Niebles

Large Language Model (LLM)-empowered multi-agent systems extend the cognitive boundaries of individual agents through disciplined collaboration and interaction, while constructing these systems often requires labor-intensive manual designs.…

Machine Learning · Computer Science 2025-06-10 Guibin Zhang , Luyang Niu , Junfeng Fang , Kun Wang , Lei Bai , Xiang Wang

Large language models (LLMs) such as GPT and Gemini have demonstrated remarkable capabilities in contextual understanding and reasoning. The strong performance of LLMs has sparked growing interest in leveraging them to automate tasks…

Artificial Intelligence · Computer Science 2026-03-10 Ziheng Geng , Jiachen Liu , Ran Cao , Lu Cheng , Dan M. Frangopol , Minghui Cheng

Large Language Models (LLMs) have achieved impressive results in knowledge-based Visual Question Answering (VQA). However existing methods still have challenges: the inability to use external tools autonomously, and the inability to work in…

Computation and Language · Computer Science 2025-08-08 Zhongjian Hu , Peng Yang , Bing Li , Zhenqi Wang
‹ Prev 1 2 3 10 Next ›