English
Related papers

Related papers: Adaptive Multimodal Agents-Based Framework for Aut…

200 papers

Multi-agent frameworks powered by large language models (LLMs) have demonstrated great success in automated planning and task execution. However, the effective adjustment of agentic workflows during execution has not been well studied. An…

Artificial Intelligence · Computer Science 2025-02-25 Boye Niu , Yiliao Song , Kai Lian , Yifan Shen , Yu Yao , Kun Zhang , Tongliang Liu

With the advancement of Multimodal Large Language Models (MLLM), LLM-driven visual agents are increasingly impacting software interfaces, particularly those with graphical user interfaces. This work introduces a novel LLM-based multimodal…

Human-Computer Interaction · Computer Science 2025-09-18 Yanda Li , Chi Zhang , Wenjia Jiang , Wanqi Yang , Bin Fu , Pei Cheng , Xin Chen , Ling Chen , Yunchao Wei

Large language model (LLM) agents have shown increasing promise for collaborative task completion. However, existing multi-agent frameworks often rely on static workflows, fixed roles, and limited inter-agent communication, reducing their…

Multiagent Systems · Computer Science 2026-02-13 Chengxuan Xia , Qianye Wu , Sixuan Tian , Yilun Hao

Autonomous agents powered by large language models (LLMs) have shown impressive capabilities in tool manipulation for complex task-solving. However, existing paradigms such as ReAct rely on sequential reasoning and execution, failing to…

Artificial Intelligence · Computer Science 2025-10-30 Jiaqi Wu , Qinlao Zhao , Zefeng Chen , Kai Qin , Yifei Zhao , Xueqian Wang , Yuhang Yao

Recent progress in large language model (LLM)-based multi-agent collaboration highlights the power of structured communication in enabling collective intelligence. However, existing methods largely rely on static or graph-based inter-agent…

Artificial Intelligence · Computer Science 2025-11-04 Song Wang , Zhen Tan , Zihan Chen , Shuang Zhou , Tianlong Chen , Jundong Li

This paper proposes a highly robust autonomous agent framework based on the ReAct paradigm, designed to solve complex tasks through adaptive decision making and multi-agent collaboration. Unlike traditional frameworks that rely on fixed…

Multiagent Systems · Computer Science 2025-04-09 Zihao Wu

Large language model (LLM)-based systems are becoming increasingly popular for solving tasks by constructing executable workflows that interleave LLM calls, information retrieval, tool use, code execution, memory updates, and verification.…

Artificial Intelligence · Computer Science 2026-03-25 Ling Yue , Kushal Raj Bhandari , Ching-Yun Ko , Dhaval Patel , Shuxin Lin , Nianjun Zhou , Jianxi Gao , Pin-Yu Chen , Shaowu Pan

The efficiency of multi-agent systems driven by large language models (LLMs) largely hinges on their communication topology. However, designing an optimal topology is a non-trivial challenge, as it requires balancing competing objectives…

Computation and Language · Computer Science 2026-05-19 Eric Hanchen Jiang , Mengting Li , Guancheng Wan , Sophia Yin , Yuchen Wu , Xiao Liang , Xinfeng Li , Yizhou Sun , Wei Wang , Kai-Wei Chang , Ying Nian Wu

AI agents are increasingly used to solve complex, multi-step tasks, but existing multi-agent frameworks remain brittle as workflows grow in scale and depth. Small errors at intermediate stages can propagate through agent interactions, while…

Artificial Intelligence · Computer Science 2026-05-26 Andy Xu , Yu-Wing Tai

Contemporary multi-agent systems encounter persistent challenges in cross-platform interoperability, dynamic task scheduling, and efficient resource sharing. Agents with heterogeneous implementations often lack standardized interfaces;…

Artificial Intelligence · Computer Science 2025-07-08 Yuyang Cheng , Yumiao Xu , Chaojia Yu , Yong Zhao

As multimodal LLM-driven agents advance in autonomy and generalization, traditional static datasets face inherent scalability limitations and are insufficient for fully assessing their capabilities in increasingly complex and diverse tasks.…

Computation and Language · Computer Science 2026-03-06 Yurun Chen , Xavier Hu , Yuhan Liu , Ziqi Wang , Zeyi Liao , Lin Chen , Feng Wei , Yuxi Qian , Bo Zheng , Keting Yin , Shengyu Zhang

Robotic platforms have become essential for marine operations by providing regular and continuous access to offshore assets, such as underwater infrastructure inspection, environmental monitoring, and resource exploration. However, the…

Retrieval-Augmented Generation (RAG) enhances Large Language Models (LLMs) by incorporating external, domain-specific data into the generative process. While LLMs are highly capable, they often rely on static, pre-trained datasets, limiting…

Artificial Intelligence · Computer Science 2024-12-10 Aniruddha Salve , Saba Attar , Mahesh Deshmukh , Sayali Shivpuje , Arnab Mitra Utsab

Autonomous agents driven by Large Language Models (LLMs) offer enormous potential for automation. Early proof of this technology can be found in various demonstrations of agents solving complex tasks, interacting with external systems to…

Large Language Model (LLM)-based agents demonstrate strong reasoning and execution capabilities on complex tasks when guided by structured instructions, commonly referred to as workflows. However, existing workflow-assisted agent serving…

Machine Learning · Computer Science 2026-05-22 Ao Li , Shangpeng Yang , Fahao Chen , Tianheng Xu , Peng Li , Zhou Su

Traditional data processing pipelines are typically static and handcrafted for specific tasks, limiting their adaptability to evolving requirements. While general-purpose agents and coding assistants can generate code for well-understood…

Artificial Intelligence · Computer Science 2026-02-20 Udayan Khurana

Multi-agent systems provide a powerful way to extend large language models (LLMs) by decomposing a complex task into specialized subtasks handled by different agents. However, their performance is often hindered by error propagation,…

Machine Learning · Computer Science 2026-05-14 Zheng Wang , Yuang Liu , Yangkai Ding

Graphical User Interface (GUI) agents possess significant commercial and social value, and GUI agents powered by advanced multimodal large language models (MLLMs) have demonstrated remarkable potential. Currently, existing GUI agents…

Artificial Intelligence · Computer Science 2025-09-05 Weizhi Chen , Ziwei Wang , Leyang Yang , Sheng Zhou , Xiaoxuan Tang , Jiajun Bu , Yong Li , Wei Jiang

Although large language models (LLMs) have revolutionized natural language processing capabilities, their practical implementation as autonomous multi-agent systems (MAS) for industrial problem-solving encounters persistent barriers.…

Computation and Language · Computer Science 2025-10-30 Hui Yi Leong , Yuheng Li , Yuqing Wu , Wenwen Ouyang , Wei Zhu , Jiechao Gao , Wei Han

As large language models from diverse providers converge toward comparable benchmark performance, the traditional paradigm of selecting a single best model per task yields diminishing returns. We argue that orchestration topology -- the…

Multiagent Systems · Computer Science 2026-02-20 Geunbin Yu
‹ Prev 1 2 3 10 Next ›