Related papers: AXIS: Efficient Human-Agent-Computer Interaction w…

AppAgentX: Evolving GUI Agents as Proficient Smartphone Users

Recent advancements in Large Language Models (LLMs) have led to the development of intelligent LLM-based agents capable of interacting with graphical user interfaces (GUIs). These agents demonstrate strong reasoning and adaptability,…

Artificial Intelligence · Computer Science 2025-04-16 Wenjia Jiang , Yangyang Zhuang , Chenxi Song , Xu Yang , Joey Tianyi Zhou , Chi Zhang

LEXI: Large Language Models Experimentation Interface

The recent developments in Large Language Models (LLM), mark a significant moment in the research and development of social interactions with artificial agents. These agents are widely deployed in a variety of settings, with potential…

Human-Computer Interaction · Computer Science 2024-07-03 Guy Laban , Tomer Laban , Hatice Gunes

API Agents vs. GUI Agents: Divergence and Convergence

Large language models (LLMs) have evolved beyond simple text generation to power software agents that directly translate natural language commands into tangible actions. While API-based LLM agents initially rose to prominence for their…

Artificial Intelligence · Computer Science 2025-06-24 Chaoyun Zhang , Shilin He , Liqun Li , Si Qin , Yu Kang , Qingwei Lin , Saravan Rajmohan , Dongmei Zhang

Efficient LLM Serving for Agentic Workflows: A Data Systems Perspective

Agentic workflows are composed of sequences of interdependent Large Language Model (LLM) calls, and they have become a dominant workload in modern AI systems. These workflows exhibit extensive redundancy from overlapping prompts and…

Multiagent Systems · Computer Science 2026-03-18 Noppanat Wadlom , Junyi Shen , Yao Lu

LLM-Based Human-Agent Collaboration and Interaction Systems: A Survey

Recent advances in large language models (LLMs) have sparked growing interest in building fully autonomous agents. However, fully autonomous LLM-based agents still face significant challenges, including limited reliability due to…

Computation and Language · Computer Science 2026-05-07 Henry Peng Zou , Wei-Chieh Huang , Yaozu Wu , Jizhou Guo , Yankai Chen , Chunyu Miao , Hoang Nguyen , Yue Zhou , Weizhi Zhang , Liancheng Fang , Hanrong Zhang , Fangxin Wang , Pengfei Zhang , Huacan Wang , Langzhou He , Yangning Li , Dongyuan Li , Renhe Jiang , Xue Liu , Philip S. Yu

AppAgent v2: Advanced Agent for Flexible Mobile Interactions

With the advancement of Multimodal Large Language Models (MLLM), LLM-driven visual agents are increasingly impacting software interfaces, particularly those with graphical user interfaces. This work introduces a novel LLM-based multimodal…

Human-Computer Interaction · Computer Science 2025-09-18 Yanda Li , Chi Zhang , Wenjia Jiang , Wanqi Yang , Bin Fu , Pei Cheng , Xin Chen , Ling Chen , Yunchao Wei

Build the web for agents, not agents for the web

Recent advancements in Large Language Models (LLMs) and multimodal counterparts have spurred significant interest in developing web agents -- AI systems capable of autonomously navigating and completing tasks within web environments. While…

Machine Learning · Computer Science 2025-06-13 Xing Han Lù , Gaurav Kamath , Marius Mosbach , Siva Reddy

Aegis: Taxonomy and Optimizations for Overcoming Agent-Environment Failures in LLM Agents

Large Language Models (LLMs) agents augmented with domain tools promise to autonomously execute complex tasks requiring human-level intelligence, such as customer service and digital assistance. However, their practical deployment is often…

Multiagent Systems · Computer Science 2025-08-28 Kevin Song , Anand Jayarajan , Yaoyao Ding , Qidong Su , Zhanda Zhu , Sihang Liu , Gennady Pekhimenko

Multi-Agent Autonomous Driving Systems with Large Language Models: A Survey of Recent Advances

Autonomous Driving Systems (ADSs) are revolutionizing transportation by reducing human intervention, improving operational efficiency, and enhancing safety. Large Language Models (LLMs) have been integrated into ADSs to support high-level…

Multiagent Systems · Computer Science 2025-10-15 Yaozu Wu , Dongyuan Li , Yankai Chen , Renhe Jiang , Henry Peng Zou , Wei-Chieh Huang , Yangning Li , Liancheng Fang , Zhen Wang , Philip S. Yu

ASSISTGUI: Task-Oriented Desktop Graphical User Interface Automation

Graphical User Interface (GUI) automation holds significant promise for assisting users with complex tasks, thereby boosting human productivity. Existing works leveraging Large Language Model (LLM) or LLM-based AI agents have shown…

Computer Vision and Pattern Recognition · Computer Science 2024-01-02 Difei Gao , Lei Ji , Zechen Bai , Mingyu Ouyang , Peiran Li , Dongxing Mao , Qinchen Wu , Weichen Zhang , Peiyi Wang , Xiangwu Guo , Hengxu Wang , Luowei Zhou , Mike Zheng Shou

LLM-Powered AI Agent Systems and Their Applications in Industry

The emergence of Large Language Models (LLMs) has reshaped agent systems. Unlike traditional rule-based agents with limited task scope, LLM-powered agents offer greater flexibility, cross-domain reasoning, and natural language interaction.…

Artificial Intelligence · Computer Science 2026-05-05 Guannan Liang , Qianqian Tong

AppAgent: Multimodal Agents as Smartphone Users

Recent advancements in large language models (LLMs) have led to the creation of intelligent agents capable of performing complex tasks. This paper introduces a novel LLM-based multimodal agent framework designed to operate smartphone…

Computer Vision and Pattern Recognition · Computer Science 2023-12-25 Chi Zhang , Zhao Yang , Jiaxuan Liu , Yucheng Han , Xin Chen , Zebiao Huang , Bin Fu , Gang Yu

Human-Centered LLM-Agent User Interface: A Position Paper

Large Language Model (LLM) -in-the-loop applications have been shown to effectively interpret the human user's commands, make plans, and operate external tools/systems accordingly. Still, the operation scope of the LLM agent is limited to…

Human-Computer Interaction · Computer Science 2024-09-24 Daniel Chin , Yuxuan Wang , Gus Xia

An LLM-based multi-agent framework for agile effort estimation

Effort estimation is a crucial activity in agile software development, where teams collaboratively review, discuss, and estimate the effort required to complete user stories in a product backlog. Current practices in agile effort estimation…

Software Engineering · Computer Science 2025-09-19 Thanh-Long Bui , Hoa Khanh Dam , Rashina Hoda

TheAgentCompany: Benchmarking LLM Agents on Consequential Real World Tasks

We interact with computers on an everyday basis, be it in everyday life or work, and many aspects of work can be done entirely with access to a computer and the Internet. At the same time, thanks to improvements in large language models…

Computation and Language · Computer Science 2025-09-11 Frank F. Xu , Yufan Song , Boxuan Li , Yuxuan Tang , Kritanjali Jain , Mengxue Bao , Zora Z. Wang , Xuhui Zhou , Zhitong Guo , Murong Cao , Mingyang Yang , Hao Yang Lu , Amaad Martin , Zhe Su , Leander Maben , Raj Mehta , Wayne Chi , Lawrence Jang , Yiqing Xie , Shuyan Zhou , Graham Neubig

Multi-Agent Collaboration Mechanisms: A Survey of LLMs

With recent advances in Large Language Models (LLMs), Agentic AI has become phenomenal in real-world applications, moving toward multiple LLM-based agents to perceive, learn, reason, and act collaboratively. These LLM-based Multi-Agent…

Artificial Intelligence · Computer Science 2025-01-14 Khanh-Tung Tran , Dung Dao , Minh-Duong Nguyen , Quoc-Viet Pham , Barry O'Sullivan , Hoang D. Nguyen

BMW Agents -- A Framework For Task Automation Through Multi-Agent Collaboration

Autonomous agents driven by Large Language Models (LLMs) offer enormous potential for automation. Early proof of this technology can be found in various demonstrations of agents solving complex tasks, interacting with external systems to…

Multiagent Systems · Computer Science 2024-07-03 Noel Crawford , Edward B. Duffy , Iman Evazzade , Torsten Foehr , Gregory Robbins , Debbrata Kumar Saha , Jiya Varma , Marcin Ziolkowski

From Human-Human Collaboration to Human-Agent Collaboration: A Vision, Design Philosophy, and an Empirical Framework for Achieving Successful Partnerships Between Humans and LLM Agents

The emergence of Large Language Model (LLM) agents enables us to build agent-based intelligent systems that move beyond the role of a "tool" to become genuine collaborators with humans, thereby realizing a novel human-agent collaboration…

Human-Computer Interaction · Computer Science 2026-02-06 Bingsheng Yao , Chaoran Chen , April Yi Wang , Sherry Tongshuang Wu , Toby Jia-jun Li , Dakuo Wang

Creating an LLM-based AI-agent: A high-level methodology towards enhancing LLMs with APIs

Large Language Models (LLMs) have revolutionized various aspects of engineering and science. Their utility is often bottlenecked by the lack of interaction with the external digital environment. To overcome this limitation and achieve…

Software Engineering · Computer Science 2024-12-24 Ioannis Tzachristas

Efficient Agents: Building Effective Agents While Reducing Cost

The remarkable capabilities of Large Language Model (LLM)-driven agents have enabled sophisticated systems to tackle complex, multi-step tasks, but their escalating costs threaten scalability and accessibility. This work presents the first…

Artificial Intelligence · Computer Science 2025-08-06 Ningning Wang , Xavier Hu , Pai Liu , He Zhu , Yue Hou , Heyuan Huang , Shengyu Zhang , Jian Yang , Jiaheng Liu , Ge Zhang , Changwang Zhang , Jun Wang , Yuchen Eleanor Jiang , Wangchunshu Zhou