English
Related papers

Related papers: Executable Code Actions Elicit Better LLM Agents

200 papers

Large Language Model (LLM) Agents have recently garnered increasing interest yet they are limited in their ability to learn from trial and error, a key element of intelligent behavior. In this work, we argue that the capacity to learn new…

Artificial Intelligence · Computer Science 2024-08-09 Haiteng Zhao , Chang Ma , Guoyin Wang , Jing Su , Lingpeng Kong , Jingjing Xu , Zhi-Hong Deng , Hongxia Yang

Based on their superior comprehension and reasoning capabilities, Large Language Model (LLM) driven agent frameworks have achieved significant success in numerous complex reasoning tasks. ReAct-like agents can solve various intricate…

Artificial Intelligence · Computer Science 2025-01-14 Guozhi Yuan , Youfeng Liu , Jingli Yang , Wei Jia , Kai Lin , Yansong Gao , Shan He , Zilin Ding , Haitao Li

Tool learning empowers large language models (LLMs) as agents to use external tools and extend their utility. Existing methods employ one single LLM-based agent to iteratively select and execute tools, thereafter incorporating execution…

Computation and Language · Computer Science 2024-06-25 Zhengliang Shi , Shen Gao , Xiuyi Chen , Yue Feng , Lingyong Yan , Haibo Shi , Dawei Yin , Pengjie Ren , Suzan Verberne , Zhaochun Ren

Intelligent agent systems based on Large Language Models (LLMs) have shown great potential in real-world applications. However, existing agent frameworks still face critical limitations in task planning and execution, restricting their…

Information Retrieval · Computer Science 2025-04-30 Junjie Chen , Haitao Li , Jingli Yang , Yiqun Liu , Qingyao Ai

Large Language Models (LLMs) have shown promise in automated code generation but typically excel only in simpler tasks such as generating standalone code units. Real-world software development, however, often involves complex code…

Software Engineering · Computer Science 2024-08-12 Kechi Zhang , Jia Li , Ge Li , Xianjie Shi , Zhi Jin

Existing LLMs exhibit remarkable performance on various NLP tasks, but still struggle with complex real-world tasks, even equipped with advanced strategies like CoT and ReAct. In this work, we propose the CoAct framework, which transfers…

Computation and Language · Computer Science 2024-06-21 Xinming Hou , Mingming Yang , Wenxiang Jiao , Xing Wang , Zhaopeng Tu , Wayne Xin Zhao

Code translation transforms code between programming languages while preserving functionality, which is critical in software development and maintenance. While traditional learning-based code translation methods have limited effectiveness…

Software Engineering · Computer Science 2026-04-08 Zhiqiang Yuan , Weitong Chen , Hanlin Wang , Xin Peng , Zhenpeng Chen , Yiling Lou

Large Language Models (LLMs) excel in traditional natural language processing tasks but struggle with problems that require complex domain-specific calculations or simulations. While equipping LLMs with external tools to build LLM-based…

Software Engineering · Computer Science 2025-06-11 Bohan Lyu , Xin Cong , Heyang Yu , Pan Yang , Yujia Qin , Yining Ye , Yaxi Lu , Zhong Zhang , Yukun Yan , Yankai Lin , Zhiyuan Liu , Maosong Sun

Code generation models based on large language models (LLMs) have gained wide adoption, but challenges remain in ensuring safety, accuracy, and controllability, especially for complex tasks. Existing methods often lack dynamic integration…

Software Engineering · Computer Science 2025-10-13 Aofan Liu , Haoxuan Li , Bin Wang , Ao Yang , Hui Li

Recent advancements have enabled Large Language Models (LLMs) to function as agents that can perform actions using external tools. This requires registering, i.e., integrating tool information into the LLM context prior to taking actions.…

Humans solve problems by executing targeted plans, yet large language models (LLMs) remain unreliable for structured workflow execution. We propose RunAgent, a multi-agent plan execution platform that interprets natural-language plans while…

Machine Learning · Computer Science 2026-05-04 Arunabh Srivastava , Mohammad A. , Khojastepour , Srimat Chakradhar , Sennur Ulukus

Tool use has turned large language models (LLMs) into powerful agents that can perform complex multi-step tasks by dynamically utilising external software components. However, these tools must be implemented in advance by human developers,…

Computation and Language · Computer Science 2025-06-02 Georg Wölflein , Dyke Ferber , Daniel Truhn , Ognjen Arandjelović , Jakob Nikolas Kather

In this paper we introduce ResearchCodeAgent, a novel multi-agent system leveraging large language models (LLMs) agents to automate the codification of research methodologies described in machine learning literature. The system bridges the…

Software Engineering · Computer Science 2025-05-06 Shubham Gandhi , Dhruv Shah , Manasi Patwardhan , Lovekesh Vig , Gautam Shroff

Code generation aims to produce code that fulfills requirements written in natural languages automatically. Large language Models (LLMs) like ChatGPT have demonstrated promising effectiveness in this area. Nonetheless, these LLMs often fail…

Software Engineering · Computer Science 2025-01-15 Ruwei Pan , Hongyu Zhang , Chao Liu

The exceptional capabilities of large language models (LLMs) have substantially accelerated the rapid rise and widespread adoption of agents. Recent studies have demonstrated that generating Python code to consolidate LLM-based agents'…

Software Engineering · Computer Science 2024-12-20 Ziyi Ni , Yifan Li , Daxiang Dong

Large Language Models (LLMs) have demonstrated considerable potential in improving coding education by providing support for code writing, explanation, and debugging. However, existing LLM-based approaches generally fail to assess students'…

Multiagent Systems · Computer Science 2025-07-21 Jianing Zhao , Peng Gao , Jiannong Cao , Zhiyuan Wen , Chen Chen , Jianing Yin , Ruosong Yang , Bo Yuan

Agents are LLM-driven components that can mutate environments in powerful, arbitrary ways. Extracting guarantees for the execution of agents in production environments can be challenging due to asynchrony and failures. In this paper, we…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-04-10 Mahesh Balakrishnan , Ashwin Bharambe , Davide Testuggine , David Geraghty , David Mao , Vidhya Venkat , Ilya Mironov , Rithesh Baradi , Gayathri Aiyer , Victoria Dudin

Autonomous agents that operate computers via Graphical User Interfaces (GUIs) often struggle with efficiency and reliability on complex, long-horizon tasks. While augmenting these agents with planners can improve task decomposition, they…

Computation and Language · Computer Science 2026-02-23 Linxin Song , Yutong Dai , Viraj Prabhu , Jieyu Zhang , Taiwei Shi , Li Li , Junnan Li , Silvio Savarese , Zeyuan Chen , Jieyu Zhao , Ran Xu , Caiming Xiong

LLMs excel at code generation from English prompts, but this progress has not extended to low-resource languages. We address Bangla-to-Python code generation by introducing BanglaCodeAct, an agent-based framework that leverages multi-agent…

Computation and Language · Computer Science 2026-01-01 Jahidul Islam , Md Ataullha , Saiful Azad

Effective prompt design is essential for improving the planning capabilities of large language model (LLM)-driven agents. However, existing structured prompting strategies are typically limited to single-agent, plan-only settings, and often…

Artificial Intelligence · Computer Science 2025-07-08 Bruce Yang , Xinfeng He , Huan Gao , Yifan Cao , Xiaofan Li , David Hsu
‹ Prev 1 2 3 10 Next ›