Related papers: Agent-based code generation for the Gammapy framew…

Guided Code Generation with LLMs: A Multi-Agent Framework for Complex Code Tasks

Large Language Models (LLMs) have shown remarkable capabilities in code generation tasks, yet they face significant limitations in handling complex, long-context programming challenges and demonstrating complex compositional reasoning…

Artificial Intelligence · Computer Science 2025-01-14 Amr Almorsi , Mohanned Ahmed , Walid Gomaa

An Agent-Based Framework for the Automatic Validation of Mathematical Optimization Models

Recently, using Large Language Models (LLMs) to generate optimization models from natural language descriptions has became increasingly popular. However, a major open question is how to validate that the generated models are correct and…

Artificial Intelligence · Computer Science 2026-04-07 Alexander Zadorojniy , Segev Wasserkrug , Eitan Farchi

Agents: An Open-source Framework for Autonomous Language Agents

Recent advances on large language models (LLMs) enable researchers and developers to build autonomous language agents that can automatically solve various tasks and interact with environments, humans, and other agents using natural language…

Computation and Language · Computer Science 2023-12-13 Wangchunshu Zhou , Yuchen Eleanor Jiang , Long Li , Jialong Wu , Tiannan Wang , Shi Qiu , Jintian Zhang , Jing Chen , Ruipu Wu , Shuai Wang , Shiding Zhu , Jiyu Chen , Wentao Zhang , Xiangru Tang , Ningyu Zhang , Huajun Chen , Peng Cui , Mrinmaya Sachan

Formal-LLM: Integrating Formal Language and Natural Language for Controllable LLM-based Agents

Recent advancements on Large Language Models (LLMs) enable AI Agents to automatically generate and execute multi-step plans to solve complex tasks. However, since LLM's content generation process is hardly controllable, current LLM-based…

Machine Learning · Computer Science 2024-08-13 Zelong Li , Wenyue Hua , Hao Wang , He Zhu , Yongfeng Zhang

CodeVisionary: An Agent-based Framework for Evaluating Large Language Models in Code Generation

Large language models (LLMs) have demonstrated strong capabilities in code generation, underscoring the critical need for rigorous and comprehensive evaluation. Existing evaluation approaches fall into three categories, including…

Software Engineering · Computer Science 2025-10-21 Xinchen Wang , Pengfei Gao , Chao Peng , Ruida Hu , Cuiyun Gao

Towards Advancing Code Generation with Large Language Models: A Research Roadmap

Recently, we have witnessed the rapid development of large language models, which have demonstrated excellent capabilities in the downstream task of code generation. However, despite their potential, LLM-based code generation still faces…

Software Engineering · Computer Science 2025-01-22 Haolin Jin , Huaming Chen , Qinghua Lu , Liming Zhu

AgentCoder: Multi-Agent-based Code Generation with Iterative Testing and Optimisation

The advancement of natural language processing (NLP) has been significantly boosted by the development of transformer-based large language models (LLMs). These models have revolutionized NLP tasks, particularly in code generation, aiding…

Computation and Language · Computer Science 2024-05-27 Dong Huang , Jie M. Zhang , Michael Luck , Qingwen Bu , Yuhao Qing , Heming Cui

SWT-Bench: Testing and Validating Real-World Bug-Fixes with Code Agents

Rigorous software testing is crucial for developing and maintaining high-quality code, making automated test generation a promising avenue for both improving software quality and boosting the effectiveness of code generation methods.…

Software Engineering · Computer Science 2025-02-10 Niels Mündler , Mark Niklas Müller , Jingxuan He , Martin Vechev

ModelScope-Agent: Building Your Customizable Agent System with Open-source Large Language Models

Large language models (LLMs) have recently demonstrated remarkable capabilities to comprehend human intentions, engage in reasoning, and design planning-like behavior. To further unleash the power of LLMs to accomplish complex tasks, there…

Computation and Language · Computer Science 2023-09-06 Chenliang Li , Hehong Chen , Ming Yan , Weizhou Shen , Haiyang Xu , Zhikai Wu , Zhicheng Zhang , Wenmeng Zhou , Yingda Chen , Chen Cheng , Hongzhu Shi , Ji Zhang , Fei Huang , Jingren Zhou

An Agentic Framework for Autonomous Materials Computation

Large Language Models (LLMs) have emerged as powerful tools for accelerating scientific discovery, yet their static knowledge and hallucination issues hinder autonomous research applications. Recent advances integrate LLMs into agentic…

Artificial Intelligence · Computer Science 2025-12-23 Zeyu Xia , Jinzhe Ma , Congjie Zheng , Shufei Zhang , Yuqiang Li , Hang Su , P. Hu , Changshui Zhang , Xingao Gong , Wanli Ouyang , Lei Bai , Dongzhan Zhou , Mao Su

Requirements Development and Formalization for Reliable Code Generation: A Multi-Agent Vision

Automated code generation has long been considered the holy grail of software engineering. The emergence of Large Language Models (LLMs) has catalyzed a revolutionary breakthrough in this area. However, existing methods that only rely on…

Software Engineering · Computer Science 2025-08-27 Xu Lu , Weisong Sun , Yiran Zhang , Ming Hu , Cong Tian , Zhi Jin , Yang Liu

A Survey on Code Generation with LLM-based Agents

Code generation agents powered by large language models (LLMs) are revolutionizing the software development paradigm. Distinct from previous code generation techniques, code generation agents are characterized by three core features. 1)…

Software Engineering · Computer Science 2025-10-01 Yihong Dong , Xue Jiang , Jiaru Qian , Tian Wang , Kechi Zhang , Zhi Jin , Ge Li

A Framework for Testing and Adapting REST APIs as LLM Tools

Large Language Models (LLMs) are increasingly used to build autonomous agents that perform complex tasks with external tools, often exposed through APIs in enterprise systems. Direct use of these APIs is difficult due to the complex input…

Software Engineering · Computer Science 2025-09-15 Jayachandu Bandlamudi , Ritwik Chaudhuri , Neelamadhav Gantayat , Sambit Ghosh , Kushal Mukherjee , Prerna Agarwal , Renuka Sindhgatta , Sameep Mehta

LLM Agents Making Agent Tools

Tool use has turned large language models (LLMs) into powerful agents that can perform complex multi-step tasks by dynamically utilising external software components. However, these tools must be implemented in advance by human developers,…

Computation and Language · Computer Science 2025-06-02 Georg Wölflein , Dyke Ferber , Daniel Truhn , Ognjen Arandjelović , Jakob Nikolas Kather

From Code Foundation Models to Agents and Applications: A Comprehensive Survey and Practical Guide to Code Intelligence

Large language models (LLMs) have fundamentally transformed automated software development by enabling direct translation of natural language descriptions into functional code, driving commercial adoption through tools like Github Copilot…

Software Engineering · Computer Science 2025-12-09 Jian Yang , Xianglong Liu , Weifeng Lv , Ken Deng , Shawn Guo , Lin Jing , Yizhi Li , Shark Liu , Xianzhen Luo , Yuyu Luo , Changzai Pan , Ensheng Shi , Yingshui Tan , Renshuai Tao , Jiajun Wu , Xianjie Wu , Zhenhe Wu , Daoguang Zan , Chenchen Zhang , Wei Zhang , He Zhu , Terry Yue Zhuo , Kerui Cao , Xianfu Cheng , Jun Dong , Shengjie Fang , Zhiwei Fei , Xiangyuan Guan , Qipeng Guo , Zhiguang Han , Joseph James , Tianqi Luo , Renyuan Li , Yuhang Li , Yiming Liang , Congnan Liu , Jiaheng Liu , Qian Liu , Ruitong Liu , Tyler Loakman , Xiangxin Meng , Chuang Peng , Tianhao Peng , Jiajun Shi , Mingjie Tang , Boyang Wang , Haowen Wang , Yunli Wang , Fanglin Xu , Zihan Xu , Fei Yuan , Ge Zhang , Jiayi Zhang , Xinhao Zhang , Wangchunshu Zhou , Hualei Zhu , King Zhu , Bryan Dai , Aishan Liu , Zhoujun Li , Chenghua Lin , Tianyu Liu , Chao Peng , Kai Shen , Libo Qin , Shuangyong Song , Zizheng Zhan , Jiajun Zhang , Jie Zhang , Zhaoxiang Zhang , Bo Zheng

Every Software as an Agent: Blueprint and Case Study

The rise of (multimodal) large language models (LLMs) has shed light on software agent -- where software can understand and follow user instructions in natural language. However, existing approaches such as API-based and GUI-based agents…

Software Engineering · Computer Science 2025-02-10 Mengwei Xu

Simulation Agent: A Framework for Integrating Simulation and Large Language Models for Enhanced Decision-Making

Simulations, although powerful in accurately replicating real-world systems, often remain inaccessible to non-technical users due to their complexity. Conversely, large language models (LLMs) provide intuitive, language-based interactions…

Computation and Language · Computer Science 2025-05-22 Jacob Kleiman , Kevin Frank , Joseph Voyles , Sindy Campagna

AI Agentic Programming: A Survey of Techniques, Challenges, and Opportunities

AI agentic programming is an emerging paradigm where large language model (LLM)-based coding agents autonomously plan, execute, and interact with tools such as compilers, debuggers, and version control systems. Unlike conventional code…

Software Engineering · Computer Science 2025-09-16 Huanting Wang , Jingzhi Gong , Huawei Zhang , Jie Xu , Zheng Wang

AgentSims: An Open-Source Sandbox for Large Language Model Evaluation

With ChatGPT-like large language models (LLM) prevailing in the community, how to evaluate the ability of LLMs is an open question. Existing evaluation methods suffer from following shortcomings: (1) constrained evaluation abilities, (2)…

Artificial Intelligence · Computer Science 2023-08-09 Jiaju Lin , Haoran Zhao , Aochi Zhang , Yiting Wu , Huqiuyue Ping , Qin Chen

A Tool for Test Case Scenarios Generation Using Large Language Models

Large Language Models (LLMs) are widely used in Software Engineering (SE) for various tasks, including generating code, designing and documenting software, adding code comments, reviewing code, and writing test scripts. However, creating…

Software Engineering · Computer Science 2024-06-12 Abdul Malik Sami , Zeeshan Rasheed , Muhammad Waseem , Zheying Zhang , Herda Tomas , Pekka Abrahamsson