Related papers: CoRe-Code: Collaborative Reinforcement Learning fo…

CollabCoder: Plan-Code Co-Evolution via Collaborative Decision-Making for Efficient Code Generation

Automated code generation remains a persistent challenge in software engineering, as conventional multi-agent frameworks are often constrained by static planning, isolated execution, high computational overhead, and limited adaptability to…

Software Engineering · Computer Science 2026-04-21 Duy Tung Doan , Quang Huy Phung , Dzung Nguyen , Khac-Hoai Nam Bui

CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation

Code generation aims to produce code that fulfills requirements written in natural languages automatically. Large language Models (LLMs) like ChatGPT have demonstrated promising effectiveness in this area. Nonetheless, these LLMs often fail…

Software Engineering · Computer Science 2025-01-15 Ruwei Pan , Hongyu Zhang , Chao Liu

Co-Learning: Code Learning for Multi-Agent Reinforcement Collaborative Framework with Conversational Natural Language Interfaces

Online question-and-answer (Q\&A) systems based on the Large Language Model (LLM) have progressively diverged from recreational to professional use. This paper proposed a Multi-Agent framework with environmentally reinforcement learning…

Software Engineering · Computer Science 2024-09-05 Jiapeng Yu , Yuqian Wu , Yajing Zhan , Wenhao Guo , Zhou Xu , Raymond Lee

Reinforcement Learning-Augmented LLM Agents for Collaborative Decision Making and Performance Optimization

Large Language Models (LLMs) perform well in language tasks but often lack collaborative awareness and struggle to optimize global performance in multi-agent settings. We present a reinforcement learning-augmented LLM agent framework that…

Artificial Intelligence · Computer Science 2026-01-01 Dong Qiu , Duo Xu , Limengxi Yue

CORD: Generalizable Cooperation via Role Diversity

Cooperative multi-agent reinforcement learning (MARL) aims to develop agents that can collaborate effectively. However, most cooperative MARL methods overfit training agents, making learned policies not generalize well to unseen…

Artificial Intelligence · Computer Science 2025-01-13 Kanefumi Matsuyama , Kefan Su , Jiangxing Wang , Deheng Ye , Zongqing Lu

LLM Collaboration With Multi-Agent Reinforcement Learning

A large amount of work has been done in Multi-Agent Systems (MAS) for modeling and solving problems with multiple interacting agents. However, most LLMs are pretrained independently and not specifically optimized for coordination. Existing…

Artificial Intelligence · Computer Science 2025-12-10 Shuo Liu , Tianle Chen , Zeyu Liang , Xueguang Lyu , Christopher Amato

Solving Math Word Problems via Cooperative Reasoning induced Language Models

Large-scale pre-trained language models (PLMs) bring new opportunities to challenging problems, especially those that need high-level intelligence, such as the math word problem (MWPs). However, directly applying existing PLMs to MWPs can…

Computation and Language · Computer Science 2024-01-01 Xinyu Zhu , Junjie Wang , Lin Zhang , Yuxiang Zhang , Ruyi Gan , Jiaxing Zhang , Yujiu Yang

Self-collaboration Code Generation via ChatGPT

Although Large Language Models (LLMs) have demonstrated remarkable code-generation ability, they still struggle with complex tasks. In real-world software development, humans usually tackle complex tasks through collaborative teamwork, a…

Software Engineering · Computer Science 2024-05-14 Yihong Dong , Xue Jiang , Zhi Jin , Ge Li

What to Retrieve for Effective Retrieval-Augmented Code Generation? An Empirical Study and Beyond

Repository-level code generation remains challenging due to complex code dependencies and the limitations of large language models (LLMs) in processing long contexts. While retrieval-augmented generation (RAG) frameworks are widely adopted,…

Software Engineering · Computer Science 2025-03-27 Wenchao Gu , Juntao Chen , Yanlin Wang , Tianyue Jiang , Xingzhe Li , Mingwei Liu , Xilin Liu , Yuchi Ma , Zibin Zheng

Enhancing LLM-based Quantum Code Generation with Multi-Agent Optimization and Quantum Error Correction

Multi-agent frameworks with Large Language Models (LLMs) have become promising tools for generating general-purpose programming languages using test-driven development, allowing developers to create more accurate and robust code. However,…

Quantum Physics · Physics 2025-07-04 Charlie Campbell , Hao Mark Chen , Wayne Luk , Hongxiang Fan

Efficient LLM Collaboration via Planning

Recently, large language models (LLMs) have demonstrated strong performance, ranging from simple to complex tasks. However, while large models achieve remarkable results across diverse tasks, they often incur substantial monetary inference…

Artificial Intelligence · Computer Science 2026-05-12 Byeongchan Lee , Jonghoon Lee , Dongyoung Kim , Jaehyung Kim , Kyungjoon Park , Dongjun Lee , Jinwoo Shin

Agentic Code Optimization via Compiler-LLM Cooperation

Generating performant executables from high level languages is critical to software performance across a wide range of domains. Modern compilers perform this task by passing code through a series of well-studied optimizations at…

Programming Languages · Computer Science 2026-04-07 Benjamin Mikek , Danylo Vashchilenko , Bryan Lu , Panpan Xu

R1-Code-Interpreter: LLMs Reason with Code via Supervised and Multi-stage Reinforcement Learning

Practical guidance on training Large Language Models (LLMs) to leverage Code Interpreter across diverse tasks remains lacking. We present R1-Code-Interpreter, an extension of a text-only LLM trained via multi-turn supervised fine-tuning…

Artificial Intelligence · Computer Science 2026-03-05 Yongchao Chen , Yueying Liu , Junwei Zhou , Yilun Hao , Jingquan Wang , Yang Zhang , Na Li , Chuchu Fan

ReCode: Unify Plan and Action for Universal Granularity Control

Real-world tasks require decisions at varying granularities, and humans excel at this by leveraging a unified cognitive representation where planning is fundamentally understood as a high-level form of action. However, current Large…

Artificial Intelligence · Computer Science 2026-01-07 Zhaoyang Yu , Jiayi Zhang , Huixue Su , Yufan Zhao , Yifan Wu , Mingyi Deng , Jinyu Xiang , Yizhang Lin , Lingxiao Tang , Yuyu Luo , Bang Liu , Chenglin Wu

CodeTree: Agent-guided Tree Search for Code Generation with Large Language Models

Pre-trained on massive amounts of code and text data, large language models (LLMs) have demonstrated remarkable achievements in performing code generation tasks. With additional execution-based feedback, these models can act as agents with…

Computation and Language · Computer Science 2024-11-14 Jierui Li , Hung Le , Yingbo Zhou , Caiming Xiong , Silvio Savarese , Doyen Sahoo

Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning

Reinforcement learning (RL) has emerged as a pivotal technique for fine-tuning large language models (LLMs) on specific tasks. However, prevailing RL fine-tuning methods predominantly rely on PPO and its variants. Though these algorithms…

Artificial Intelligence · Computer Science 2025-02-25 Hao Ma , Tianyi Hu , Zhiqiang Pu , Boyin Liu , Xiaolin Ai , Yanyan Liang , Min Chen

Lessons Learned: A Multi-Agent Framework for Code LLMs to Learn and Improve

Recent studies show that LLMs possess different skills and specialize in different tasks. In fact, we observe that their varied performance occur in several levels of granularity. For example, in the code optimization task, code LLMs excel…

Artificial Intelligence · Computer Science 2025-10-24 Yuanzhe Liu , Ryan Deng , Tim Kaler , Xuhao Chen , Charles E. Leiserson , Yao Ma , Jie Chen

Cooperative Strategic Planning Enhances Reasoning Capabilities in Large Language Models

Enhancing the reasoning capabilities of large language models (LLMs) is crucial for enabling them to tackle complex, multi-step problems. Multi-agent frameworks have shown great potential in enhancing LLMs' reasoning capabilities. However,…

Artificial Intelligence · Computer Science 2024-10-29 Danqing Wang , Zhuorui Ye , Fei Fang , Lei Li

Enhancing LLM Code Generation: A Systematic Evaluation of Multi-Agent Collaboration and Runtime Debugging for Improved Accuracy, Reliability, and Latency

The use of large language models (LLMs) for automated code generation has emerged as a significant focus within AI research. As these pretrained models continue to evolve, their ability to understand and generate complex code structures has…

Software Engineering · Computer Science 2025-05-06 Nazmus Ashrafi , Salah Bouktif , Mohammed Mediani

RA-Gen: A Controllable Code Generation Framework Using ReAct for Multi-Agent Task Execution

Code generation models based on large language models (LLMs) have gained wide adoption, but challenges remain in ensuring safety, accuracy, and controllability, especially for complex tasks. Existing methods often lack dynamic integration…

Software Engineering · Computer Science 2025-10-13 Aofan Liu , Haoxuan Li , Bin Wang , Ao Yang , Hui Li