Related papers: Co-Learning: Code Learning for Multi-Agent Reinfor…

Mutual Enhancement of Large Language and Reinforcement Learning Models through Bi-Directional Feedback Mechanisms: A Planning Case Study

Large Language Models (LLMs) have demonstrated remarkable capabilities for reinforcement learning (RL) models, such as planning and reasoning capabilities. However, the problems of LLMs and RL model collaboration still need to be solved. In…

Computation and Language · Computer Science 2025-03-04 Shangding Gu

CodeCoR: An LLM-Based Self-Reflective Multi-Agent Framework for Code Generation

Code generation aims to produce code that fulfills requirements written in natural languages automatically. Large language Models (LLMs) like ChatGPT have demonstrated promising effectiveness in this area. Nonetheless, these LLMs often fail…

Software Engineering · Computer Science 2025-01-15 Ruwei Pan , Hongyu Zhang , Chao Liu

CoRe-Code: Collaborative Reinforcement Learning for Code Generation

Large language models (LLMs) have achieved strong performance in code generation, but most methods rely on autoregressive decoding without global planning, often leading to locally coherent yet globally suboptimal solutions (e.g., failing…

Artificial Intelligence · Computer Science 2026-05-26 Zhihao Dou , Qinjian Zhao , Zhongwei Wan , Xiaoyu Xia , Sumon Biswas

Large Language Model Integration with Reinforcement Learning to Augment Decision-Making in Autonomous Cyber Operations

Reinforcement Learning (RL) has shown great potential for autonomous decision-making in the cybersecurity domain, enabling agents to learn through direct environment interaction. However, RL agents in Autonomous Cyber Operations (ACO)…

Cryptography and Security · Computer Science 2026-02-17 Konur Tholl , François Rivest , Mariam El Mezouar , Adrian Taylor , Ranwa Al Mallah

Reinforcement Learning for Aligning Large Language Models Agents with Interactive Environments: Quantifying and Mitigating Prompt Overfitting

Reinforcement learning (RL) is a promising approach for aligning large language models (LLMs) knowledge with sequential decision-making tasks. However, few studies have thoroughly investigated the impact on LLM agents capabilities of…

Machine Learning · Computer Science 2025-09-08 Mohamed Salim Aissi , Clement Romac , Thomas Carta , Sylvain Lamprier , Pierre-Yves Oudeyer , Olivier Sigaud , Laure Soulier , Nicolas Thome

CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation

In recent years, instruction fine-tuning (IFT) on large language models (LLMs) has garnered considerable attention to enhance model performance on unseen tasks. Attempts have been made on automatic construction and effective selection for…

Computation and Language · Computer Science 2024-10-25 Renhao Li , Minghuan Tan , Derek F. Wong , Min Yang

Experiential Co-Learning of Software-Developing Agents

Recent advancements in large language models (LLMs) have brought significant changes to various domains, especially through LLM-driven autonomous agents. A representative scenario is in software development, where LLM agents demonstrate…

Computation and Language · Computer Science 2024-06-06 Chen Qian , Yufan Dang , Jiahao Li , Wei Liu , Zihao Xie , Yifei Wang , Weize Chen , Cheng Yang , Xin Cong , Xiaoyin Che , Zhiyuan Liu , Maosong Sun

Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents

Recent studies have uncovered the potential of Large Language Models (LLMs) in addressing complex sequential decision-making tasks through the provision of high-level instructions. However, LLM-based agents lack specialization in tackling…

Artificial Intelligence · Computer Science 2024-05-28 Zihao Zhou , Bin Hu , Chenyang Zhao , Pu Zhang , Bin Liu

Lessons Learned: A Multi-Agent Framework for Code LLMs to Learn and Improve

Recent studies show that LLMs possess different skills and specialize in different tasks. In fact, we observe that their varied performance occur in several levels of granularity. For example, in the code optimization task, code LLMs excel…

Artificial Intelligence · Computer Science 2025-10-24 Yuanzhe Liu , Ryan Deng , Tim Kaler , Xuhao Chen , Charles E. Leiserson , Yao Ma , Jie Chen

Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning

Large Language Models (LLMs) are increasingly being explored for building Agents capable of active environmental interaction (e.g., via tool use) to solve complex problems. Reinforcement Learning (RL) is considered a key technology with…

Computation and Language · Computer Science 2025-11-19 Mingyue Cheng , Jie Ouyang , Shuo Yu , Ruiran Yan , Yucong Luo , Zirui Liu , Daoyu Wang , Qi Liu , Enhong Chen

LLM Collaboration With Multi-Agent Reinforcement Learning

A large amount of work has been done in Multi-Agent Systems (MAS) for modeling and solving problems with multiple interacting agents. However, most LLMs are pretrained independently and not specifically optimized for coordination. Existing…

Artificial Intelligence · Computer Science 2025-12-10 Shuo Liu , Tianle Chen , Zeyu Liang , Xueguang Lyu , Christopher Amato

CodeEdu: A Multi-Agent Collaborative Platform for Personalized Coding Education

Large Language Models (LLMs) have demonstrated considerable potential in improving coding education by providing support for code writing, explanation, and debugging. However, existing LLM-based approaches generally fail to assess students'…

Multiagent Systems · Computer Science 2025-07-21 Jianing Zhao , Peng Gao , Jiannong Cao , Zhiyuan Wen , Chen Chen , Jianing Yin , Ruosong Yang , Bo Yuan

Coevolving with the Other You: Fine-Tuning LLM with Sequential Cooperative Multi-Agent Reinforcement Learning

Reinforcement learning (RL) has emerged as a pivotal technique for fine-tuning large language models (LLMs) on specific tasks. However, prevailing RL fine-tuning methods predominantly rely on PPO and its variants. Though these algorithms…

Artificial Intelligence · Computer Science 2025-02-25 Hao Ma , Tianyi Hu , Zhiqiang Pu , Boyin Liu , Xiaolin Ai , Yanyan Liang , Min Chen

Multi-Agent Evolve: LLM Self-Improve through Co-evolution

Reinforcement Learning (RL) has demonstrated significant potential in enhancing the reasoning capabilities of large language models (LLMs). However, the success of RL for LLMs heavily relies on human-curated datasets and verifiable rewards,…

Artificial Intelligence · Computer Science 2025-10-31 Yixing Chen , Yiding Wang , Siqi Zhu , Haofei Yu , Tao Feng , Muhan Zhang , Mostofa Patwary , Jiaxuan You

LLM-based Multi-Agent Reinforcement Learning: Current and Future Directions

In recent years, Large Language Models (LLMs) have shown great abilities in various tasks, including question answering, arithmetic problem solving, and poem writing, among others. Although research on LLM-as-an-agent has shown that LLM can…

Multiagent Systems · Computer Science 2024-05-21 Chuanneng Sun , Songjun Huang , Dario Pompili

Controlling Performance and Budget of a Centralized Multi-agent LLM System with Reinforcement Learning

Large language models (LLMs) exhibit complementary strengths across domains and come with varying inference costs, motivating the design of multi-agent LLM systems where specialized models collaborate efficiently. Existing approaches…

Computation and Language · Computer Science 2025-11-05 Bowen Jin , TJ Collins , Donghan Yu , Mert Cemri , Shenao Zhang , Mengyu Li , Jay Tang , Tian Qin , Zhiyang Xu , Jiarui Lu , Guoli Yin , Jiawei Han , Zirui Wang

MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning

Leveraging multiple large language models (LLMs) to build collaborative multi-agentic workflows has demonstrated significant potential. However, most previous studies focus on prompting the out-of-the-box LLMs, relying on their innate…

Artificial Intelligence · Computer Science 2025-07-15 Chanwoo Park , Seungju Han , Xingzhi Guo , Asuman Ozdaglar , Kaiqing Zhang , Joo-Kyung Kim

Enhancing LLM-based Quantum Code Generation with Multi-Agent Optimization and Quantum Error Correction

Multi-agent frameworks with Large Language Models (LLMs) have become promising tools for generating general-purpose programming languages using test-driven development, allowing developers to create more accurate and robust code. However,…

Quantum Physics · Physics 2025-07-04 Charlie Campbell , Hao Mark Chen , Wayne Luk , Hongxiang Fan

CodeAgent: Autonomous Communicative Agents for Code Review

Code review, which aims at ensuring the overall quality and reliability of software, is a cornerstone of software development. Unfortunately, while crucial, Code review is a labor-intensive process that the research community is looking to…

Software Engineering · Computer Science 2024-09-26 Xunzhu Tang , Kisub Kim , Yewei Song , Cedric Lothritz , Bei Li , Saad Ezzini , Haoye Tian , Jacques Klein , Tegawende F. Bissyande

Harnessing the Power of Reinforcement Learning for Language-Model-Based Information Retriever via Query-Document Co-Augmentation

Recent studies have proposed leveraging Large Language Models (LLMs) as information retrievers through query rewriting. However, for challenging corpora, we argue that enhancing queries alone is insufficient for robust semantic matching;…

Information Retrieval · Computer Science 2025-06-24 Jingming Liu , Yumeng Li , Wei Shi , Yao-Xiang Ding , Hui Su , Kun Zhou