English
Related papers

Related papers: Code2Math: Can Your Code Agent Effectively Evolve …

200 papers

Large Language Model (LLM) Agents, often trained with Reinforcement Learning (RL), are constrained by a dependency on human-curated data, limiting scalability and tethering AI to human knowledge. Existing self-evolution frameworks offer an…

Machine Learning · Computer Science 2025-11-21 Peng Xia , Kaide Zeng , Jiaqi Liu , Can Qin , Fang Wu , Yiyang Zhou , Caiming Xiong , Huaxiu Yao

Computational experiments have emerged as a valuable method for studying complex systems, involving the algorithmization of counterfactuals. However, accurately representing real social systems in Agent-based Modeling (ABM) is challenging…

Artificial Intelligence · Computer Science 2024-02-02 Qun Ma , Xiao Xue , Deyu Zhou , Xiangning Yu , Donghua Liu , Xuwen Zhang , Zihan Zhao , Yifan Shen , Peilin Ji , Juanjuan Li , Gang Wang , Wanpeng Ma

The remarkable growth in large language model (LLM) capabilities has spurred exploration into multi-agent systems, with debate frameworks emerging as a promising avenue for enhanced problem-solving. These multi-agent debate (MAD)…

Artificial Intelligence · Computer Science 2025-06-23 Yongjin Yang , Euiin Yi , Jongwoo Ko , Kimin Lee , Zhijing Jin , Se-Young Yun

Large language models are quickly becoming the foundation for intelligent agents that are capable of using tools. However, training such agents is challenging because it requires human creation and annotation of a diverse set of tasks,…

Artificial Intelligence · Computer Science 2025-06-03 Yifei Zhou , Sergey Levine , Jason Weston , Xian Li , Sainbayar Sukhbaatar

Large Language Models (LLMs) have shown outstanding breakthroughs in code generation. Recent work improves code LLMs by training on synthetic data generated by some powerful LLMs, which can be challenging to scale due to the dependence on a…

Computation and Language · Computer Science 2025-02-11 Yunfan Shao , Linyang Li , Yichuan Ma , Peiji Li , Demin Song , Qinyuan Cheng , Shimin Li , Xiaonan Li , Pengyu Wang , Qipeng Guo , Hang Yan , Xipeng Qiu , Xuanjing Huang , Dahua Lin

Deep research has revolutionized data analysis, yet data scientists still devote substantial time to manually crafting visualizations, highlighting the need for robust automation from natural language queries. However, current systems…

Artificial Intelligence · Computer Science 2025-10-06 Zichen Chen , Jiefeng Chen , Sercan Ö. Arik , Misha Sra , Tomas Pfister , Jinsung Yoon

Coding agents powered by large language models (LLMs) have gained traction for automating code generation through iterative problem-solving with minimal human involvement. Despite the emergence of various frameworks, e.g., LangChain,…

Machine Learning · Computer Science 2025-08-19 Junpeng Wang , Yuzhong Chen , Menghai Pan , Chin-Chia Michael Yeh , Mahashweta Das

Large Language Models (LLMs) have demonstrated considerable potential in improving coding education by providing support for code writing, explanation, and debugging. However, existing LLM-based approaches generally fail to assess students'…

Multiagent Systems · Computer Science 2025-07-21 Jianing Zhao , Peng Gao , Jiannong Cao , Zhiyuan Wen , Chen Chen , Jianing Yin , Ruosong Yang , Bo Yuan

Code synthesis, which requires a deep understanding of complex natural language problem descriptions, generation of code instructions for complex algorithms and data structures, and the successful execution of comprehensive unit tests,…

Computation and Language · Computer Science 2024-05-21 Md. Ashraful Islam , Mohammed Eunus Ali , Md Rizwan Parvez

Students benefit from math problems contextualized to their interests. Large language models (LLMs) offer promise for efficient personalization at scale. However, LLM-generated personalized problems may often have problems such as…

Computers and Society · Computer Science 2026-04-08 Fareya Ikram , Nischal Ashok Kumar , Junyang Lu , Hunter McNichols , Candace Walkington , Neil Heffernan , Andrew S. Lan

Autonomous agents powered by large language models (LLMs) have the potential to significantly enhance human productivity by reasoning, using tools, and executing complex tasks in diverse environments. However, current approaches to…

While the complex reasoning capability of Large Language Models (LLMs) has attracted significant attention, single-agent systems often encounter inherent performance ceilings in complex tasks such as code generation. Multi-agent…

Employing Large Language Models (LLMs) to address mathematical problems is an intriguing research endeavor, considering the abundance of math problems expressed in natural language across numerous science and engineering fields. LLMs, with…

Computation and Language · Computer Science 2024-07-01 Yiran Wu , Feiran Jia , Shaokun Zhang , Hangyu Li , Erkang Zhu , Yue Wang , Yin Tat Lee , Richard Peng , Qingyun Wu , Chi Wang

Recent advances on large language models (LLMs) enable researchers and developers to build autonomous language agents that can automatically solve various tasks and interact with environments, humans, and other agents using natural language…

Recent large language models (LLMs) have demonstrated strong capabilities in understanding and generating code, from competitive programming to repository-level software engineering. In emerging agentic systems, code is no longer only a…

Large Language Models (LLMs) have emerged as powerful tools for accelerating scientific discovery, yet their static knowledge and hallucination issues hinder autonomous research applications. Recent advances integrate LLMs into agentic…

Artificial Intelligence · Computer Science 2025-12-23 Zeyu Xia , Jinzhe Ma , Congjie Zheng , Shufei Zhang , Yuqiang Li , Hang Su , P. Hu , Changshui Zhang , Xingao Gong , Wanli Ouyang , Lei Bai , Dongzhan Zhou , Mao Su

Tool use has turned large language models (LLMs) into powerful agents that can perform complex multi-step tasks by dynamically utilising external software components. However, these tools must be implemented in advance by human developers,…

Computation and Language · Computer Science 2025-06-02 Georg Wölflein , Dyke Ferber , Daniel Truhn , Ognjen Arandjelović , Jakob Nikolas Kather

Recent advancements in large language models (LLMs) have brought significant changes to various domains, especially through LLM-driven autonomous agents. A representative scenario is in software development, where LLM agents demonstrate…

Computation and Language · Computer Science 2024-06-06 Chen Qian , Yufan Dang , Jiahao Li , Wei Liu , Zihao Xie , Yifei Wang , Weize Chen , Cheng Yang , Xin Cong , Xiaoyin Che , Zhiyuan Liu , Maosong Sun

With software maintenance accounting for 50% of the cost of developing software, enhancing code quality and reliability has become more critical than ever. In response to this challenge, this doctoral research proposal aims to explore…

Software Engineering · Computer Science 2024-06-25 Fernando Vallecillos Ruiz

Using multiple agents was found to improve the debugging capabilities of Large Language Models. However, increasing the number of LLM-agents has several drawbacks such as increasing the running costs and rising the risk for the agents to…

Software Engineering · Computer Science 2025-04-28 Yacine Majdoub , Eya Ben Charrada , Haifa Touati
‹ Prev 1 2 3 10 Next ›