English
Related papers

Related papers: Molly: Making Large Language Model Agents Solve Py…

200 papers

While large language models (LLMs) have demonstrated remarkable versatility across a wide range of general tasks, their effectiveness often diminishes in domain-specific applications due to inherent knowledge gaps. Moreover, their…

Artificial Intelligence · Computer Science 2025-11-21 Hanzhi Yan , Qin Lu , Xianqiao Wang , Xiaoming Zhai , Tianming Liu , He Li

With the development of artificial intelligence (AI), large language models (LLM) are widely used in many fields. However, the reasoning ability of LLM is still very limited when it comes to mathematical reasoning. Mathematics plays an…

Computation and Language · Computer Science 2024-08-06 Wenbei Xie , Donglin Liu , Haoran Yan , Wenjie Wu , Zongyang Liu

The landscape of education is changing rapidly, shaped by emerging pedagogical approaches, technological innovations such as artificial intelligence (AI), and evolving societal expectations, all of which demand thorough evaluation of new…

Computers and Society · Computer Science 2026-03-19 Fiammetta Caccavale , Carina L. Gargalo , Julian Kager , Magdalena Skowyra , Steen Larsen , Krist V. Gernaey , Ulrich Krühne

Large language models (LLMs) are revolutionizing education, with LLM-based agents playing a key role in simulating student behavior. A major challenge in student simulation is modeling the diverse learning patterns of students at various…

Machine Learning · Computer Science 2025-08-12 Tao Wu , Jingyuan Chen , Wang Lin , Mengze Li , Yumeng Zhu , Ang Li , Kun Kuang , Fei Wu

Large Language Models (LLMs) generate responses to questions; however, their effectiveness is often hindered by sub-optimal quality of answers and occasional failures to provide accurate responses to questions. To address these challenges,…

Computation and Language · Computer Science 2024-02-06 Liang Zhang , Katherine Jijo , Spurthi Setty , Eden Chung , Fatima Javid , Natan Vidra , Tommy Clifford

In this study, we explore the fine-tuning of Large Language Models (LLMs) to better support policymakers in their crucial work of understanding, analyzing, and crafting legal regulations. To equip the model with a deep understanding of…

Computation and Language · Computer Science 2025-11-06 One Octadion , Bondan Sapta Prakoso , Nanang Yudi Setiawan , Novanto Yudistira

Large Language Models (LLMs) encapsulate an extensive amount of world knowledge, and this has enabled their application in various domains to improve the performance of a variety of Natural Language Processing (NLP) tasks. This has also…

Artificial Intelligence · Computer Science 2024-04-30 Sina Gholamian , Domingo Huh

Large Language Models (LLMs) hold significant promise for mathematics education, yet they often struggle with complex mathematical reasoning. While Retrieval-Augmented Generation (RAG) mitigates these issues by grounding LLMs in external…

Computation and Language · Computer Science 2025-12-02 Shiting Chen , Zijian Zhao , Jinsong Chen

High-quality, error-free datasets are a key ingredient in building reliable, accurate, and unbiased machine learning (ML) models. However, real world datasets often suffer from errors due to sensor malfunctions, data entry mistakes, or…

Machine Learning · Computer Science 2025-03-11 Tommaso Bendinelli , Artur Dox , Christian Holz

Multi-agent collaboration among models has shown promise in reasoning tasks but is underexplored in long-form generation tasks like summarization and question-answering. We extend multi-agent multi-model reasoning to generation,…

Computation and Language · Computer Science 2025-03-20 David Wan , Justin Chih-Yao Chen , Elias Stengel-Eskin , Mohit Bansal

Large language model (LLM) agents deployed for multi-step tasks frequently fail in predictable ways: attempting actions with unmet preconditions, issuing redundant commands, or mishandling environment constraints. While retrieval-augmented…

Artificial Intelligence · Computer Science 2025-10-03 Humaid Ibrahim , Nikolai Rozanov , Marek Rei

Large Language Models (LLMs) often produce answers with a single chain-of-thought, which restricts their ability to explore reasoning paths or self-correct flawed outputs in complex tasks. In this paper, we introduce MALT (Multi-Agent LLM…

The applications of Large Language Models (LLMs) in political science are rapidly expanding. This paper demonstrates how LLMs, when augmented with predefined functions and specialized tools, can serve as dynamic agents capable of…

Computation and Language · Computer Science 2025-10-08 Joseph R. Loffredo , Suyeol Yun

The emergence of Large Language Models (LLMs) has significantly advanced natural language processing, but these models often generate factually incorrect information, known as "hallucination". Initial retrieval-augmented generation (RAG)…

Computation and Language · Computer Science 2024-11-12 Yujia Zhou , Zheng Liu , Zhicheng Dou

Background: Over the past few decades, the process and methodology of automated question generation (AQG) have undergone significant transformations. Recent progress in generative natural language models has opened up new potential in the…

Artificial Intelligence · Computer Science 2024-12-06 Dominic Lohr , Marc Berges , Abhishek Chugh , Michael Kohlhase , Dennis Müller

Large Language Models (LLMs) are tools that have become indispensable in development and programming. However, they suffer from hallucinations, especially when dealing with unknown knowledge. This is particularly the case when LLMs are to…

Computation and Language · Computer Science 2026-05-05 Andreas Baumann , Peter Eberhard

Recent studies have uncovered the potential of Large Language Models (LLMs) in addressing complex sequential decision-making tasks through the provision of high-level instructions. However, LLM-based agents lack specialization in tackling…

Artificial Intelligence · Computer Science 2024-05-28 Zihao Zhou , Bin Hu , Chenyang Zhao , Pu Zhang , Bin Liu

Knowing that the generative capabilities of large language models (LLM) are sometimes hampered by tendencies to hallucinate or create non-factual responses, researchers have increasingly focused on methods to ground generated outputs in…

Information Retrieval · Computer Science 2024-11-20 Sonal Prabhune , Donald J. Berndt

Large Language Models (LLMs) have advanced artificial intelligence by enabling human-like text generation and natural language understanding. However, their reliance on static training data limits their ability to respond to dynamic,…

Artificial Intelligence · Computer Science 2026-04-02 Aditi Singh , Abul Ehtesham , Saket Kumar , Tala Talaei Khoei , Athanasios V. Vasilakos

Recent advances in Large Language Models (LLMs) demonstrate that chain-of-thought prompting and deep reasoning substantially enhance performance on complex tasks, and multi-agent systems can further improve accuracy by enabling model…

Artificial Intelligence · Computer Science 2025-10-16 Zehui Ling , Deshu Chen , Yichi Zhang , Yuchen Liu , Xigui Li , Xin Guo , Yuan Cheng
‹ Prev 1 2 3 10 Next ›