Related papers: Towards Practical Tool Usage for Continually Learn…

Continual Learning for Large Language Models: A Survey

Large language models (LLMs) are not amenable to frequent re-training, due to high training costs arising from their massive scale. However, updates are necessary to endow LLMs with new skills and keep them up-to-date with rapidly evolving…

Computation and Language · Computer Science 2024-02-08 Tongtong Wu , Linhao Luo , Yuan-Fang Li , Shirui Pan , Thuy-Trang Vu , Gholamreza Haffari

Continual Learning in Large Language Models: Methods, Challenges, and Opportunities

Continual learning (CL) has emerged as a pivotal paradigm to enable large language models (LLMs) to dynamically adapt to evolving knowledge and sequential tasks while mitigating catastrophic forgetting-a critical limitation of the static…

Computation and Language · Computer Science 2026-03-16 Hongyang Chen , Zhongwu Sun , Hongfei Ye , Kunchi Li , Xuemin Lin

What Affects the Stability of Tool Learning? An Empirical Study on the Robustness of Tool Learning Frameworks

Tool learning methods have enhanced the ability of large language models (LLMs) to interact with real-world applications. Many existing works fine-tune LLMs or design prompts to enable LLMs to select appropriate tools and correctly invoke…

Computation and Language · Computer Science 2024-07-04 Chengrui Huang , Zhengliang Shi , Yuntao Wen , Xiuying Chen , Peng Han , Shen Gao , Shuo Shang

Investigating Continual Pretraining in Large Language Models: Insights and Implications

Continual learning (CL) in large language models (LLMs) is an evolving domain that focuses on developing efficient and sustainable training strategies to adapt models to emerging knowledge and achieve robustness in dynamic environments. Our…

Computation and Language · Computer Science 2025-02-13 Çağatay Yıldız , Nishaanth Kanna Ravichandran , Nitin Sharma , Matthias Bethge , Beyza Ermis

Towards Lifelong Learning of Large Language Models: A Survey

As the applications of large language models (LLMs) expand across diverse fields, the ability of these models to adapt to ongoing changes in data, tasks, and user preferences becomes crucial. Traditional training methods, relying on static…

Machine Learning · Computer Science 2024-06-11 Junhao Zheng , Shengjie Qiu , Chengming Shi , Qianli Ma

LLM With Tools: A Survey

The integration of tools in augmenting large language models presents a novel approach toward enhancing the efficiency and accuracy of these models in handling specific, complex tasks. This paper delves into the methodology,challenges, and…

Artificial Intelligence · Computer Science 2024-09-30 Zhuocheng Shen

Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

Large Language Model (LLM) agents significantly extend the capabilities of standalone LLMs, empowering them to interact with external tools (e.g., APIs, functions) and complete various tasks in a self-directed fashion. The challenge of tool…

Artificial Intelligence · Computer Science 2024-02-19 Weizhou Shen , Chenliang Li , Hongzhan Chen , Ming Yan , Xiaojun Quan , Hehong Chen , Ji Zhang , Fei Huang

COLT: Enhancing Video Large Language Models with Continual Tool Usage

The success of Large Language Models (LLMs) has significantly propelled the research of video understanding. To harvest the benefits of well-trained expert models (i.e., tools), video LLMs prioritize the exploration of tool usage…

Computer Vision and Pattern Recognition · Computer Science 2026-01-06 Yuyang Liu , Meng Cao , Xinyuan Shi , Xiaondan Liang

Self-Training Large Language Models for Tool-Use Without Demonstrations

Large language models (LLMs) remain prone to factual inaccuracies and computational errors, including hallucinations and mistakes in mathematical reasoning. Recent work augmented LLMs with tools to mitigate these shortcomings, but often…

Computation and Language · Computer Science 2025-02-11 Ne Luo , Aryo Pradipta Gema , Xuanli He , Emile van Krieken , Pietro Lesci , Pasquale Minervini

Continual Learning of Large Language Models: A Comprehensive Survey

The recent success of large language models (LLMs) trained on static, pre-collected, general datasets has sparked numerous research directions and applications. One such direction addresses the non-trivial challenge of integrating…

Machine Learning · Computer Science 2024-11-26 Haizhou Shi , Zihao Xu , Hengyi Wang , Weiyi Qin , Wenyuan Wang , Yibin Wang , Zifeng Wang , Sayna Ebrahimi , Hao Wang

Tool Learning with Large Language Models: A Survey

Recently, tool learning with large language models (LLMs) has emerged as a promising paradigm for augmenting the capabilities of LLMs to tackle highly complex problems. Despite growing attention and rapid advancements in this field, the…

Computation and Language · Computer Science 2024-11-05 Changle Qu , Sunhao Dai , Xiaochi Wei , Hengyi Cai , Shuaiqiang Wang , Dawei Yin , Jun Xu , Ji-Rong Wen

Programmers Aren't Obsolete Yet: A Syllabus for Teaching CS Students to Responsibly Use Large Language Models for Code Generation

Large Language Models (LLMs) have emerged as powerful tools for automating code generation, offering immense potential to enhance programmer productivity. However, their non-deterministic nature and reliance on user input necessitate a…

Software Engineering · Computer Science 2025-02-24 Bruno Pereira Cipriano , Lúcio Studer Ferreira

Are Large Language Models Temporally Grounded?

Are Large language models (LLMs) temporally grounded? Since LLMs cannot perceive and interact with the environment, it is impossible to answer this question directly. Instead, we provide LLMs with textual narratives and probe them with…

Computation and Language · Computer Science 2023-11-17 Yifu Qiu , Zheng Zhao , Yftah Ziser , Anna Korhonen , Edoardo M. Ponti , Shay B. Cohen

Test-Time Learning for Large Language Models

While Large Language Models (LLMs) have exhibited remarkable emergent capabilities through extensive pre-training, they still face critical limitations in generalizing to specialized domains and handling diverse linguistic variations, known…

Computation and Language · Computer Science 2025-05-28 Jinwu Hu , Zhitian Zhang , Guohao Chen , Xutao Wen , Chao Shuai , Wei Luo , Bin Xiao , Yuanqing Li , Mingkui Tan

Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale

In recent years, Large Language Models (LLMs) have made significant strides towards Artificial General Intelligence. However, training these models from scratch requires substantial computational resources and vast amounts of text data. In…

Computation and Language · Computer Science 2024-10-03 Wenzhen Zheng , Wenbo Pan , Xu Xu , Libo Qin , Li Yue , Ming Zhou

Patterns of Student Help-Seeking When Using a Large Language Model-Powered Programming Assistant

Providing personalized assistance at scale is a long-standing challenge for computing educators, but a new generation of tools powered by large language models (LLMs) offers immense promise. Such tools can, in theory, provide on-demand help…

Computers and Society · Computer Science 2024-01-09 Brad Sheese , Mark Liffiton , Jaromir Savelka , Paul Denny

Towards Modeling Learner Performance with Large Language Models

Recent work exploring the capabilities of pre-trained large language models (LLMs) has demonstrated their ability to act as general pattern machines by completing complex token sequences representing a wide array of tasks, including…

Computers and Society · Computer Science 2024-03-25 Seyed Parsa Neshaei , Richard Lee Davis , Adam Hazimeh , Bojan Lazarevski , Pierre Dillenbourg , Tanja Käser

Enhancing Tool Retrieval with Iterative Feedback from Large Language Models

Tool learning aims to enhance and expand large language models' (LLMs) capabilities with external tools, which has gained significant attention recently. Current methods have shown that LLMs can effectively handle a certain amount of tools…

Computation and Language · Computer Science 2024-10-01 Qiancheng Xu , Yongqi Li , Heming Xia , Wenjie Li

Adapting While Learning: Grounding LLMs for Scientific Problems with Intelligent Tool Usage Adaptation

Large Language Models (LLMs) demonstrate promising capabilities in solving scientific problems but often suffer from the issue of hallucination. While integrating LLMs with tools can mitigate this issue, models fine-tuned on tool usage…

Machine Learning · Computer Science 2025-06-23 Bohan Lyu , Yadi Cao , Duncan Watson-Parris , Leon Bergen , Taylor Berg-Kirkpatrick , Rose Yu

TL-Training: A Task-Feature-Based Framework for Training Large Language Models in Tool Use

Large language models (LLMs) achieve remarkable advancements by leveraging tools to interact with environments, a critical step toward generalized AI. However, the standard supervised fine-tuning (SFT) approach, which relies on large-scale…

Computation and Language · Computer Science 2025-08-27 Junjie Ye , Yilong Wu , Sixian Li , Yuming Yang , Zhiheng Xi , Tao Gui , Qi Zhang , Xuanjing Huang , Peng Wang , Zhongchao Shi , Jianping Fan , Zhengyin Du