Related papers: Characterizing Datapoints via Second-Split Forgett…

Mapping Post-Training Forgetting in Language Models at Scale

Scaled post-training now drives many of the largest capability gains in language models (LMs), yet its effect on pretrained knowledge remains poorly understood. Not all forgetting is equal: Forgetting one fact (e.g., a U.S. president or an…

Machine Learning · Computer Science 2025-10-21 Jackson Harmon , Andreas Hochlehnert , Matthias Bethge , Ameya Prabhu

Anatomy of Unlearning: The Dual Impact of Fact Salience and Model Fine-Tuning

Machine Unlearning (MU) enables Large Language Models (LLMs) to remove unsafe or outdated information. However, existing work assumes that all facts are equally forgettable and largely ignores whether the forgotten knowledge originates from…

Computation and Language · Computer Science 2026-05-15 Borisiuk Anna , Andrey Savchenko , Alexander Panchenko , Elena Tutubalina

Data Repetition Beats Data Scaling in Long-CoT Supervised Fine-Tuning

Supervised fine-tuning (SFT) on chain-of-thought data is an essential post-training step for reasoning language models. Standard machine learning intuition suggests that training with more unique training samples yields better…

Computation and Language · Computer Science 2026-02-12 Dawid J. Kopiczko , Sagar Vaze , Tijmen Blankevoort , Yuki M. Asano

(How) Learning Rates Regulate Catastrophic Overtraining

Supervised fine-tuning (SFT) is a common first stage of LLM post-training, teaching the model to follow instructions and shaping its behavior as a helpful assistant. At the same time, SFT may harm the fundamental capabilities of an LLM,…

Machine Learning · Computer Science 2026-04-16 Mark Rofin , Aditya Varre , Nicolas Flammarion

Forgetting: A New Mechanism Towards Better Large Language Model Fine-tuning

Supervised fine-tuning (SFT) plays a critical role for pretrained large language models (LLMs), notably enhancing their capacity to acquire domain-specific knowledge while preserving or potentially augmenting their general-purpose…

Machine Learning · Computer Science 2026-03-31 Ali Taheri , Alireza Taban , Qizhou Wang , Shanshan Ye , Abdolreza Mirzaei , Tongliang Liu , Bo Han

Improved Supervised Fine-Tuning for Large Language Models to Mitigate Catastrophic Forgetting

Supervised Fine-Tuning (SFT) is a critical step for enhancing the instruction-following capabilities of Large Language Models (LLMs) and adapting them to specialized domains. However, SFT often leads to a degradation of the model's general…

Computation and Language · Computer Science 2025-07-01 Fei Ding , Baiqiao Wang

Mind the Gap: Data Rewriting for Stable Off-Policy Supervised Fine-Tuning

Supervised fine-tuning (SFT) of large language models can be viewed as an off-policy learning problem, where expert demonstrations come from a fixed behavior policy while training aims to optimize a target policy. Importance sampling is the…

Machine Learning · Computer Science 2025-09-22 Shiwan Zhao , Xuyang Zhao , Jiaming Zhou , Aobo Kong , Qicheng Li , Yong Qin

An Empirical Study of Example Forgetting during Deep Neural Network Learning

Inspired by the phenomenon of catastrophic forgetting, we investigate the learning dynamics of neural networks as they train on single classification tasks. Our goal is to understand whether a related phenomenon occurs when data does not…

Machine Learning · Computer Science 2019-11-18 Mariya Toneva , Alessandro Sordoni , Remi Tachet des Combes , Adam Trischler , Yoshua Bengio , Geoffrey J. Gordon

Why Reinforcement Fine-Tuning Enables MLLMs Preserve Prior Knowledge Better: A Data Perspective

Post-training algorithms such as Supervised Fine-Tuning (SFT) and Reinforcement Fine-Tuning (RFT) are widely used to adapt (multimodal) large language models to downstream tasks. While effective at task adaptation, their impact on retaining…

Computation and Language · Computer Science 2026-03-06 Zhihao Zhang , Qiaole Dong , Qi Zhang , Jun Zhao , Enyu Zhou , Zhiheng Xi , Senjie Jin , Xiaoran Fan , Yuhao Zhou , Mingqi Wu , Yanwei Fu , Tao Ji , Tao Gui , Xuanjing Huang , Kai Chen

Self-Filtering: A Noise-Aware Sample Selection for Label Noise with Confidence Penalization

Sample selection is an effective strategy to mitigate the effect of label noise in robust learning. Typical strategies commonly apply the small-loss criterion to identify clean samples. However, those samples lying around the decision…

Computer Vision and Pattern Recognition · Computer Science 2022-08-25 Qi Wei , Haoliang Sun , Xiankai Lu , Yilong Yin

Memorization Through the Lens of Curvature of Loss Function Around Samples

Deep neural networks are over-parameterized and easily overfit the datasets they train on. In the extreme case, it has been shown that these networks can memorize a training set with fully randomized labels. We propose using the curvature…

Machine Learning · Computer Science 2023-10-03 Isha Garg , Deepak Ravikumar , Kaushik Roy

Debunk the Myth of SFT Generalization

A prevailing view holds that supervised fine-tuning (SFT) memorizes training data and fails to generalize, whereas reinforcement learning (RL) attains broader robustness. We revisit this claim through a systematic evaluation on two…

Machine Learning · Computer Science 2025-10-02 Xiaofeng Lin , Hejian Sang , Zhipeng Wang , Xuezhou Zhang

Example Forgetting: A Novel Approach to Explain and Interpret Deep Neural Networks in Seismic Interpretation

In recent years, deep neural networks have significantly impacted the seismic interpretation process. Due to the simple implementation and low interpretation costs, deep neural networks are an attractive component for the common…

Machine Learning · Computer Science 2023-03-01 Ryan Benkert , Oluwaseun Joseph Aribido , Ghassan AlRegib

Are Time Series Foundation Models Susceptible to Catastrophic Forgetting?

Time Series Foundation Models (TSFMs) have shown promising zero-shot generalization across diverse forecasting tasks. However, their robustness to continual adaptation remains underexplored. In this work, we investigate the extent to which…

Machine Learning · Computer Science 2025-10-03 Nouha Karaouli , Denis Coquenet , Elisa Fromont , Martial Mermillod , Marina Reyboz

Overcoming Catastrophic Forgetting by Soft Parameter Pruning

Catastrophic forgetting is a challenge issue in continual learning when a deep neural network forgets the knowledge acquired from the former task after learning on subsequent tasks. However, existing methods try to find the joint…

Machine Learning · Computer Science 2018-12-06 Jian Peng , Jiang Hao , Zhuo Li , Enqiang Guo , Xiaohong Wan , Deng Min , Qing Zhu , Haifeng Li

Retaining by Doing: The Role of On-Policy Data in Mitigating Forgetting

Adapting language models (LMs) to new tasks via post-training carries the risk of degrading existing capabilities -- a phenomenon classically known as catastrophic forgetting. In this paper, toward identifying guidelines for mitigating this…

Machine Learning · Computer Science 2025-12-04 Howard Chen , Noam Razin , Karthik Narasimhan , Danqi Chen

FisherSFT: Data-Efficient Supervised Fine-Tuning of Language Models Using Information Gain

Supervised fine-tuning (SFT) is a standard approach to adapting large language models (LLMs) to new domains. In this work, we improve the statistical efficiency of SFT by selecting an informative subset of training examples. Specifically,…

Machine Learning · Computer Science 2025-05-22 Rohan Deb , Kiran Thekumparampil , Kousha Kalantari , Gaurush Hiranandani , Shoham Sabach , Branislav Kveton

Predicting the Susceptibility of Examples to Catastrophic Forgetting

Catastrophic forgetting - the tendency of neural networks to forget previously learned data when learning new information - remains a central challenge in continual learning. In this work, we adopt a behavioral approach, observing a…

Machine Learning · Computer Science 2025-07-08 Guy Hacohen , Tinne Tuytelaars

Reinforcement Fine-Tuning Naturally Mitigates Forgetting in Continual Post-Training

Continual post-training (CPT) is a popular and effective technique for adapting foundation models like multimodal large language models to specific and ever-evolving downstream tasks. While existing research has primarily concentrated on…

Machine Learning · Computer Science 2026-01-22 Song Lai , Haohan Zhao , Rong Feng , Changyi Ma , Wenzhuo Liu , Hongbo Zhao , Xi Lin , Dong Yi , Qingfu Zhang , Hongbin Liu , Gaofeng Meng , Fei Zhu

Rethinking Semi-supervised Learning with Language Models

Semi-supervised learning (SSL) is a popular setting aiming to effectively utilize unlabelled data to improve model performance in downstream natural language processing (NLP) tasks. Currently, there are two popular approaches to make use of…

Computation and Language · Computer Science 2023-05-23 Zhengxiang Shi , Francesco Tonolini , Nikolaos Aletras , Emine Yilmaz , Gabriella Kazai , Yunlong Jiao