English
Related papers

Related papers: Characterizing Datapoints via Second-Split Forgett…

200 papers

Scaled post-training now drives many of the largest capability gains in language models (LMs), yet its effect on pretrained knowledge remains poorly understood. Not all forgetting is equal: Forgetting one fact (e.g., a U.S. president or an…

Machine Learning · Computer Science 2025-10-21 Jackson Harmon , Andreas Hochlehnert , Matthias Bethge , Ameya Prabhu

Machine Unlearning (MU) enables Large Language Models (LLMs) to remove unsafe or outdated information. However, existing work assumes that all facts are equally forgettable and largely ignores whether the forgotten knowledge originates from…

Computation and Language · Computer Science 2026-05-15 Borisiuk Anna , Andrey Savchenko , Alexander Panchenko , Elena Tutubalina

Supervised fine-tuning (SFT) on chain-of-thought data is an essential post-training step for reasoning language models. Standard machine learning intuition suggests that training with more unique training samples yields better…

Computation and Language · Computer Science 2026-02-12 Dawid J. Kopiczko , Sagar Vaze , Tijmen Blankevoort , Yuki M. Asano

Supervised fine-tuning (SFT) is a common first stage of LLM post-training, teaching the model to follow instructions and shaping its behavior as a helpful assistant. At the same time, SFT may harm the fundamental capabilities of an LLM,…

Machine Learning · Computer Science 2026-04-16 Mark Rofin , Aditya Varre , Nicolas Flammarion

Supervised fine-tuning (SFT) plays a critical role for pretrained large language models (LLMs), notably enhancing their capacity to acquire domain-specific knowledge while preserving or potentially augmenting their general-purpose…

Machine Learning · Computer Science 2026-03-31 Ali Taheri , Alireza Taban , Qizhou Wang , Shanshan Ye , Abdolreza Mirzaei , Tongliang Liu , Bo Han

Supervised Fine-Tuning (SFT) is a critical step for enhancing the instruction-following capabilities of Large Language Models (LLMs) and adapting them to specialized domains. However, SFT often leads to a degradation of the model's general…

Computation and Language · Computer Science 2025-07-01 Fei Ding , Baiqiao Wang

Supervised fine-tuning (SFT) of large language models can be viewed as an off-policy learning problem, where expert demonstrations come from a fixed behavior policy while training aims to optimize a target policy. Importance sampling is the…

Machine Learning · Computer Science 2025-09-22 Shiwan Zhao , Xuyang Zhao , Jiaming Zhou , Aobo Kong , Qicheng Li , Yong Qin

Inspired by the phenomenon of catastrophic forgetting, we investigate the learning dynamics of neural networks as they train on single classification tasks. Our goal is to understand whether a related phenomenon occurs when data does not…

Post-training algorithms such as Supervised Fine-Tuning (SFT) and Reinforcement Fine-Tuning (RFT) are widely used to adapt (multimodal) large language models to downstream tasks. While effective at task adaptation, their impact on retaining…

Computation and Language · Computer Science 2026-03-06 Zhihao Zhang , Qiaole Dong , Qi Zhang , Jun Zhao , Enyu Zhou , Zhiheng Xi , Senjie Jin , Xiaoran Fan , Yuhao Zhou , Mingqi Wu , Yanwei Fu , Tao Ji , Tao Gui , Xuanjing Huang , Kai Chen

Sample selection is an effective strategy to mitigate the effect of label noise in robust learning. Typical strategies commonly apply the small-loss criterion to identify clean samples. However, those samples lying around the decision…

Computer Vision and Pattern Recognition · Computer Science 2022-08-25 Qi Wei , Haoliang Sun , Xiankai Lu , Yilong Yin

Deep neural networks are over-parameterized and easily overfit the datasets they train on. In the extreme case, it has been shown that these networks can memorize a training set with fully randomized labels. We propose using the curvature…

Machine Learning · Computer Science 2023-10-03 Isha Garg , Deepak Ravikumar , Kaushik Roy

A prevailing view holds that supervised fine-tuning (SFT) memorizes training data and fails to generalize, whereas reinforcement learning (RL) attains broader robustness. We revisit this claim through a systematic evaluation on two…

Machine Learning · Computer Science 2025-10-02 Xiaofeng Lin , Hejian Sang , Zhipeng Wang , Xuezhou Zhang

In recent years, deep neural networks have significantly impacted the seismic interpretation process. Due to the simple implementation and low interpretation costs, deep neural networks are an attractive component for the common…

Machine Learning · Computer Science 2023-03-01 Ryan Benkert , Oluwaseun Joseph Aribido , Ghassan AlRegib

Time Series Foundation Models (TSFMs) have shown promising zero-shot generalization across diverse forecasting tasks. However, their robustness to continual adaptation remains underexplored. In this work, we investigate the extent to which…

Machine Learning · Computer Science 2025-10-03 Nouha Karaouli , Denis Coquenet , Elisa Fromont , Martial Mermillod , Marina Reyboz

Catastrophic forgetting is a challenge issue in continual learning when a deep neural network forgets the knowledge acquired from the former task after learning on subsequent tasks. However, existing methods try to find the joint…

Machine Learning · Computer Science 2018-12-06 Jian Peng , Jiang Hao , Zhuo Li , Enqiang Guo , Xiaohong Wan , Deng Min , Qing Zhu , Haifeng Li

Adapting language models (LMs) to new tasks via post-training carries the risk of degrading existing capabilities -- a phenomenon classically known as catastrophic forgetting. In this paper, toward identifying guidelines for mitigating this…

Machine Learning · Computer Science 2025-12-04 Howard Chen , Noam Razin , Karthik Narasimhan , Danqi Chen

Supervised fine-tuning (SFT) is a standard approach to adapting large language models (LLMs) to new domains. In this work, we improve the statistical efficiency of SFT by selecting an informative subset of training examples. Specifically,…

Machine Learning · Computer Science 2025-05-22 Rohan Deb , Kiran Thekumparampil , Kousha Kalantari , Gaurush Hiranandani , Shoham Sabach , Branislav Kveton

Catastrophic forgetting - the tendency of neural networks to forget previously learned data when learning new information - remains a central challenge in continual learning. In this work, we adopt a behavioral approach, observing a…

Machine Learning · Computer Science 2025-07-08 Guy Hacohen , Tinne Tuytelaars

Continual post-training (CPT) is a popular and effective technique for adapting foundation models like multimodal large language models to specific and ever-evolving downstream tasks. While existing research has primarily concentrated on…

Machine Learning · Computer Science 2026-01-22 Song Lai , Haohan Zhao , Rong Feng , Changyi Ma , Wenzhuo Liu , Hongbo Zhao , Xi Lin , Dong Yi , Qingfu Zhang , Hongbin Liu , Gaofeng Meng , Fei Zhu

Semi-supervised learning (SSL) is a popular setting aiming to effectively utilize unlabelled data to improve model performance in downstream natural language processing (NLP) tasks. Currently, there are two popular approaches to make use of…

Computation and Language · Computer Science 2023-05-23 Zhengxiang Shi , Francesco Tonolini , Nikolaos Aletras , Emine Yilmaz , Gabriella Kazai , Yunlong Jiao
‹ Prev 1 2 3 10 Next ›