Related papers: Subtract the Corruption: Training-Data-Free Correc…

Corrective Machine Unlearning

Machine Learning models increasingly face data integrity challenges due to the use of large-scale training datasets drawn from the Internet. We study what model developers can do if they detect that some data was manipulated or incorrect.…

Machine Learning · Computer Science 2024-10-18 Shashwat Goel , Ameya Prabhu , Philip Torr , Ponnurangam Kumaraguru , Amartya Sanyal

A Non-Intrusive Correction Algorithm for Classification Problems with Corrupted Data

A novel correction algorithm is proposed for multi-class classification problems with corrupted training data. The algorithm is non-intrusive, in the sense that it post-processes a trained classification model by adding a correction…

Machine Learning · Computer Science 2020-02-13 Jun Hou , Tong Qin , Kailiang Wu , Dongbin Xiu

Navigating Data Corruption in Machine Learning: Balancing Quality, Quantity, and Imputation Strategies

Data corruption, including missing and noisy data, poses significant challenges in real-world machine learning. This study investigates the effects of data corruption on model performance and explores strategies to mitigate these effects…

Machine Learning · Computer Science 2025-05-22 Qi Liu , Wanjing Ma

Efficient Unlearning through Maximizing Relearning Convergence Delay

Machine unlearning poses challenges in removing mislabeled, contaminated, or problematic data from a pretrained model. Current unlearning approaches and evaluation metrics are solely focused on model predictions, which limits insight into…

Machine Learning · Computer Science 2026-04-13 Khoa Tran , Simon S. Woo

LoTUS: Large-Scale Machine Unlearning with a Taste of Uncertainty

We present LoTUS, a novel Machine Unlearning (MU) method that eliminates the influence of training samples from pre-trained models, avoiding retraining from scratch. LoTUS smooths the prediction probabilities of the model up to an…

Machine Learning · Computer Science 2025-08-29 Christoforos N. Spartalis , Theodoros Semertzidis , Efstratios Gavves , Petros Daras

Challenging Forgets: Unveiling the Worst-Case Forget Sets in Machine Unlearning

The trustworthy machine learning (ML) community is increasingly recognizing the crucial need for models capable of selectively 'unlearning' data points after training. This leads to the problem of machine unlearning (MU), aiming to…

Machine Learning · Computer Science 2024-07-10 Chongyu Fan , Jiancheng Liu , Alfred Hero , Sijia Liu

Too Correct to Learn: Reinforcement Learning on Saturated Reasoning Data

Reinforcement Learning (RL) enhances LLM reasoning, yet a paradox emerges as models scale: strong base models saturate standard benchmarks (e.g., MATH), yielding correct but homogeneous solutions. In such environments, the lack of failure…

Machine Learning · Computer Science 2026-04-21 Zhenwen Liang , Yujun Zhou , Sidi Lu , Xiangliang Zhang , Haitao Mi , Dong Yu

Remaining-data-free Machine Unlearning by Suppressing Sample Contribution

Machine unlearning (MU) aims to remove the influence of specific training samples from a well-trained model, a task of growing importance due to the ``right to be forgotten.'' The unlearned model should approach the retrained model, where…

Machine Learning · Computer Science 2026-03-10 Xinwen Cheng , Zhehao Huang , Wenxin Zhou , Zhengbao He , Ruikai Yang , Yingwen Wu , Xiaolin Huang

Tackling Fake Forgetting through Uncertainty Quantification

Machine unlearning seeks to remove the influence of specified data from a trained model. While the unlearning accuracy provides a widely used metric for assessing unlearning performance, it falls short in assessing the reliability of…

Machine Learning · Computer Science 2026-05-13 Yingdan Shi , Sijia Liu , Kaize Ding , Ren Wang

A hybrid framework for effective and efficient machine unlearning

Recently machine unlearning (MU) is proposed to remove the imprints of revoked samples from the already trained model parameters, to solve users' privacy concern. Different from the runtime expensive retraining from scratch, there exist two…

Machine Learning · Computer Science 2024-12-20 Mingxin Li , Yizhen Yu , Ning Wang , Zhigang Wang , Xiaodong Wang , Haipeng Qu , Jia Xu , Shen Su , Zhichao Yin

Rotation Control Unlearning: Quantifying and Controlling Continuous Unlearning for LLM with The Cognitive Rotation Space

As Large Language Models (LLMs) become increasingly prevalent, their security vulnerabilities have already drawn attention. Machine unlearning is introduced to seek to mitigate these risks by removing the influence of undesirable data.…

Machine Learning · Computer Science 2026-01-06 Xiang Zhang , Kun Wei , Xu Yang , Jiahua Li , Su Yan , Cheng Deng

Neural Corrective Machine Unranking

Machine unlearning in neural information retrieval (IR) systems requires removing specific data whilst maintaining model performance. Applying existing machine unlearning methods to IR may compromise retrieval effectiveness or inadvertently…

Information Retrieval · Computer Science 2025-07-25 Jingrui Hou , Axel Finke , Georgina Cosma

FUNU: Boosting Machine Unlearning Efficiency by Filtering Unnecessary Unlearning

Machine unlearning is an emerging field that selectively removes specific data samples from a trained model. This capability is crucial for addressing privacy concerns, complying with data protection regulations, and correcting errors or…

Machine Learning · Computer Science 2025-01-29 Zitong Li , Qingqing Ye , Haibo Hu

Markov Chain Monte Carlo-Based Machine Unlearning: Unlearning What Needs to be Forgotten

As the use of machine learning (ML) models is becoming increasingly popular in many real-world applications, there are practical challenges that need to be addressed for model maintenance. One such challenge is to 'undo' the effect of a…

Machine Learning · Computer Science 2022-03-01 Quoc Phong Nguyen , Ryutaro Oikawa , Dinil Mon Divakaran , Mun Choon Chan , Bryan Kian Hsiang Low

Unlearning Comparator: A Visual Analytics System for Comparative Evaluation of Machine Unlearning Methods

Machine Unlearning (MU) aims to remove target training data from a trained model so that the removed data no longer influences the model's behavior, fulfilling "right to be forgotten" obligations under data privacy laws. Yet, we observe…

Cryptography and Security · Computer Science 2026-01-27 Jaeung Lee , Suhyeon Yu , Yurim Jang , Simon S. Woo , Jaemin Jo

Redirection for Erasing Memory (REM): Towards a universal unlearning method for corrupted data

Machine unlearning is studied for a multitude of tasks, but specialization of unlearning methods to particular tasks has made their systematic comparison challenging. To address this issue, we propose a conceptual space to characterize…

Machine Learning · Computer Science 2026-02-04 Stefan Schoepf , Michael Curtis Mozer , Nicole Elyse Mitchell , Alexandra Brintrup , Georgios Kaissis , Peter Kairouz , Eleni Triantafillou

Model Sparsity Can Simplify Machine Unlearning

In response to recent data regulation requirements, machine unlearning (MU) has emerged as a critical process to remove the influence of specific examples from a given model. Although exact unlearning can be achieved through complete model…

Machine Learning · Computer Science 2024-01-30 Jinghan Jia , Jiancheng Liu , Parikshit Ram , Yuguang Yao , Gaowen Liu , Yang Liu , Pranay Sharma , Sijia Liu

A Certified Unlearning Approach without Access to Source Data

With the growing adoption of data privacy regulations, the ability to erase private or copyrighted information from trained models has become a crucial requirement. Traditional unlearning methods often assume access to the complete training…

Machine Learning · Computer Science 2025-12-22 Umit Yigit Basaran , Sk Miraj Ahmed , Amit Roy-Chowdhury , Basak Guler

Fast Yet Effective Machine Unlearning

Unlearning the data observed during the training of a machine learning (ML) model is an important task that can play a pivotal role in fortifying the privacy and security of ML-based applications. This paper raises the following questions:…

Machine Learning · Computer Science 2023-06-01 Ayush K Tarun , Vikram S Chundawat , Murari Mandal , Mohan Kankanhalli

Corrupted but Not Broken: Understanding and Mitigating the Negative Impacts of Corrupted Data in Visual Instruction Tuning

Visual Instruction Tuning (VIT) aims to enhance Multimodal Large Language Models (MLLMs), yet its effectiveness is often compromised by corrupted datasets with issues such as hallucinated content, incorrect responses, and poor OCR quality.…

Computer Vision and Pattern Recognition · Computer Science 2025-05-28 Yunhao Gou , Hansi Yang , Zhili Liu , Kai Chen , Yihan Zeng , Lanqing Hong , Zhenguo Li , Qun Liu , Bo Han , James T. Kwok , Yu Zhang