Related papers: Approximate Data Deletion from Machine Learning Mo…

Approximate Data Deletion in Generative Models

Users have the right to have their data deleted by third-party learned systems, as codified by recent legislation such as the General Data Protection Regulation (GDPR) and the California Consumer Privacy Act (CCPA). Such data deletion can…

Machine Learning · Computer Science 2022-06-30 Zhifeng Kong , Scott Alfeld

Certifiable Machine Unlearning for Linear Models

Machine unlearning is the task of updating machine learning (ML) models after a subset of the training data they were trained on is deleted. Methods for the task are desired to combine effectiveness and efficiency, i.e., they should…

Machine Learning · Computer Science 2021-08-17 Ananth Mahadevan , Michael Mathioudakis

Making AI Forget You: Data Deletion in Machine Learning

Intense recent discussions have focused on how to provide individuals with control over when their data can and cannot be used --- the EU's Right To Be Forgotten regulation is an example of this effort. In this paper we initiate a framework…

Machine Learning · Computer Science 2019-11-06 Antonio Ginart , Melody Y. Guan , Gregory Valiant , James Zou

Machine unlearning via GAN

Machine learning models, especially deep models, may unintentionally remember information about their training data. Malicious attackers can thus pilfer some property about training data by attacking the model via membership inference…

Machine Learning · Computer Science 2021-11-24 Kongyang Chen , Yao Huang , Yiwen Wang

Certified Data Removal from Machine Learning Models

Good data stewardship requires removal of data at the request of the data's owner. This raises the question if and how a trained machine-learning model, which implicitly stores information about its training data, should be affected by such…

Machine Learning · Computer Science 2023-11-09 Chuan Guo , Tom Goldstein , Awni Hannun , Laurens van der Maaten

Certain and Approximately Certain Models for Statistical Learning

Real-world data is often incomplete and contains missing values. To train accurate models over real-world datasets, users need to spend a substantial amount of time and resources imputing and finding proper values for missing data items. In…

Machine Learning · Statistics 2024-03-05 Cheng Zhen , Nischal Aryal , Arash Termehchy , Alireza Aghasi , Amandeep Singh Chabada

Attribute-to-Delete: Machine Unlearning via Datamodel Matching

Machine unlearning -- efficiently removing the effect of a small "forget set" of training data on a pre-trained machine learning model -- has recently attracted significant research interest. Despite this interest, however, recent work…

Machine Learning · Computer Science 2024-11-13 Kristian Georgiev , Roy Rinberg , Sung Min Park , Shivam Garg , Andrew Ilyas , Aleksander Madry , Seth Neel

Revisiting Machine Unlearning with Dimensional Alignment

Machine unlearning, an emerging research topic focusing on compliance with data privacy regulations, enables trained models to remove the information learned from specific data. While many existing methods indirectly address this issue by…

Machine Learning · Computer Science 2024-12-24 Seonguk Seo , Dongwan Kim , Bohyung Han

Algorithms that Approximate Data Removal: New Results and Limitations

We study the problem of deleting user data from machine learning models trained using empirical risk minimization. Our focus is on learning algorithms which return the empirical risk minimizer and approximate unlearning algorithms that…

Machine Learning · Statistics 2022-09-27 Vinith M. Suriyakumar , Ashia C. Wilson

Learning Over Dirty Data with Minimal Repairs

Missing data often exists in real-world datasets, requiring significant time and effort for data repair to learn accurate models. In this paper, we show that imputing all missing values is not always necessary to achieve an accurate ML…

Machine Learning · Computer Science 2026-03-19 Cheng Zhen , Prayoga , Nischal Aryal , Arash Termehchy , Garrett Biwer , Lubna Alzamil

Machine Unlearning: Linear Filtration for Logit-based Classifiers

Recently enacted legislation grants individuals certain rights to decide in what fashion their personal data may be used, and in particular a "right to be forgotten". This poses a challenge to machine learning: how to proceed when an…

Machine Learning · Computer Science 2020-07-09 Thomas Baumhauer , Pascal Schöttle , Matthias Zeppelzauer

MultiDelete for Multimodal Machine Unlearning

Machine Unlearning removes specific knowledge about training data samples from an already trained model. It has significant practical benefits, such as purging private, inaccurate, or outdated information from trained models without the…

Artificial Intelligence · Computer Science 2025-04-10 Jiali Cheng , Hadi Amiri

Learn to Unlearn: A Survey on Machine Unlearning

Machine Learning (ML) models have been shown to potentially leak sensitive information, thus raising privacy concerns in ML-driven applications. This inspired recent research on removing the influence of specific data samples from a trained…

Machine Learning · Computer Science 2023-10-30 Youyang Qu , Xin Yuan , Ming Ding , Wei Ni , Thierry Rakotoarivelo , David Smith

Zero-Shot Machine Unlearning

Modern privacy regulations grant citizens the right to be forgotten by products, services and companies. In case of machine learning (ML) applications, this necessitates deletion of data not only from storage archives but also from ML…

Machine Learning · Computer Science 2023-06-01 Vikram S Chundawat , Ayush K Tarun , Murari Mandal , Mohan Kankanhalli

Distribution-Level Feature Distancing for Machine Unlearning: Towards a Better Trade-off Between Model Utility and Forgetting

With the explosive growth of deep learning applications and increasing privacy concerns, the right to be forgotten has become a critical requirement in various AI industries. For example, given a facial recognition system, some individuals…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Dasol Choi , Dongbin Na

Machine Unlearning of Features and Labels

Removing information from a machine learning model is a non-trivial task that requires to partially revert the training process. This task is unavoidable when sensitive data, such as credit card numbers or passwords, accidentally enter the…

Machine Learning · Computer Science 2023-08-08 Alexander Warnecke , Lukas Pirch , Christian Wressnegger , Konrad Rieck

FUNU: Boosting Machine Unlearning Efficiency by Filtering Unnecessary Unlearning

Machine unlearning is an emerging field that selectively removes specific data samples from a trained model. This capability is crucial for addressing privacy concerns, complying with data protection regulations, and correcting errors or…

Machine Learning · Computer Science 2025-01-29 Zitong Li , Qingqing Ye , Haibo Hu

Descent-to-Delete: Gradient-Based Methods for Machine Unlearning

We study the data deletion problem for convex models. By leveraging techniques from convex optimization and reservoir sampling, we give the first data deletion algorithms that are able to handle an arbitrarily long sequence of adversarial…

Machine Learning · Statistics 2020-07-07 Seth Neel , Aaron Roth , Saeed Sharifi-Malvajerdi

Data Deletion for Linear Regression with Noisy SGD

In the current era of big data and machine learning, it's essential to find ways to shrink the size of training dataset while preserving the training performance to improve efficiency. However, the challenge behind it includes providing…

Machine Learning · Statistics 2024-10-15 Zhangjie Xia , Chi-Hua Wang , Guang Cheng

Machine Unlearning Method Based On Projection Residual

Machine learning models (mainly neural networks) are used more and more in real life. Users feed their data to the model for training. But these processes are often one-way. Once trained, the model remembers the data. Even when data is…

Machine Learning · Computer Science 2022-10-03 Zihao Cao , Jianzong Wang , Shijing Si , Zhangcheng Huang , Jing Xiao