English

Data Augmentation Improves Machine Unlearning

Machine Learning 2025-08-27 v1 Artificial Intelligence

Abstract

Machine Unlearning (MU) aims to remove the influence of specific data from a trained model while preserving its performance on the remaining data. Although a few works suggest connections between memorisation and augmentation, the role of systematic augmentation design in MU remains under-investigated. In this work, we investigate the impact of different data augmentation strategies on the performance of unlearning methods, including SalUn, Random Label, and Fine-Tuning. Experiments conducted on CIFAR-10 and CIFAR-100, under varying forget rates, show that proper augmentation design can significantly improve unlearning effectiveness, reducing the performance gap to retrained models. Results showed a reduction of up to 40.12% of the Average Gap unlearning Metric, when using TrivialAug augmentation. Our results suggest that augmentation not only helps reduce memorization but also plays a crucial role in achieving privacy-preserving and efficient unlearning.

Keywords

Cite

@article{arxiv.2508.18502,
  title  = {Data Augmentation Improves Machine Unlearning},
  author = {Andreza M. C. Falcao and Filipe R. Cordeiro},
  journal= {arXiv preprint arXiv:2508.18502},
  year   = {2025}
}

Comments

Paper accepted at SIBGRAPI'25

R2 v1 2026-07-01T05:05:30.294Z