English

Generating Artificial Data for Private Deep Learning

Machine Learning 2019-04-30 v3 Cryptography and Security Machine Learning

Abstract

In this paper, we propose generating artificial data that retain statistical properties of real data as the means of providing privacy with respect to the original dataset. We use generative adversarial network to draw privacy-preserving artificial data samples and derive an empirical method to assess the risk of information disclosure in a differential-privacy-like way. Our experiments show that we are able to generate artificial data of high quality and successfully train and validate machine learning models on this data while limiting potential privacy loss.

Keywords

Cite

@article{arxiv.1803.03148,
  title  = {Generating Artificial Data for Private Deep Learning},
  author = {Aleksei Triastcyn and Boi Faltings},
  journal= {arXiv preprint arXiv:1803.03148},
  year   = {2019}
}

Comments

Privacy-Enhancing Artificial Intelligence and Language Technologies, AAAI Spring Symposium Series, 2019

R2 v1 2026-06-23T00:46:41.307Z