Related papers: Generating private data with user customization

Distributed generation of privacy preserving data with user customization

Distributed devices such as mobile phones can produce and store large amounts of data that can enhance machine learning models; however, this data may contain private information specific to the data owner that prevents the release of the…

Machine Learning · Computer Science 2019-04-23 Xiao Chen , Thomas Navidi , Stefano Ermon , Ram Rajagopal

Differentially Private Synthetic Data Generation via Lipschitz-Regularised Variational Autoencoders

Synthetic data has been hailed as the silver bullet for privacy preserving data analysis. If a record is not real, then how could it violate a person's privacy? In addition, deep-learning based generative models are employed successfully to…

Machine Learning · Computer Science 2023-07-14 Benedikt Groß , Gerhard Wunder

Privacy-preserving Generative Framework Against Membership Inference Attacks

Artificial intelligence and machine learning have been integrated into all aspects of our lives and the privacy of personal data has attracted more and more attention. Since the generation of the model needs to extract the effective…

Cryptography and Security · Computer Science 2022-02-14 Ruikang Yang , Jianfeng Ma , Yinbin Miao , Xindi Ma

A Generative Framework for Personalized Learning and Estimation: Theory, Algorithms, and Privacy

A distinguishing characteristic of federated learning is that the (local) client data could have statistical heterogeneity. This heterogeneity has motivated the design of personalized learning, where individual (personalized) models are…

Machine Learning · Computer Science 2022-07-06 Kaan Ozkara , Antonious M. Girgis , Deepesh Data , Suhas Diggavi

Personalized and Private Peer-to-Peer Machine Learning

The rise of connected personal devices together with privacy concerns call for machine learning algorithms capable of leveraging the data of a large number of agents to learn personalized models under strong privacy requirements. In this…

Machine Learning · Computer Science 2018-02-20 Aurélien Bellet , Rachid Guerraoui , Mahsa Taziki , Marc Tommasi

Private Set Generation with Discriminative Information

Differentially private data generation techniques have become a promising solution to the data privacy challenge -- it enables sharing of data while complying with rigorous privacy guarantees, which is essential for scientific progress in…

Cryptography and Security · Computer Science 2022-11-09 Dingfan Chen , Raouf Kerkouche , Mario Fritz

DPD-fVAE: Synthetic Data Generation Using Federated Variational Autoencoders With Differentially-Private Decoder

Federated learning (FL) is getting increased attention for processing sensitive, distributed datasets common to domains such as healthcare. Instead of directly training classification models on these datasets, recent works have considered…

Machine Learning · Computer Science 2022-11-22 Bjarne Pfitzner , Bert Arnrich

Causally Constrained Data Synthesis for Private Data Release

Making evidence based decisions requires data. However for real-world applications, the privacy of data is critical. Using synthetic data which reflects certain statistical properties of the original data preserves the privacy of the…

Machine Learning · Computer Science 2021-05-28 Varun Chandrasekaran , Darren Edge , Somesh Jha , Amit Sharma , Cheng Zhang , Shruti Tople

Generating Artificial Data for Private Deep Learning

In this paper, we propose generating artificial data that retain statistical properties of real data as the means of providing privacy with respect to the original dataset. We use generative adversarial network to draw privacy-preserving…

Machine Learning · Computer Science 2019-04-30 Aleksei Triastcyn , Boi Faltings

Private prediction for large-scale synthetic text generation

We present an approach for generating differentially private synthetic text using large language models (LLMs), via private prediction. In the private prediction framework, we only require the output synthetic data to satisfy differential…

Machine Learning · Computer Science 2024-10-10 Kareem Amin , Alex Bie , Weiwei Kong , Alexey Kurakin , Natalia Ponomareva , Umar Syed , Andreas Terzis , Sergei Vassilvitskii

imdpGAN: Generating Private and Specific Data with Generative Adversarial Networks

Generative Adversarial Network (GAN) and its variants have shown promising results in generating synthetic data. However, the issues with GANs are: (i) the learning happens around the training samples and the model often ends up remembering…

Computer Vision and Pattern Recognition · Computer Science 2020-09-30 Saurabh Gupta , Arun Balaji Buduru , Ponnurangam Kumaraguru

Anonymizing Sensor Data on the Edge: A Representation Learning and Transformation Approach

The abundance of data collected by sensors in Internet of Things (IoT) devices, and the success of deep neural networks in uncovering hidden patterns in time series data have led to mounting privacy concerns. This is because private and…

Machine Learning · Computer Science 2022-06-02 Omid Hajihassani , Omid Ardakanian , Hamzeh Khazaei

Differentially Private Mixture of Generative Neural Networks

Generative models are used in a wide range of applications building on large amounts of contextually rich information. Due to possible privacy violations of the individuals whose data is used to train these models, however, publishing or…

Machine Learning · Computer Science 2018-07-16 Gergely Acs , Luca Melis , Claude Castelluccia , Emiliano De Cristofaro

Learning Privacy-Preserving Student Networks via Discriminative-Generative Distillation

While deep models have proved successful in learning rich knowledge from massive well-annotated data, they may pose a privacy leakage risk in practical deployment. It is necessary to find an effective trade-off between high utility and…

Machine Learning · Computer Science 2024-09-05 Shiming Ge , Bochao Liu , Pengju Wang , Yong Li , Dan Zeng

Privacy Enhancing Machine Learning via Removal of Unwanted Dependencies

The rapid rise of IoT and Big Data has facilitated copious data driven applications to enhance our quality of life. However, the omnipresent and all-encompassing nature of the data collection can generate privacy concerns. Hence, there is a…

Machine Learning · Computer Science 2021-09-09 Mert Al , Semih Yagli , Sun-Yuan Kung

Generation Meets Recommendation: Proposing Novel Items for Groups of Users

Consider a movie studio aiming to produce a set of new movies for summer release: What types of movies it should produce? Who would the movies appeal to? How many movies should it make? Similar issues are encountered by a variety of…

Information Retrieval · Computer Science 2018-08-06 Vinh Vo Thanh , Harold Soh

PrE-Text: Training Language Models on Private Federated Data in the Age of LLMs

On-device training is currently the most common approach for training machine learning (ML) models on private, distributed user data. Despite this, on-device training has several drawbacks: (1) most user devices are too small to train large…

Machine Learning · Computer Science 2024-10-21 Charlie Hou , Akshat Shrivastava , Hongyuan Zhan , Rylan Conway , Trang Le , Adithya Sagar , Giulia Fanti , Daniel Lazar

Differentially Private Synthetic Medical Data Generation using Convolutional GANs

Deep learning models have demonstrated superior performance in several application problems, such as image classification and speech processing. However, creating a deep learning model using health record data requires addressing certain…

Machine Learning · Computer Science 2021-12-14 Amirsina Torfi , Edward A. Fox , Chandan K. Reddy

Privacy-Preserving Student Learning with Differentially Private Data-Free Distillation

Deep learning models can achieve high inference accuracy by extracting rich knowledge from massive well-annotated data, but may pose the risk of data privacy leakage in practical deployment. In this paper, we present an effective…

Machine Learning · Computer Science 2024-09-20 Bochao Liu , Jianghu Lu , Pengju Wang , Junjie Zhang , Dan Zeng , Zhenxing Qian , Shiming Ge

Federated Split GANs

Mobile devices and the immense amount and variety of data they generate are key enablers of machine learning (ML)-based applications. Traditional ML techniques have shifted toward new paradigms such as federated (FL) and split learning (SL)…

Machine Learning · Computer Science 2022-07-06 Pranvera Kortoçi , Yilei Liang , Pengyuan Zhou , Lik-Hang Lee , Abbas Mehrabi , Pan Hui , Sasu Tarkoma , Jon Crowcroft