Related papers: Is Private Learning Possible with Instance Encodin…

InstaHide's Sample Complexity When Mixing Two Private Images

Training neural networks usually require large numbers of sensitive training data, and how to protect the privacy of training data has thus become a critical topic in deep learning research. InstaHide is a state-of-the-art scheme to protect…

Machine Learning · Computer Science 2024-02-07 Baihe Huang , Zhao Song , Runzhou Tao , Junze Yin , Ruizhe Zhang , Danyang Zhuo

Personalized and Private Peer-to-Peer Machine Learning

The rise of connected personal devices together with privacy concerns call for machine learning algorithms capable of leveraging the data of a large number of agents to learn personalized models under strong privacy requirements. In this…

Machine Learning · Computer Science 2018-02-20 Aurélien Bellet , Rachid Guerraoui , Mahsa Taziki , Marc Tommasi

Private Learning with Public Features

We study a class of private learning problems in which the data is a join of private and public features. This is often the case in private personalization tasks such as recommendation or ad prediction, in which features related to…

Machine Learning · Computer Science 2023-10-25 Walid Krichene , Nicolas Mayoraz , Steffen Rendle , Shuang Song , Abhradeep Thakurta , Li Zhang

InstaHide: Instance-hiding Schemes for Private Distributed Learning

How can multiple distributed entities collaboratively train a shared deep net on their private data while preserving privacy? This paper introduces InstaHide, a simple encryption of training images, which can be plugged into existing…

Cryptography and Security · Computer Science 2021-02-25 Yangsibo Huang , Zhao Song , Kai Li , Sanjeev Arora

Learning Privacy Preserving Encodings through Adversarial Training

We present a framework to learn privacy-preserving encodings of images that inhibit inference of chosen private attributes, while allowing recovery of other desirable information. Rather than simply inhibiting a given fixed pre-trained…

Machine Learning · Computer Science 2018-12-06 Francesco Pittaluga , Sanjeev J. Koppal , Ayan Chakrabarti

How Private is Your Attention? Bridging Privacy with In-Context Learning

In-context learning (ICL)-the ability of transformer-based models to perform new tasks from examples provided at inference time-has emerged as a hallmark of modern language models. While recent works have investigated the mechanisms…

Machine Learning · Statistics 2025-04-23 Soham Bonnerjee , Zhen Wei , Yeon , Anna Asch , Sagnik Nandy , Promit Ghosal

Privacy-preserving Machine Learning through Data Obfuscation

As machine learning becomes a practice and commodity, numerous cloud-based services and frameworks are provided to help customers develop and deploy machine learning applications. While it is prevalent to outsource model training and…

Cryptography and Security · Computer Science 2018-07-16 Tianwei Zhang , Zecheng He , Ruby B. Lee

Quantum Privacy-Preserving Perceptron

With the extensive applications of machine learning, the issue of private or sensitive data in the training examples becomes more and more serious: during the training process, personal information or habits may be disclosed to unexpected…

Quantum Physics · Physics 2017-08-01 Shenggang Ying , Mingsheng Ying , Yuan Feng

Efficient Private Storage of Sparse Machine Learning Data

We consider the problem of maintaining sparsity in private distributed storage of confidential machine learning data. In many applications, e.g., face recognition, the data used in machine learning algorithms is represented by sparse…

Information Theory · Computer Science 2022-06-15 Marvin Xhemrishi , Maximilian Egger , Rawad Bitar

On InstaHide, Phase Retrieval, and Sparse Matrix Factorization

In this work, we examine the security of InstaHide, a scheme recently proposed by [Huang, Song, Li and Arora, ICML'20] for preserving the security of private datasets in the context of distributed learning. To generate a synthetic training…

Machine Learning · Computer Science 2021-03-26 Sitan Chen , Xiaoxiao Li , Zhao Song , Danyang Zhuo

Pre-trained Encoders in Self-Supervised Learning Improve Secure and Privacy-preserving Supervised Learning

Classifiers in supervised learning have various security and privacy issues, e.g., 1) data poisoning attacks, backdoor attacks, and adversarial examples on the security side as well as 2) inference attacks and the right to be forgotten for…

Cryptography and Security · Computer Science 2022-12-08 Hongbin Liu , Wenjie Qu , Jinyuan Jia , Neil Zhenqiang Gong

Privacy-Preserving Machine Learning for Collaborative Data Sharing via Auto-encoder Latent Space Embeddings

Privacy-preserving machine learning in data-sharing processes is an ever-critical task that enables collaborative training of Machine Learning (ML) models without the need to share the original data sources. It is especially relevant when…

Machine Learning · Computer Science 2022-11-14 Ana María Quintero-Ossa , Jesús Solano , Hernán Jarcía , David Zarruk , Alejandro Correa Bahnsen , Carlos Valencia

What Can We Learn Privately?

Learning problems form an important category of computational tasks that generalizes many of the computations researchers apply to large real-life data sets. We ask: what concept classes can be learned privately, namely, by an algorithm…

Machine Learning · Computer Science 2012-10-10 Shiva Prasad Kasiviswanathan , Homin K. Lee , Kobbi Nissim , Sofya Raskhodnikova , Adam Smith

Private Inference in Quantized Models

A typical setup in many machine learning scenarios involves a server that holds a model and a user that possesses data, and the challenge is to perform inference while safeguarding the privacy of both parties. Private Inference has been…

Information Theory · Computer Science 2023-11-27 Zirui Deng , Vinayak Ramkumar , Rawad Bitar , Netanel Raviv

Prive-HD: Privacy-Preserved Hyperdimensional Computing

The privacy of data is a major challenge in machine learning as a trained model may expose sensitive information of the enclosed dataset. Besides, the limited computation capability and capacity of edge devices have made cloud-hosted…

Machine Learning · Computer Science 2020-05-15 Behnam Khaleghi , Mohsen Imani , Tajana Rosing

CodedPrivateML: A Fast and Privacy-Preserving Framework for Distributed Machine Learning

How to train a machine learning model while keeping the data private and secure? We present CodedPrivateML, a fast and scalable approach to this critical problem. CodedPrivateML keeps both the data and the model information-theoretically…

Machine Learning · Computer Science 2021-02-23 Jinhyun So , Basak Guler , A. Salman Avestimehr

Public Data Assisted Differentially Private In-Context Learning

In-context learning (ICL) in Large Language Models (LLMs) has shown remarkable performance across various tasks without requiring fine-tuning. However, recent studies have highlighted the risk of private data leakage through the prompt in…

Artificial Intelligence · Computer Science 2025-09-16 Seongho Joo , Hyukhun Koh , Kyomin Jung

Trade-offs between membership privacy & adversarially robust learning

Historically, machine learning methods have not been designed with security in mind. In turn, this has given rise to adversarial examples, carefully perturbed input samples aimed to mislead detection at test time, which have been applied to…

Machine Learning · Computer Science 2022-01-11 Jamie Hayes

Training Private Models That Know What They Don't Know

Training reliable deep learning models which avoid making overconfident but incorrect predictions is a longstanding challenge. This challenge is further exacerbated when learning has to be differentially private: protection provided to…

Machine Learning · Computer Science 2023-05-31 Stephan Rabanser , Anvith Thudi , Abhradeep Thakurta , Krishnamurthy Dvijotham , Nicolas Papernot

The Trade-Offs of Private Prediction

Machine learning models leak information about their training data every time they reveal a prediction. This is problematic when the training data needs to remain private. Private prediction methods limit how much information about the…

Machine Learning · Computer Science 2020-07-13 Laurens van der Maaten , Awni Hannun