Related papers: Differentially Private Model Compression

Differentially Private Fine-tuning of Language Models

We give simpler, sparser, and faster algorithms for differentially private fine-tuning of large-scale pre-trained language models, which achieve the state-of-the-art privacy versus utility tradeoffs on many standard NLP tasks. We propose a…

Machine Learning · Computer Science 2022-07-18 Da Yu , Saurabh Naik , Arturs Backurs , Sivakanth Gopi , Huseyin A. Inan , Gautam Kamath , Janardhan Kulkarni , Yin Tat Lee , Andre Manoel , Lukas Wutschitz , Sergey Yekhanin , Huishuai Zhang

Exploring Extreme Parameter Compression for Pre-trained Language Models

Recent work explored the potential of large-scale Transformer-based pre-trained models, especially Pre-trained Language Models (PLMs) in natural language processing. This raises many concerns from various perspectives, e.g., financial costs…

Computation and Language · Computer Science 2022-05-23 Yuxin Ren , Benyou Wang , Lifeng Shang , Xin Jiang , Qun Liu

Compressing Large-Scale Transformer-Based Models: A Case Study on BERT

Pre-trained Transformer-based models have achieved state-of-the-art performance for various Natural Language Processing (NLP) tasks. However, these models often have billions of parameters, and, thus, are too resource-hungry and…

Machine Learning · Computer Science 2021-09-29 Prakhar Ganesh , Yao Chen , Xin Lou , Mohammad Ali Khan , Yin Yang , Hassan Sajjad , Preslav Nakov , Deming Chen , Marianne Winslett

Differential Privacy, Linguistic Fairness, and Training Data Influence: Impossibility and Possibility Theorems for Multilingual Language Models

Language models such as mBERT, XLM-R, and BLOOM aim to achieve multilingual generalization or compression to facilitate transfer to a large number of (potentially unseen) languages. However, these models should ideally also be private,…

Computation and Language · Computer Science 2023-08-21 Phillip Rust , Anders Søgaard

A Short Study on Compressing Decoder-Based Language Models

Pre-trained Language Models (PLMs) have been successful for a wide range of natural language processing (NLP) tasks. The state-of-the-art of PLMs, however, are extremely large to be used on edge devices. As a result, the topic of model…

Computation and Language · Computer Science 2021-10-19 Tianda Li , Yassir El Mesbahi , Ivan Kobyzev , Ahmad Rashid , Atif Mahmud , Nithin Anchuri , Habib Hajimolahoseini , Yang Liu , Mehdi Rezagholizadeh

Benchmarking Differential Privacy and Federated Learning for BERT Models

Natural Language Processing (NLP) techniques can be applied to help with the diagnosis of medical conditions such as depression, using a collection of a person's utterances. Depression is a serious medical illness that can have adverse…

Computation and Language · Computer Science 2022-06-17 Priyam Basu , Tiasa Singha Roy , Rakshit Naidu , Zumrut Muftuoglu , Sahib Singh , Fatemehsadat Mireshghallah

Privacy-preserving Fine-tuning of Large Language Models through Flatness

The privacy concerns associated with the use of Large Language Models (LLMs) have grown recently with the development of LLMs such as ChatGPT. Differential Privacy (DP) techniques are explored in existing work to mitigate their privacy…

Artificial Intelligence · Computer Science 2024-03-08 Tiejin Chen , Longchao Da , Huixue Zhou , Pingzhi Li , Kaixiong Zhou , Tianlong Chen , Hua Wei

Natural Language Understanding with Privacy-Preserving BERT

Privacy preservation remains a key challenge in data mining and Natural Language Understanding (NLU). Previous research shows that the input text or even text embeddings can leak private information. This concern motivates our research on…

Computation and Language · Computer Science 2021-08-23 Chen Qu , Weize Kong , Liu Yang , Mingyang Zhang , Michael Bendersky , Marc Najork

Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models

Fine-tuning large language models (LLMs) for specific tasks introduces privacy risks, as models may inadvertently memorise and leak sensitive training data. While Differential Privacy (DP) offers a solution to mitigate these risks, it…

Machine Learning · Computer Science 2024-11-26 Olivia Ma , Jonathan Passerat-Palmbach , Dmitrii Usynin

Differentially Private Decoding in Large Language Models

Recent large-scale natural language processing (NLP) systems use a pre-trained Large Language Model (LLM) on massive and diverse corpora as a headstart. In practice, the pre-trained model is adapted to a wide array of tasks via fine-tuning…

Computation and Language · Computer Science 2022-09-12 Jimit Majmudar , Christophe Dupuy , Charith Peris , Sami Smaili , Rahul Gupta , Richard Zemel

One size does not fit all: Investigating strategies for differentially-private learning across NLP tasks

Preserving privacy in contemporary NLP models allows us to work with sensitive data, but unfortunately comes at a price. We know that stricter privacy guarantees in differentially-private stochastic gradient descent (DP-SGD) generally…

Computation and Language · Computer Science 2023-02-01 Manuel Senge , Timour Igamberdiev , Ivan Habernal

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Increasing model size when pretraining natural language representations often results in improved performance on downstream tasks. However, at some point further model increases become harder due to GPU/TPU memory limitations and longer…

Computation and Language · Computer Science 2020-02-11 Zhenzhong Lan , Mingda Chen , Sebastian Goodman , Kevin Gimpel , Piyush Sharma , Radu Soricut

Privately Fine-Tuning Large Language Models with Differential Privacy

Pre-trained Large Language Models (LLMs) are an integral part of modern AI that have led to breakthrough performances in complex AI tasks. Major AI companies with expensive infrastructures are able to develop and train these large models…

Cryptography and Security · Computer Science 2023-05-02 Rouzbeh Behnia , Mohamamdreza Ebrahimi , Jason Pacheco , Balaji Padmanabhan

Fine-Tuning Large Language Models with User-Level Differential Privacy

We investigate practical and scalable algorithms for training large language models (LLMs) with user-level differential privacy (DP) in order to provably safeguard all the examples contributed by each user. We study two variants of DP-SGD…

Machine Learning · Computer Science 2024-07-11 Zachary Charles , Arun Ganesh , Ryan McKenna , H. Brendan McMahan , Nicole Mitchell , Krishna Pillutla , Keith Rush

Efficient Fine-Tuning of Compressed Language Models with Learners

Fine-tuning BERT-based models is resource-intensive in memory, computation, and time. While many prior works aim to improve inference efficiency via compression techniques, e.g., pruning, these works do not explicitly address the…

Computation and Language · Computer Science 2022-08-04 Danilo Vucetic , Mohammadreza Tayaranian , Maryam Ziaeefard , James J. Clark , Brett H. Meyer , Warren J. Gross

Differentially Private Distributed Learning for Language Modeling Tasks

One of the big challenges in machine learning applications is that training data can be different from the real-world data faced by the algorithm. In language modeling, users' language (e.g. in private messaging) could change in a year and…

Computation and Language · Computer Science 2018-03-07 Vadim Popov , Mikhail Kudinov , Irina Piontkovskaya , Petr Vytovtov , Alex Nevidomsky

How to Prune Your Language Model: Recovering Accuracy on the "Sparsity May Cry'' Benchmark

Pruning large language models (LLMs) from the BERT family has emerged as a standard compression benchmark, and several pruning methods have been proposed for this task. The recent ``Sparsity May Cry'' (SMC) benchmark put into question the…

Computation and Language · Computer Science 2023-12-22 Eldar Kurtic , Torsten Hoefler , Dan Alistarh

Large Language Models Can Be Strong Differentially Private Learners

Differentially Private (DP) learning has seen limited success for building large deep learning models of text, and straightforward attempts at applying Differentially Private Stochastic Gradient Descent (DP-SGD) to NLP tasks have resulted…

Machine Learning · Computer Science 2022-11-11 Xuechen Li , Florian Tramèr , Percy Liang , Tatsunori Hashimoto

Towards the Anonymization of the Language Modeling

Rapid advances in Natural Language Processing (NLP) have revolutionized many fields, including healthcare. However, these advances raise significant privacy concerns, especially when pre-trained models fine-tuned and specialized on…

Computation and Language · Computer Science 2026-05-21 Antoine Boutet , Lucas Magnana , Juliette Sénéchal

Characterizing Stereotypical Bias from Privacy-preserving Pre-Training

Differential Privacy (DP) can be applied to raw text by exploiting the spatial arrangement of words in an embedding space. We investigate the implications of such text privatization on Language Models (LMs) and their tendency towards…

Computation and Language · Computer Science 2024-07-02 Stefan Arnold , Rene Gröbner , Annika Schreiner