Related papers: Differentially Private Attention Computation

Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Large language models (LLMs) have emerged as powerful tools for tackling complex tasks across diverse domains, but they also raise privacy concerns when fine-tuned on sensitive data due to potential memorization. While differential privacy…

Computation and Language · Computer Science 2024-08-19 Lynn Chua , Badih Ghazi , Yangsibo Huang , Pritish Kamath , Ravi Kumar , Daogao Liu , Pasin Manurangsi , Amer Sinha , Chiyuan Zhang

Can Differentially Private Fine-tuning LLMs Protect Against Privacy Attacks?

Fine-tuning large language models (LLMs) has become an essential strategy for adapting them to specialized tasks; however, this process introduces significant privacy challenges, as sensitive training data may be inadvertently memorized and…

Cryptography and Security · Computer Science 2025-05-02 Hao Du , Shang Liu , Yang Cao

Privacy Preserving In-Context-Learning Framework for Large Language Models

Large language models (LLMs) have significantly transformed natural language understanding and generation, but they raise privacy concerns due to potential exposure of sensitive information. Studies have highlighted the risk of information…

Machine Learning · Computer Science 2025-11-20 Bishnu Bhusal , Manoj Acharya , Ramneet Kaur , Colin Samplawski , Anirban Roy , Adam D. Cobb , Rohit Chadha , Susmit Jha

Improved Algorithms for Differentially Private Language Model Alignment

Language model alignment is crucial for ensuring that large language models (LLMs) align with human preferences, yet it often involves sensitive user data, raising significant privacy concerns. While prior work has integrated differential…

Cryptography and Security · Computer Science 2025-05-15 Keyu Chen , Hao Tang , Qinglin Liu , Yizhao Xu

Privately Fine-Tuning Large Language Models with Differential Privacy

Pre-trained Large Language Models (LLMs) are an integral part of modern AI that have led to breakthrough performances in complex AI tasks. Major AI companies with expensive infrastructures are able to develop and train these large models…

Cryptography and Security · Computer Science 2023-05-02 Rouzbeh Behnia , Mohamamdreza Ebrahimi , Jason Pacheco , Balaji Padmanabhan

Privacy-Preserving Large Language Models: Mechanisms, Applications, and Future Directions

The rapid advancement of large language models (LLMs) has revolutionized natural language processing, enabling applications in diverse domains such as healthcare, finance and education. However, the growing reliance on extensive data for…

Cryptography and Security · Computer Science 2024-12-10 Guoshenghui Zhao , Eric Song

Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models

Fine-tuning large language models (LLMs) for specific tasks introduces privacy risks, as models may inadvertently memorise and leak sensitive training data. While Differential Privacy (DP) offers a solution to mitigate these risks, it…

Machine Learning · Computer Science 2024-11-26 Olivia Ma , Jonathan Passerat-Palmbach , Dmitrii Usynin

Privacy-preserving Fine-tuning of Large Language Models through Flatness

The privacy concerns associated with the use of Large Language Models (LLMs) have grown recently with the development of LLMs such as ChatGPT. Differential Privacy (DP) techniques are explored in existing work to mitigate their privacy…

Artificial Intelligence · Computer Science 2024-03-08 Tiejin Chen , Longchao Da , Huixue Zhou , Pingzhi Li , Kaixiong Zhou , Tianlong Chen , Hua Wei

Differentially Private Decoding in Large Language Models

Recent large-scale natural language processing (NLP) systems use a pre-trained Large Language Model (LLM) on massive and diverse corpora as a headstart. In practice, the pre-trained model is adapted to a wide array of tasks via fine-tuning…

Computation and Language · Computer Science 2022-09-12 Jimit Majmudar , Christophe Dupuy , Charith Peris , Sami Smaili , Rahul Gupta , Richard Zemel

DPrivBench: Benchmarking LLMs' Reasoning for Differential Privacy

Differential privacy (DP) has a wide range of applications for protecting data privacy, but designing and verifying DP algorithms requires expert-level reasoning, creating a high barrier for non-expert practitioners. Prior works either rely…

Machine Learning · Computer Science 2026-05-19 Erchi Wang , Pengrun Huang , Eli Chien , Om Thakkar , Kamalika Chaudhuri , Yu-Xiang Wang , Ruihan Wu

Context-Aware Differential Privacy for Language Modeling

The remarkable ability of language models (LMs) has also brought challenges at the interface of AI and security. A critical challenge pertains to how much information these models retain and leak about the training data. This is…

Machine Learning · Computer Science 2023-01-31 My H. Dinh , Ferdinando Fioretto

Public Data Assisted Differentially Private In-Context Learning

In-context learning (ICL) in Large Language Models (LLMs) has shown remarkable performance across various tasks without requiring fine-tuning. However, recent studies have highlighted the risk of private data leakage through the prompt in…

Artificial Intelligence · Computer Science 2025-09-16 Seongho Joo , Hyukhun Koh , Kyomin Jung

DP-MemArc: Differential Privacy Transfer Learning for Memory Efficient Language Models

Large language models have repeatedly shown outstanding performance across diverse applications. However, deploying these models can inadvertently risk user privacy. The significant memory demands during training pose a major challenge in…

Cryptography and Security · Computer Science 2025-02-21 Yanming Liu , Xinyue Peng , Yuwei Zhang , Xiaolan Ke , Songhang Deng , Jiannan Cao , Chen Ma , Mengchen Fu , Tianyu Du , Sheng Cheng , Xun Wang , Jianwei Yin , Xuhong Zhang

Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions

Large Language Models (LLMs) represent a significant advancement in artificial intelligence, finding applications across various domains. However, their reliance on massive internet-sourced datasets for training brings notable privacy…

Cryptography and Security · Computer Science 2025-02-11 Michele Miranda , Elena Sofia Ruzzetti , Andrea Santilli , Fabio Massimo Zanzotto , Sébastien Bratières , Emanuele Rodolà

Differentially Private Natural Language Models: Recent Advances and Future Directions

Recent developments in deep learning have led to great success in various natural language processing (NLP) tasks. However, these applications may involve data that contain sensitive information. Therefore, how to achieve good performance…

Computation and Language · Computer Science 2023-10-24 Lijie Hu , Ivan Habernal , Lei Shen , Di Wang

Scaling Laws for Differentially Private Language Models

Scaling laws have emerged as important components of large language model (LLM) training as they can predict performance gains through scale, and provide guidance on important hyper-parameter choices that would otherwise be expensive. LLMs…

Machine Learning · Computer Science 2025-02-03 Ryan McKenna , Yangsibo Huang , Amer Sinha , Borja Balle , Zachary Charles , Christopher A. Choquette-Choo , Badih Ghazi , George Kaissis , Ravi Kumar , Ruibo Liu , Da Yu , Chiyuan Zhang

Fine-Tuning Large Language Models with User-Level Differential Privacy

We investigate practical and scalable algorithms for training large language models (LLMs) with user-level differential privacy (DP) in order to provably safeguard all the examples contributed by each user. We study two variants of DP-SGD…

Machine Learning · Computer Science 2024-07-11 Zachary Charles , Arun Ganesh , Ryan McKenna , H. Brendan McMahan , Nicole Mitchell , Krishna Pillutla , Keith Rush

Differentially Private Language Models Benefit from Public Pre-training

Language modeling is a keystone task in natural language processing. When training a language model on sensitive information, differential privacy (DP) allows us to quantify the degree to which our private data is protected. However,…

Machine Learning · Computer Science 2020-10-27 Gavin Kerrigan , Dylan Slack , Jens Tuyls

Privacy Preserving Large Language Models: ChatGPT Case Study Based Vision and Framework

The generative Artificial Intelligence (AI) tools based on Large Language Models (LLMs) use billions of parameters to extensively analyse large datasets and extract critical private information such as, context, specific details,…

Cryptography and Security · Computer Science 2023-10-20 Imdad Ullah , Najm Hassan , Sukhpal Singh Gill , Basem Suleiman , Tariq Ahamed Ahanger , Zawar Shah , Junaid Qadir , Salil S. Kanhere

Does Differential Privacy Impact Bias in Pretrained NLP Models?

Differential privacy (DP) is applied when fine-tuning pre-trained large language models (LLMs) to limit leakage of training examples. While most DP research has focused on improving a model's privacy-utility tradeoff, some find that DP can…

Computation and Language · Computer Science 2024-10-25 Md. Khairul Islam , Andrew Wang , Tianhao Wang , Yangfeng Ji , Judy Fox , Jieyu Zhao