Related papers: Differentially Private Decoding in Large Language …

Privacy-Preserving Large Language Models: Mechanisms, Applications, and Future Directions

The rapid advancement of large language models (LLMs) has revolutionized natural language processing, enabling applications in diverse domains such as healthcare, finance and education. However, the growing reliance on extensive data for…

Cryptography and Security · Computer Science 2024-12-10 Guoshenghui Zhao , Eric Song

Privately Fine-Tuning Large Language Models with Differential Privacy

Pre-trained Large Language Models (LLMs) are an integral part of modern AI that have led to breakthrough performances in complex AI tasks. Major AI companies with expensive infrastructures are able to develop and train these large models…

Cryptography and Security · Computer Science 2023-05-02 Rouzbeh Behnia , Mohamamdreza Ebrahimi , Jason Pacheco , Balaji Padmanabhan

Privacy Preserving In-Context-Learning Framework for Large Language Models

Large language models (LLMs) have significantly transformed natural language understanding and generation, but they raise privacy concerns due to potential exposure of sensitive information. Studies have highlighted the risk of information…

Machine Learning · Computer Science 2025-11-20 Bishnu Bhusal , Manoj Acharya , Ramneet Kaur , Colin Samplawski , Anirban Roy , Adam D. Cobb , Rohit Chadha , Susmit Jha

Differentially Private Language Models Benefit from Public Pre-training

Language modeling is a keystone task in natural language processing. When training a language model on sensitive information, differential privacy (DP) allows us to quantify the degree to which our private data is protected. However,…

Machine Learning · Computer Science 2020-10-27 Gavin Kerrigan , Dylan Slack , Jens Tuyls

Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions

Large Language Models (LLMs) represent a significant advancement in artificial intelligence, finding applications across various domains. However, their reliance on massive internet-sourced datasets for training brings notable privacy…

Cryptography and Security · Computer Science 2025-02-11 Michele Miranda , Elena Sofia Ruzzetti , Andrea Santilli , Fabio Massimo Zanzotto , Sébastien Bratières , Emanuele Rodolà

Privacy-Preserving Models for Legal Natural Language Processing

Pre-training large transformer models with in-domain data improves domain adaptation and helps gain performance on the domain-specific downstream tasks. However, sharing models pre-trained on potentially sensitive data is prone to…

Computation and Language · Computer Science 2025-08-14 Ying Yin , Ivan Habernal

Differentially Private Natural Language Models: Recent Advances and Future Directions

Recent developments in deep learning have led to great success in various natural language processing (NLP) tasks. However, these applications may involve data that contain sensitive information. Therefore, how to achieve good performance…

Computation and Language · Computer Science 2023-10-24 Lijie Hu , Ivan Habernal , Lei Shen , Di Wang

Can Differentially Private Fine-tuning LLMs Protect Against Privacy Attacks?

Fine-tuning large language models (LLMs) has become an essential strategy for adapting them to specialized tasks; however, this process introduces significant privacy challenges, as sensitive training data may be inadvertently memorized and…

Cryptography and Security · Computer Science 2025-05-02 Hao Du , Shang Liu , Yang Cao

On Protecting the Data Privacy of Large Language Models (LLMs): A Survey

Large language models (LLMs) are complex artificial intelligence systems capable of understanding, generating and translating human language. They learn language patterns by analyzing large amounts of text data, allowing them to perform…

Cryptography and Security · Computer Science 2024-03-15 Biwei Yan , Kun Li , Minghui Xu , Yueyan Dong , Yue Zhang , Zhaochun Ren , Xiuzhen Cheng

Privacy Preserving Large Language Models: ChatGPT Case Study Based Vision and Framework

The generative Artificial Intelligence (AI) tools based on Large Language Models (LLMs) use billions of parameters to extensively analyse large datasets and extract critical private information such as, context, specific details,…

Cryptography and Security · Computer Science 2023-10-20 Imdad Ullah , Najm Hassan , Sukhpal Singh Gill , Basem Suleiman , Tariq Ahamed Ahanger , Zawar Shah , Junaid Qadir , Salil S. Kanhere

Locally Differentially Private In-Context Learning

Large pretrained language models (LLMs) have shown surprising In-Context Learning (ICL) ability. An important application in deploying large language models is to augment LLMs with a private database for some specific task. The main problem…

Cryptography and Security · Computer Science 2024-05-09 Chunyan Zheng , Keke Sun , Wenhao Zhao , Haibo Zhou , Lixin Jiang , Shaoyang Song , Chunlai Zhou

You Are What You Write: Preserving Privacy in the Era of Large Language Models

Large scale adoption of large language models has introduced a new era of convenient knowledge transfer for a slew of natural language processing tasks. However, these models also run the risk of undermining user trust by exposing unwanted…

Computation and Language · Computer Science 2022-04-21 Richard Plant , Valerio Giuffrida , Dimitra Gkatzia

Does Differential Privacy Impact Bias in Pretrained NLP Models?

Differential privacy (DP) is applied when fine-tuning pre-trained large language models (LLMs) to limit leakage of training examples. While most DP research has focused on improving a model's privacy-utility tradeoff, some find that DP can…

Computation and Language · Computer Science 2024-10-25 Md. Khairul Islam , Andrew Wang , Tianhao Wang , Yangfeng Ji , Judy Fox , Jieyu Zhao

Privacy in Fine-tuning Large Language Models: Attacks, Defenses, and Future Directions

Fine-tuning has emerged as a critical process in leveraging Large Language Models (LLMs) for specific downstream tasks, enabling these models to achieve state-of-the-art performance across various domains. However, the fine-tuning process…

Artificial Intelligence · Computer Science 2025-04-08 Hao Du , Shang Liu , Lele Zheng , Yang Cao , Atsuyoshi Nakamura , Lei Chen

Open LLMs are Necessary for Current Private Adaptations and Outperform their Closed Alternatives

While open Large Language Models (LLMs) have made significant progress, they still fall short of matching the performance of their closed, proprietary counterparts, making the latter attractive even for the use on highly private data.…

Machine Learning · Computer Science 2024-11-18 Vincent Hanke , Tom Blanchard , Franziska Boenisch , Iyiola Emmanuel Olatunji , Michael Backes , Adam Dziedzic

Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Large language models (LLMs) have emerged as powerful tools for tackling complex tasks across diverse domains, but they also raise privacy concerns when fine-tuned on sensitive data due to potential memorization. While differential privacy…

Computation and Language · Computer Science 2024-08-19 Lynn Chua , Badih Ghazi , Yangsibo Huang , Pritish Kamath , Ravi Kumar , Daogao Liu , Pasin Manurangsi , Amer Sinha , Chiyuan Zhang

Assessing and Mitigating Data Memorization Risks in Fine-Tuned Large Language Models

Large Language Models (LLMs) have demonstrated remarkable capabilities across diverse natural language processing tasks, but their tendency to memorize training data poses significant privacy risks, particularly during fine-tuning…

Computation and Language · Computer Science 2025-08-21 Badrinath Ramakrishnan , Akshaya Balaji

DP-SelFT: Differentially Private Selective Fine-Tuning for Large Language Models

Large language models (LLMs) are commonly adapted to downstream tasks through fine-tuning, but fine-tuning data often contains sensitive information that may be leaked by the resulting model. Differential privacy (DP) offers formal…

Machine Learning · Computer Science 2026-05-19 Haichao Sha , Zihao Wang , Yuncheng Wu , Hong Chen , Wei Dong

Differentially Private Attention Computation

Large language models (LLMs), especially those based on the Transformer architecture, have had a profound impact on various aspects of daily life, such as natural language processing, content generation, research methodologies, and more.…

Machine Learning · Computer Science 2024-10-15 Yeqi Gao , Zhao Song , Xin Yang , Yufa Zhou

Learning Differentially Private Recurrent Language Models

We demonstrate that it is possible to train large recurrent language models with user-level differential privacy guarantees with only a negligible cost in predictive accuracy. Our work builds on recent advances in the training of deep…

Machine Learning · Computer Science 2018-02-27 H. Brendan McMahan , Daniel Ramage , Kunal Talwar , Li Zhang