Related papers: Differentially Private Set Union

Differentially private partition selection

Many data analysis operations can be expressed as a GROUP BY query on an unbounded set of partitions, followed by a per-partition aggregation. To make such a query differentially private, adding noise to each aggregation is not enough: we…

Cryptography and Security · Computer Science 2021-11-01 Damien Desfontaines , James Voss , Bryant Gipson , Chinmoy Mandayam

Practical Differentially Private Top-$k$ Selection with Pay-what-you-get Composition

We study the problem of top-$k$ selection over a large domain universe subject to user-level differential privacy. Typically, the exponential mechanism or report noisy max are the algorithms used to solve this problem. However, these…

Cryptography and Security · Computer Science 2019-09-19 David Durfee , Ryan Rogers

Scalable Private Partition Selection via Adaptive Weighting

In the differentially private partition selection problem (a.k.a. private set union, private key discovery), users hold subsets of items from an unbounded universe. The goal is to output as many items as possible from the union of the…

Data Structures and Algorithms · Computer Science 2025-08-12 Justin Y. Chen , Vincent Cohen-Addad , Alessandro Epasto , Morteza Zadimoghaddam

Learning discrete distributions: user vs item-level privacy

Much of the literature on differential privacy focuses on item-level privacy, where loosely speaking, the goal is to provide privacy per item or training example. However, recently many practical applications such as federated learning…

Machine Learning · Computer Science 2021-01-13 Yuhan Liu , Ananda Theertha Suresh , Felix Yu , Sanjiv Kumar , Michael Riley

Differentially Private n-gram Extraction

We revisit the problem of $n$-gram extraction in the differential privacy setting. In this problem, given a corpus of private text data, the goal is to release as many $n$-grams as possible while preserving user level privacy. Extracting…

Machine Learning · Computer Science 2021-08-09 Kunho Kim , Sivakanth Gopi , Janardhan Kulkarni , Sergey Yekhanin

Self-adaptive Privacy Concern Detection for User-generated Content

To protect user privacy in data analysis, a state-of-the-art strategy is differential privacy in which scientific noise is injected into the real analysis output. The noise masks individual's sensitive information contained in the dataset.…

Cryptography and Security · Computer Science 2018-06-20 Xuan-Son Vu , Lili Jiang

A Comparative Analysis of Word-Level Metric Differential Privacy: Benchmarking The Privacy-Utility Trade-off

The application of Differential Privacy to Natural Language Processing techniques has emerged in relevance in recent years, with an increasing number of studies published in established NLP outlets. In particular, the adaptation of…

Computation and Language · Computer Science 2024-04-05 Stephen Meisenbacher , Nihildev Nandakumar , Alexandra Klymenko , Florian Matthes

DP-SIPS: A simpler, more scalable mechanism for differentially private partition selection

Partition selection, or set union, is an important primitive in differentially private mechanism design: in a database where each user contributes a list of items, the goal is to publish as many of these items as possible under differential…

Cryptography and Security · Computer Science 2023-06-23 Marika Swanberg , Damien Desfontaines , Samuel Haney

Small Count Privacy and Large Count Utility in Data Publishing

While the introduction of differential privacy has been a major breakthrough in the study of privacy preserving data publication, some recent work has pointed out a number of cases where it is not possible to limit inference about…

Databases · Computer Science 2012-02-16 Ada Wai-Chee Fu , Jia Wang , Ke Wang , Raymond Chi-Wing Wong

Mind the Privacy Unit! User-Level Differential Privacy for Language Model Fine-Tuning

Large language models (LLMs) have emerged as powerful tools for tackling complex tasks across diverse domains, but they also raise privacy concerns when fine-tuned on sensitive data due to potential memorization. While differential privacy…

Computation and Language · Computer Science 2024-08-19 Lynn Chua , Badih Ghazi , Yangsibo Huang , Pritish Kamath , Ravi Kumar , Daogao Liu , Pasin Manurangsi , Amer Sinha , Chiyuan Zhang

Differentially Private Histogram, Predecessor, and Set Cardinality under Continual Observation

Differential privacy is the de-facto privacy standard in data analysis. The classic model of differential privacy considers the data to be static. The dynamic setting, called differential privacy under continual observation, captures many…

Data Structures and Algorithms · Computer Science 2023-06-21 Monika Henzinger , A. R. Sricharan , Teresa Anna Steiner

The Large Margin Mechanism for Differentially Private Maximization

A basic problem in the design of privacy-preserving algorithms is the private maximization problem: the goal is to pick an item from a universe that (approximately) maximizes a data-dependent function, all under the constraint of…

Machine Learning · Computer Science 2014-09-09 Kamalika Chaudhuri , Daniel Hsu , Shuang Song

Differential Privacy for Sets in Euclidean Space

As multi-agent systems become more numerous and more data-driven, novel forms of privacy are needed in order to protect data types that are not accounted for by existing privacy frameworks. In this paper, we present a new form of privacy…

Optimization and Control · Mathematics 2017-10-04 Matthew T. Hale

Data Analytics with Differential Privacy

Differential privacy is the state-of-the-art definition for privacy, guaranteeing that any analysis performed on a sensitive dataset leaks no information about the individuals whose data are contained therein. In this thesis, we develop…

Machine Learning · Computer Science 2023-11-29 Vassilis Digalakis

A Data- and Workload-Aware Algorithm for Range Queries Under Differential Privacy

We describe a new algorithm for answering a given set of range queries under $\epsilon$-differential privacy which often achieves substantially lower error than competing methods. Our algorithm satisfies differential privacy by adding noise…

Databases · Computer Science 2014-10-02 Chao Li , Michael Hay , Gerome Miklau , Yue Wang

Differential Private Noise Adding Mechanism and Its Application on Consensus

Differential privacy is a formal mathematical {stand-ard} for quantifying the degree of that individual privacy in a statistical database is preserved. To guarantee differential privacy, a typical method is adding random noise to the…

Information Theory · Computer Science 2017-03-08 Jianping He , Lin Cai

Differential Privacy: An Economic Method for Choosing Epsilon

Differential privacy is becoming a gold standard for privacy research; it offers a guaranteed bound on loss of privacy due to release of query results, even under worst-case assumptions. The theory of differential privacy is an active…

Databases · Computer Science 2018-03-16 Justin Hsu , Marco Gaboardi , Andreas Haeberlen , Sanjeev Khanna , Arjun Narayan , Benjamin C. Pierce , Aaron Roth

Learning Numeric Optimal Differentially Private Truncated Additive Mechanisms

Differentially private (DP) mechanisms face the challenge of providing accurate results while protecting their inputs: the privacy-utility trade-off. A simple but powerful technique for DP adds noise to sensitivity-bounded query outputs to…

Cryptography and Security · Computer Science 2021-07-28 David M. Sommer , Lukas Abfalterer , Sheila Zingg , Esfandiar Mohammadi

Selective Differential Privacy for Language Modeling

With the increasing applications of language models, it has become crucial to protect these models from leaking private information. Previous work has attempted to tackle this challenge by training RNN-based language models with…

Computation and Language · Computer Science 2022-07-19 Weiyan Shi , Aiqi Cui , Evan Li , Ruoxi Jia , Zhou Yu

Constrained Differential Privacy for Count Data

Concern about how to aggregate sensitive user data without compromising individual privacy is a major barrier to greater availability of data. The model of differential privacy has emerged as an accepted model to release sensitive…

Databases · Computer Science 2017-10-03 Graham Cormode , Tejas Kulkarni , Divesh Srivastava