Related papers: Differentially Private Weighted Sampling

Differentially Private Variational Inference for Non-conjugate Models

Many machine learning applications are based on data collected from people, such as their tastes and behaviour as well as biological traits and genetic data. Regardless of how important the application might be, one has to make sure…

Machine Learning · Statistics 2017-04-11 Joonas Jälkö , Onur Dikmen , Antti Honkela

Mitigating Statistical Bias within Differentially Private Synthetic Data

Increasing interest in privacy-preserving machine learning has led to new and evolved approaches for generating private synthetic data from undisclosed real data. However, mechanisms of privacy preservation can significantly reduce the…

Machine Learning · Statistics 2022-05-23 Sahra Ghalebikesabi , Harrison Wilde , Jack Jewson , Arnaud Doucet , Sebastian Vollmer , Chris Holmes

Private sampling: a noiseless approach for generating differentially private synthetic data

In a world where artificial intelligence and data science become omnipresent, data sharing is increasingly locking horns with data-privacy concerns. Differential privacy has emerged as a rigorous framework for protecting individual privacy…

Cryptography and Security · Computer Science 2022-06-06 March Boedihardjo , Thomas Strohmer , Roman Vershynin

Personalized Privacy Amplification via Importance Sampling

For scalable machine learning on large data sets, subsampling a representative subset is a common approach for efficient model training. This is often achieved through importance sampling, whereby informative data points are sampled more…

Cryptography and Security · Computer Science 2025-03-31 Dominik Fay , Sebastian Mair , Jens Sjölund

Privacy Amplification by Subsampling: Tight Analyses via Couplings and Divergences

Differential privacy comes equipped with multiple analytical tools for the design of private data analyses. One important tool is the so-called "privacy amplification by subsampling" principle, which ensures that a differentially private…

Machine Learning · Computer Science 2018-11-26 Borja Balle , Gilles Barthe , Marco Gaboardi

Fair and Differentially Private Distributed Frequency Estimation

In order to remain competitive, Internet companies collect and analyse user data for the purpose of improving user experiences. Frequency estimation is a widely used statistical tool which could potentially conflict with the relevant…

Cryptography and Security · Computer Science 2021-04-14 Mengmeng Yang , Ivan Tjuawinata , Kwok-Yan Lam , Tianqing Zhu , Jun Zhao

Differentially Private Finite Population Estimation via Survey Weight Regularization

In general, it is challenging to release differentially private versions of survey-weighted statistics with low error for acceptable privacy loss. This is because weighted statistics from complex sample survey data can be more sensitive to…

Cryptography and Security · Computer Science 2024-11-08 Jeremy Seeman , Yajuan Si , Jerome P Reiter

$(\epsilon, \delta)$-Differentially Private Partial Least Squares Regression

As data-privacy requirements are becoming increasingly stringent and statistical models based on sensitive data are being deployed and used more routinely, protecting data-privacy becomes pivotal. Partial Least Squares (PLS) regression is…

Machine Learning · Statistics 2024-12-13 Ramin Nikzad-Langerodi , Mohit Kumar , Du Nguyen Duy , Mahtab Alghasi

Efficient privacy loss accounting for subsampling and random allocation

We consider the privacy amplification properties of a sampling scheme in which a user's data is used in $k$ steps chosen randomly and uniformly from a sequence (or set) of $t$ steps. This sampling scheme has been recently applied in the…

Machine Learning · Computer Science 2026-02-20 Vitaly Feldman , Moshe Shenfeld

Impact of Sampling on Locally Differentially Private Data Collection

With the recent bloom of data, there is a huge surge in threats against individuals' private information. Various techniques for optimizing privacy-preserving data analysis are at the focus of research in the recent years. In this paper, we…

Cryptography and Security · Computer Science 2022-11-11 Sayan Biswas , Graham Cormode , Carsten Maple

ML-PWS: Estimating the Mutual Information Between Experimental Time Series Using Neural Networks

The ability to quantify information transmission is crucial for the analysis and design of natural and engineered systems. The information transmission rate is the fundamental measure for systems with time-varying signals, yet computing it…

Biological Physics · Physics 2025-09-26 Manuel Reinhardt , Gašper Tkačik , Pieter Rein ten Wolde

Differential Privacy: An Economic Method for Choosing Epsilon

Differential privacy is becoming a gold standard for privacy research; it offers a guaranteed bound on loss of privacy due to release of query results, even under worst-case assumptions. The theory of differential privacy is an active…

Databases · Computer Science 2018-03-16 Justin Hsu , Marco Gaboardi , Andreas Haeberlen , Sanjeev Khanna , Arjun Narayan , Benjamin C. Pierce , Aaron Roth

DPSW-Sketch: A Differentially Private Sketch Framework for Frequency Estimation over Sliding Windows (Technical Report)

The sliding window model of computation captures scenarios in which data are continually arriving in the form of a stream, and only the most recent $w$ items are used for analysis. In this setting, an algorithm needs to accurately track…

Cryptography and Security · Computer Science 2024-06-13 Yiping Wang , Yanhao Wang , Cen Chen

Differentially Private Statistical Inference through $\beta$-Divergence One Posterior Sampling

Differential privacy guarantees allow the results of a statistical analysis involving sensitive data to be released without compromising the privacy of any individual taking part. Achieving such guarantees generally requires the injection…

Machine Learning · Statistics 2023-10-31 Jack Jewson , Sahra Ghalebikesabi , Chris Holmes

Privacy-Preserving Data Sharing for Genome-Wide Association Studies

Traditional statistical methods for confidentiality protection of statistical databases do not scale well to deal with GWAS (genome-wide association studies) databases especially in terms of guarantees regarding protection from linkage to…

Methodology · Statistics 2012-05-04 Caroline Uhler , Aleksandra B. Slavkovic , Stephen E. Fienberg

Differentially Private Covariate Balancing Causal Inference

Differential privacy is the leading mathematical framework for privacy protection, providing a probabilistic guarantee that safeguards individuals' private information when publishing statistics from a dataset. This guarantee is achieved by…

Methodology · Statistics 2025-08-19 Yuki Ohnishi , Jordan Awan

Improving Statistical Privacy by Subsampling

Differential privacy (DP) considers a scenario, where an adversary has almost complete information about the entries of a database This worst-case assumption is likely to overestimate the privacy thread for an individual in real life.…

Cryptography and Security · Computer Science 2025-04-16 Dennis Breutigam , Rüdiger Reischuk

Private Set Generation with Discriminative Information

Differentially private data generation techniques have become a promising solution to the data privacy challenge -- it enables sharing of data while complying with rigorous privacy guarantees, which is essential for scientific progress in…

Cryptography and Security · Computer Science 2022-11-09 Dingfan Chen , Raouf Kerkouche , Mario Fritz

Weighted Sampled Split Learning (WSSL): Balancing Privacy, Robustness, and Fairness in Distributed Learning Environments

This study presents Weighted Sampled Split Learning (WSSL), an innovative framework tailored to bolster privacy, robustness, and fairness in distributed machine learning systems. Unlike traditional approaches, WSSL disperses the learning…

Machine Learning · Computer Science 2023-10-31 Manish Osti , Aashray Thakuri , Basheer Qolomany , Aos Mulahuwaish

Differentially Private Confidence Intervals for Proportions under Stratified Random Sampling

Confidence intervals are a fundamental tool for quantifying the uncertainty of parameters of interest. With the increase of data privacy awareness, developing a private version of confidence intervals has gained growing attention from both…

Methodology · Statistics 2024-04-12 Shurong Lin , Mark Bun , Marco Gaboardi , Eric D. Kolaczyk , Adam Smith