English
Related papers

Related papers: Simulation-based, Finite-sample Inference for Priv…

200 papers

This article presents a novel, general, and effective simulation-inspired approach, called {\it repro samples method}, to conduct statistical inference. The approach studies the performance of artificial samples, referred to as {\it repro…

Methodology · Statistics 2022-06-15 Min-ge Xie , Peng Wang

Many modern statistical analysis and machine learning applications require training models on sensitive user data. Under a formal definition of privacy protection, differentially private algorithms inject calibrated noise into the…

Machine Learning · Statistics 2025-04-01 Yifei Xiong , Nianqiao Phyllis Ju , Sanguo Zhang

We consider the task of constructing confidence intervals with differential privacy. We propose two private variants of the non-parametric bootstrap, which privately compute the median of the results of multiple "little" bootstraps run on…

Machine Learning · Statistics 2024-06-05 Karan Chadha , John Duchi , Rohith Kuditipudi

Rapid advancements in data science require us to have fundamentally new frameworks to tackle prevalent but highly non-trivial "irregular" inference problems, to which the large sample central limit theorem does not apply. Typical examples…

Methodology · Statistics 2026-02-11 Minge Xie , Peng Wang

In this paper, we present a novel and effective inference approach to conduct both finite- and large-sample inference for high-dimensional linear regression models. This approach is developed under the so-called repro samples framework, in…

Methodology · Statistics 2025-12-01 Peng Wang , Min-Ge Xie , Linjun Zhang

We design a debiased parametric bootstrap framework for statistical inference from differentially private data. Existing usage of the parametric bootstrap on privatized data ignored or avoided handling possible biases introduced by the…

Methodology · Statistics 2026-04-10 Zhanyu Wang , Arin Chang , Jordan Awan

Differential privacy guarantees allow the results of a statistical analysis involving sensitive data to be released without compromising the privacy of any individual taking part. Achieving such guarantees generally requires the injection…

Machine Learning · Statistics 2023-10-31 Jack Jewson , Sahra Ghalebikesabi , Chris Holmes

The task of statistical inference, which includes the building of confidence intervals and tests for parameters and effects of interest to a researcher, is still an open area of investigation in a differentially private (DP) setting.…

Methodology · Statistics 2025-07-17 Ogonnaya Michael Romanus , Younes Boulaguiem , Roberto Molinari

The increased use of differential privacy (DP) has allowed the sharing of large amounts of data while reducing the risk of disclosure of sensitive information at the individual level. However, the noise introduced by DP methods makes…

Methodology · Statistics 2026-04-29 Jordan Awan , Xi Chen , Roberto Molinari

We study the problem of estimating finite sample confidence intervals of the mean of a normal population under the constraint of differential privacy. We consider both the known and unknown variance cases and construct differentially…

Cryptography and Security · Computer Science 2017-11-13 Vishesh Karwa , Salil Vadhan

This paper presents a novel method to make statistical inferences for both the model support and regression coefficients in a high-dimensional logistic regression model. Our method is based on the repro samples framework, in which we…

Methodology · Statistics 2024-03-18 Xiaotian Hou , Linjun Zhang , Peng Wang , Min-ge Xie

The process of data mining with differential privacy produces results that are affected by two types of noise: sampling noise due to data collection and privacy noise that is designed to prevent the reconstruction of sensitive information.…

Machine Learning · Computer Science 2018-04-12 Yue Wang , Daniel Kifer , Jaewoo Lee

Differential privacy comes equipped with multiple analytical tools for the design of private data analyses. One important tool is the so-called "privacy amplification by subsampling" principle, which ensures that a differentially private…

Machine Learning · Computer Science 2018-11-26 Borja Balle , Gilles Barthe , Marco Gaboardi

Survival analysis is widely used in applications involving sensitive individual-level data, yet differentially private hypothesis testing for right-censored data remains largely undeveloped. We initiate a finite-sample theory of private…

Statistics Theory · Mathematics 2026-05-19 Elly K. H. Hung , Yi Yu

Local differential privacy is a differential privacy paradigm in which individuals first apply a privacy mechanism to their data (often by adding noise) before transmitting the result to a curator. The noise for privacy results in…

Methodology · Statistics 2023-10-17 Yuki Ohnishi , Jordan Awan

Differential Privacy (DP) is a mathematical framework for releasing information with formal privacy guarantees. While numerous DP procedures have been developed for statistical analysis and machine learning, valid statistical inference…

Methodology · Statistics 2025-06-27 Ruyu Zhou , Fang Liu

Bootstrap is a common tool for quantifying uncertainty in data analysis. However, besides additional computational costs in the application of the bootstrap on massive data, a challenging problem in bootstrap based inference under…

Machine Learning · Statistics 2025-05-05 Holger Dette , Carina Graw

This paper aims to construct a valid and efficient confidence interval for the extrema of parameters under privacy protection. The usual statistical inference on the extrema of parameters often suffers from the selection bias issue, and the…

Methodology · Statistics 2023-03-07 Xiaowen Fu , Yang Xiang , Xinzhou Guo

Confidence intervals are a fundamental tool for quantifying the uncertainty of parameters of interest. With the increase of data privacy awareness, developing a private version of confidence intervals has gained growing attention from both…

Methodology · Statistics 2024-04-12 Shurong Lin , Mark Bun , Marco Gaboardi , Eric D. Kolaczyk , Adam Smith

In modern settings of data analysis, we may be running our algorithms on datasets that are sensitive in nature. However, classical machine learning and statistical algorithms were not designed with these risks in mind, and it has been…

Data Structures and Algorithms · Computer Science 2021-08-21 Huanyu Zhang
‹ Prev 1 2 3 10 Next ›