English
Related papers

Related papers: Private High-Dimensional Hypothesis Testing

200 papers

In this work we present novel differentially private identity (goodness-of-fit) testers for natural and widely studied classes of multivariate product distributions: Gaussians in $\mathbb{R}^d$ with known covariance and product…

Data Structures and Algorithms · Computer Science 2022-03-07 Clément L. Canonne , Gautam Kamath , Audra McMillan , Jonathan Ullman , Lydia Zakynthinou

We present a fast, differentially private algorithm for high-dimensional covariance-aware mean estimation with nearly optimal sample complexity. Only exponential-time estimators were previously known to achieve this guarantee. Given $n$…

Machine Learning · Computer Science 2025-11-26 Gavin Brown , Samuel B. Hopkins , Adam Smith

In this work, we give efficient algorithms for privately estimating a Gaussian distribution in both pure and approximate differential privacy (DP) models with optimal dependence on the dimension in the sample complexity. In the pure DP…

Data Structures and Algorithms · Computer Science 2023-06-02 Daniel Alabi , Pravesh K. Kothari , Pranay Tankala , Prayaag Venkat , Fred Zhang

We present differentially private algorithms for high-dimensional mean estimation. Previous private estimators on distributions over $\mathbb{R}^d$ suffer from a curse of dimensionality, as they require $\Omega(d^{1/2})$ samples to achieve…

Machine Learning · Computer Science 2024-11-04 Yuval Dagan , Michael I. Jordan , Xuelin Yang , Lydia Zakynthinou , Nikita Zhivotovskiy

Given a dataset of $n$ i.i.d. samples from an unknown distribution $P$, we consider the problem of generating a sample from a distribution that is close to $P$ in total variation distance, under the constraint of differential privacy (DP).…

Data Structures and Algorithms · Computer Science 2023-06-23 Badih Ghazi , Xiao Hu , Ravi Kumar , Pasin Manurangsi

We present two sample-efficient differentially private mean estimators for $d$-dimensional (sub)Gaussian distributions with unknown covariance. Informally, given $n \gtrsim d/\alpha^2$ samples from such a distribution with mean $\mu$ and…

Machine Learning · Computer Science 2024-03-27 Gavin Brown , Marco Gaboardi , Adam Smith , Jonathan Ullman , Lydia Zakynthinou

The increasing prevalence of high-dimensional data across various applications has raised significant privacy concerns in statistical inference. In this paper, we propose a differentially private integrated statistic for testing…

Methodology · Statistics 2025-06-04 Shiwei Sang , Yicheng Zeng , Xuehu Zhu , Shurong Zheng

We consider the problem of hypothesis testing for discrete distributions. In the standard model, where we have sample access to an underlying distribution $p$, extensive research has established optimal bounds for uniformity testing,…

Machine Learning · Computer Science 2024-12-03 Maryam Aliakbarpour , Piotr Indyk , Ronitt Rubinfeld , Sandeep Silwal

We present novel, computationally efficient, and differentially private algorithms for two fundamental high-dimensional learning problems: learning a multivariate Gaussian and learning a product distribution over the Boolean hypercube in…

Data Structures and Algorithms · Computer Science 2019-05-31 Gautam Kamath , Jerry Li , Vikrant Singhal , Jonathan Ullman

We provide the first $\widetilde{\mathcal{O}}\left(d\right)$-sample algorithm for sampling from unbounded Gaussian distributions under the constraint of $\left(\varepsilon, \delta\right)$-differential privacy. This is a quadratic…

Data Structures and Algorithms · Computer Science 2025-03-04 Valentio Iverson , Gautam Kamath , Argyris Mouzakis

We study the fundamental problems of identity testing (goodness of fit), and closeness testing (two sample test) of distributions over $k$ elements, under differential privacy. While the problems have a long history in statistics, finite…

Machine Learning · Computer Science 2017-11-01 Jayadev Acharya , Ziteng Sun , Huanyu Zhang

We investigate the problems of identity and closeness testing over a discrete population from random samples. Our goal is to develop efficient testers while guaranteeing Differential Privacy to the individuals of the population. We describe…

Machine Learning · Computer Science 2017-07-19 Maryam Aliakbarpour , Ilias Diakonikolas , Ronitt Rubinfeld

We provide a differentially private algorithm for hypothesis selection. Given samples from an unknown probability distribution $P$ and a set of $m$ probability distributions $\mathcal{H}$, the goal is to output, in a…

Data Structures and Algorithms · Computer Science 2021-01-05 Mark Bun , Gautam Kamath , Thomas Steinke , Zhiwei Steven Wu

We study the problem of testing discrete distributions with a focus on the high probability regime. Specifically, given samples from one or more discrete distributions, a property $\mathcal{P}$, and parameters $0< \epsilon, \delta <1$, we…

Data Structures and Algorithms · Computer Science 2020-09-15 Ilias Diakonikolas , Themis Gouleakis , Daniel M. Kane , John Peebles , Eric Price

We develop differentially private hypothesis testing methods for the small sample regime. Given a sample $\cal D$ from a categorical distribution $p$ over some domain $\Sigma$, an explicitly described distribution $q$ over $\Sigma$, some…

Data Structures and Algorithms · Computer Science 2017-06-08 Bryan Cai , Constantinos Daskalakis , Gautam Kamath

We present new algorithms for estimating and testing \emph{collision probability}, a fundamental measure of the spread of a discrete distribution that is widely used in many scientific fields. We describe an algorithm that satisfies…

Machine Learning · Statistics 2025-04-21 Robert Busa-Fekete , Umar Syed

We provide optimal lower bounds for two well-known parameter estimation (also known as statistical estimation) tasks in high dimensions with approximate differential privacy. First, we prove that for any $\alpha \le O(1)$, estimating the…

Statistics Theory · Mathematics 2024-01-05 Shyam Narayanan

We extend the framework of augmented distribution testing (Aliakbarpour, Indyk, Rubinfeld, and Silwal, NeurIPS 2024) to the differentially private setting. This captures scenarios where a data analyst must perform hypothesis testing tasks…

Machine Learning · Computer Science 2025-03-20 Maryam Aliakbarpour , Arnav Burudgunte , Clément Cannone , Ronitt Rubinfeld

We study the following fundamental hypothesis testing problem, which we term Gaussian mean testing. Given i.i.d. samples from a distribution $p$ on $\mathbb{R}^d$, the task is to distinguish, with high probability, between the following…

Statistics Theory · Mathematics 2022-10-26 Ilias Diakonikolas , Daniel M. Kane , Ankit Pensia

We consider the task of privately obtaining prediction error guarantees in ordinary least-squares regression problems with Gaussian covariates (with unknown covariance structure). We provide the first sample-optimal polynomial time…

Data Structures and Algorithms · Computer Science 2025-04-01 Prashanti Anderson , Ainesh Bakshi , Mahbod Majid , Stefan Tiegel
‹ Prev 1 2 3 10 Next ›