English
Related papers

Related papers: Private Hypothesis Selection

200 papers

We initiate the study of hypothesis selection under local differential privacy. Given samples from an unknown probability distribution $p$ and a set of $k$ probability distributions $\mathcal{Q}$, we aim to output, under the constraints of…

Data Structures and Algorithms · Computer Science 2020-06-23 Sivakanth Gopi , Gautam Kamath , Janardhan Kulkarni , Aleksandar Nikolov , Zhiwei Steven Wu , Huanyu Zhang

Estimating the density of a distribution from its samples is a fundamental problem in statistics. Hypothesis selection addresses the setting where, in addition to a sample set, we are given $n$ candidate distributions -- referred to as…

Data Structures and Algorithms · Computer Science 2025-10-23 Maryam Aliakbarpour , Zhan Shi , Ria Stevens , Vincent X. Wang

We study the problem of hypothesis selection under the constraint of local differential privacy. Given a class $\mathcal{F}$ of $k$ distributions and a set of i.i.d. samples from an unknown distribution $h$, the goal of hypothesis selection…

Machine Learning · Statistics 2026-03-05 Alireza F. Pour , Hassan Ashtiani , Shahab Asoodeh

We initiate an investigation of private sampling from distributions. Given a dataset with $n$ independent observations from an unknown distribution $P$, a sampling algorithm must output a single observation from a distribution that is close…

Machine Learning · Computer Science 2022-11-16 Sofya Raskhodnikova , Satchit Sivakumar , Adam Smith , Marika Swanberg

Many algorithms have been developed to estimate probability distributions subject to differential privacy (DP): such an algorithm takes as input independent samples from a distribution and estimates the density function in a way that is…

Cryptography and Security · Computer Science 2024-12-17 Albert Cheu , Debanuj Nayak

Hypothesis testing plays a central role in statistical inference, and is used in many settings where privacy concerns are paramount. This work answers a basic question about privately testing simple hypotheses: given two distributions $P$…

Data Structures and Algorithms · Computer Science 2019-04-04 Clément L. Canonne , Gautam Kamath , Audra McMillan , Adam Smith , Jonathan Ullman

We revisit the problem of differentially private release of classification queries. In this problem, the goal is to design an algorithm that can accurately answer a sequence of classification queries based on a private training set while…

Machine Learning · Computer Science 2019-12-05 Anupama Nandi , Raef Bassily

Given a dataset of $n$ i.i.d. samples from an unknown distribution $P$, we consider the problem of generating a sample from a distribution that is close to $P$ in total variation distance, under the constraint of differential privacy (DP).…

Data Structures and Algorithms · Computer Science 2023-06-23 Badih Ghazi , Xiao Hu , Ravi Kumar , Pasin Manurangsi

In modern settings of data analysis, we may be running our algorithms on datasets that are sensitive in nature. However, classical machine learning and statistical algorithms were not designed with these risks in mind, and it has been…

Data Structures and Algorithms · Computer Science 2021-08-21 Huanyu Zhang

We construct a universally Bayes consistent learning rule that satisfies differential privacy (DP). We first handle the setting of binary classification and then extend our rule to the more general setting of density estimation (with…

Machine Learning · Computer Science 2022-12-09 Olivier Bousquet , Haim Kaplan , Aryeh Kontorovich , Yishay Mansour , Shay Moran , Menachem Sadigurschi , Uri Stemmer

We consider the problem of hypothesis testing for discrete distributions. In the standard model, where we have sample access to an underlying distribution $p$, extensive research has established optimal bounds for uniformity testing,…

Machine Learning · Computer Science 2024-12-03 Maryam Aliakbarpour , Piotr Indyk , Ronitt Rubinfeld , Sandeep Silwal

We consider the problem of model selection in a high-dimensional sparse linear regression model under privacy constraints. We propose a differentially private (DP) best subset selection method with strong statistical utility properties by…

Machine Learning · Statistics 2024-10-30 Saptarshi Roy , Zehua Wang , Ambuj Tewari

We provide improved differentially private algorithms for identity testing of high-dimensional distributions. Specifically, for $d$-dimensional Gaussian distributions with known covariance $\Sigma$, we can test whether the distribution…

Data Structures and Algorithms · Computer Science 2022-07-26 Shyam Narayanan

Differential Privacy (DP) is a probabilistic framework that protects privacy while preserving data utility. To protect the privacy of the individuals in the dataset, DP requires adding a precise amount of noise to a statistic of interest;…

Computation · Statistics 2025-05-05 Yu-Wei Chen , Pranav Sanghi , Jordan Awan

We provide sample complexity upper bounds for agnostically learning multivariate Gaussians under the constraint of approximate differential privacy. These are the first finite sample upper bounds for general Gaussians which do not impose…

Machine Learning · Statistics 2020-10-21 Ishaq Aden-Ali , Hassan Ashtiani , Gautam Kamath

Differentially Private algorithms often need to select the best amongst many candidate options. Classical works on this selection problem require that the candidates' goodness, measured as a real-valued score function, does not change by…

Data Structures and Algorithms · Computer Science 2018-11-21 Jingcheng Liu , Kunal Talwar

We propose an algorithm with improved query-complexity for the problem of hypothesis selection under local differential privacy constraints. Given a set of $k$ probability distributions $Q$, we describe an algorithm that satisfies local…

Data Structures and Algorithms · Computer Science 2026-01-16 Gautam Kamath , Alireza F. Pour , Matthew Regehr , David P. Woodruff

A private learner is trained on a sample of labeled points and generates a hypothesis that can be used for predicting the labels of newly sampled points while protecting the privacy of the training set [Kasiviswannathan et al., FOCS 2008].…

Machine Learning · Computer Science 2023-05-17 Moni Naor , Kobbi Nissim , Uri Stemmer , Chao Yan

In this work, we give efficient algorithms for privately estimating a Gaussian distribution in both pure and approximate differential privacy (DP) models with optimal dependence on the dimension in the sample complexity. In the pure DP…

Data Structures and Algorithms · Computer Science 2023-06-02 Daniel Alabi , Pravesh K. Kothari , Pranay Tankala , Prayaag Venkat , Fred Zhang

Hypothesis Selection is a fundamental distribution learning problem where given a comparator-class $Q=\{q_1,\ldots, q_n\}$ of distributions, and a sampling access to an unknown target distribution $p$, the goal is to output a distribution…

Machine Learning · Computer Science 2021-08-19 Olivier Bousquet , Mark Braverman , Klim Efremenko , Gillat Kol , Shay Moran
‹ Prev 1 2 3 10 Next ›