Related papers: The Sample Complexity of Simple Binary Hypothesis …

The Sample Complexity of Distributed Simple Binary Hypothesis Testing under Information Constraints

This paper resolves two open problems from a recent paper, arXiv:2403.16981, concerning the sample complexity of distributed simple binary hypothesis testing under information constraints. The first open problem asks whether interaction…

Information Theory · Computer Science 2025-06-18 Hadi Kazemi , Ankit Pensia , Varun Jog

The Structure of Optimal Private Tests for Simple Hypotheses

Hypothesis testing plays a central role in statistical inference, and is used in many settings where privacy concerns are paramount. This work answers a basic question about privately testing simple hypotheses: given two distributions $P$…

Data Structures and Algorithms · Computer Science 2019-04-04 Clément L. Canonne , Gautam Kamath , Audra McMillan , Adam Smith , Jonathan Ullman

Communication-constrained hypothesis testing: Optimality, robustness, and reverse data processing inequalities

We study hypothesis testing under communication constraints, where each sample is quantized before being revealed to a statistician. Without communication constraints, it is well known that the sample complexity of simple binary hypothesis…

Statistics Theory · Mathematics 2023-12-19 Ankit Pensia , Varun Jog , Po-Ling Loh

Simple Binary Hypothesis Testing under Local Differential Privacy and Communication Constraints

We study simple binary hypothesis testing under both local differential privacy (LDP) and communication constraints. We qualify our results as either minimax optimal or instance optimal: the former hold for the set of distribution pairs…

Statistics Theory · Mathematics 2023-12-19 Ankit Pensia , Amir R. Asadi , Varun Jog , Po-Ling Loh

The Price of Tolerance in Distribution Testing

We revisit the problem of tolerant distribution testing. That is, given samples from an unknown distribution $p$ over $\{1, \dots, n\}$, is it $\varepsilon_1$-close to or $\varepsilon_2$-far from a reference distribution $q$ (in total…

Data Structures and Algorithms · Computer Science 2021-11-10 Clément L. Canonne , Ayush Jain , Gautam Kamath , Jerry Li

Sample Complexity of Composite Quantum Hypothesis Testing

This paper investigates symmetric composite binary quantum hypothesis testing (QHT), where the goal is to determine which of two uncertainty sets contains an unknown quantum state. While asymptotic error exponents for this problem are…

Quantum Physics · Physics 2026-04-13 Jacob Paul Simpson , Efstratios Palias , Sharu Theresa Jose

Testing Mixtures of Discrete Distributions

There has been significant study on the sample complexity of testing properties of distributions over large domains. For many properties, it is known that the sample complexity can be substantially smaller than the domain size. For example,…

Statistics Theory · Mathematics 2019-07-09 Maryam Aliakbarpour , Ravi Kumar , Ronitt Rubinfeld

An invitation to the sample complexity of quantum hypothesis testing

Quantum hypothesis testing (QHT) has been traditionally studied from the information-theoretic perspective, wherein one is interested in the optimal decay rate of error probabilities as a function of the number of samples of an unknown…

Quantum Physics · Physics 2025-06-17 Hao-Chung Cheng , Nilanjana Datta , Nana Liu , Theshani Nuradha , Robert Salzmann , Mark M. Wilde

Optimal Algorithms for Testing Closeness of Discrete Distributions

We study the question of closeness testing for two discrete distributions. More precisely, given samples from two distributions $p$ and $q$ over an $n$-element set, we wish to distinguish whether $p=q$ versus $p$ is at least $\eps$-far from…

Data Structures and Algorithms · Computer Science 2013-08-20 Siu-On Chan , Ilias Diakonikolas , Gregory Valiant , Paul Valiant

On the Sample Complexity of Robust Binary Hypothesis Testing

We study the sample complexity of robust binary hypothesis testing under three standard contamination models: $\varepsilon$-additive (Huber), $\varepsilon$-subtractive, and $\varepsilon$-total variation (TV), denoted by…

Statistics Theory · Mathematics 2026-05-26 Shankar Vallinayagam , Ankit Pensia , Varun Jog

Bias Reduction for Sum Estimation

In classical statistics and distribution testing, it is often assumed that elements can be sampled from some distribution $P$, and that when an element $x$ is sampled, the probability $P$ of sampling $x$ is also known. Recent work in…

Data Structures and Algorithms · Computer Science 2022-08-03 Talya Eden , Jakob Bæk Tejs Houen , Shyam Narayanan , Will Rosenbaum , Jakub Tětek

Kernel Two-Sample Hypothesis Testing Using Kernel Set Classification

The two-sample hypothesis testing problem is studied for the challenging scenario of high dimensional data sets with small sample sizes. We show that the two-sample hypothesis testing problem can be posed as a one-class set classification…

Machine Learning · Statistics 2017-11-15 Hamed Masnadi-Shirazi

Entropy Equivalence Testing

We introduce the problem of \emph{entropy equivalence testing} for probability distributions, a relaxation of the well-studied closeness testing problem, where the distribution testing algorithm is now only required to distinguish, given…

Data Structures and Algorithms · Computer Science 2026-05-25 Clément L. Canonne , Yash Pote , Jonathan Scarlett , Joy Qiping Yang

Hypothesis Testing over Observable Regimes in Singular Models

Hypothesis testing in singular statistical models is often regarded as inherently problematic due to non-identifiability and degeneracy of the Fisher information. We show that the fundamental obstruction to testing in such models is not…

Statistics Theory · Mathematics 2026-03-02 Sean Plummer

Likelihood-free hypothesis testing

Consider the problem of binary hypothesis testing. Given $Z$ coming from either $\mathbb P^{\otimes m}$ or $\mathbb Q^{\otimes m}$, to decide between the two with small probability of error it is sufficient, and in many cases necessary, to…

Statistics Theory · Mathematics 2024-03-11 Patrik Róbert Gerber , Yury Polyanskiy

Sharp Bounds for Generalized Uniformity Testing

We study the problem of generalized uniformity testing \cite{BC17} of a discrete probability distribution: Given samples from a probability distribution $p$ over an {\em unknown} discrete domain $\mathbf{\Omega}$, we want to distinguish,…

Data Structures and Algorithms · Computer Science 2017-09-08 Ilias Diakonikolas , Daniel M. Kane , Alistair Stewart

Priv'IT: Private and Sample Efficient Identity Testing

We develop differentially private hypothesis testing methods for the small sample regime. Given a sample $\cal D$ from a categorical distribution $p$ over some domain $\Sigma$, an explicitly described distribution $q$ over $\Sigma$, some…

Data Structures and Algorithms · Computer Science 2017-06-08 Bryan Cai , Constantinos Daskalakis , Gautam Kamath

The Sample Complexity of Robust Covariance Testing

We study the problem of testing the covariance matrix of a high-dimensional Gaussian in a robust setting, where the input distribution has been corrupted in Huber's contamination model. Specifically, we are given i.i.d. samples from a…

Machine Learning · Computer Science 2021-01-01 Ilias Diakonikolas , Daniel M. Kane

Which Distribution Distances are Sublinearly Testable?

Given samples from an unknown distribution $p$ and a description of a distribution $q$, are $p$ and $q$ close or far? This question of "identity testing" has received significant attention in the case of testing whether $p$ and $q$ are…

Data Structures and Algorithms · Computer Science 2017-11-01 Constantinos Daskalakis , Gautam Kamath , John Wright

On Robust Hypothesis Testing with respect to the Hellinger Distance

We study a variant of the simple hypothesis testing problem where observed samples do not necessarily come from either of the specified distributions, but rather from a close variant of them. In this setting, we require a test that is…

Statistics Theory · Mathematics 2026-04-21 Eeshan Modak , Sivaraman Balakrishnan , Ananda Theertha Suresh