Related papers: Robust Kernel Hypothesis Testing under Data Corrup…

Kernel Robust Hypothesis Testing

The problem of robust hypothesis testing is studied, where under the null and the alternative hypotheses, the data-generating distributions are assumed to be in some uncertainty sets, and the goal is to design a test that performs well…

Signal Processing · Electrical Eng. & Systems 2023-08-08 Zhongchang Sun , Shaofeng Zou

Differentially Private Permutation Tests: Applications to Kernel Methods

Recent years have witnessed growing concerns about the privacy of sensitive data. In response to these concerns, differential privacy has emerged as a rigorous framework for privacy protection, gaining widespread recognition in both…

Statistics Theory · Mathematics 2024-01-09 Ilmun Kim , Antonin Schrab

A Unified View of Optimal Kernel Hypothesis Testing

This paper provides a unifying view of optimal kernel hypothesis testing across the MMD two-sample, HSIC independence, and KSD goodness-of-fit frameworks. Minimax optimal separation rates in the kernel and $L^2$ metrics are presented, with…

Machine Learning · Statistics 2025-12-30 Antonin Schrab

Minimax optimality of permutation tests

Permutation tests are widely used in statistics, providing a finite-sample guarantee on the type I error rate whenever the distribution of the samples under the null hypothesis is invariant to some rearrangement. Despite its increasing…

Statistics Theory · Mathematics 2022-05-26 Ilmun Kim , Sivaraman Balakrishnan , Larry Wasserman

Robust Risk Minimization for Statistical Learning

We consider a general statistical learning problem where an unknown fraction of the training data is corrupted. We develop a robust learning method that only requires specifying an upper bound on the corrupted data fraction. The method…

Machine Learning · Statistics 2020-02-10 Muhammad Osama , Dave Zachariah , Peter Stoica

A fast and effective kernel two-sample test for large-scale data

Kernel two-sample tests have been widely used, and the development of efficient methods for high-dimensional, large-scale data is receiving increasing attention in the big data era. However, existing methods, such as the maximum mean…

Methodology · Statistics 2025-10-03 Hoseung Song , Hao Chen

Robust Hypothesis Testing Using Wasserstein Uncertainty Sets

We develop a novel computationally efficient and general framework for robust hypothesis testing. The new framework features a new way to construct uncertainty sets under the null and the alternative distributions, which are sets centered…

Machine Learning · Statistics 2018-05-29 Rui Gao , Liyan Xie , Yao Xie , Huan Xu

Robust Multi-Hypothesis Testing with Moment Constrained Uncertainty Sets

The problem of robust binary hypothesis testing is studied. Under both hypotheses, the data-generating distributions are assumed to belong to uncertainty sets constructed through moments; in particular, the sets contain distributions whose…

Statistics Theory · Mathematics 2024-01-09 Akshayaa Magesh , Zhongchang Sun , Venugopal V. Veeravalli , Shaofeng Zou

Robust Estimation Under Heterogeneous Corruption Rates

We study the problem of robust estimation under heterogeneous corruption rates, where each sample may be independently corrupted with a known but non-identical probability. This setting arises naturally in distributed and federated…

Machine Learning · Computer Science 2025-10-02 Syomantak Chaudhuri , Jerry Li , Thomas A. Courtade

Robust Hypothesis Testing with Wasserstein Uncertainty Sets

We consider a data-driven robust hypothesis test where the optimal test will minimize the worst-case performance regarding distributions that are close to the empirical distributions with respect to the Wasserstein distance. This leads to a…

Statistics Theory · Mathematics 2021-06-01 Liyan Xie , Rui Gao , Yao Xie

A conformal test of linear models via permutation-augmented regressions

Permutation tests are widely recognized as robust alternatives to tests based on normal theory. Random permutation tests have been frequently employed to assess the significance of variables in linear models. Despite their widespread use,…

Methodology · Statistics 2023-12-29 Leying Guan

Composite Goodness-of-fit Tests with Kernels

Model misspecification can create significant challenges for the implementation of probabilistic models, and this has led to development of a range of robust methods which directly account for this issue. However, whether these more…

Machine Learning · Statistics 2025-04-22 Oscar Key , Arthur Gretton , François-Xavier Briol , Tamara Fernandez

Online and Distributed Robust Regressions under Adversarial Data Corruption

In today's era of big data, robust least-squares regression becomes a more challenging problem when considering the adversarial corruption along with explosive growth of datasets. Traditional robust methods can handle the noise but suffer…

Data Structures and Algorithms · Computer Science 2017-10-04 Xuchao Zhang , Liang Zhao , Arnold P. Boedihardjo , Chang-Tien Lu

On Interaction Between Augmentations and Corruptions in Natural Corruption Robustness

Invariance to a broad array of image corruptions, such as warping, noise, or color shifts, is an important aspect of building robust models in computer vision. Recently, several new data augmentations have been proposed that significantly…

Computer Vision and Pattern Recognition · Computer Science 2021-11-22 Eric Mintun , Alexander Kirillov , Saining Xie

On Robustness of the Normalized Subgradient Method with Randomly Corrupted Subgradients

Numerous modern optimization and machine learning algorithms rely on subgradient information being trustworthy and hence, they may fail to converge when such information is corrupted. In this paper, we consider the setting where subgradient…

Optimization and Control · Mathematics 2021-03-23 Berkay Turan , Cesar A. Uribe , Hoi-To Wai , Mahnoosh Alizadeh

A general framework for the analysis of kernel-based tests

Kernel-based tests provide a simple yet effective framework that use the theory of reproducing kernel Hilbert spaces to design non-parametric testing procedures. In this paper we propose new theoretical tools that can be used to study the…

Statistics Theory · Mathematics 2022-09-02 Tamara Fernández , Nicolás Rivera

Finite-Sample Two-Group Composite Hypothesis Testing via Machine Learning

In the problem of composite hypothesis testing, identifying the potential uniformly most powerful (UMP) unbiased test is of great interest. Beyond typical hypothesis settings with exponential family, it is usually challenging to prove the…

Methodology · Statistics 2022-08-03 Tianyu Zhan , Jian Kang

On Robust Mean Estimation under Coordinate-level Corruption

We study the problem of robust mean estimation and introduce a novel Hamming distance-based measure of distribution shift for coordinate-level corruptions. We show that this measure yields adversary models that capture more realistic…

Machine Learning · Computer Science 2021-06-14 Zifan Liu , Jongho Park , Theodoros Rekatsinas , Christos Tzamos

A robust and p-hacking-proof significance test under variance uncertainty

P-hacking poses challenges to traditional hypothesis testing. In this paper, we propose a robust method for the one-sample significance test that can protect against p-hacking from sample manipulation. Precisely, assuming a sequential…

Statistics Theory · Mathematics 2025-02-18 Xifeng Li , Shuzhen Yang , Jianfeng Yao

Non-Convex Robust Hypothesis Testing using Sinkhorn Uncertainty Sets

We present a new framework to address the non-convex robust hypothesis testing problem, wherein the goal is to seek the optimal detector that minimizes the maximum of worst-case type-I and type-II risk functions. The distributional…

Machine Learning · Statistics 2024-03-25 Jie Wang , Rui Gao , Yao Xie