Related papers: Conditional Testing based on Localized Conformal p…

Testing for Outliers with Conformal p-values

This paper studies the construction of p-values for nonparametric outlier detection, taking a multiple-testing perspective. The goal is to test whether new independent samples belong to the same distribution as a reference data set or are…

Methodology · Statistics 2024-03-12 Stephen Bates , Emmanuel Candès , Lihua Lei , Yaniv Romano , Matteo Sesia

Integrative conformal p-values for powerful out-of-distribution testing with labeled outliers

This paper develops novel conformal methods to test whether a new observation was sampled from the same distribution as a reference set. Blending inductive and transductive conformal inference in an innovative way, the described methods can…

Methodology · Statistics 2022-08-26 Ziyi Liang , Matteo Sesia , Wenguang Sun

Prediction-Powered Conditional Inference

We study prediction-powered conditional inference in the setting where labeled data are scarce, unlabeled covariates are abundant, and a black-box machine-learning predictor is available. The goal is to perform statistical inference on…

Machine Learning · Statistics 2026-03-09 Yang Sui , Jin Zhou , Hua Zhou , Xiaowu Dai

Selection by Prediction with Conformal p-values

Decision making or scientific discovery pipelines such as job hiring and drug discovery often involve multiple stages: before any resource-intensive step, there is often an initial screening that uses predictions from a machine learning…

Methodology · Statistics 2023-05-30 Ying Jin , Emmanuel J. Candès

A Two-Sample Conditional Distribution Test Using Conformal Prediction and Weighted Rank Sum

We consider the problem of testing the equality of conditional distributions of a response variable given a vector of covariates between two populations. Such a hypothesis testing problem can be motivated from various machine learning and…

Methodology · Statistics 2023-02-24 Xiaoyu Hu , Jing Lei

Model-free selective inference under covariate shift via weighted conformal p-values

This paper introduces novel weighted conformal p-values and methods for model-free selective inference. The problem is as follows: given test units with covariates $X$ and missing responses $Y$, how do we select units for which the…

Methodology · Statistics 2023-09-27 Ying Jin , Emmanuel J. Candès

Valid Feature-Level Inference for Tabular Foundation Models via the Conditional Randomization Test

Modern machine learning models are highly expressive but notoriously difficult to analyze statistically. In particular, while black-box predictors can achieve strong empirical performance, they rarely provide valid hypothesis tests or…

Machine Learning · Computer Science 2026-03-10 Mohamed Salem

Deploying the Conditional Randomization Test in High Multiplicity Problems

This paper introduces the sequential CRT, which is a variable selection procedure that combines the conditional randomization test (CRT) and Selective SeqStep+. Valid p-values are constructed via the flexible CRT, which are then ordered and…

Methodology · Statistics 2022-04-08 Shuangning Li , Emmanuel J. Candès

Conformal prediction with localization

We propose a new method called localized conformal prediction, where we can perform conformal inference using only a local region around a new test sample to construct its confidence interval. Localized conformal inference is a natural…

Statistics Theory · Mathematics 2020-07-08 Leying Guan

Randomized p-values for multiple testing and their application in replicability analysis

We are concerned with testing replicability hypotheses for many endpoints simultaneously. This constitutes a multiple test problem with composite null hypotheses. Traditional $p$-values, which are computed under least favourable parameter…

Methodology · Statistics 2020-02-26 Anh-Tuan Hoang , Thorsten Dickhaus

Kernel conditional tests from learning-theoretic bounds

We propose a framework for hypothesis testing on conditional probability distributions, which we then use to construct statistical tests of functionals of conditional distributions. These tests identify the inputs where the functionals…

Machine Learning · Computer Science 2025-11-03 Pierre-François Massiani , Christian Fiedler , Lukas Haverbeck , Friedrich Solowjow , Sebastian Trimpe

Sequential Specification Tests to Choose a Model: A Change-Point Approach

Researchers faced with a sequence of candidate model specifications must often choose the best specification that does not violate a testable identification assumption. One option in this scenario is sequential specification tests:…

Methodology · Statistics 2023-07-25 Adam C. Sales

A Variational Estimator for $L_p$ Calibration Errors

Calibration$\unicode{x2014}$the problem of ensuring that predicted probabilities align with observed class frequencies$\unicode{x2014}$is a basic desideratum for reliable prediction with machine learning systems. Calibration error is…

Machine Learning · Statistics 2026-03-02 Eugène Berta , Sacha Braun , David Holzmüller , Francis Bach , Michael I. Jordan

Model-agnostic Selective Labeling with Provable Statistical Guarantees

Obtaining high-quality labels for large datasets is expensive, requiring massive annotations from human experts. While AI models offer a cost-effective alternative by predicting labels, their label quality is compromised by the unavoidable…

Machine Learning · Computer Science 2026-02-17 Huipeng Huang , Wenbo Liao , Huajun Xi , Hao Zeng , Mengchen Zhao , Hongxin Wei

Localized Conformal Prediction: A Generalized Inference Framework for Conformal Prediction

We propose a new inference framework called localized conformal prediction. It generalizes the framework of conformal prediction by offering a single-test-sample adaptive construction that emphasizes a local region around this test sample,…

Statistics Theory · Mathematics 2022-03-02 Leying Guan

Conditional independence testing: a predictive perspective

Conditional independence testing is a key problem required by many machine learning and statistics tools. In particular, it is one way of evaluating the usefulness of some features on a supervised prediction problem. We propose a novel…

Machine Learning · Statistics 2019-08-02 Marco Henrique de Almeida Inácio , Rafael Izbicki , Rafael Bassi Stern

FDR Control via Neural Networks under Covariate-Dependent Symmetric Nulls

In modern multiple hypothesis testing, the availability of covariate information alongside the primary test statistics has motivated the development of more powerful and adaptive inference methods. However, most existing approaches rely on…

Methodology · Statistics 2025-11-20 Taehyoung Kim , Seohwa Hwang , Junyong Park

Conformal Prediction with Learned Features

In this paper, we focus on the problem of conformal prediction with conditional guarantees. Prior work has shown that it is impossible to construct nontrivial prediction sets with full conditional coverage guarantees. A wealth of research…

Machine Learning · Computer Science 2024-04-29 Shayan Kiyani , George Pappas , Hamed Hassani

Individualized Conformal

The problem of individualized prediction can be addressed using variants of conformal prediction, obtaining the intervals to which the actual values of the variables of interest belong. Here we present a method based on detecting the…

Methodology · Statistics 2023-04-12 Fernando Delbianco , Fernando Tohmé

Multiple testing of composite null hypotheses for discrete data using randomized $p$-values

$P$-values that are derived from continuously distributed test statistics are typically uniformly distributed on $(0,1)$ under least favorable parameter configurations (LFCs) in the null hypothesis. Conservativeness of a $p$-value $P$…

Methodology · Statistics 2023-03-13 Daniel Ochieng , Anh-Tuan Hoang , Thorsten Dickhaus