Related papers: Deploying the Conditional Randomization Test in Hi…

Fast and Powerful Conditional Randomization Testing via Distillation

We consider the problem of conditional independence testing: given a response Y and covariates (X,Z), we test the null hypothesis that Y is independent of X given Z. The conditional randomization test (CRT) was recently proposed as a way to…

Methodology · Statistics 2021-06-07 Molei Liu , Eugene Katsevich , Lucas Janson , Aaditya Ramdas

Conditional Randomization Rank Test

We propose a new method named the Conditional Randomization Rank Test (CRRT) for testing conditional independence of a response variable Y and a covariate variable X, conditional on the rest of the covariates Z. The new method generalizes…

Methodology · Statistics 2021-12-02 Yanjie Zhong , Todd Kuffner , Soumendra Lahiri

A Conditional Randomization Test for Sparse Logistic Regression in High-Dimension

Identifying the relevant variables for a classification model with correct confidence levels is a central but difficult task in high-dimension. Despite the core role of sparse logistic regression in statistics and machine learning, it still…

Machine Learning · Statistics 2022-05-31 Binh T. Nguyen , Bertrand Thirion , Sylvain Arlot

The Control of the False Discovery Rate in Fixed Sequence Multiple Testing

Controlling the false discovery rate (FDR) is a powerful approach to multiple testing. In many applications, the tested hypotheses have an inherent hierarchical structure. In this paper, we focus on the fixed sequence structure where the…

Methodology · Statistics 2016-11-11 Gavin Lynch , Wenge Guo , Sanat K. Sarkar , Helmut Finner

Nearest-Neighbor Sampling Based Conditional Independence Testing

The conditional randomization test (CRT) was recently proposed to test whether two random variables X and Y are conditionally independent given random variables Z. The CRT assumes that the conditional distribution of X given Z is known…

Machine Learning · Computer Science 2023-04-11 Shuai Li , Ziqi Chen , Hongtu Zhu , Christina Dan Wang , Wang Wen

High-Dimensional False Discovery Rate Control for Dependent Variables

Algorithms that ensure reproducible findings from large-scale, high-dimensional data are pivotal in numerous signal processing applications. In recent years, multivariate false discovery rate (FDR) controlling methods have emerged,…

Methodology · Statistics 2024-01-31 Jasin Machkour , Michael Muma , Daniel P. Palomar

Sequential FDR and pFDR control under arbitrary dependence, with application to pharmacovigilance database monitoring

We propose sequential multiple testing procedures which control the false discover rate (FDR) or the positive false discovery rate (pFDR) under arbitrary dependence between the data streams. This is accomplished by "optimizing" an upper…

Methodology · Statistics 2024-11-27 Michael Hankin , Jay Bartroff

Setwise Hierarchical Variable Selection and the Generalized Linear Step-Up Procedure for False Discovery Rate Control

Controlling the false discovery rate (FDR) in variable selection becomes challenging when predictors are correlated, as existing methods often exclude all members of correlated groups and consequently perform poorly for prediction. We…

Methodology · Statistics 2026-03-03 Sarah Organ , Toby Kenney , Hong Gu

A Power Analysis of the Conditional Randomization Test and Knockoffs

In many scientific problems, researchers try to relate a response variable $Y$ to a set of potential explanatory variables $X = (X_1,\dots,X_p)$, and start by trying to identify variables that contribute to this relationship. In statistical…

Statistics Theory · Mathematics 2020-10-07 Wenshuo Wang , Lucas Janson

FDR Control for Complex-Valued Data with Application in Single Snapshot Multi-Source Detection and DOA Estimation

False discovery rate (FDR) control is a popular approach for maintaining the integrity of statistical analyses, especially in high-dimensional data settings, where multiple comparisons increase the risk of false positives. FDR control has…

Signal Processing · Electrical Eng. & Systems 2026-03-03 Fabian Scheidt , Jasin Machkour , Michael Muma

Testing Multivariate Conditional Independence Using Exchangeable Sampling and Sufficient Statistics

We consider testing multivariate conditional independence between a response Y and a covariate vector X given additional variables Z. We introduce the Multivariate Sufficient Statistic Conditional Randomization Test (MS-CRT), which…

Methodology · Statistics 2025-04-10 Xiaotong Lin , Jie Xie , Fangqiao Tian , Dongming Huang

Valid Feature-Level Inference for Tabular Foundation Models via the Conditional Randomization Test

Modern machine learning models are highly expressive but notoriously difficult to analyze statistically. In particular, while black-box predictors can achieve strong empirical performance, they rarely provide valid hypothesis tests or…

Machine Learning · Computer Science 2026-03-10 Mohamed Salem

Controlling the False Discovery Rate in Subspace Selection

Controlling the false discovery rate (FDR) is a popular approach to multiple testing, variable selection, and related problems of simultaneous inference. In many contemporary applications, models are not specified by discrete variables,…

Statistics Theory · Mathematics 2024-04-16 Mateo Díaz , Venkat Chandrasekaran

False Discovery Rate Control for Confounder Selection Using Mirror Statistics

While data-driven confounder selection requires careful consideration, it is frequently employed in observational studies. Widely recognized criteria for confounder selection include the minimal-set approach, which involves selecting…

Methodology · Statistics 2025-08-21 Kazuharu Harada , Masataka Taguri

False discovery rate control with multivariate $p$-values

Multivariate statistics are often available as well as necessary in hypothesis tests. We study how to use such statistics to control not only false discovery rate (FDR) but also positive FDR (pFDR) with good power. We show that FDR can be…

Statistics Theory · Mathematics 2008-05-21 Zhiyi Chi

A Computationally Efficient Approach to False Discovery Rate Control and Power Maximisation via Randomisation and Mirror Statistic

Simultaneously performing variable selection and inference in high-dimensional regression models is an open challenge in statistics and machine learning. The increasing availability of vast amounts of variables requires the adoption of…

Methodology · Statistics 2025-05-08 Marco Molinari , Magne Thoresen

Hypothesis Testing in Sequentially Sampled Data: AdapRT to Maximize Power Beyond iid Sampling

Testing whether a variable of interest affects the outcome is one of the most fundamental problem in statistics and is often the main scientific question of interest. To tackle this problem, the conditional randomization test (CRT) is…

Methodology · Statistics 2023-05-26 Dae Woong Ham , Jiaze Qiu

Sequential Tests of Multiple Hypotheses Controlling False Discovery and Nondiscovery Rates

We propose a general and flexible procedure for testing multiple hypotheses about sequential (or streaming) data that simultaneously controls both the false discovery rate (FDR) and false nondiscovery rate (FNR) under minimal assumptions…

Methodology · Statistics 2019-01-14 Jay Bartroff , Jinlin Song

Conditional Diffusion Models Based Conditional Independence Testing

Conditional independence (CI) testing is a fundamental task in modern statistics and machine learning. The conditional randomization test (CRT) was recently introduced to test whether two random variables, $X$ and $Y$, are conditionally…

Machine Learning · Statistics 2024-12-19 Yanfeng Yang , Shuai Li , Yingjie Zhang , Zhuoran Sun , Hai Shu , Ziqi Chen , Renming Zhang

The Terminating-Random Experiments Selector: Fast High-Dimensional Variable Selection with False Discovery Rate Control

We propose the Terminating-Random Experiments (T-Rex) selector, a fast variable selection method for high-dimensional data. The T-Rex selector controls a user-defined target false discovery rate (FDR) while maximizing the number of selected…

Methodology · Statistics 2024-03-14 Jasin Machkour , Michael Muma , Daniel P. Palomar