Related papers: Conformalized Multiple Testing under Unknown Null …

Estimating the null distribution for conditional inference and genome-scale screening

In a novel approach to the multiple testing problem, Efron (2004; 2007) formulated estimators of the distribution of test statistics or nominal p-values under a null distribution suitable for modeling the data of thousands of unaffected…

Methodology · Statistics 2012-10-30 David R. Bickel

Empirical null and false discovery rate inference for exponential families

In large scale multiple testing, the use of an empirical null distribution rather than the theoretical null distribution can be critical for correct inference. This paper proposes a ``mode matching'' method for fitting an empirical null…

Applications · Statistics 2009-01-27 Armin Schwartzman

An adaptive significance threshold criterion for massive multiple hypotheses testing

This research deals with massive multiple hypothesis testing. First regarding multiple tests as an estimation problem under a proper population model, an error measurement called Erroneous Rejection Ratio (ERR) is introduced and related to…

Statistics Theory · Mathematics 2007-06-13 Cheng Cheng

False Discovery Rate Control For Structured Multiple Testing: Asymmetric Rules And Conformal Q-values

The effective utilization of structural information in data while ensuring statistical validity poses a significant challenge in false discovery rate (FDR) analyses. Conformal inference provides rigorous theory for grounding complex machine…

Methodology · Statistics 2024-06-18 Zinan Zhao , Wenguang Sun

Semi-supervised multiple testing

An important limitation of standard multiple testing procedures is that the null distribution should be known. Here, we consider a null distribution-free approach for multiple testing in the following semi-supervised setting: the user does…

Statistics Theory · Mathematics 2022-12-08 David Mary , Etienne Roquain

Empirical Bayes large-scale multiple testing for high-dimensional binary outcome data

This paper explores the multiple testing problem for sparse high-dimensional data with binary outcomes. We propose novel empirical Bayes multiple testing procedures based on a spike-and-slab posterior and then evaluate their performance in…

Statistics Theory · Mathematics 2025-06-16 Yu-Chien Bo Ning

Bias corrected estimators for proportion of true null hypotheses under exponential model: Application of adaptive FDR-controlling in segmented failure data

Two recently introduced model based bias corrected estimators for proportion of true null hypotheses ($\pi_0$) under multiple hypotheses testing scenario have been restructured for exponentially distributed random observations available for…

Statistics Theory · Mathematics 2020-07-28 Aniket Biswas , Gaurangadeb Chattopadhyay , Aditya Chatterjee

Exploratory data analysis for large-scale multiple testing problems and its application in gene expression studies

In large scale multiple testing problems, a two-class empirical Bayes approach can be used to control the false discovery rate (Fdr) for the entire array of hypotheses under study. A sample splitting step is incorporated to modify that…

Computation · Statistics 2019-12-13 Paramita Chakraborty , Chong Ma , John Grego , James Lynch

Empirical Bayes methods for controlling the false discovery rate with dependent data

False discovery rate (FDR) has been widely used as an error measure in large scale multiple testing problems, but most research in the area has been focused on procedures for controlling the FDR based on independent test statistics or the…

Methodology · Statistics 2009-09-29 Weihua Tang , Cun-Hui Zhang

Empirical Bayes Method for Large Scale Multiple Testing with Heteroscedastic Errors

In this paper, we address the normal mean inference problem, which involves testing multiple means of normal random variables with heteroscedastic variances. Most existing empirical Bayes methods for this setting are developed under…

Methodology · Statistics 2026-01-01 Kwangok Seo , Johan Lim , Kaiwen Wang , Dohwan Park , Shota Katayama , Xinlei Wang

On spike and slab empirical Bayes multiple testing

This paper explores a connection between empirical Bayes posterior distributions and false discovery rate (FDR) control. In the Gaussian sequence model, this work shows that empirical Bayes-calibrated spike and slab posterior distributions…

Statistics Theory · Mathematics 2019-06-18 Ismael Castillo , Etienne Roquain

Derandomized Novelty Detection with FDR Control via Conformal E-values

Conformal inference provides a general distribution-free method to rigorously calibrate the output of any machine learning algorithm for novelty detection. While this approach has many strengths, it has the limitation of being randomized,…

Machine Learning · Computer Science 2023-10-25 Meshi Bashari , Amir Epstein , Yaniv Romano , Matteo Sesia

Asymptotically optimal sequential FDR and pFDR control with (or without) prior information on the number of signals

We investigate asymptotically optimal multiple testing procedures for streams of sequential data in the context of prior information on the number of false null hypotheses ("signals"). We show that the "gap" and "gap-intersection"…

Methodology · Statistics 2020-05-04 Xinrui He , Jay Bartroff

A Conformalized Empirical Bayes Method for Multiple Testing with Side Information

This article presents a Conformalized Locally Adaptive Weighting (CLAW) approach to multiple testing with side information. The proposed method employs innovative data-driven strategies to construct pairwise exchangeable scores, which are…

Methodology · Statistics 2025-02-28 Zinan Zhao , Wenguang Sun

A robust and powerful method for assessing replicability of high dimensional data

Identifying signals that replicate across multiple studies is essential for establishing robust scientific evidence, yet existing methods for high-dimensional replicability analysis either rely on restrictive modeling assumptions, are…

Methodology · Statistics 2026-03-05 Haochen Lei , Yan Li , Hongyuan Cao

Null-free False Discovery Rate Control Using Decoy Permutations

The traditional approaches to false discovery rate (FDR) control in multiple hypothesis testing are usually based on the null distribution of a test statistic. However, all types of null distributions, including the theoretical,…

Methodology · Statistics 2021-04-13 Kun He , Mengjie Li , Yan Fu , Fuzhou Gong , Xiaoming Sun

BONuS: Multiple multivariate testing with a data-adaptivetest statistic

We propose a new adaptive empirical Bayes framework, the Bag-Of-Null-Statistics (BONuS) procedure, for multiple testing where each hypothesis testing problem is itself multivariate or nonparametric. BONuS is an adaptive and interactive…

Methodology · Statistics 2021-07-05 Chiao-Yu Yang , Lihua Lei , Nhat Ho , Will Fithian

Imprecise Subset Simulation

The objective of this work is to quantify the uncertainty in probability of failure estimates resulting from incomplete knowledge of the probability distributions for the input random variables. We propose a framework that couples the…

Methodology · Statistics 2021-10-26 Dimitris G. Giovanis , Michael Shields

Solving the Empirical Bayes Normal Means Problem with Correlated Noise

The Normal Means problem plays a fundamental role in many areas of modern high-dimensional statistics, both in theory and practice. And the Empirical Bayes (EB) approach to solving this problem has been shown to be highly effective, again…

Methodology · Statistics 2018-12-27 Lei Sun , Matthew Stephens

Unified Conformalized Multiple Testing with Full Data Efficiency

Conformalized multiple testing offers a model-free way to control predictive uncertainty in decision-making. Existing methods typically use only part of the available data to build score functions tailored to specific settings. We propose a…

Methodology · Statistics 2026-05-22 Yuyang Huo , Xiaoyang Wu , Changliang Zou , Haojie Ren