Related papers: Multiple testing procedures under confounding

Modelling overdispersion heterogeneity in differential expression analysis using mixtures

Next-generation sequencing technologies now constitute a method of choice to measure gene expression. Data to analyze are read counts, commonly modeled using Negative Binomial distributions. A relevant issue associated with this…

Methodology · Statistics 2014-11-10 Elisabetta Bonafede , Franck Picard , Stéphane Robin , Cinzia Viroli

Estimation and testing for multiple regulation of multivariate mixed outcomes

Considerable interest has recently been focused on studying multiple phenotypes simultaneously in both epidemiological and genomic studies, either to capture the multidimensionality of complex disorders or to understand shared etiology of…

Methodology · Statistics 2015-11-26 Denis Agniel , Katherine P. Liao , Tianxi Cai

A Practitioner's Guide to Multiple Testing Error Rates

It is quite common in modern research, for a researcher to test many hypotheses. The statistical (frequentist) hypothesis testing framework, does not scale with the number of hypotheses in the sense that naively performing many hypothesis…

Methodology · Statistics 2013-06-26 Jonathan Rosenblatt

Capturing the Severity of Type II Errors in High-Dimensional Multiple Testing

The severity of type II errors is frequently ignored when deriving a multiple testing procedure, even though utilizing it properly can greatly help in making correct decisions. This paper puts forward a theory behind developing a multiple…

Methodology · Statistics 2014-03-25 Li He , Sanat K. Sarkar , Zhigen Zhao

Using Multiple Imputation to Classify Potential Outcomes Subgroups

With medical tests becoming increasingly available, concerns about over-testing and over-treatment dramatically increase. Hence, it is important to understand the influence of testing on treatment selection in general practice. Most…

Methodology · Statistics 2020-08-11 Yun Li , Irina Bondarenko , Michael R. Elliott , Timothy P. Hofer , Jeremy M. G. Taylor

Multiple Hypothesis Testing in Pattern Discovery

The problem of multiple hypothesis testing arises when there are more than one hypothesis to be tested simultaneously for statistical significance. This is a very common situation in many data mining applications. For instance, assessing…

Machine Learning · Statistics 2009-06-30 Sami Hanhijärvi , Kai Puolamäki , Gemma C. Garriga

Confounder Adjustment in Multiple Hypothesis Testing

We consider large-scale studies in which thousands of significance tests are performed simultaneously. In some of these studies, the multiple testing procedure can be severely biased by latent confounding factors such as batch effects and…

Methodology · Statistics 2016-06-21 Jingshu Wang , Qingyuan Zhao , Trevor Hastie , Art B. Owen

Gene profiling for determining pluripotent genes in a time course microarray experiment

In microarray experiments, it is often of interest to identify genes which have a pre-specified gene expression profile with respect to time. Methods available in the literature are, however, typically not stringent enough in identifying…

Applications · Statistics 2009-01-18 J. Tuke , G. F. V. Glonek , P. J. Solomon

Dealing with multiple testing: To adjust or not to adjust

Multiple testing problems arise naturally in scientific studies because of the need to capture or convey more information with more variables. The literature is enormous, but the emphasis is primarily methodological, providing numerous…

Other Statistics · Statistics 2020-10-07 Yudi Pawitan , Arvid Sjölander

Multiple bias-calibration for adjusting selection bias of non-probability samples using data integration

Valid statistical inference is challenging when the sample is subject to unknown selection bias. Data integration can be used to correct for selection bias when we have a parallel probability sample from the same population with some common…

Methodology · Statistics 2023-07-24 Zhonglei Wang , Shu Yang , Jae Kwang Kim

Simultaneous inference for generalized linear models with unmeasured confounders

Tens of thousands of simultaneous hypothesis tests are routinely performed in genomic studies to identify differentially expressed genes. However, due to unmeasured confounders, many standard statistical approaches may be substantially…

Methodology · Statistics 2025-03-18 Jin-Hong Du , Larry Wasserman , Kathryn Roeder

Incorporating increased variability in testing for cancer DNA methylation

Cancer development is associated with aberrant DNA methylation, including increased stochastic variability. Statistical tests for discovering cancer methylation biomarkers have focused on changes in mean methylation. To improve the power of…

Methodology · Statistics 2023-06-27 James Y. Dai , Heng Chen , Xiaoyu Wang , Wei Sun , Ying Huang , William M. Grady , Ziding Feng

Multiple Testing in Genome-Wide Association Studies via Hierarchical Hidden Markov Models

The problems of large-scale multiple testing are often encountered in modern scientific researches. Conventional multiple testing procedures usually suffer considerable loss of testing efficiency due to the lack of consideration of…

Methodology · Statistics 2022-12-21 Pengfei Wang , Zhaofeng Tian

Graphical and numerical diagnostic tools to assess multiple imputation models by posterior predictive checking

Missing data are often dealt with multiple imputation. A crucial part of the multiple imputation process is selecting sensible models to generate plausible values for incomplete data. A method based on posterior predictive checking is…

Computation · Statistics 2026-05-14 Mingyang Cai , Stef van Buuren , Gerko Vink

Controlling the False Discovery Proportion in Matched Observational Studies

We provide an approach to exploratory data analysis in matched observational studies with a single intervention and multiple endpoints. In such settings, the researcher would like to explore evidence for actual treatment effects among these…

Methodology · Statistics 2025-12-10 Mengqi Lin , Colin Fogarty

Learning predictive models for combinations of heterogeneous proteomic data sources

Multiple technologies that measure expression levels of protein mixtures in the human body offer a potential for detection and understanding the disease. The recent increase of these technologies prompts researchers to evaluate the…

Machine Learning · Computer Science 2026-05-12 Michal Valko , Richard Pelikan , Miloš Hauskrecht

Multiple testing for signal-agnostic searches of new physics with machine learning

In this work, we address the question of how to enhance signal-agnostic searches by leveraging multiple testing strategies. Specifically, we consider hypothesis tests relying on machine learning, where model selection can introduce a bias…

High Energy Physics - Phenomenology · Physics 2024-08-23 Gaia Grosso , Marco Letizia

Addressing Confounding and Continuous Exposure Measurement Error Using Corrected Score Functions

Confounding and exposure measurement error can introduce bias when drawing inference about the marginal effect of an exposure on an outcome of interest. While there are broad methodologies for addressing each source of bias individually,…

Methodology · Statistics 2025-01-29 Brian D. Richardson , Bryan S. Blette , Peter B. Gilbert , Michael G. Hudgens

The Blessings of Multiple Treatments and Outcomes in Treatment Effect Estimation

Assessing causal effects in the presence of unobserved confounding is a challenging problem. Existing studies leveraged proxy variables or multiple treatments to adjust for the confounding bias. In particular, the latter approach attributes…

Methodology · Statistics 2023-10-17 Yong Wu , Mingzhou Liu , Jing Yan , Yanwei Fu , Shouyan Wang , Yizhou Wang , Xinwei Sun

Multiple Regression Analysis of Unmeasured Confounding

Whereas confidence intervals are used to assess uncertainty due to unmeasured individuals, confounding intervals can be used to assess uncertainty due to unmeasured attributes. Previously, we have introduced a methodology for computing…

Methodology · Statistics 2025-08-13 Brian Knaeble , R Mitchell Hughes