Related papers: Multiple multi-sample testing under arbitrary cova…

Multiple two-sample testing under arbitrary covariance dependency with an application in imaging mass spectrometry

Large-scale hypothesis testing has become a ubiquitous problem in high-dimensional statistical inference, with broad applications in various scienfitic disciplines. One relevant application is constituted by imaging mass spectrometry (IMS)…

Methodology · Statistics 2021-08-19 Vladimir Vutov , Thorsten Dickhaus

Exact Multivariate Tests - A New Effective Principle of Controlled Model Choice

High-dimensional tests are applied to find relevant sets of variables and relevant models. If variables are selected by analyzing the sums of products matrices and a corresponding mean-value test is performed, there is the danger that the…

Methodology · Statistics 2012-02-10 Juergen Laeuter , Maciej Rosolowski , Ekkehard Glimm

Admissibility of Adaptive Monotone Step-Down Multiple Testing Procedures Under Arbitrary Covariance Dependence

In this paper, we consider the problem of simultaneous testing of multivariate normal means under arbitrary covariance dependence. Specifically, let $\boldsymbol{X}\sim N_n(\boldsymbol{\theta},\boldsymbol{\Sigma})$, where…

Statistics Theory · Mathematics 2026-05-29 Prasenjit Ghosh , Arijit Chakrabarti

Cross-modal dependence analysis with asynchronous longitudinal multimodal data

We propose a Bayesian latent variable model to estimate covariate-assisted dependence structures across multiple modalities of multivariate data that may be observed asynchronously. This setting commonly arises in longitudinal biomedical…

Methodology · Statistics 2026-05-27 Kun Qian , Hyung G. Park

A note on marginal correlation based screening

Independence screening methods such as the two sample $t$-test and the marginal correlation based ranking are among the most widely used techniques for variable selection in ultrahigh dimensional data sets. In this short note, simple…

Methodology · Statistics 2020-11-17 Run Wang , Somak Dutta , Vivekananda Roy

Independence Test for High Dimensional Random Vectors

This paper proposes a new mutual independence test for a large number of high dimensional random vectors. The test statistic is based on the characteristic function of the empirical spectral distribution of the sample covariance matrix. The…

Statistics Theory · Mathematics 2012-05-31 G. M. Pan , J. Gao , Y. Yang , M. Guo

Data-adaptive statistics for multiple hypothesis testing in high-dimensional settings

Current statistical inference problems in areas like astronomy, genomics, and marketing routinely involve the simultaneous testing of thousands -- even millions -- of null hypotheses. For high-dimensional multivariate distributions, these…

Methodology · Statistics 2017-04-25 Weixin Cai , Nima S. Hejazi , Alan E. Hubbard

Graphical-model Based Multiple Testing under Dependence, with Applications to Genome-wide Association Studies

Large-scale multiple testing tasks often exhibit dependence, and leveraging the dependence between individual tests is still one challenging and important problem in statistics. With recent advances in graphical models, it is feasible to…

Methodology · Statistics 2012-10-19 Jie Liu , Chunming Zhang , Catherine McCarty , Peggy Peissig , Elizabeth Burnside , David Page

Testing independence with high-dimensional correlated samples

Testing independence among a number of (ultra) high-dimensional random samples is a fundamental and challenging problem. By arranging $n$ identically distributed $p$-dimensional random vectors into a $p \times n$ data matrix, we investigate…

Statistics Theory · Mathematics 2017-03-28 Xi Chen , Weidong Liu

A sequential rejection testing method for high-dimensional regression with correlated variables

We propose a general, modular method for significance testing of groups (or clusters) of variables in a high-dimensional linear model. In presence of high correlations among the covariables, due to serious problems of identifiability, it is…

Statistics Theory · Mathematics 2015-02-12 Jacopo Mandozzi , Peter Bühlmann

High-dimensional variable selection via tilting

The paper considers variable selection in linear regression models where the number of covariates is possibly much larger than the number of observations. High dimensionality of the data brings in many complications, such as (possibly…

Methodology · Statistics 2016-11-29 Haeran Cho , Piotr Fryzlewicz

Statistical inferences for complex dependence of multimodal imaging data

Statistical analysis of multimodal imaging data is a challenging task, since the data involves high-dimensionality, strong spatial correlations and complex data structures. In this paper, we propose rigorous statistical testing procedures…

Methodology · Statistics 2023-03-08 Jinyuan Chang , Jing He , Jian Kang , Mingcong Wu

Multiple Testing under High-dimensional Dynamic Factor Model

Large-scale multiple testing under static factor models is widely used to detect sparse signals in high-dimensional data. However, static factor models are arguably too stringent because they ignore serial correlation, which seriously…

Statistics Theory · Mathematics 2025-04-04 Xinxin Yang , Lilun Du

High-dimensional Feature Screening for Nonlinear Associations With Survival Outcome Using Restricted Mean Survival Time

Feature screening is an important tool in analyzing ultrahigh-dimensional data, particularly in the field of Omics and oncology studies. However, most attention has been focused on identifying features that have a linear or monotonic impact…

Methodology · Statistics 2023-05-10 Yaxian Chen , KF Lam , Zhonghua Liu

Multi-label Classification under Uncertainty: A Tree-based Conformal Prediction Approach

Multi-label classification is a common challenge in various machine learning applications, where a single data instance can be associated with multiple classes simultaneously. The current paper proposes a novel tree-based method for…

Methodology · Statistics 2024-05-01 Chhavi Tyagi , Wenge Guo

Permutation-based multiple testing when fitting many generalized linear models

In many applied sciences a popular analysis strategy for high-dimensional data is to fit many multivariate generalized linear models in parallel. This paper presents a novel approach to address the resulting multiple testing problem by…

Statistics Theory · Mathematics 2024-10-07 Riccardo De Santis , Jelle J. Goeman , Samuel Davenport , Jesse Hemerik , Livio Finos

Using Multiple Imputation to Classify Potential Outcomes Subgroups

With medical tests becoming increasingly available, concerns about over-testing and over-treatment dramatically increase. Hence, it is important to understand the influence of testing on treatment selection in general practice. Most…

Methodology · Statistics 2020-08-11 Yun Li , Irina Bondarenko , Michael R. Elliott , Timothy P. Hofer , Jeremy M. G. Taylor

A Multivariate Bernoulli-Based Sampling Method for Multi-Label Data with Application to Meta-Research

Datasets may contain observations with multiple labels. If the labels are not mutually exclusive, and if the labels vary greatly in frequency, obtaining a sample that includes sufficient observations with scarcer labels to make inferences…

Machine Learning · Computer Science 2026-05-27 Simon Chung , Colby J. Vorland , Donna L. Maney , Andrew W. Brown

Joint estimation of sparse multivariate regression and conditional graphical models

Multivariate regression model is a natural generalization of the classical univari- ate regression model for fitting multiple responses. In this paper, we propose a high- dimensional multivariate conditional regression model for…

Machine Learning · Statistics 2016-11-26 Junhui Wang

Scalable Multilabel Prediction via Randomized Methods

Modeling the dependence between outputs is a fundamental challenge in multilabel classification. In this work we show that a generic regularized nonlinearity mapping independent predictions to joint predictions is sufficient to achieve…

Machine Learning · Computer Science 2015-04-22 Nikos Karampatziakis , Paul Mineiro