English
Related papers

Related papers: Multiple multi-sample testing under arbitrary cova…

200 papers

Large-scale hypothesis testing has become a ubiquitous problem in high-dimensional statistical inference, with broad applications in various scienfitic disciplines. One relevant application is constituted by imaging mass spectrometry (IMS)…

Methodology · Statistics 2021-08-19 Vladimir Vutov , Thorsten Dickhaus

High-dimensional tests are applied to find relevant sets of variables and relevant models. If variables are selected by analyzing the sums of products matrices and a corresponding mean-value test is performed, there is the danger that the…

Methodology · Statistics 2012-02-10 Juergen Laeuter , Maciej Rosolowski , Ekkehard Glimm

In this paper, we consider the problem of simultaneous testing of multivariate normal means under arbitrary covariance dependence. Specifically, let $\boldsymbol{X}\sim N_n(\boldsymbol{\theta},\boldsymbol{\Sigma})$, where…

Statistics Theory · Mathematics 2026-05-29 Prasenjit Ghosh , Arijit Chakrabarti

We propose a Bayesian latent variable model to estimate covariate-assisted dependence structures across multiple modalities of multivariate data that may be observed asynchronously. This setting commonly arises in longitudinal biomedical…

Methodology · Statistics 2026-05-27 Kun Qian , Hyung G. Park

Independence screening methods such as the two sample $t$-test and the marginal correlation based ranking are among the most widely used techniques for variable selection in ultrahigh dimensional data sets. In this short note, simple…

Methodology · Statistics 2020-11-17 Run Wang , Somak Dutta , Vivekananda Roy

This paper proposes a new mutual independence test for a large number of high dimensional random vectors. The test statistic is based on the characteristic function of the empirical spectral distribution of the sample covariance matrix. The…

Statistics Theory · Mathematics 2012-05-31 G. M. Pan , J. Gao , Y. Yang , M. Guo

Current statistical inference problems in areas like astronomy, genomics, and marketing routinely involve the simultaneous testing of thousands -- even millions -- of null hypotheses. For high-dimensional multivariate distributions, these…

Methodology · Statistics 2017-04-25 Weixin Cai , Nima S. Hejazi , Alan E. Hubbard

Large-scale multiple testing tasks often exhibit dependence, and leveraging the dependence between individual tests is still one challenging and important problem in statistics. With recent advances in graphical models, it is feasible to…

Methodology · Statistics 2012-10-19 Jie Liu , Chunming Zhang , Catherine McCarty , Peggy Peissig , Elizabeth Burnside , David Page

Testing independence among a number of (ultra) high-dimensional random samples is a fundamental and challenging problem. By arranging $n$ identically distributed $p$-dimensional random vectors into a $p \times n$ data matrix, we investigate…

Statistics Theory · Mathematics 2017-03-28 Xi Chen , Weidong Liu

We propose a general, modular method for significance testing of groups (or clusters) of variables in a high-dimensional linear model. In presence of high correlations among the covariables, due to serious problems of identifiability, it is…

Statistics Theory · Mathematics 2015-02-12 Jacopo Mandozzi , Peter Bühlmann

The paper considers variable selection in linear regression models where the number of covariates is possibly much larger than the number of observations. High dimensionality of the data brings in many complications, such as (possibly…

Methodology · Statistics 2016-11-29 Haeran Cho , Piotr Fryzlewicz

Statistical analysis of multimodal imaging data is a challenging task, since the data involves high-dimensionality, strong spatial correlations and complex data structures. In this paper, we propose rigorous statistical testing procedures…

Methodology · Statistics 2023-03-08 Jinyuan Chang , Jing He , Jian Kang , Mingcong Wu

Large-scale multiple testing under static factor models is widely used to detect sparse signals in high-dimensional data. However, static factor models are arguably too stringent because they ignore serial correlation, which seriously…

Statistics Theory · Mathematics 2025-04-04 Xinxin Yang , Lilun Du

Feature screening is an important tool in analyzing ultrahigh-dimensional data, particularly in the field of Omics and oncology studies. However, most attention has been focused on identifying features that have a linear or monotonic impact…

Methodology · Statistics 2023-05-10 Yaxian Chen , KF Lam , Zhonghua Liu

Multi-label classification is a common challenge in various machine learning applications, where a single data instance can be associated with multiple classes simultaneously. The current paper proposes a novel tree-based method for…

Methodology · Statistics 2024-05-01 Chhavi Tyagi , Wenge Guo

In many applied sciences a popular analysis strategy for high-dimensional data is to fit many multivariate generalized linear models in parallel. This paper presents a novel approach to address the resulting multiple testing problem by…

Statistics Theory · Mathematics 2024-10-07 Riccardo De Santis , Jelle J. Goeman , Samuel Davenport , Jesse Hemerik , Livio Finos

With medical tests becoming increasingly available, concerns about over-testing and over-treatment dramatically increase. Hence, it is important to understand the influence of testing on treatment selection in general practice. Most…

Methodology · Statistics 2020-08-11 Yun Li , Irina Bondarenko , Michael R. Elliott , Timothy P. Hofer , Jeremy M. G. Taylor

Datasets may contain observations with multiple labels. If the labels are not mutually exclusive, and if the labels vary greatly in frequency, obtaining a sample that includes sufficient observations with scarcer labels to make inferences…

Machine Learning · Computer Science 2026-05-27 Simon Chung , Colby J. Vorland , Donna L. Maney , Andrew W. Brown

Multivariate regression model is a natural generalization of the classical univari- ate regression model for fitting multiple responses. In this paper, we propose a high- dimensional multivariate conditional regression model for…

Machine Learning · Statistics 2016-11-26 Junhui Wang

Modeling the dependence between outputs is a fundamental challenge in multilabel classification. In this work we show that a generic regularized nonlinearity mapping independent predictions to joint predictions is sufficient to achieve…

Machine Learning · Computer Science 2015-04-22 Nikos Karampatziakis , Paul Mineiro
‹ Prev 1 2 3 10 Next ›