Related papers: Large Scale Partial Correlation Screening with Unc…

Predictive Correlation Screening: Application to Two-stage Predictor Design in High Dimension

We introduce a new approach to variable selection, called Predictive Correlation Screening, for predictor design. Predictive Correlation Screening (PCS) implements false positive control on the selected variables, is well suited to small…

Machine Learning · Statistics 2013-04-11 Hamed Firouzi , Bala Rajaratnam , Alfred Hero

Large Scale Correlation Screening

This paper treats the problem of screening for variables with high correlations in high dimensional data in which there can be many fewer samples than variables. We focus on threshold-based correlation screening methods for three related…

Machine Learning · Statistics 2015-03-18 Alfred O. Hero , Bala Rajaratnam

A unified framework for correlation mining in ultra-high dimension

Many applications benefit from theory relevant to the identification of variables having large correlations or partial correlations in high dimension. Recently there has been progress in the ultra-high dimensional setting when the sample…

Statistics Theory · Mathematics 2022-08-25 Yun Wei , Bala Rajaratnam , Alfred O. Hero

Distributed Conditional Feature Screening via Pearson Partial Correlation with FDR Control

This paper studies the distributed conditional feature screening for massive data with ultrahigh-dimensional features. Specifically, three distributed partial correlation feature screening methods (SAPS, ACPS and JDPS methods) are firstly…

Methodology · Statistics 2024-03-12 Naiwen Pang , Xiaochao Xia

Partial Distance Correlation Screening for High Dimensional Time Series

High dimensional time series datasets are becoming increasingly common in various fields such as economics, finance, meteorology, and neuroscience. Given this ubiquity of time series data, it is surprising that very few works on variable…

Methodology · Statistics 2018-04-17 Kashif Yousuf , Yang Feng

Inference for Large Panel Data with Many Covariates

This paper proposes a novel testing procedure for selecting a sparse set of covariates that explains a large dimensional panel. Our selection method provides correct false detection control while having higher power than existing…

Econometrics · Economics 2023-03-09 Markus Pelger , Jiacheng Zou

A Robust Partial Correlation-based Screening Approach

As a computationally fast and working efficient tool, sure independence screening has received much attention in solving ultrahigh dimensional problems. This paper contributes two robust sure screening approaches that simultaneously take…

Methodology · Statistics 2021-07-27 Xiaochao Xia

Identifying the Complete Correlation Structure in Large-Scale High-Dimensional Data Sets with Local False Discovery Rates

The identification of the dependent components in multiple data sets is a fundamental problem in many practical applications. The challenge in these applications is that often the data sets are high-dimensional with few observations or…

Methodology · Statistics 2023-06-02 Martin Gölz , Tanuj Hasija , Michael Muma , Abdelhak M. Zoubir

Predictive Quantile Regression with High-Dimensional Predictors: The Variable Screening Approach

This paper advances a variable screening approach to enhance conditional quantile forecasts using high-dimensional predictors. We have refined and augmented the quantile partial correlation (QPC)-based variable screening proposed by Ma et…

Econometrics · Economics 2024-10-22 Hongqi Chen , Ji Hyung Lee

Contextual Online False Discovery Rate Control

Multiple hypothesis testing, a situation when we wish to consider many hypotheses, is a core problem in statistical inference that arises in almost every scientific field. In this setting, controlling the false discovery rate (FDR), which…

Statistics Theory · Mathematics 2019-03-19 Shiyun Chen , Shiva Kasiviswanathan

Penalized linear regression with high-dimensional pairwise screening

In variable selection, most existing screening methods focus on marginal effects and ignore dependence between covariates. To improve the performance of selection, we incorporate pairwise effects in covariates for screening and…

Methodology · Statistics 2019-02-12 Siliang Gong , Kai Zhang , Yufeng Liu

The Control of the False Discovery Rate in Fixed Sequence Multiple Testing

Controlling the false discovery rate (FDR) is a powerful approach to multiple testing. In many applications, the tested hypotheses have an inherent hierarchical structure. In this paper, we focus on the fixed sequence structure where the…

Methodology · Statistics 2016-11-11 Gavin Lynch , Wenge Guo , Sanat K. Sarkar , Helmut Finner

Copula-based Partial Correlation Screening: a Joint and Robust Approach

Screening for ultrahigh dimensional features may encounter complicated issues such as outlying observations, heteroscedasticity or heavy-tailed distribution, multi-collinearity and confounding effects. Standard correlation-based marginal…

Statistics Theory · Mathematics 2018-12-27 Xiaochao Xia , Jialiang Li

Estimating False Discovery Proportion Under Arbitrary Covariance Dependence

Multiple hypothesis testing is a fundamental problem in high dimensional inference, with wide applications in many scientific fields. In genome-wide association studies, tens of thousands of tests are performed simultaneously to find if any…

Methodology · Statistics 2011-11-16 Jianqing Fan , Xu Han , Weijie Gu

A covariate-adaptive test for replicability across multiple studies with false discovery rate control

Replicability is a lynchpin for credible discoveries. The partial conjunction (PC) p-value, which combines individual base p-values from multiple similar studies, can gauge whether a feature of interest exhibits replicated signals across…

Methodology · Statistics 2025-07-29 Ninh Tran , Dennis Leung

A note of feature screening via rank-based coefficient of correlation

Feature screening is useful and popular to detect informative predictors for ultrahigh-dimensional data before developing proceeding statistical analysis or constructing statistical models. While a large body of feature screening procedures…

Methodology · Statistics 2020-08-12 Li-Pang Chen

False discovery rate regression: an application to neural synchrony detection in primary visual cortex

Many approaches for multiple testing begin with the assumption that all tests in a given study should be combined into a global false-discovery-rate analysis. But this may be inappropriate for many of today's large-scale screening problems,…

Methodology · Statistics 2014-06-10 James G. Scott , Ryan C. Kelly , Matthew A. Smith , Pengcheng Zhou , Robert E. Kass

Confounder Selection via Support Intersection

Confounding matters in almost all observational studies that focus on causality. In order to eliminate bias caused by connfounders, oftentimes a substantial number of features need to be collected in the analysis. In this case, large p…

Statistics Theory · Mathematics 2019-12-30 Shinyuu Lee , Yuru Zhu

Sparse Functional Principal Component Analysis in High Dimensions

Functional principal component analysis (FPCA) is a fundamental tool and has attracted increasing attention in recent decades, while existing methods are restricted to data with a single or finite number of random functions (much smaller…

Methodology · Statistics 2021-01-22 Xiaoyu Hu , Fang Yao

Sparse PCA with False Discovery Rate Controlled Variable Selection

Sparse principal component analysis (PCA) aims at mapping large dimensional data to a linear subspace of lower dimension. By imposing loading vectors to be sparse, it performs the double duty of dimension reduction and variable selection.…

Machine Learning · Statistics 2024-01-17 Jasin Machkour , Arnaud Breloy , Michael Muma , Daniel P. Palomar , Frédéric Pascal