Related papers: High-dimensional Variable Screening via Conditiona…

Conditional Sure Independence Screening

Independence screening is a powerful method for variable selection for `Big Data' when the number of variables is massive. Commonly used independence screening methods are based on marginal correlations or variations of it. In many…

Statistics Theory · Mathematics 2012-11-02 Emre Barut , Jianqing Fan , Anneleen Verhasselt

Conditional variable screening for ultra-high dimensional longitudinal data with time interactions

In recent years we have been able to gather large amounts of genomic data at a fast rate, creating situations where the number of variables greatly exceeds the number of observations. In these situations, most models that can handle a…

Methodology · Statistics 2025-02-07 Andrea Bratsberg , Abhik Ghosh , Magne Thoresen

Sufficient variable screening via directional regression with censored response

We in this paper propose a directional regression based approach for ultrahigh dimensional sufficient variable screening with censored responses. The new method is designed in a model-free manner and thus can be adapted to various complex…

Methodology · Statistics 2018-02-28 Menghao Xu , Zhou Yu , Jun Shao

Robust Sure Independence Screening for Non-polynomial dimensional Generalized Linear Models

We consider the problem of variable screening in ultra-high dimensional generalized linear models (GLMs) of non-polynomial orders. Since the popular SIS approach is extremely unstable in the presence of contamination and noise, we discuss a…

Statistics Theory · Mathematics 2022-11-15 Abhik Ghosh , Erica Ponzi , Torkjel Sandanger , Magne Thoresen

Quantile-adaptive model-free variable screening for high-dimensional heterogeneous data

We introduce a quantile-adaptive framework for nonlinear variable screening with high-dimensional heterogeneous data. This framework has two distinctive features: (1) it allows the set of active variables to vary across quantiles, thus…

Statistics Theory · Mathematics 2013-12-12 Xuming He , Lan Wang , Hyokyoung Grace Hong

Regularization after retention in ultrahigh dimensional linear regression models

In ultrahigh dimensional setting, independence screening has been both theoretically and empirically proved a useful variable selection framework with low computation cost. In this work, we propose a two-step framework by using marginal…

Methodology · Statistics 2017-08-11 Haolei Weng , Yang Feng , Xingye Qiao

Variable screening with multiple studies

Advancement in technology has generated abundant high-dimensional data that allows integration of multiple relevant studies. Due to their huge computational advantage, variable screening methods based on marginal correlation have become…

Methodology · Statistics 2017-10-12 Tianzhou Ma , Zhao Ren , George C. Tseng

Multiscale Cochran-Mantel-Haenszel Scanning for Conditional Dependency

We propose a nonparametric approach to testing conditional independence and estimating conditional association, generalizing the Cochran-Mantel-Haenszel (CMH) test and odds-ratio estimator to continuous sample spaces. It leverages a…

Methodology · Statistics 2026-04-22 Gyeonghun Kang , Jialiang Mao , Li Ma

A note on marginal correlation based screening

Independence screening methods such as the two sample $t$-test and the marginal correlation based ranking are among the most widely used techniques for variable selection in ultrahigh dimensional data sets. In this short note, simple…

Methodology · Statistics 2020-11-17 Run Wang , Somak Dutta , Vivekananda Roy

Efficient High-Dimensional Conditional Independence Testing

This article deals with the problem of testing conditional independence between two random vectors ${\bf X}$ and ${\bf Y}$ given a confounding random vector ${\bf Z}$. Several authors have considered this problem for multivariate data.…

Statistics Theory · Mathematics 2025-09-16 Bilol Banerjee

Robust rank correlation based screening

Independence screening is a variable selection method that uses a ranking criterion to select significant variables, particularly for statistical models with nonpolynomial dimensionality or "large p, small n" paradigms when p can be as…

Methodology · Statistics 2012-10-18 Gaorong Li , Heng Peng , Jun Zhang , Lixing Zhu

Partial Distance Correlation Screening for High Dimensional Time Series

High dimensional time series datasets are becoming increasingly common in various fields such as economics, finance, meteorology, and neuroscience. Given this ubiquity of time series data, it is surprising that very few works on variable…

Methodology · Statistics 2018-04-17 Kashif Yousuf , Yang Feng

Model-Free Conditional Feature Screening with Exposure Variables

In high dimensional analysis, effects of explanatory variables on responses sometimes rely on certain exposure variables, such as time or environmental factors. In this paper, to characterize the importance of each predictor, we utilize its…

Methodology · Statistics 2018-04-11 Yeqing Zhou , Jingyuan Liu , Zhihui Hao , Liping Zhu

Conditional Mean and Quantile Dependence Testing in High Dimension

Motivated by applications in biological science, we propose a novel test to assess the conditional mean dependence of a response variable on a large number of covariates. Our procedure is built on the martingale difference divergence…

Statistics Theory · Mathematics 2017-01-31 Xianyang Zhang , Shun Yao , Xiaofeng Shao

Predictive Quantile Regression with High-Dimensional Predictors: The Variable Screening Approach

This paper advances a variable screening approach to enhance conditional quantile forecasts using high-dimensional predictors. We have refined and augmented the quantile partial correlation (QPC)-based variable screening proposed by Ma et…

Econometrics · Economics 2024-10-22 Hongqi Chen , Ji Hyung Lee

Nonparametric Independence Screening in Sparse Ultra-High Dimensional Varying Coefficient Models

The varying-coefficient model is an important nonparametric statistical model that allows us to examine how the effects of covariates vary with exposure variables. When the number of covariates is big, the issue of variable selection…

Statistics Theory · Mathematics 2013-03-05 Jianqing Fan , Yunbei Ma , Wei Dai

On the consistency theory of high dimensional variable screening

Variable screening is a fast dimension reduction technique for assisting high dimensional feature selection. As a preselection method, it selects a moderate size subset of candidate variables for further refining via feature selection to…

Statistics Theory · Mathematics 2015-06-09 Xiangyu Wang , Chenlei Leng , David B. Dunson

Model-free screening procedure for ultrahigh-dimensional survival data based on Hilbert-Schmidt independence criterion

How to select the active variables which have significant impact on the event of interest is a very important and meaningful problem in the statistical analysis of ultrahigh-dimensional data. Sure independent screening procedure has been…

Methodology · Statistics 2023-03-28 Xuerui Li , Yanyan Liu , Yankai Peng , Jing Zhang

Distribution-free and Model-free Multivariate Feature Screening via Multivariate Rank Distance Correlation

Feature screening approaches are effective in selecting active features from data with ultrahigh dimensionality and increasing complexity; however, the majority of existing feature screening approaches are either restricted to a univariate…

Methodology · Statistics 2023-05-09 Shaofei Zhao , Guifang Fu

High-dimensional variable selection via tilting

The paper considers variable selection in linear regression models where the number of covariates is possibly much larger than the number of observations. High dimensionality of the data brings in many complications, such as (possibly…

Methodology · Statistics 2016-11-29 Haeran Cho , Piotr Fryzlewicz