Related papers: Greedy Forward Regression for Variable Screening

Robust variable screening for regression using factor profiling

Sure Independence Screening is a fast procedure for variable selection in ultra-high dimensional regression analysis. Unfortunately, its performance greatly deteriorates with increasing dependence among the predictors. To solve this issue,…

Methodology · Statistics 2018-11-15 Yixin Wang , Stefan Van Aelst

Variable screening with multiple studies

Advancement in technology has generated abundant high-dimensional data that allows integration of multiple relevant studies. Due to their huge computational advantage, variable screening methods based on marginal correlation have become…

Methodology · Statistics 2017-10-12 Tianzhou Ma , Zhao Ren , George C. Tseng

Faithful Variable Screening for High-Dimensional Convex Regression

We study the problem of variable selection in convex nonparametric regression. Under the assumption that the true regression function is convex and sparse, we develop a screening procedure to select a subset of variables that contains the…

Statistics Theory · Mathematics 2014-11-19 Min Xu , Minhua Chen , John Lafferty

A robust variable screening procedure for ultra-high dimensional data

Variable selection in ultra-high dimensional regression problems has become an important issue. In such situations, penalized regression models may face computational problems and some pre screening of the variables may be necessary. A…

Methodology · Statistics 2020-05-01 Abhik Ghosh , Magne Thoresen

Conditional Sure Independence Screening

Independence screening is a powerful method for variable selection for `Big Data' when the number of variables is massive. Commonly used independence screening methods are based on marginal correlations or variations of it. In many…

Statistics Theory · Mathematics 2012-11-02 Emre Barut , Jianqing Fan , Anneleen Verhasselt

Variable Screening for High Dimensional Time Series

Variable selection is a widely studied problem in high dimensional statistics, primarily since estimating the precise relationship between the covariates and the response is of great importance in many scientific disciplines. However, most…

Methodology · Statistics 2018-03-12 Kashif Yousuf

Ultrahigh dimensional variable selection: beyond the linear model

Variable selection in high-dimensional space characterizes many contemporary problems in scientific discovery and decision making. Many frequently-used techniques are based on independence screening; examples include correlation ranking…

Methodology · Statistics 2008-12-18 Jianqing Fan , Richard Samworth , Yichao Wu

Forward Regression via Gram-Schmidt Orthogonalization for Ultra-High Dimensional Linear Models

Forward regression is a classical and effective tool for variable screening in ultra-high dimensional linear models, but its standard projection-based implementation can be computationally costly and numerically unstable when predictors are…

Methodology · Statistics 2026-03-20 Jialuo Chen , Zhaoxing Gao , Yifan Jiang , Ruey S. Tsay

Independent screening for single-index hazard rate models with ultra-high dimensional features

In data sets with many more features than observations, independent screening based on all univariate regression models leads to a computationally convenient variable selection method. Recent efforts have shown that in the case of…

Machine Learning · Statistics 2011-08-12 Anders Gorst-Rasmussen , Thomas H. Scheike

Screening methods for linear errors-in-variables models in high dimensions

Microarray studies, in order to identify genes associated with an outcome of interest, usually produce noisy measurements for a large number of gene expression features from a small number of subjects. One common approach to analyzing such…

Methodology · Statistics 2021-04-21 Linh Nghiem , Francis K. C. Hui , Samuel Mueller , A. H. Welsh

Nonparametric Independence Screening in Sparse Ultra-High Dimensional Varying Coefficient Models

The varying-coefficient model is an important nonparametric statistical model that allows us to examine how the effects of covariates vary with exposure variables. When the number of covariates is big, the issue of variable selection…

Statistics Theory · Mathematics 2013-03-05 Jianqing Fan , Yunbei Ma , Wei Dai

Nonparametric Independence Screening in Sparse Ultra-High Dimensional Additive Models

A variable screening procedure via correlation learning was proposed Fan and Lv (2008) to reduce dimensionality in sparse ultra-high dimensional models. Even when the true model is linear, the marginal regression can be highly nonlinear. To…

Methodology · Statistics 2011-01-19 Jianqing Fan , Yang Feng , Rui Song

Forward regression is a crucial methodology for automatically identifying important predictors from a large pool of potential covariates. In contexts with moderate predictor correlation, forward selection techniques can achieve screening…

Methodology · Statistics 2024-08-23 Xuejun Jiang , Yue Ma , Haofeng Wang

Variable screening using factor analysis for high-dimensional data with multicollinearity

Screening methods are useful tools for variable selection in regression analysis when the number of predictors is much larger than the sample size. Factor analysis is used to eliminate multicollinearity among predictors, which improves the…

Methodology · Statistics 2025-10-28 Shuntaro Tanaka , Hidetoshi Matsui

Sure Independence Screening for Ultra-High Dimensional Feature Space

Variable selection plays an important role in high dimensional statistical modeling which nowadays appears in many areas and is key to various scientific discoveries. For problems of large scale or dimensionality $p$, estimation accuracy…

Statistics Theory · Mathematics 2008-08-27 Jianqing Fan , Jinchi Lv

Sure independence screening in generalized linear models with NP-dimensionality

Ultrahigh-dimensional variable selection plays an increasingly important role in contemporary scientific discoveries and statistical research. Among others, Fan and Lv [J. R. Stat. Soc. Ser. B Stat. Methodol. 70 (2008) 849-911] propose an…

Methodology · Statistics 2012-11-14 Jianqing Fan , Rui Song

Bayesian Variable Selection Under High-dimensional Settings With Grouped Covariates

Consider the normal linear regression setup when the number of covariates p is much larger than the sample size n, and the covariates form correlated groups. The response variable y is not related to an entire group of covariates in all or…

Methodology · Statistics 2023-09-06 Pranay Agarwal , Subhajit Dutta , Minerva Mukhopadhyay

Regularization after retention in ultrahigh dimensional linear regression models

In ultrahigh dimensional setting, independence screening has been both theoretically and empirically proved a useful variable selection framework with low computation cost. In this work, we propose a two-step framework by using marginal…

Methodology · Statistics 2017-08-11 Haolei Weng , Yang Feng , Xingye Qiao

Model-Free, Monotone Invariant and Computationally Efficient Feature Screening with Data-adaptive Threshold

Feature screening for ultrahigh-dimension, in general, proceeds with two essential steps. The first step is measuring and ranking the marginal dependence between response and covariates, and the second is determining the threshold. We…

Methodology · Statistics 2022-07-28 Linsui Deng , Yilin Zhang

Forward variable selection for sparse ultra-high dimensional varying coefficient models

Varying coefficient models have numerous applications in a wide scope of scientific areas. While enjoying nice interpretability, they also allow flexibility in modeling dynamic impacts of the covariates. But, in the new era of big data, it…

Methodology · Statistics 2014-10-27 Ming-Yen Cheng , Toshio Honda , Jin-Ting Zhang