Related papers: Screening Methods for Classification Based on Non-…

Variable screening with multiple studies

Advancement in technology has generated abundant high-dimensional data that allows integration of multiple relevant studies. Due to their huge computational advantage, variable screening methods based on marginal correlation have become…

Methodology · Statistics 2017-10-12 Tianzhou Ma , Zhao Ren , George C. Tseng

Linear screening for high-dimensional computer experiments

In this paper we propose a linear variable screening method for computer experiments when the number of input variables is larger than the number of runs. This method uses a linear model to model the nonlinear data, and screens the…

Methodology · Statistics 2020-06-16 Chunya Li , Daijun Chen , Shifeng Xiong

Nonparametric Independence Screening in Sparse Ultra-High Dimensional Varying Coefficient Models

The varying-coefficient model is an important nonparametric statistical model that allows us to examine how the effects of covariates vary with exposure variables. When the number of covariates is big, the issue of variable selection…

Statistics Theory · Mathematics 2013-03-05 Jianqing Fan , Yunbei Ma , Wei Dai

Are screening methods useful in feature selection? An empirical study

Filter or screening methods are often used as a preprocessing step for reducing the number of variables used by a learning algorithm in obtaining a classification or regression model. While there are many such filter methods, there is a…

Machine Learning · Statistics 2019-09-13 Mingyuan Wang , Adrian Barbu

Variable screening using factor analysis for high-dimensional data with multicollinearity

Screening methods are useful tools for variable selection in regression analysis when the number of predictors is much larger than the sample size. Factor analysis is used to eliminate multicollinearity among predictors, which improves the…

Methodology · Statistics 2025-10-28 Shuntaro Tanaka , Hidetoshi Matsui

A Model-free Variable Screening Method Based on Leverage Score

With rapid advances in information technology, massive datasets are collected in all fields of science, such as biology, chemistry, and social science. Useful or meaningful information is extracted from these data often through statistical…

Methodology · Statistics 2021-09-22 Wenxuan Zhong , Yiwen Liu , Peng Zeng

Conditional Sure Independence Screening

Independence screening is a powerful method for variable selection for `Big Data' when the number of variables is massive. Commonly used independence screening methods are based on marginal correlations or variations of it. In many…

Statistics Theory · Mathematics 2012-11-02 Emre Barut , Jianqing Fan , Anneleen Verhasselt

RaSE: A Variable Screening Framework via Random Subspace Ensembles

Variable screening methods have been shown to be effective in dimension reduction under the ultra-high dimensional setting. Most existing screening methods are designed to rank the predictors according to their individual contributions to…

Methodology · Statistics 2022-02-08 Ye Tian , Yang Feng

Accelerated Sparse Bayesian Learning via Screening Test and Its Applications

In high-dimensional settings, sparse structures are critical for efficiency in term of memory and computation complexity. For a linear system, to find the sparsest solution provided with an over-complete dictionary of features directly is…

Machine Learning · Statistics 2020-07-09 Yiping Jiang , Tianshi Chen

Determination of class-specific variables in nonparametric multiple-class classification

As technology advanced, collecting data via automatic collection devices become popular, thus we commonly face data sets with lengthy variables, especially when these data sets are collected without specific research goals beforehand. It…

Machine Learning · Statistics 2022-05-10 Wan-Ping Nicole Chen , Yuan-chin Ivan Chang

Conditional nonparametric variable screening by neural factor regression

High-dimensional covariates often admit linear factor structure. To effectively screen correlated covariates in high-dimension, we propose a conditional variable screening test based on non-parametric regression using neural networks due to…

Econometrics · Economics 2024-08-21 Jianqing Fan , Weining Wang , Yue Zhao

Nonparametric Variable Screening with Optimal Decision Stumps

Decision trees and their ensembles are endowed with a rich set of diagnostic tools for ranking and screening variables in a predictive model. Despite the widespread use of tree based variable importance measures, pinning down their…

Machine Learning · Statistics 2020-12-14 Jason M. Klusowski , Peter M. Tian

Distributed variable screening for generalized linear models

In this article, we develop a distributed variable screening method for generalized linear models. This method is designed to handle situations where both the sample size and the number of covariates are large. Specifically, the proposed…

Methodology · Statistics 2024-05-09 Tianbo Diao , Lianqiang Qu , Bo Li , Liuquan Sun

Nonparametric Independence Screening in Sparse Ultra-High Dimensional Additive Models

A variable screening procedure via correlation learning was proposed Fan and Lv (2008) to reduce dimensionality in sparse ultra-high dimensional models. Even when the true model is linear, the marginal regression can be highly nonlinear. To…

Methodology · Statistics 2011-01-19 Jianqing Fan , Yang Feng , Rui Song

On the consistency theory of high dimensional variable screening

Variable screening is a fast dimension reduction technique for assisting high dimensional feature selection. As a preselection method, it selects a moderate size subset of candidate variables for further refining via feature selection to…

Statistics Theory · Mathematics 2015-06-09 Xiangyu Wang , Chenlei Leng , David B. Dunson

Random Partitioning and Distribution-based Thresholding for Iterative Variable Screening in High Dimensions

In big data analysis, a simple task such as linear regression can become very challenging as the variable dimension $p$ grows. As a result, variable screening is inevitable in many scientific studies. In recent years, randomized algorithms…

Methodology · Statistics 2019-02-13 Yu-Hsiang Cheng , Tzee-Ming Huang , Su-Yun Huang

Bayesian iterative screening in ultra-high dimensional linear regressions

Variable selection in ultra-high dimensional linear regression is often preceded by a screening step to significantly reduce the dimension. Here we develop a Bayesian variable screening method (BITS) guided by the posterior model…

Methodology · Statistics 2025-02-28 Run Wang , Somak Dutta , Vivekananda Roy

Robust variable screening for regression using factor profiling

Sure Independence Screening is a fast procedure for variable selection in ultra-high dimensional regression analysis. Unfortunately, its performance greatly deteriorates with increasing dependence among the predictors. To solve this issue,…

Methodology · Statistics 2018-11-15 Yixin Wang , Stefan Van Aelst

A Paradigmatic Regression Algorithm for Gene Selection Problems

Motivation: Gene selection has become a common task in most gene expression studies. The objective of such research is often to identify the smallest possible set of genes that can still achieve good predictive performance. The problem of…

Methodology · Statistics 2015-11-25 Stéphane Guerrier , Nabil Mili , Roberto Molinari , Samuel Orso , Marco Avella-Medina , Yanyuan Ma

Greedy Forward Regression for Variable Screening

Two popular variable screening methods under the ultra-high dimensional setting with the desirable sure screening property are the sure independence screening (SIS) and the forward regression (FR). Both are classical variable screening…

Methodology · Statistics 2015-11-05 Ming-Yen Cheng , Sanying Feng , Gaorong Li , Heng Lian