Related papers: High-dimensional iterative variable selection for …

Iterative variable selection for high-dimensional data with binary outcomes

We propose an iterative variable selection scheme for high-dimensional data with binary outcomes. The scheme adopts a structured screen-and-select framework and uses non-local prior-based Bayesian model selection within the same. The…

Methodology · Statistics 2022-11-08 Nilotpal Sanyal

High-dimensional variable selection for Cox's proportional hazards model

Variable selection in high dimensional space has challenged many contemporary statistical problems from many frontiers of scientific disciplines. Recent technology advance has made it possible to collect a huge amount of covariate…

Machine Learning · Statistics 2010-05-20 Jianqing Fan , Yang Feng , Yichao Wu

High-dimensional variable selection via tilting

The paper considers variable selection in linear regression models where the number of covariates is possibly much larger than the number of observations. High dimensionality of the data brings in many complications, such as (possibly…

Methodology · Statistics 2016-11-29 Haeran Cho , Piotr Fryzlewicz

High-dimensional Feature Screening for Nonlinear Associations With Survival Outcome Using Restricted Mean Survival Time

Feature screening is an important tool in analyzing ultrahigh-dimensional data, particularly in the field of Omics and oncology studies. However, most attention has been focused on identifying features that have a linear or monotonic impact…

Methodology · Statistics 2023-05-10 Yaxian Chen , KF Lam , Zhonghua Liu

Bayesian variable and hazard structure selection in the General Hazard model

The proportional hazards (PH) and accelerated failure time (AFT) models are the most widely used hazard structures for analysing time-to-event data. When the goal is to identify variables associated with event times, variable selection is…

Methodology · Statistics 2026-02-04 Yulong Chen , Jim Griffin , Francisco Javier Rubio

Efficient Test-based Variable Selection for High-dimensional Linear Models

Variable selection plays a fundamental role in high-dimensional data analysis. Various methods have been developed for variable selection in recent years. Well-known examples are forward stepwise regression (FSR) and least angle regression…

Methodology · Statistics 2018-02-01 Siliang Gong , Kai Zhang , Yufeng Liu

Variable Selection for High-dimensional Generalized Linear Models using an Iterated Conditional Modes/Medians Algorithm

High-dimensional linear and nonlinear models have been extensively used to identify associations between response and explanatory variables. The variable selection problem is commonly of interest in the presence of massive and complex data.…

Methodology · Statistics 2017-08-10 Vitara Pungpapong , Min Zhang , Dabao Zhang

Ultrahigh dimensional variable selection: beyond the linear model

Variable selection in high-dimensional space characterizes many contemporary problems in scientific discovery and decision making. Many frequently-used techniques are based on independence screening; examples include correlation ranking…

Methodology · Statistics 2008-12-18 Jianqing Fan , Richard Samworth , Yichao Wu

Variable Selection for Survival Data with A Class of Adaptive Elastic Net Techniques

The accelerated failure time (AFT) models have proved useful in many contexts, though heavy censoring (as for example in cancer survival) and high dimensionality (as for example in microarray data) cause difficulties for model fitting and…

Methodology · Statistics 2013-12-10 Md Hasinur Rahaman Khan , J. Ewart H. Shaw

Penalized integrative analysis under the accelerated failure time model

For survival data with high-dimensional covariates, results generated in the analysis of a single dataset are often unsatisfactory because of the small sample size. Integrative analysis pools raw data from multiple independent studies with…

Methodology · Statistics 2015-01-13 Qingzhao Zhang , Sanguo Zhang , Jin Liu , Jian Huang , Shuangge Ma

High-dimensional variable selection

This paper explores the following question: what kind of statistical guarantees can be given when doing variable selection in high-dimensional models? In particular, we look at the error rates and power of some multi-stage regression…

Statistics Theory · Mathematics 2009-08-20 Larry Wasserman , Kathryn Roeder

Optimal model averaging forecasting in high-dimensional survival analysis

This article considers ultrahigh-dimensional forecasting problems with survival response variables. We propose a two-step model averaging procedure for improving the forecasting accuracy of the true conditional mean of a survival response…

Methodology · Statistics 2022-11-28 Xiaodong Yan , Hongni Wang , Wei Wang , Jinhan Xie , Yanyan Ren , Xinjun Wang

Robust Estimation and Variable Selection for the Accelerated Failure Time Model

This paper considers robust modeling of the survival time for cancer patients. Accurate prediction can be helpful for developing therapeutic and care strategies. We propose a unified Expectation-Maximization approach combined with the…

Methodology · Statistics 2019-12-23 Yi Li , Muxuan Liang , Lu Mao , Sijian Wang

Comparison study of variable selection procedures in high-dimensional Gaussian linear regression

We propose an extensive simulation study to compare some variable selection procedures in a high-dimensional framework. Assuming that the relationship between the actives variables and the response variable is linear, the high-dimensional…

Applications · Statistics 2025-03-21 Perrine Lacroix , Mélina Gallopin , Marie-Laure Martin

Scalable Variational Bayes Inference for Dynamic Variable Selection

We develop a variational Bayes approach for dynamic variable selection in high-dimensional regression models with time-varying parameters and predictors that exhibit a predefined group structure. Through comprehensive simulation studies, we…

Methodology · Statistics 2025-04-16 Nicolas Bianco , Mauro Bernardi , Daniele Bianchi

Iterated Feature Screening based on Distance Correlation for Ultrahigh-Dimensional Censored Data with Covariates Measurement Error

Feature screening is an important method to reduce the dimension and capture informative variables in ultrahigh-dimensional data analysis. Many methods have been developed for feature screening. These methods, however, are challenged by…

Methodology · Statistics 2019-01-08 Li-Pang Chen

Bayesian iterative screening in ultra-high dimensional linear regressions

Variable selection in ultra-high dimensional linear regression is often preceded by a screening step to significantly reduce the dimension. Here we develop a Bayesian variable screening method (BITS) guided by the posterior model…

Methodology · Statistics 2025-02-28 Run Wang , Somak Dutta , Vivekananda Roy

Linear screening for high-dimensional computer experiments

In this paper we propose a linear variable screening method for computer experiments when the number of input variables is larger than the number of runs. This method uses a linear model to model the nonlinear data, and screens the…

Methodology · Statistics 2020-06-16 Chunya Li , Daijun Chen , Shifeng Xiong

Conditional variable screening for ultra-high dimensional longitudinal data with time interactions

In recent years we have been able to gather large amounts of genomic data at a fast rate, creating situations where the number of variables greatly exceeds the number of observations. In these situations, most models that can handle a…

Methodology · Statistics 2025-02-07 Andrea Bratsberg , Abhik Ghosh , Magne Thoresen

Improving variable selection properties with data integration and transfer learning

We study variable selection (also called support recovery) in high-dimensional sparse linear regression when one has external information on which variables are likely to be associated with the response. Consistent recovery is only possible…

Statistics Theory · Mathematics 2026-02-16 Paul Rognon-Vael , David Rossell , Piotr Zwiernik