English
Related papers

Related papers: Some Two-Step Procedures for Variable Selection in…

200 papers

We consider the problem of model selection and estimation in sparse high dimensional linear regression models with strongly correlated variables. First, we study the theoretical properties of the dual Lasso solution, and we show that joint…

Applications · Statistics 2017-03-21 Niharika Gauraha

The least absolute shrinkage and selection operator (Lasso) is a popular method for high-dimensional statistics. However, it is known that the Lasso often has estimation bias and prediction error. To address such disadvantages, many…

Methodology · Statistics 2026-04-29 Guo Liu

High-dimensional prediction typically comprises two steps: variable selection and subsequent least-squares refitting on the selected variables. However, the standard variable selection procedures, such as the lasso, hinge on tuning…

Methodology · Statistics 2017-06-07 Didier Chételat , Johannes Lederer , Joseph Salmon

This paper explores the validity of the two-stage estimation procedure for sparse linear models in high-dimensional settings with possibly many endogenous regressors. In particular, the number of endogenous regressors in the main equation…

Statistics Theory · Mathematics 2013-09-18 Ying Zhu

Inference for high-dimensional logistic regression models using penalized methods has been a challenging research problem. As an illustration, a major difficulty is the significant bias of the Lasso estimator, which limits its direct…

Methodology · Statistics 2024-10-29 Yuming Zhang , Stéphane Guerrier , Runze Li

We show that the two-stage adaptive Lasso procedure (Zou, 2006) is consistent for high-dimensional model selection in linear and Gaussian graphical models. Our conditions for consistency cover more general situations than those accomplished…

Statistics Theory · Mathematics 2009-03-17 Shuheng Zhou , Sara van de Geer , Peter Bühlmann

In regression problems where covariates can be naturally grouped, the group Lasso is an attractive method for variable selection since it respects the grouping structure in the data. We study the selection and estimation properties of the…

Statistics Theory · Mathematics 2010-11-30 Fengrong Wei , Jian Huang

The Lasso is an attractive technique for regularization and variable selection for high-dimensional data, where the number of predictor variables $p_n$ is potentially much larger than the number of samples $n$. However, it was recently…

Statistics Theory · Mathematics 2009-03-02 Nicolai Meinshausen , Bin Yu

We study regression discontinuity designs in which many predetermined covariates, possibly much more than the number of observations, can be used to increase the precision of treatment effect estimates. We consider a two-step estimator…

Econometrics · Economics 2022-05-06 Alexander Kreiß , Christoph Rothe

This paper investigates the two-step estimation of a high dimensional additive regression model, in which the number of nonparametric additive components is potentially larger than the sample size but the number of significant additive…

Statistics Theory · Mathematics 2013-01-30 Kengo Kato

We propose a two step algorithm based on $\ell_1/\ell_0$ regularization for the detection and estimation of parameters of a high dimensional change point regression model and provide the corresponding rates of convergence for the change…

Methodology · Statistics 2019-01-18 Abhishek Kaul , Venkata K. Jandhyala , Stergios B. Fotopoulos

This paper concerns statistical inference for the components of a high-dimensional regression parameter despite possible endogeneity of each regressor. Given a first-stage linear model for the endogenous regressors and a second-stage linear…

Statistics Theory · Mathematics 2019-11-25 David Gold , Johannes Lederer , Jing Tao

Among the most popular variable selection procedures in high-dimensional regression, Lasso provides a solution path to rank the variables and determines a cut-off position on the path to select variables and estimate coefficients. In this…

Methodology · Statistics 2018-06-19 X. Jessie Jeng , Huimin Peng , Wenbin Lu

We consider regression problems where the number of predictors greatly exceeds the number of observations. We propose a method for variable selection that first estimates the regression function, yielding a "pre-conditioned" response…

Statistics Theory · Mathematics 2013-04-16 Debashis Paul , Eric Bair , Trevor Hastie , Robert Tibshirani

A great deal of interest has recently focused on conducting inference on the parameters in a high-dimensional linear model. In this paper, we consider a simple and very na\"{i}ve two-step procedure for this task, in which we (i) fit a lasso…

Methodology · Statistics 2020-07-02 Sen Zhao , Daniela Witten , Ali Shojaie

This paper proposes a multi-stage projection-based Lasso procedure for the semiparametric sample selection model in high-dimensional settings under a weak nonparametric restriction on the selection correction. In particular, the number of…

Statistics Theory · Mathematics 2014-11-13 Ying Zhu

Heavy-tailed high-dimensional data are commonly encountered in various scientific fields and pose great challenges to modern statistical analysis. A natural procedure to address this problem is to use penalized quantile regression with…

Statistics Theory · Mathematics 2015-03-20 Jianqing Fan , Yingying Fan , Emre Barut

We consider a high-dimensional regression model with a possible change-point due to a covariate threshold and develop the Lasso estimator of regression coefficients as well as the threshold parameter. Our Lasso estimator not only selects…

Statistics Theory · Mathematics 2019-08-23 Sokbae Lee , Myung Hwan Seo , Youngki Shin

We consider selection of random predictors for high-dimensional regression problem with binary response for a general loss function. Important special case is when the binary model is semiparametric and the response function is misspecified…

Statistics Theory · Mathematics 2020-02-19 Mariusz Kubkowski , Jan Mielniczuk

In the high-dimensional regression model a response variable is linearly related to $p$ covariates, but the sample size $n$ is smaller than $p$. We assume that only a small subset of covariates is `active' (i.e., the corresponding…

Statistics Theory · Mathematics 2013-05-03 Adel Javanmard , Andrea Montanari
‹ Prev 1 2 3 10 Next ›