Related papers: Near-ideal model selection by $\ell_1$ minimizatio…

Thresholded Lasso for high dimensional variable selection

Given $n$ noisy samples with $p$ dimensions, where $n \ll p$, we show that the multi-step thresholding procedure based on the Lasso -- we call it the {\it Thresholded Lasso}, can accurately estimate a sparse vector $\beta \in {\mathbb R}^p$…

Statistics Theory · Mathematics 2025-10-28 Shuheng Zhou

Thresholded Lasso for high dimensional variable selection and statistical estimation

Given $n$ noisy samples with $p$ dimensions, where $n \ll p$, we show that the multi-step thresholding procedure based on the Lasso -- we call it the {\it Thresholded Lasso}, can accurately estimate a sparse vector $\beta \in \R^p$ in a…

Statistics Theory · Mathematics 2010-02-11 Shuheng Zhou

Lasso-type recovery of sparse representations for high-dimensional data

The Lasso is an attractive technique for regularization and variable selection for high-dimensional data, where the number of predictor variables $p_n$ is potentially much larger than the number of samples $n$. However, it was recently…

Statistics Theory · Mathematics 2009-03-02 Nicolai Meinshausen , Bin Yu

How well can we estimate a sparse vector?

The estimation of a sparse vector in the linear model is a fundamental problem in signal processing, statistics, and compressive sensing. This paper establishes a lower bound on the mean-squared error, which holds regardless of the…

Information Theory · Computer Science 2013-03-04 Emmanuel J. Candès , Mark A. Davenport

Nearly Optimal Sample Size in Hypothesis Testing for High-Dimensional Regression

We consider the problem of fitting the parameters of a high-dimensional linear regression model. In the regime where the number of parameters $p$ is comparable to or exceeds the sample size $n$, a successful approach uses an…

Statistics Theory · Mathematics 2013-11-04 Adel Javanmard , Andrea Montanari

The Dantzig selector: Statistical estimation when $p$ is much larger than $n$

In many important statistical applications, the number of variables or parameters $p$ is much larger than the number of observations $n$. Suppose then that we have observations $y=X\beta+z$, where $\beta\in\mathbf{R}^p$ is a parameter…

Statistics Theory · Mathematics 2009-09-29 Emmanuel Candes , Terence Tao

Sparse recovery with unknown variance: a LASSO-type approach

We address the issue of estimating the regression vector $\beta$ in the generic $s$-sparse linear model $y = X\beta+z$, with $\beta\in\R^{p}$, $y\in\R^{n}$, $z\sim\mathcal N(0,\sg^2 I)$ and $p> n$ when the variance $\sg^{2}$ is unknown. We…

Statistics Theory · Mathematics 2012-11-06 Stéphane Chrétien , Sébastien Darses

Model-Consistent Sparse Estimation through the Bootstrap

We consider the least-square linear regression problem with regularization by the $\ell^1$-norm, a problem usually referred to as the Lasso. In this paper, we first present a detailed asymptotic analysis of model consistency of the Lasso in…

Machine Learning · Computer Science 2009-01-22 Francis Bach

A new perspective on least squares under convex constraint

Consider the problem of estimating the mean of a Gaussian random vector when the mean vector is assumed to be in a given convex set. The most natural solution is to take the Euclidean projection of the data vector on to this convex set; in…

Statistics Theory · Mathematics 2014-11-21 Sourav Chatterjee

The Smooth-Lasso and other $\ell_1+\ell_2$-penalized methods

We consider a linear regression problem in a high dimensional setting where the number of covariates $p$ can be much larger than the sample size $n$. In such a situation, one often assumes sparsity of the regression vector, \textit i.e.,…

Statistics Theory · Mathematics 2011-10-12 Mohamed Hebiri , Sara A. Van De Geer

Oracle Inequalities and Optimal Inference under Group Sparsity

We consider the problem of estimating a sparse linear regression vector $\beta^*$ under a gaussian noise model, for the purpose of both prediction and model selection. We assume that prior knowledge is available on the sparsity pattern,…

Statistics Theory · Mathematics 2012-08-21 Karim Lounici , Massimiliano Pontil , Alexandre B. Tsybakov , Sara van de Geer

Model selection with lasso-zero: adding straw to the haystack to better find needles

The high-dimensional linear model $y = X \beta^0 + \epsilon$ is considered and the focus is put on the problem of recovering the support $S^0$ of the sparse vector $\beta^0.$ We introduce Lasso-Zero, a new $\ell_1$-based estimator whose…

Methodology · Statistics 2019-04-15 Pascaline Descloux , Sylvain Sardy

A Study of Error Variance Estimation in Lasso Regression

Variance estimation in the linear model when $p > n$ is a difficult problem. Standard least squares estimation techniques do not apply. Several variance estimators have been proposed in the literature, all with accompanying asymptotic…

Methodology · Statistics 2014-01-30 Stephen Reid , Robert Tibshirani , Jerome Friedman

Lasso with Latents: Efficient Estimation, Covariate Rescaling, and Computational-Statistical Gaps

It is well-known that the statistical performance of Lasso can suffer significantly when the covariates of interest have strong correlations. In particular, the prediction error of Lasso becomes much worse than computationally inefficient…

Machine Learning · Statistics 2024-02-26 Jonathan Kelner , Frederic Koehler , Raghu Meka , Dhruv Rohatgi

On the number of variables to use in principal component regression

We study least squares linear regression over $N$ uncorrelated Gaussian features that are selected in order of decreasing variance. When the number of selected features $p$ is at most the sample size $n$, the estimator under consideration…

Statistics Theory · Mathematics 2019-10-04 Ji Xu , Daniel Hsu

High-Dimensional Regression with Binary Coefficients. Estimating Squared Error and a Phase Transition

We consider a sparse linear regression model Y=X\beta^{*}+W where X has a Gaussian entries, W is the noise vector with mean zero Gaussian entries, and \beta^{*} is a binary vector with support size (sparsity) k. Using a novel conditional…

Machine Learning · Statistics 2019-09-26 David Gamarnik , Ilias Zadik

Lasso and Partially-Rotated Designs

We consider the sparse linear regression model $\mathbf{y} = X \beta +\mathbf{w}$, where $X \in \mathbb{R}^{n \times d}$ is the design, $\beta \in \mathbb{R}^{d}$ is a $k$-sparse secret, and $\mathbf{w} \sim N(0, I_n)$ is the noise. Given…

Statistics Theory · Mathematics 2025-05-19 Rares-Darius Buhai

Sparse High-Dimensional Linear Regression. Algorithmic Barriers and a Local Search Algorithm

We consider a sparse high dimensional regression model where the goal is to recover a $k$-sparse unknown vector $\beta^*$ from $n$ noisy linear observations of the form $Y=X\beta^*+W \in \mathbb{R}^n$ where $X \in \mathbb{R}^{n \times p}$…

Statistics Theory · Mathematics 2019-09-24 David Gamarnik , Ilias Zadik

The out-of-sample prediction error of the square-root-LASSO and related estimators

We study the classical problem of predicting an outcome variable, $Y$, using a linear combination of a $d$-dimensional covariate vector, $\mathbf{X}$. We are interested in linear predictors whose coefficients solve: % \begin{align*}…

Statistics Theory · Mathematics 2024-04-10 José Luis Montiel Olea , Cynthia Rush , Amilcar Velez , Johannes Wiesel

On the design-dependent suboptimality of the Lasso

This paper investigates the effect of the design matrix on the ability (or inability) to estimate a sparse parameter in linear regression. More specifically, we characterize the optimal rate of estimation when the smallest singular value of…

Statistics Theory · Mathematics 2024-02-02 Reese Pathak , Cong Ma