Thresholded Lasso for high dimensional variable selection
Abstract
Given noisy samples with dimensions, where , we show that the multi-step thresholding procedure based on the Lasso -- we call it the {\it Thresholded Lasso}, can accurately estimate a sparse vector in a linear model , where is a design matrix normalized to have column -norm , and . We show that under the restricted eigenvalue (RE) condition, it is possible to achieve the loss within a logarithmic factor of the ideal mean square error one would achieve with an while selecting a sufficiently sparse model -- hence achieving ; the oracle would supply perfect information about which coordinates are non-zero and which are above the noise level. We also show for the Gauss-Dantzig selector (Cand\`{e}s-Tao 07), if obeys a uniform uncertainty principle, one will achieve the sparse oracle inequalities as above, while allowing at most irrelevant variables in the model in the worst case, where is the smallest integer such that for , . Our simulation results on the Thresholded Lasso match our theoretical analysis excellently.
Cite
@article{arxiv.2309.15355,
title = {Thresholded Lasso for high dimensional variable selection},
author = {Shuheng Zhou},
journal= {arXiv preprint arXiv:2309.15355},
year = {2025}
}
Comments
arXiv admin note: substantial text overlap with arXiv:1002.1583