Related papers: Honest variable selection in linear and logistic r…

Tuning parameter calibration for $\ell_1$-regularized logistic regression

Feature selection is a standard approach to understanding and modeling high-dimensional classification data, but the corresponding statistical methods hinge on tuning parameters that are difficult to calibrate. In particular, existing…

Methodology · Statistics 2019-03-01 Wei Li , Johannes Lederer

Simultaneous support recovery in high dimensions: Benefits and perils of block $\ell_1/\ell_\infty$-regularization

Consider the use of $\ell_{1}/\ell_{\infty}$-regularized regression for joint estimation of a $\pdim \times \numreg$ matrix of regression coefficients. We analyze the high-dimensional scaling of $\ell_1/\ell_\infty$-regularized quadratic…

Statistics Theory · Mathematics 2009-05-12 S. Negahban , M. J. Wainwright

Selective inference after convex clustering with $\ell_1$ penalization

Classical inference methods notoriously fail when applied to data-driven test hypotheses or inference targets. Instead, dedicated methodologies are required to obtain statistical guarantees for these selective inference problems. Selective…

Methodology · Statistics 2025-11-11 François Bachoc , Cathy Maugis-Rabusseau , Pierre Neuvial

Monte Carlo Simulation for Lasso-Type Problems by Estimator Augmentation

Regularized linear regression under the $\ell_1$ penalty, such as the Lasso, has been shown to be effective in variable selection and sparse modeling. The sampling distribution of an $\ell_1$-penalized estimator $\hat{\beta}$ is hard to…

Methodology · Statistics 2014-12-24 Qing Zhou

Algorithms for Sparse Support Vector Machines

Many problems in classification involve huge numbers of irrelevant features. Model selection reveals the crucial features, reduces the dimensionality of feature space, and improves model interpretation. In the support vector machine…

Methodology · Statistics 2021-10-18 Alfonso Landeros , Kenneth Lange

Selection of variables and dimension reduction in high-dimensional non-parametric regression

We consider a $l_1$-penalization procedure in the non-parametric Gaussian regression model. In many concrete examples, the dimension $d$ of the input variable $X$ is very large (sometimes depending on the number of observations). Estimation…

Statistics Theory · Mathematics 2008-12-16 Karine Bertin , Guillaume Lecué

Penalized Euclidean Distance Regression

A new method is proposed for variable screening, variable selection and prediction in linear regression problems where the number of predictors can be much larger than the number of observations. The method involves minimizing a penalized…

Statistics Theory · Mathematics 2017-09-14 D. Vasiliu , T. Dey , I. L. Dryden

Penalized Composite Quasi-Likelihood for Ultrahigh-Dimensional Variable Selection

In high-dimensional model selection problems, penalized simple least-square approaches have been extensively used. This paper addresses the question of both robustness and efficiency of penalized model selection methods, and proposes a…

Methodology · Statistics 2011-07-06 Jelena Bradic , Jianqing Fan , Weiwei Wang

Variable selection using pseudo-variables

Penalized regression has become a standard tool for model building across a wide range of application domains. Common practice is to tune the amount of penalization to tradeoff bias and variance or to optimize some other measure of…

Methodology · Statistics 2018-04-05 Wenhao Hu , Eric Laber , Leonard Stefanski

Selection of variables and decision boundaries for functional data via bi-level selection

Sparsity-inducing penalties are useful tools for variable selection and they are also effective for regression settings where the data are functions. We consider the problem of selecting not only variables but also decision boundaries in…

Methodology · Statistics 2020-06-01 Hidetoshi Matsui

Variable selection for model-based clustering using the integrated complete-data likelihood

Variable selection in cluster analysis is important yet challenging. It can be achieved by regularization methods, which realize a trade-off between the clustering accuracy and the number of selected variables by using a lasso-type penalty.…

Methodology · Statistics 2016-12-23 Marbac Matthieu , Sedki Mohammed

Consistent selection of tuning parameters via variable selection stability

Penalized regression models are popularly used in high-dimensional data analysis to conduct variable selection and model fitting simultaneously. Whereas success has been widely reported in literature, their performances largely depend on…

Machine Learning · Statistics 2013-12-16 Wei Sun , Junhui Wang , Yixin Fang

Robust penalized empirical likelihood in high dimensional longitudinal data analysis

As an effective nonparametric method, empirical likelihood (EL) is appealing in combining estimating equations flexibly and adaptively for incorporating data information. To select important variables and estimating equations in the sparse…

Methodology · Statistics 2021-07-02 Jiaqi Li , Liya Fu

Tuning parameter selection in high dimensional penalized likelihood

Determining how to appropriately select the tuning parameter is essential in penalized likelihood methods for high-dimensional data analysis. We examine this problem in the setting of penalized likelihood methods for generalized linear…

Methodology · Statistics 2016-05-12 Yingying Fan , Cheng Yong Tang

Improving variable selection properties with data integration and transfer learning

We study variable selection (also called support recovery) in high-dimensional sparse linear regression when one has external information on which variables are likely to be associated with the response. Consistent recovery is only possible…

Statistics Theory · Mathematics 2026-02-16 Paul Rognon-Vael , David Rossell , Piotr Zwiernik

M-estimation with the Trimmed l1 Penalty

We study high-dimensional estimators with the trimmed $\ell_1$ penalty, which leaves the $h$ largest parameter entries penalty-free. While optimization techniques for this nonconvex penalty have been studied, the statistical properties have…

Statistics Theory · Mathematics 2019-05-14 Jihun Yun , Peng Zheng , Eunho Yang , Aurelie Lozano , Aleksandr Aravkin

Estimation And Selection Via Absolute Penalized Convex Minimization And Its Multistage Adaptive Applications

The $\ell_1$-penalized method, or the Lasso, has emerged as an important tool for the analysis of large data sets. Many important results have been obtained for the Lasso in linear regression which have led to a deeper understanding of…

Machine Learning · Statistics 2011-12-30 Jian Huang , Cun-Hui Zhang

Estimation and variable selection in high dimension in nonlinear mixed-effects models

We consider nonlinear mixed effects models including high-dimensional covariates to model individual parameters variability. The objective is to identify relevant covariates among a large set under sparsity assumption and to estimate model…

Statistics Theory · Mathematics 2025-08-06 Antoine Caillebotte , Estelle Kuhn , Sarah Lemler

A review and recommendations on variable selection methods in regression models for binary data

The selection of essential variables in logistic regression is vital because of its extensive use in medical studies, finance, economics and related fields. In this paper, we explore four main typologies (test-based, penalty-based,…

Methodology · Statistics 2022-05-17 Souvik Bag , Kapil Gupta , Soudeep Deb

The Smooth-Lasso and other $\ell_1+\ell_2$-penalized methods

We consider a linear regression problem in a high dimensional setting where the number of covariates $p$ can be much larger than the sample size $n$. In such a situation, one often assumes sparsity of the regression vector, \textit i.e.,…

Statistics Theory · Mathematics 2011-10-12 Mohamed Hebiri , Sara A. Van De Geer