Related papers: Loss-guided Stability Selection

Stability Selection

Estimation of structure, such as in variable selection, graphical modelling or cluster analysis is notoriously difficult, especially for high-dimensional data. We introduce stability selection. It is based on subsampling in combination with…

Methodology · Statistics 2009-05-16 Nicolai Meinshausen , Peter Buehlmann

Stability Selection for Structured Variable Selection

In variable or graph selection problems, finding a right-sized model or controlling the number of false positives is notoriously difficult. Recently, a meta-algorithm called Stability Selection was proposed that can provide reliable…

Machine Learning · Statistics 2017-12-14 George Philipp , Seunghak Lee , Eric P. Xing

Trimming Stability Selection increases variable selection robustness

Contamination can severely distort an estimator unless the estimation procedure is suitably robust. This is a well-known issue and has been addressed in Robust Statistics, however, the relation of contamination and distorted variable…

Statistics Theory · Mathematics 2022-07-15 Tino Werner

Controlling false discoveries in high-dimensional situations: Boosting with stability selection

Modern biotechnologies often result in high-dimensional data sets with much more variables than observations (n $\ll$ p). These data sets pose new challenges to statistical analysis: Variable selection becomes one of the most important…

Machine Learning · Statistics 2014-11-06 Benjamin Hofner , Luigi Boccuto , Markus Göker

A Stable Lasso

The Lasso has been widely used as a method for variable selection, valued for its simplicity and empirical performance. However, Lasso's selection stability deteriorates in the presence of correlated predictors. Several approaches have been…

Methodology · Statistics 2025-11-05 Mahdi Nouraie , Houying Zhu , Samuel Muller

Probing for sparse and fast variable selection with model-based boosting

We present a new variable selection method based on model-based gradient boosting and randomly permuted variables. Model-based boosting is a tool to fit a statistical model while performing variable selection at the same time. A drawback of…

Machine Learning · Statistics 2017-02-16 Janek Thomas , Tobias Hepp , Andreas Mayr , Bernd Bischl

On an improvement of LASSO by scaling

A sparse modeling is a major topic in machine learning and statistics. LASSO (Least Absolute Shrinkage and Selection Operator) is a popular sparse modeling method while it has been known to yield unexpected large bias especially at a sparse…

Machine Learning · Computer Science 2018-08-23 Katsuyuki Hagiwara

Selection consistency of Lasso-based procedures for misspecified high-dimensional binary model and random regressors

We consider selection of random predictors for high-dimensional regression problem with binary response for a general loss function. Important special case is when the binary model is semiparametric and the response function is misspecified…

Statistics Theory · Mathematics 2020-02-19 Mariusz Kubkowski , Jan Mielniczuk

Enhanced variable selection for boosting sparser and less complex models in distributional copula regression

Structured additive distributional copula regression allows to model the joint distribution of multivariate outcomes by relating all distribution parameters to covariates. Estimation via statistical boosting enables accounting for…

Methodology · Statistics 2024-06-07 Annika Strömer , Nadja Klein , Christian Staerk , Florian Faschingbauer , Hannah Klinkhammer , Andreas Mayr

2D Stability Selection: Design Jittering for Doubly Stable Feature Selection

We study feature selection in high-dimensional regression under two distinct sources of instability: sampling variability and measurement error in the design matrix. Stability Selection addresses the former through sub-sampling and…

Methodology · Statistics 2026-05-05 Mahdi Nouraie , Houying Zhu , Samuel Muller

The Loss Rank Criterion for Variable Selection in Linear Regression Analysis

Lasso and other regularization procedures are attractive methods for variable selection, subject to a proper choice of shrinkage parameter. Given a set of potential subsets produced by a regularization algorithm, a consistent model…

Methodology · Statistics 2014-02-26 Minh-Ngoc Tran

On the Selection Stability of Stability Selection and Its Applications

Stability selection is a widely adopted resampling-based framework for high-dimensional variable selection. This paper seeks to broaden the use of an established stability estimator to evaluate the overall stability of the stability…

Methodology · Statistics 2025-06-04 Mahdi Nouraie , Samuel Muller

Cluster Stability Selection

Stability selection (Meinshausen and Buhlmann, 2010) makes any feature selection method more stable by returning only those features that are consistently selected across many subsamples. We prove (in what is, to our knowledge, the first…

Methodology · Statistics 2022-01-04 Gregory Faletto , Jacob Bien

A note on selection stability: combining stability and prediction

Recently, many regularized procedures have been proposed for variable selection in linear regression, but their performance depends on the tuning parameter selection. Here a criterion for the tuning parameter selection is proposed, which…

Methodology · Statistics 2013-01-31 Yixin Fang , Junhui Wang , Wei Sun

Stability

Reproducibility is imperative for any scientific discovery. More often than not, modern scientific findings rely on statistical analysis of high-dimensional data. At a minimum, reproducibility manifests itself in stability of statistical…

Statistics Theory · Mathematics 2013-10-02 Bin Yu

Stabilizing black-box model selection with the inflated argmax

Model selection is the process of choosing from a class of candidate models given data. For instance, methods such as the LASSO and sparse identification of nonlinear dynamics (SINDy) formulate model selection as finding a sparse solution…

Machine Learning · Statistics 2025-12-02 Melissa Adrian , Jake A. Soloff , Rebecca Willett

Sparse Learning for Variable Selection with Structures and Nonlinearities

In this thesis we discuss machine learning methods performing automated variable selection for learning sparse predictive models. There are multiple reasons for promoting sparsity in the predictive models. By relying on a limited set of…

Machine Learning · Computer Science 2019-03-27 Magda Gregorova

Improving Lasso for model selection and prediction

It is known that the Thresholded Lasso (TL), SCAD or MCP correct intrinsic estimation bias of the Lasso. In this paper we propose an alternative method of improving the Lasso for predictive models with general convex loss functions which…

Statistics Theory · Mathematics 2021-01-26 Piotr Pokarowski , Wojciech Rejchel , Agnieszka Soltys , Michal Frej , Jan Mielniczuk

Adaptive post-Dantzig estimation and prediction for non-sparse "large $p$ and small $n$" models

For consistency (even oracle properties) of estimation and model prediction, almost all existing methods of variable/feature selection critically depend on sparsity of models. However, for ``large $p$ and small $n$" models sparsity…

Methodology · Statistics 2010-08-10 Lu Lin , Lixing Zhu , Yujie Gai

Efficient estimation and correction of selection-induced bias with order statistics

Model selection aims to identify a sufficiently well performing model that is possibly simpler than the most complex model among a pool of candidates. However, the decision-making process itself can inadvertently introduce non-negligible…

Methodology · Statistics 2024-08-08 Yann McLatchie , Aki Vehtari