English
Related papers

Related papers: Loss-guided Stability Selection

200 papers

Estimation of structure, such as in variable selection, graphical modelling or cluster analysis is notoriously difficult, especially for high-dimensional data. We introduce stability selection. It is based on subsampling in combination with…

Methodology · Statistics 2009-05-16 Nicolai Meinshausen , Peter Buehlmann

In variable or graph selection problems, finding a right-sized model or controlling the number of false positives is notoriously difficult. Recently, a meta-algorithm called Stability Selection was proposed that can provide reliable…

Machine Learning · Statistics 2017-12-14 George Philipp , Seunghak Lee , Eric P. Xing

Contamination can severely distort an estimator unless the estimation procedure is suitably robust. This is a well-known issue and has been addressed in Robust Statistics, however, the relation of contamination and distorted variable…

Statistics Theory · Mathematics 2022-07-15 Tino Werner

Modern biotechnologies often result in high-dimensional data sets with much more variables than observations (n $\ll$ p). These data sets pose new challenges to statistical analysis: Variable selection becomes one of the most important…

Machine Learning · Statistics 2014-11-06 Benjamin Hofner , Luigi Boccuto , Markus Göker

The Lasso has been widely used as a method for variable selection, valued for its simplicity and empirical performance. However, Lasso's selection stability deteriorates in the presence of correlated predictors. Several approaches have been…

Methodology · Statistics 2025-11-05 Mahdi Nouraie , Houying Zhu , Samuel Muller

We present a new variable selection method based on model-based gradient boosting and randomly permuted variables. Model-based boosting is a tool to fit a statistical model while performing variable selection at the same time. A drawback of…

Machine Learning · Statistics 2017-02-16 Janek Thomas , Tobias Hepp , Andreas Mayr , Bernd Bischl

A sparse modeling is a major topic in machine learning and statistics. LASSO (Least Absolute Shrinkage and Selection Operator) is a popular sparse modeling method while it has been known to yield unexpected large bias especially at a sparse…

Machine Learning · Computer Science 2018-08-23 Katsuyuki Hagiwara

We consider selection of random predictors for high-dimensional regression problem with binary response for a general loss function. Important special case is when the binary model is semiparametric and the response function is misspecified…

Statistics Theory · Mathematics 2020-02-19 Mariusz Kubkowski , Jan Mielniczuk

Structured additive distributional copula regression allows to model the joint distribution of multivariate outcomes by relating all distribution parameters to covariates. Estimation via statistical boosting enables accounting for…

We study feature selection in high-dimensional regression under two distinct sources of instability: sampling variability and measurement error in the design matrix. Stability Selection addresses the former through sub-sampling and…

Methodology · Statistics 2026-05-05 Mahdi Nouraie , Houying Zhu , Samuel Muller

Lasso and other regularization procedures are attractive methods for variable selection, subject to a proper choice of shrinkage parameter. Given a set of potential subsets produced by a regularization algorithm, a consistent model…

Methodology · Statistics 2014-02-26 Minh-Ngoc Tran

Stability selection is a widely adopted resampling-based framework for high-dimensional variable selection. This paper seeks to broaden the use of an established stability estimator to evaluate the overall stability of the stability…

Methodology · Statistics 2025-06-04 Mahdi Nouraie , Samuel Muller

Stability selection (Meinshausen and Buhlmann, 2010) makes any feature selection method more stable by returning only those features that are consistently selected across many subsamples. We prove (in what is, to our knowledge, the first…

Methodology · Statistics 2022-01-04 Gregory Faletto , Jacob Bien

Recently, many regularized procedures have been proposed for variable selection in linear regression, but their performance depends on the tuning parameter selection. Here a criterion for the tuning parameter selection is proposed, which…

Methodology · Statistics 2013-01-31 Yixin Fang , Junhui Wang , Wei Sun

Reproducibility is imperative for any scientific discovery. More often than not, modern scientific findings rely on statistical analysis of high-dimensional data. At a minimum, reproducibility manifests itself in stability of statistical…

Statistics Theory · Mathematics 2013-10-02 Bin Yu

Model selection is the process of choosing from a class of candidate models given data. For instance, methods such as the LASSO and sparse identification of nonlinear dynamics (SINDy) formulate model selection as finding a sparse solution…

Machine Learning · Statistics 2025-12-02 Melissa Adrian , Jake A. Soloff , Rebecca Willett

In this thesis we discuss machine learning methods performing automated variable selection for learning sparse predictive models. There are multiple reasons for promoting sparsity in the predictive models. By relying on a limited set of…

Machine Learning · Computer Science 2019-03-27 Magda Gregorova

It is known that the Thresholded Lasso (TL), SCAD or MCP correct intrinsic estimation bias of the Lasso. In this paper we propose an alternative method of improving the Lasso for predictive models with general convex loss functions which…

Statistics Theory · Mathematics 2021-01-26 Piotr Pokarowski , Wojciech Rejchel , Agnieszka Soltys , Michal Frej , Jan Mielniczuk

For consistency (even oracle properties) of estimation and model prediction, almost all existing methods of variable/feature selection critically depend on sparsity of models. However, for ``large $p$ and small $n$" models sparsity…

Methodology · Statistics 2010-08-10 Lu Lin , Lixing Zhu , Yujie Gai

Model selection aims to identify a sufficiently well performing model that is possibly simpler than the most complex model among a pool of candidates. However, the decision-making process itself can inadvertently introduce non-negligible…

Methodology · Statistics 2024-08-08 Yann McLatchie , Aki Vehtari
‹ Prev 1 2 3 10 Next ›