English
Related papers

Related papers: Variable selection with error control: Another loo…

200 papers

Stability selection (Meinshausen and Buhlmann, 2010) makes any feature selection method more stable by returning only those features that are consistently selected across many subsamples. We prove (in what is, to our knowledge, the first…

Methodology · Statistics 2022-01-04 Gregory Faletto , Jacob Bien

We introduce extensions of stability selection, a method to stabilise variable selection methods introduced by Meinshausen and B\"uhlmann (J R Stat Soc 72:417-473, 2010). We propose to apply a base selection method repeatedly to random…

Methodology · Statistics 2016-10-26 Andre Beinrucker , Ürün Dogan , Gilles Blanchard

Estimation of structure, such as in variable selection, graphical modelling or cluster analysis is notoriously difficult, especially for high-dimensional data. We introduce stability selection. It is based on subsampling in combination with…

Methodology · Statistics 2009-05-16 Nicolai Meinshausen , Peter Buehlmann

In variable or graph selection problems, finding a right-sized model or controlling the number of false positives is notoriously difficult. Recently, a meta-algorithm called Stability Selection was proposed that can provide reliable…

Machine Learning · Statistics 2017-12-14 George Philipp , Seunghak Lee , Eric P. Xing

Recently, many regularized procedures have been proposed for variable selection in linear regression, but their performance depends on the tuning parameter selection. Here a criterion for the tuning parameter selection is proposed, which…

Methodology · Statistics 2013-01-31 Yixin Fang , Junhui Wang , Wei Sun

Stability selection is a widely adopted resampling-based framework for high-dimensional variable selection. This paper seeks to broaden the use of an established stability estimator to evaluate the overall stability of the stability…

Methodology · Statistics 2025-06-04 Mahdi Nouraie , Samuel Muller

In modern data analysis, sparse model selection becomes inevitable once the number of predictors variables is very high. It is well-known that model selection procedures like the Lasso or Boosting tend to overfit on real data. The…

Machine Learning · Computer Science 2022-02-11 Tino Werner

Stability selection is a versatile framework for structure estimation and variable selection in high-dimensional setting, primarily grounded in frequentist principles. In this paper, we propose an enhanced methodology that integrates…

Methodology · Statistics 2026-05-05 Mahdi Nouraie , Connor Smith , Samuel Muller

Stability selection is a popular method for improving feature selection algorithms. One of its key attributes is that it provides theoretical upper bounds on the expected number of false positives, E(FP), enabling false positive control in…

Methodology · Statistics 2025-07-18 Omar Melikechi , Jeffrey W. Miller

Penalized regression models are popularly used in high-dimensional data analysis to conduct variable selection and model fitting simultaneously. Whereas success has been widely reported in literature, their performances largely depend on…

Machine Learning · Statistics 2013-12-16 Wei Sun , Junhui Wang , Yixin Fang

The Lasso has been widely used as a method for variable selection, valued for its simplicity and empirical performance. However, Lasso's selection stability deteriorates in the presence of correlated predictors. Several approaches have been…

Methodology · Statistics 2025-11-05 Mahdi Nouraie , Houying Zhu , Samuel Muller

Variable selection in cluster analysis is important yet challenging. It can be achieved by regularization methods, which realize a trade-off between the clustering accuracy and the number of selected variables by using a lasso-type penalty.…

Methodology · Statistics 2016-12-23 Marbac Matthieu , Sedki Mohammed

Modern biotechnologies often result in high-dimensional data sets with much more variables than observations (n $\ll$ p). These data sets pose new challenges to statistical analysis: Variable selection becomes one of the most important…

Machine Learning · Statistics 2014-11-06 Benjamin Hofner , Luigi Boccuto , Markus Göker

The Lasso is a prominent algorithm for variable selection. However, its instability in the presence of correlated variables in the high-dimensional setting is well-documented. Although previous research has attempted to address this issue…

Methodology · Statistics 2025-05-28 Mahdi Nouraie , Connor Smith , Samuel Muller

The instability in the selection of models is a major concern with data sets containing a large number of covariates. We focus on stability selection which is used as a technique to improve variable selection performance for a range of…

Methodology · Statistics 2016-04-26 Md Hasinur Rahaman Khan , Anamika Bhadra , Tamanna Howlader

Stability is an important aspect of a classification procedure because unstable predictions can potentially reduce users' trust in a classification system and also harm the reproducibility of scientific conclusions. The major goal of our…

Machine Learning · Statistics 2017-01-23 Will Wei Sun , Guang Cheng , Yufeng Liu

A number of attempts have been made to improve accuracy and/or scalability of the PC (Peter and Clark) algorithm, some well known (Buhlmann, et al., 2010; Kalisch and Buhlmann, 2007; 2008; Zhang, 2012, to give some examples). We add here…

Artificial Intelligence · Computer Science 2016-10-06 Joseph Ramsey

While variable selection is essential to optimize the learning complexity by prioritizing features, automating the selection process is preferred since it requires laborious efforts with intensive analysis otherwise. However, it is not an…

Machine Learning · Computer Science 2019-10-29 Makiya Nakashima , Alex Sim , Youngsoo Kim , Jonghyun Kim , Jinoh Kim

Contamination can severely distort an estimator unless the estimation procedure is suitably robust. This is a well-known issue and has been addressed in Robust Statistics, however, the relation of contamination and distorted variable…

Statistics Theory · Mathematics 2022-07-15 Tino Werner

Most scientific publications follow the familiar recipe of (i) obtain data, (ii) fit a model, and (iii) comment on the scientific relevance of the effects of particular covariates in that model. This approach, however, ignores the fact that…

Methodology · Statistics 2021-03-08 Nicholas Kissel , Lucas Mentch
‹ Prev 1 2 3 10 Next ›