English
Related papers

Related papers: Post-Selection Inference via Algorithmic Stability

200 papers

Algorithmic stability is a central concept in statistics and learning theory that measures how sensitive an algorithm's output is to small changes in the training data. Stability plays a crucial role in understanding generalization,…

Statistics Theory · Mathematics 2026-01-21 Abhinav Chakraborty , Yuetian Luo , Rina Foygel Barber

We consider the problem of inference for parameters selected to report only after some algorithm, the canonical example being inference for model parameters after a model selection procedure. The conditional correction for selection…

Methodology · Statistics 2019-01-30 Jelena Markovic , Jonathan Taylor , Jeremy Taylor

Feature selection, as a vital dimension reduction technique, reduces data dimension by identifying an essential subset of input features, which can facilitate interpretable insights into learning and inference processes. Algorithmic…

Machine Learning · Computer Science 2022-01-06 Xinxing Wu , Qiang Cheng

Post-selection inference consists in providing statistical guarantees, based on a data set, that are robust to a prior model selection step on the same data set. In this paper, we address an instance of the post-selection-inference problem,…

Statistics Theory · Mathematics 2025-06-16 Dominique Bontemps , François Bachoc , Pierre Neuvial

Estimation of structure, such as in variable selection, graphical modelling or cluster analysis is notoriously difficult, especially for high-dimensional data. We introduce stability selection. It is based on subsampling in combination with…

Methodology · Statistics 2009-05-16 Nicolai Meinshausen , Peter Buehlmann

Model averaging techniques based on resampling methods (such as bootstrapping or subsampling) have been utilized across many areas of statistics, often with the explicit goal of promoting stability in the resulting output. We provide a…

Statistics Theory · Mathematics 2024-05-28 Jake A. Soloff , Rina Foygel Barber , Rebecca Willett

It is common practice in statistical data analysis to perform data-driven variable selection and derive statistical inference from the resulting model. Such inference enjoys none of the guarantees that classical statistical theory provides…

Statistics Theory · Mathematics 2013-06-06 Richard Berk , Lawrence Brown , Andreas Buja , Kai Zhang , Linda Zhao

The notion of replicable algorithms was introduced in Impagliazzo et al. [STOC '22] to describe randomized algorithms that are stable under the resampling of their inputs. More precisely, a replicable algorithm gives the same output with…

Machine Learning · Computer Science 2023-03-28 Mark Bun , Marco Gaboardi , Max Hopkins , Russell Impagliazzo , Rex Lei , Toniann Pitassi , Satchit Sivakumar , Jessica Sorrell

Algorithmic stability is a concept from learning theory that expresses the degree to which changes to the input data (e.g., removal of a single data point) may affect the outputs of a regression algorithm. Knowing an algorithm's stability…

Machine Learning · Computer Science 2022-12-23 Byol Kim , Rina Foygel Barber

Stability selection is a versatile framework for structure estimation and variable selection in high-dimensional setting, primarily grounded in frequentist principles. In this paper, we propose an enhanced methodology that integrates…

Methodology · Statistics 2026-05-05 Mahdi Nouraie , Connor Smith , Samuel Muller

We say that an algorithm is stable if small changes in the input result in small changes in the output. This kind of algorithm stability is particularly relevant when analyzing and visualizing time-varying data. Stability in general plays…

Data Structures and Algorithms · Computer Science 2025-03-10 Wouter Meulemans , Bettina Speckmann , Kevin Verbeek , Jules Wulms

Post-selection inference is a statistical technique for determining salient variables after model or variable selection. Recently, selective inference, a kind of post-selection inference framework, has garnered the attention in the…

Methodology · Statistics 2019-06-28 Yuta Umezu , Ichiro Takeuchi

We introduce a statistical physics inspired supervised machine learning algorithm for classification and regression problems. The method is based on the invariances or stability of predicted results when known data is represented as…

Machine Learning · Statistics 2018-11-19 Patrick Chao , Tahereh Mazaheri , Bo Sun , Nicholas B. Weingartner , Zohar Nussinov

We introduce a new notion of the stability of computations, which holds under post-processing and adaptive composition. We show that the notion is both necessary and sufficient to ensure generalization in the face of adaptivity, for any…

Machine Learning · Computer Science 2020-01-01 Katrina Ligett , Moshe Shenfeld

We introduce a notion of algorithmic stability of learning algorithms---that we term \emph{argument stability}---that captures stability of the hypothesis output by the learning algorithm in the normed space of functions from which…

Machine Learning · Statistics 2017-08-04 Tongliang Liu , Gábor Lugosi , Gergely Neu , Dacheng Tao

Research on bias in machine learning algorithms has generally been concerned with the impact of bias on predictive accuracy. We believe that there are other factors that should also play a role in the evaluation of bias. One such factor is…

Machine Learning · Computer Science 2007-05-23 Peter D. Turney

We study the stability of posterior predictive inferences to the specification of the likelihood model and perturbations of the data generating process. In modern big data analyses, useful broad structural judgements may be elicited from…

Methodology · Statistics 2024-04-30 Jack Jewson , Jim Q. Smith , Chris Holmes

In modern data analysis, sparse model selection becomes inevitable once the number of predictors variables is very high. It is well-known that model selection procedures like the Lasso or Boosting tend to overfit on real data. The…

Machine Learning · Computer Science 2022-02-11 Tino Werner

Stochastic optimization of the Area Under the Precision-Recall Curve (AUPRC) is a crucial problem for machine learning. Although various algorithms have been extensively studied for AUPRC optimization, the generalization is only guaranteed…

Machine Learning · Computer Science 2022-09-28 Peisong Wen , Qianqian Xu , Zhiyong Yang , Yuan He , Qingming Huang

It is common to show the confidence intervals or $p$-values of selected features, or predictor variables in regression, but they often involve selection bias. The selective inference approach solves this bias by conditioning on the…

Methodology · Statistics 2022-06-02 Yoshikazu Terada , Hidetoshi Shimodaira
‹ Prev 1 2 3 10 Next ›