Related papers: Valid post-selection inference

Valid Post-selection Inference in Assumption-lean Linear Regression

Construction of valid statistical inference for estimators based on data-driven selection has received a lot of attention in the recent times. Berk et al. (2013) is possibly the first work to provide valid inference for Gaussian…

Methodology · Statistics 2018-06-12 Arun Kumar Kuchibhotla , Lawrence D. Brown , Andreas Buja , Edward I. George , Linda Zhao

Locally Simultaneous Inference

Selective inference is the problem of giving valid answers to statistical questions chosen in a data-driven manner. A standard solution to selective inference is simultaneous inference, which delivers valid answers to the set of all…

Methodology · Statistics 2024-05-03 Tijana Zrnic , William Fithian

Inference post region selection

Post-selection inference consists in providing statistical guarantees, based on a data set, that are robust to a prior model selection step on the same data set. In this paper, we address an instance of the post-selection-inference problem,…

Statistics Theory · Mathematics 2025-06-16 Dominique Bontemps , François Bachoc , Pierre Neuvial

Exact post-selection inference, with application to the lasso

We develop a general approach to valid inference after model selection. At the core of our framework is a result that characterizes the distribution of a post-selection estimator conditioned on the selection event. We specialize the…

Statistics Theory · Mathematics 2016-05-04 Jason D. Lee , Dennis L. Sun , Yuekai Sun , Jonathan E. Taylor

Uniformly valid confidence intervals post-model-selection

We suggest general methods to construct asymptotically uniformly valid confidence intervals post-model-selection. The constructions are based on principles recently proposed by Berk et al. (2013). In particular the candidate models used can…

Statistics Theory · Mathematics 2017-11-15 François Bachoc , David Preinerstorfer , Lukas Steinberger

Selective Inference for Additive and Linear Mixed Models

This work addresses the problem of conducting valid inference for additive and linear mixed models after model selection. One possible solution to overcome overconfident inference results after model selection is selective inference, which…

Methodology · Statistics 2020-12-22 David Rügamer , Philipp F. M. Baumann , Sonja Greven

Evaluating methods for Lasso selective inference in biomedical research by a comparative simulation study

Variable selection for regression models plays a key role in the analysis of biomedical data. However, inference after selection is not covered by classical statistical frequentist theory which assumes a fixed set of covariates in the…

Methodology · Statistics 2021-07-21 Michael Kammer , Daniela Dunkler , Stefan Michiels , Georg Heinze

Post-selection Inference in Regression Models for Group Testing Data

We develop methodology for valid inference after variable selection in logistic regression when the responses are partially observed, that is, when one observes a set of error-prone testing outcomes instead of the true values of the…

Methodology · Statistics 2025-04-17 Qinyan Shen , Karl Gregory , Xianzheng Huang

Sparsified Simultaneous Confidence Intervals for High-Dimensional Linear Models

Statistical inference of the high-dimensional regression coefficients is challenging because the uncertainty introduced by the model selection procedure is hard to account for. A critical question remains unsettled; that is, is it possible…

Methodology · Statistics 2025-01-06 Xiaorui Zhu , Yichen Qin , Peng Wang

Selective inference after likelihood- or test-based model selection in linear models

Statistical inference after model selection requires an inference framework that takes the selection into account in order to be valid. Following recent work on selective inference, we derive analytical expressions for inference after…

Methodology · Statistics 2017-09-26 David Rügamer , Sonja Greven

Anytime-Valid Linear Models and Regression Adjusted Causal Inference in Randomized Experiments

Linear models are foundational tools in statistics and ubiquitous across the applied sciences. However, conventional statistical inference -- such as $t$-tests and $F$-tests -- are only valid at fixed sample sizes, making them unsuitable…

Methodology · Statistics 2025-07-08 Michael Lindon , Dae Woong Ham , Martin Tingley , Iavor Bojinov

Conditional predictive inference post model selection

We give a finite-sample analysis of predictive inference procedures after model selection in regression with random design. The analysis is focused on a statistically challenging scenario where the number of potentially important…

Statistics Theory · Mathematics 2009-08-26 Hannes Leeb

More powerful post-selection inference, with application to the Lasso

Investigators often use the data to generate interesting hypotheses and then perform inference for the generated hypotheses. P-values and confidence intervals must account for this explorative data analysis. A fruitful method for doing so…

Methodology · Statistics 2018-02-06 Keli Liu , Jelena Markovic , Robert Tibshirani

Preserving Statistical Validity in Adaptive Data Analysis

A great deal of effort has been devoted to reducing the risk of spurious scientific discoveries, from the use of sophisticated validation techniques, to deep statistical methods for controlling the false discovery rate in multiple…

Machine Learning · Computer Science 2016-03-03 Cynthia Dwork , Vitaly Feldman , Moritz Hardt , Toniann Pitassi , Omer Reingold , Aaron Roth

Interval Estimation of Coefficients in Penalized Regression Models of Insurance Data

The Tweedie exponential dispersion family is a popular choice among many to model insurance losses that consist of zero-inflated semicontinuous data. In such data, it is often important to obtain credibility (inference) of the most…

Methodology · Statistics 2025-07-17 Alokesh Manna , Zijian Huang , Dipak K. Dey , Yuwen Gu , Robin He

Post-Selection Confidence Bounds for Prediction Performance

In machine learning, the selection of a promising model from a potentially large number of competing models and the assessment of its generalization performance are critical tasks that need careful consideration. Typically, model selection…

Machine Learning · Statistics 2023-02-06 Pascal Rink , Werner Brannath

Post-Selection Inference via Algorithmic Stability

When the target of statistical inference is chosen in a data-driven manner, the guarantees provided by classical theories vanish. We propose a solution to the problem of inference after selection by building on the framework of algorithmic…

Statistics Theory · Mathematics 2022-03-16 Tijana Zrnic , Michael I. Jordan

On Post-Selection Inference in A/B Tests

When interpreting A/B tests, we typically focus only on the statistically significant results and take them by face value. This practice, termed post-selection inference in the statistical literature, may negatively affect both point…

Applications · Statistics 2021-06-01 Alex Deng , Yicheng Li , Jiannan Lu , Vivek Ramamurthy

Post-selection inference for L1-penalized likelihood models

We present a new method for post-selection inference for L1 (lasso)-penalized likelihood models, including generalized regression models. Our approach generalizes the post-selection framework presented in Lee et al (2014). The method…

Methodology · Statistics 2016-10-17 Jonathan Taylor , Robert Tibshirani

Valid confidence intervals for post-model-selection predictors

We consider inference post-model-selection in linear regression. In this setting, Berk et al.(2013) recently introduced a class of confidence sets, the so-called PoSI intervals, that cover a certain non-standard quantity of interest with a…

Statistics Theory · Mathematics 2019-02-14 François Bachoc , Hannes Leeb , Benedikt M. Pötscher