English
Related papers

Related papers: Valid post-selection inference

200 papers

Construction of valid statistical inference for estimators based on data-driven selection has received a lot of attention in the recent times. Berk et al. (2013) is possibly the first work to provide valid inference for Gaussian…

Selective inference is the problem of giving valid answers to statistical questions chosen in a data-driven manner. A standard solution to selective inference is simultaneous inference, which delivers valid answers to the set of all…

Methodology · Statistics 2024-05-03 Tijana Zrnic , William Fithian

Post-selection inference consists in providing statistical guarantees, based on a data set, that are robust to a prior model selection step on the same data set. In this paper, we address an instance of the post-selection-inference problem,…

Statistics Theory · Mathematics 2025-06-16 Dominique Bontemps , François Bachoc , Pierre Neuvial

We develop a general approach to valid inference after model selection. At the core of our framework is a result that characterizes the distribution of a post-selection estimator conditioned on the selection event. We specialize the…

Statistics Theory · Mathematics 2016-05-04 Jason D. Lee , Dennis L. Sun , Yuekai Sun , Jonathan E. Taylor

We suggest general methods to construct asymptotically uniformly valid confidence intervals post-model-selection. The constructions are based on principles recently proposed by Berk et al. (2013). In particular the candidate models used can…

Statistics Theory · Mathematics 2017-11-15 François Bachoc , David Preinerstorfer , Lukas Steinberger

This work addresses the problem of conducting valid inference for additive and linear mixed models after model selection. One possible solution to overcome overconfident inference results after model selection is selective inference, which…

Methodology · Statistics 2020-12-22 David Rügamer , Philipp F. M. Baumann , Sonja Greven

Variable selection for regression models plays a key role in the analysis of biomedical data. However, inference after selection is not covered by classical statistical frequentist theory which assumes a fixed set of covariates in the…

Methodology · Statistics 2021-07-21 Michael Kammer , Daniela Dunkler , Stefan Michiels , Georg Heinze

We develop methodology for valid inference after variable selection in logistic regression when the responses are partially observed, that is, when one observes a set of error-prone testing outcomes instead of the true values of the…

Methodology · Statistics 2025-04-17 Qinyan Shen , Karl Gregory , Xianzheng Huang

Statistical inference of the high-dimensional regression coefficients is challenging because the uncertainty introduced by the model selection procedure is hard to account for. A critical question remains unsettled; that is, is it possible…

Methodology · Statistics 2025-01-06 Xiaorui Zhu , Yichen Qin , Peng Wang

Statistical inference after model selection requires an inference framework that takes the selection into account in order to be valid. Following recent work on selective inference, we derive analytical expressions for inference after…

Methodology · Statistics 2017-09-26 David Rügamer , Sonja Greven

Linear models are foundational tools in statistics and ubiquitous across the applied sciences. However, conventional statistical inference -- such as $t$-tests and $F$-tests -- are only valid at fixed sample sizes, making them unsuitable…

Methodology · Statistics 2025-07-08 Michael Lindon , Dae Woong Ham , Martin Tingley , Iavor Bojinov

We give a finite-sample analysis of predictive inference procedures after model selection in regression with random design. The analysis is focused on a statistically challenging scenario where the number of potentially important…

Statistics Theory · Mathematics 2009-08-26 Hannes Leeb

Investigators often use the data to generate interesting hypotheses and then perform inference for the generated hypotheses. P-values and confidence intervals must account for this explorative data analysis. A fruitful method for doing so…

Methodology · Statistics 2018-02-06 Keli Liu , Jelena Markovic , Robert Tibshirani

A great deal of effort has been devoted to reducing the risk of spurious scientific discoveries, from the use of sophisticated validation techniques, to deep statistical methods for controlling the false discovery rate in multiple…

Machine Learning · Computer Science 2016-03-03 Cynthia Dwork , Vitaly Feldman , Moritz Hardt , Toniann Pitassi , Omer Reingold , Aaron Roth

The Tweedie exponential dispersion family is a popular choice among many to model insurance losses that consist of zero-inflated semicontinuous data. In such data, it is often important to obtain credibility (inference) of the most…

Methodology · Statistics 2025-07-17 Alokesh Manna , Zijian Huang , Dipak K. Dey , Yuwen Gu , Robin He

In machine learning, the selection of a promising model from a potentially large number of competing models and the assessment of its generalization performance are critical tasks that need careful consideration. Typically, model selection…

Machine Learning · Statistics 2023-02-06 Pascal Rink , Werner Brannath

When the target of statistical inference is chosen in a data-driven manner, the guarantees provided by classical theories vanish. We propose a solution to the problem of inference after selection by building on the framework of algorithmic…

Statistics Theory · Mathematics 2022-03-16 Tijana Zrnic , Michael I. Jordan

When interpreting A/B tests, we typically focus only on the statistically significant results and take them by face value. This practice, termed post-selection inference in the statistical literature, may negatively affect both point…

Applications · Statistics 2021-06-01 Alex Deng , Yicheng Li , Jiannan Lu , Vivek Ramamurthy

We present a new method for post-selection inference for L1 (lasso)-penalized likelihood models, including generalized regression models. Our approach generalizes the post-selection framework presented in Lee et al (2014). The method…

Methodology · Statistics 2016-10-17 Jonathan Taylor , Robert Tibshirani

We consider inference post-model-selection in linear regression. In this setting, Berk et al.(2013) recently introduced a class of confidence sets, the so-called PoSI intervals, that cover a certain non-standard quantity of interest with a…

Statistics Theory · Mathematics 2019-02-14 François Bachoc , Hannes Leeb , Benedikt M. Pötscher
‹ Prev 1 2 3 10 Next ›