Related papers: Optimal Inference After Model Selection

Selective inference after likelihood- or test-based model selection in linear models

Statistical inference after model selection requires an inference framework that takes the selection into account in order to be valid. Following recent work on selective inference, we derive analytical expressions for inference after…

Methodology · Statistics 2017-09-26 David Rügamer , Sonja Greven

Selective Inference via Marginal Screening for High Dimensional Classification

Post-selection inference is a statistical technique for determining salient variables after model or variable selection. Recently, selective inference, a kind of post-selection inference framework, has garnered the attention in the…

Methodology · Statistics 2019-06-28 Yuta Umezu , Ichiro Takeuchi

Integrative Methods for Post-Selection Inference Under Convex Constraints

Inference after model selection has been an active research topic in the past few years, with numerous works offering different approaches to addressing the perils of the reuse of data. In particular, major progress has been made recently…

Methodology · Statistics 2020-06-02 Snigdha Panigrahi , Jonathan Taylor , Asaf Weinstein

Selective Inference for Additive and Linear Mixed Models

This work addresses the problem of conducting valid inference for additive and linear mixed models after model selection. One possible solution to overcome overconfident inference results after model selection is selective inference, which…

Methodology · Statistics 2020-12-22 David Rügamer , Philipp F. M. Baumann , Sonja Greven

Methods of Selective Inference for Linear Mixed Models: a Review and Empirical Comparison

Selective inference aims at providing valid inference after a data-driven selection of models or hypotheses. It is essential to avoid overconfident results and replicability issues. While significant advances have been made in this area for…

Methodology · Statistics 2025-03-14 Matteo D'Alessandro , Magne Thoresen

Evaluating methods for Lasso selective inference in biomedical research by a comparative simulation study

Variable selection for regression models plays a key role in the analysis of biomedical data. However, inference after selection is not covered by classical statistical frequentist theory which assumes a fixed set of covariates in the…

Methodology · Statistics 2021-07-21 Michael Kammer , Daniela Dunkler , Stefan Michiels , Georg Heinze

Inference-Time Selective Debiasing to Enhance Fairness in Text Classification Models

We propose selective debiasing -- an inference-time safety mechanism designed to enhance the overall model quality in terms of prediction performance and fairness, especially in scenarios where retraining the model is impractical. The…

Computation and Language · Computer Science 2025-03-12 Gleb Kuzmin , Neemesh Yadav , Ivan Smirnov , Timothy Baldwin , Artem Shelmanov

Post-selection Inference in Regression Models for Group Testing Data

We develop methodology for valid inference after variable selection in logistic regression when the responses are partially observed, that is, when one observes a set of error-prone testing outcomes instead of the true values of the…

Methodology · Statistics 2025-04-17 Qinyan Shen , Karl Gregory , Xianzheng Huang

Selective Inference in Propensity Score Analysis

Selective inference (post-selection inference) is a methodology that has attracted much attention in recent years in the fields of statistics and machine learning. Naive inference based on data that are also used for model selection tends…

Methodology · Statistics 2021-11-25 Yoshiyuki Ninomiya , Yuta Umezu , Ichiro Takeuchi

Valid post-selection inference

It is common practice in statistical data analysis to perform data-driven variable selection and derive statistical inference from the resulting model. Such inference enjoys none of the guarantees that classical statistical theory provides…

Statistics Theory · Mathematics 2013-06-06 Richard Berk , Lawrence Brown , Andreas Buja , Kai Zhang , Linda Zhao

Selective Randomization Inference for Adaptive Experiments

Adaptive experiments use preliminary analyses of the data to inform further course of action and are commonly used in many disciplines including medical and social sciences. Because the null hypothesis and experimental design are…

Methodology · Statistics 2026-05-26 Tobias Freidling , Qingyuan Zhao , Zijun Gao

Interval Estimation of Coefficients in Penalized Regression Models of Insurance Data

The Tweedie exponential dispersion family is a popular choice among many to model insurance losses that consist of zero-inflated semicontinuous data. In such data, it is often important to obtain credibility (inference) of the most…

Methodology · Statistics 2025-07-17 Alokesh Manna , Zijian Huang , Dipak K. Dey , Yuwen Gu , Robin He

Exact Post-Selection Inference for Sequential Regression Procedures

We propose new inference tools for forward stepwise regression, least angle regression, and the lasso. Assuming a Gaussian model for the observation vector y, we first describe a general scheme to perform valid inference after any selection…

Methodology · Statistics 2015-10-13 Ryan J. Tibshirani , Jonathan Taylor , Richard Lockhart , Robert Tibshirani

Selective inference is easier with p-values

Selective inference is a subfield of statistics that enables valid inference after selection of a data-dependent question. In this paper, we introduce selectively dominant p-values, a class of p-values that allow practitioners to easily…

Methodology · Statistics 2024-11-22 Anav Sood

Selective inference with a randomized response

Inspired by sample splitting and the reusable holdout introduced in the field of differential privacy, we consider selective inference with a randomized response. We discuss two major advantages of using a randomized response for model…

Statistics Theory · Mathematics 2016-12-01 Xiaoying Tian , Jonathan E. Taylor

Selective Inference for Hierarchical Clustering

Classical tests for a difference in means control the type I error rate when the groups are defined a priori. However, when the groups are instead defined via clustering, then applying a classical test yields an extremely inflated type I…

Methodology · Statistics 2022-11-01 Lucy L. Gao , Jacob Bien , Daniela Witten

Conditional predictive inference post model selection

We give a finite-sample analysis of predictive inference procedures after model selection in regression with random design. The analysis is focused on a statistically challenging scenario where the number of potentially important…

Statistics Theory · Mathematics 2009-08-26 Hannes Leeb

Splitting strategies for post-selection inference

We consider the problem of providing valid inference for a selected parameter in a sparse regression setting. It is well known that classical regression tools can be unreliable in this context due to the bias generated in the selection…

Methodology · Statistics 2022-12-07 Daniel G. Rasines , G. Alastair Young

Selective inference using randomized group lasso estimators for general models

Selective inference methods are developed for group lasso estimators for use with a wide class of distributions and loss functions. The method includes the use of exponential family distributions, as well as quasi-likelihood modeling for…

Methodology · Statistics 2024-03-28 Yiling Huang , Sarah Pirenne , Snigdha Panigrahi , Gerda Claeskens

More Powerful Selective Kernel Tests for Feature Selection

Refining one's hypotheses in the light of data is a common scientific practice; however, the dependency on the data introduces selection bias and can lead to specious statistical analysis. An approach for addressing this is via conditioning…

Machine Learning · Computer Science 2020-03-03 Jen Ning Lim , Makoto Yamada , Wittawat Jitkrittum , Yoshikazu Terada , Shigeyuki Matsui , Hidetoshi Shimodaira