Related papers: Selective Correlations - the conditional estimator…

Selective Confidence Intervals for Martingale Regression Model

In this paper we consider the problem of constructing confidence intervals for coefficients of martingale regression models (in particular, time series models) after variable selection. Although constructing confidence intervals are common…

Statistics Theory · Mathematics 2020-05-19 Ka Wai Tsang , Wei Dai

Efficient estimation and correction of selection-induced bias with order statistics

Model selection aims to identify a sufficiently well performing model that is possibly simpler than the most complex model among a pool of candidates. However, the decision-making process itself can inadvertently introduce non-negligible…

Methodology · Statistics 2024-08-08 Yann McLatchie , Aki Vehtari

The Aleatoric Uncertainty Estimation Using a Separate Formulation with Virtual Residuals

We propose a new optimization framework for aleatoric uncertainty estimation in regression problems. Existing methods can quantify the error in the target estimation, but they tend to underestimate it. To obtain the predictive uncertainty…

Computer Vision and Pattern Recognition · Computer Science 2021-03-12 Takumi Kawashima , Qing Yu , Akari Asai , Daiki Ikami , Kiyoharu Aizawa

Training and Testing with Multiple Splits: A Central Limit Theorem for Split-Sample Estimators

As predictive algorithms grow in popularity, using the same dataset to both train and test a new model has become routine across research, policy, and industry. Sample-splitting attains valid inference on model properties by using separate…

Econometrics · Economics 2025-11-27 Bruno Fava

Exact post-selection inference, with application to the lasso

We develop a general approach to valid inference after model selection. At the core of our framework is a result that characterizes the distribution of a post-selection estimator conditioned on the selection event. We specialize the…

Statistics Theory · Mathematics 2016-05-04 Jason D. Lee , Dennis L. Sun , Yuekai Sun , Jonathan E. Taylor

Splitting strategies for post-selection inference

We consider the problem of providing valid inference for a selected parameter in a sparse regression setting. It is well known that classical regression tools can be unreliable in this context due to the bias generated in the selection…

Methodology · Statistics 2022-12-07 Daniel G. Rasines , G. Alastair Young

Inference post region selection

Post-selection inference consists in providing statistical guarantees, based on a data set, that are robust to a prior model selection step on the same data set. In this paper, we address an instance of the post-selection-inference problem,…

Statistics Theory · Mathematics 2025-06-16 Dominique Bontemps , François Bachoc , Pierre Neuvial

Locally Simultaneous Inference

Selective inference is the problem of giving valid answers to statistical questions chosen in a data-driven manner. A standard solution to selective inference is simultaneous inference, which delivers valid answers to the set of all…

Methodology · Statistics 2024-05-03 Tijana Zrnic , William Fithian

Inference in Linear Dyadic Data Models with Network Spillovers

When using dyadic data (i.e., data indexed by pairs of units), researchers typically assume a linear model, estimate it using Ordinary Least Squares and conduct inference using ``dyadic-robust" variance estimators. The latter assumes that…

Econometrics · Economics 2024-11-20 Nathan Canen , Ko Sugiura

Evaluating methods for Lasso selective inference in biomedical research by a comparative simulation study

Variable selection for regression models plays a key role in the analysis of biomedical data. However, inference after selection is not covered by classical statistical frequentist theory which assumes a fixed set of covariates in the…

Methodology · Statistics 2021-07-21 Michael Kammer , Daniela Dunkler , Stefan Michiels , Georg Heinze

Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles

Prediction intervals are a machine- and human-interpretable way to represent predictive uncertainty in a regression analysis. In this paper, we present a method for generating prediction intervals along with point estimates from an ensemble…

Machine Learning · Statistics 2020-07-21 Tárik S. Salem , Helge Langseth , Heri Ramampiaro

Variable Selection for Linear Regression Imputation in Surveys

Survey sampling is concerned with the estimation of finite population parameters. In practice, survey data suffer from item nonresponse, which is commonly handled through imputation, i.e., replacing missing values with predicted values. As…

Methodology · Statistics 2026-03-06 Ziming An , Mehdi Dagdoug , David Haziza

A Statistical Model with Qualitative Input

A statistical estimation model with qualitative input provides a mechanism to fuse human intuition in the form of qualitative information into a statistical model. We investigate the statistical properties of this model and devise a…

Applications · Statistics 2025-10-21 Seksan Kiatsupaibul , Pariyakorn Maneekul

Reliable uncertainties in indirect measurements

In this article we present very intuitive, easy to follow, yet mathematically rigorous, approach to the so called data fitting process. Rather than minimizing the distance between measured and simulated data points, we prefer to find such…

Data Analysis, Statistics and Probability · Physics 2017-08-07 Marek W. Gutowski

Estimation of neural connections from partially observed neural spikes

Plasticity is one of the most important properties of the nervous system, which enables animals to adjust their behavior to the ever-changing external environment. Changes in synaptic efficacy between neurons constitute one of the major…

Neurons and Cognition · Quantitative Biology 2018-01-23 Taishi Iwasaki , Hideitsu Hino , Masami Tatsuno , Shotaro Akaho , Noboru Murata

Selective Inference in Propensity Score Analysis

Selective inference (post-selection inference) is a methodology that has attracted much attention in recent years in the fields of statistics and machine learning. Naive inference based on data that are also used for model selection tends…

Methodology · Statistics 2021-11-25 Yoshiyuki Ninomiya , Yuta Umezu , Ichiro Takeuchi

Spurious Correlations and Where to Find Them

Spurious correlations occur when a model learns unreliable features from the data and are a well-known drawback of data-driven learning. Although there are several algorithms proposed to mitigate it, we are yet to jointly derive the…

Machine Learning · Computer Science 2023-08-23 Gautam Sreekumar , Vishnu Naresh Boddeti

Selective inference using randomized group lasso estimators for general models

Selective inference methods are developed for group lasso estimators for use with a wide class of distributions and loss functions. The method includes the use of exponential family distributions, as well as quasi-likelihood modeling for…

Methodology · Statistics 2024-03-28 Yiling Huang , Sarah Pirenne , Snigdha Panigrahi , Gerda Claeskens

How to Fix a Broken Confidence Estimator: Evaluating Post-hoc Methods for Selective Classification with Deep Neural Networks

This paper addresses the problem of selective classification for deep neural networks, where a model is allowed to abstain from low-confidence predictions to avoid potential errors. We focus on so-called post-hoc methods, which replace the…

Machine Learning · Computer Science 2025-06-23 Luís Felipe P. Cattelan , Danilo Silva

Inference for Forecasting Accuracy: Pooled versus Individual Estimators in High-dimensional Panel Data

Panels with large time $(T)$ and cross-sectional $(N)$ dimensions are a key data structure in social sciences and other fields. A central question in panel data analysis is whether to pool data across individuals or to estimate separate…

Methodology · Statistics 2025-12-18 Tim Kutta , Martin Schumann , Holger Dette