Related papers: Conditional predictive inference for stable algori…

Conformal Prediction Intervals for Neural Networks Using Cross Validation

Neural networks are among the most powerful nonlinear models used to address supervised learning problems. Similar to most machine learning algorithms, neural networks produce point predictions and do not provide any prediction interval…

Machine Learning · Statistics 2020-07-01 Saeed Khaki , Dan Nettleton

Cross-validation Confidence Intervals for Test Error

This work develops central limit theorems for cross-validation and consistent estimators of its asymptotic variance under weak stability conditions on the learning algorithm. Together, these results provide practical, asymptotically-exact…

Machine Learning · Statistics 2020-11-03 Pierre Bayle , Alexandre Bayle , Lucas Janson , Lester Mackey

Training-conditional coverage for distribution-free predictive inference

The field of distribution-free predictive inference provides tools for provably valid prediction without any assumptions on the distribution of the data, which can be paired with any regression algorithm to provide accurate and reliable…

Statistics Theory · Mathematics 2023-01-19 Michael Bian , Rina Foygel Barber

Conditional predictive inference post model selection

We give a finite-sample analysis of predictive inference procedures after model selection in regression with random design. The analysis is focused on a statistically challenging scenario where the number of potentially important…

Statistics Theory · Mathematics 2009-08-26 Hannes Leeb

Algorithmic stability implies training-conditional coverage for distribution-free prediction methods

In a supervised learning problem, given a predicted value that is the output of some trained model, how can we quantify our uncertainty around this prediction? Distribution-free predictive inference aims to construct prediction intervals…

Statistics Theory · Mathematics 2025-07-01 Ruiting Liang , Rina Foygel Barber

Cross-validation: what does it estimate and how well does it do it?

Cross-validation is a widely-used technique to estimate prediction error, but its behavior is complex and not fully understood. Ideally, one would like to think that cross-validation estimates the prediction error for the model at hand, fit…

Methodology · Statistics 2024-03-12 Stephen Bates , Trevor Hastie , Robert Tibshirani

On the bias of K-fold cross validation with stable learners

This paper investigates the efficiency of the K-fold cross-validation (CV) procedure and a debiased version thereof as a means of estimating the generalization risk of a learning algorithm. We work under the general assumption of uniform…

Statistics Theory · Mathematics 2023-06-13 Anass Aghbalou , François Portier , Anne Sabourin

Cross-Validation with Confidence

Cross-validation is one of the most popular model selection methods in statistics and machine learning. Despite its wide applicability, traditional cross validation methods tend to select overfitting models, due to the ignorance of the…

Methodology · Statistics 2017-12-25 Jing Lei

Irredundant $k$-Fold Cross-Validation

In traditional k-fold cross-validation, each instance is used ($k-1$) times for training and once for testing, leading to redundancy that lets many instances disproportionately influence the learning phase. We introduce Irredundant $k$-fold…

Machine Learning · Computer Science 2025-08-29 Jesus S. Aguilar-Ruiz

Conditional validity and a fast approximation formula of full conformal prediction sets

Prediction sets based on full conformal prediction have seen an increasing interest in statistical learning due to their universal marginal coverage guarantees. However, practitioners have refrained from using it in applications for two…

Statistics Theory · Mathematics 2025-08-08 Nicolai Amann

Distributional conformal prediction

We propose a robust method for constructing conditionally valid prediction intervals based on models for conditional distributions such as quantile and distribution regression. Our approach can be applied to important prediction problems…

Econometrics · Economics 2022-01-26 Victor Chernozhukov , Kaspar Wüthrich , Yinchu Zhu

The limits of distribution-free conditional predictive inference

We consider the problem of distribution-free predictive inference, with the goal of producing predictive coverage guarantees that hold conditionally rather than marginally. Existing methods such as conformal prediction offer marginal…

Statistics Theory · Mathematics 2020-04-16 Rina Foygel Barber , Emmanuel J. Candès , Aaditya Ramdas , Ryan J. Tibshirani

Predictive Performance Test based on the Exhaustive Nested Cross-Validation for High-dimensional data

It is crucial to assess the predictive performance of a model to establish its practicality and relevance in real-world scenarios, particularly for high-dimensional data analysis. Among data splitting or resampling methods, cross-validation…

Methodology · Statistics 2025-11-26 Iris Ivy Gauran , Hernando Ombao , Zhaoxia Yu

Approximation to Object Conditional Validity with Inductive Conformal Predictors

Conformal predictors are machine learning algorithms that output prediction sets that have a guarantee of marginal validity for finite samples with minimal distributional assumptions. This is a property that makes conformal predictors…

Machine Learning · Computer Science 2021-03-03 Anthony Bellotti

The Structure of Cross-Validation Error: Stability, Covariance, and Minimax Limits

Despite ongoing theoretical research on cross-validation (CV), many theoretical questions remain widely open. This motivates our investigation into how properties of algorithm-distribution pairs can affect the choice for the number of folds…

Statistics Theory · Mathematics 2026-01-09 Ido Nachum , Rüdiger Urbanke , Thomas Weinberger

Robust Validation: Confident Predictions Even When Distributions Shift

While the traditional viewpoint in machine learning and statistics assumes training and testing samples come from the same population, practice belies this fiction. One strategy -- coming from robust statistics and optimization -- is thus…

Machine Learning · Statistics 2024-07-08 Maxime Cauchois , Suyash Gupta , Alnur Ali , John C. Duchi

Adjusting Regression Models for Conditional Uncertainty Calibration

Conformal Prediction methods have finite-sample distribution-free marginal coverage guarantees. However, they generally do not offer conditional coverage guarantees, which can be important for high-stakes decisions. In this paper, we…

Machine Learning · Statistics 2024-09-27 Ruijiang Gao , Mingzhang Yin , James McInerney , Nathan Kallus

Confidence Intervals for Low-Dimensional Parameters in High-Dimensional Linear Models

The purpose of this paper is to propose methodologies for statistical inference of low-dimensional parameters with high-dimensional data. We focus on constructing confidence intervals for individual coefficients and linear combinations of…

Methodology · Statistics 2012-11-05 Cun-Hui Zhang , Stephanie S. Zhang

Cross-Validation, Risk Estimation, and Model Selection

Cross-validation is a popular non-parametric method for evaluating the accuracy of a predictive rule. The usefulness of cross-validation depends on the task we want to employ it for. In this note, I discuss a simple non-parametric setting,…

Methodology · Statistics 2019-09-27 Stefan Wager

A Few Observations on Sample-Conditional Coverage in Conformal Prediction

We revisit the problem of constructing predictive confidence sets for which we wish to obtain some type of conditional validity. We provide new arguments showing how ``split conformal'' methods achieve near desired coverage levels with high…

Statistics Theory · Mathematics 2025-03-04 John C. Duchi