Related papers: Prediction-Powered E-Values

Extending Prediction-Powered Inference through Conformal Prediction

Prediction-powered inference is a recent methodology for the safe use of black-box ML models to impute missing data, strengthening inference of statistical parameters. However, many applications require strong properties besides valid…

Methodology · Statistics 2025-10-21 Daniel Csillag , Pedro Dall'Antonia , Claudio José Struchiner , Guilherme Tegoni Goedert

Prediction-Powered Inference

Prediction-powered inference is a framework for performing valid statistical inference when an experimental dataset is supplemented with predictions from a machine-learning system. The framework yields simple algorithms for computing…

Machine Learning · Statistics 2023-11-10 Anastasios N. Angelopoulos , Stephen Bates , Clara Fannjiang , Michael I. Jordan , Tijana Zrnic

E-Values Expand the Scope of Conformal Prediction

Conformal prediction is a powerful framework for distribution-free uncertainty quantification. The standard approach to conformal prediction relies on comparing the ranks of prediction scores: under exchangeability, the rank of a future…

Machine Learning · Statistics 2025-05-07 Etienne Gauthier , Francis Bach , Michael I. Jordan

e-Values for Real-Time Residential Electricity Demand Forecast Model Selection

With the growing number of forecasting techniques and the increasing significance of forecast-based operation - particularly in the rapidly evolving energy sector - selecting the most effective forecasting model has become a critical task.…

Systems and Control · Electrical Eng. & Systems 2024-10-24 Fabian Backhaus , Karoline Brucke , Peter Ruckdeschel , Sunke Schlüters

Valid sequential inference on probability forecast performance

Probability forecasts for binary events play a central role in many applications. Their quality is commonly assessed with proper scoring rules, which assign forecasts a numerical score such that a correct forecast achieves a minimal…

Methodology · Statistics 2022-07-04 Alexander Henzi , Johanna F. Ziegel

Prediction-Powered Conditional Inference

We study prediction-powered conditional inference in the setting where labeled data are scarce, unlabeled covariates are abundant, and a black-box machine-learning predictor is available. The goal is to perform statistical inference on…

Machine Learning · Statistics 2026-03-09 Yang Sui , Jin Zhou , Hua Zhou , Xiaowu Dai

E-values as statistical evidence: A comparison to Bayes factors, likelihoods, and p-values

A recurring debate in the philosophy of statistics concerns what, exactly, should count as a measure of evidence for or against a given hypothesis. P-values, likelihood ratios, and Bayes factors all have their defenders. In this paper we…

Methodology · Statistics 2026-03-26 Ben Chugg , Aaditya Ramdas , Peter Grünwald

Empirical Likelihood Meets Prediction-Powered Inference

We study inference with a small labeled sample, a large unlabeled sample, and high-quality predictions from an external model. We link prediction-powered inference with empirical likelihood by stacking supervised estimating equations based…

Methodology · Statistics 2025-12-19 Guanghui Wang , Mengtao Wen , Changliang Zou

Regularized e-processes: anytime valid inference with knowledge-based efficiency gains

Classical statistical methods have theoretical justification when the sample size is predetermined. In applications, however, it's often the case that sample sizes are data-dependent rather than predetermined. The aforementioned methods…

Statistics Theory · Mathematics 2026-05-06 Ryan Martin

Cross-Prediction-Powered Inference

While reliable data-driven decision-making hinges on high-quality labeled data, the acquisition of quality labels often involves laborious human annotations or slow and expensive scientific measurements. Machine learning is becoming an…

Machine Learning · Statistics 2024-03-01 Tijana Zrnic , Emmanuel J. Candès

Feature Selection using e-values

In the context of supervised parametric models, we introduce the concept of e-values. An e-value is a scalar quantity that represents the proximity of the sampling distribution of parameter estimates in a model trained on a subset of…

Machine Learning · Statistics 2022-07-19 Subhabrata Majumdar , Snigdhansu Chatterjee

Conformal e-prediction

This paper discusses a counterpart of conformal prediction for e-values, conformal e-prediction. Conformal e-prediction is conceptually simpler and had been developed in the 1990s as a precursor of conformal prediction. When conformal…

Machine Learning · Computer Science 2025-05-20 Vladimir Vovk

Beyond Neyman-Pearson: e-values enable hypothesis testing with a data-driven alpha

A standard practice in statistical hypothesis testing is to mention the p-value alongside the accept/reject decision. We show the advantages of mentioning an e-value instead. With p-values, it is not clear how to use an extreme observation…

Methodology · Statistics 2024-04-04 Peter Grünwald

Anytime-valid, Bayes-assisted, Prediction-Powered Inference

Given a large pool of unlabelled data and a smaller amount of labels, prediction-powered inference (PPI) leverages machine learning predictions to increase the statistical efficiency of confidence interval procedures based solely on…

Machine Learning · Statistics 2025-10-27 Valentin Kilian , Stefano Cortinovis , François Caron

Selective inference is easier with p-values

Selective inference is a subfield of statistics that enables valid inference after selection of a data-dependent question. In this paper, we introduce selectively dominant p-values, a class of p-values that allow practitioners to easily…

Methodology · Statistics 2024-11-22 Anav Sood

Sequentially valid tests for forecast calibration

Forecasting and forecast evaluation are inherently sequential tasks. Predictions are often issued on a regular basis, such as every hour, day, or month, and their quality is monitored continuously. However, the classical statistical tools…

Methodology · Statistics 2022-07-04 Sebastian Arnold , Alexander Henzi , Johanna F. Ziegel

The Value Added of Machine Learning to Causal Inference: Evidence from Revisited Studies

A new and rapidly growing econometric literature is making advances in the problem of using machine learning methods for causal inference questions. Yet, the empirical economics literature has not started to fully exploit the strengths of…

General Economics · Economics 2021-01-05 Anna Baiardi , Andrea A. Naghi

Post-Hoc Large-Sample Statistical Inference

We derive inferential procedures for large sample sizes that remain valid under data-dependent significance levels (so-called "post-hoc valid inference"). Classical statistical tools require that the significance level -- the "type-I error"…

Statistics Theory · Mathematics 2026-03-10 Ben Chugg , Etienne Gauthier , Michael I. Jordan , Aaditya Ramdas , Ian Waudby-Smith

Do More Predictions Improve Statistical Inference? Filtered Prediction-Powered Inference

Recent advances in artificial intelligence have enabled the generation of large-scale, low-cost predictions with increasingly high fidelity. As a result, the primary challenge in statistical inference has shifted from data scarcity to data…

Statistics Theory · Mathematics 2026-02-12 Shirong Xu , Will Wei Sun

Confidence and discoveries with e-values

We discuss systematically two versions of confidence regions: those based on p-values and those based on e-values, a recent alternative to p-values. Both versions can be applied to multiple hypothesis testing, and in this paper we are…

Statistics Theory · Mathematics 2024-03-05 Vladimir Vovk , Ruodu Wang