Related papers: PPI++: Efficient Prediction-Powered Inference

Regression for the Mean: Auto-Evaluation and Inference with Few Labels through Post-hoc Regression

The availability of machine learning systems that can effectively perform arbitrary tasks has led to synthetic labels from these systems being used in applications of statistical inference, such as data analysis or model evaluation. The…

Machine Learning · Computer Science 2025-07-09 Benjamin Eyre , David Madras

Bayesian Prediction-Powered Inference

Prediction-powered inference (PPI) is a method that improves statistical estimates based on limited human-labeled data. Specifically, PPI methods provide tighter confidence intervals by combining small amounts of human-labeled data with…

Machine Learning · Computer Science 2024-05-13 R. Alex Hofer , Joshua Maynez , Bhuwan Dhingra , Adam Fisch , Amir Globerson , William W. Cohen

Anytime-valid, Bayes-assisted, Prediction-Powered Inference

Given a large pool of unlabelled data and a smaller amount of labels, prediction-powered inference (PPI) leverages machine learning predictions to increase the statistical efficiency of confidence interval procedures based solely on…

Machine Learning · Statistics 2025-10-27 Valentin Kilian , Stefano Cortinovis , François Caron

Prediction-Powered Inference with Inverse Probability Weighting

Prediction-powered inference (PPI) is a recent framework for valid statistical inference with partially labeled data, combining model-based predictions on a large unlabeled set with bias correction from a smaller labeled subset. Building on…

Machine Learning · Statistics 2026-03-25 Jyotishka Datta , Nicholas G. Polson

Demystifying Prediction Powered Inference

Machine learning predictions are increasingly used to supplement incomplete or costly-to-measure outcomes in fields such as biomedical research, environmental science, and social science. However, treating predictions as ground truth…

Machine Learning · Statistics 2026-01-29 Yilin Song , Dan M. Kluger , Harsh Parikh , Tian Gu

Local Prediction-Powered Inference

To infer a function value on a specific point $x$, it is essential to assign higher weights to the points closer to $x$, which is called local polynomial / multivariable regression. In many practical cases, a limited sample size may ruin…

Machine Learning · Statistics 2024-09-30 Yanwu Gu , Dong Xia

Do More Predictions Improve Statistical Inference? Filtered Prediction-Powered Inference

Recent advances in artificial intelligence have enabled the generation of large-scale, low-cost predictions with increasingly high fidelity. As a result, the primary challenge in statistical inference has shifted from data scarcity to data…

Statistics Theory · Mathematics 2026-02-12 Shirong Xu , Will Wei Sun

Prediction-Powered Semi-Supervised Learning with Online Power Tuning

Prediction-Powered Inference (PPI) is a recently proposed statistical inference technique for parameter estimation that leverages pseudo-labels on both labeled and unlabeled data to construct an unbiased, low-variance estimator. In this…

Machine Learning · Computer Science 2025-10-28 Noa Shoham , Ron Dorfman , Shalev Shaer , Kfir Y. Levy , Yaniv Romano

Generalized Prediction-Powered Inference, with Application to Binary Classifier Evaluation

In the partially-observed outcome setting, a recent set of proposals known as "prediction-powered inference" (PPI) involve (i) applying a pre-trained machine learning model to predict the response, and then (ii) using these predictions to…

Methodology · Statistics 2026-02-12 Runjia Zou , Daniela Witten , Brian Williamson

Calibeating Prediction-Powered Inference

We study semisupervised mean estimation with a small labeled sample, a large unlabeled sample, and a black-box prediction model whose output may be miscalibrated. A standard approach in this setting is augmented inverse-probability…

Machine Learning · Statistics 2026-04-24 Lars van der Laan , Mark Van Der Laan

Stratified Prediction-Powered Inference for Hybrid Language Model Evaluation

Prediction-powered inference (PPI) is a method that improves statistical estimates based on limited human-labeled data. PPI achieves this by combining small amounts of human-labeled data with larger amounts of data labeled by a reasonably…

Machine Learning · Computer Science 2024-12-05 Adam Fisch , Joshua Maynez , R. Alex Hofer , Bhuwan Dhingra , Amir Globerson , William W. Cohen

Multiple-Prediction-Powered Inference

Statistical estimation often involves tradeoffs between expensive, high-quality measurements and a variety of lower-quality proxies. We introduce Multiple-Prediction-Powered Inference (MultiPPI): a general framework for constructing…

Statistics Theory · Mathematics 2026-03-31 Charlie Cowen-Breen , Alekh Agarwal , Stephen Bates , William W. Cohen , Jacob Eisenstein , Amir Globerson , Adam Fisch

Prediction-Powered Adaptive Shrinkage Estimation

Prediction-Powered Inference (PPI) is a powerful framework for enhancing statistical estimates by combining limited gold-standard data with machine learning (ML) predictions. While prior work has demonstrated PPI's benefits for individual…

Machine Learning · Statistics 2025-11-10 Sida Li , Nikolaos Ignatiadis

No Free Lunch: Non-Asymptotic Analysis of Prediction-Powered Inference

Prediction-Powered Inference (PPI) is a popular strategy for combining gold-standard and possibly noisy pseudo-labels to perform statistical estimation. Prior work has shown an asymptotic "free lunch" for PPI++, an adaptive form of PPI,…

Machine Learning · Statistics 2025-05-27 Pranav Mani , Peng Xu , Zachary C. Lipton , Michael Oberst

Active Multiple-Prediction-Powered Inference

Post-deployment monitoring of healthcare AI requires statistically valid, label-efficient methods, but gold-standard labels from clinician chart review are expensive. Prediction-powered inference (PPI) and active statistical inference (ASI)…

Machine Learning · Statistics 2026-05-12 Nicholas Brawand , Nima Leclerc , Anhthy Ngo , Matthew Peterson , Sriram Vishwanath , Laith Alhussein , Ben Wellner

FAB-PPI: Frequentist, Assisted by Bayes, Prediction-Powered Inference

Prediction-powered inference (PPI) enables valid statistical inference by combining experimental data with machine learning predictions. When a sufficient number of high-quality predictions is available, PPI results in more accurate…

Machine Learning · Statistics 2025-08-18 Stefano Cortinovis , François Caron

Power Analysis for Prediction-Powered Inference

Modern studies increasingly leverage outcomes predicted by machine learning and artificial intelligence (AI/ML) models, and recent work, such as prediction-powered inference (PPI), has developed valid downstream statistical inference…

Methodology · Statistics 2026-03-18 Yiqun T. Chen , Moran Guo , Shengy Li

Empirical Likelihood Meets Prediction-Powered Inference

We study inference with a small labeled sample, a large unlabeled sample, and high-quality predictions from an external model. We link prediction-powered inference with empirical likelihood by stacking supervised estimating equations based…

Methodology · Statistics 2025-12-19 Guanghui Wang , Mengtao Wen , Changliang Zou

Semi-Supervised Learning via Cross-Prediction-Powered Inference for Wireless Systems

In many wireless application scenarios, acquiring labeled data can be prohibitively costly, requiring complex optimization processes or measurement campaigns. Semi-supervised learning leverages unlabeled samples to augment the available…

Information Theory · Computer Science 2024-10-08 Houssem Sifaou , Osvaldo Simeone

Prediction-Powered Inference Across Many Tasks for AI Evaluation & Social Science Research

Many applications require statistically valid inference across many related tasks, while using only a handful of high-quality labels per hypothesis. In AI evaluation, these tasks may correspond to model behaviors across prompts, subgroups,…

Machine Learning · Statistics 2026-05-29 Nicolas Emmenegger , Ellery Stahler , Chara Podimata