English

Learning U-Statistics with Active Inference

Machine Learning 2026-05-13 v1 Machine Learning

Abstract

UU-statistics play a central role in statistical inference. In many modern applications, however, acquiring the labels required for UU-statistics is costly. Motivated by recent advances in active inference, we develop an active inference framework for UU-statistics that selectively queries informative labels to improve estimation efficiency under a fixed labeling budget, while preserving valid statistical inference. Our approach is built on the augmented inverse probability weighting UU-statistic, which is designed to incorporate the sampling rule and machine learning predictions. We characterize the optimal sampling rule that minimizes its variance and design practical sampling strategies. We further extend the framework to UU-statistic-based empirical risk minimization. Experiments on real datasets demonstrate substantial gains in estimation efficiency over baseline methods, while maintaining target coverage.

Keywords

Cite

@article{arxiv.2605.11638,
  title  = {Learning U-Statistics with Active Inference},
  author = {Xiaoning Wang and Yuyang Huo and Liuhua Peng and Changliang Zou},
  journal= {arXiv preprint arXiv:2605.11638},
  year   = {2026}
}