Data-driven Error Estimation: Excess Risk Bounds without Class Complexity as Input

Sanath Kumar Krishnamurthy; Anna Lyubarskaja; Emma Brunskill; Susan Athey

Data-driven Error Estimation: Excess Risk Bounds without Class Complexity as Input

Machine Learning 2026-02-05 v4 Machine Learning

Authors: Sanath Kumar Krishnamurthy , Anna Lyubarskaja , Emma Brunskill , Susan Athey

Abstract

Constructing confidence intervals that are simultaneously valid across a class of estimates is central to tasks such as multiple mean estimation, generalization guarantees, and adaptive experimental design. We frame this as an ``error estimation problem," where the goal is to determine a high-probability upper bound on the maximum error for a class of estimates. We propose an entirely data-driven approach that derives such bounds for both finite and infinite class settings, naturally adapting to a potentially unknown correlation structure of random errors. Notably, our method does not require class complexity as an input, overcoming a major limitation of existing approaches. We present our simple yet general solution and demonstrate applications to simultaneous confidence intervals, excess-risk control and optimizing exploration in contextual bandit algorithms.

Keywords

generalization bounds machine learning theory classification

Cite

@article{arxiv.2405.04636,
  title  = {Data-driven Error Estimation: Excess Risk Bounds without Class Complexity as Input},
  author = {Sanath Kumar Krishnamurthy and Anna Lyubarskaja and Emma Brunskill and Susan Athey},
  journal= {arXiv preprint arXiv:2405.04636},
  year   = {2026}
}

Data-driven Error Estimation: Excess Risk Bounds without Class Complexity as Input

Abstract

Keywords

Cite

Related papers