English

Learning Theory for Kernel Bilevel Optimization

Machine Learning 2025-11-18 v3

Abstract

Bilevel optimization has emerged as a technique for addressing a wide range of machine learning problems that involve an outer objective implicitly determined by the minimizer of an inner problem. While prior works have primarily focused on the parametric setting, a learning-theoretic foundation for bilevel optimization in the nonparametric case remains relatively unexplored. In this paper, we take a first step toward bridging this gap by studying Kernel Bilevel Optimization (KBO), where the inner objective is optimized over a reproducing kernel Hilbert space. This setting enables rich function approximation while providing a foundation for rigorous theoretical analysis. In this context, we derive novel finite-sample generalization bounds for KBO, leveraging tools from empirical process theory. These bounds further allow us to assess the statistical accuracy of gradient-based methods applied to the empirical discretization of KBO. We numerically illustrate our theoretical findings on a synthetic instrumental variable regression task.

Keywords

Cite

@article{arxiv.2502.08457,
  title  = {Learning Theory for Kernel Bilevel Optimization},
  author = {Fares El Khoury and Edouard Pauwels and Samuel Vaiter and Michael Arbel},
  journal= {arXiv preprint arXiv:2502.08457},
  year   = {2025}
}

Comments

47 pages, 3 figures. Accepted at NeurIPS 2025

R2 v1 2026-06-28T21:41:46.428Z