English

Differentially Private Bilevel Optimization

Machine Learning 2026-01-15 v3 Cryptography and Security Optimization and Control

Abstract

We present differentially private (DP) algorithms for bilevel optimization, a problem class that received significant attention lately in various machine learning applications. These are the first algorithms for such problems under standard DP constraints, and are also the first to avoid Hessian computations which are prohibitive in large-scale settings. Under the well-studied setting in which the upper-level is not necessarily convex and the lower-level problem is strongly-convex, our proposed gradient-based (ϵ,δ)(\epsilon,\delta)-DP algorithm returns a point with hypergradient norm at most O~((dup/ϵn)1/2+(dlow/ϵn)1/3)\widetilde{\mathcal{O}}\left((\sqrt{d_\mathrm{up}}/\epsilon n)^{1/2}+(\sqrt{d_\mathrm{low}}/\epsilon n)^{1/3}\right) where nn is the dataset size, and dup/dlowd_\mathrm{up}/d_\mathrm{low} are the upper/lower level dimensions. Our analysis covers constrained and unconstrained problems alike, accounts for mini-batch gradients, and applies to both empirical and population losses. As an application, we specialize our analysis to derive a simple private rule for tuning a regularization hyperparameter.

Keywords

Cite

@article{arxiv.2409.19800,
  title  = {Differentially Private Bilevel Optimization},
  author = {Guy Kornowski},
  journal= {arXiv preprint arXiv:2409.19800},
  year   = {2026}
}

Comments

Accepted to ALT 2026; some fixes following reviews