Related papers: How much data is sufficient to learn high-performi…

Learning to Optimize Computational Resources: Frugal Training with Generalization Guarantees

Algorithms typically come with tunable parameters that have a considerable impact on the computational resources they consume. Too often, practitioners must hand-tune the parameters, a tedious and error-prone task. A recent line of research…

Machine Learning · Computer Science 2020-11-24 Maria-Florina Balcan , Tuomas Sandholm , Ellen Vitercik

Data-Driven Performance Guarantees for Classical and Learned Optimizers

We introduce a data-driven approach to analyze the performance of continuous optimization algorithms using generalization guarantees from statistical learning theory. We study classical and learned optimizers to solve families of parametric…

Optimization and Control · Mathematics 2025-10-07 Rajiv Sambharya , Bartolomeo Stellato

Provably Data-driven Multiple Hyper-parameter Tuning with Structured Loss Function

Data-driven algorithm design automates hyperparameter tuning, but its statistical foundations remain limited because model performance can depend on hyperparameters in implicit and highly non-smooth ways. Existing guarantees focus on the…

Machine Learning · Statistics 2026-05-13 Tung Quoc Le , Anh Tuan Nguyen , Viet Anh Nguyen

Data-driven Algorithm Design

Data driven algorithm design is an important aspect of modern data science and algorithm design. Rather than using off the shelf algorithms that only have worst case performance guarantees, practitioners often optimize over large families…

Data Structures and Algorithms · Computer Science 2020-11-17 Maria-Florina Balcan

An Information-Theoretic Approach to Generalization Theory

We investigate the in-distribution generalization of machine learning algorithms. We depart from traditional complexity-based approaches by analyzing information-theoretic bounds that quantify the dependence between a learning algorithm and…

Machine Learning · Statistics 2024-08-27 Borja Rodríguez-Gálvez , Ragnar Thobaben , Mikael Skoglund

Refined bounds for algorithm configuration: The knife-edge of dual class approximability

Automating algorithm configuration is growing increasingly necessary as algorithms come with more and more tunable parameters. It is common to tune parameters using machine learning, optimizing performance metrics such as runtime and…

Artificial Intelligence · Computer Science 2020-12-25 Maria-Florina Balcan , Tuomas Sandholm , Ellen Vitercik

Information-Theoretic Bounds on the Moments of the Generalization Error of Learning Algorithms

Generalization error bounds are critical to understanding the performance of machine learning models. In this work, building upon a new bound of the expected value of an arbitrary function of the population and empirical risk of a learning…

Information Theory · Computer Science 2021-05-07 Gholamali Aminian , Laura Toni , Miguel R. D. Rodrigues

Which Algorithms Have Tight Generalization Bounds?

We study which machine learning algorithms have tight generalization bounds. First, we present conditions that preclude the existence of tight generalization bounds. Specifically, we show that algorithms that have certain inductive biases…

Machine Learning · Computer Science 2024-10-04 Michael Gastpar , Ido Nachum , Jonathan Shafer , Thomas Weinberger

Generalization Bounds for Data-Driven Numerical Linear Algebra

Data-driven algorithms can adapt their internal structure or parameters to inputs from unknown application-specific distributions, by learning from a training sample of inputs. Several recent works have applied this approach to problems in…

Machine Learning · Computer Science 2022-06-17 Peter Bartlett , Piotr Indyk , Tal Wagner

Towards Data-Algorithm Dependent Generalization: a Case Study on Overparameterized Linear Regression

One of the major open problems in machine learning is to characterize generalization in the overparameterized regime, where most traditional generalization bounds become inconsistent even for overparameterized linear regression. In many…

Machine Learning · Computer Science 2023-11-22 Jing Xu , Jiaye Teng , Yang Yuan , Andrew Chi-Chih Yao

Unlock the Power of Algorithm Features: A Generalization Analysis for Algorithm Selection

In the algorithm selection research, the discussion surrounding algorithm features has been significantly overshadowed by the emphasis on problem features. Although a few empirical studies have yielded evidence regarding the effectiveness…

Machine Learning · Computer Science 2024-06-04 Xingyu Wu , Yan Zhong , Jibin Wu , Yuxiao Huang , Sheng-hao Wu , Kay Chen Tan

Stability, Complexity and Data-Dependent Worst-Case Generalization Bounds

Providing generalization guarantees for stochastic optimization algorithms remains a key challenge in learning theory. Recently, numerous works demonstrated the impact of the geometric properties of optimization trajectories on…

Machine Learning · Computer Science 2026-01-23 Mario Tuci , Lennart Bastian , Benjamin Dupuis , Nassir Navab , Tolga Birdal , Umut Şimşekli

Practical Bayesian Optimization of Machine Learning Algorithms

Machine learning algorithms frequently require careful tuning of model hyperparameters, regularization terms, and optimization parameters. Unfortunately, this tuning is often a "black art" that requires expert experience, unwritten rules of…

Machine Learning · Statistics 2012-08-30 Jasper Snoek , Hugo Larochelle , Ryan P. Adams

Fantastic Generalization Measures are Nowhere to be Found

We study the notion of a generalization bound being uniformly tight, meaning that the difference between the bound and the population loss is small for all learning algorithms and all population distributions. Numerous generalization bounds…

Machine Learning · Computer Science 2023-11-29 Michael Gastpar , Ido Nachum , Jonathan Shafer , Thomas Weinberger

Upper Bounds on the Generalization Error of Private Algorithms for Discrete Data

In this work, we study the generalization capability of algorithms from an information-theoretic perspective. It has been shown that the expected generalization error of an algorithm is bounded from above by a function of the relative…

Information Theory · Computer Science 2021-10-27 Borja Rodríguez-Gálvez , Germán Bassi , Mikael Skoglund

Generalizable control for quantum parameter estimation through reinforcement learning

Measurement and estimation of parameters are essential for science and engineering, where one of the main quests is to find systematic schemes that can achieve high precision. While conventional schemes for quantum parameter estimation…

Quantum Physics · Physics 2021-04-29 Han Xu , Junning Li , Liqiang Liu , Yu Wang , Haidong Yuan , Xin Wang

Practicality of generalization guarantees for unsupervised domain adaptation with neural networks

Understanding generalization is crucial to confidently engineer and deploy machine learning models, especially when deployment implies a shift in the data domain. For such domain adaptation problems, we seek generalization bounds which are…

Machine Learning · Computer Science 2023-03-16 Adam Breitholtz , Fredrik D. Johansson

The Limits of Assumption-free Tests for Algorithm Performance

Algorithm evaluation and comparison are fundamental questions in machine learning and statistics -- how well does an algorithm perform at a given modeling task, and which algorithm performs best? Many methods have been developed to assess…

Statistics Theory · Mathematics 2025-11-25 Yuetian Luo , Rina Foygel Barber

Generalization in portfolio-based algorithm selection

Portfolio-based algorithm selection has seen tremendous practical success over the past two decades. This algorithm configuration procedure works by first selecting a portfolio of diverse algorithm parameter settings, and then, on a given…

Artificial Intelligence · Computer Science 2020-12-25 Maria-Florina Balcan , Tuomas Sandholm , Ellen Vitercik

Separating Geometry from Probability in the Analysis of Generalization

The goal of machine learning is to find models that minimize prediction error on data that has not yet been seen. Its operational paradigm assumes access to a dataset $S$ and articulates a scheme for evaluating how well a given model…

Machine Learning · Computer Science 2026-04-22 Maxim Raginsky , Benjamin Recht