Related papers: Generalization bounds for averaged classifiers

Learning to Abstain from Binary Prediction

A binary classifier capable of abstaining from making a label prediction has two goals in tension: minimizing errors, and avoiding abstaining unnecessarily often. In this work, we exactly characterize the best achievable tradeoff between…

Machine Learning · Computer Science 2016-11-30 Akshay Balsubramani

The Utility of Abstaining in Binary Classification

We explore the problem of binary classification in machine learning, with a twist - the classifier is allowed to abstain on any datum, professing ignorance about the true class label without committing to any prediction. This is directly…

Machine Learning · Computer Science 2015-12-29 Akshay Balsubramani

Algorithmic stability and hypothesis complexity

We introduce a notion of algorithmic stability of learning algorithms---that we term \emph{argument stability}---that captures stability of the hypothesis output by the learning algorithm in the normed space of functions from which…

Machine Learning · Statistics 2017-08-04 Tongliang Liu , Gábor Lugosi , Gergely Neu , Dacheng Tao

Robustness and Generalization

We derive generalization bounds for learning algorithms based on their robustness: the property that if a testing sample is "similar" to a training sample, then the testing error is close to the training error. This provides a novel…

Machine Learning · Computer Science 2015-03-17 Huan Xu , Shie Mannor

Algebraic Machine Learning

Machine learning algorithms use error function minimization to fit a large set of parameters in a preexisting model. However, error minimization eventually leads to a memorization of the training dataset, losing the ability to generalize to…

Machine Learning · Computer Science 2018-03-16 Fernando Martin-Maroto , Gonzalo G. de Polavieja

A Generalization Result for Convergence in Learning-to-Optimize

Learning-to-optimize leverages machine learning to accelerate optimization algorithms. While empirical results show tremendous improvements compared to classical optimization algorithms, theoretical guarantees are mostly lacking, such that…

Machine Learning · Computer Science 2025-06-02 Michael Sucker , Peter Ochs

Improving classifier decision boundaries using nearest neighbors

Neural networks are not learning optimal decision boundaries. We show that decision boundaries are situated in areas of low training data density. They are impacted by few training samples which can easily lead to overfitting. We provide a…

Machine Learning · Computer Science 2023-10-09 Johannes Schneider

Uniform Generalization, Concentration, and Adaptive Learning

One fundamental goal in any learning algorithm is to mitigate its risk for overfitting. Mathematically, this requires that the learning algorithm enjoys a small generalization risk, which is defined either in expectation or in probability.…

Machine Learning · Computer Science 2016-10-04 Ibrahim Alabdulmohsin

Learning Not to Learn: Training Deep Neural Networks with Biased Data

We propose a novel regularization algorithm to train deep neural networks, in which data at training time is severely biased. Since a neural network efficiently learns data distribution, a network is likely to learn the bias information to…

Computer Vision and Pattern Recognition · Computer Science 2019-04-16 Byungju Kim , Hyunwoo Kim , Kyungsu Kim , Sungjin Kim , Junmo Kim

Learning Weighted Representations for Generalization Across Designs

Predictive models that generalize well under distributional shift are often desirable and sometimes crucial to building robust and reliable machine learning applications. We focus on distributional shift that arises in causal inference from…

Machine Learning · Statistics 2018-02-27 Fredrik D. Johansson , Nathan Kallus , Uri Shalit , David Sontag

The Generalization Ability of Online Algorithms for Dependent Data

We study the generalization performance of online learning algorithms trained on samples coming from a dependent source of data. We show that the generalization error of any stable online algorithm concentrates around its regret--an easily…

Machine Learning · Statistics 2012-06-08 Alekh Agarwal , John C. Duchi

Bayesian Deep Learning and a Probabilistic Perspective of Generalization

The key distinguishing property of a Bayesian approach is marginalization, rather than using a single setting of weights. Bayesian marginalization can particularly improve the accuracy and calibration of modern deep neural networks, which…

Machine Learning · Computer Science 2022-03-31 Andrew Gordon Wilson , Pavel Izmailov

Adaptation Algorithm and Theory Based on Generalized Discrepancy

We present a new algorithm for domain adaptation improving upon a discrepancy minimization algorithm previously shown to outperform a number of algorithms for this task. Unlike many previous algorithms for domain adaptation, our algorithm…

Machine Learning · Computer Science 2015-02-24 Corinna Cortes , Mehryar Mohri , Andres Muñoz Medina

Generalizability of Neural Networks Minimizing Empirical Risk Based on Expressive Ability

The primary objective of learning methods is generalization. Classic uniform generalization bounds, which rely on VC-dimension or Rademacher complexity, fail to explain the significant attribute that over-parameterized models in deep…

Machine Learning · Computer Science 2025-03-07 Lijia Yu , Yibo Miao , Yifan Zhu , Xiao-Shan Gao , Lijun Zhang

The Optimal Choice of Hypothesis Is the Weakest, Not the Shortest

If $A$ and $B$ are sets such that $A \subset B$, generalisation may be understood as the inference from $A$ of a hypothesis sufficient to construct $B$. One might infer any number of hypotheses from $A$, yet only some of those may…

Artificial Intelligence · Computer Science 2024-04-12 Michael Timothy Bennett

Unlock the Power of Algorithm Features: A Generalization Analysis for Algorithm Selection

In the algorithm selection research, the discussion surrounding algorithm features has been significantly overshadowed by the emphasis on problem features. Although a few empirical studies have yielded evidence regarding the effectiveness…

Machine Learning · Computer Science 2024-06-04 Xingyu Wu , Yan Zhong , Jibin Wu , Yuxiao Huang , Sheng-hao Wu , Kay Chen Tan

Towards optimally abstaining from prediction with OOD test examples

A common challenge across all areas of machine learning is that training data is not distributed like test data, due to natural shifts, "blind spots," or adversarial examples; such test examples are referred to as out-of-distribution (OOD)…

Machine Learning · Computer Science 2021-10-29 Adam Tauman Kalai , Varun Kanade

Do highly over-parameterized neural networks generalize since bad solutions are rare?

We study over-parameterized classifiers where Empirical Risk Minimization (ERM) for learning leads to zero training error. In these over-parameterized settings there are many global minima with zero training error, some of which generalize…

Machine Learning · Computer Science 2023-12-05 Julius Martinetz , Thomas Martinetz

Learning from networked examples

Many machine learning algorithms are based on the assumption that training examples are drawn independently. However, this assumption does not hold anymore when learning from a networked sample because two or more training examples may…

Artificial Intelligence · Computer Science 2017-06-06 Yuyi Wang , Jan Ramon , Zheng-Chu Guo

Compressive Sensing and Neural Networks from a Statistical Learning Perspective

Various iterative reconstruction algorithms for inverse problems can be unfolded as neural networks. Empirically, this approach has often led to improved results, but theoretical guarantees are still scarce. While some progress on…

Statistics Theory · Mathematics 2021-08-16 Arash Behboodi , Holger Rauhut , Ekkehard Schnoor