Related papers: Supervised classification via minimax probabilisti…

Minimax risk classifiers with 0-1 loss

Supervised classification techniques use training samples to learn a classification rule with small expected 0-1 loss (error probability). Conventional methods enable tractable learning and provide out-of-sample generalization by using…

Machine Learning · Statistics 2023-08-21 Santiago Mazuelas , Mauricio Romero , Peter Grünwald

Minimax Classification with 0-1 Loss and Performance Guarantees

Supervised classification techniques use training samples to find classification rules with small expected 0-1 loss. Conventional methods achieve efficient learning and out-of-sample generalization by minimizing surrogate losses over…

Machine Learning · Statistics 2021-08-12 Santiago Mazuelas , Andrea Zanoni , Aritz Perez

Generalized Maximum Entropy for Supervised Classification

The maximum entropy principle advocates to evaluate events' probabilities using a distribution that maximizes entropy among those that satisfy certain expectations' constraints. Such principle can be generalized for arbitrary decision…

Machine Learning · Statistics 2021-12-16 Santiago Mazuelas , Yuan Shen , Aritz Pérez

General Supervision via Probabilistic Transformations

Different types of training data have led to numerous schemes for supervised classification. Current learning techniques are tailored to one specific scheme and cannot handle general ensembles of training data. This paper presents a…

Machine Learning · Statistics 2020-10-20 Santiago Mazuelas , Aritz Perez

Efficient Large-Scale Learning of Minimax Risk Classifiers

Supervised learning with large-scale data usually leads to complex optimization problems, especially for classification tasks with multiple classes. Stochastic subgradient methods can enable efficient learning with a large number of samples…

Machine Learning · Computer Science 2025-11-25 Kartheek Bondugula , Santiago Mazuelas , Aritz Pérez

A Minimax Approach to Supervised Learning

Given a task of predicting $Y$ from $X$, a loss function $L$, and a set of probability distributions $\Gamma$ on $(X,Y)$, what is the optimal decision rule minimizing the worst-case expected loss over $\Gamma$? In this paper, we address…

Machine Learning · Statistics 2017-07-05 Farzan Farnia , David Tse

Unsupervised representation learning with recognition-parametrised probabilistic models

We introduce a new approach to probabilistic unsupervised learning based on the recognition-parametrised model (RPM): a normalised semi-parametric hypothesis class for joint distributions over observed and latent variables. Under the key…

Machine Learning · Computer Science 2023-04-21 William I. Walker , Hugo Soulat , Changmin Yu , Maneesh Sahani

Risk-based Calibration for Generative Classifiers

Generative classifiers are constructed on the basis of a joint probability distribution and are typically learned using closed-form procedures that rely on data statistics and maximize scores related to data fitting. However, these scores…

Machine Learning · Computer Science 2025-03-31 Aritz Pérez , Carlos Echegoyen , Guzmán Santafé

Unsupervised Representation Learning - an Invariant Risk Minimization Perspective

We propose a novel unsupervised framework for \emph{Invariant Risk Minimization} (IRM), extending the concept of invariance to settings where labels are unavailable. Traditional IRM methods rely on labeled data to learn representations that…

Machine Learning · Computer Science 2026-03-05 Yotam Norman , Ron Meir

On the Error Resistance of Hinge Loss Minimization

Commonly used classification algorithms in machine learning, such as support vector machines, minimize a convex surrogate loss on training examples. In practice, these algorithms are surprisingly robust to errors in the training data. In…

Machine Learning · Computer Science 2020-12-03 Kunal Talwar

Probabilistic Invariant Learning with Randomized Linear Classifiers

Designing models that are both expressive and preserve known invariances of tasks is an increasingly hard problem. Existing solutions tradeoff invariance for computational or memory resources. In this work, we show how to leverage…

Machine Learning · Computer Science 2023-09-29 Leonardo Cotta , Gal Yehuda , Assaf Schuster , Chris J. Maddison

Robust Unsupervised Learning via L-Statistic Minimization

Designing learning algorithms that are resistant to perturbations of the underlying data distribution is a problem of wide practical and theoretical importance. We present a general approach to this problem focusing on unsupervised…

Machine Learning · Computer Science 2021-02-22 Andreas Maurer , Daniela A. Parletta , Andrea Paudice , Massimiliano Pontil

A Convex Relaxation for Weakly Supervised Classifiers

This paper introduces a general multi-class approach to weakly supervised classification. Inferring the labels and learning the parameters of the model is usually done jointly through a block-coordinate descent algorithm such as…

Machine Learning · Computer Science 2012-07-03 Armand Joulin , Francis Bach

On Excess Risk Convergence Rates of Neural Network Classifiers

The recent success of neural networks in pattern recognition and classification problems suggests that neural networks possess qualities distinct from other more classical classifiers such as SVMs or boosting classifiers. This paper studies…

Machine Learning · Statistics 2023-09-27 Hyunouk Ko , Namjoon Suh , Xiaoming Huo

Optimal learning via local entropies and sample compression

The aim of this paper is to provide several novel upper bounds on the excess risk with a primal focus on classification problems. We suggest two approaches and the obtained bounds are represented via the distribution dependent local…

Statistics Theory · Mathematics 2018-03-13 Nikita Zhivotovskiy

Beyond the EM Algorithm: Constrained Optimization Methods for Latent Class Model

Latent class model (LCM), which is a finite mixture of different categorical distributions, is one of the most widely used models in statistics and machine learning fields. Because of its non-continuous nature and the flexibility in shape,…

Machine Learning · Statistics 2021-03-23 Hao Chen , Lanshan Han , Alvin Lim

Efficient Learning of Minimax Risk Classifiers in High Dimensions

High-dimensional data is common in multiple areas, such as health care and genomics, where the number of features can be tens of thousands. In such scenarios, the large number of features often leads to inefficient learning. Constraint…

Machine Learning · Statistics 2023-06-13 Kartheek Bondugula , Santiago Mazuelas , Aritz Pérez

Implicitly Constrained Semi-Supervised Linear Discriminant Analysis

Semi-supervised learning is an important and active topic of research in pattern recognition. For classification using linear discriminant analysis specifically, several semi-supervised variants have been proposed. Using any one of these…

Machine Learning · Statistics 2014-11-18 Jesse H. Krijthe , Marco Loog

Group zero-norm regularized robust loss minimization: proximal MM method and statistical error bound

This study focuses on solving group zero-norm regularized robust loss minimization problems. We propose a proximal Majorization-Minimization (PMM) algorithm to address a class of equivalent Difference-of-Convex (DC) surrogate optimization…

Optimization and Control · Mathematics 2025-05-30 Ling Liang , Shujun Bi

Regularization via Mass Transportation

The goal of regression and classification methods in supervised learning is to minimize the empirical risk, that is, the expectation of some loss function quantifying the prediction error under the empirical distribution. When facing scarce…

Optimization and Control · Mathematics 2019-07-15 Soroosh Shafieezadeh-Abadeh , Daniel Kuhn , Peyman Mohajerin Esfahani