Related papers: Information, Divergence and Risk for Binary Experi…

Multiclass Classification, Information, Divergence, and Surrogate Risk

We provide a unifying view of statistical information measures, multi-way Bayesian hypothesis testing, loss functions for multi-class classification problems, and multi-distribution $f$-divergences, elaborating equivalence results between…

Statistics Theory · Mathematics 2017-09-12 John C. Duchi , Khashayar Khosravi , Feng Ruan

Calibrated Surrogate Losses for Classification with Label-Dependent Costs

We present surrogate regret bounds for arbitrary surrogate losses in the context of binary classification with label-dependent costs. Such bounds relate a classifier's risk, assessed with respect to a surrogate loss, to its cost-sensitive…

Machine Learning · Statistics 2010-09-15 Clayton Scott

Composite Binary Losses

We study losses for binary classification and class probability estimation and extend the understanding of them from margin losses to general composite losses which are the composition of a proper loss with a link function. We characterise…

Machine Learning · Statistics 2009-12-18 Mark D. Reid , Robert C. Williamson

Surrogate regret bounds for generalized classification performance metrics

We consider optimization of generalized performance metrics for binary classification by means of surrogate losses. We focus on a class of metrics, which are linear-fractional functions of the false positive and false negative rates…

Machine Learning · Computer Science 2016-10-10 Wojciech Kotłowski , Krzysztof Dembczyński

On the Error Resistance of Hinge Loss Minimization

Commonly used classification algorithms in machine learning, such as support vector machines, minimize a convex surrogate loss on training examples. In practice, these algorithms are surprisingly robust to errors in the training data. In…

Machine Learning · Computer Science 2020-12-03 Kunal Talwar

Adversarial Surrogate Risk Bounds for Binary Classification

A central concern in classification is the vulnerability of machine learning models to adversarial attacks. Adversarial training is one of the most popular techniques for training robust classifiers, which involves minimizing an adversarial…

Machine Learning · Computer Science 2025-10-09 Natalie S. Frank

Unifying Lower Bounds on Prediction Dimension of Consistent Convex Surrogates

Given a prediction task, understanding when one can and cannot design a consistent convex surrogate loss, particularly a low-dimensional one, is an important and active area of machine learning research. The prediction task may be given as…

Machine Learning · Computer Science 2021-02-17 Jessie Finocchiaro , Rafael Frongillo , Bo Waggoner

A Note on Reverse Pinsker Inequalities

A simple method is shown to provide optimal variational bounds on $f$-divergences with possible constraints on relative information extremums. Known results are refined or proved to be optimal as particular cases.

Information Theory · Computer Science 2019-02-05 Olivier Binette

Constrained Classification and Policy Learning

Modern machine learning approaches to classification, including AdaBoost, support vector machines, and deep neural networks, utilize surrogate loss techniques to circumvent the computational complexity of minimizing empirical classification…

Econometrics · Economics 2023-07-26 Toru Kitagawa , Shosei Sakaguchi , Aleksey Tetenov

Structure-aware error bounds for linear classification with the zero-one loss

We prove risk bounds for binary classification in high-dimensional settings when the sample size is allowed to be smaller than the dimensionality of the training set observations. In particular, we prove upper bounds for both 'compressive…

Statistics Theory · Mathematics 2017-09-29 Ata Kaban , Robert J. Durrant

Fast Large-Scale Discrete Optimization Based on Principal Coordinate Descent

Binary optimization, a representative subclass of discrete optimization, plays an important role in mathematical optimization and has various applications in computer vision and machine learning. Usually, binary optimization problems are…

Optimization and Control · Mathematics 2021-05-18 Huan Xiong , Mengyang Yu , Li Liu , Fan Zhu , Fumin Shen , Ling Shao

Empirically Estimable Classification Bounds Based on a New Divergence Measure

Information divergence functions play a critical role in statistics and information theory. In this paper we show that a non-parametric f-divergence measure can be used to provide improved bounds on the minimum binary classification…

Information Theory · Computer Science 2015-02-11 Visar Berisha , Alan Wisler , Alfred O. Hero , Andreas Spanias

Minimizing The Misclassification Error Rate Using a Surrogate Convex Loss

We carefully study how well minimizing convex surrogate loss functions, corresponds to minimizing the misclassification error rate for the problem of binary classification with linear predictors. In particular, we show that amongst all…

Machine Learning · Computer Science 2012-07-03 Shai Ben-David , David Loker , Nathan Srebro , Karthik Sridharan

Convergence Rates for Empirical Estimation of Binary Classification Bounds

Bounding the best achievable error probability for binary classification problems is relevant to many applications including machine learning, signal processing, and information theory. Many bounds on the Bayes binary classification error…

Information Theory · Computer Science 2018-10-03 Salimeh Yasaei Sekeh , Morteza Noshad , Kevin R. Moon , Alfred O. Hero

Fundamental Novel Consistency Theory: $H$-Consistency Bounds

In machine learning, the loss functions optimized during training often differ from the target loss that defines task performance due to computational intractability or lack of differentiability. We present an in-depth study of the target…

Machine Learning · Computer Science 2025-12-30 Yutao Zhong

On $f$-Divergences: Integral Representations, Local Behavior, and Inequalities

This paper is focused on $f$-divergences, consisting of three main contributions. The first one introduces integral representations of a general $f$-divergence by means of the relative information spectrum. The second part provides a new…

Information Theory · Computer Science 2018-07-04 Igal Sason

A Minimax Surrogate Loss Approach to Conditional Difference Estimation

We present a new machine learning approach to estimate personalized treatment effects in the classical potential outcomes framework with binary outcomes. To overcome the problem that both treatment and control outcomes for the same unit are…

Machine Learning · Statistics 2018-05-07 Siong Thye Goh , Cynthia Rudin

BreGMN: scaled-Bregman Generative Modeling Networks

The family of f-divergences is ubiquitously applied to generative modeling in order to adapt the distribution of the model to that of the data. Well-definedness of f-divergences, however, requires the distributions of the data and model to…

Machine Learning · Statistics 2019-06-04 Akash Srivastava , Kristjan Greenewald , Farzaneh Mirzazadeh

The Adversarial Consistency of Surrogate Risks for Binary Classification

We study the consistency of surrogate risks for robust binary classification. It is common to learn robust classifiers by adversarial training, which seeks to minimize the expected $0$-$1$ loss when each example can be maliciously corrupted…

Machine Learning · Computer Science 2025-10-09 Natalie Frank , Jonathan Niles-Weed

Large-Margin Classification with Multiple Decision Rules

Binary classification is a common statistical learning problem in which a model is estimated on a set of covariates for some outcome indicating the membership of one of two classes. In the literature, there exists a distinction between hard…

Machine Learning · Statistics 2014-11-20 Patrick K. Kimes , D. Neil Hayes , J. S. Marron , Yufeng Liu