Related papers: Multiclass Classification, Information, Divergence…

sigmoidF1: A Smooth F1 Score Surrogate Loss for Multilabel Classification

Multiclass multilabel classification is the task of attributing multiple labels to examples via predictions. Current models formulate a reduction of the multilabel setting into either multiple binary classifications or multiclass…

Machine Learning · Computer Science 2022-11-01 Gabriel Bénédict , Vincent Koops , Daan Odijk , Maarten de Rijke

Multiclass Classification Calibration Functions

In this paper we refine the process of computing calibration functions for a number of multiclass classification surrogate losses. Calibration functions are a powerful tool for easily converting bounds for the surrogate risk (which can be…

Machine Learning · Statistics 2016-09-22 Bernardo Ávila Pires , Csaba Szepesvári

Information, Divergence and Risk for Binary Experiments

We unify f-divergences, Bregman divergences, surrogate loss bounds (regret bounds), proper scoring rules, matching losses, cost curves, ROC-curves and information. We do this by systematically studying integral and variational…

Machine Learning · Statistics 2009-01-06 Mark D. Reid , Robert C. Williamson

Unified Binary and Multiclass Margin-Based Classification

The notion of margin loss has been central to the development and analysis of algorithms for binary classification. To date, however, there remains no consensus as to the analogue of the margin loss for multiclass classification. In this…

Machine Learning · Statistics 2024-05-20 Yutong Wang , Clayton Scott

On surrogate loss functions and $f$-divergences

The goal of binary classification is to estimate a discriminant function $\gamma$ from observations of covariate vectors and corresponding binary labels. We consider an elaboration of this problem in which the covariates are not available…

Statistics Theory · Mathematics 2009-09-29 XuanLong Nguyen , Martin J. Wainwright , Michael I. Jordan

The Convexity and Design of Composite Multiclass Losses

We consider composite loss functions for multiclass prediction comprising a proper (i.e., Fisher-consistent) loss over probability distributions and an inverse link function. We establish conditions for their (strong) convexity and explore…

Machine Learning · Computer Science 2012-06-22 Mark Reid , Robert Williamson , Peng Sun

A surrogate loss function for optimization of $F_\beta$ score in binary classification with imbalanced data

The $F_\beta$ score is a commonly used measure of classification performance, which plays crucial roles in classification tasks with imbalanced data sets. However, the $F_\beta$ score cannot be used as a loss function by gradient-based…

Machine Learning · Computer Science 2021-04-06 Namgil Lee , Heejung Yang , Hojin Yoo

Information Processing Equalities and the Information-Risk Bridge

We introduce two new classes of measures of information for statistical experiments which generalise and subsume $\phi$-divergences, integral probability metrics, $\mathfrak{N}$-distances (MMD), and $(f,\Gamma)$ divergences between two or…

Machine Learning · Computer Science 2023-09-11 Robert C. Williamson , Zac Cranko

Intra-Class Uncertainty Loss Function for Classification

Most classification models can be considered as the process of matching templates. However, when intra-class uncertainty/variability is not considered, especially for datasets containing unbalanced classes, this may lead to classification…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 He Zhu , Shan Yu

Convex Calibration Dimension for Multiclass Loss Matrices

We study consistency properties of surrogate loss functions for general multiclass learning problems, defined by a general multiclass loss matrix. We extend the notion of classification calibration, which has been studied for binary and…

Machine Learning · Computer Science 2015-08-25 Harish G. Ramaswamy , Shivani Agarwal

Theory and Algorithms for Learning with Multi-Class Abstention and Multi-Expert Deferral

Large language models (LLMs) have achieved remarkable performance but face critical challenges: hallucinations and high inference costs. Leveraging multiple experts offers a solution: deferring uncertain inputs to more capable experts…

Machine Learning · Computer Science 2025-12-30 Anqi Mao

$f$-Divergence Based Classification: Beyond the Use of Cross-Entropy

In deep learning, classification tasks are formalized as optimization problems often solved via the minimization of the cross-entropy. However, recent advancements in the design of objective functions allow the usage of the $f$-divergence…

Machine Learning · Computer Science 2024-05-17 Nicola Novello , Andrea M. Tonello

On loss functions and regret bounds for multi-category classification

We develop new approaches in multi-class settings for constructing proper scoring rules and hinge-like losses and establishing corresponding regret bounds with respect to the zero-one or cost-weighted classification loss. Our construction…

Statistics Theory · Mathematics 2021-05-18 Zhiqiang Tan , Xinwei Zhang

Consistent Multilabel Ranking through Univariate Losses

We consider the problem of rank loss minimization in the setting of multilabel classification, which is usually tackled by means of convex surrogate losses defined on pairs of labels. Very recently, this approach was put into question by a…

Machine Learning · Computer Science 2012-07-03 Krzysztof Dembczynski , Wojciech Kotlowski , Eyke Huellermeier

On the Calibration of Multiclass Classification with Rejection

We investigate the problem of multiclass classification with rejection, where a classifier can choose not to make a prediction to avoid critical misclassification. First, we consider an approach based on simultaneous training of a…

Machine Learning · Statistics 2019-10-31 Chenri Ni , Nontawat Charoenphakdee , Junya Honda , Masashi Sugiyama

Multi-Label Learning with Stronger Consistency Guarantees

We present a detailed study of surrogate losses and algorithms for multi-label learning, supported by $H$-consistency bounds. We first show that, for the simplest form of multi-label loss (the popular Hamming loss), the well-known…

Machine Learning · Computer Science 2024-07-19 Anqi Mao , Mehryar Mohri , Yutao Zhong

Multi-Label Contrastive Learning : A Comprehensive Study

Multi-label classification, which involves assigning multiple labels to a single input, has emerged as a key area in both research and industry due to its wide-ranging applications. Designing effective loss functions is crucial for…

Machine Learning · Computer Science 2025-01-06 Alexandre Audibert , Aurélien Gauffre , Massih-Reza Amini

An In-Depth Examination of Risk Assessment in Multi-Class Classification Algorithms

Advanced classification algorithms are being increasingly used in safety-critical applications like health-care, engineering, etc. In such applications, miss-classifications made by ML algorithms can result in substantial financial or…

Machine Learning · Computer Science 2024-12-06 Disha Ghandwani , Neeraj Sarna , Yuanyuan Li , Yang Lin

Balancing Methods for Multi-label Text Classification with Long-Tailed Class Distribution

Multi-label text classification is a challenging task because it requires capturing label dependencies. It becomes even more challenging when class distribution is long-tailed. Resampling and re-weighting are common approaches used for…

Computation and Language · Computer Science 2021-10-19 Yi Huang , Buse Giledereli , Abdullatif Köksal , Arzucan Özgür , Elif Ozkirimli

Revisiting Discriminative vs. Generative Classifiers: Theory and Implications

A large-scale deep model pre-trained on massive labeled or unlabeled data transfers well to downstream tasks. Linear evaluation freezes parameters in the pre-trained model and trains a linear classifier separately, which is efficient and…

Machine Learning · Computer Science 2023-05-30 Chenyu Zheng , Guoqiang Wu , Fan Bao , Yue Cao , Chongxuan Li , Jun Zhu