Related papers: A Tunable Loss Function for Binary Classification

A Tunable Loss Function for Robust Classification: Calibration, Landscape, and Generalization

We introduce a tunable loss function called $\alpha$-loss, parameterized by $\alpha \in (0,\infty]$, which interpolates between the exponential loss ($\alpha = 1/2$), the log-loss ($\alpha = 1$), and the 0-1 loss ($\alpha = \infty$), for…

Machine Learning · Computer Science 2022-12-22 Tyler Sypherd , Mario Diaz , John Kevin Cava , Gautam Dasarathy , Peter Kairouz , Lalitha Sankar

An Analysis of Loss Functions for Binary Classification and Regression

This paper explores connections between margin-based loss functions and consistency in binary classification and regression applications. It is shown that a large class of margin-based loss functions for binary classification/regression…

Machine Learning · Statistics 2023-01-30 Jeffrey Buzas

On the alpha-loss Landscape in the Logistic Model

We analyze the optimization landscape of a recently introduced tunable class of loss functions called $\alpha$-loss, $\alpha \in (0,\infty]$, in the logistic model. This family encapsulates the exponential loss ($\alpha = 1/2$), the…

Machine Learning · Computer Science 2022-12-22 Tyler Sypherd , Mario Diaz , Lalitha Sankar , Gautam Dasarathy

Xtreme Margin: A Tunable Loss Function for Binary Classification Problems

Loss functions drive the optimization of machine learning algorithms. The choice of a loss function can have a significant impact on the training of a model, and how the model learns the data. Binary classification is one of the major…

Machine Learning · Computer Science 2022-11-02 Rayan Wali

On the Universality of the Logistic Loss Function

A loss function measures the discrepancy between the true values (observations) and their estimated fits, for a given instance of data. A loss function is said to be proper (unbiased, Fisher consistent) if the fits are defined over a unit…

Information Theory · Computer Science 2018-05-11 Amichai Painsky , Gregory W. Wornell

Non-parametric Binary regression in metric spaces with KL loss

We propose a non-parametric variant of binary regression, where the hypothesis is regularized to be a Lipschitz function taking a metric space to [0,1] and the loss is logarithmic. This setting presents novel computational and statistical…

Machine Learning · Computer Science 2020-10-21 Ariel Avital , Klim Efremenko , Aryeh Kontorovich , David Toplin , Bo Waggoner

Second-Order Bounds for [0,1]-Valued Regression via Betting Loss

We consider the $[0,1]$-valued regression problem in the i.i.d. setting. In a related problem called cost-sensitive classification, \citet{foster21efficient} have shown that the log loss minimizer achieves an improved generalization bound…

Machine Learning · Computer Science 2025-07-18 Yinan Li , Kwang-Sung Jun

Composite Binary Losses

We study losses for binary classification and class probability estimation and extend the understanding of them from margin losses to general composite losses which are the composition of a proper loss with a link function. We characterise…

Machine Learning · Statistics 2009-12-18 Mark D. Reid , Robert C. Williamson

The fundamental nature of the log loss function

The standard loss functions used in the literature on probabilistic prediction are the log loss function, the Brier loss function, and the spherical loss function; however, any computable proper loss function can be used for comparison of…

Machine Learning · Computer Science 2015-06-30 Vladimir Vovk

The Central Role of the Loss Function in Reinforcement Learning

This paper illustrates the central role of loss functions in data-driven decision making, providing a comprehensive survey on their influence in cost-sensitive classification (CSC) and reinforcement learning (RL). We demonstrate how…

Machine Learning · Statistics 2025-04-07 Kaiwen Wang , Nathan Kallus , Wen Sun

On surrogate loss functions and $f$-divergences

The goal of binary classification is to estimate a discriminant function $\gamma$ from observations of covariate vectors and corresponding binary labels. We consider an elaboration of this problem in which the covariates are not available…

Statistics Theory · Mathematics 2009-09-29 XuanLong Nguyen , Martin J. Wainwright , Michael I. Jordan

Label Smoothing is Robustification against Model Misspecification

Label smoothing (LS) adopts smoothed targets in classification tasks. For example, in binary classification, instead of the one-hot target $(1,0)^\top$ used in conventional logistic regression (LR), LR with LS (LSLR) uses the smoothed…

Machine Learning · Statistics 2023-05-16 Ryoya Yamasaki , Toshiyuki Tanaka

Alternate Loss Functions for Classification and Robust Regression Can Improve the Accuracy of Artificial Neural Networks

All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that…

Neural and Evolutionary Computing · Computer Science 2024-11-06 Mathew Mithra Noel , Arindam Banerjee , Yug Oswal , Geraldine Bessie Amali D , Venkataraman Muthiah-Nakarajan

Surrogate regret bounds for generalized classification performance metrics

We consider optimization of generalized performance metrics for binary classification by means of surrogate losses. We focus on a class of metrics, which are linear-fractional functions of the false positive and false negative rates…

Machine Learning · Computer Science 2016-10-10 Wojciech Kotłowski , Krzysztof Dembczyński

Logistic Regression: Tight Bounds for Stochastic and Online Optimization

The logistic loss function is often advocated in machine learning and statistics as a smooth and strictly convex surrogate for the 0-1 loss. In this paper we investigate the question of whether these smoothness and convexity properties make…

Machine Learning · Computer Science 2014-05-16 Elad Hazan , Tomer Koren , Kfir Y. Levy

A surrogate loss function for optimization of $F_\beta$ score in binary classification with imbalanced data

The $F_\beta$ score is a commonly used measure of classification performance, which plays crucial roles in classification tasks with imbalanced data sets. However, the $F_\beta$ score cannot be used as a loss function by gradient-based…

Machine Learning · Computer Science 2021-04-06 Namgil Lee , Heejung Yang , Hojin Yoo

Bregman Divergence Bounds and Universality Properties of the Logarithmic Loss

A loss function measures the discrepancy between the true values and their estimated fits, for a given instance of data. In classification problems, a loss function is said to be proper if a minimizer of the expected loss is the true…

Information Theory · Computer Science 2020-01-03 Amichai Painsky , Gregory W. Wornell

Loss Functions and Operators Generated by f-Divergences

The logistic loss (a.k.a. cross-entropy loss) is one of the most popular loss functions used for multiclass classification. It is also the loss function of choice for next-token prediction in language modeling. It is associated with the…

Machine Learning · Computer Science 2025-06-16 Vincent Roulet , Tianlin Liu , Nino Vieillard , Michael E. Sander , Mathieu Blondel

Implicitly Maximizing Margins with the Hinge Loss

A new loss function is proposed for neural networks on classification tasks which extends the hinge loss by assigning gradients to its critical points. We will show that for a linear classifier on linearly separable data with fixed step…

Machine Learning · Computer Science 2020-06-26 Justin Lizama

Signal to Noise Ratio Loss Function

This work proposes a new loss function targeting classification problems, utilizing a source of information overlooked by cross entropy loss. First, we derive a series of the tightest upper and lower bounds for the probability of a random…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Ali Ghobadzadeh , Amir Lashkari