English
Related papers

Related papers: A Tunable Loss Function for Binary Classification

200 papers

We introduce a tunable loss function called $\alpha$-loss, parameterized by $\alpha \in (0,\infty]$, which interpolates between the exponential loss ($\alpha = 1/2$), the log-loss ($\alpha = 1$), and the 0-1 loss ($\alpha = \infty$), for…

Machine Learning · Computer Science 2022-12-22 Tyler Sypherd , Mario Diaz , John Kevin Cava , Gautam Dasarathy , Peter Kairouz , Lalitha Sankar

This paper explores connections between margin-based loss functions and consistency in binary classification and regression applications. It is shown that a large class of margin-based loss functions for binary classification/regression…

Machine Learning · Statistics 2023-01-30 Jeffrey Buzas

We analyze the optimization landscape of a recently introduced tunable class of loss functions called $\alpha$-loss, $\alpha \in (0,\infty]$, in the logistic model. This family encapsulates the exponential loss ($\alpha = 1/2$), the…

Machine Learning · Computer Science 2022-12-22 Tyler Sypherd , Mario Diaz , Lalitha Sankar , Gautam Dasarathy

Loss functions drive the optimization of machine learning algorithms. The choice of a loss function can have a significant impact on the training of a model, and how the model learns the data. Binary classification is one of the major…

Machine Learning · Computer Science 2022-11-02 Rayan Wali

A loss function measures the discrepancy between the true values (observations) and their estimated fits, for a given instance of data. A loss function is said to be proper (unbiased, Fisher consistent) if the fits are defined over a unit…

Information Theory · Computer Science 2018-05-11 Amichai Painsky , Gregory W. Wornell

We propose a non-parametric variant of binary regression, where the hypothesis is regularized to be a Lipschitz function taking a metric space to [0,1] and the loss is logarithmic. This setting presents novel computational and statistical…

Machine Learning · Computer Science 2020-10-21 Ariel Avital , Klim Efremenko , Aryeh Kontorovich , David Toplin , Bo Waggoner

We consider the $[0,1]$-valued regression problem in the i.i.d. setting. In a related problem called cost-sensitive classification, \citet{foster21efficient} have shown that the log loss minimizer achieves an improved generalization bound…

Machine Learning · Computer Science 2025-07-18 Yinan Li , Kwang-Sung Jun

We study losses for binary classification and class probability estimation and extend the understanding of them from margin losses to general composite losses which are the composition of a proper loss with a link function. We characterise…

Machine Learning · Statistics 2009-12-18 Mark D. Reid , Robert C. Williamson

The standard loss functions used in the literature on probabilistic prediction are the log loss function, the Brier loss function, and the spherical loss function; however, any computable proper loss function can be used for comparison of…

Machine Learning · Computer Science 2015-06-30 Vladimir Vovk

This paper illustrates the central role of loss functions in data-driven decision making, providing a comprehensive survey on their influence in cost-sensitive classification (CSC) and reinforcement learning (RL). We demonstrate how…

Machine Learning · Statistics 2025-04-07 Kaiwen Wang , Nathan Kallus , Wen Sun

The goal of binary classification is to estimate a discriminant function $\gamma$ from observations of covariate vectors and corresponding binary labels. We consider an elaboration of this problem in which the covariates are not available…

Statistics Theory · Mathematics 2009-09-29 XuanLong Nguyen , Martin J. Wainwright , Michael I. Jordan

Label smoothing (LS) adopts smoothed targets in classification tasks. For example, in binary classification, instead of the one-hot target $(1,0)^\top$ used in conventional logistic regression (LR), LR with LS (LSLR) uses the smoothed…

Machine Learning · Statistics 2023-05-16 Ryoya Yamasaki , Toshiyuki Tanaka

All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that…

Neural and Evolutionary Computing · Computer Science 2024-11-06 Mathew Mithra Noel , Arindam Banerjee , Yug Oswal , Geraldine Bessie Amali D , Venkataraman Muthiah-Nakarajan

We consider optimization of generalized performance metrics for binary classification by means of surrogate losses. We focus on a class of metrics, which are linear-fractional functions of the false positive and false negative rates…

Machine Learning · Computer Science 2016-10-10 Wojciech Kotłowski , Krzysztof Dembczyński

The logistic loss function is often advocated in machine learning and statistics as a smooth and strictly convex surrogate for the 0-1 loss. In this paper we investigate the question of whether these smoothness and convexity properties make…

Machine Learning · Computer Science 2014-05-16 Elad Hazan , Tomer Koren , Kfir Y. Levy

The $F_\beta$ score is a commonly used measure of classification performance, which plays crucial roles in classification tasks with imbalanced data sets. However, the $F_\beta$ score cannot be used as a loss function by gradient-based…

Machine Learning · Computer Science 2021-04-06 Namgil Lee , Heejung Yang , Hojin Yoo

A loss function measures the discrepancy between the true values and their estimated fits, for a given instance of data. In classification problems, a loss function is said to be proper if a minimizer of the expected loss is the true…

Information Theory · Computer Science 2020-01-03 Amichai Painsky , Gregory W. Wornell

The logistic loss (a.k.a. cross-entropy loss) is one of the most popular loss functions used for multiclass classification. It is also the loss function of choice for next-token prediction in language modeling. It is associated with the…

Machine Learning · Computer Science 2025-06-16 Vincent Roulet , Tianlin Liu , Nino Vieillard , Michael E. Sander , Mathieu Blondel

A new loss function is proposed for neural networks on classification tasks which extends the hinge loss by assigning gradients to its critical points. We will show that for a linear classifier on linearly separable data with fixed step…

Machine Learning · Computer Science 2020-06-26 Justin Lizama

This work proposes a new loss function targeting classification problems, utilizing a source of information overlooked by cross entropy loss. First, we derive a series of the tightest upper and lower bounds for the probability of a random…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Ali Ghobadzadeh , Amir Lashkari
‹ Prev 1 2 3 10 Next ›