Related papers: On Expected Accuracy

Mixing between the Cross Entropy and the Expectation Loss Terms

The cross entropy loss is widely used due to its effectiveness and solid theoretical grounding. However, as training progresses, the loss tends to focus on hard to classify samples, which may prevent the network from obtaining gains in…

Machine Learning · Computer Science 2021-09-14 Barak Battash , Lior Wolf , Tamir Hazan

Enhancing Cross Entropy with a Linearly Adaptive Loss Function for Optimized Classification Performance

We propose the Linearly Adaptive Cross Entropy Loss function. This is a novel measure derived from the information theory. In comparison to the standard cross entropy loss function, the proposed one has an additional term that depends on…

Machine Learning · Computer Science 2025-07-16 Jae Wan Shim

Anchor Loss: Modulating Loss Scale based on Prediction Difficulty

We propose a novel loss function that dynamically rescales the cross entropy based on prediction difficulty regarding a sample. Deep neural network architectures in image classification tasks struggle to disambiguate visually similar…

Computer Vision and Pattern Recognition · Computer Science 2019-09-26 Serim Ryou , Seong-Gyun Jeong , Pietro Perona

Signal to Noise Ratio Loss Function

This work proposes a new loss function targeting classification problems, utilizing a source of information overlooked by cross entropy loss. First, we derive a series of the tightest upper and lower bounds for the probability of a random…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Ali Ghobadzadeh , Amir Lashkari

Negative Log Likelihood Ratio Loss for Deep Neural Network Classification

In deep neural network, the cross-entropy loss function is commonly used for classification. Minimizing cross-entropy is equivalent to maximizing likelihood under assumptions of uniform feature and class distributions. It belongs to…

Machine Learning · Computer Science 2018-05-01 Donglai Zhu , Hengshuai Yao , Bei Jiang , Peng Yu

Being Bayesian about Categorical Probability

Neural networks utilize the softmax as a building block in classification tasks, which contains an overconfidence problem and lacks an uncertainty representation ability. As a Bayesian alternative to the softmax, we consider a random…

Machine Learning · Computer Science 2020-06-30 Taejong Joo , Uijung Chung , Min-Gwan Seo

Loss Functions for Classification using Structured Entropy

Cross-entropy loss is the standard metric used to train classification models in deep learning and gradient boosting. It is well-known that this loss function fails to account for similarities between the different values of the target. We…

Machine Learning · Statistics 2022-06-16 Brian Lucena

EXACT: How to Train Your Accuracy

Classification tasks are usually evaluated in terms of accuracy. However, accuracy is discontinuous and cannot be directly optimized using gradient ascent. Popular methods minimize cross-entropy, hinge loss, or other surrogate losses, which…

Machine Learning · Computer Science 2024-07-25 Ivan Karpukhin , Stanislav Dereka , Sergey Kolesnikov

Towards noise contrastive estimation with soft targets for conditional models

Soft targets combined with the cross-entropy loss have shown to improve generalization performance of deep neural networks on supervised classification tasks. The standard cross-entropy loss however assumes data to be categorically…

Machine Learning · Computer Science 2024-07-16 Johannes Hugger , Virginie Uhlmann

Cross-Entropy Loss Functions: Theoretical Analysis and Applications

Cross-entropy is a widely used loss function in applications. It coincides with the logistic loss applied to the outputs of a neural network, when the softmax is used. But, what guarantees can we rely on when using cross-entropy as a…

Machine Learning · Computer Science 2023-06-21 Anqi Mao , Mehryar Mohri , Yutao Zhong

Calibrating Segmentation Networks with Margin-based Label Smoothing

Despite the undeniable progress in visual recognition tasks fueled by deep neural networks, there exists recent evidence showing that these models are poorly calibrated, resulting in over-confident predictions. The standard practices of…

Computer Vision and Pattern Recognition · Computer Science 2024-02-01 Balamurali Murugesan , Bingyuan Liu , Adrian Galdran , Ismail Ben Ayed , Jose Dolz

Alternate Loss Functions for Classification and Robust Regression Can Improve the Accuracy of Artificial Neural Networks

All machine learning algorithms use a loss, cost, utility or reward function to encode the learning objective and oversee the learning process. This function that supervises learning is a frequently unrecognized hyperparameter that…

Neural and Evolutionary Computing · Computer Science 2024-11-06 Mathew Mithra Noel , Arindam Banerjee , Yug Oswal , Geraldine Bessie Amali D , Venkataraman Muthiah-Nakarajan

Using Error Decay Prediction to Overcome Practical Issues of Deep Active Learning for Named Entity Recognition

Existing deep active learning algorithms achieve impressive sampling efficiency on natural language processing tasks. However, they exhibit several weaknesses in practice, including (a) inability to use uncertainty sampling with black-box…

Computation and Language · Computer Science 2020-07-22 Haw-Shiuan Chang , Shankar Vembu , Sunil Mohan , Rheeya Uppaal , Andrew McCallum

Exploring Alternatives to Softmax Function

Softmax function is widely used in artificial neural networks for multiclass classification, multilabel classification, attention mechanisms, etc. However, its efficacy is often questioned in literature. The log-softmax loss has been shown…

Machine Learning · Computer Science 2020-11-24 Kunal Banerjee , Vishak Prasad C , Rishi Raj Gupta , Karthik Vyas , Anushree H , Biswajit Mishra

Variational Classification

We present a latent variable model for classification that provides a novel probabilistic interpretation of neural network softmax classifiers. We derive a variational objective to train the model, analogous to the evidence lower bound…

Machine Learning · Computer Science 2024-01-10 Shehzaad Dhuliawala , Mrinmaya Sachan , Carl Allen

Noise-Sampling Cross Entropy Loss: Improving Disparity Regression Via Cost Volume Aware Regularizer

Recent end-to-end deep neural networks for disparity regression have achieved the state-of-the-art performance. However, many well-acknowledged specific properties of disparity estimation are omitted in these deep learning algorithms.…

Computer Vision and Pattern Recognition · Computer Science 2020-05-29 Yang Chen , Zongqing Lu , Xuechen Zhang , Lei Chen , Qingmin Liao

Deep Neural Network-aided Soft-Demapping in Optical Coherent Systems: Regression versus Classification

We examine here what type of predictive modelling, classification, or regression, using neural networks (NN), fits better the task of soft-demapping based post-processing in coherent optical communications, where the transmission channel is…

Signal Processing · Electrical Eng. & Systems 2022-08-23 Pedro J. Freire , Jaroslaw E. Prilepsky , Yevhenii Osadchuk , Sergei K. Turitsyn , Vahid Aref

Imbalanced Image Classification with Complement Cross Entropy

Recently, deep learning models have achieved great success in computer vision applications, relying on large-scale class-balanced datasets. However, imbalanced class distributions still limit the wide applicability of these models due to…

Computer Vision and Pattern Recognition · Computer Science 2021-08-05 Yechan Kim , Younkwan Lee , Moongu Jeon

Calibration of Ordinal Regression Networks

Recent studies have shown that deep neural networks are not well-calibrated and often produce over-confident predictions. The miscalibration issue primarily stems from using cross-entropy in classifications, which aims to align predicted…

Machine Learning · Computer Science 2025-02-05 Daehwan Kim , Haejun Chung , Ikbeom Jang

Unified Interpretation of Softmax Cross-Entropy and Negative Sampling: With Case Study for Knowledge Graph Embedding

In knowledge graph embedding, the theoretical relationship between the softmax cross-entropy and negative sampling loss functions has not been investigated. This makes it difficult to fairly compare the results of the two different loss…

Machine Learning · Computer Science 2022-03-17 Hidetaka Kamigaito , Katsuhiko Hayashi