Related papers: Loss Functions for Classification using Structured…

On the relation between Loss Functions and T-Norms

Deep learning has been shown to achieve impressive results in several domains like computer vision and natural language processing. A key element of this success has been the development of new loss functions, like the popular cross-entropy…

Machine Learning · Computer Science 2019-07-19 Francesco Giannini , Giuseppe Marra , Michelangelo Diligenti , Marco Maggini , Marco Gori

Mixing between the Cross Entropy and the Expectation Loss Terms

The cross entropy loss is widely used due to its effectiveness and solid theoretical grounding. However, as training progresses, the loss tends to focus on hard to classify samples, which may prevent the network from obtaining gains in…

Machine Learning · Computer Science 2021-09-14 Barak Battash , Lior Wolf , Tamir Hazan

Improving Deep Regression with Ordinal Entropy

In computer vision, it is often observed that formulating regression problems as a classification task often yields better performance. We investigate this curious phenomenon and provide a derivation to show that classification, with the…

Computer Vision and Pattern Recognition · Computer Science 2023-03-01 Shihao Zhang , Linlin Yang , Michael Bi Mi , Xiaoxu Zheng , Angela Yao

Enhancing Cross Entropy with a Linearly Adaptive Loss Function for Optimized Classification Performance

We propose the Linearly Adaptive Cross Entropy Loss function. This is a novel measure derived from the information theory. In comparison to the standard cross entropy loss function, the proposed one has an additional term that depends on…

Machine Learning · Computer Science 2025-07-16 Jae Wan Shim

Evaluation of Neural Architectures Trained with Square Loss vs Cross-Entropy in Classification Tasks

Modern neural architectures for classification tasks are trained using the cross-entropy loss, which is widely believed to be empirically superior to the square loss. In this work we provide evidence indicating that this belief may not be…

Machine Learning · Computer Science 2021-10-26 Like Hui , Mikhail Belkin

Cut your Losses with Squentropy

Nearly all practical neural models for classification are trained using cross-entropy loss. Yet this ubiquitous choice is supported by little theoretical or empirical evidence. Recent work (Hui & Belkin, 2020) suggests that training using…

Machine Learning · Computer Science 2023-02-09 Like Hui , Mikhail Belkin , Stephen Wright

Entropy-based Guidance of Deep Neural Networks for Accelerated Convergence and Improved Performance

Neural networks have dramatically increased our capacity to learn from large, high-dimensional datasets across innumerable disciplines. However, their decisions are not easily interpretable, their computational costs are high, and building…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Mackenzie J. Meni , Ryan T. White , Michael Mayo , Kevin Pilkiewicz

Uses and Abuses of the Cross-Entropy Loss: Case Studies in Modern Deep Learning

Modern deep learning is primarily an experimental science, in which empirical advances occasionally come at the expense of probabilistic rigor. Here we focus on one such example; namely the use of the categorical cross-entropy loss to model…

Machine Learning · Statistics 2020-11-11 Elliott Gordon-Rodriguez , Gabriel Loaiza-Ganem , Geoff Pleiss , John P. Cunningham

A unifying mutual information view of metric learning: cross-entropy vs. pairwise losses

Recently, substantial research efforts in Deep Metric Learning (DML) focused on designing complex pairwise-distance losses, which require convoluted schemes to ease optimization, such as sample mining or pair weighting. The standard…

Machine Learning · Computer Science 2021-11-29 Malik Boudiaf , Jérôme Rony , Imtiaz Masud Ziko , Eric Granger , Marco Pedersoli , Pablo Piantanida , Ismail Ben Ayed

Cross-Entropy Loss Functions: Theoretical Analysis and Applications

Cross-entropy is a widely used loss function in applications. It coincides with the logistic loss applied to the outputs of a neural network, when the softmax is used. But, what guarantees can we rely on when using cross-entropy as a…

Machine Learning · Computer Science 2023-06-21 Anqi Mao , Mehryar Mohri , Yutao Zhong

PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions

Cross-entropy loss and focal loss are the most common choices when training deep neural networks for classification problems. Generally speaking, however, a good loss function can take on much more flexible forms, and should be tailored for…

Computer Vision and Pattern Recognition · Computer Science 2022-05-12 Zhaoqi Leng , Mingxing Tan , Chenxi Liu , Ekin Dogus Cubuk , Xiaojie Shi , Shuyang Cheng , Dragomir Anguelov

CC-Loss: Channel Correlation Loss For Image Classification

The loss function is a key component in deep learning models. A commonly used loss function for classification is the cross entropy loss, which is a simple yet effective application of information theory for classification problems. Based…

Computer Vision and Pattern Recognition · Computer Science 2020-10-13 Zeyu Song , Dongliang Chang , Zhanyu Ma , Xiaoxu Li , Zheng-Hua Tan

Learning to Learn and Predict: A Meta-Learning Approach for Multi-Label Classification

Many tasks in natural language processing can be viewed as multi-label classification problems. However, most of the existing models are trained with the standard cross-entropy loss function and use a fixed prediction policy (e.g., a…

Computation and Language · Computer Science 2019-09-11 Jiawei Wu , Wenhan Xiong , William Yang Wang

A Surprising Linear Relationship Predicts Test Performance in Deep Networks

Given two networks with the same training loss on a dataset, when would they have drastically different test losses and errors? Better understanding of this question of generalization may improve practical applications of deep networks. In…

Machine Learning · Computer Science 2018-07-26 Qianli Liao , Brando Miranda , Andrzej Banburski , Jack Hidary , Tomaso Poggio

Cross-Entropy Loss and Low-Rank Features Have Responsibility for Adversarial Examples

State-of-the-art neural networks are vulnerable to adversarial examples; they can easily misclassify inputs that are imperceptibly different than their training and test data. In this work, we establish that the use of cross-entropy loss…

Machine Learning · Computer Science 2019-01-25 Kamil Nar , Orhan Ocal , S. Shankar Sastry , Kannan Ramchandran

Negative Log Likelihood Ratio Loss for Deep Neural Network Classification

In deep neural network, the cross-entropy loss function is commonly used for classification. Minimizing cross-entropy is equivalent to maximizing likelihood under assumptions of uniform feature and class distributions. It belongs to…

Machine Learning · Computer Science 2018-05-01 Donglai Zhu , Hengshuai Yao , Bei Jiang , Peng Yu

Using Focal Loss to Fight Shallow Heuristics: An Empirical Analysis of Modulated Cross-Entropy in Natural Language Inference

There is no such thing as a perfect dataset. In some datasets, deep neural networks discover underlying heuristics that allow them to take shortcuts in the learning process, resulting in poor generalization capability. Instead of using…

Computation and Language · Computer Science 2022-11-28 Frano Rajič , Ivan Stresec , Axel Marmet , Tim Poštuvan

SimLoss: Class Similarities in Cross Entropy

One common loss function in neural network classification tasks is Categorical Cross Entropy (CCE), which punishes all misclassifications equally. However, classes often have an inherent structure. For instance, classifying an image of a…

Machine Learning · Computer Science 2020-03-09 Konstantin Kobs , Michael Steininger , Albin Zehe , Florian Lautenschlager , Andreas Hotho

Multi-View Non-negative Matrix Factorization Discriminant Learning via Cross Entropy Loss

Multi-view learning accomplishes the task objectives of classification by leverag-ing the relationships between different views of the same object. Most existing methods usually focus on consistency and complementarity between multiple…

Machine Learning · Computer Science 2022-01-14 Jian-wei Liu , Yuan-fang Wang , Run-kun Lu , Xionglin Luo

Towards noise contrastive estimation with soft targets for conditional models

Soft targets combined with the cross-entropy loss have shown to improve generalization performance of deep neural networks on supervised classification tasks. The standard cross-entropy loss however assumes data to be categorically…

Machine Learning · Computer Science 2024-07-16 Johannes Hugger , Virginie Uhlmann