Loss Functions for Classification using Structured Entropy

Brian Lucena

Loss Functions for Classification using Structured Entropy

Machine Learning 2022-06-16 v1 Computer Vision and Pattern Recognition Information Theory Machine Learning math.IT

Authors: Brian Lucena

Abstract

Cross-entropy loss is the standard metric used to train classification models in deep learning and gradient boosting. It is well-known that this loss function fails to account for similarities between the different values of the target. We propose a generalization of entropy called {\em structured entropy} which uses a random partition to incorporate the structure of the target variable in a manner which retains many theoretical properties of standard entropy. We show that a structured cross-entropy loss yields better results on several classification problems where the target variable has an a priori known structure. The approach is simple, flexible, easily computable, and does not rely on a hierarchically defined notion of structure.

Keywords

deep learning statistical learning theory multi-task learning

Cite

@article{arxiv.2206.07122,
  title  = {Loss Functions for Classification using Structured Entropy},
  author = {Brian Lucena},
  journal= {arXiv preprint arXiv:2206.07122},
  year   = {2022}
}

Related papers

View all related →