GradTail: Learning Long-Tailed Data Using Gradient-based Sample Weighting

Zhao Chen; Vincent Casser; Henrik Kretzschmar; Dragomir Anguelov

GradTail: Learning Long-Tailed Data Using Gradient-based Sample Weighting

Machine Learning 2022-01-20 v2 Computer Vision and Pattern Recognition

Authors: Zhao Chen , Vincent Casser , Henrik Kretzschmar , Dragomir Anguelov

Abstract

We propose GradTail, an algorithm that uses gradients to improve model performance on the fly in the face of long-tailed training data distributions. Unlike conventional long-tail classifiers which operate on converged - and possibly overfit - models, we demonstrate that an approach based on gradient dot product agreement can isolate long-tailed data early on during model training and improve performance by dynamically picking higher sample weights for that data. We show that such upweighting leads to model improvements for both classification and regression models, the latter of which are relatively unexplored in the long-tail literature, and that the long-tail examples found by gradient alignment are consistent with our semantic expectations.

Keywords

long-tailed recognition machine learning theory neural network training

Cite

@article{arxiv.2201.05938,
  title  = {GradTail: Learning Long-Tailed Data Using Gradient-based Sample Weighting},
  author = {Zhao Chen and Vincent Casser and Henrik Kretzschmar and Dragomir Anguelov},
  journal= {arXiv preprint arXiv:2201.05938},
  year   = {2022}
}

Comments

15 pages (including Appendix), 8 figures

GradTail: Learning Long-Tailed Data Using Gradient-based Sample Weighting

Abstract

Keywords

Cite

Comments

Related papers