English

Continuous Weight Balancing

Machine Learning 2021-04-01 v1 Artificial Intelligence

Abstract

We propose a simple method by which to choose sample weights for problems with highly imbalanced or skewed traits. Rather than naively discretizing regression labels to find binned weights, we take a more principled approach -- we derive sample weights from the transfer function between an estimated source and specified target distributions. Our method outperforms both unweighted and discretely-weighted models on both regression and classification tasks. We also open-source our implementation of this method (https://github.com/Daniel-Wu/Continuous-Weight-Balancing) to the scientific community.

Keywords

Cite

@article{arxiv.2103.16591,
  title  = {Continuous Weight Balancing},
  author = {Daniel J. Wu and Avoy Datta},
  journal= {arXiv preprint arXiv:2103.16591},
  year   = {2021}
}

Comments

4 pages, 2 figures, presented at the S2D-OLAD workshop at ICLR 2021