English

Diverse mini-batch Active Learning

Machine Learning 2019-01-18 v1 Machine Learning

Abstract

We study the problem of reducing the amount of labeled training data required to train supervised classification models. We approach it by leveraging Active Learning, through sequential selection of examples which benefit the model most. Selecting examples one by one is not practical for the amount of training examples required by the modern Deep Learning models. We consider the mini-batch Active Learning setting, where several examples are selected at once. We present an approach which takes into account both informativeness of the examples for the model, as well as the diversity of the examples in a mini-batch. By using the well studied K-means clustering algorithm, this approach scales better than the previously proposed approaches, and achieves comparable or better performance.

Keywords

Cite

@article{arxiv.1901.05954,
  title  = {Diverse mini-batch Active Learning},
  author = {Fedor Zhdanov},
  journal= {arXiv preprint arXiv:1901.05954},
  year   = {2019}
}