English

SMART: An Open Source Data Labeling Platform for Supervised Learning

Machine Learning 2023-06-26 v1 Machine Learning

Abstract

SMART is an open source web application designed to help data scientists and research teams efficiently build labeled training data sets for supervised machine learning tasks. SMART provides users with an intuitive interface for creating labeled data sets, supports active learning to help reduce the required amount of labeled data, and incorporates inter-rater reliability statistics to provide insight into label quality. SMART is designed to be platform agnostic and easily deployable to meet the needs of as many different research teams as possible. The project website contains links to the code repository and extensive user documentation.

Keywords

Cite

@article{arxiv.1812.06591,
  title  = {SMART: An Open Source Data Labeling Platform for Supervised Learning},
  author = {Rob Chew and Michael Wenger and Caroline Kery and Jason Nance and Keith Richards and Emily Hadley and Peter Baumgartner},
  journal= {arXiv preprint arXiv:1812.06591},
  year   = {2023}
}

Comments

5 pages, 1 figure

R2 v1 2026-06-23T06:44:07.728Z