English

Using system context information to complement weakly labeled data

Machine Learning 2021-07-22 v1

Abstract

Real-world datasets collected with sensor networks often contain incomplete and uncertain labels as well as artefacts arising from the system environment. Complete and reliable labeling is often infeasible for large-scale and long-term sensor network deployments due to the labor and time overhead, limited availability of experts and missing ground truth. In addition, if the machine learning method used for analysis is sensitive to certain features of a deployment, labeling and learning needs to be repeated for every new deployment. To address these challenges, we propose to make use of system context information formalized in an information graph and embed it in the learning process via contrastive learning. Based on real-world data we show that this approach leads to an increased accuracy in case of weakly labeled data and leads to an increased robustness and transferability of the classifier to new sensor locations.

Keywords

Cite

@article{arxiv.2107.10236,
  title  = {Using system context information to complement weakly labeled data},
  author = {Matthias Meyer and Michaela Wenner and Clément Hibert and Fabian Walter and Lothar Thiele},
  journal= {arXiv preprint arXiv:2107.10236},
  year   = {2021}
}

Comments

Also appears in "Proceedings of the First Workshop on Weakly Supervised Learning (WeaSuL)" arXiv:2107.03690

R2 v1 2026-06-24T04:24:23.504Z