Rethinking Streaming Machine Learning Evaluation

Shreya Shankar; Bernease Herman; Aditya G. Parameswaran

Rethinking Streaming Machine Learning Evaluation

Machine Learning 2022-05-24 v1 Artificial Intelligence Machine Learning

Authors: Shreya Shankar , Bernease Herman , Aditya G. Parameswaran

Abstract

While most work on evaluating machine learning (ML) models focuses on computing accuracy on batches of data, tracking accuracy alone in a streaming setting (i.e., unbounded, timestamp-ordered datasets) fails to appropriately identify when models are performing unexpectedly. In this position paper, we discuss how the nature of streaming ML problems introduces new real-world challenges (e.g., delayed arrival of labels) and recommend additional metrics to assess streaming ML performance.

Rethinking Streaming Machine Learning Evaluation

Abstract

Keywords

Cite

Comments

Related papers