Related papers: Automatically detecting data drift in machine lear…

Detection of data drift and outliers affecting machine learning model performance over time

A trained ML model is deployed on another `test' dataset where target feature values (labels) are unknown. Drift is distribution change between the training and deployment data, which is concerning if model performance changes. For a…

Applications · Statistics 2022-09-07 Samuel Ackerman , Eitan Farchi , Orna Raz , Marcel Zalmanovici , Parijat Dube

Uncovering Drift in Textual Data: An Unsupervised Method for Detecting and Mitigating Drift in Machine Learning Models

Drift in machine learning refers to the phenomenon where the statistical properties of data or context, in which the model operates, change over time leading to a decrease in its performance. Therefore, maintaining a constant monitoring…

Computation and Language · Computer Science 2023-09-08 Saeed Khaki , Akhouri Abhinav Aditya , Zohar Karnin , Lan Ma , Olivia Pan , Samarth Marudheri Chandrashekar

On the Reliable Detection of Concept Drift from Streaming Unlabeled Data

Classifiers deployed in the real world operate in a dynamic environment, where the data distribution can change over time. These changes, referred to as concept drift, can cause the predictive performance of the classifier to drop over…

Machine Learning · Statistics 2017-04-04 Tegjyot Singh Sethi , Mehmed Kantardzic

Automating concept-drift detection by self-evaluating predictive model degradation

A key aspect of automating predictive machine learning entails the capability of properly triggering the update of the trained model. To this aim, suitable automatic solutions to self-assess the prediction quality and the data distribution…

Machine Learning · Computer Science 2019-07-19 Tania Cerquitelli , Stefano Proto , Francesco Ventura , Daniele Apiletti , Elena Baralis

Machine Learning Model Drift Detection Via Weak Data Slices

Detecting drift in performance of Machine Learning (ML) models is an acknowledged challenge. For ML models to become an integral part of business applications it is essential to detect when an ML model drifts away from acceptable operation.…

Machine Learning · Computer Science 2021-08-12 Samuel Ackerman , Parijat Dube , Eitan Farchi , Orna Raz , Marcel Zalmanovici

Confident Learning: Estimating Uncertainty in Dataset Labels

Learning exists in the context of data, yet notions of confidence typically focus on model predictions, not label quality. Confident learning (CL) is an alternative approach which focuses instead on label quality by characterizing and…

Machine Learning · Statistics 2022-08-23 Curtis G. Northcutt , Lu Jiang , Isaac L. Chuang

Automatic Learning to Detect Concept Drift

Many methods have been proposed to detect concept drift, i.e., the change in the distribution of streaming data, due to concept drift causes a decrease in the prediction accuracy of algorithms. However, the most of current detection methods…

Artificial Intelligence · Computer Science 2021-05-05 Hang Yu , Tianyu Liu , Jie Lu , Guangquan Zhang

Beta Distribution Drift Detection for Adaptive Classifiers

With today's abundant streams of data, the only constant we can rely on is change. For stream classification algorithms, it is necessary to adapt to concept drift. This can be achieved by monitoring the model error, and triggering counter…

Machine Learning · Computer Science 2020-12-09 Lukas Fleckenstein , Sebastian Kauschke , Johannes Fürnkranz

Task-Sensitive Concept Drift Detector with Constraint Embedding

Detecting drifts in data is essential for machine learning applications, as changes in the statistics of processed data typically has a profound influence on the performance of trained models. Most of the available drift detection methods…

Machine Learning · Computer Science 2024-10-28 Andrea Castellani , Sebastian Schmitt , Barbara Hammer

AutoML for Multi-Class Anomaly Compensation of Sensor Drift

Addressing sensor drift is essential in industrial measurement systems, where precise data output is necessary for maintaining accuracy and reliability in monitoring processes, as it progressively degrades the performance of machine…

Machine Learning · Computer Science 2025-02-27 Melanie Schaller , Mathis Kruse , Antonio Ortega , Marius Lindauer , Bodo Rosenhahn

Sequential changepoint detection in classification data under label shift

Classifier predictions often rely on the assumption that new observations come from the same distribution as training data. When the underlying distribution changes, so does the optimal classification rule, and performance may degrade. We…

Methodology · Statistics 2021-09-01 Ciaran Evans , Max G'Sell

Monitoring the calibration of probability forecasts with an application to concept drift detection involving image classification

Machine learning approaches for image classification have led to impressive advances in that field. For example, convolutional neural networks are able to achieve remarkable image classification accuracy across a wide range of applications…

Machine Learning · Statistics 2025-10-30 Christopher T. Franck , Anne R. Driscoll , Zoe Szajnfarber , William H. Woodall

On The Reliability Of Machine Learning Applications In Manufacturing Environments

The increasing deployment of advanced digital technologies such as Internet of Things (IoT) devices and Cyber-Physical Systems (CPS) in industrial environments is enabling the productive use of machine learning (ML) algorithms in the…

Machine Learning · Computer Science 2021-12-21 Nicolas Jourdan , Sagar Sen , Erik Johannes Husom , Enrique Garcia-Ceja , Tobias Biegel , Joachim Metternich

Bias Assessment and Data Drift Detection in Medical Image Analysis: A Survey

Machine Learning (ML) models have gained popularity in medical imaging analysis given their expert level performance in many medical domains. To enhance the trustworthiness, acceptance, and regulatory compliance of medical imaging models…

Human-Computer Interaction · Computer Science 2025-06-06 Mischa Dombrowski , Andrea Prenner , Bernhard Kainz

A Multi-Criteria Automated MLOps Pipeline for Cost-Effective Cloud-Based Classifier Retraining in Response to Data Distribution Shifts

The performance of machine learning (ML) models often deteriorates when the underlying data distribution changes over time, a phenomenon known as data distribution drift. When this happens, ML models need to be retrained and redeployed. ML…

Machine Learning · Computer Science 2025-12-15 Emmanuel K. Katalay , David O. Dimandja , Jordan F. Masakuna

Failing Loudly: An Empirical Study of Methods for Detecting Dataset Shift

We might hope that when faced with unexpected inputs, well-designed software systems would fire off warnings. Machine learning (ML) systems, however, which depend strongly on properties of their inputs (e.g. the i.i.d. assumption), tend to…

Machine Learning · Statistics 2019-10-29 Stephan Rabanser , Stephan Günnemann , Zachary C. Lipton

A Neighbor-Searching Discrepancy-based Drift Detection Scheme for Learning Evolving Data

Uncertain changes in data streams present challenges for machine learning models to dynamically adapt and uphold performance in real-time. Particularly, classification boundary change, also known as real concept drift, is the major cause of…

Machine Learning · Computer Science 2024-05-24 Feng Gu , Jie Lu , Zhen Fang , Kun Wang , Guangquan Zhang

Sequential Drift Detection in Deep Learning Classifiers

We utilize neural network embeddings to detect data drift by formulating the drift detection within an appropriate sequential decision framework. This enables control of the false alarm rate although the statistical tests are repeatedly…

Applications · Statistics 2020-08-03 Samuel Ackerman , Parijat Dube , Eitan Farchi

Precise Change Point Detection using Spectral Drift Detection

The notion of concept drift refers to the phenomenon that the data generating distribution changes over time; as a consequence machine learning models may become inaccurate and need adjustment. In this paper we consider the problem of…

Machine Learning · Computer Science 2022-05-16 Fabian Hinder , André Artelt , Valerie Vaquet , Barbara Hammer

Time to Retrain? Detecting Concept Drifts in Machine Learning Systems

With the boom of machine learning (ML) techniques, software practitioners build ML systems to process the massive volume of streaming data for diverse software engineering tasks such as failure prediction in AIOps. Trained using historical…

Machine Learning · Computer Science 2025-08-05 Tri Minh Triet Pham , Karthikeyan Premkumar , Mohamed Naili , Jinqiu Yang