Related papers: Explaining Drift using Shapley Values

Unified Shapley Framework to Explain Prediction Drift

Predictions are the currency of a machine learning model, and to understand the model's behavior over segments of a dataset, or over time, is an important problem in machine learning research and practice. There currently is no systematic…

Machine Learning · Computer Science 2021-02-17 Aalok Shanbhag , Avijit Ghosh , Josh Rubin

From XAI to MLOps: Explainable Concept Drift Detection with Profile Drift Detection

Predictive models often degrade in performance due to evolving data distributions, a phenomenon known as data drift. Among its forms, concept drift, where the relationship between explanatory variables and the response variable changes, is…

Machine Learning · Statistics 2026-05-18 Ugur Dar , Mustafa Cavus

Model Based Explanations of Concept Drift

The notion of concept drift refers to the phenomenon that the distribution generating the observed data changes over time. If drift is present, machine learning models can become inaccurate and need adjustment. While there do exist methods…

Machine Learning · Computer Science 2023-03-17 Fabian Hinder , Valerie Vaquet , Johannes Brinkrolf , Barbara Hammer

Uncovering Drift in Textual Data: An Unsupervised Method for Detecting and Mitigating Drift in Machine Learning Models

Drift in machine learning refers to the phenomenon where the statistical properties of data or context, in which the model operates, change over time leading to a decrease in its performance. Therefore, maintaining a constant monitoring…

Computation and Language · Computer Science 2023-09-08 Saeed Khaki , Akhouri Abhinav Aditya , Zohar Karnin , Lan Ma , Olivia Pan , Samarth Marudheri Chandrashekar

Interpretable Model Drift Detection

Data in the real world often has an evolving distribution. Thus, machine learning models trained on such data get outdated over time. This phenomenon is called model drift. Knowledge of this drift serves two purposes: (i) Retain an accurate…

Machine Learning · Computer Science 2025-03-11 Pranoy Panda , Kancheti Sai Srinivas , Vineeth N Balasubramanian , Gaurav Sinha

Unsupervised Concept Drift Detection from Deep Learning Representations in Real-time

Concept drift is the phenomenon in which the underlying data distributions and statistical properties of a target domain change over time, leading to a degradation in model performance. Consequently, production models require continuous…

Machine Learning · Computer Science 2025-08-07 Salvatore Greco , Bartolomeo Vacchetti , Daniele Apiletti , Tania Cerquitelli

Efficient computation and analysis of distributional Shapley values

Distributional data Shapley value (DShapley) has recently been proposed as a principled framework to quantify the contribution of individual datum in machine learning. DShapley develops the foundational game theory concept of Shapley values…

Machine Learning · Statistics 2021-02-19 Yongchan Kwon , Manuel A. Rivas , James Zou

"Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

Machine learning models frequently experience performance drops under distribution shifts. The underlying cause of such shifts may be multiple simultaneous factors such as changes in data quality, differences in specific covariate…

Machine Learning · Computer Science 2023-06-07 Haoran Zhang , Harvineet Singh , Marzyeh Ghassemi , Shalmali Joshi

Drift-Based Dataset Stability Benchmark

Machine learning (ML) represents an efficient and popular approach for network traffic classification. However, network traffic classification is a challenging domain, and trained models may degrade soon after deployment due to the obsolete…

Machine Learning · Computer Science 2026-01-01 Dominik Soukup , Richard Plný , Daniel Vašata , Tomáš Čejka

Learning under Concept Drift: A Review

Concept drift describes unforeseeable changes in the underlying distribution of streaming data over time. Concept drift research involves the development of methodologies and techniques for drift detection, understanding and adaptation.…

Machine Learning · Computer Science 2020-04-14 Jie Lu , Anjin Liu , Fan Dong , Feng Gu , Joao Gama , Guangquan Zhang

Reliable and Interpretable Drift Detection in Streams of Short Texts

Data drift is the change in model input data that is one of the key factors leading to machine learning models performance degradation over time. Monitoring drift helps detecting these issues and preventing their harmful consequences.…

Computation and Language · Computer Science 2023-05-30 Ella Rabinovich , Matan Vetzler , Samuel Ackerman , Ateret Anaby-Tavor

Detection of data drift and outliers affecting machine learning model performance over time

A trained ML model is deployed on another `test' dataset where target feature values (labels) are unknown. Drift is distribution change between the training and deployment data, which is concerning if model performance changes. For a…

Applications · Statistics 2022-09-07 Samuel Ackerman , Eitan Farchi , Orna Raz , Marcel Zalmanovici , Parijat Dube

A Neighbor-Searching Discrepancy-based Drift Detection Scheme for Learning Evolving Data

Uncertain changes in data streams present challenges for machine learning models to dynamically adapt and uphold performance in real-time. Particularly, classification boundary change, also known as real concept drift, is the major cause of…

Machine Learning · Computer Science 2024-05-24 Feng Gu , Jie Lu , Zhen Fang , Kun Wang , Guangquan Zhang

The many Shapley values for model explanation

The Shapley value has become a popular method to attribute the prediction of a machine-learning model on an input to its base features. The use of the Shapley value is justified by citing [16] showing that it is the \emph{unique} method…

Artificial Intelligence · Computer Science 2020-02-10 Mukund Sundararajan , Amir Najmi

"Who experiences large model decay and why?" A Hierarchical Framework for Diagnosing Heterogeneous Performance Drift

Machine learning (ML) models frequently experience performance degradation when deployed in new contexts. Such degradation is rarely uniform: some subgroups may suffer large performance decay while others may not. Understanding where and…

Machine Learning · Computer Science 2025-06-03 Harvineet Singh , Fan Xia , Alexej Gossmann , Andrew Chuang , Julian C. Hong , Jean Feng

Characterizing Concept Drift

Most machine learning models are static, but the world is dynamic, and increasing online deployment of learned models gives increasing urgency to the development of efficient and effective mechanisms to address learning in the context of…

Machine Learning · Computer Science 2016-11-16 Geoffrey I. Webb , Roy Hyde , Hong Cao , Hai Long Nguyen , Francois Petitjean

Switching Scheme: A Novel Approach for Handling Incremental Concept Drift in Real-World Data Sets

Machine learning models nowadays play a crucial role for many applications in business and industry. However, models only start adding value as soon as they are deployed into production. One challenge of deployed models is the effect of…

Machine Learning · Computer Science 2020-11-06 Lucas Baier , Vincent Kellner , Niklas Kühl , Gerhard Satzger

A feature selection method based on Shapley values robust to concept shift in regression

Feature selection is one of the most relevant processes in any methodology for creating a statistical learning model. Usually, existing algorithms establish some criterion to select the most influential variables, discarding those that do…

Machine Learning · Statistics 2024-05-10 Carlos Sebastián , Carlos E. González-Guillén

Unsupervised Model Drift Estimation with Batch Normalization Statistics for Dataset Shift Detection and Model Selection

While many real-world data streams imply that they change frequently in a nonstationary way, most of deep learning methods optimize neural networks on training data, and this leads to severe performance degradation when dataset shift…

Machine Learning · Computer Science 2021-07-02 Wonju Lee , Seok-Yong Byun , Jooeun Kim , Minje Park , Kirill Chechil

Counterfactual Explanations of Concept Drift

The notion of concept drift refers to the phenomenon that the distribution, which is underlying the observed data, changes over time; as a consequence machine learning models may become inaccurate and need adjustment. While there do exist…

Machine Learning · Computer Science 2020-06-24 Fabian Hinder , Barbara Hammer