English
Related papers

Related papers: Explaining Drift using Shapley Values

200 papers

Predictions are the currency of a machine learning model, and to understand the model's behavior over segments of a dataset, or over time, is an important problem in machine learning research and practice. There currently is no systematic…

Machine Learning · Computer Science 2021-02-17 Aalok Shanbhag , Avijit Ghosh , Josh Rubin

Predictive models often degrade in performance due to evolving data distributions, a phenomenon known as data drift. Among its forms, concept drift, where the relationship between explanatory variables and the response variable changes, is…

Machine Learning · Statistics 2026-05-18 Ugur Dar , Mustafa Cavus

The notion of concept drift refers to the phenomenon that the distribution generating the observed data changes over time. If drift is present, machine learning models can become inaccurate and need adjustment. While there do exist methods…

Machine Learning · Computer Science 2023-03-17 Fabian Hinder , Valerie Vaquet , Johannes Brinkrolf , Barbara Hammer

Drift in machine learning refers to the phenomenon where the statistical properties of data or context, in which the model operates, change over time leading to a decrease in its performance. Therefore, maintaining a constant monitoring…

Computation and Language · Computer Science 2023-09-08 Saeed Khaki , Akhouri Abhinav Aditya , Zohar Karnin , Lan Ma , Olivia Pan , Samarth Marudheri Chandrashekar

Data in the real world often has an evolving distribution. Thus, machine learning models trained on such data get outdated over time. This phenomenon is called model drift. Knowledge of this drift serves two purposes: (i) Retain an accurate…

Machine Learning · Computer Science 2025-03-11 Pranoy Panda , Kancheti Sai Srinivas , Vineeth N Balasubramanian , Gaurav Sinha

Concept drift is the phenomenon in which the underlying data distributions and statistical properties of a target domain change over time, leading to a degradation in model performance. Consequently, production models require continuous…

Machine Learning · Computer Science 2025-08-07 Salvatore Greco , Bartolomeo Vacchetti , Daniele Apiletti , Tania Cerquitelli

Distributional data Shapley value (DShapley) has recently been proposed as a principled framework to quantify the contribution of individual datum in machine learning. DShapley develops the foundational game theory concept of Shapley values…

Machine Learning · Statistics 2021-02-19 Yongchan Kwon , Manuel A. Rivas , James Zou

Machine learning models frequently experience performance drops under distribution shifts. The underlying cause of such shifts may be multiple simultaneous factors such as changes in data quality, differences in specific covariate…

Machine Learning · Computer Science 2023-06-07 Haoran Zhang , Harvineet Singh , Marzyeh Ghassemi , Shalmali Joshi

Machine learning (ML) represents an efficient and popular approach for network traffic classification. However, network traffic classification is a challenging domain, and trained models may degrade soon after deployment due to the obsolete…

Machine Learning · Computer Science 2026-01-01 Dominik Soukup , Richard Plný , Daniel Vašata , Tomáš Čejka

Concept drift describes unforeseeable changes in the underlying distribution of streaming data over time. Concept drift research involves the development of methodologies and techniques for drift detection, understanding and adaptation.…

Machine Learning · Computer Science 2020-04-14 Jie Lu , Anjin Liu , Fan Dong , Feng Gu , Joao Gama , Guangquan Zhang

Data drift is the change in model input data that is one of the key factors leading to machine learning models performance degradation over time. Monitoring drift helps detecting these issues and preventing their harmful consequences.…

Computation and Language · Computer Science 2023-05-30 Ella Rabinovich , Matan Vetzler , Samuel Ackerman , Ateret Anaby-Tavor

A trained ML model is deployed on another `test' dataset where target feature values (labels) are unknown. Drift is distribution change between the training and deployment data, which is concerning if model performance changes. For a…

Applications · Statistics 2022-09-07 Samuel Ackerman , Eitan Farchi , Orna Raz , Marcel Zalmanovici , Parijat Dube

Uncertain changes in data streams present challenges for machine learning models to dynamically adapt and uphold performance in real-time. Particularly, classification boundary change, also known as real concept drift, is the major cause of…

Machine Learning · Computer Science 2024-05-24 Feng Gu , Jie Lu , Zhen Fang , Kun Wang , Guangquan Zhang

The Shapley value has become a popular method to attribute the prediction of a machine-learning model on an input to its base features. The use of the Shapley value is justified by citing [16] showing that it is the \emph{unique} method…

Artificial Intelligence · Computer Science 2020-02-10 Mukund Sundararajan , Amir Najmi

Machine learning (ML) models frequently experience performance degradation when deployed in new contexts. Such degradation is rarely uniform: some subgroups may suffer large performance decay while others may not. Understanding where and…

Machine Learning · Computer Science 2025-06-03 Harvineet Singh , Fan Xia , Alexej Gossmann , Andrew Chuang , Julian C. Hong , Jean Feng

Most machine learning models are static, but the world is dynamic, and increasing online deployment of learned models gives increasing urgency to the development of efficient and effective mechanisms to address learning in the context of…

Machine Learning · Computer Science 2016-11-16 Geoffrey I. Webb , Roy Hyde , Hong Cao , Hai Long Nguyen , Francois Petitjean

Machine learning models nowadays play a crucial role for many applications in business and industry. However, models only start adding value as soon as they are deployed into production. One challenge of deployed models is the effect of…

Machine Learning · Computer Science 2020-11-06 Lucas Baier , Vincent Kellner , Niklas Kühl , Gerhard Satzger

Feature selection is one of the most relevant processes in any methodology for creating a statistical learning model. Usually, existing algorithms establish some criterion to select the most influential variables, discarding those that do…

Machine Learning · Statistics 2024-05-10 Carlos Sebastián , Carlos E. González-Guillén

While many real-world data streams imply that they change frequently in a nonstationary way, most of deep learning methods optimize neural networks on training data, and this leads to severe performance degradation when dataset shift…

Machine Learning · Computer Science 2021-07-02 Wonju Lee , Seok-Yong Byun , Jooeun Kim , Minje Park , Kirill Chechil

The notion of concept drift refers to the phenomenon that the distribution, which is underlying the observed data, changes over time; as a consequence machine learning models may become inaccurate and need adjustment. While there do exist…

Machine Learning · Computer Science 2020-06-24 Fabian Hinder , Barbara Hammer
‹ Prev 1 2 3 10 Next ›