Related papers: Sequential Harmful Shift Detection Without Labels

Tracking the risk of a deployed model and detecting harmful distribution shifts

When deployed in the real world, machine learning models inevitably encounter changes in the data distribution, and certain -- but not all -- distribution shifts could result in significant performance degradation. In practice, it may make…

Machine Learning · Statistics 2022-05-06 Aleksandr Podkopaev , Aaditya Ramdas

Sequential changepoint detection in classification data under label shift

Classifier predictions often rely on the assumption that new observations come from the same distribution as training data. When the underlying distribution changes, so does the optimal classification rule, and performance may degrade. We…

Methodology · Statistics 2021-09-01 Ciaran Evans , Max G'Sell

Flexible and Efficient Drift Detection without Labels

Machine learning models are being increasingly used to automate decisions in almost every domain, and ensuring the performance of these models is crucial for ensuring high quality machine learning enabled services. Ensuring concept drift is…

Machine Learning · Statistics 2025-09-30 Nelvin Tan , Yu-Ching Shih , Dong Yang , Amol Salunkhe

Label-Free Detection of Governance Evidence Degradation in Risk Decision Systems

Risk decision systems in fraud detection and credit scoring operate under structural label absence: ground truth arrives weeks to months after decisions are made. During this blind period, model performance may degrade silently, eroding the…

Computers and Society · Computer Science 2026-04-21 Oleg Solozobov

Concept Drift and Covariate Shift Detection Ensemble with Lagged Labels

In model serving, having one fixed model during the entire often life-long inference process is usually detrimental to model performance, as data distribution evolves over time, resulting in lack of reliability of the model trained on…

Artificial Intelligence · Computer Science 2020-12-16 Yiming Xu , Diego Klabjan

Reliably Detecting Model Failures in Deployment Without Labels

The distribution of data changes over time; models operating in dynamic environments need retraining. But knowing when to retrain, without access to labels, is an open challenge since some, but not all shifts degrade model performance. This…

Machine Learning · Computer Science 2025-11-05 Viet Nguyen , Changjian Shui , Vijay Giri , Siddharth Arya , Amol Verma , Fahad Razak , Rahul G. Krishnan

Online Adaptation to Label Distribution Shift

Machine learning models often encounter distribution shifts when deployed in the real world. In this paper, we focus on adaptation to label distribution shift in the online setting, where the test-time label distribution is continually…

Machine Learning · Computer Science 2022-01-06 Ruihan Wu , Chuan Guo , Yi Su , Kilian Q. Weinberger

An Adaptive Method for Weak Supervision with Drifting Data

We introduce an adaptive method with formal quality guarantees for weak supervision in a non-stationary setting. Our goal is to infer the unknown labels of a sequence of data by using weak supervision sources that provide independent noisy…

Machine Learning · Computer Science 2025-05-05 Alessio Mazzetto , Reza Esfandiarpoor , Akash Singirikonda , Eli Upfal , Stephen H. Bach

Online Distribution Shift Detection via Recency Prediction

When deploying modern machine learning-enabled robotic systems in high-stakes applications, detecting distribution shift is critical. However, most existing methods for detecting distribution shift are not well-suited to robotics settings,…

Robotics · Computer Science 2024-05-21 Rachel Luo , Rohan Sinha , Yixiao Sun , Ali Hindy , Shengjia Zhao , Silvio Savarese , Edward Schmerling , Marco Pavone

On Continuous Monitoring of Risk Violations under Unknown Shift

Machine learning systems deployed in the real world must operate under dynamic and often unpredictable distribution shifts. This challenges the validity of statistical safety assurances on the system's risk established beforehand. Common…

Machine Learning · Statistics 2025-06-23 Alexander Timans , Rajeev Verma , Eric Nalisnick , Christian A. Naesseth

Task-Sensitive Concept Drift Detector with Constraint Embedding

Detecting drifts in data is essential for machine learning applications, as changes in the statistics of processed data typically has a profound influence on the performance of trained models. Most of the available drift detection methods…

Machine Learning · Computer Science 2024-10-28 Andrea Castellani , Sebastian Schmitt , Barbara Hammer

Label-free estimation of clinically relevant performance metrics under distribution shifts

Performance monitoring is essential for safe clinical deployment of image classification models. However, because ground-truth labels are typically unavailable in the target dataset, direct assessment of real-world model performance is…

Machine Learning · Computer Science 2025-07-31 Tim Flühmann , Alceu Bissoto , Trung-Dung Hoang , Lisa M. Koch

Unsupervised Accuracy Estimation of Deep Visual Models using Domain-Adaptive Adversarial Perturbation without Source Samples

Deploying deep visual models can lead to performance drops due to the discrepancies between source and target distributions. Several approaches leverage labeled source data to estimate target domain accuracy, but accessing labeled source…

Computer Vision and Pattern Recognition · Computer Science 2023-07-20 JoonHo Lee , Jae Oh Woo , Hankyu Moon , Kwonho Lee

Estimating Generalization under Distribution Shifts via Domain-Invariant Representations

When machine learning models are deployed on a test distribution different from the training distribution, they can perform poorly, but overestimate their performance. In this work, we aim to better estimate a model's performance under…

Machine Learning · Computer Science 2020-07-08 Ching-Yao Chuang , Antonio Torralba , Stefanie Jegelka

Efficient Inference under Label Shift in Unsupervised Domain Adaptation

In many real-world applications, researchers aim to deploy models trained in a source domain to a target domain, where obtaining labeled data is often expensive, time-consuming, or even infeasible. While most existing literature assumes…

Methodology · Statistics 2025-08-26 Seong-ho Lee , Yanyuan Ma , Jiwei Zhao

Unsupervised Supervised Learning II: Training Margin Based Classifiers without Labels

Many popular linear classifiers, such as logistic regression, boosting, or SVM, are trained by optimizing a margin-based risk function. Traditionally, these risk functions are computed based on a labeled dataset. We develop a novel…

Machine Learning · Computer Science 2010-07-23 Krishnakumar Balasubramanian , Pinar Donmez , Guy Lebanon

On the Reliable Detection of Concept Drift from Streaming Unlabeled Data

Classifiers deployed in the real world operate in a dynamic environment, where the data distribution can change over time. These changes, referred to as concept drift, can cause the predictive performance of the classifier to drop over…

Machine Learning · Statistics 2017-04-04 Tegjyot Singh Sethi , Mehmed Kantardzic

A Self-Organizing Clustering System for Unsupervised Distribution Shift Detection

Modeling non-stationary data is a challenging problem in the field of continual learning, and data distribution shifts may result in negative consequences on the performance of a machine learning model. Classic learning tools are often…

Machine Learning · Computer Science 2024-10-23 Sebastián Basterrech , Line Clemmensen , Gerardo Rubino

Precise Change Point Detection using Spectral Drift Detection

The notion of concept drift refers to the phenomenon that the data generating distribution changes over time; as a consequence machine learning models may become inaccurate and need adjustment. In this paper we consider the problem of…

Machine Learning · Computer Science 2022-05-16 Fabian Hinder , André Artelt , Valerie Vaquet , Barbara Hammer

Learning Time Series Detection Models from Temporally Imprecise Labels

In this paper, we consider a new low-quality label learning problem: learning time series detection models from temporally imprecise labels. In this problem, the data consist of a set of input time series, and supervision is provided by a…

Machine Learning · Statistics 2017-04-14 Roy J. Adams , Benjamin M. Marlin