Related papers: Towards Practicable Sequential Shift Detectors

Tracking the risk of a deployed model and detecting harmful distribution shifts

When deployed in the real world, machine learning models inevitably encounter changes in the data distribution, and certain -- but not all -- distribution shifts could result in significant performance degradation. In practice, it may make…

Machine Learning · Statistics 2022-05-06 Aleksandr Podkopaev , Aaditya Ramdas

Ensembling Shift Detectors: an Extensive Empirical Evaluation

The term dataset shift refers to the situation where the data used to train a machine learning model is different from where the model operates. While several types of shifts naturally occur, existing shift detectors are usually designed to…

Machine Learning · Computer Science 2021-06-29 Simona Maggio , Léo Dreyfus-Schmidt

On Distribution Shift in Learning-based Bug Detectors

Deep learning has recently achieved initial success in program analysis tasks such as bug detection. Lacking real bugs, most existing works construct training and test data by injecting synthetic bugs into correct programs. Despite…

Machine Learning · Computer Science 2022-06-22 Jingxuan He , Luca Beurer-Kellner , Martin Vechev

Feature Shift Detection: Localizing Which Features Have Shifted via Conditional Distribution Tests

While previous distribution shift detection approaches can identify if a shift has occurred, these approaches cannot localize which specific features have caused a distribution shift -- a critical step in diagnosing or fixing any underlying…

Machine Learning · Computer Science 2021-07-16 Sean Kulinski , Saurabh Bagchi , David I. Inouye

Are Concept Drift Detectors Reliable Alarming Systems? -- A Comparative Study

As machine learning models increasingly replace traditional business logic in the production system, their lifecycle management is becoming a significant concern. Once deployed into production, the machine learning models are constantly…

Machine Learning · Computer Science 2022-11-24 Lorena Poenaru-Olaru , Luis Cruz , Arie van Deursen , Jan S. Rellermeyer

Sequential changepoint detection in classification data under label shift

Classifier predictions often rely on the assumption that new observations come from the same distribution as training data. When the underlying distribution changes, so does the optimal classification rule, and performance may degrade. We…

Methodology · Statistics 2021-09-01 Ciaran Evans , Max G'Sell

Deep Hypothesis Tests Detect Clinically Relevant Subgroup Shifts in Medical Images

Distribution shifts remain a fundamental problem for the safe application of machine learning systems. If undetected, they may impact the real-world performance of such systems or will at least render original performance claims invalid. In…

Machine Learning · Computer Science 2023-03-10 Lisa M. Koch , Christian M. Schürch , Christian F. Baumgartner , Arthur Gretton , Philipp Berens

Sequential Change Point Detection with FDR Control in Reconfigurable Sensor Networks

This paper investigates sequential change-point detection in reconfigurable sensor networks. In this problem, data from multiple sensors are observed sequentially. Each sensor can have a unique change point, and the data distribution…

Methodology · Statistics 2025-04-10 Seungwon Lee , Yunxiao Chen , Xiaoou Li

A unified framework for dataset shift diagnostics

Supervised learning techniques typically assume training data originates from the target population. Yet, in reality, dataset shift frequently arises, which, if not adequately taken into account, may decrease the performance of their…

Machine Learning · Statistics 2023-09-14 Felipe Maia Polo , Rafael Izbicki , Evanildo Gomes Lacerda , Juan Pablo Ibieta-Jimenez , Renato Vicente

Sequential change-point detection: Computation versus statistical performance

Change-point detection studies the problem of detecting the changes in the underlying distribution of the data stream as soon as possible after the change happens. Modern large-scale, high-dimensional, and complex streaming data call for…

Statistics Theory · Mathematics 2023-06-05 Haoyun Wang , Yao Xie

Control+Shift: Generating Controllable Distribution Shifts

We propose a new method for generating realistic datasets with distribution shifts using any decoder-based generative model. Our approach systematically creates datasets with varying intensities of distribution shifts, facilitating a…

Computer Vision and Pattern Recognition · Computer Science 2024-09-13 Roy Friedman , Rhea Chowers

Sequential change-point detection when unknown parameters are present in the pre-change distribution

In the sequential change-point detection literature, most research specifies a required frequency of false alarms at a given pre-change distribution $f_{\theta}$ and tries to minimize the detection delay for every possible post-change…

Statistics Theory · Mathematics 2007-06-13 Yajun Mei

Data+Shift: Supporting visual investigation of data distribution shifts by data scientists

Machine learning on data streams is increasingly more present in multiple domains. However, there is often data distribution shift that can lead machine learning models to make incorrect decisions. While there are automatic methods to…

Machine Learning · Computer Science 2022-05-02 João Palmeiro , Beatriz Malveiro , Rita Costa , David Polido , Ricardo Moreira , Pedro Bizarro

Towards Explaining Distribution Shifts

A distribution shift can have fundamental consequences such as signaling a change in the operating environment or significantly reducing the accuracy of downstream models. Thus, understanding distribution shifts is critical for examining…

Machine Learning · Computer Science 2023-06-21 Sean Kulinski , David I. Inouye

Sequential (Quickest) Change Detection: Classical Results and New Directions

Online detection of changes in stochastic systems, referred to as sequential change detection or quickest change detection, is an important research topic in statistics, signal processing, and information theory, and has a wide range of…

Statistics Theory · Mathematics 2021-04-12 Liyan Xie , Shaofeng Zou , Yao Xie , Venugopal V. Veeravalli

Adapting to Continuous Covariate Shift via Online Density Ratio Estimation

Dealing with distribution shifts is one of the central challenges for modern machine learning. One fundamental situation is the covariate shift, where the input distributions of data change from training to testing stages while the…

Machine Learning · Computer Science 2024-05-28 Yu-Jie Zhang , Zhen-Yu Zhang , Peng Zhao , Masashi Sugiyama

Data Distribution Shifts in (Industrial) Federated Learning as a Privacy Issue

We consider industrial federated learning, a collaboration between a small number of powerful, potentially competing industrial players, mediated by a third party aspiring to improve the service it provides to its customers. We argue that…

Machine Learning · Computer Science 2024-09-24 David Brunner , Alessio Montuoro

Sequence Transferability and Task Order Selection in Continual Learning

In continual learning, understanding the properties of task sequences and their relationships to model performance is important for developing advanced algorithms with better accuracy. However, efforts in this direction remain…

Machine Learning · Computer Science 2025-02-11 Thinh Nguyen , Cuong N. Nguyen , Quang Pham , Binh T. Nguyen , Savitha Ramasamy , Xiaoli Li , Cuong V. Nguyen

Sequential Harmful Shift Detection Without Labels

We introduce a novel approach for detecting distribution shifts that negatively impact the performance of machine learning models in continuous production environments, which requires no access to ground truth data labels. It builds upon…

Machine Learning · Statistics 2024-12-18 Salim I. Amoukou , Tom Bewley , Saumitra Mishra , Freddy Lecue , Daniele Magazzeni , Manuela Veloso

Transfer Learning for High-dimensional Quantile Regression with Distribution Shift

Information from related source studies can often enhance the findings of a target study. However, the distribution shift between target and source studies can severely impact the efficiency of knowledge transfer. In the high-dimensional…

Methodology · Statistics 2025-11-26 Ruiqi Bai , Yijiao Zhang , Hanbo Yang , Zhongyi Zhu