Related papers: Sequential changepoint detection in classification…

Semi-supervised sequence classification through change point detection

Sequential sensor data is generated in a wide variety of practical applications. A fundamental challenge involves learning effective classifiers for such sequential data. While deep learning has led to impressive performance gains in recent…

Machine Learning · Computer Science 2020-10-07 Nauman Ahad , Mark A. Davenport

Test-time Adaptation with Calibration of Medical Image Classification Nets for Label Distribution Shift

Class distribution plays an important role in learning deep classifiers. When the proportion of each class in the test set differs from the training set, the performance of classification nets usually degrades. Such a label distribution…

Image and Video Processing · Electrical Eng. & Systems 2022-07-12 Wenao Ma , Cheng Chen , Shuang Zheng , Jing Qin , Huimao Zhang , Qi Dou

Deep learning model solves change point detection for multiple change types

A change points detection aims to catch an abrupt disorder in data distribution. Common approaches assume that there are only two fixed distributions for data: one before and another after a change point. Real-world data are richer than…

Machine Learning · Computer Science 2022-04-18 Alexander Stepikin , Evgenia Romanenkova , Alexey Zaytsev

Sequential Harmful Shift Detection Without Labels

We introduce a novel approach for detecting distribution shifts that negatively impact the performance of machine learning models in continuous production environments, which requires no access to ground truth data labels. It builds upon…

Machine Learning · Statistics 2024-12-18 Salim I. Amoukou , Tom Bewley , Saumitra Mishra , Freddy Lecue , Daniele Magazzeni , Manuela Veloso

Sequential change-point detection when unknown parameters are present in the pre-change distribution

In the sequential change-point detection literature, most research specifies a required frequency of false alarms at a given pre-change distribution $f_{\theta}$ and tries to minimize the detection delay for every possible post-change…

Statistics Theory · Mathematics 2007-06-13 Yajun Mei

Sequential change-point detection: Computation versus statistical performance

Change-point detection studies the problem of detecting the changes in the underlying distribution of the data stream as soon as possible after the change happens. Modern large-scale, high-dimensional, and complex streaming data call for…

Statistics Theory · Mathematics 2023-06-05 Haoyun Wang , Yao Xie

Test-time Recalibration of Conformal Predictors Under Distribution Shift Based on Unlabeled Examples

Modern image classifiers are very accurate, but the predictions come without uncertainty estimates. Conformal predictors provide uncertainty estimates by computing a set of classes containing the correct class with a user-specified…

Machine Learning · Computer Science 2023-06-06 Fatih Furkan Yilmaz , Reinhard Heckel

Labeled Data Selection for Category Discovery

Category discovery methods aim to find novel categories in unlabeled visual data. At training time, a set of labeled and unlabeled images are provided, where the labels correspond to the categories present in the images. The labeled data…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Bingchen Zhao , Nico Lang , Serge Belongie , Oisin Mac Aodha

Tracking the risk of a deployed model and detecting harmful distribution shifts

When deployed in the real world, machine learning models inevitably encounter changes in the data distribution, and certain -- but not all -- distribution shifts could result in significant performance degradation. In practice, it may make…

Machine Learning · Statistics 2022-05-06 Aleksandr Podkopaev , Aaditya Ramdas

Nonparametric sequential change-point detection for multivariate time series based on empirical distribution functions

The aim of sequential change-point detection is to issue an alarm when it is thought that certain probabilistic properties of the monitored observations have changed. This work is concerned with nonparametric, closed-end testing procedures…

Methodology · Statistics 2020-10-27 Ivan Kojadinovic , Ghislain Verdier

Training on Test Data with Bayesian Adaptation for Covariate Shift

When faced with distribution shift at test time, deep neural networks often make inaccurate predictions with unreliable uncertainty estimates. While improving the robustness of neural networks is one promising approach to mitigate this…

Machine Learning · Computer Science 2021-09-28 Aurick Zhou , Sergey Levine

Post-detection inference for sequential changepoint localization

This paper addresses a fundamental but largely unexplored challenge in sequential changepoint analysis: conducting inference following a detected change. We develop a very general framework to construct confidence sets for the unknown…

Machine Learning · Statistics 2026-05-12 Aytijhya Saha , Aaditya Ramdas

Sequential Change Diagnosis Revisited and the Adaptive Matrix CuSum

The problem of sequential change diagnosis is considered, where observations are obtained on-line, an abrupt change occurs in their distribution, and the goal is to quickly detect the change and accurately identify the post-change…

Statistics Theory · Mathematics 2022-11-24 Austin Warner , Georgios Fellouris

Feature Shift Detection: Localizing Which Features Have Shifted via Conditional Distribution Tests

While previous distribution shift detection approaches can identify if a shift has occurred, these approaches cannot localize which specific features have caused a distribution shift -- a critical step in diagnosing or fixing any underlying…

Machine Learning · Computer Science 2021-07-16 Sean Kulinski , Saurabh Bagchi , David I. Inouye

Undersampling is a Minimax Optimal Robustness Intervention in Nonparametric Classification

While a broad range of techniques have been proposed to tackle distribution shift, the simple baseline of training on an $\textit{undersampled}$ balanced dataset often achieves close to state-of-the-art-accuracy across several popular…

Machine Learning · Computer Science 2023-06-21 Niladri S. Chatterji , Saminul Haque , Tatsunori Hashimoto

Precise Change Point Detection using Spectral Drift Detection

The notion of concept drift refers to the phenomenon that the data generating distribution changes over time; as a consequence machine learning models may become inaccurate and need adjustment. In this paper we consider the problem of…

Machine Learning · Computer Science 2022-05-16 Fabian Hinder , André Artelt , Valerie Vaquet , Barbara Hammer

A Nonparametric Approach for Multiple Change Point Analysis of Multivariate Data

Change point analysis has applications in a wide variety of fields. The general problem concerns the inference of a change in distribution for a set of time-ordered observations. Sequential detection is an online version in which new data…

Methodology · Statistics 2013-10-16 David S. Matteson , Nicholas A. James

Coping with Label Shift via Distributionally Robust Optimisation

The label shift problem refers to the supervised learning setting where the train and test label distributions do not match. Existing work addressing label shift usually assumes access to an \emph{unlabelled} test sample. This sample may be…

Machine Learning · Computer Science 2021-08-18 Jingzhao Zhang , Aditya Menon , Andreas Veit , Srinadh Bhojanapalli , Sanjiv Kumar , Suvrit Sra

Semiparametric Learning from Open-Set Label Shift Data

We study the open-set label shift problem, where the test data may include a novel class absent from training. This setting is challenging because both the class proportions and the distribution of the novel class are not identifiable…

Methodology · Statistics 2025-09-19 Siyan Liu , Yukun Liu , Qinglong Tian , Pengfei Li , Jing Qin

A generalized approach to label shift: the Conditional Probability Shift Model

In many practical applications of machine learning, a discrepancy often arises between a source distribution from which labeled training examples are drawn and a target distribution for which only unlabeled data is observed. Traditionally,…

Machine Learning · Statistics 2025-03-05 Paweł Teisseyre , Jan Mielniczuk