Related papers: Performance Prediction Under Dataset Shift

Estimating Generalization under Distribution Shifts via Domain-Invariant Representations

When machine learning models are deployed on a test distribution different from the training distribution, they can perform poorly, but overestimate their performance. In this work, we aim to better estimate a model's performance under…

Machine Learning · Computer Science 2020-07-08 Ching-Yao Chuang , Antonio Torralba , Stefanie Jegelka

Explanation Shift: How Did the Distribution Shift Impact the Model?

As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In practice, new input data tend to come without target labels. Then, state-of-the-art techniques model input data distributions…

Machine Learning · Computer Science 2023-09-08 Carlos Mougan , Klaus Broelemann , David Masip , Gjergji Kasneci , Thanassis Thiropanis , Steffen Staab

Evaluating Predictive Uncertainty and Robustness to Distributional Shift Using Real World Data

Most machine learning models operate under the assumption that the training, testing and deployment data is independent and identically distributed (i.i.d.). This assumption doesn't generally hold true in a natural setting. Usually, the…

Machine Learning · Computer Science 2021-12-14 Kumud Lakara , Akshat Bhandari , Pratinav Seth , Ujjwal Verma

Explanation Shift: Detecting distribution shifts on tabular data via the explanation space

As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In the past, predictive performance was considered the key indicator to monitor. However, explanation aspects have come to…

Machine Learning · Computer Science 2022-10-25 Carlos Mougan , Klaus Broelemann , Gjergji Kasneci , Thanassis Tiropanis , Steffen Staab

"What is Different Between These Datasets?" A Framework for Explaining Data Distribution Shifts

The performance of machine learning models relies heavily on the quality of input data, yet real-world applications often face significant data-related challenges. A common issue arises when curating training data or deploying models: two…

Machine Learning · Computer Science 2025-09-24 Varun Babbar , Zhicheng Guo , Cynthia Rudin

Assessing the Generalizability of a Performance Predictive Model

A key component of automated algorithm selection and configuration, which in most cases are performed using supervised machine learning (ML) methods is a good-performing predictive model. The predictive model uses the feature representation…

Machine Learning · Computer Science 2023-06-02 Ana Nikolikj , Gjorgjina Cenikj , Gordana Ispirova , Diederick Vermetten , Ryan Dieter Lang , Andries Petrus Engelbrecht , Carola Doerr , Peter Korošec , Tome Eftimov

Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift

Modern machine learning methods including deep learning have achieved great success in predictive accuracy for supervised learning tasks, but may still fall short in giving useful estimates of their predictive {\em uncertainty}. Quantifying…

Machine Learning · Statistics 2019-12-24 Yaniv Ovadia , Emily Fertig , Jie Ren , Zachary Nado , D Sculley , Sebastian Nowozin , Joshua V. Dillon , Balaji Lakshminarayanan , Jasper Snoek

Robust Predictive Modeling Under Unseen Data Distribution Shifts: A Methodological Commentary

Most research designing novel predictive models, or employing existing ones, assumes that training and testing data are independent and identically distributed. In practice, the data encountered at serving time often deviate from the…

Machine Learning · Computer Science 2026-03-30 Hanyu Duan , Yi Yang , Ahmed Abbasi , Kar Yan Tam

Estimating Model Performance under Domain Shifts with Class-Specific Confidence Scores

Machine learning models are typically deployed in a test setting that differs from the training setting, potentially leading to decreased model performance because of domain shift. If we could estimate the performance that a pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Zeju Li , Konstantinos Kamnitsas , Mobarakol Islam , Chen Chen , Ben Glocker

Learning Weighted Representations for Generalization Across Designs

Predictive models that generalize well under distributional shift are often desirable and sometimes crucial to building robust and reliable machine learning applications. We focus on distributional shift that arises in causal inference from…

Machine Learning · Statistics 2018-02-27 Fredrik D. Johansson , Nathan Kallus , Uri Shalit , David Sontag

Theoretical Analysis of Learned Database Operations under Distribution Shift through Distribution Learnability

Use of machine learning to perform database operations, such as indexing, cardinality estimation, and sorting, is shown to provide substantial performance benefits. However, when datasets change and data distribution shifts, empirical…

Machine Learning · Computer Science 2024-11-12 Sepanta Zeighami , Cyrus Shahahbi

An Empirical Framework for Domain Generalization in Clinical Settings

Clinical machine learning models experience significantly degraded performance in datasets not seen during training, e.g., new hospitals or populations. Recent developments in domain generalization offer a promising solution to this problem…

Machine Learning · Computer Science 2021-04-16 Haoran Zhang , Natalie Dullerud , Laleh Seyyed-Kalantari , Quaid Morris , Shalmali Joshi , Marzyeh Ghassemi

Learning Prediction Intervals for Model Performance

Understanding model performance on unlabeled data is a fundamental challenge of developing, deploying, and maintaining AI systems. Model performance is typically evaluated using test sets or periodic manual quality assessments, both of…

Machine Learning · Computer Science 2020-12-17 Benjamin Elder , Matthew Arnold , Anupama Murthi , Jiri Navratil

"Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

Machine learning models frequently experience performance drops under distribution shifts. The underlying cause of such shifts may be multiple simultaneous factors such as changes in data quality, differences in specific covariate…

Machine Learning · Computer Science 2023-06-07 Haoran Zhang , Harvineet Singh , Marzyeh Ghassemi , Shalmali Joshi

Understanding Behavior of Clinical Models under Domain Shifts

The hypothesis that computational models can be reliable enough to be adopted in prognosis and patient care is revolutionizing healthcare. Deep learning, in particular, has been a game changer in building predictive models, thus leading to…

Machine Learning · Statistics 2019-06-17 Jayaraman J. Thiagarajan , Deepta Rajan , Prasanna Sattigeri

A unified framework for dataset shift diagnostics

Supervised learning techniques typically assume training data originates from the target population. Yet, in reality, dataset shift frequently arises, which, if not adequately taken into account, may decrease the performance of their…

Machine Learning · Statistics 2023-09-14 Felipe Maia Polo , Rafael Izbicki , Evanildo Gomes Lacerda , Juan Pablo Ibieta-Jimenez , Renato Vicente

From Concept Drift to Model Degradation: An Overview on Performance-Aware Drift Detectors

The dynamicity of real-world systems poses a significant challenge to deployed predictive machine learning (ML) models. Changes in the system on which the ML model has been trained may lead to performance degradation during the system's…

Machine Learning · Computer Science 2022-03-22 Firas Bayram , Bestoun S. Ahmed , Andreas Kassler

Identifying the Context Shift between Test Benchmarks and Production Data

Machine learning models are often brittle on production data despite achieving high accuracy on benchmark datasets. Benchmark datasets have traditionally served dual purposes: first, benchmarks offer a standard on which machine learning…

Machine Learning · Computer Science 2022-09-26 Matthew Groh

Extended Abstract of Performance Analysis and Prediction of Model Transformation

In the software development process, model transformation is increasingly assimilated. However, systems being developed with model transformation sometimes grow in size and become complex. Meanwhile, the performance of model transformation…

Software Engineering · Computer Science 2020-04-21 Vijayshree Vijayshree , Markus Frank , Steffen Becker

Inside-Out: Measuring Generalization in Vision Transformers Through Inner Workings

Reliable generalization metrics are fundamental to the evaluation of machine learning models. Especially in high-stakes applications where labeled target data are scarce, evaluation of models' generalization performance under distribution…

Machine Learning · Computer Science 2026-04-10 Yunxiang Peng , Mengmeng Ma , Ziyu Yao , Xi Peng