Related papers: Diagnosing Model Performance Under Distribution Sh…

Explanation Shift: How Did the Distribution Shift Impact the Model?

As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In practice, new input data tend to come without target labels. Then, state-of-the-art techniques model input data distributions…

Machine Learning · Computer Science 2023-09-08 Carlos Mougan , Klaus Broelemann , David Masip , Gjergji Kasneci , Thanassis Thiropanis , Steffen Staab

Explanation Shift: Detecting distribution shifts on tabular data via the explanation space

As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In the past, predictive performance was considered the key indicator to monitor. However, explanation aspects have come to…

Machine Learning · Computer Science 2022-10-25 Carlos Mougan , Klaus Broelemann , Gjergji Kasneci , Thanassis Tiropanis , Steffen Staab

"Why did the Model Fail?": Attributing Model Performance Changes to Distribution Shifts

Machine learning models frequently experience performance drops under distribution shifts. The underlying cause of such shifts may be multiple simultaneous factors such as changes in data quality, differences in specific covariate…

Machine Learning · Computer Science 2023-06-07 Haoran Zhang , Harvineet Singh , Marzyeh Ghassemi , Shalmali Joshi

Feature Shift Detection: Localizing Which Features Have Shifted via Conditional Distribution Tests

While previous distribution shift detection approaches can identify if a shift has occurred, these approaches cannot localize which specific features have caused a distribution shift -- a critical step in diagnosing or fixing any underlying…

Machine Learning · Computer Science 2021-07-16 Sean Kulinski , Saurabh Bagchi , David I. Inouye

Evaluating Predictive Uncertainty and Robustness to Distributional Shift Using Real World Data

Most machine learning models operate under the assumption that the training, testing and deployment data is independent and identically distributed (i.i.d.). This assumption doesn't generally hold true in a natural setting. Usually, the…

Machine Learning · Computer Science 2021-12-14 Kumud Lakara , Akshat Bhandari , Pratinav Seth , Ujjwal Verma

Decision-Focused Evaluation of Worst-Case Distribution Shift

Distribution shift is a key challenge for predictive models in practice, creating the need to identify potentially harmful shifts in advance of deployment. Existing work typically defines these worst-case shifts as ones that most degrade…

Machine Learning · Computer Science 2024-07-08 Kevin Ren , Yewon Byun , Bryan Wilder

Estimating Model Performance under Domain Shifts with Class-Specific Confidence Scores

Machine learning models are typically deployed in a test setting that differs from the training setting, potentially leading to decreased model performance because of domain shift. If we could estimate the performance that a pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Zeju Li , Konstantinos Kamnitsas , Mobarakol Islam , Chen Chen , Ben Glocker

Estimating Generalization under Distribution Shifts via Domain-Invariant Representations

When machine learning models are deployed on a test distribution different from the training distribution, they can perform poorly, but overestimate their performance. In this work, we aim to better estimate a model's performance under…

Machine Learning · Computer Science 2020-07-08 Ching-Yao Chuang , Antonio Torralba , Stefanie Jegelka

Fairness Hub Technical Briefs: Definition and Detection of Distribution Shift

Distribution shift is a common situation in machine learning tasks, where the data used for training a model is different from the data the model is applied to in the real world. This issue arises across multiple technical settings: from…

Machine Learning · Computer Science 2024-05-24 Nicolas Acevedo , Carmen Cortez , Chris Brooks , Rene Kizilcec , Renzhe Yu

Improving and extending the testing of distributions for shape-restricted properties

Distribution testing deals with what information can be deduced about an unknown distribution over $\{1,\ldots,n\}$, where the algorithm is only allowed to obtain a relatively small number of independent samples from the distribution. In…

Computational Complexity · Computer Science 2016-09-23 Eldar Fischer , Oded Lachish , Yadu Vasudev

Ask Your Distribution Shift if Pre-Training is Right for You

Pre-training is a widely used approach to develop models that are robust to distribution shifts. However, in practice, its effectiveness varies: fine-tuning a pre-trained model improves robustness significantly in some cases but not at all…

Machine Learning · Computer Science 2024-12-24 Benjamin Cohen-Wang , Joshua Vendrow , Aleksander Madry

Forecasting the Past: Gradient-Based Distribution Shift Detection in Trajectory Prediction

Trajectory prediction models often fail in real-world automated driving due to distributional shifts between training and test conditions. Such distributional shifts, whether behavioural or environmental, pose a critical risk by causing the…

Machine Learning · Computer Science 2026-04-15 Michele De Vita , Julian Wiederer , Vasileios Belagiannis

"Who experiences large model decay and why?" A Hierarchical Framework for Diagnosing Heterogeneous Performance Drift

Machine learning (ML) models frequently experience performance degradation when deployed in new contexts. Such degradation is rarely uniform: some subgroups may suffer large performance decay while others may not. Understanding where and…

Machine Learning · Computer Science 2025-06-03 Harvineet Singh , Fan Xia , Alexej Gossmann , Andrew Chuang , Julian C. Hong , Jean Feng

Performance Prediction Under Dataset Shift

ML models deployed in production often have to face unknown domain changes, fundamentally different from their training settings. Performance prediction models carry out the crucial task of measuring the impact of these changes on model…

Machine Learning · Computer Science 2022-06-23 Simona Maggio , Victor Bouvier , Léo Dreyfus-Schmidt

Theoretical Analysis of Learned Database Operations under Distribution Shift through Distribution Learnability

Use of machine learning to perform database operations, such as indexing, cardinality estimation, and sorting, is shown to provide substantial performance benefits. However, when datasets change and data distribution shifts, empirical…

Machine Learning · Computer Science 2024-11-12 Sepanta Zeighami , Cyrus Shahahbi

Learning the Distribution Map in Reverse Causal Performative Prediction

In numerous predictive scenarios, the predictive model affects the sampling distribution; for example, job applicants often meticulously craft their resumes to navigate through a screening systems. Such shifts in distribution are…

Machine Learning · Statistics 2025-04-14 Daniele Bracale , Subha Maity , Moulinath Banerjee , Yuekai Sun

Improving Regression Performance with Distributional Losses

There is growing evidence that converting targets to soft targets in supervised learning can provide considerable gains in performance. Much of this work has considered classification, converting hard zero-one values to soft labels---such…

Machine Learning · Statistics 2018-06-13 Ehsan Imani , Martha White

On conditional diffusion models for PDE simulations

Modelling partial differential equations (PDEs) is of crucial importance in science and engineering, and it includes tasks ranging from forecasting to inverse problems, such as data assimilation. However, most previous numerical and machine…

Machine Learning · Computer Science 2024-10-23 Aliaksandra Shysheya , Cristiana Diaconu , Federico Bergamin , Paris Perdikaris , José Miguel Hernández-Lobato , Richard E. Turner , Emile Mathieu

Predicting with Confidence on Unseen Distributions

Recent work has shown that the performance of machine learning models can vary substantially when models are evaluated on data drawn from a distribution that is close to but different from the training distribution. As a result, predicting…

Machine Learning · Computer Science 2021-08-23 Devin Guillory , Vaishaal Shankar , Sayna Ebrahimi , Trevor Darrell , Ludwig Schmidt

Mandoline: Model Evaluation under Distribution Shift

Machine learning models are often deployed in different settings than they were trained and validated on, posing a challenge to practitioners who wish to predict how well the deployed model will perform on a target distribution. If an…

Machine Learning · Computer Science 2022-04-12 Mayee Chen , Karan Goel , Nimit S. Sohoni , Fait Poms , Kayvon Fatahalian , Christopher Ré