Related papers: Evaluating Predictive Uncertainty and Robustness t…

Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks

There has been significant research done on developing methods for improving robustness to distributional shift and uncertainty estimation. In contrast, only limited work has examined developing standard datasets and benchmarks for…

Machine Learning · Computer Science 2022-02-14 Andrey Malinin , Neil Band , Ganshin , Alexander , German Chesnokov , Yarin Gal , Mark J. F. Gales , Alexey Noskov , Andrey Ploskonosov , Liudmila Prokhorenkova , Ivan Provilkov , Vatsal Raina , Vyas Raina , Roginskiy , Denis , Mariya Shmatova , Panos Tigas , Boris Yangel

Robust Predictive Modeling Under Unseen Data Distribution Shifts: A Methodological Commentary

Most research designing novel predictive models, or employing existing ones, assumes that training and testing data are independent and identically distributed. In practice, the data encountered at serving time often deviate from the…

Machine Learning · Computer Science 2026-03-30 Hanyu Duan , Yi Yang , Ahmed Abbasi , Kar Yan Tam

How Reliable is Your Regression Model's Uncertainty Under Real-World Distribution Shifts?

Many important computer vision applications are naturally formulated as regression problems. Within medical imaging, accurate regression models have the potential to automate various tasks, helping to lower costs and improve patient…

Machine Learning · Computer Science 2023-11-08 Fredrik K. Gustafsson , Martin Danelljan , Thomas B. Schön

Robust Validation: Confident Predictions Even When Distributions Shift

While the traditional viewpoint in machine learning and statistics assumes training and testing samples come from the same population, practice belies this fiction. One strategy -- coming from robust statistics and optimization -- is thus…

Machine Learning · Statistics 2024-07-08 Maxime Cauchois , Suyash Gupta , Alnur Ali , John C. Duchi

Shifts 2.0: Extending The Dataset of Real Distributional Shifts

Distributional shift, or the mismatch between training and deployment data, is a significant obstacle to the usage of machine learning in high-stakes industrial applications, such as autonomous driving and medicine. This creates a need to…

Machine Learning · Computer Science 2022-09-16 Andrey Malinin , Andreas Athanasopoulos , Muhamed Barakovic , Meritxell Bach Cuadra , Mark J. F. Gales , Cristina Granziera , Mara Graziani , Nikolay Kartashev , Konstantinos Kyriakopoulos , Po-Jui Lu , Nataliia Molchanova , Antonis Nikitakis , Vatsal Raina , Francesco La Rosa , Eli Sivena , Vasileios Tsarsitalidis , Efi Tsompopoulou , Elena Volf

Can You Trust Your Model's Uncertainty? Evaluating Predictive Uncertainty Under Dataset Shift

Modern machine learning methods including deep learning have achieved great success in predictive accuracy for supervised learning tasks, but may still fall short in giving useful estimates of their predictive {\em uncertainty}. Quantifying…

Machine Learning · Statistics 2019-12-24 Yaniv Ovadia , Emily Fertig , Jie Ren , Zachary Nado , D Sculley , Sebastian Nowozin , Joshua V. Dillon , Balaji Lakshminarayanan , Jasper Snoek

Evaluating Model Robustness and Stability to Dataset Shift

As the use of machine learning in high impact domains becomes widespread, the importance of evaluating safety has increased. An important aspect of this is evaluating how robust a model is to changes in setting or population, which…

Machine Learning · Computer Science 2021-03-16 Adarsh Subbaswamy , Roy Adams , Suchi Saria

Tracking Adaptation Time: Metrics for Temporal Distribution Shift

Evaluating robustness under temporal distribution shift remains an open challenge. Existing metrics quantify the average decline in performance, but fail to capture how models adapt to evolving data. As a result, temporal degradation is…

Machine Learning · Computer Science 2026-04-09 Lorenzo Iovine , Giacomo Ziffer , Emanuele Della Valle

Robust Calibration For Improved Weather Prediction Under Distributional Shift

In this paper, we present results on improving out-of-domain weather prediction and uncertainty estimation as part of the \texttt{Shifts Challenge on Robustness and Uncertainty under Real-World Distributional Shift} challenge. We find that…

Machine Learning · Computer Science 2024-01-10 Sankalp Gilda , Neel Bhandari , Wendy Mak , Andrea Panizza

Explanation Shift: How Did the Distribution Shift Impact the Model?

As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In practice, new input data tend to come without target labels. Then, state-of-the-art techniques model input data distributions…

Machine Learning · Computer Science 2023-09-08 Carlos Mougan , Klaus Broelemann , David Masip , Gjergji Kasneci , Thanassis Thiropanis , Steffen Staab

Assessing the Impact of Distribution Shift on Reinforcement Learning Performance

Research in machine learning is making progress in fixing its own reproducibility crisis. Reinforcement learning (RL), in particular, faces its own set of unique challenges. Comparison of point estimates, and plots that show successful…

Machine Learning · Computer Science 2024-02-07 Ted Fujimoto , Joshua Suetterlein , Samrat Chatterjee , Auroop Ganguly

Performance Prediction Under Dataset Shift

ML models deployed in production often have to face unknown domain changes, fundamentally different from their training settings. Performance prediction models carry out the crucial task of measuring the impact of these changes on model…

Machine Learning · Computer Science 2022-06-23 Simona Maggio , Victor Bouvier , Léo Dreyfus-Schmidt

Distributionally robust and generalizable inference

We discuss recently developed methods that quantify the stability and generalizability of statistical findings under distributional changes. In many practical problems, the data is not drawn i.i.d. from the target population. For example,…

Methodology · Statistics 2023-10-05 Dominik Rothenhäusler , Peter Bühlmann

Multiply Robust Estimation for Local Distribution Shifts with Multiple Domains

Distribution shifts are ubiquitous in real-world machine learning applications, posing a challenge to the generalization of models trained on one data distribution to another. We focus on scenarios where data distributions vary across…

Machine Learning · Statistics 2024-06-05 Steven Wilkins-Reeves , Xu Chen , Qi Ma , Christine Agarwal , Aude Hofleitner

Quantifying Distribution Shifts and Uncertainties for Enhanced Model Robustness in Machine Learning Applications

Distribution shifts, where statistical properties differ between training and test datasets, present a significant challenge in real-world machine learning applications where they directly impact model generalization and robustness. In this…

Machine Learning · Computer Science 2024-05-06 Vegard Flovik

Measuring Robustness to Natural Distribution Shifts in Image Classification

We study how robust current ImageNet models are to distribution shifts arising from natural variations in datasets. Most research on robustness focuses on synthetic image perturbations (noise, simulated weather artifacts, adversarial…

Machine Learning · Computer Science 2020-09-15 Rohan Taori , Achal Dave , Vaishaal Shankar , Nicholas Carlini , Benjamin Recht , Ludwig Schmidt

Beyond Point Estimates: Distributional Uncertainty in Machine Learning Performance Evaluation

Machine learning models are often evaluated using point estimates of performance metrics such as accuracy, F1 score, or mean squared error. Such summaries fail to capture the inherent variability induced by stochastic elements of the…

Machine Learning · Computer Science 2026-05-13 Christoph Lehmann , Yahor Paromau

The Many Faces of Robustness: A Critical Analysis of Out-of-Distribution Generalization

We introduce four new real-world distribution shift datasets consisting of changes in image style, image blurriness, geographic location, camera operation, and more. With our new datasets, we take stock of previously proposed methods for…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Dan Hendrycks , Steven Basart , Norman Mu , Saurav Kadavath , Frank Wang , Evan Dorundo , Rahul Desai , Tyler Zhu , Samyak Parajuli , Mike Guo , Dawn Song , Jacob Steinhardt , Justin Gilmer

A Fine-Grained Analysis on Distribution Shift

Robustness to distribution shifts is critical for deploying machine learning models in the real world. Despite this necessity, there has been little work in defining the underlying mechanisms that cause these shifts and evaluating the…

Machine Learning · Computer Science 2021-11-29 Olivia Wiles , Sven Gowal , Florian Stimberg , Sylvestre Alvise-Rebuffi , Ira Ktena , Krishnamurthy Dvijotham , Taylan Cemgil

Predicting with Confidence on Unseen Distributions

Recent work has shown that the performance of machine learning models can vary substantially when models are evaluated on data drawn from a distribution that is close to but different from the training distribution. As a result, predicting…

Machine Learning · Computer Science 2021-08-23 Devin Guillory , Vaishaal Shankar , Sayna Ebrahimi , Trevor Darrell , Ludwig Schmidt