English
Related papers

Related papers: Evaluating Predictive Uncertainty and Robustness t…

200 papers

There has been significant research done on developing methods for improving robustness to distributional shift and uncertainty estimation. In contrast, only limited work has examined developing standard datasets and benchmarks for…

Most research designing novel predictive models, or employing existing ones, assumes that training and testing data are independent and identically distributed. In practice, the data encountered at serving time often deviate from the…

Machine Learning · Computer Science 2026-03-30 Hanyu Duan , Yi Yang , Ahmed Abbasi , Kar Yan Tam

Many important computer vision applications are naturally formulated as regression problems. Within medical imaging, accurate regression models have the potential to automate various tasks, helping to lower costs and improve patient…

Machine Learning · Computer Science 2023-11-08 Fredrik K. Gustafsson , Martin Danelljan , Thomas B. Schön

While the traditional viewpoint in machine learning and statistics assumes training and testing samples come from the same population, practice belies this fiction. One strategy -- coming from robust statistics and optimization -- is thus…

Machine Learning · Statistics 2024-07-08 Maxime Cauchois , Suyash Gupta , Alnur Ali , John C. Duchi

Distributional shift, or the mismatch between training and deployment data, is a significant obstacle to the usage of machine learning in high-stakes industrial applications, such as autonomous driving and medicine. This creates a need to…

Modern machine learning methods including deep learning have achieved great success in predictive accuracy for supervised learning tasks, but may still fall short in giving useful estimates of their predictive {\em uncertainty}. Quantifying…

As the use of machine learning in high impact domains becomes widespread, the importance of evaluating safety has increased. An important aspect of this is evaluating how robust a model is to changes in setting or population, which…

Machine Learning · Computer Science 2021-03-16 Adarsh Subbaswamy , Roy Adams , Suchi Saria

Evaluating robustness under temporal distribution shift remains an open challenge. Existing metrics quantify the average decline in performance, but fail to capture how models adapt to evolving data. As a result, temporal degradation is…

Machine Learning · Computer Science 2026-04-09 Lorenzo Iovine , Giacomo Ziffer , Emanuele Della Valle

In this paper, we present results on improving out-of-domain weather prediction and uncertainty estimation as part of the \texttt{Shifts Challenge on Robustness and Uncertainty under Real-World Distributional Shift} challenge. We find that…

Machine Learning · Computer Science 2024-01-10 Sankalp Gilda , Neel Bhandari , Wendy Mak , Andrea Panizza

As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In practice, new input data tend to come without target labels. Then, state-of-the-art techniques model input data distributions…

Machine Learning · Computer Science 2023-09-08 Carlos Mougan , Klaus Broelemann , David Masip , Gjergji Kasneci , Thanassis Thiropanis , Steffen Staab

Research in machine learning is making progress in fixing its own reproducibility crisis. Reinforcement learning (RL), in particular, faces its own set of unique challenges. Comparison of point estimates, and plots that show successful…

Machine Learning · Computer Science 2024-02-07 Ted Fujimoto , Joshua Suetterlein , Samrat Chatterjee , Auroop Ganguly

ML models deployed in production often have to face unknown domain changes, fundamentally different from their training settings. Performance prediction models carry out the crucial task of measuring the impact of these changes on model…

Machine Learning · Computer Science 2022-06-23 Simona Maggio , Victor Bouvier , Léo Dreyfus-Schmidt

We discuss recently developed methods that quantify the stability and generalizability of statistical findings under distributional changes. In many practical problems, the data is not drawn i.i.d. from the target population. For example,…

Methodology · Statistics 2023-10-05 Dominik Rothenhäusler , Peter Bühlmann

Distribution shifts are ubiquitous in real-world machine learning applications, posing a challenge to the generalization of models trained on one data distribution to another. We focus on scenarios where data distributions vary across…

Machine Learning · Statistics 2024-06-05 Steven Wilkins-Reeves , Xu Chen , Qi Ma , Christine Agarwal , Aude Hofleitner

Distribution shifts, where statistical properties differ between training and test datasets, present a significant challenge in real-world machine learning applications where they directly impact model generalization and robustness. In this…

Machine Learning · Computer Science 2024-05-06 Vegard Flovik

We study how robust current ImageNet models are to distribution shifts arising from natural variations in datasets. Most research on robustness focuses on synthetic image perturbations (noise, simulated weather artifacts, adversarial…

Machine Learning · Computer Science 2020-09-15 Rohan Taori , Achal Dave , Vaishaal Shankar , Nicholas Carlini , Benjamin Recht , Ludwig Schmidt

Machine learning models are often evaluated using point estimates of performance metrics such as accuracy, F1 score, or mean squared error. Such summaries fail to capture the inherent variability induced by stochastic elements of the…

Machine Learning · Computer Science 2026-05-13 Christoph Lehmann , Yahor Paromau

We introduce four new real-world distribution shift datasets consisting of changes in image style, image blurriness, geographic location, camera operation, and more. With our new datasets, we take stock of previously proposed methods for…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Dan Hendrycks , Steven Basart , Norman Mu , Saurav Kadavath , Frank Wang , Evan Dorundo , Rahul Desai , Tyler Zhu , Samyak Parajuli , Mike Guo , Dawn Song , Jacob Steinhardt , Justin Gilmer

Robustness to distribution shifts is critical for deploying machine learning models in the real world. Despite this necessity, there has been little work in defining the underlying mechanisms that cause these shifts and evaluating the…

Recent work has shown that the performance of machine learning models can vary substantially when models are evaluated on data drawn from a distribution that is close to but different from the training distribution. As a result, predicting…

Machine Learning · Computer Science 2021-08-23 Devin Guillory , Vaishaal Shankar , Sayna Ebrahimi , Trevor Darrell , Ludwig Schmidt
‹ Prev 1 2 3 10 Next ›