English
Related papers

Related papers: Performance Prediction Under Dataset Shift

200 papers

When machine learning models are deployed on a test distribution different from the training distribution, they can perform poorly, but overestimate their performance. In this work, we aim to better estimate a model's performance under…

Machine Learning · Computer Science 2020-07-08 Ching-Yao Chuang , Antonio Torralba , Stefanie Jegelka

As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In practice, new input data tend to come without target labels. Then, state-of-the-art techniques model input data distributions…

Machine Learning · Computer Science 2023-09-08 Carlos Mougan , Klaus Broelemann , David Masip , Gjergji Kasneci , Thanassis Thiropanis , Steffen Staab

Most machine learning models operate under the assumption that the training, testing and deployment data is independent and identically distributed (i.i.d.). This assumption doesn't generally hold true in a natural setting. Usually, the…

Machine Learning · Computer Science 2021-12-14 Kumud Lakara , Akshat Bhandari , Pratinav Seth , Ujjwal Verma

As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In the past, predictive performance was considered the key indicator to monitor. However, explanation aspects have come to…

Machine Learning · Computer Science 2022-10-25 Carlos Mougan , Klaus Broelemann , Gjergji Kasneci , Thanassis Tiropanis , Steffen Staab

The performance of machine learning models relies heavily on the quality of input data, yet real-world applications often face significant data-related challenges. A common issue arises when curating training data or deploying models: two…

Machine Learning · Computer Science 2025-09-24 Varun Babbar , Zhicheng Guo , Cynthia Rudin

A key component of automated algorithm selection and configuration, which in most cases are performed using supervised machine learning (ML) methods is a good-performing predictive model. The predictive model uses the feature representation…

Modern machine learning methods including deep learning have achieved great success in predictive accuracy for supervised learning tasks, but may still fall short in giving useful estimates of their predictive {\em uncertainty}. Quantifying…

Most research designing novel predictive models, or employing existing ones, assumes that training and testing data are independent and identically distributed. In practice, the data encountered at serving time often deviate from the…

Machine Learning · Computer Science 2026-03-30 Hanyu Duan , Yi Yang , Ahmed Abbasi , Kar Yan Tam

Machine learning models are typically deployed in a test setting that differs from the training setting, potentially leading to decreased model performance because of domain shift. If we could estimate the performance that a pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Zeju Li , Konstantinos Kamnitsas , Mobarakol Islam , Chen Chen , Ben Glocker

Predictive models that generalize well under distributional shift are often desirable and sometimes crucial to building robust and reliable machine learning applications. We focus on distributional shift that arises in causal inference from…

Machine Learning · Statistics 2018-02-27 Fredrik D. Johansson , Nathan Kallus , Uri Shalit , David Sontag

Use of machine learning to perform database operations, such as indexing, cardinality estimation, and sorting, is shown to provide substantial performance benefits. However, when datasets change and data distribution shifts, empirical…

Machine Learning · Computer Science 2024-11-12 Sepanta Zeighami , Cyrus Shahahbi

Clinical machine learning models experience significantly degraded performance in datasets not seen during training, e.g., new hospitals or populations. Recent developments in domain generalization offer a promising solution to this problem…

Machine Learning · Computer Science 2021-04-16 Haoran Zhang , Natalie Dullerud , Laleh Seyyed-Kalantari , Quaid Morris , Shalmali Joshi , Marzyeh Ghassemi

Understanding model performance on unlabeled data is a fundamental challenge of developing, deploying, and maintaining AI systems. Model performance is typically evaluated using test sets or periodic manual quality assessments, both of…

Machine Learning · Computer Science 2020-12-17 Benjamin Elder , Matthew Arnold , Anupama Murthi , Jiri Navratil

Machine learning models frequently experience performance drops under distribution shifts. The underlying cause of such shifts may be multiple simultaneous factors such as changes in data quality, differences in specific covariate…

Machine Learning · Computer Science 2023-06-07 Haoran Zhang , Harvineet Singh , Marzyeh Ghassemi , Shalmali Joshi

The hypothesis that computational models can be reliable enough to be adopted in prognosis and patient care is revolutionizing healthcare. Deep learning, in particular, has been a game changer in building predictive models, thus leading to…

Machine Learning · Statistics 2019-06-17 Jayaraman J. Thiagarajan , Deepta Rajan , Prasanna Sattigeri

Supervised learning techniques typically assume training data originates from the target population. Yet, in reality, dataset shift frequently arises, which, if not adequately taken into account, may decrease the performance of their…

The dynamicity of real-world systems poses a significant challenge to deployed predictive machine learning (ML) models. Changes in the system on which the ML model has been trained may lead to performance degradation during the system's…

Machine Learning · Computer Science 2022-03-22 Firas Bayram , Bestoun S. Ahmed , Andreas Kassler

Machine learning models are often brittle on production data despite achieving high accuracy on benchmark datasets. Benchmark datasets have traditionally served dual purposes: first, benchmarks offer a standard on which machine learning…

Machine Learning · Computer Science 2022-09-26 Matthew Groh

In the software development process, model transformation is increasingly assimilated. However, systems being developed with model transformation sometimes grow in size and become complex. Meanwhile, the performance of model transformation…

Software Engineering · Computer Science 2020-04-21 Vijayshree Vijayshree , Markus Frank , Steffen Becker

Reliable generalization metrics are fundamental to the evaluation of machine learning models. Especially in high-stakes applications where labeled target data are scarce, evaluation of models' generalization performance under distribution…

Machine Learning · Computer Science 2026-04-10 Yunxiang Peng , Mengmeng Ma , Ziyu Yao , Xi Peng
‹ Prev 1 2 3 10 Next ›