English
Related papers

Related papers: Diagnosing Model Performance Under Distribution Sh…

200 papers

As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In practice, new input data tend to come without target labels. Then, state-of-the-art techniques model input data distributions…

Machine Learning · Computer Science 2023-09-08 Carlos Mougan , Klaus Broelemann , David Masip , Gjergji Kasneci , Thanassis Thiropanis , Steffen Staab

As input data distributions evolve, the predictive performance of machine learning models tends to deteriorate. In the past, predictive performance was considered the key indicator to monitor. However, explanation aspects have come to…

Machine Learning · Computer Science 2022-10-25 Carlos Mougan , Klaus Broelemann , Gjergji Kasneci , Thanassis Tiropanis , Steffen Staab

Machine learning models frequently experience performance drops under distribution shifts. The underlying cause of such shifts may be multiple simultaneous factors such as changes in data quality, differences in specific covariate…

Machine Learning · Computer Science 2023-06-07 Haoran Zhang , Harvineet Singh , Marzyeh Ghassemi , Shalmali Joshi

While previous distribution shift detection approaches can identify if a shift has occurred, these approaches cannot localize which specific features have caused a distribution shift -- a critical step in diagnosing or fixing any underlying…

Machine Learning · Computer Science 2021-07-16 Sean Kulinski , Saurabh Bagchi , David I. Inouye

Most machine learning models operate under the assumption that the training, testing and deployment data is independent and identically distributed (i.i.d.). This assumption doesn't generally hold true in a natural setting. Usually, the…

Machine Learning · Computer Science 2021-12-14 Kumud Lakara , Akshat Bhandari , Pratinav Seth , Ujjwal Verma

Distribution shift is a key challenge for predictive models in practice, creating the need to identify potentially harmful shifts in advance of deployment. Existing work typically defines these worst-case shifts as ones that most degrade…

Machine Learning · Computer Science 2024-07-08 Kevin Ren , Yewon Byun , Bryan Wilder

Machine learning models are typically deployed in a test setting that differs from the training setting, potentially leading to decreased model performance because of domain shift. If we could estimate the performance that a pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Zeju Li , Konstantinos Kamnitsas , Mobarakol Islam , Chen Chen , Ben Glocker

When machine learning models are deployed on a test distribution different from the training distribution, they can perform poorly, but overestimate their performance. In this work, we aim to better estimate a model's performance under…

Machine Learning · Computer Science 2020-07-08 Ching-Yao Chuang , Antonio Torralba , Stefanie Jegelka

Distribution shift is a common situation in machine learning tasks, where the data used for training a model is different from the data the model is applied to in the real world. This issue arises across multiple technical settings: from…

Machine Learning · Computer Science 2024-05-24 Nicolas Acevedo , Carmen Cortez , Chris Brooks , Rene Kizilcec , Renzhe Yu

Distribution testing deals with what information can be deduced about an unknown distribution over $\{1,\ldots,n\}$, where the algorithm is only allowed to obtain a relatively small number of independent samples from the distribution. In…

Computational Complexity · Computer Science 2016-09-23 Eldar Fischer , Oded Lachish , Yadu Vasudev

Pre-training is a widely used approach to develop models that are robust to distribution shifts. However, in practice, its effectiveness varies: fine-tuning a pre-trained model improves robustness significantly in some cases but not at all…

Machine Learning · Computer Science 2024-12-24 Benjamin Cohen-Wang , Joshua Vendrow , Aleksander Madry

Trajectory prediction models often fail in real-world automated driving due to distributional shifts between training and test conditions. Such distributional shifts, whether behavioural or environmental, pose a critical risk by causing the…

Machine Learning · Computer Science 2026-04-15 Michele De Vita , Julian Wiederer , Vasileios Belagiannis

Machine learning (ML) models frequently experience performance degradation when deployed in new contexts. Such degradation is rarely uniform: some subgroups may suffer large performance decay while others may not. Understanding where and…

Machine Learning · Computer Science 2025-06-03 Harvineet Singh , Fan Xia , Alexej Gossmann , Andrew Chuang , Julian C. Hong , Jean Feng

ML models deployed in production often have to face unknown domain changes, fundamentally different from their training settings. Performance prediction models carry out the crucial task of measuring the impact of these changes on model…

Machine Learning · Computer Science 2022-06-23 Simona Maggio , Victor Bouvier , Léo Dreyfus-Schmidt

Use of machine learning to perform database operations, such as indexing, cardinality estimation, and sorting, is shown to provide substantial performance benefits. However, when datasets change and data distribution shifts, empirical…

Machine Learning · Computer Science 2024-11-12 Sepanta Zeighami , Cyrus Shahahbi

In numerous predictive scenarios, the predictive model affects the sampling distribution; for example, job applicants often meticulously craft their resumes to navigate through a screening systems. Such shifts in distribution are…

Machine Learning · Statistics 2025-04-14 Daniele Bracale , Subha Maity , Moulinath Banerjee , Yuekai Sun

There is growing evidence that converting targets to soft targets in supervised learning can provide considerable gains in performance. Much of this work has considered classification, converting hard zero-one values to soft labels---such…

Machine Learning · Statistics 2018-06-13 Ehsan Imani , Martha White

Modelling partial differential equations (PDEs) is of crucial importance in science and engineering, and it includes tasks ranging from forecasting to inverse problems, such as data assimilation. However, most previous numerical and machine…

Recent work has shown that the performance of machine learning models can vary substantially when models are evaluated on data drawn from a distribution that is close to but different from the training distribution. As a result, predicting…

Machine Learning · Computer Science 2021-08-23 Devin Guillory , Vaishaal Shankar , Sayna Ebrahimi , Trevor Darrell , Ludwig Schmidt

Machine learning models are often deployed in different settings than they were trained and validated on, posing a challenge to practitioners who wish to predict how well the deployed model will perform on a target distribution. If an…

Machine Learning · Computer Science 2022-04-12 Mayee Chen , Karan Goel , Nimit S. Sohoni , Fait Poms , Kayvon Fatahalian , Christopher Ré
‹ Prev 1 2 3 10 Next ›