English
Related papers

Related papers: Shift Happens: Adjusting Classifiers

200 papers

We propose a simple, statistically principled, and theoretically justified method to improve supervised learning when the training set is not representative, a situation known as covariate shift. We build upon a well-established methodology…

Machine Learning · Statistics 2025-03-12 Maximilian Autenrieth , David A. van Dyk , Roberto Trotta , David C. Stenning

Classifier-guided diffusion models generate conditional samples by augmenting the reverse-time score with the gradient of the log-probability predicted by a probabilistic classifier. In practice, this classifier is usually obtained by…

Machine Learning · Statistics 2026-02-06 Sharan Sahu , Arisina Banerjee , Yuchen Wu

Generalization under distribution shift remains a core challenge in modern machine learning, yet existing learning bound theory is limited to narrow, idealized settings and is non-estimable from samples. In this paper, we bridge the gap…

Machine Learning · Statistics 2025-08-25 Hongbo Chen , Li Charlie Xia

In machine learning, if the training data is an unbiased sample of an underlying distribution, then the learned classification function will make accurate predictions for new samples. However, if the training data is not an unbiased sample,…

Machine Learning · Computer Science 2019-01-15 Wouter M. Kouw , Marco Loog

The phenomenon of data distribution evolving over time has been observed in a range of applications, calling the needs of adaptive learning algorithms. We thus study the problem of supervised gradual domain adaptation, where labeled data…

Machine Learning · Computer Science 2022-11-15 Jing Dong , Shiji Zhou , Baoxiang Wang , Han Zhao

For the binary prevalence quantification problem under prior probability shift, we determine the asymptotic variance of the maximum likelihood estimator. We find that it is a function of the Brier score for the regression of the class label…

Machine Learning · Statistics 2021-09-23 Dirk Tasche

An open scientific challenge is how to classify events with reliable measures of uncertainty, when we have a mechanistic model of the data-generating process but the distribution over both labels and latent nuisance parameters is different…

Machine Learning · Statistics 2024-07-02 Luca Masserano , Alex Shen , Michele Doro , Tommaso Dorigo , Rafael Izbicki , Ann B. Lee

As a fundamental problem in machine learning, dataset shift induces a paradigm to learn and transfer knowledge under changing environment. Previous methods assume the changes are induced by covariate, which is less practical for complex…

Machine Learning · Computer Science 2022-03-01 You-Wei Luo , Chuan-Xian Ren

The balanced loss is a widely adopted objective for multi-class classification under class imbalance. By assigning equal importance to all classes, regardless of their frequency, it promotes fairness and ensures that minority classes are…

Machine Learning · Computer Science 2026-01-01 Corinna Cortes , Mehryar Mohri , Yutao Zhong

Classifier predictions often rely on the assumption that new observations come from the same distribution as training data. When the underlying distribution changes, so does the optimal classification rule, and performance may degrade. We…

Methodology · Statistics 2021-09-01 Ciaran Evans , Max G'Sell

Within the machine learning community, the widely-used uniform convergence framework has been used to answer the question of how complex, over-parameterized models can generalize well to new data. This approach bounds the test error of the…

Machine Learning · Statistics 2021-03-05 Ryan Theisen , Jason M. Klusowski , Michael W. Mahoney

With increased interest in adopting AI methods for clinical diagnosis, a vital step towards safe deployment of such tools is to ensure that the models not only produce accurate predictions but also do not generalize to data regimes where…

Machine Learning · Computer Science 2021-03-08 Vivek Narayanaswamy , Jayaraman J. Thiagarajan , Deepta Rajan , Andreas Spanias

The goal of regression and classification methods in supervised learning is to minimize the empirical risk, that is, the expectation of some loss function quantifying the prediction error under the empirical distribution. When facing scarce…

Optimization and Control · Mathematics 2019-07-15 Soroosh Shafieezadeh-Abadeh , Daniel Kuhn , Peyman Mohajerin Esfahani

This paper proposes an easy-to-compute upper bound for the overlap index between two probability distributions without requiring any knowledge of the distribution models. The computation of our bound is time-efficient and memory-efficient…

Machine Learning · Computer Science 2024-12-02 Hao Fu , Prashanth Krishnamurthy , Siddharth Garg , Farshad Khorrami

The generalization ability of machine learning models degrades significantly when the test distribution shifts away from the training distribution. We investigate the problem of training models that are robust to shifts caused by changes in…

Machine Learning · Computer Science 2023-09-19 Jiaheng Wei , Harikrishna Narasimhan , Ehsan Amid , Wen-Sheng Chu , Yang Liu , Abhishek Kumar

For many applications, an ensemble of base classifiers is an effective solution. The tuning of its parameters(number of classes, amount of data on which each classifier is to be trained on, etc.) requires G, the generalization error of a…

An interesting phenomenon arises: Empirical Risk Minimization (ERM) sometimes outperforms methods specifically designed for out-of-distribution tasks. This motivates an investigation into the reasons behind such behavior beyond algorithmic…

Machine Learning · Computer Science 2026-01-21 Hong Zheng , Fei Teng

Learning with identical train and test distributions has been extensively investigated both practically and theoretically. Much remains to be understood, however, in statistical learning under distribution shifts. This paper focuses on a…

Machine Learning · Computer Science 2024-11-01 Omar Montasser , Han Shao , Emmanuel Abbe

Proposition. Let $f$ be a predictor trained on a distribution $P$ and evaluated on a shifted distribution $Q$. Under verifiable regularity and complexity constraints, the excess risk under shift admits an explicit upper bound determined by…

Machine Learning · Computer Science 2026-02-23 Chandrasekhar Gokavarapu , Sudhakar Gadde , Y. Rajasekhar , S. R. Bhargava

A stream of algorithmic advances has steadily increased the popularity of the Bayesian approach as an inference paradigm, both from the theoretical and applied perspective. Even with apparent successes in numerous application fields, a…

Methodology · Statistics 2020-07-10 Owen Thomas , Henri Pesonen , Jukka Corander
‹ Prev 1 2 3 10 Next ›