Related papers: Shift Happens: Adjusting Classifiers

Stratified Learning: A General-Purpose Statistical Method for Improved Learning under Covariate Shift

We propose a simple, statistically principled, and theoretically justified method to improve supervised learning when the training set is not representative, a situation known as covariate shift. We build upon a well-established methodology…

Machine Learning · Statistics 2025-03-12 Maximilian Autenrieth , David A. van Dyk , Roberto Trotta , David C. Stenning

Provably Reliable Classifier Guidance via Cross-Entropy Control

Classifier-guided diffusion models generate conditional samples by augmenting the reverse-time score with the gradient of the log-probability predicted by a probabilistic classifier. In practice, this classifier is usually obtained by…

Machine Learning · Statistics 2026-02-06 Sharan Sahu , Arisina Banerjee , Yuchen Wu

General and Estimable Learning Bound Unifying Covariate and Concept Shifts

Generalization under distribution shift remains a core challenge in modern machine learning, yet existing learning bound theory is limited to narrow, idealized settings and is non-estimable from samples. In this paper, we bridge the gap…

Machine Learning · Statistics 2025-08-25 Hongbo Chen , Li Charlie Xia

An introduction to domain adaptation and transfer learning

In machine learning, if the training data is an unbiased sample of an underlying distribution, then the learned classification function will make accurate predictions for new samples. However, if the training data is not an unbiased sample,…

Machine Learning · Computer Science 2019-01-15 Wouter M. Kouw , Marco Loog

Algorithms and Theory for Supervised Gradual Domain Adaptation

The phenomenon of data distribution evolving over time has been observed in a range of applications, calling the needs of adaptive learning algorithms. We thus study the problem of supervised gradual domain adaptation, where labeled data…

Machine Learning · Computer Science 2022-11-15 Jing Dong , Shiji Zhou , Baoxiang Wang , Han Zhao

Minimising quantifier variance under prior probability shift

For the binary prevalence quantification problem under prior probability shift, we determine the asymptotic variance of the maximum likelihood estimator. We find that it is a function of the Brier score for the regression of the class label…

Machine Learning · Statistics 2021-09-23 Dirk Tasche

Classification under Nuisance Parameters and Generalized Label Shift in Likelihood-Free Inference

An open scientific challenge is how to classify events with reliable measures of uncertainty, when we have a mechanistic model of the data-generating process but the distribution over both labels and latent nuisance parameters is different…

Machine Learning · Statistics 2024-07-02 Luca Masserano , Alex Shen , Michele Doro , Tommaso Dorigo , Rafael Izbicki , Ann B. Lee

Generalized Label Shift Correction via Minimum Uncertainty Principle: Theory and Algorithm

As a fundamental problem in machine learning, dataset shift induces a paradigm to learn and transfer knowledge under changing environment. Previous methods assume the changes are induced by covariate, which is less practical for complex…

Machine Learning · Computer Science 2022-03-01 You-Wei Luo , Chuan-Xian Ren

Improved Balanced Classification with Theoretically Grounded Loss Functions

The balanced loss is a widely adopted objective for multi-class classification under class imbalance. By assigning equal importance to all classes, regardless of their frequency, it promotes fairness and ensures that minority classes are…

Machine Learning · Computer Science 2026-01-01 Corinna Cortes , Mehryar Mohri , Yutao Zhong

Sequential changepoint detection in classification data under label shift

Classifier predictions often rely on the assumption that new observations come from the same distribution as training data. When the underlying distribution changes, so does the optimal classification rule, and performance may degrade. We…

Methodology · Statistics 2021-09-01 Ciaran Evans , Max G'Sell

Good Classifiers are Abundant in the Interpolating Regime

Within the machine learning community, the widely-used uniform convergence framework has been used to answer the question of how complex, over-parameterized models can generalize well to new data. This approach bounds the test error of the…

Machine Learning · Statistics 2021-03-05 Ryan Theisen , Jason M. Klusowski , Michael W. Mahoney

Loss Estimators Improve Model Generalization

With increased interest in adopting AI methods for clinical diagnosis, a vital step towards safe deployment of such tools is to ensure that the models not only produce accurate predictions but also do not generalize to data regimes where…

Machine Learning · Computer Science 2021-03-08 Vivek Narayanaswamy , Jayaraman J. Thiagarajan , Deepta Rajan , Andreas Spanias

Regularization via Mass Transportation

The goal of regression and classification methods in supervised learning is to minimize the empirical risk, that is, the expectation of some loss function quantifying the prediction error under the empirical distribution. When facing scarce…

Optimization and Control · Mathematics 2019-07-15 Soroosh Shafieezadeh-Abadeh , Daniel Kuhn , Peyman Mohajerin Esfahani

An Upper Bound for the Distribution Overlap Index and Its Applications

This paper proposes an easy-to-compute upper bound for the overlap index between two probability distributions without requiring any knowledge of the distribution models. The computation of our bound is time-efficient and memory-efficient…

Machine Learning · Computer Science 2024-12-02 Hao Fu , Prashanth Krishnamurthy , Siddharth Garg , Farshad Khorrami

Distributionally Robust Post-hoc Classifiers under Prior Shifts

The generalization ability of machine learning models degrades significantly when the test distribution shifts away from the training distribution. We investigate the problem of training models that are robust to shifts caused by changes in…

Machine Learning · Computer Science 2023-09-19 Jiaheng Wei , Harikrishna Narasimhan , Ehsan Amid , Wen-Sheng Chu , Yang Liu , Abhishek Kumar

Efficient Estimation of Generalization Error and Bias-Variance Components of Ensembles

For many applications, an ensemble of base classifiers is an effective solution. The tuning of its parameters(number of classes, amount of data on which each classifier is to be trained on, etc.) requires G, the generalization error of a…

Machine Learning · Computer Science 2017-11-16 Dhruv Mahajan , Vivek Gupta , S Sathiya Keerthi , Sellamanickam Sundararajan , Shravan Narayanamurthy , Rahul Kidambi

Distribution Shift Is Key to Learning Invariant Prediction

An interesting phenomenon arises: Empirical Risk Minimization (ERM) sometimes outperforms methods specifically designed for out-of-distribution tasks. This motivates an investigation into the reasons behind such behavior beyond algorithmic…

Machine Learning · Computer Science 2026-01-21 Hong Zheng , Fei Teng

Transformation-Invariant Learning and Theoretical Guarantees for OOD Generalization

Learning with identical train and test distributions has been extensively investigated both practically and theoretically. Much remains to be understood, however, in statistical learning under distribution shifts. This paper focuses on a…

Machine Learning · Computer Science 2024-11-01 Omar Montasser , Han Shao , Emmanuel Abbe

Certified Learning under Distribution Shift: Sound Verification and Identifiable Structure

Proposition. Let $f$ be a predictor trained on a distribution $P$ and evaluated on a shifted distribution $Q$. Under verifiable regularity and complexity constraints, the excess risk under shift admits an explicit upper bound determined by…

Machine Learning · Computer Science 2026-02-23 Chandrasekhar Gokavarapu , Sudhakar Gadde , Y. Rajasekhar , S. R. Bhargava

Generalised Bayes Updates with $f$-divergences through Probabilistic Classifiers

A stream of algorithmic advances has steadily increased the popularity of the Bayesian approach as an inference paradigm, both from the theoretical and applied perspective. Even with apparent successes in numerous application fields, a…

Methodology · Statistics 2020-07-10 Owen Thomas , Henri Pesonen , Jukka Corander