English
Related papers

Related papers: Robust Label Shift Quantification

200 papers

The label shift problem refers to the supervised learning setting where the train and test label distributions do not match. Existing work addressing label shift usually assumes access to an \emph{unlabelled} test sample. This sample may be…

Machine Learning · Computer Science 2021-08-18 Jingzhao Zhang , Aditya Menon , Andreas Veit , Srinadh Bhojanapalli , Sanjiv Kumar , Suvrit Sra

Quantification learning deals with the task of estimating the target label distribution under label shift. In this paper, we first present a unifying framework, distribution feature matching (DFM), that recovers as particular instances…

Machine Learning · Statistics 2023-07-04 Bastien Dussap , Gilles Blanchard , Badr-Eddine Chérief-Abdellatif

Label shift refers to the phenomenon where the prior class probability p(y) changes between the training and test distributions, while the conditional probability p(x|y) stays fixed. Label shift arises in settings like medical diagnosis,…

Machine Learning · Computer Science 2020-06-30 Amr Alexandari , Anshul Kundaje , Avanti Shrikumar

We address the challenge of minimizing true risk in multi-node distributed learning. These systems are frequently exposed to both inter-node and intra-node label shifts, which present a critical obstacle to effectively optimizing model…

Machine Learning · Computer Science 2025-02-05 Zhiyuan Wu , Changkyu Choi , Xiangcheng Cao , Volkan Cevher , Ali Ramezani-Kebrya

Investigation of machine learning algorithms robust to changes between the training and test distributions is an active area of research. In this paper we explore a special type of dataset shift which we call class-dependent domain shift.…

Machine Learning · Computer Science 2020-07-13 Tigran Galstyan , Hrant Khachatrian , Greg Ver Steeg , Aram Galstyan

The quantification problem consists of determining the prevalence of a given label in a target population. However, one often has access to the labels in a sample from the training population but not in the target population. A common…

Machine Learning · Statistics 2019-04-08 Afonso Fernandes Vaz , Rafael Izbicki , Rafael Bassi Stern

We show when maximizing a properly defined $f$-divergence measure with respect to a classifier's predictions and the supervised labels is robust with label noise. Leveraging its variational form, we derive a nice decoupling property for a…

Machine Learning · Computer Science 2021-08-20 Jiaheng Wei , Yang Liu

An assumption often made in supervised learning is that the training and testing sets have the same label distribution. However, in real-life scenarios, this assumption rarely holds. For example, medical diagnosis result distributions…

Machine Learning · Computer Science 2026-04-03 Yunrui Zhang , Gustavo Batista , Salil S. Kanhere

Based on existing ideas in the field of imprecise probabilities, we present a new approach for assessing the reliability of the individual predictions of a generative probabilistic classifier. We call this approach robustness…

Machine Learning · Computer Science 2025-04-11 Adrián Detavernier , Jasper De Bock

Under label shift, the label distribution p(y) might change but the class-conditional distributions p(x|y) do not. There are two dominant approaches for estimating the label marginal. BBSE, a moment-matching approach based on confusion…

Machine Learning · Computer Science 2020-10-20 Saurabh Garg , Yifan Wu , Sivaraman Balakrishnan , Zachary C. Lipton

We study the robustness of conformal prediction, a powerful tool for uncertainty quantification, to label noise. Our analysis tackles both regression and classification problems, characterizing when and how it is possible to construct…

Machine Learning · Computer Science 2024-11-27 Bat-Sheva Einbinder , Shai Feldman , Stephen Bates , Anastasios N. Angelopoulos , Asaf Gendler , Yaniv Romano

A fundamental question in adversarial machine learning is whether a robust classifier exists for a given task. A line of research has made some progress towards this goal by studying the concentration of measure, but we argue standard…

Machine Learning · Computer Science 2022-03-18 Xiao Zhang , David Evans

We benchmark the robustness of maximum likelihood based uncertainty estimation methods to outliers in training data for regression tasks. Outliers or noisy labels in training data results in degraded performances as well as incorrect…

Machine Learning · Computer Science 2022-02-09 Deebul S. Nair , Nico Hochgeschwender , Miguel A. Olivares-Mendez

Trustworthy deployment of ML models requires a proper measure of uncertainty, especially in safety-critical applications. We focus on uncertainty quantification (UQ) for classification problems via two avenues -- prediction sets using…

Machine Learning · Statistics 2021-07-08 Aleksandr Podkopaev , Aaditya Ramdas

We study the open-set label shift problem, where the test data may include a novel class absent from training. This setting is challenging because both the class proportions and the distribution of the novel class are not identifiable…

Methodology · Statistics 2025-09-19 Siyan Liu , Yukun Liu , Qinglong Tian , Pengfei Li , Jing Qin

In this paper, we investigate the robust models for $\Lambda$-quantiles with partial information regarding the loss distribution, where $\Lambda$-quantiles extend the classical quantiles by replacing the fixed probability level with a…

Mathematical Finance · Quantitative Finance 2025-05-28 Xia Han , Peng Liu

The parameters of the log-logistic distribution are generally estimated based on classical methods such as maximum likelihood estimation, whereas these methods usually result in severe biased estimates when the data contain outliers. In…

Methodology · Statistics 2022-09-16 Zhuanzhuan Ma , Min Wang , Chanseok Park

Labelling of data for supervised learning can be costly and time-consuming and the risk of incorporating label noise in large data sets is imminent. When training a flexible discriminative model using a strictly proper loss, such noise will…

Machine Learning · Statistics 2022-05-13 Amanda Olmin , Fredrik Lindsten

Label switching is a phenomenon arising in mixture model posterior inference that prevents one from meaningfully assessing posterior statistics using standard Monte Carlo procedures. This issue arises due to invariance of the posterior…

Machine Learning · Computer Science 2019-11-12 Pierre Monteiller , Sebastian Claici , Edward Chien , Farzaneh Mirzazadeh , Justin Solomon , Mikhail Yurochkin

As the volume of data continues to expand, it becomes increasingly common for data to be aggregated from multiple sources. Leveraging multiple sources for model training typically achieves better predictive performance on test datasets.…

Methodology · Statistics 2025-03-05 Congbin Xu , Chengde Qian , Zhaojun Wang , Changliang Zou
‹ Prev 1 2 3 10 Next ›