English
Related papers

Related papers: Robust Binary Hypothesis Testing Under Contaminate…

200 papers

In supervised learning, automatically assessing the quality of the labels before any learning takes place remains an open research question. In certain particular cases, hypothesis testing procedures have been proposed to assess whether a…

Machine Learning · Computer Science 2023-12-19 Weisong Yang , Rafael Poyiadzi , Niall Twomey , Raul Santos Rodriguez

Learning with label dependent label noise has been extensively explored in both theory and practice; however, dealing with instance (i.e., feature) and label dependent label noise continues to be a challenging task. The difficulty arises…

Machine Learning · Statistics 2023-06-07 Hyungki Im , Paul Grigas

In this paper we provide machine learning practitioners with tools to answer the question: is there class-conditional noise in my labels? In particular, we present hypothesis tests to check whether a given dataset of instance-label pairs…

Machine Learning · Computer Science 2021-06-02 Rafael Poyiadzi , Weisong Yang , Niall Twomey , Raul Santos-Rodriguez

We study online classification of features into labels with general hypothesis classes. In our setting, true labels are determined by some function within the hypothesis class but are corrupted by unknown stochastic noise, and the features…

Machine Learning · Computer Science 2024-09-27 Changlong Wu , Ananth Grama , Wojciech Szpankowski

In this paper, we study a classification problem in which sample labels are randomly corrupted. In this scenario, there is an unobservable sample with noise-free labels. However, before being observed, the true labels are independently…

Machine Learning · Statistics 2015-07-21 Tongliang Liu , Dacheng Tao

We investigate the problem of classification in the presence of unknown class-conditional label noise in which the labels observed by the learner have been corrupted with some unknown class dependent probability. In order to obtain finite…

Machine Learning · Statistics 2019-06-11 Henry W J Reeve , Ata Kaban

The problem of robust binary hypothesis testing is studied. Under both hypotheses, the data-generating distributions are assumed to belong to uncertainty sets constructed through moments; in particular, the sets contain distributions whose…

Statistics Theory · Mathematics 2024-01-09 Akshayaa Magesh , Zhongchang Sun , Venugopal V. Veeravalli , Shaofeng Zou

This paper considers the design of a minimax test for two hypotheses where the actual probability densities of the observations are located in neighborhoods obtained by placing a bound on the relative entropy between actual and nominal…

Information Theory · Computer Science 2016-11-18 Bernard C. Levy

We study the robustness of conformal prediction, a powerful tool for uncertainty quantification, to label noise. Our analysis tackles both regression and classification problems, characterizing when and how it is possible to construct…

Machine Learning · Computer Science 2024-11-27 Bat-Sheva Einbinder , Shai Feldman , Stephen Bates , Anastasios N. Angelopoulos , Asaf Gendler , Yaniv Romano

Labelling of data for supervised learning can be costly and time-consuming and the risk of incorporating label noise in large data sets is imminent. When training a flexible discriminative model using a strictly proper loss, such noise will…

Machine Learning · Statistics 2022-05-13 Amanda Olmin , Fredrik Lindsten

Noisy labels damage the performance of deep networks. For robust learning, a prominent two-stage pipeline alternates between eliminating possible incorrect labels and semi-supervised training. However, discarding part of noisy labels could…

Machine Learning · Computer Science 2023-01-09 Mingcai Chen , Hao Cheng , Yuntao Du , Ming Xu , Wenyu Jiang , Chongjun Wang

Positive-unlabeled learning (PU learning) is known as a special case of semi-supervised binary classification where only a fraction of positive examples are labeled. The challenge is then to find the correct classifier despite this lack of…

Statistics Theory · Mathematics 2022-01-19 Olivier Coudray , Christine Keribin , Pascal Massart , Patrick Pamphile

Large datasets in NLP suffer from noisy labels, due to erroneous automatic and human annotation procedures. We study the problem of text classification with label noise, and aim to capture this noise through an auxiliary noise model over…

Computation and Language · Computer Science 2022-06-22 Siddhant Garg , Goutham Ramakrishnan , Varun Thumbe

In a binary classification problem where the goal is to fit an accurate predictor, the presence of corrupted labels in the training data set may create an additional challenge. However, in settings where likelihood maximization is poorly…

Statistics Theory · Mathematics 2021-06-18 Yonghoon Lee , Rina Foygel Barber

We consider the problem of training a model under the presence of label noise. Current approaches identify samples with potentially incorrect labels and reduce their influence on the learning process by either assigning lower weights to…

Machine Learning · Computer Science 2019-06-04 Duc Tam Nguyen , Thi-Phuong-Nhung Ngo , Zhongyu Lou , Michael Klar , Laura Beggel , Thomas Brox

Binary classification is a task that involves the classification of data into one of two distinct classes. It is widely utilized in various fields. However, conventional classifiers tend to make overconfident predictions for data that…

Machine Learning · Computer Science 2025-03-13 Shoma Yokura , Akihisa Ichiki

In many real-world classification problems, the labels of training examples are randomly corrupted. Most previous theoretical work on classification with label noise assumes that the two classes are separable, that the label noise is…

Machine Learning · Statistics 2016-08-08 Gilles Blanchard , Marek Flaska , Gregory Handy , Sara Pozzi , Clayton Scott

The classical binary hypothesis testing problem is revisited. We notice that when one of the hypotheses is composite, there is an inherent difficulty in defining an optimality criterion that is both informative and well-justified. For…

Statistics Theory · Mathematics 2021-03-29 Michael Bell , Yuval Kochman

A natural way of estimating heteroscedastic label noise in regression is to model the observed (potentially noisy) target as a sample from a normal distribution, whose parameters can be learned by minimizing the negative log-likelihood.…

Machine Learning · Computer Science 2023-08-15 Erik Englesson , Amir Mehrpanah , Hossein Azizpour

In this letter, the optimality of the likelihood ratio test (LRT) is investigated for binary hypothesis testing problems in the presence of a behavioral decision-maker. By utilizing prospect theory, a behavioral decision-maker is modeled to…

Information Theory · Computer Science 2018-12-05 Sinan Gezici , Pramod K. Varshney
‹ Prev 1 2 3 10 Next ›