Related papers: Robust Optimal Classification Trees under Noisy La…

Multiclass Optimal Classification Trees with SVM-splits

In this paper we present a novel mathematical optimization-based methodology to construct tree-shaped classification rules for multiclass instances. Our approach consists of building Classification Trees in which, except for the leaf nodes,…

Optimization and Control · Mathematics 2021-11-17 Víctor Blanco , Alberto Japón , Justo Puerto

Robust Learning Under Label Noise With Iterative Noise-Filtering

We consider the problem of training a model under the presence of label noise. Current approaches identify samples with potentially incorrect labels and reduce their influence on the learning process by either assigning lower weights to…

Machine Learning · Computer Science 2019-06-04 Duc Tam Nguyen , Thi-Phuong-Nhung Ngo , Zhongyu Lou , Michael Klar , Laura Beggel , Thomas Brox

A Mathematical Programming approach to Binary Supervised Classification with Label Noise

In this paper we propose novel methodologies to construct Support Vector Machine -based classifiers that takes into account that label noises occur in the training sample. We propose different alternatives based on solving Mixed Integer…

Machine Learning · Computer Science 2020-04-22 Víctor Blanco , Alberto Japón , Justo Puerto

On the Robustness of Decision Tree Learning under Label Noise

In most practical problems of classifier learning, the training data suffers from the label noise. Hence, it is important to understand how robust is a learning algorithm to such label noise. This paper presents some theoretical analysis to…

Machine Learning · Computer Science 2016-08-29 Aritra Ghosh , Naresh Manwani , P. S. Sastry

Exploring Loss Design Techniques For Decision Tree Robustness To Label Noise

In the real world, data is often noisy, affecting not only the quality of features but also the accuracy of labels. Current research on mitigating label errors stems primarily from advances in deep learning, and a gap exists in exploring…

Machine Learning · Computer Science 2024-05-29 Lukasz Sztukiewicz , Jack Henry Good , Artur Dubrawski

Mixed-Integer Linear Optimization for Semi-Supervised Optimal Classification Trees

Decision trees are one of the most popular methods for solving classification problems, mainly because of their good interpretability properties. Moreover, due to advances in recent years in mixed-integer optimization, several models have…

Optimization and Control · Mathematics 2026-05-29 Jan Pablo Burgard , Maria Eduarda Pinheiro , Martin Schmidt

Towards Robustness to Label Noise in Text Classification via Noise Modeling

Large datasets in NLP suffer from noisy labels, due to erroneous automatic and human annotation procedures. We study the problem of text classification with label noise, and aim to capture this noise through an auxiliary noise model over…

Computation and Language · Computer Science 2022-06-22 Siddhant Garg , Goutham Ramakrishnan , Varun Thumbe

Learning with Noisy Labels: Interconnection of Two Expectation-Maximizations

Labor-intensive labeling becomes a bottleneck in developing computer vision algorithms based on deep learning. For this reason, dealing with imperfect labels has increasingly gained attention and has become an active field of study. We…

Computer Vision and Pattern Recognition · Computer Science 2024-01-10 Heewon Kim , Hyun Sung Chang , Kiho Cho , Jaeyun Lee , Bohyung Han

Multiple Kernel Learning from Noisy Labels by Stochastic Programming

We study the problem of multiple kernel learning from noisy labels. This is in contrast to most of the previous studies on multiple kernel learning that mainly focus on developing efficient algorithms and assume perfectly labeled training…

Machine Learning · Computer Science 2012-06-22 Tianbao Yang , Mehrdad Mahdavi , Rong Jin , Lijun Zhang , Yang Zhou

Robustness of Accuracy Metric and its Inspirations in Learning with Noisy Labels

For multi-class classification under class-conditional label noise, we prove that the accuracy metric itself can be robust. We concretize this finding's inspiration in two essential aspects: training and validation, with which we address…

Machine Learning · Computer Science 2020-12-09 Pengfei Chen , Junjie Ye , Guangyong Chen , Jingwei Zhao , Pheng-Ann Heng

Error-Bounded Correction of Noisy Labels

To collect large scale annotated data, it is inevitable to introduce label noise, i.e., incorrect class labels. To be robust against label noise, many successful methods rely on the noisy classifiers (i.e., models trained on the noisy…

Computer Vision and Pattern Recognition · Computer Science 2020-11-23 Songzhu Zheng , Pengxiang Wu , Aman Goswami , Mayank Goswami , Dimitris Metaxas , Chao Chen

Distilling Effective Supervision from Severe Label Noise

Collecting large-scale data with clean labels for supervised training of neural networks is practically challenging. Although noisy labels are usually cheap to acquire, existing methods suffer a lot from label noise. This paper targets at…

Machine Learning · Computer Science 2020-06-16 Zizhao Zhang , Han Zhang , Sercan O. Arik , Honglak Lee , Tomas Pfister

Optimized Gradient Clipping for Noisy Label Learning

Previous research has shown that constraining the gradient of loss function with respect to model-predicted probabilities can enhance the model robustness against noisy labels. These methods typically specify a fixed optimal threshold for…

Machine Learning · Computer Science 2024-12-24 Xichen Ye , Yifan Wu , Weizhong Zhang , Xiaoqiang Li , Yifan Chen , Cheng Jin

Robust Feature Learning Against Noisy Labels

Supervised learning of deep neural networks heavily relies on large-scale datasets annotated by high-quality labels. In contrast, mislabeled samples can significantly degrade the generalization of models and result in memorizing samples,…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Tsung-Ming Tai , Yun-Jie Jhang , Wen-Jyi Hwang

Learning from Noisy Labels with Deep Neural Networks: A Survey

Deep learning has achieved remarkable success in numerous domains with help from large amounts of big data. However, the quality of data labels is a concern because of the lack of high-quality labels in many real-world scenarios. As noisy…

Machine Learning · Computer Science 2022-03-11 Hwanjun Song , Minseok Kim , Dongmin Park , Yooju Shin , Jae-Gil Lee

Learning from Noisy Labels with Noise Modeling Network

Multi-label image classification has generated significant interest in recent years and the performance of such systems often suffers from the not so infrequent occurrence of incorrect or missing labels in the training data. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2020-05-05 Zhuolin Jiang , Jan Silovsky , Man-Hung Siu , William Hartmann , Herbert Gish , Sancar Adali

Robustness and Reliability When Training With Noisy Labels

Labelling of data for supervised learning can be costly and time-consuming and the risk of incorporating label noise in large data sets is imminent. When training a flexible discriminative model using a strictly proper loss, such noise will…

Machine Learning · Statistics 2022-05-13 Amanda Olmin , Fredrik Lindsten

Semi-Supervised Cascaded Clustering for Classification of Noisy Label Data

The performance of supervised classification techniques often deteriorates when the data has noisy labels. Even the semi-supervised classification approaches have largely focused only on the problem of handling missing labels. Most of the…

Machine Learning · Computer Science 2022-05-05 Ashit Gupta , Anirudh Deodhar , Tathagata Mukherjee , Venkataramana Runkana

A robust approach to model-based classification based on trimming and constraints

In a standard classification framework a set of trustworthy learning data are employed to build a decision rule, with the final aim of classifying unlabelled units belonging to the test set. Therefore, unreliable labelled observations,…

Applications · Statistics 2019-11-20 Andrea Cappozzo , Francesca Greselin , Thomas Brendan Murphy

Optimal Decision Tree and Adaptive Submodular Ranking with Noisy Outcomes

In pool-based active learning, the learner is given an unlabeled data set and aims to efficiently learn the unknown hypothesis by querying the labels of the data points. This can be formulated as the classical Optimal Decision Tree (ODT)…

Machine Learning · Computer Science 2024-08-01 Su Jia , Fatemeh Navidi , Viswanath Nagarajan , R. Ravi