Related papers: Boosting Classifiers with Noisy Inference

Boosting in the presence of label noise

Boosting is known to be sensitive to label noise. We studied two approaches to improve AdaBoost's robustness against labelling errors. One is to employ a label-noise robust classifier as a base learner, while the other is to modify the…

Machine Learning · Computer Science 2013-09-27 Jakramate Bootkrajang , Ata Kaban

Boosting, Voting Classifiers and Randomized Sample Compression Schemes

In boosting, we aim to leverage multiple weak learners to produce a strong learner. At the center of this paradigm lies the concept of building the strong learner as a voting classifier, which outputs a weighted majority vote of the weak…

Machine Learning · Computer Science 2024-12-23 Arthur da Cunha , Kasper Green Larsen , Martin Ritzert

Advances in Boosting (Invited Talk)

Boosting is a general method of generating many simple classification rules and combining them into a single, highly accurate rule. In this talk, I will review the AdaBoost boosting algorithm and some of its underlying theory, and then look…

Machine Learning · Computer Science 2013-01-07 Robert E. Schapire

Robust and Efficient Boosting Method using the Conditional Risk

Well-known for its simplicity and effectiveness in classification, AdaBoost, however, suffers from overfitting when class-conditional distributions have significant overlap. Moreover, it is very sensitive to noise that appears in the…

Machine Learning · Statistics 2018-06-22 Zhi Xiao , Zhe Luo , Bo Zhong , Xin Dang

The Many Faces of Optimal Weak-to-Strong Learning

Boosting is an extremely successful idea, allowing one to combine multiple low accuracy classifiers into a much more accurate voting classifier. In this work, we present a new and surprisingly simple Boosting algorithm that obtains a…

Machine Learning · Computer Science 2024-09-02 Mikael Møller Høgsgaard , Kasper Green Larsen , Markus Engelund Mathiasen

ITBoost: Information-Theoretic Trust for Robust Boosting

Gradient boosting remains a strong and widely used method for tabular data learning, but its performance often degrades when training labels are noisy. This behavior is largely related to the way boosting algorithms emphasize samples with…

Machine Learning · Computer Science 2026-05-12 Ye Su , Longlong Zhao , Diego Garcia-Gil , Jipeng Guo , Gangchun Zhang , Jinxin Chen , Jinsong Chen

Revisiting Agnostic Boosting

Boosting is a key method in statistical learning, allowing for converting weak learners into strong ones. While well studied in the realizable case, the statistical properties of weak-to-strong learning remain less understood in the…

Machine Learning · Computer Science 2026-01-01 Arthur da Cunha , Mikael Møller Høgsgaard , Andrea Paudice , Yuxin Sun

A Resilient Distributed Boosting Algorithm

Given a learning task where the data is distributed among several parties, communication is one of the fundamental resources which the parties would like to minimize. We present a distributed boosting algorithm which is resilient to a…

Machine Learning · Computer Science 2022-06-14 Yuval Filmus , Idan Mehalel , Shay Moran

Untangling AdaBoost-based Cost-Sensitive Classification. Part II: Empirical Analysis

A lot of approaches, each following a different strategy, have been proposed in the literature to provide AdaBoost with cost-sensitive properties. In the first part of this series of two papers, we have presented these algorithms in a…

Computer Vision and Pattern Recognition · Computer Science 2016-07-25 Iago Landesa-Vázquez , José Luis Alba-Castro

Untangling AdaBoost-based Cost-Sensitive Classification. Part I: Theoretical Perspective

Boosting algorithms have been widely used to tackle a plethora of problems. In the last few years, a lot of approaches have been proposed to provide standard AdaBoost with cost-sensitive capabilities, each with a different focus. However,…

Computer Vision and Pattern Recognition · Computer Science 2016-07-25 Iago Landesa-Vázquez , José Luis Alba-Castro

Analysis of classifiers robust to noisy labels

We explore contemporary robust classification algorithms for overcoming class-dependant labelling noise: Forward, Importance Re-weighting and T-revision. The classifiers are trained and evaluated on class-conditional random label noise data…

Machine Learning · Computer Science 2021-06-02 Alex Díaz , Damian Steele

Vote-boosting ensembles

Vote-boosting is a sequential ensemble learning method in which the individual classifiers are built on different weighted versions of the training data. To build a new classifier, the weight of each training instance is determined in terms…

Machine Learning · Computer Science 2018-02-22 Maryam Sabzevari , Gonzalo Martínez-Muñoz , Alberto Suárez

Quantum Boosting

Suppose we have a weak learning algorithm $\mathcal{A}$ for a Boolean-valued problem: $\mathcal{A}$ produces hypotheses whose bias $\gamma$ is small, only slightly better than random guessing (this could, for instance, be due to…

Quantum Physics · Physics 2020-08-18 Srinivasan Arunachalam , Reevu Maity

A Bayesian Boosting Model

We offer a novel view of AdaBoost in a statistical setting. We propose a Bayesian model for binary classification in which label noise is modeled hierarchically. Using variational inference to optimize a dynamic evidence lower bound, we…

Machine Learning · Statistics 2012-09-11 Alexander Lorbert , David M. Blei , Robert E. Schapire , Peter J. Ramadge

Towards Robustness to Label Noise in Text Classification via Noise Modeling

Large datasets in NLP suffer from noisy labels, due to erroneous automatic and human annotation procedures. We study the problem of text classification with label noise, and aim to capture this noise through an auxiliary noise model over…

Computation and Language · Computer Science 2022-06-22 Siddhant Garg , Goutham Ramakrishnan , Varun Thumbe

Robust Product Classification with Instance-Dependent Noise

Noisy labels in large E-commerce product data (i.e., product items are placed into incorrect categories) are a critical issue for product categorization task because they are unavoidable, non-trivial to remove and degrade prediction…

Computation and Language · Computer Science 2022-09-16 Huy Nguyen , Devashish Khatwani

When Analytic Calculus Cracks AdaBoost Code

The principle of boosting in supervised learning involves combining multiple weak classifiers to obtain a stronger classifier. AdaBoost has the reputation to be a perfect example of this approach. This study analyzes the (two classes)…

Machine Learning · Computer Science 2024-02-08 Jean-Marc Brossier , Olivier Lafitte , Lenny Réthoré

Communication Efficient Distributed Agnostic Boosting

We consider the problem of learning from distributed data in the agnostic setting, i.e., in the presence of arbitrary forms of noise. Our main contribution is a general distributed boosting-based procedure for learning an arbitrary concept…

Machine Learning · Computer Science 2016-11-21 Shang-Tse Chen , Maria-Florina Balcan , Duen Horng Chau

AdaBoost is not an Optimal Weak to Strong Learner

AdaBoost is a classic boosting algorithm for combining multiple inaccurate classifiers produced by a weak learner, to produce a strong learner with arbitrarily high accuracy when given enough training data. Determining the optimal number of…

Machine Learning · Computer Science 2025-08-12 Mikael Møller Høgsgaard , Kasper Green Larsen , Martin Ritzert

Training a Neural Network in a Low-Resource Setting on Automatically Annotated Noisy Data

Manually labeled corpora are expensive to create and often not available for low-resource languages or domains. Automatic labeling approaches are an alternative way to obtain labeled data in a quicker and cheaper way. However, these labels…

Machine Learning · Computer Science 2018-07-24 Michael A. Hedderich , Dietrich Klakow