Related papers: batchboost: regularization for stabilizing trainin…

Why Mixup Improves the Model Performance

Machine learning techniques are used in a wide range of domains. However, machine learning models often suffer from the problem of over-fitting. Many data augmentation methods have been proposed to tackle such a problem, and one of them is…

Machine Learning · Statistics 2021-06-21 Masanari Kimura

Split-Boost Neural Networks

The calibration and training of a neural network is a complex and time-consuming procedure that requires significant computational resources to achieve satisfactory results. Key obstacles are a large number of hyperparameters to select and…

Machine Learning · Computer Science 2023-09-07 Raffaele Giuseppe Cestari , Gabriele Maroni , Loris Cannelli , Dario Piga , Simone Formentin

MixBoost: Synthetic Oversampling with Boosted Mixup for Handling Extreme Imbalance

Training a classification model on a dataset where the instances of one class outnumber those of the other class is a challenging problem. Such imbalanced datasets are standard in real-world situations such as fraud detection, medical…

Machine Learning · Computer Science 2020-09-04 Anubha Kabra , Ayush Chopra , Nikaash Puri , Pinkesh Badjatiya , Sukriti Verma , Piyush Gupta , Balaji K

Adaptive Batch Normalization for Training Data with Heterogeneous Features

Batch Normalization (BN) is an important preprocessing step to many deep learning applications. Since it is a data-dependent process, for some homogeneous datasets it is a redundant or even a performance-degrading process. In this paper, we…

Machine Learning · Computer Science 2022-12-01 Wael Alsobhi , Tarik Alafif , Alaa Abdel-Hakim , Weiwei Zong

PatternBoost: Constructions in Mathematics with a Little Help from AI

We introduce PatternBoost, a flexible method for finding interesting constructions in mathematics. Our algorithm alternates between two phases. In the first ``local'' phase, a classical search algorithm is used to produce many desirable…

Combinatorics · Mathematics 2024-11-04 François Charton , Jordan S. Ellenberg , Adam Zsolt Wagner , Geordie Williamson

How Does Mixup Help With Robustness and Generalization?

Mixup is a popular data augmentation technique based on taking convex combinations of pairs of examples and their labels. This simple technique has been shown to substantially improve both the robustness and the generalization of the…

Machine Learning · Computer Science 2021-03-19 Linjun Zhang , Zhun Deng , Kenji Kawaguchi , Amirata Ghorbani , James Zou

The Many Faces of Optimal Weak-to-Strong Learning

Boosting is an extremely successful idea, allowing one to combine multiple low accuracy classifiers into a much more accurate voting classifier. In this work, we present a new and surprisingly simple Boosting algorithm that obtains a…

Machine Learning · Computer Science 2024-09-02 Mikael Møller Høgsgaard , Kasper Green Larsen , Markus Engelund Mathiasen

When and How Mixup Improves Calibration

In many machine learning applications, it is important for the model to provide confidence scores that accurately capture its prediction uncertainty. Although modern learning methods have achieved great success in predictive accuracy,…

Machine Learning · Computer Science 2022-07-12 Linjun Zhang , Zhun Deng , Kenji Kawaguchi , James Zou

Multicalibration Boosting: Theory, Convergence, and Transferability

Multicalibration extends classical calibration by requiring predictions to be unbiased over a rich collection of functions, encompassing both prediction slices and subpopulations. It has emerged as a powerful framework for fairness,…

Machine Learning · Statistics 2026-05-26 Hanxuan Ye , Hongzhe Li

ProBoost: a Boosting Method for Probabilistic Classifiers

ProBoost, a new boosting algorithm for probabilistic classifiers, is proposed in this work. This algorithm uses the epistemic uncertainty of each training sample to determine the most challenging/uncertain ones; the relevance of these…

Machine Learning · Computer Science 2025-08-05 Fábio Mendonça , Sheikh Shanawaz Mostafa , Fernando Morgado-Dias , Antonio G. Ravelo-García , Mário A. T. Figueiredo

Re-scale boosting for regression and classification

Boosting is a learning scheme that combines weak prediction rules to produce a strong composite estimator, with the underlying intuition that one can obtain accurate prediction rules by combining "rough" ones. Although boosting is proved to…

Machine Learning · Computer Science 2015-05-07 Shaobo Lin , Yao Wang , Lin Xu

On Mixup Regularization

Mixup is a data augmentation technique that creates new examples as convex combinations of training points and labels. This simple technique has empirically shown to improve the accuracy of many state-of-the-art models in different settings…

Machine Learning · Computer Science 2026-05-28 Luigi Carratino , Moustapha Cissé , Rodolphe Jenatton , Jean-Philippe Vert

MEBoost: Mixing Estimators with Boosting for Imbalanced Data Classification

Class imbalance problem has been a challenging research problem in the fields of machine learning and data mining as most real life datasets are imbalanced. Several existing machine learning algorithms try to maximize the accuracy…

Machine Learning · Computer Science 2018-06-21 Farshid Rayhan , Sajid Ahmed , Asif Mahbub , Md. Rafsan Jani , Swakkhar Shatabda , Dewan Md. Farid , Chowdhury Mofizur Rahman

MetaMix: Improved Meta-Learning with Interpolation-based Consistency Regularization

Model-Agnostic Meta-Learning (MAML) and its variants are popular few-shot classification methods. They train an initializer across a variety of sampled learning tasks (also known as episodes) such that the initialized model can adapt…

Computer Vision and Pattern Recognition · Computer Science 2020-10-13 Yangbin Chen , Yun Ma , Tom Ko , Jianping Wang , Qing Li

Hybrid Dual-Batch and Cyclic Progressive Learning for Efficient Distributed Training

Distributed machine learning is critical for training deep learning models on large datasets with numerous parameters. Current research primarily focuses on leveraging additional hardware resources and powerful computing units to accelerate…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-03 Kuan-Wei Lu , Ding-Yong Hong , Pangfeng Liu , Jan-Jan Wu

RandomBoost: Simplified Multi-class Boosting through Randomization

We propose a novel boosting approach to multi-class classification problems, in which multiple classes are distinguished by a set of random projection matrices in essence. The approach uses random projections to alleviate the proliferation…

Machine Learning · Computer Science 2013-02-06 Sakrapee Paisitkriangkrai , Chunhua Shen , Qinfeng Shi , Anton van den Hengel

A Boosting Approach to Constructing an Ensemble Stack

An approach to evolutionary ensemble learning for classification is proposed in which boosting is used to construct a stack of programs. Each application of boosting identifies a single champion and a residual dataset, i.e. the training…

Neural and Evolutionary Computing · Computer Science 2023-11-27 Zhilei Zhou , Ziyu Qiu , Brad Niblett , Andrew Johnston , Jeffrey Schwartzentruber , Nur Zincir-Heywood , Malcolm Heywood

Meta-Learning Requires Meta-Augmentation

Meta-learning algorithms aim to learn two components: a model that predicts targets for a task, and a base learner that quickly updates that model when given examples from a new task. This additional level of learning can be powerful, but…

Machine Learning · Computer Science 2020-11-05 Janarthanan Rajendran , Alex Irpan , Eric Jang

Progressive Boosting for Class Imbalance

Pattern recognition applications often suffer from skewed data distributions between classes, which may vary during operations w.r.t. the design data. Two-class classification systems designed using skewed data tend to recognize the…

Machine Learning · Computer Science 2019-12-02 Roghayeh Soleymani , Eric Granger , Giorgio Fumera

Balanced-MixUp for Highly Imbalanced Medical Image Classification

Highly imbalanced datasets are ubiquitous in medical image classification problems. In such problems, it is often the case that rare classes associated to less prevalent diseases are severely under-represented in labeled databases,…

Computer Vision and Pattern Recognition · Computer Science 2022-09-15 Adrian Galdran , Gustavo Carneiro , Miguel A. González Ballester