Related papers: Adaptive XGBoost for Evolving Data Streams

A Boosting Approach to Constructing an Ensemble Stack

An approach to evolutionary ensemble learning for classification is proposed in which boosting is used to construct a stack of programs. Each application of boosting identifies a single champion and a residual dataset, i.e. the training…

Neural and Evolutionary Computing · Computer Science 2023-11-27 Zhilei Zhou , Ziyu Qiu , Brad Niblett , Andrew Johnston , Jeffrey Schwartzentruber , Nur Zincir-Heywood , Malcolm Heywood

A Comparative Analysis of XGBoost

XGBoost is a scalable ensemble technique based on gradient boosting that has demonstrated to be a reliable and efficient machine learning challenge solver. This work proposes a practical analysis of how this novel technique works in terms…

Machine Learning · Computer Science 2023-05-05 Candice Bentéjac , Anna Csörgő , Gonzalo Martínez-Muñoz

Gradient Boosting on Stochastic Data Streams

Boosting is a popular ensemble algorithm that generates more powerful learners by linearly combining base models from a simpler hypothesis class. In this work, we investigate the problem of adapting batch gradient boosting for minimizing…

Machine Learning · Computer Science 2017-03-02 Hanzhang Hu , Wen Sun , Arun Venkatraman , Martial Hebert , J. Andrew Bagnell

Dynamic Meta-Learning for Adaptive XGBoost-Neural Ensembles

This paper introduces a novel adaptive ensemble framework that synergistically combines XGBoost and neural networks through sophisticated meta-learning. The proposed method leverages advanced uncertainty quantification techniques and…

Machine Learning · Computer Science 2025-10-07 Arthur Sedek

XGenBoost: Synthesizing Small and Large Tabular Datasets with XGBoost

Tree ensembles such as XGBoost are often preferred for discriminative tasks in mixed-type tabular data, due to their inductive biases, minimal hyperparameter tuning, and training efficiency. We argue that these qualities, when leveraged…

Machine Learning · Computer Science 2026-03-10 Jim Achterberg , Marcel Haas , Bram van Dijk , Marco Spruit

Quantile Extreme Gradient Boosting for Uncertainty Quantification

As the availability, size and complexity of data have increased in recent years, machine learning (ML) techniques have become popular for modeling. Predictions resulting from applying ML models are often used for inference, decision-making,…

Machine Learning · Statistics 2023-04-25 Xiaozhe Yin , Masoud Fallah-Shorshani , Rob McConnell , Scott Fruin , Yao-Yi Chiang , Meredith Franklin

Accelerated Gradient Boosting

Gradient tree boosting is a prediction algorithm that sequentially produces a model in the form of linear combinations of decision trees, by solving an infinite-dimensional optimization problem. We combine gradient boosting and Nesterov's…

Machine Learning · Statistics 2018-03-07 Gérard Biau , Benoît Cadre , Laurent Rouvìère

XGBoost: A Scalable Tree Boosting System

Tree boosting is a highly effective and widely used machine learning method. In this paper, we describe a scalable end-to-end tree boosting system called XGBoost, which is used widely by data scientists to achieve state-of-the-art results…

Machine Learning · Computer Science 2016-06-14 Tianqi Chen , Carlos Guestrin

A Simple and Fast Baseline for Tuning Large XGBoost Models

XGBoost, a scalable tree boosting algorithm, has proven effective for many prediction tasks of practical interest, especially using tabular datasets. Hyperparameter tuning can further improve the predictive performance, but unlike neural…

Machine Learning · Computer Science 2021-11-16 Sanyam Kapoor , Valerio Perrone

An ensemble consists of a set of individually trained classifiers (such as neural networks or decision trees) whose predictions are combined when classifying novel instances. Previous research has shown that an ensemble is often more…

Artificial Intelligence · Computer Science 2011-06-02 R. Maclin , D. Opitz

Scaling Up Diffusion and Flow-based XGBoost Models

Novel machine learning methods for tabular data generation are often developed on small datasets which do not match the scale required for scientific applications. We investigate a recent proposal to use XGBoost as the function approximator…

Machine Learning · Computer Science 2024-08-30 Jesse C. Cresswell , Taewoo Kim

An Ensemble Classification Algorithm Based on Information Entropy for Data Streams

Data stream mining problem has caused widely concerns in the area of machine learning and data mining. In some recent studies, ensemble classification has been widely used in concept drift detection, however, most of them regard…

Data Structures and Algorithms · Computer Science 2017-08-14 Junhong Wang , Shuliang Xu , Bingqian Duan , Caifeng Liu , Jiye Liang

XGBoost: Scalable GPU Accelerated Learning

We describe the multi-GPU gradient boosting algorithm implemented in the XGBoost library (https://github.com/dmlc/xgboost). Our algorithm allows fast, scalable training on multi-GPU systems with all of the features of the XGBoost library.…

Machine Learning · Computer Science 2018-07-02 Rory Mitchell , Andrey Adinets , Thejaswi Rao , Eibe Frank

Progressive Boosting for Class Imbalance

Pattern recognition applications often suffer from skewed data distributions between classes, which may vary during operations w.r.t. the design data. Two-class classification systems designed using skewed data tend to recognize the…

Machine Learning · Computer Science 2019-12-02 Roghayeh Soleymani , Eric Granger , Giorgio Fumera

Dynamic Model Switching for Improved Accuracy in Machine Learning

In the dynamic landscape of machine learning, where datasets vary widely in size and complexity, selecting the most effective model poses a significant challenge. Rather than fixating on a single model, our research propels the field…

Machine Learning · Computer Science 2024-05-01 Syed Tahir Abbas Hasani

XGBOD: Improving Supervised Outlier Detection with Unsupervised Representation Learning

A new semi-supervised ensemble algorithm called XGBOD (Extreme Gradient Boosting Outlier Detection) is proposed, described and demonstrated for the enhanced detection of outliers from normal observations in various practical datasets. The…

Machine Learning · Computer Science 2020-09-22 Yue Zhao , Maciej K. Hryniewicki

BoostTree and BoostForest for Ensemble Learning

Bootstrap aggregating (Bagging) and boosting are two popular ensemble learning approaches, which combine multiple base learners to generate a composite model for more accurate and more reliable performance. They have been widely used in…

Machine Learning · Computer Science 2022-12-07 Changming Zhao , Dongrui Wu , Jian Huang , Ye Yuan , Hai-Tao Zhang , Ruimin Peng , Zhenhua Shi

Boosted Generative Models

We propose a novel approach for using unsupervised boosting to create an ensemble of generative models, where models are trained in sequence to correct earlier mistakes. Our meta-algorithmic framework can leverage any existing base learner…

Machine Learning · Computer Science 2017-12-25 Aditya Grover , Stefano Ermon

Combining Gradient Boosting Machines with Collective Inference to Predict Continuous Values

Gradient boosting of regression trees is a competitive procedure for learning predictive models of continuous data that fits the data with an additive non-parametric model. The classic version of gradient boosting assumes that the data is…

Machine Learning · Computer Science 2016-07-04 Iman Alodah , Jennifer Neville

Tree Boosting Methods for Balanced andImbalanced Classification and their Robustness Over Time in Risk Assessment

Most real-world classification problems deal with imbalanced datasets, posing a challenge for Artificial Intelligence (AI), i.e., machine learning algorithms, because the minority class, which is of extreme interest, often proves difficult…

Machine Learning · Computer Science 2025-04-28 Gissel Velarde , Michael Weichert , Anuj Deshmunkh , Sanjay Deshmane , Anindya Sudhir , Khushboo Sharma , Vaibhav Joshi