Related papers: MIBoost: A gradient boosting algorithm for variabl…

Multiple Imputation Through XGBoost

The use of multiple imputation (MI) is becoming increasingly popular for addressing missing data. Although some conventional MI approaches have been well studied and have shown empirical validity, they have limitations when processing large…

Methodology · Statistics 2023-07-31 Yongshi Deng , Thomas Lumley

Unified Robust Boosting

Boosting is a popular algorithm in supervised machine learning with wide applications in regression and classification problems. It combines weak learners, such as regression trees, to obtain accurate predictions. However, in the presence…

Computation · Statistics 2025-02-06 Zhu Wang

Boosting with Structural Sparsity: A Differential Inclusion Approach

Boosting as gradient descent algorithms is one popular method in machine learning. In this paper a novel Boosting-type algorithm is proposed based on restricted gradient descent with structural sparsity control whose underlying dynamics are…

Machine Learning · Statistics 2017-04-18 Chendi Huang , Xinwei Sun , Jiechao Xiong , Yuan Yao

Gradient Boosting for Linear Mixed Models

Gradient boosting from the field of statistical learning is widely known as a powerful framework for estimation and selection of predictor effects in various regression models by adapting concepts from classification theory. Current…

Methodology · Statistics 2020-11-03 Colin Griesbach , Benjamin Säfken , Elisabeth Waldmann

Bayesian MI-LASSO for Variable Selection on Multiply-Imputed Data

Multiple imputation is widely used for handling missing data in real-world applications. For variable selection on multiply-imputed datasets, however, if selection is performed on each imputed dataset separately, it can result in different…

Methodology · Statistics 2025-08-07 Jungang Zou , Sijian Wang , Qixuan Chen

StructBoost: Boosting Methods for Predicting Structured Output Variables

Boosting is a method for learning a single accurate predictor by linearly combining a set of less accurate weak learners. Recently, structured learning has found many applications in computer vision. Inspired by structured support vector…

Machine Learning · Computer Science 2020-03-10 Chunhua Shen , Guosheng Lin , Anton van den Hengel

Robust Boosting for Regression Problems

Gradient boosting algorithms construct a regression predictor using a linear combination of ``base learners''. Boosting also offers an approach to obtaining robust non-parametric regression estimators that are scalable to applications with…

Methodology · Statistics 2020-08-11 Xiaomeng Ju , Matías Salibián-Barrera

A comparison of strategies for selecting auxiliary variables for multiple imputation

Multiple imputation (MI) is a popular method for handling missing data. Auxiliary variables can be added to the imputation model(s) to improve MI estimates. However, the choice of which auxiliary variables to include in the imputation model…

Methodology · Statistics 2022-04-01 Rheanna M. Mainzer , Cattram D. Nguyen , John B. Carlin , Margarita Moreno-Betancur , Ian R. White , Katherine J. Lee

Variable selection with multiply-imputed datasets: choosing between stacked and grouped methods

Penalized regression methods, such as lasso and elastic net, are used in many biomedical applications when simultaneous regression coefficient estimation and variable selection is desired. However, missing data complicates the…

Methodology · Statistics 2020-03-18 Jiacong Du , Jonathan Boss , Peisong Han , Lauren J Beesley , Stephen A Goutman , Stuart Batterman , Eva L Feldman , Bhramar Mukherjee

Optimization by gradient boosting

Gradient boosting is a state-of-the-art prediction technique that sequentially produces a model in the form of linear combinations of simple predictors---typically decision trees---by solving an infinite-dimensional convex optimization…

Statistics Theory · Mathematics 2017-07-18 Gérard Biau , Benoît Cadre

uBoost: A boosting method for producing uniform selection efficiencies from multivariate classifiers

The use of multivariate classifiers, especially neural networks and decision trees, has become commonplace in particle physics. Typically, a series of classifiers is trained rather than just one to enhance the performance; this is known as…

Nuclear Experiment · Physics 2015-06-16 Justin Stevens , Mike Williams

The Many Faces of Optimal Weak-to-Strong Learning

Boosting is an extremely successful idea, allowing one to combine multiple low accuracy classifiers into a much more accurate voting classifier. In this work, we present a new and surprisingly simple Boosting algorithm that obtains a…

Machine Learning · Computer Science 2024-09-02 Mikael Møller Høgsgaard , Kasper Green Larsen , Markus Engelund Mathiasen

Who wins the Miss Contest for Imputation Methods? Our Vote for Miss BooPF

Missing data is an expected issue when large amounts of data is collected, and several imputation techniques have been proposed to tackle this problem. Beneath classical approaches such as MICE, the application of Machine Learning…

Machine Learning · Statistics 2017-12-01 Burim Ramosaj , Markus Pauly

MEBoost: Mixing Estimators with Boosting for Imbalanced Data Classification

Class imbalance problem has been a challenging research problem in the fields of machine learning and data mining as most real life datasets are imbalanced. Several existing machine learning algorithms try to maximize the accuracy…

Machine Learning · Computer Science 2018-06-21 Farshid Rayhan , Sajid Ahmed , Asif Mahbub , Md. Rafsan Jani , Swakkhar Shatabda , Dewan Md. Farid , Chowdhury Mofizur Rahman

ProBoost: a Boosting Method for Probabilistic Classifiers

ProBoost, a new boosting algorithm for probabilistic classifiers, is proposed in this work. This algorithm uses the epistemic uncertainty of each training sample to determine the most challenging/uncertain ones; the relevance of these…

Machine Learning · Computer Science 2025-08-05 Fábio Mendonça , Sheikh Shanawaz Mostafa , Fernando Morgado-Dias , Antonio G. Ravelo-García , Mário A. T. Figueiredo

Bayesian Boosting for Linear Mixed Models

Boosting methods are widely used in statistical learning to deal with high-dimensional data due to their variable selection feature. However, those methods lack straightforward ways to construct estimators for the precision of the…

Methodology · Statistics 2021-06-10 Boyao Zhang , Colin Griesbach , Cora Kim , Nadia Müller-Voggel , Elisabeth Bergherr

Addressing cluster-constant covariates in mixed effects models via likelihood-based boosting techniques

Boosting techniques from the field of statistical learning have grown to be a popular tool for estimating and selecting predictor effects in various regression models and can roughly be separated in two general approaches, namely gradient…

Methodology · Statistics 2019-12-16 Colin Griesbach , Andreas Groll , Elisabeth Waldmann

RandomBoost: Simplified Multi-class Boosting through Randomization

We propose a novel boosting approach to multi-class classification problems, in which multiple classes are distinguished by a set of random projection matrices in essence. The approach uses random projections to alleviate the proliferation…

Machine Learning · Computer Science 2013-02-06 Sakrapee Paisitkriangkrai , Chunhua Shen , Qinfeng Shi , Anton van den Hengel

Proximal boosting: aggregating weak learners to minimize non-differentiable losses

Gradient boosting is a prediction method that iteratively combines weak learners to produce a complex and accurate model. From an optimization point of view, the learning procedure of gradient boosting mimics a gradient descent on a…

Machine Learning · Computer Science 2022-11-30 Erwan Fouillen , Claire Boyer , Maxime Sangnier

Gradient tree boosting with random output projections for multi-label classification and multi-output regression

In many applications of supervised learning, multiple classification or regression outputs have to be predicted jointly. We consider several extensions of gradient boosting to address such problems. We first propose a straightforward…

Machine Learning · Statistics 2019-05-21 Arnaud Joly , Louis Wehenkel , Pierre Geurts