English
Related papers

Related papers: StructureBoost: Efficient Gradient Boosting for St…

200 papers

Boosting is a method for learning a single accurate predictor by linearly combining a set of less accurate weak learners. Recently, structured learning has found many applications in computer vision. Inspired by structured support vector…

Machine Learning · Computer Science 2020-03-10 Chunhua Shen , Guosheng Lin , Anton van den Hengel

Gradient Boosted Decision Tree (GBDT) is a widely-used machine learning algorithm that has been shown to achieve state-of-the-art results on many standard data science problems. We are interested in its application to multioutput problems…

Machine Learning · Computer Science 2022-11-24 Leonid Iosipoi , Anton Vakhrushev

In this paper we present CatBoost, a new open-sourced gradient boosting library that successfully handles categorical features and outperforms existing publicly available implementations of gradient boosting in terms of quality on a set of…

Machine Learning · Computer Science 2018-10-29 Anna Veronika Dorogush , Vasily Ershov , Andrey Gulin

Technology and collaboration enable dramatic increases in the size of psychological and psychiatric data collections, but finding structure in these large data sets with many collected variables is challenging. Decision tree ensembles like…

Machine Learning · Statistics 2017-02-15 Patrick J. Miller , Gitta H. Lubke , Daniel B. McArtor , C. S. Bergeman

Gradient boosted decision trees are a popular machine learning technique, in part because of their ability to give good accuracy with small models. We describe two extensions to the standard tree boosting algorithm designed to increase this…

Machine Learning · Statistics 2017-11-01 Natalia Ponomareva , Thomas Colthurst , Gilbert Hendry , Salem Haykal , Soroush Radpour

This paper presents the key algorithmic techniques behind CatBoost, a new gradient boosting toolkit. Their combination leads to CatBoost outperforming other publicly available boosting implementations in terms of quality on a variety of…

Machine Learning · Computer Science 2019-01-23 Liudmila Prokhorenkova , Gleb Gusev , Aleksandr Vorobev , Anna Veronika Dorogush , Andrey Gulin

Gradient Boosted Decision Trees (GBDT) is a very successful ensemble learning algorithm widely used across a variety of applications. Recently, several variants of GBDT training algorithms and implementations have been designed and heavily…

Machine Learning · Computer Science 2019-06-27 Yu Shi , Jian Li , Zhize Li

Standard methods of using categorical variables as predictors either endow them with an ordinal structure or assume they have no structure at all. However, categorical variables often possess structure that is more complicated than a linear…

Machine Learning · Statistics 2020-04-17 Brian Lucena

agtboost is an R package implementing fast gradient tree boosting computations in a manner similar to other established frameworks such as xgboost and LightGBM, but with significant decreases in computation time and required mathematical…

Machine Learning · Statistics 2020-08-31 Berent Ånund Strømnes Lunde , Tore Selland Kleppe

Additive models, such as produced by gradient boosting, and full interaction models, such as classification and regression trees (CART), are widely used algorithms that have been investigated largely in isolation. We show that these models…

Gradient tree boosting is a prediction algorithm that sequentially produces a model in the form of linear combinations of decision trees, by solving an infinite-dimensional optimization problem. We combine gradient boosting and Nesterov's…

Machine Learning · Statistics 2018-03-07 Gérard Biau , Benoît Cadre , Laurent Rouvìère

We propose PathBoost, a gradient tree boosting method for graph-level classification and regression that learns discriminative path-based features directly from the input graph structure. Building on a previous work, which was tailored to a…

Machine Learning · Computer Science 2026-05-12 Claudio Meggio , Johan Pensar , Riccardo De Bin

Uplift modeling comprises a collection of machine learning techniques designed for managers to predict the incremental impact of specific actions on customer outcomes. However, accurately estimating this incremental impact poses significant…

Machine Learning · Computer Science 2025-02-10 Junjie Gao , Xiangyu Zheng , DongDong Wang , Zhixiang Huang , Bangqi Zheng , Kai Yang

Despite the rise to dominance of deep learning in unstructured data domains, tree-based methods such as Random Forests (RF) and Gradient Boosted Decision Trees (GBDT) are still the workhorses for handling discriminative tasks on tabular…

Machine Learning · Computer Science 2025-04-21 João Bravo

Gradient boosting decision tree (GBDT) is an ensemble machine learning algorithm, which is widely used in industry, due to its good performance and easy interpretation. Due to the problem of data isolation and the requirement of privacy,…

Machine Learning · Computer Science 2024-06-21 Tao Fan , Weijing Chen , Guoqiang Ma , Yan Kang , Lixin Fan , Qiang Yang

Gradient boosting, a method of building additive ensembles from weak learners, has established itself as a practical and theoretically-motivated approach to approximate functions, especially using decision tree weak learners. Comparable…

Machine Learning · Computer Science 2026-03-26 Abhijit Chowdhary , Elizabeth Newman , Deepanshu Verma

Gradient boosting is a state-of-the-art prediction technique that sequentially produces a model in the form of linear combinations of simple predictors---typically decision trees---by solving an infinite-dimensional convex optimization…

Statistics Theory · Mathematics 2017-07-18 Gérard Biau , Benoît Cadre

Boosting Trees are one of the most successful statistical learning approaches that involve sequentially growing an ensemble of simple regression trees (i.e., "weak learners"). However, gradient boosted trees are not yet available for…

Boosted decision trees are a very powerful machine learning technique. After introducing specific concepts of machine learning in the high-energy physics context and describing ways to quantify the performance and training quality of…

Data Analysis, Statistics and Probability · Physics 2022-06-22 Yann Coadou

Real world datasets contain incorrectly labeled instances that hamper the performance of the model and, in particular, the ability to generalize out of distribution. Also, each example might have different contribution towards learning.…

‹ Prev 1 2 3 10 Next ›