Related papers: A tree-based varying coefficient model

Tree Boosted Varying Coefficient Models

This paper investigates the integration of gradient boosted decision trees and varying coefficient models. We introduce the tree boosted varying coefficient framework which justifies the implementation of decision tree boosting as the…

Methodology · Statistics 2019-04-03 Yichen Zhou , Giles Hooker

Feature Importance in Gradient Boosting Trees with Cross-Validation Feature Selection

Gradient Boosting Machines (GBM) are among the go-to algorithms on tabular data, which produce state of the art results in many prediction tasks. Despite its popularity, the GBM framework suffers from a fundamental flaw in its base…

Machine Learning · Computer Science 2021-09-14 Afek Ilay Adler , Amichai Painsky

Variational Boosted Soft Trees

Gradient boosting machines (GBMs) based on decision trees consistently demonstrate state-of-the-art results on regression and classification tasks with tabular data, often outperforming deep neural networks. However, these models do not…

Machine Learning · Computer Science 2023-02-23 Tristan Cinquin , Tammo Rukat , Philipp Schmidt , Martin Wistuba , Artur Bekasov

Tree-Structured Modelling of Varying Coefficients

The varying-coefficient model is a strong tool for the modelling of interactions in generalized regression. It is easy to apply if both the variables that are modified as well as the effect modifiers are known. However, in general one has a…

Methodology · Statistics 2017-05-25 Moritz Berger , Gerhard Tutz , Matthias Schmid

Randomized Gradient Boosting Machine

Gradient Boosting Machine (GBM) introduced by Friedman is a powerful supervised learning algorithm that is very widely used in practice---it routinely features as a leading algorithm in machine learning competitions such as Kaggle and the…

Machine Learning · Computer Science 2020-09-17 Haihao Lu , Rahul Mazumder

Probabilistic Gradient Boosting Machines for Large-Scale Probabilistic Regression

Gradient Boosting Machines (GBM) are hugely popular for solving tabular data problems. However, practitioners are not only interested in point predictions, but also in probabilistic predictions in order to quantify the uncertainty of the…

Machine Learning · Computer Science 2021-06-08 Olivier Sprangers , Sebastian Schelter , Maarten de Rijke

Gradient boosting machine with partially randomized decision trees

The gradient boosting machine is a powerful ensemble-based machine learning method for solving regression problems. However, one of the difficulties of its using is a possible discontinuity of the regression function, which arises when…

Machine Learning · Computer Science 2020-06-22 Andrei V. Konstantinov , Lev V. Utkin

VCBART: Bayesian trees for varying coefficients

The linear varying coefficient models posits a linear relationship between an outcome and covariates in which the covariate effects are modeled as functions of additional effect modifiers. Despite a long history of study and use in…

Methodology · Statistics 2024-09-26 Sameer K. Deshpande , Ray Bai , Cecilia Balocchi , Jennifer E. Starling , Jordan Weiss

Generalized Tree-Informed Mixed Model Regression

The standard regression tree method applied to observations within clusters poses both methodological and implementation challenges. Effectively leveraging these data requires methods that account for both individual-level and sample-level…

Methodology · Statistics 2025-03-05 Jeremiah Allis , Xin Jin , Riddhi Ghosh

Distributed Bayesian Varying Coefficient Modeling Using a Gaussian Process Prior

Varying coefficient models (VCMs) are widely used for estimating nonlinear regression functions for functional data. Their Bayesian variants using Gaussian process priors on the functional coefficients, however, have received limited…

Methodology · Statistics 2022-03-01 Rajarshi Guhaniyogi , Cheng Li , Terrance D. Savitsky , Sanvesh Srivastava

Soft Gradient Boosting Machine

Gradient Boosting Machine has proven to be one successful function approximator and has been widely used in a variety of areas. However, since the training procedure of each base learner has to take the sequential order, it is infeasible to…

Machine Learning · Computer Science 2020-06-09 Ji Feng , Yi-Xuan Xu , Yuan Jiang , Zhi-Hua Zhou

Reliable and scalable variable importance estimation via warm-start and early stopping

As opaque black-box predictive models become more prevalent, the need to develop interpretations for these models is of great interest. The concept of variable importance and Shapley values are interpretability measures that applies to any…

Machine Learning · Statistics 2025-03-10 Zexuan Sun , Garvesh Raskutti

From Point to probabilistic gradient boosting for claim frequency and severity prediction

Gradient boosting for decision tree algorithms are increasingly used in actuarial applications as they show superior predictive performance over traditional generalised linear models. Many enhancements to the first gradient boosting machine…

Machine Learning · Statistics 2025-08-05 Dominik Chevalier , Marie-Pier Côté

Distributional Gradient Boosting Machines

We present a unified probabilistic gradient boosting framework for regression tasks that models and predicts the entire conditional distribution of a univariate response variable as a function of covariates. Our likelihood-based approach…

Machine Learning · Statistics 2022-04-05 Alexander März , Thomas Kneib

AGBoost: Attention-based Modification of Gradient Boosting Machine

A new attention-based model for the gradient boosting machine (GBM) called AGBoost (the attention-based gradient boosting) is proposed for solving regression problems. The main idea behind the proposed AGBoost model is to assign attention…

Machine Learning · Computer Science 2022-07-13 Andrei Konstantinov , Lev Utkin , Stanislav Kirpichenko

Scalable and Communication-Efficient Varying Coefficient Mixed Effect Models: Methodology, Theory, and Applications

Human migration exhibits complex spatiotemporal dependence driven by environmental and socioeconomic forces. Modeling such patterns at scale requires methods that accommodate many random effects while remaining feasible when raw data or…

Methodology · Statistics 2026-05-29 Lida Chalangar Jalili Dehkharghani , Li-Hsiang Lin

Gradient Boosted Mixed Models: Flexible Joint Estimation of Mean and Variance Components for Clustered Data

Linear mixed models are widely used for clustered data, but their reliance on parametric forms limits flexibility in complex and high-dimensional settings. In contrast, gradient boosting methods achieve high predictive accuracy through…

Machine Learning · Statistics 2025-11-04 Mitchell L. Prevett , Francis K. C. Hui , Zhi Yang Tho , A. H. Welsh , Anton H. Westveld

The Bayesian Regularized Quantile Varying Coefficient Model

The quantile varying coefficient (VC) model can flexibly capture dynamical patterns of regression coefficients. In addition, due to the quantile check loss function, it is robust against outliers and heavy-tailed distributions of the…

Methodology · Statistics 2023-07-11 Fei Zhou , Jie Ren , Shuangge Ma , Cen Wu

Accelerated Componentwise Gradient Boosting using Efficient Data Representation and Momentum-based Optimization

Componentwise boosting (CWB), also known as model-based boosting, is a variant of gradient boosting that builds on additive models as base learners to ensure interpretability. CWB is thus often used in research areas where models are…

Computation · Statistics 2021-11-01 Daniel Schalk , Bernd Bischl , David Rügamer

Statistical Inference for Explainable Boosting Machines

Explainable boosting machines (EBMs) are popular "glass-box" models that learn a set of univariate functions using boosting trees. These achieve explainability through visualizations of each feature's effect. However, unlike linear model…

Machine Learning · Statistics 2026-03-31 Haimo Fang , Kevin Tan , Jonathan Pipping-Gamon , Giles Hooker