Related papers: Trees-Based Models for Correlated Data

FREEtree: A Tree-based Approach for High Dimensional Longitudinal Data With Correlated Features

This paper proposes FREEtree, a tree-based method for high dimensional longitudinal data with correlated features. Popular machine learning approaches, like Random Forests, commonly used for variable selection do not perform well when there…

Machine Learning · Statistics 2020-06-18 Yuancheng Xu , Athanasse Zafirov , R. Michael Alvarez , Dan Kojis , Min Tan , Christina M. Ramirez

Tree-Structured Boosting: Connections Between Gradient Boosted Stumps and Full Decision Trees

Additive models, such as produced by gradient boosting, and full interaction models, such as classification and regression trees (CART), are widely used algorithms that have been investigated largely in isolation. We show that these models…

Machine Learning · Statistics 2017-11-21 José Marcio Luna , Eric Eaton , Lyle H. Ungar , Eric Diffenderfer , Shane T. Jensen , Efstathios D. Gennatas , Mateo Wirth , Charles B. Simone , Timothy D. Solberg , Gilmer Valdes

Probabilistic Modeling Using Tree Linear Cascades

We introduce tree linear cascades, a class of linear structural equation models for which the error variables are uncorrelated but need not be Gaussian nor independent. We show that, in spite of this weak assumption, the tree structure of…

Methodology · Statistics 2022-02-16 Nicholas C. Landolfi , Sanjay Lall

Analyze Additive and Interaction Effects via Collaborative Trees

We present Collaborative Trees, a novel tree model designed for regression prediction, along with its bagging version, which aims to analyze complex statistical associations between features and uncover potential patterns inherent in the…

Methodology · Statistics 2024-05-21 Chien-Ming Chi

Energy Trees: Regression and Classification With Structured and Mixed-Type Covariates

The increasing complexity of data requires methods and models that can effectively handle intricate structures, as simplifying them would result in loss of information. While several analytical tools have been developed to work with complex…

Methodology · Statistics 2023-06-16 Riccardo Giubilei , Tullia Padellini , Pierpaolo Brutti

Uncertain Trees: Dealing with Uncertain Inputs in Regression Trees

Tree-based ensemble methods, as Random Forests and Gradient Boosted Trees, have been successfully used for regression in many applications and research studies. Furthermore, these methods have been extended in order to deal with uncertainty…

Machine Learning · Computer Science 2018-11-20 Myriam Tami , Marianne Clausel , Emilie Devijver , Adrien Dulac , Eric Gaussier , Stefan Janaqi , Meriam Chebre

Contrast Trees and Distribution Boosting

Often machine learning methods are applied and results reported in cases where there is little to no information concerning accuracy of the output. Simply because a computer program returns a result does not insure its validity. If…

Machine Learning · Statistics 2022-05-25 Jerome H. Friedman

Classification and regression tree methods for incomplete data from sample surveys

Analysis of sample survey data often requires adjustments to account for missing data in the outcome variables of principal interest. Standard adjustment methods based on item imputation or on propensity weighting factors rely heavily on…

Methodology · Statistics 2016-03-08 Wei-Yin Loh , John Eltinge , MoonJung Cho , Yuanzhi Li

Dive into Decision Trees and Forests: A Theoretical Demonstration

Based on decision trees, many fields have arguably made tremendous progress in recent years. In simple words, decision trees use the strategy of "divide-and-conquer" to divide the complex problem on the dependency between input features and…

Machine Learning · Computer Science 2021-01-22 Jinxiong Zhang

Evaluation of Tree Based Regression over Multiple Linear Regression for Non-normally Distributed Data in Battery Performance

Battery performance datasets are typically non-normal and multicollinear. Extrapolating such datasets for model predictions needs attention to such characteristics. This study explores the impact of data normality in building machine…

Machine Learning · Computer Science 2021-11-05 Shovan Chowdhury , Yuxiao Lin , Boryann Liaw , Leslie Kerby

A cautionary tale on fitting decision trees to data from additive models: generalization lower bounds

Decision trees are important both as interpretable models amenable to high-stakes decision-making, and as building blocks of ensemble methods such as random forests and gradient boosting. Their statistical properties, however, are not well…

Machine Learning · Statistics 2021-10-20 Yan Shuo Tan , Abhineet Agarwal , Bin Yu

Regularized regression on compositional trees with application to MRI analysis

A compositional tree refers to a tree structure on a set of random variables where each random variable is a node and composition occurs at each non-leaf node of the tree. As a generalization of compositional data, compositional trees…

Methodology · Statistics 2021-04-20 Bingkai Wang , Brian S. Caffo , Xi Luo , Chin-Fu Liu , Andreia V. Faria , Michael I. Miller , Yi Zhao

Tree-Structured Modelling of Categorical Predictors in Regression

Generalized linear and additive models are very efficient regression tools but the selection of relevant terms becomes difficult if higher order interactions are needed. In contrast, tree-based methods also known as recursive partitioning…

Methodology · Statistics 2015-04-21 Gerhard Tutz , Moritz Berger

Tree Boosted Varying Coefficient Models

This paper investigates the integration of gradient boosted decision trees and varying coefficient models. We introduce the tree boosted varying coefficient framework which justifies the implementation of decision tree boosting as the…

Methodology · Statistics 2019-04-03 Yichen Zhou , Giles Hooker

Forecasting with Hyper-Trees

We introduce Hyper-Trees as a novel framework for modeling time series data using gradient boosted trees. Unlike conventional tree-based approaches that forecast time series directly, Hyper-Trees learn the parameters of a target time series…

Machine Learning · Computer Science 2026-02-09 Alexander März , Kashif Rasul

Flexible tree-structured regression for clustered data with an application to quality of life in older adults

Tree-structured models are a powerful alternative to parametric regression models if non-linear effects and interactions are present in the data. Yet, classical tree-structured models might not be appropriate if data comes in clusters of…

Methodology · Statistics 2025-01-23 Nikolai Spuck , Matthias Schmid , Moritz Berger

Dynamic Trees for Learning and Design

Dynamic regression trees are an attractive option for automatic regression and classification with complicated response surfaces in on-line application settings. We create a sequential tree model whose state changes in time with the…

Methodology · Statistics 2010-11-23 Matthew A. Taddy , Robert B. Gramacy , Nicholas G. Polson

Multivariate Boosted Trees and Applications to Forecasting and Control

Gradient boosted trees are competition-winning, general-purpose, non-parametric regressors, which exploit sequential model fitting and gradient descent to minimize a specific loss function. The most popular implementations are tailored to…

Machine Learning · Computer Science 2022-08-23 Lorenzo Nespoli , Vasco Medici

Growing Regression Forests by Classification: Applications to Object Pose Estimation

In this work, we propose a novel node splitting method for regression trees and incorporate it into the regression forest framework. Unlike traditional binary splitting, where the splitting rule is selected from a predefined set of binary…

Computer Vision and Pattern Recognition · Computer Science 2014-07-16 Kota Hara , Rama Chellappa

Distributional Adaptive Soft Regression Trees

Random forests are an ensemble method relevant for many problems, such as regression or classification. They are popular due to their good predictive performance (compared to, e.g., decision trees) requiring only minimal tuning of…

Methodology · Statistics 2022-10-20 Nikolaus Umlauf , Nadja Klein