Related papers: On Computing Optimal Tree Ensembles

Optimal Classification Trees for Continuous Feature Data Using Dynamic Programming with Branch-and-Bound

Computing an optimal classification tree that provably maximizes training performance within a given size limit, is NP-hard, and in practice, most state-of-the-art methods do not scale beyond computing optimal trees of depth three.…

Machine Learning · Computer Science 2025-01-15 Catalin E. Brita , Jacobus G. M. van der Linden , Emir Demirović

Optimal Generalized Decision Trees via Integer Programming

Decision trees have been a very popular class of predictive models for decades due to their interpretability and good performance on categorical features. However, they are not always robust and tend to overfit the data. Additionally, if…

Machine Learning · Computer Science 2019-08-14 Oktay Gunluk , Jayant Kalagnanam , Minhan Li , Matt Menickelly , Katya Scheinberg

Born-Again Tree Ensembles

The use of machine learning algorithms in finance, medicine, and criminal justice can deeply impact human lives. As a consequence, research into interpretable machine learning has rapidly grown in an attempt to better control and fix…

Machine Learning · Computer Science 2021-02-02 Thibaut Vidal , Toni Pacheco , Maximilian Schiffer

Large Random Forests: Optimisation for Rapid Evaluation

Random Forests are one of the most popular classifiers in machine learning. The larger they are, the more precise is the outcome of their predictions. However, this comes at a cost: their running time for classification grows linearly with…

Machine Learning · Computer Science 2019-12-24 Frederik Gossen , Bernhard Steffen

Optimal Decision Tree Pruning Revisited: Algorithms and Complexity

We present a comprehensive classical and parameterized complexity analysis of decision tree pruning operations, extending recent research on the complexity of learning small decision trees. Thereby, we offer new insights into the…

Machine Learning · Computer Science 2025-03-06 Juha Harviainen , Frank Sommer , Manuel Sorge , Stefan Szeider

Shrub Ensembles for Online Classification

Online learning algorithms have become a ubiquitous tool in the machine learning toolbox and are frequently used in small, resource-constraint environments. Among the most successful online learning methods are Decision Tree (DT) ensembles.…

Machine Learning · Computer Science 2021-12-08 Sebastian Buschjäger , Sibylle Hess , Katharina Morik

Optimal survival trees ensemble

Recent studies have adopted an approach of selecting accurate and diverse trees based on individual or collective performance within an ensemble for classification and regression problems. This work follows in the wake of these…

Applications · Statistics 2020-05-20 Naz Gul , Nosheen Faiz , Dan Brawn , Rafal Kulakowski , Zardad Khan , Berthold Lausen

The Influence of Dimensions on the Complexity of Computing Decision Trees

A decision tree recursively splits a feature space $\mathbb{R}^{d}$ and then assigns class labels based on the resulting partition. Decision trees have been part of the basic machine-learning toolkit for decades. A large body of work treats…

Computational Complexity · Computer Science 2022-06-03 Stephen G. Kobourov , Maarten Löffler , Fabrizio Montecchiani , Marcin Pilipczuk , Ignaz Rutter , Raimund Seidel , Manuel Sorge , Jules Wulms

Experiments with Optimal Model Trees

Model trees provide an appealing way to perform interpretable machine learning for both classification and regression problems. In contrast to ``classic'' decision trees with constant values in their leaves, model trees can use linear…

Machine Learning · Computer Science 2026-03-11 Sabino Francesco Roselli , Eibe Frank

MurTree: Optimal Classification Trees via Dynamic Programming and Search

Decision tree learning is a widely used approach in machine learning, favoured in applications that require concise and interpretable models. Heuristic methods are traditionally used to quickly produce models with reasonably high accuracy.…

Machine Learning · Computer Science 2022-06-30 Emir Demirović , Anna Lukina , Emmanuel Hebrard , Jeffrey Chan , James Bailey , Christopher Leckie , Kotagiri Ramamohanarao , Peter J. Stuckey

Efficient non-greedy optimization of decision trees

Decision trees and randomized forests are widely used in computer vision and machine learning. Standard algorithms for decision tree induction optimize the split functions one node at a time according to some splitting criteria. This greedy…

Machine Learning · Computer Science 2015-11-13 Mohammad Norouzi , Maxwell D. Collins , Matthew Johnson , David J. Fleet , Pushmeet Kohli

Near Optimal Decision Trees in a SPLIT Second

Decision tree optimization is fundamental to interpretable machine learning. The most popular approach is to greedily search for the best feature at every decision point, which is fast but provably suboptimal. Recent approaches find the…

Machine Learning · Computer Science 2025-11-19 Varun Babbar , Hayden McTavish , Cynthia Rudin , Margo Seltzer

Optimally Pruning Decision Tree Ensembles With Feature Cost

We consider the problem of learning decision rules for prediction with feature budget constraint. In particular, we are interested in pruning an ensemble of decision trees to reduce expected feature cost while maintaining high prediction…

Machine Learning · Statistics 2016-01-06 Feng Nan , Joseph Wang , Venkatesh Saligrama

Optimization of Tree Ensembles

Tree ensemble models such as random forests and boosted trees are among the most widely used and practically successful predictive models in applied machine learning and business analytics. Although such models have been used to make…

Optimization and Control · Mathematics 2019-10-11 Velibor V. Mišić

Dive into Decision Trees and Forests: A Theoretical Demonstration

Based on decision trees, many fields have arguably made tremendous progress in recent years. In simple words, decision trees use the strategy of "divide-and-conquer" to divide the complex problem on the dependency between input features and…

Machine Learning · Computer Science 2021-01-22 Jinxiong Zhang

The Max-Cut Decision Tree: Improving on the Accuracy and Running Time of Decision Trees

Decision trees are a widely used method for classification, both by themselves and as the building blocks of multiple different ensemble learning methods. The Max-Cut decision tree involves novel modifications to a standard, baseline model…

Machine Learning · Computer Science 2020-06-26 Jonathan Bodine , Dorit S. Hochbaum

Data Selection: A General Principle for Building Small Interpretable Models

We present convincing empirical evidence for an effective and general strategy for building accurate small models. Such models are attractive for interpretability and also find use in resource-constrained environments. The strategy is to…

Machine Learning · Computer Science 2024-04-30 Abhishek Ghose

Big Data Classification Using Augmented Decision Trees

We present an algorithm for classification tasks on big data. Experiments conducted as part of this study indicate that the algorithm can be as accurate as ensemble methods such as random forests or gradient boosted trees. Unlike ensemble…

Machine Learning · Statistics 2017-10-27 Rajiv Sambasivan , Sourish Das

Prediction Algorithms Achieving Bayesian Decision Theoretical Optimality Based on Decision Trees as Data Observation Processes

In the field of decision trees, most previous studies have difficulty ensuring the statistical optimality of a prediction of new data and suffer from overfitting because trees are usually used only to represent prediction functions to be…

Machine Learning · Computer Science 2023-06-13 Yuta Nakahara , Shota Saito , Naoki Ichijo , Koki Kazama , Toshiyasu Matsushima

ForestPrune: Compact Depth-Controlled Tree Ensembles

Tree ensembles are powerful models that achieve excellent predictive performances, but can grow to unwieldy sizes. These ensembles are often post-processed (pruned) to reduce memory footprint and improve interpretability. We present…

Machine Learning · Statistics 2023-05-26 Brian Liu , Rahul Mazumder