Related papers: Sparsity in Optimal Randomized Classification Tree…

Generalized and Scalable Optimal Sparse Decision Trees

Decision tree optimization is notoriously difficult from a computational perspective but essential for the field of interpretable machine learning. Despite efforts over the past 40 years, only recently have optimization breakthroughs been…

Machine Learning · Computer Science 2022-11-24 Jimmy Lin , Chudi Zhong , Diane Hu , Cynthia Rudin , Margo Seltzer

On multivariate randomized classification trees: $l_0$-based sparsity, VC~dimension and decomposition methods

Decision trees are widely-used classification and regression models because of their interpretability and good accuracy. Classical methods such as CART are based on greedy approaches but a growing attention has recently been devoted to…

Machine Learning · Computer Science 2021-12-16 Edoardo Amaldi , Antonio Consolo , Andrea Manno

Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design

Sparse decision trees are one of the most common forms of interpretable models. While recent advances have produced algorithms that fully optimize sparse decision trees for prediction, that work does not address policy design, because the…

Machine Learning · Computer Science 2022-10-27 Ali Behrouz , Mathias Lecuyer , Cynthia Rudin , Margo Seltzer

Efficient non-greedy optimization of decision trees

Decision trees and randomized forests are widely used in computer vision and machine learning. Standard algorithms for decision tree induction optimize the split functions one node at a time according to some splitting criteria. This greedy…

Machine Learning · Computer Science 2015-11-13 Mohammad Norouzi , Maxwell D. Collins , Matthew Johnson , David J. Fleet , Pushmeet Kohli

Optimal randomized classification trees

Classification and Regression Trees (CARTs) are off-the-shelf techniques in modern Statistics and Machine Learning. CARTs are traditionally built by means of a greedy procedure, sequentially deciding the splitting predictor variable(s) and…

Machine Learning · Statistics 2021-10-25 Rafael Blanquero , Emilio Carrizosa , Cristina Molero-Río , Dolores Romero Morales

Near Optimal Decision Trees in a SPLIT Second

Decision tree optimization is fundamental to interpretable machine learning. The most popular approach is to greedily search for the best feature at every decision point, which is fast but provably suboptimal. Recent approaches find the…

Machine Learning · Computer Science 2025-11-19 Varun Babbar , Hayden McTavish , Cynthia Rudin , Margo Seltzer

Convergence Rates of Oblique Regression Trees for Flexible Function Libraries

We develop a theoretical framework for the analysis of oblique decision trees, where the splits at each decision node occur at linear combinations of the covariates (as opposed to conventional tree constructions that force axis-aligned…

Statistics Theory · Mathematics 2023-09-01 Matias D. Cattaneo , Rajita Chandak , Jason M. Klusowski

Experiments with Optimal Model Trees

Model trees provide an appealing way to perform interpretable machine learning for both classification and regression problems. In contrast to ``classic'' decision trees with constant values in their leaves, model trees can use linear…

Machine Learning · Computer Science 2026-03-11 Sabino Francesco Roselli , Eibe Frank

Optimal Sparse Decision Trees

Decision tree algorithms have been among the most popular algorithms for interpretable (transparent) machine learning since the early 1980's. The problem that has plagued decision tree algorithms since their inception is their lack of…

Machine Learning · Computer Science 2023-09-28 Xiyang Hu , Cynthia Rudin , Margo Seltzer

MurTree: Optimal Classification Trees via Dynamic Programming and Search

Decision tree learning is a widely used approach in machine learning, favoured in applications that require concise and interpretable models. Heuristic methods are traditionally used to quickly produce models with reasonably high accuracy.…

Machine Learning · Computer Science 2022-06-30 Emir Demirović , Anna Lukina , Emmanuel Hebrard , Jeffrey Chan , James Bailey , Christopher Leckie , Kotagiri Ramamohanarao , Peter J. Stuckey

Optimal Sparse Regression Trees

Regression trees are one of the oldest forms of AI models, and their predictions can be made without a calculator, which makes them broadly useful, particularly for high-stakes applications. Within the large literature on regression trees,…

Machine Learning · Computer Science 2023-04-11 Rui Zhang , Rui Xin , Margo Seltzer , Cynthia Rudin

Fast Sparse Decision Tree Optimization via Reference Ensembles

Sparse decision tree optimization has been one of the most fundamental problems in AI since its inception and is a challenge at the core of interpretable machine learning. Sparse decision tree optimization is computationally hard, and…

Machine Learning · Computer Science 2022-07-07 Hayden McTavish , Chudi Zhong , Reto Achermann , Ilias Karimalis , Jacques Chen , Cynthia Rudin , Margo Seltzer

Optimal Sparse Survival Trees

Interpretability is crucial for doctors, hospitals, pharmaceutical companies and biotechnology corporations to analyze and make decisions for high stakes problems that involve human health. Tree-based methods have been widely adopted for…

Machine Learning · Computer Science 2024-05-24 Rui Zhang , Rui Xin , Margo Seltzer , Cynthia Rudin

Sparse learning with CART

Decision trees with binary splits are popularly constructed using Classification and Regression Trees (CART) methodology. For regression models, this approach recursively divides the data into two near-homogenous daughter nodes according to…

Machine Learning · Statistics 2020-11-20 Jason M. Klusowski

A cautionary tale on fitting decision trees to data from additive models: generalization lower bounds

Decision trees are important both as interpretable models amenable to high-stakes decision-making, and as building blocks of ensemble methods such as random forests and gradient boosting. Their statistical properties, however, are not well…

Machine Learning · Statistics 2021-10-20 Yan Shuo Tan , Abhineet Agarwal , Bin Yu

Exploiting random projections and sparsity with random forests and gradient boosting methods -- Application to multi-label and multi-output learning, random forest model compression and leveraging input sparsity

Within machine learning, the supervised learning field aims at modeling the input-output relationship of a system, from past observations of its behavior. Decision trees characterize the input-output relationship through a series of nested…

Machine Learning · Statistics 2019-05-20 Arnaud Joly

Optimal Classification Trees for Continuous Feature Data Using Dynamic Programming with Branch-and-Bound

Computing an optimal classification tree that provably maximizes training performance within a given size limit, is NP-hard, and in practice, most state-of-the-art methods do not scale beyond computing optimal trees of depth three.…

Machine Learning · Computer Science 2025-01-15 Catalin E. Brita , Jacobus G. M. van der Linden , Emir Demirović

Bayesian Regression Tree Ensembles that Adapt to Smoothness and Sparsity

Ensembles of decision trees are a useful tool for obtaining for obtaining flexible estimates of regression functions. Examples of these methods include gradient boosted decision trees, random forests, and Bayesian CART. Two potential…

Methodology · Statistics 2018-09-18 Antonio Ricardo Linero , Yun Yang

Efficient Tree Generation for Globally Optimal Decisions under Probabilistic Outcomes

Many real-world problems require making sequences of decisions where the outcomes of each decision are probabilistic and uncertain, and the availability of different actions is constrained by the outcomes of previous actions. There is a…

Optimization and Control · Mathematics 2025-04-28 Berk Ozturk , She'ifa Punla-Green , Les Servi

Approximating splits for decision trees quickly in sparse data streams

Decision trees are one of the most popular classifiers in the machine learning literature. While the most common decision tree learning algorithms treat data as a batch, numerous algorithms have been proposed to construct decision trees…

Machine Learning · Computer Science 2026-01-21 Nikolaj Tatti