Related papers: Superfast Selection for Decision Tree Algorithms

Optimal hypersurface decision trees

The study of optimal decision trees has gained increasing attention in recent years; however, despite substantial progress, it still suffers from two major challenges: First, trees constructed by existing optimal decision tree (ODT)…

Machine Learning · Computer Science 2026-05-04 Xi He

A Novel Splitting Criterion Inspired by Geometric Mean Metric Learning for Decision Tree

Decision tree (DT) attracts persistent research attention due to its impressive empirical performance and interpretability in numerous applications. However, the growth of traditional yet widely-used univariate decision trees (UDTs) is…

Machine Learning · Computer Science 2022-06-22 Dan Li , Songcan Chen

Approximating splits for decision trees quickly in sparse data streams

Decision trees are one of the most popular classifiers in the machine learning literature. While the most common decision tree learning algorithms treat data as a batch, numerous algorithms have been proposed to construct decision trees…

Machine Learning · Computer Science 2026-01-21 Nikolaj Tatti

An Eager Splitting Strategy for Online Decision Trees

Decision tree ensembles are widely used in practice. In this work, we study in ensemble settings the effectiveness of replacing the split strategy for the state-of-the-art online tree learner, Hoeffding Tree, with a rigorous but more eager…

Machine Learning · Computer Science 2021-08-03 Chaitanya Manapragada , Heitor M Gomes , Mahsa Salehi , Albert Bifet , Geoffrey I Webb

HyperFast: Instant Classification for Tabular Data

Training deep learning models and performing hyperparameter tuning can be computationally demanding and time-consuming. Meanwhile, traditional machine learning methods like gradient-boosting algorithms remain the preferred choice for most…

Machine Learning · Computer Science 2024-02-23 David Bonet , Daniel Mas Montserrat , Xavier Giró-i-Nieto , Alexander G. Ioannidis

DFDT: Dynamic Fast Decision Tree for IoT Data Stream Mining on Edge Devices

The Internet of Things generates massive data streams, with edge computing emerging as a key enabler for online IoT applications and 5G networks. Edge solutions facilitate real-time machine learning inference, but also require continuous…

Machine Learning · Computer Science 2025-12-09 Afonso Lourenço , João Rodrigo , João Gama , Goreti Marreiros

Optimal Classification Trees for Continuous Feature Data Using Dynamic Programming with Branch-and-Bound

Computing an optimal classification tree that provably maximizes training performance within a given size limit, is NP-hard, and in practice, most state-of-the-art methods do not scale beyond computing optimal trees of depth three.…

Machine Learning · Computer Science 2025-01-15 Catalin E. Brita , Jacobus G. M. van der Linden , Emir Demirović

The Max-Cut Decision Tree: Improving on the Accuracy and Running Time of Decision Trees

Decision trees are a widely used method for classification, both by themselves and as the building blocks of multiple different ensemble learning methods. The Max-Cut decision tree involves novel modifications to a standard, baseline model…

Machine Learning · Computer Science 2020-06-26 Jonathan Bodine , Dorit S. Hochbaum

ZTree: A Subgroup Identification Based Decision Tree Learning Framework

Decision trees are a commonly used class of machine learning models valued for their interpretability and versatility, capable of both classification and regression. We propose ZTree, a novel decision tree learning framework that replaces…

Machine Learning · Computer Science 2025-09-17 Eric Cheng , Jie Cheng

Strict Very Fast Decision Tree: a memory conservative algorithm for data stream mining

Dealing with memory and time constraints are current challenges when learning from data streams with a massive amount of data. Many algorithms have been proposed to handle these difficulties, among them, the Very Fast Decision Tree (VFDT)…

Artificial Intelligence · Computer Science 2018-05-18 Victor Guilherme Turrisi da Costa , André Carlos Ponce de Leon Ferreira de Carvalho , Sylvio Barbon Junior

Online Learning of Decision Trees with Thompson Sampling

Decision Trees are prominent prediction models for interpretable Machine Learning. They have been thoroughly researched, mostly in the batch setting with a fixed labelled dataset, leading to popular algorithms such as C4.5, ID3 and CART.…

Machine Learning · Computer Science 2024-06-24 Ayman Chaouki , Jesse Read , Albert Bifet

Near Optimal Decision Trees in a SPLIT Second

Decision tree optimization is fundamental to interpretable machine learning. The most popular approach is to greedily search for the best feature at every decision point, which is fast but provably suboptimal. Recent approaches find the…

Machine Learning · Computer Science 2025-11-19 Varun Babbar , Hayden McTavish , Cynthia Rudin , Margo Seltzer

Optimal Decision Diagrams for Classification

Decision diagrams for classification have some notable advantages over decision trees, as their internal connections can be determined at training time and their width is not bound to grow exponentially with their depth. Accordingly,…

Machine Learning · Computer Science 2022-05-31 Alexandre M. Florio , Pedro Martins , Maximilian Schiffer , Thiago Serra , Thibaut Vidal

Utility-Diversity Aware Online Batch Selection for LLM Supervised Fine-tuning

Supervised fine-tuning (SFT) is a commonly used technique to adapt large language models (LLMs) to downstream tasks. In practice, SFT on a full dataset is computationally expensive and sometimes suffers from overfitting or bias…

Machine Learning · Computer Science 2026-02-03 Heming Zou , Yixiu Mao , Yun Qu , Qi Wang , Xiangyang Ji

A Novel Hyperparameter-free Approach to Decision Tree Construction that Avoids Overfitting by Design

Decision trees are an extremely popular machine learning technique. Unfortunately, overfitting in decision trees still remains an open issue that sometimes prevents achieving good performance. In this work, we present a novel approach for…

Machine Learning · Computer Science 2019-06-05 Rafael Garcia Leiva , Antonio Fernandez Anta , Vincenzo Mancuso , Paolo Casari

A Simple Approximation Algorithm for Optimal Decision Tree

Optimal decision tree (\odt) is a fundamental problem arising in applications such as active learning, entity identification, and medical diagnosis. An instance of \odt is given by $m$ hypotheses, out of which an unknown ``true'' hypothesis…

Data Structures and Algorithms · Computer Science 2025-05-22 Zhengjia Zhuo , Viswanath Nagarajan

Branches: Efficiently Seeking Optimal Sparse Decision Trees with AO*

Decision Tree (DT) Learning is a fundamental problem in Interpretable Machine Learning, yet it poses a formidable optimisation challenge. Practical algorithms have recently emerged, primarily leveraging Dynamic Programming and Branch &…

Machine Learning · Computer Science 2025-05-13 Ayman Chaouki , Jesse Read , Albert Bifet

Anytime Optimal Decision Tree Learning with Continuous Features

In recent years, significant progress has been made on algorithms for learning optimal decision trees, primarily in the context of binary features. Extending these methods to continuous features remains substantially more challenging due to…

Machine Learning · Computer Science 2026-01-22 Harold Kiossou , Pierre Schaus , Siegfried Nijssen

Exact Distributed Training: Random Forest with Billions of Examples

We introduce an exact distributed algorithm to train Random Forest models as well as other decision forest models without relying on approximating best split search. We explain the proposed algorithm and compare it to related approaches for…

Machine Learning · Computer Science 2018-04-19 Mathieu Guillame-Bert , Olivier Teytaud

Breiman meets Bellman: Non-Greedy Decision Trees with MDPs

In supervised learning, decision trees are valued for their interpretability and performance. While greedy decision tree algorithms like CART remain widely used due to their computational efficiency, they often produce sub-optimal solutions…

Machine Learning · Computer Science 2025-06-03 Hector Kohler , Riad Akrour , Philippe Preux