Related papers: Permutation Decision Trees

Regularized impurity reduction: Accurate decision trees with complexity guarantees

Decision trees are popular classification models, providing high accuracy and intuitive explanations. However, as the tree size grows the model interpretability deteriorates. Traditional tree-induction algorithms, such as C4.5 and CART,…

Machine Learning · Computer Science 2022-11-29 Guangyi Zhang , Aristides Gionis

A Hybrid Tsallis-Polarization Impurity Measure for Decision Trees: Theoretical Foundations and Empirical Evaluation

We introduce the Integrated Tsallis Combination (ITC), a hybrid impurity measure for decision tree learning that combines normalized Tsallis entropy with an exponential polarization component. While many existing measures sacrifice…

Machine Learning · Statistics 2026-03-17 Edouard Lansiaux , Idriss Jairi , Hayfa Zgaya-Biau

Learning accurate and interpretable tree-based models

Decision trees and their ensembles are popular in machine learning as easy-to-understand models. Several techniques have been proposed in the literature for learning tree-based classifiers, with different techniques working well for data…

Machine Learning · Computer Science 2025-05-20 Maria-Florina Balcan , Dravyansh Sharma

Decision Tree Embedding by Leaf-Means

Decision trees and random forest remain highly competitive for classification on medium-sized, standard datasets due to their robustness, minimal preprocessing requirements, and interpretability. However, a single tree suffers from high…

Machine Learning · Statistics 2025-12-02 Cencheng Shen , Yuexiao Dong , Carey E. Priebe

Profit Driven Decision Trees for Churn Prediction

Customer retention campaigns increasingly rely on predictive models to detect potential churners in a vast customer base. From the perspective of machine learning, the task of predicting customer churn can be presented as a binary…

Machine Learning · Statistics 2017-12-22 Sebastiaan Höppner , Eugen Stripling , Bart Baesens , Seppe vanden Broucke , Tim Verdonck

Analyzing CART

Decision trees with binary splits are popularly constructed using Classification and Regression Trees (CART) methodology. For binary classification and regression models, this approach recursively divides the data into two near-homogenous…

Machine Learning · Statistics 2020-08-17 Jason M. Klusowski

Improved Weighted Random Forest for Classification Problems

Several studies have shown that combining machine learning models in an appropriate way will introduce improvements in the individual predictions made by the base models. The key to make well-performing ensemble model is in the diversity of…

Machine Learning · Computer Science 2021-03-01 Mohsen Shahhosseini , Guiping Hu

ZTree: A Subgroup Identification Based Decision Tree Learning Framework

Decision trees are a commonly used class of machine learning models valued for their interpretability and versatility, capable of both classification and regression. We propose ZTree, a novel decision tree learning framework that replaces…

Machine Learning · Computer Science 2025-09-17 Eric Cheng , Jie Cheng

The permutation entropy rate equals the metric entropy rate for ergodic information sources and ergodic dynamical systems

Permutation entropy quantifies the diversity of possible orderings of the values a random or deterministic system can take, as Shannon entropy quantifies the diversity of values. We show that the metric and permutation entropy…

Chaotic Dynamics · Physics 2016-08-16 Jose M. Amigo , Matthew B. Kennel , Ljupco Kocarev

Can a Single Tree Outperform an Entire Forest?

The prevailing mindset is that a single decision tree underperforms classic random forests in testing accuracy, despite its advantages in interpretability and lightweight structure. This study challenges such a mindset by significantly…

Machine Learning · Computer Science 2024-11-27 Qiangqiang Mao , Yankai Cao

On Random Tree Structures, Their Entropy, and Compression

Measuring the complexity of tree structures can be beneficial in areas that use tree data structures for storage, communication, and processing purposes. This complexity can then be used to compress tree data structures to their…

Information Theory · Computer Science 2023-09-19 Amirmohammad Farzaneh , Mihai-Alin Badiu , Justin P. Coon

Rectified Decision Trees: Towards Interpretability, Compression and Empirical Soundness

How to obtain a model with good interpretability and performance has always been an important research topic. In this paper, we propose rectified decision trees (ReDT), a knowledge distillation based decision trees rectification with high…

Machine Learning · Computer Science 2020-08-25 Jiawang Bai , Yiming Li , Jiawei Li , Yong Jiang , Shutao Xia

Determinism, Complexity, and Predictability in Computer Performance

Computers are deterministic dynamical systems (CHAOS 19:033124, 2009). Among other things, that implies that one should be able to use deterministic forecast rules to predict their behavior. That statement is sometimes-but not always-true.…

Chaotic Dynamics · Physics 2013-05-24 Joshua Garland , Ryan James , Elizabeth Bradley

Classification Trees with Valid Inference via the Exponential Mechanism

Decision trees are widely used for non-linear modeling, as they capture interactions between predictors while producing inherently interpretable models. Despite their popularity, performing inference on the non-linear fit remains largely…

Methodology · Statistics 2026-04-14 Soham Bakshi , Snigdha Panigrahi

Discovering Closed and Maximal Embedded Patterns from Large Tree Data

We address the problem of summarizing embedded tree patterns extracted from large data trees. We do so by defining and mining closed and maximal embedded unordered tree patterns from a single large data tree. We design an embedded frequent…

Databases · Computer Science 2022-01-11 Xiaoying Wu , Dimitri Theodoratos , Nikos Mamoulis

Tree in Tree: from Decision Trees to Decision Graphs

Decision trees have been widely used as classifiers in many machine learning applications thanks to their lightweight and interpretable decision process. This paper introduces Tree in Tree decision graph (TnT), a framework that extends the…

Machine Learning · Computer Science 2021-11-01 Bingzhao Zhu , Mahsa Shoaran

Ents: An Efficient Three-party Training Framework for Decision Trees by Communication Optimization

Multi-party training frameworks for decision trees based on secure multi-party computation enable multiple parties to train high-performance models on distributed private data with privacy preservation. The training process essentially…

Cryptography and Security · Computer Science 2024-07-04 Guopeng Lin , Weili Han , Wenqiang Ruan , Ruisheng Zhou , Lushan Song , Bingshuai Li , Yunfeng Shao

Fine-Tuning Pretrained Language Models: Weight Initializations, Data Orders, and Early Stopping

Fine-tuning pretrained contextual word embedding models to supervised downstream tasks has become commonplace in natural language processing. This process, however, is often brittle: even with the same hyperparameter values, distinct random…

Computation and Language · Computer Science 2020-02-19 Jesse Dodge , Gabriel Ilharco , Roy Schwartz , Ali Farhadi , Hannaneh Hajishirzi , Noah Smith

TREE: Tree Regularization for Efficient Execution

The rise of machine learning methods on heavily resource constrained devices requires not only the choice of a suitable model architecture for the target platform, but also the optimization of the chosen model with regard to execution time…

Machine Learning · Computer Science 2024-06-19 Lena Schmid , Daniel Biebert , Christian Hakert , Kuan-Hsun Chen , Michel Lang , Markus Pauly , Jian-Jia Chen

A Probabilistic Framework for Mutation Testing in Deep Neural Networks

Context: Mutation Testing (MT) is an important tool in traditional Software Engineering (SE) white-box testing. It aims to artificially inject faults in a system to evaluate a test suite's capability to detect them, assuming that the test…

Software Engineering · Computer Science 2023-01-16 Florian Tambon , Foutse Khomh , Giuliano Antoniol