Related papers: Beyond Trees: Classification with Sparse Pairwise …

On multivariate randomized classification trees: $l_0$-based sparsity, VC~dimension and decomposition methods

Decision trees are widely-used classification and regression models because of their interpretability and good accuracy. Classical methods such as CART are based on greedy approaches but a growing attention has recently been devoted to…

Machine Learning · Computer Science 2021-12-16 Edoardo Amaldi , Antonio Consolo , Andrea Manno

Sparse Density Trees and Lists: An Interpretable Alternative to High-Dimensional Histograms

We present sparse tree-based and list-based density estimation methods for binary/categorical data. Our density estimation models are higher dimensional analogies to variable bin width histograms. In each leaf of the tree (or list), the…

Machine Learning · Statistics 2023-11-16 Siong Thye Goh , Lesia Semenova , Cynthia Rudin

Classification algorithms using adaptive partitioning

Algorithms for binary classification based on adaptive tree partitioning are formulated and analyzed for both their risk performance and their friendliness to numerical implementation. The algorithms can be viewed as generating a set…

Statistics Theory · Mathematics 2014-11-05 Peter Binev , Albert Cohen , Wolfgang Dahmen , Ronald DeVore

Probabilistic Scores of Classifiers, Calibration is not Enough

In binary classification tasks, accurate representation of probabilistic predictions is essential for various real-world applications such as predicting payment defaults or assessing medical risks. The model must then be well-calibrated to…

Machine Learning · Computer Science 2024-08-08 Agathe Fernandes Machado , Arthur Charpentier , Emmanuel Flachaire , Ewen Gallic , François Hu

Locally Optimized Random Forests

Standard supervised learning procedures are validated against a test set that is assumed to have come from the same distribution as the training data. However, in many problems, the test data may have come from a different distribution. We…

Machine Learning · Statistics 2019-08-28 Tim Coleman , Kimberly Kaufeld , Mary Frances Dorn , Lucas Mentch

Challenges learning from imbalanced data using tree-based models: Prevalence estimates systematically depend on hyperparameters and can be upwardly biased

When using machine learning for imbalanced binary classification problems, it is common to subsample the majority class to create a (more) balanced training dataset. This biases the model's predictions because the model learns from data…

Machine Learning · Computer Science 2025-11-03 Nathan Phelps , Daniel J. Lizotte , Douglas G. Woolford

Semi-supervised Predictive Clustering Trees for (Hierarchical) Multi-label Classification

Semi-supervised learning (SSL) is a common approach to learning predictive models using not only labeled examples, but also unlabeled examples. While SSL for the simple tasks of classification and regression has received a lot of attention…

Machine Learning · Computer Science 2024-04-02 Jurica Levatić , Michelangelo Ceci , Dragi Kocev , Sašo Džeroski

Mixed-Integer Linear Optimization for Semi-Supervised Optimal Classification Trees

Decision trees are one of the most popular methods for solving classification problems, mainly because of their good interpretability properties. Moreover, due to advances in recent years in mixed-integer optimization, several models have…

Optimization and Control · Mathematics 2026-05-29 Jan Pablo Burgard , Maria Eduarda Pinheiro , Martin Schmidt

Mixed-Integer Linear Optimization for Cardinality-Constrained Random Forests

Random forests are among the most famous algorithms for solving classification problems, in particular for large-scale data sets. Considering a set of labeled points and several decision trees, the method takes the majority vote to classify…

Optimization and Control · Mathematics 2025-01-24 Jan Pablo Burgard , Maria Eduarda Pinheiro , Martin Schmidt

Treelets--An adaptive multi-scale basis for sparse unordered data

In many modern applications, including analysis of gene expression and text documents, the data are noisy, high-dimensional, and unordered--with no particular meaning to the given order of the variables. Yet, successful learning is often…

Methodology · Statistics 2008-07-25 Ann B. Lee , Boaz Nadler , Larry Wasserman

Classification Trees for Imbalanced and Sparse Data: Surface-to-Volume Regularization

Classification algorithms face difficulties when one or more classes have limited training data. We are particularly interested in classification trees, due to their interpretability and flexibility. When data are limited in one or more of…

Methodology · Statistics 2021-06-15 Yichen Zhu , Cheng Li , David B. Dunson

Multiclass Optimal Classification Trees with SVM-splits

In this paper we present a novel mathematical optimization-based methodology to construct tree-shaped classification rules for multiclass instances. Our approach consists of building Classification Trees in which, except for the leaf nodes,…

Optimization and Control · Mathematics 2021-11-17 Víctor Blanco , Alberto Japón , Justo Puerto

Exploring Correlation between Labels to improve Multi-Label Classification

This paper attempts multi-label classification by extending the idea of independent binary classification models for each output label, and exploring how the inherent correlation between output labels can be used to improve predictions.…

Machine Learning · Computer Science 2015-11-26 Amit Garg , Jonathan Noyola , Romil Verma , Ashutosh Saxena , Aditya Jami

Interpolating Conditional Density Trees

Joint distributions over many variables are frequently modeled by decomposing them into products of simpler, lower-dimensional conditional distributions, such as in sparsely connected Bayesian networks. However, automatically learning such…

Machine Learning · Computer Science 2013-01-07 Scott Davies , Andrew Moore

Preference Analysis Using Random Spanning Trees: A Stochastic Sampling Approach to Inconsistent Pairwise Comparisons

Eliciting preferences from human judgements is inherently imprecise, yet most decision analysis methods force a single priority vector from pairwise comparisons, discarding the information embedded in inconsistencies. We instead leverage…

General Economics · Economics 2026-02-27 Salvatore Greco , Sajid Siraj , Michele Lundy

Sort by Structure: Language Model Ranking as Dependency Probing

Making an informed choice of pre-trained language model (LM) is critical for performance, yet environmentally costly, and as such widely underexplored. The field of Computer Vision has begun to tackle encoder ranking, with promising forays…

Computation and Language · Computer Science 2022-06-13 Max Müller-Eberstein , Rob van der Goot , Barbara Plank

Unbiased and Efficient Sampling of Dependency Trees

Most computational models of dependency syntax consist of distributions over spanning trees. However, the majority of dependency treebanks require that every valid dependency tree has a single edge coming out of the ROOT node, a constraint…

Computation and Language · Computer Science 2022-11-29 Miloš Stanojević

Sparse Variable Selection on High Dimensional Heterogeneous Data with Tree Structured Responses

We consider the problem of sparse variable selection on high dimension heterogeneous data sets, which has been taking on renewed interest recently due to the growth of biological and medical data sets with complex, non-i.i.d. structures and…

Methodology · Statistics 2024-04-22 Hui Liu , Xiang Liu , Jing Diao , Wenting Ye , Xueling Liu , Dehui Wei

Learning Discriminative Multilevel Structured Dictionaries for Supervised Image Classification

Sparse representations using overcomplete dictionaries have proved to be a powerful tool in many signal processing applications such as denoising, super-resolution, inpainting, compression or classification. The sparsity of the…

Machine Learning · Statistics 2018-03-01 Jeremy Aghaei Mazaheri , Elif Vural , Claude Labit , Christine Guillemot

Bayesian Pairwise Estimation Under Dependent Informative Sampling

An informative sampling design leads to the selection of units whose inclusion probabilities are correlated with the response variable of interest. Model inference performed on the resulting observed sample will be biased for the population…

Methodology · Statistics 2018-06-29 Matthew R. Williams , Terrance D. Savitsky