Related papers: Multi-Scale Vector Quantization with Reconstructio…

Random projection trees for vector quantization

A simple and computationally efficient scheme for tree-structured vector quantization is presented. Unlike previous methods, its quantization error depends only on the intrinsic dimension of the data distribution, rather than the apparent…

Machine Learning · Statistics 2008-05-12 Sanjoy Dasgupta , Yoav Freund

Regularized Vector Quantization for Tokenized Image Synthesis

Quantizing images into discrete representations has been a fundamental problem in unified generative modeling. Predominant approaches learn the discrete representation either in a deterministic manner by selecting the best-matching token or…

Computer Vision and Pattern Recognition · Computer Science 2023-10-17 Jiahui Zhang , Fangneng Zhan , Christian Theobalt , Shijian Lu

Exploring data subsets with vtree

Variable trees are a new method for the exploration of discrete multivariate data. They display nested subsets and corresponding frequencies and percentages. Manual calculation of these quantities can be laborious, especially when there are…

Computation · Statistics 2021-02-08 Nick Barrowman , Richard J. Webster

Random Planted Forest: a directly interpretable tree ensemble

We introduce a novel interpretable tree based algorithm for prediction in a regression setting. Our motivation is to estimate the unknown regression function from a functional decomposition perspective in which the functional components…

Machine Learning · Statistics 2023-08-04 Munir Hiabu , Enno Mammen , Joseph T. Meyer

Transformation Forests

Regression models for supervised learning problems with a continuous target are commonly understood as models for the conditional mean of the target given predictors. This notion is simple and therefore appealing for interpretation and…

Methodology · Statistics 2018-01-09 Torsten Hothorn , Achim Zeileis

Scalable $k$-d trees for distributed data

Data structures known as $k$-d trees have numerous applications in scientific computing, particularly in areas of modern statistics and data science such as range search in decision trees, clustering, nearest neighbors search, local…

Data Structures and Algorithms · Computer Science 2022-01-21 Aritra Chakravorty , William S. Cleveland , Patrick J. Wolfe

Des-q: a quantum algorithm to provably speedup retraining of decision trees

Decision trees are widely adopted machine learning models due to their simplicity and explainability. However, as training data size grows, standard methods become increasingly slow, scaling polynomially with the number of training…

Quantum Physics · Physics 2025-01-23 Niraj Kumar , Romina Yalovetzky , Changhao Li , Pierre Minssen , Marco Pistoia

Hierarchical Uncertainty Exploration via Feedforward Posterior Trees

When solving ill-posed inverse problems, one often desires to explore the space of potential solutions rather than be presented with a single plausible reconstruction. Valuable insights into these feasible solutions and their associated…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Elias Nehme , Rotem Mulayoff , Tomer Michaeli

Tree-Structured Random Vector Quantization for Limited-Feedback Wireless Channels

We consider the quantization of a transmit beamforming vector in multiantenna channels and of a signature vector in code division multiple access (CDMA) systems. Assuming perfect channel knowledge, the receiver selects for a transmitter the…

Information Theory · Computer Science 2015-10-16 Wiroonsak Santipach , Kritsada Mamat

Quantifying Sources of Uncertainty in Deep Learning-Based Image Reconstruction

Image reconstruction methods based on deep neural networks have shown outstanding performance, equalling or exceeding the state-of-the-art results of conventional approaches, but often do not provide uncertainty information about the…

Computer Vision and Pattern Recognition · Computer Science 2020-12-01 Riccardo Barbano , Željko Kereta , Chen Zhang , Andreas Hauptmann , Simon Arridge , Bangti Jin

Uncertain Trees: Dealing with Uncertain Inputs in Regression Trees

Tree-based ensemble methods, as Random Forests and Gradient Boosted Trees, have been successfully used for regression in many applications and research studies. Furthermore, these methods have been extended in order to deal with uncertainty…

Machine Learning · Computer Science 2018-11-20 Myriam Tami , Marianne Clausel , Emilie Devijver , Adrien Dulac , Eric Gaussier , Stefan Janaqi , Meriam Chebre

Regularized Classification-Aware Quantization

Traditionally, quantization is designed to minimize the reconstruction error of a data source. When considering downstream classification tasks, other measures of distortion can be of interest; such as the 0-1 classification loss.…

Machine Learning · Computer Science 2021-07-22 Daniel Severo , Elad Domanovitz , Ashish Khisti

Decision trees as partitioning machines to characterize their generalization properties

Decision trees are popular machine learning models that are simple to build and easy to interpret. Even though algorithms to learn decision trees date back to almost 50 years, key properties affecting their generalization error are still…

Machine Learning · Computer Science 2020-10-16 Jean-Samuel Leboeuf , Frédéric LeBlanc , Mario Marchand

Quantifying probabilistic robustness of tree-based classifiers against natural distortions

The concept of trustworthy AI has gained widespread attention lately. One of the aspects relevant to trustworthy AI is robustness of ML models. In this study, we show how to probabilistically quantify robustness against naturally occurring…

Machine Learning · Computer Science 2022-11-30 Christoph Schweimer , Sebastian Scher

Randomized Incremental Construction of Net-Trees

Net-trees are a general purpose data structure for metric data that have been used to solve a wide range of algorithmic problems. We give a simple randomized algorithm to construct net-trees on doubling metrics using $O(n\log n)$ time in…

Computational Geometry · Computer Science 2018-09-06 Mahmoodreza Jahanseir , Donald R. Sheehy

Quantification and visualization of variation in anatomical trees

This paper presents two approaches to quantifying and visualizing variation in datasets of trees. The first approach localizes subtrees in which significant population differences are found through hypothesis testing and sparse classifiers…

Applications · Statistics 2014-10-10 Nina Amenta , Manasi Datar , Asger Dirksen , Marleen de Bruijne , Aasa Feragen , Xiaoyin Ge , Jesper Holst Pedersen , Marylesa Howard , Megan Owen , Jens Petersen , Jie Shi , Qiuping Xu

Measure Inducing Classification and Regression Trees for Functional Data

We propose a tree-based algorithm for classification and regression problems in the context of functional data analysis, which allows to leverage representation learning and multiple splitting rules at the node level, reducing…

Machine Learning · Statistics 2020-11-03 Edoardo Belli , Simone Vantini

Cluster-Based Random Forest Visualization and Interpretation

Random forests are a machine learning method used to automatically classify datasets and consist of a multitude of decision trees. While these random forests often have higher performance and generalize better than a single decision tree,…

Machine Learning · Computer Science 2025-07-31 Max Sondag , Christofer Meinecke , Dennis Collaris , Tatiana von Landesberger , Stef van den Elzen

Data Structures for Finite Downsets of Natural Vectors: Theory and Practice

Manipulating downward-closed sets of vectors forms the basis of so-called antichain-based algorithms in verification. In that context, the dimension of the vectors is intimately tied to the size of the input structure to be verified. In…

Logic in Computer Science · Computer Science 2025-02-14 Michaël Cadilhac , Vanessa Flügel , Guillermo A. Pérez , Shrisha Rao

Achieving Reliable Causal Inference with Data-Mined Variables: A Random Forest Approach to the Measurement Error Problem

Combining machine learning with econometric analysis is becoming increasingly prevalent in both research and practice. A common empirical strategy involves the application of predictive modeling techniques to 'mine' variables of interest…

Econometrics · Economics 2020-12-22 Mochen Yang , Edward McFowland , Gordon Burtch , Gediminas Adomavicius