Related papers: The Binary Space Partitioning-Tree Process

Binary Space Partitioning Forests

The Binary Space Partitioning~(BSP)-Tree process is proposed to produce flexible 2-D partition structures which are originally used as a Bayesian nonparametric prior for relational modelling. It can hardly be applied to other learning tasks…

Machine Learning · Statistics 2019-03-25 Xuhui Fan , Bin Li , Scott Anthony Sisson

Random Tessellation Forests

Space partitioning methods such as random forests and the Mondrian process are powerful machine learning methods for multi-dimensional and relational data, and are based on recursively cutting a domain. The flexibility of these methods is…

Machine Learning · Statistics 2019-12-03 Shufei Ge , Shijia Wang , Yee Whye Teh , Liangliang Wang , Lloyd T. Elliott

Online Binary Space Partitioning Forests

The Binary Space Partitioning-Tree~(BSP-Tree) process was recently proposed as an efficient strategy for space partitioning tasks. Because it uses more than one dimension to partition the space, the BSP-Tree Process is more efficient and…

Machine Learning · Statistics 2020-03-03 Xuhui Fan , Bin Li , Scott A. Sisson

Time Complexity Analysis of Binary Space Partitioning Scheme for Image Compression

Segmentation-based image coding methods provide high compression ratios when compared with traditional image coding approaches like the transform and sub band coding for low bit-rate compression applications. In this paper, a…

Computer Vision and Pattern Recognition · Computer Science 2012-11-12 Rehna V. J. , M. K. Jeyakumar

Shape Modeling with Spline Partitions

Shape modelling (with methods that output shapes) is a new and important task in Bayesian nonparametrics and bioinformatics. In this work, we focus on Bayesian nonparametric methods for capturing shapes by partitioning a space using curves.…

Machine Learning · Statistics 2022-11-08 Shufei Ge , Shijia Wang , Lloyd Elliott

The Mondrian Process for Machine Learning

This report is concerned with the Mondrian process and its applications in machine learning. The Mondrian process is a guillotine-partition-valued stochastic process that possesses an elegant self-consistency property. The first part of the…

Machine Learning · Statistics 2015-07-21 Matej Balog , Yee Whye Teh

Stochastic geometry to generalize the Mondrian Process

The stable under iterated tessellation (STIT) process is a stochastic process that produces a recursive partition of space with cut directions drawn independently from a distribution over the sphere. The case of random axis-aligned cuts is…

Machine Learning · Statistics 2021-09-15 Eliza O'Reilly , Ngoc Tran

Bayesian Nonparametric Space Partitions: A Survey

Bayesian nonparametric space partition (BNSP) models provide a variety of strategies for partitioning a $D$-dimensional space into a set of blocks. In this way, the data points lie in the same block would share certain kinds of homogeneity.…

Machine Learning · Statistics 2021-03-02 Xuhui Fan , Bin Li , Ling Luo , Scott A. Sisson

Rectangular Bounding Process

Stochastic partition models divide a multi-dimensional space into a number of rectangular regions, such that the data within each region exhibit certain types of homogeneity. Due to the nature of their partition strategy, existing partition…

Machine Learning · Statistics 2019-03-12 Xuhui Fan , Bin Li , Scott Anthony Sisson

Consistent Bayesian Spatial Domain Partitioning Using Predictive Spanning Tree Methods

Bayesian model-based spatial clustering methods are widely used for their flexibility in estimating latent clusters with an unknown number of clusters while accounting for spatial proximity. Many existing methods are designed for clustering…

Methodology · Statistics 2025-08-13 Kun Huang , Huiyan Sang

Partition Tree Weighting

This paper introduces the Partition Tree Weighting technique, an efficient meta-algorithm for piecewise stationary sources. The technique works by performing Bayesian model averaging over a large class of possible partitions of the data…

Information Theory · Computer Science 2012-11-22 Joel Veness , Martha White , Michael Bowling , András György

Parallel Approaches to Accelerate Bayesian Decision Trees

Markov Chain Monte Carlo (MCMC) is a well-established family of algorithms primarily used in Bayesian statistics to sample from a target distribution when direct sampling is challenging. Existing work on Bayesian decision trees uses MCMC.…

Computation · Statistics 2023-01-24 Efthyvoulos Drousiotis , Paul G. Spirakis , Simon Maskell

Nonuniform Dynamic Discretization in Hybrid Networks

We consider probabilistic inference in general hybrid networks, which include continuous and discrete variables in an arbitrary topology. We reexamine the question of variable discretization in a hybrid network aiming at minimizing the…

Artificial Intelligence · Computer Science 2013-02-08 Alexander V. Kozlov , Daphne Koller

Mondrian Forests: Efficient Online Random Forests

Ensembles of randomized decision trees, usually referred to as random forests, are widely used for classification and regression tasks in machine learning and statistics. Random forests achieve competitive predictive performance and are…

Machine Learning · Statistics 2015-02-17 Balaji Lakshminarayanan , Daniel M. Roy , Yee Whye Teh

TOAST: Fast and scalable auto-partitioning based on principled static analysis

Partitioning large machine learning models across distributed accelerator systems is a complex process, requiring a series of interdependent decisions that are further complicated by internal sharding ambiguities. Consequently, existing…

Machine Learning · Computer Science 2025-08-26 Sami Alabed , Dominik Grewe , Norman Alexander Rink , Masha Samsikova , Timur Sitdikov , Agnieszka Swietlik , Dimitrios Vytiniotis , Daniel Belov

Random trees between two walls: Exact partition function

We derive the exact partition function for a discrete model of random trees embedded in a one-dimensional space. These trees have vertices labeled by integers representing their position in the target space, with the SOS constraint that…

Statistical Mechanics · Physics 2007-05-23 J. Bouttier , P. Di Francesco , E. Guitter

Variational Bayesian Methods for a Tree-Structured Stick-Breaking Process Mixture of Gaussians by Application of the Bayes Codes for Context Tree Models

The tree-structured stick-breaking process (TS-SBP) mixture model is a non-parametric Bayesian model that can represent tree-like hierarchical structures among the mixture components. For TS-SBP mixture models, only a Markov chain Monte…

Machine Learning · Statistics 2024-09-12 Yuta Nakahara

Statistical Advantages of Oblique Randomized Decision Trees and Forests

This work studies the statistical implications of using features comprised of general linear combinations of covariates to partition the data in randomized decision tree and forest regression algorithms. Using random tessellation theory in…

Statistics Theory · Mathematics 2025-11-05 Eliza O'Reilly

Minimax optimal rates for Mondrian trees and forests

Introduced by Breiman, Random Forests are widely used classification and regression algorithms. While being initially designed as batch algorithms, several variants have been proposed to handle online learning. One particular instance of…

Machine Learning · Statistics 2019-04-10 Jaouad Mourtada , Stéphane Gaïffas , Erwan Scornet

Spectral Clustering, Bayesian Spanning Forest, and Forest Process

Spectral clustering views the similarity matrix as a weighted graph, and partitions the data by minimizing a graph-cut loss. Since it minimizes the across-cluster similarity, there is no need to model the distribution within each cluster.…

Methodology · Statistics 2023-04-14 Leo L. Duan , Arkaprava Roy