Related papers: The Mondrian Kernel

The Uniformly Rotated Mondrian Kernel

Random feature maps are used to decrease the computational cost of kernel machines in large-scale problems. The Mondrian kernel is one such example of a fast random feature approximation of the Laplace kernel, generated by a computationally…

Machine Learning · Computer Science 2025-03-13 Calvin Osborne , Eliza O'Reilly

The Mondrian Process for Machine Learning

This report is concerned with the Mondrian process and its applications in machine learning. The Mondrian process is a guillotine-partition-valued stochastic process that possesses an elegant self-consistency property. The first part of the…

Machine Learning · Statistics 2015-07-21 Matej Balog , Yee Whye Teh

Mondrian Forests: Efficient Online Random Forests

Ensembles of randomized decision trees, usually referred to as random forests, are widely used for classification and regression tasks in machine learning and statistics. Random forests achieve competitive predictive performance and are…

Machine Learning · Statistics 2015-02-17 Balaji Lakshminarayanan , Daniel M. Roy , Yee Whye Teh

Minimax optimal rates for Mondrian trees and forests

Introduced by Breiman, Random Forests are widely used classification and regression algorithms. While being initially designed as batch algorithms, several variants have been proposed to handle online learning. One particular instance of…

Machine Learning · Statistics 2019-04-10 Jaouad Mourtada , Stéphane Gaïffas , Erwan Scornet

Inference with Mondrian Random Forests

Random forests are popular methods for regression and classification analysis, and many different variants have been proposed in recent years. One interesting example is the Mondrian random forest, in which the underlying constituent trees…

Statistics Theory · Mathematics 2025-11-10 Matias D. Cattaneo , Jason M. Klusowski , William G. Underwood

The Random Forest Kernel and other kernels for big data from random partitions

We present Random Partition Kernels, a new class of kernels derived by demonstrating a natural connection between random partitions of objects and kernels between those objects. We show how the construction can be used to create kernels…

Machine Learning · Statistics 2014-02-19 Alex Davies , Zoubin Ghahramani

Making Sense of Random Forest Probabilities: a Kernel Perspective

A random forest is a popular tool for estimating probabilities in machine learning classification tasks. However, the means by which this is accomplished is unprincipled: one simply counts the fraction of trees in a forest that vote for a…

Machine Learning · Statistics 2018-12-17 Matthew A. Olson , Abraham J. Wyner

Random forests and kernel methods

Random forests are ensemble methods which grow trees as base learners and combine their predictions by averaging. Random forests are known for their good practical performance, particularly in high dimensional set-tings. On the theoretical…

Statistics Theory · Mathematics 2015-09-18 Erwan Scornet

Quadrature-based features for kernel approximation

We consider the problem of improving kernel approximation via randomized feature maps. These maps arise as Monte Carlo approximation to integral representations of kernel functions and scale up kernel methods for larger datasets. Based on…

Machine Learning · Computer Science 2018-10-31 Marina Munkhoeva , Yermek Kapushev , Evgeny Burnaev , Ivan Oseledets

Random Features for Kernel Approximation: A Survey on Algorithms, Theory, and Beyond

Random features is one of the most popular techniques to speed up kernel methods in large-scale problems. Related works have been recognized by the NeurIPS Test-of-Time award in 2017 and the ICML Best Paper Finalist in 2019. The body of…

Machine Learning · Statistics 2021-07-13 Fanghui Liu , Xiaolin Huang , Yudong Chen , Johan A. K. Suykens

Learning Random Kernel Approximations for Object Recognition

Approximations based on random Fourier features have recently emerged as an efficient and formally consistent methodology to design large-scale kernel machines. By expressing the kernel as a Fourier expansion, features are generated based…

Computer Vision and Pattern Recognition · Computer Science 2012-03-08 Eduard Gabriel Băzăvan , Fuxin Li , Cristian Sminchisescu

Non-asymptotic Properties of Generalized Mondrian Forests in Statistical Learning

Random Forests have been extensively used in regression and classification, inspiring the development of various forest-based methods. Among these, Mondrian Forests, derived from the Mondrian process, mark a significant advancement.…

Statistics Theory · Mathematics 2025-02-28 Haoran Zhan , Jingli Wang , Yingcun Xia

Random Tessellation Forests

Space partitioning methods such as random forests and the Mondrian process are powerful machine learning methods for multi-dimensional and relational data, and are based on recursively cutting a domain. The flexibility of these methods is…

Machine Learning · Statistics 2019-12-03 Shufei Ge , Shijia Wang , Yee Whye Teh , Liangliang Wang , Lloyd T. Elliott

Similarity plays a fundamental role in many areas, including data mining, machine learning, statistics and various applied domains. Inspired by the success of ensemble methods and the flexibility of trees, we propose to learn a similarity…

Machine Learning · Computer Science 2019-08-29 Donghui Yan , Songxiang Gu , Ying Xu , Zhiwei Qin

Scaling up Kernel Ridge Regression via Locality Sensitive Hashing

Random binning features, introduced in the seminal paper of Rahimi and Recht (2007), are an efficient method for approximating a kernel matrix using locality sensitive hashing. Random binning features provide a very simple and efficient way…

Machine Learning · Statistics 2020-03-24 Michael Kapralov , Navid Nouri , Ilya Razenshteyn , Ameya Velingker , Amir Zandieh

Improving Random Forests by Smoothing

Random forest regression is a powerful non-parametric method that adapts to local data characteristics through data-driven partitioning, making it effective across diverse application domains. However, the piecewise constant nature of…

Machine Learning · Computer Science 2026-05-19 Ziyi Liu , Phuc Luong , Mario Boley , Daniel F. Schmidt

Universal consistency and minimax rates for online Mondrian Forests

We establish the consistency of an algorithm of Mondrian Forests, a randomized classification algorithm that can be implemented online. First, we amend the original Mondrian Forest algorithm, that considers a fixed lifetime parameter.…

Machine Learning · Statistics 2017-11-09 Jaouad Mourtada , Stéphane Gaïffas , Erwan Scornet

Random Forest (RF) Kernel for Regression, Classification and Survival

Breiman's random forest (RF) can be interpreted as an implicit kernel generator,where the ensuing proximity matrix represents the data-driven RF kernel. Kernel perspective on the RF has been used to develop a principled framework for…

Machine Learning · Statistics 2020-09-02 Dai Feng , Richard Baumgartner

AMF: Aggregated Mondrian Forests for Online Learning

Random Forests (RF) is one of the algorithms of choice in many supervised learning applications, be it classification or regression. The appeal of such tree-ensemble methods comes from a combination of several characteristics: a remarkable…

Machine Learning · Statistics 2020-05-18 Jaouad Mourtada , Stéphane Gaïffas , Erwan Scornet

Feature maps for the Laplacian kernel and its generalizations

Recent applications of kernel methods in machine learning have seen a renewed interest in the Laplacian kernel, due to its stability to the bandwidth hyperparameter in comparison to the Gaussian kernel, as well as its expressivity being…

Machine Learning · Statistics 2025-02-24 Sudhendu Ahir , Parthe Pandit