Related papers: Scalable $k$-d trees for distributed data

In-memory Multidimensional Indexing Using the skd-tree

In this paper, we revisit the problem of indexing multi-dimensional data in memory for the efficient support of multi-dimensional range queries and nearest neighbor queries. This is a classic problem in main-memory databases, where there is…

Databases · Computer Science 2026-05-06 Achilleas Michalopoulos , Dimitrios Tsitsigkos , Nikos Mamoulis

(Nearest) Neighbors You Can Rely On: Formally Verified k-d Tree Construction and Search in Coq

The k-d tree is a classic binary space-partitioning tree used to organize points in k-dimensional space. While used in computational geometry and graphics, the data structure has a long history of application in nearest neighbor search. The…

Logic in Computer Science · Computer Science 2023-11-21 Nadeem Abdul Hamid

Randomized partition trees for exact nearest neighbor search

The k-d tree was one of the first spatial data structures proposed for nearest neighbor search. Its efficacy is diminished in high-dimensional spaces, but several variants, with randomization and overlapping cells, have proved to be…

Data Structures and Algorithms · Computer Science 2013-02-11 Sanjoy Dasgupta , Kaushik Sinha

Spanning Trees in Random Satisfiability Problems

Working with tree graphs is always easier than with loopy ones and spanning trees are the closest tree-like structures to a given graph. We find a correspondence between the solutions of random K-satisfiability problem and those of spanning…

Disordered Systems and Neural Networks · Physics 2009-11-11 A. Ramezanpour , S. Moghimi-Araghi

Review of Three Algorithms That Build k-d Trees

The original description of the k-d tree recognized that rebalancing techniques, such as used to build an AVL tree or a red-black tree, are not applicable to a k-d tree. Hence, in order to build a balanced k-d tree, it is necessary to find…

Data Structures and Algorithms · Computer Science 2025-12-30 Russell A. Brown

Approximate k-nearest neighbour based spatial clustering using k-d tree

Different spatial objects that vary in their characteristics, such as molecular biology and geography, are presented in spatial areas. Methods to organize, manage, and maintain those objects in a structured manner are required. Data mining…

Databases · Computer Science 2013-03-11 Dr. Mohammed Otair

Analysis of approximate nearest neighbor searching with clustered point sets

We present an empirical analysis of data structures for approximate nearest neighbor searching. We compare the well-known optimized kd-tree splitting method against two alternative splitting methods. The first, called the sliding-midpoint…

Computational Geometry · Computer Science 2007-05-23 Songrit Maneewongvatana , David M. Mount

The Effect of Points Dispersion on the $k$-nn Search in Random Projection Forests

Partitioning trees are efficient data structures for $k$-nearest neighbor search. Machine learning libraries commonly use a special type of partitioning trees called $k$d-trees to perform $k$-nn search. Unfortunately, $k$d-trees can be…

Machine Learning · Computer Science 2023-02-28 Mashaan Alshammari , John Stavrakakis , Adel F. Ahmed , Masahiro Takatsuka

Which Spatial Partition Trees are Adaptive to Intrinsic Dimension?

Recent theory work has found that a special type of spatial partition tree - called a random projection tree - is adaptive to the intrinsic dimension of the data from which it is built. Here we examine this same question, with a combination…

Machine Learning · Statistics 2025-03-27 Nakul Verma , Samory Kpotufe , Sanjoy Dasgupta

Multi-Scale Vector Quantization with Reconstruction Trees

We propose and study a multi-scale approach to vector quantization. We develop an algorithm, dubbed reconstruction trees, inspired by decision trees. Here the objective is parsimonious reconstruction of unsupervised data, rather than…

Machine Learning · Computer Science 2019-09-05 Enrico Cecini , Ernesto De Vito , Lorenzo Rosasco

K-tree: Large Scale Document Clustering

We introduce K-tree in an information retrieval context. It is an efficient approximation of the k-means clustering algorithm. Unlike k-means it forms a hierarchy of clusters. It has been extended to address issues with sparse…

Information Retrieval · Computer Science 2010-01-07 Christopher M. De Vries , Shlomo Geva

Highly Efficient Structural Learning of Sparse Staged Trees

Several structural learning algorithms for staged tree models, an asymmetric extension of Bayesian networks, have been defined. However, they do not scale efficiently as the number of variables considered increases. Here we introduce the…

Machine Learning · Statistics 2022-06-15 Manuele Leonelli , Gherardo Varando

Bigger Buffer k-d Trees on Multi-Many-Core Systems

A buffer k-d tree is a k-d tree variant for massively-parallel nearest neighbor search. While providing valuable speed-ups on modern many-core devices in case both a large number of reference and query points are given, buffer k-d trees are…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-12-10 Fabian Gieseke , Cosmin Eugen Oancea , Ashish Mahabal , Christian Igel , Tom Heskes

Optimal Sparse Regression Trees

Regression trees are one of the oldest forms of AI models, and their predictions can be made without a calculator, which makes them broadly useful, particularly for high-stakes applications. Within the large literature on regression trees,…

Machine Learning · Computer Science 2023-04-11 Rui Zhang , Rui Xin , Margo Seltzer , Cynthia Rudin

Building a Balanced k-d Tree with MapReduce

The original description of the k-d tree recognized that rebalancing techniques, such as are used to build an AVL tree or a red-black tree, are not applicable to a k-d tree. Hence, in order to build a balanced k-d tree, it is necessary to…

Data Structures and Algorithms · Computer Science 2020-11-03 Russell A. Brown

Accelerating k-Means Clustering with Cover Trees

The k-means clustering algorithm is a popular algorithm that partitions data into k clusters. There are many improvements to accelerate the standard algorithm. Most current research employs upper and lower bounds on point-to-cluster…

Machine Learning · Computer Science 2024-10-22 Andreas Lang , Erich Schubert

Dynamic Trees: A Structured Variational Method Giving Efficient Propagation Rules

Dynamic trees are mixtures of tree structured belief networks. They solve some of the problems of fixed tree networks at the cost of making exact inference intractable. For this reason approximate methods such as sampling or mean field…

Machine Learning · Computer Science 2013-01-18 Amos J. Storkey

Near-optimal linear decision trees for k-SUM and related problems

We construct near optimal linear decision trees for a variety of decision problems in combinatorics and discrete geometry. For example, for any constant $k$, we construct linear decision trees that solve the $k$-SUM problem on $n$ elements…

Computational Geometry · Computer Science 2017-05-05 Daniel M. Kane , Shachar Lovett , Shay Moran

Statistically-Consistent k-mer Methods for Phylogenetic Tree Reconstruction

Frequencies of $k$-mers in sequences are sometimes used as a basis for inferring phylogenetic trees without first obtaining a multiple sequence alignment. We show that a standard approach of using the squared-Euclidean distance between…

Populations and Evolution · Quantitative Biology 2016-01-15 Elizabeth S. Allman , John A. Rhodes , Seth Sullivant

Tree-Structured Stick Breaking Processes for Hierarchical Data

Many data are naturally modeled by an unobserved hierarchical structure. In this paper we propose a flexible nonparametric prior over unknown data hierarchies. The approach uses nested stick-breaking processes to allow for trees of…

Methodology · Statistics 2010-06-08 Ryan Prescott Adams , Zoubin Ghahramani , Michael I. Jordan