English
Related papers

Related papers: Scalable $k$-d trees for distributed data

200 papers

In this paper, we revisit the problem of indexing multi-dimensional data in memory for the efficient support of multi-dimensional range queries and nearest neighbor queries. This is a classic problem in main-memory databases, where there is…

Databases · Computer Science 2026-05-06 Achilleas Michalopoulos , Dimitrios Tsitsigkos , Nikos Mamoulis

The k-d tree is a classic binary space-partitioning tree used to organize points in k-dimensional space. While used in computational geometry and graphics, the data structure has a long history of application in nearest neighbor search. The…

Logic in Computer Science · Computer Science 2023-11-21 Nadeem Abdul Hamid

The k-d tree was one of the first spatial data structures proposed for nearest neighbor search. Its efficacy is diminished in high-dimensional spaces, but several variants, with randomization and overlapping cells, have proved to be…

Data Structures and Algorithms · Computer Science 2013-02-11 Sanjoy Dasgupta , Kaushik Sinha

Working with tree graphs is always easier than with loopy ones and spanning trees are the closest tree-like structures to a given graph. We find a correspondence between the solutions of random K-satisfiability problem and those of spanning…

Disordered Systems and Neural Networks · Physics 2009-11-11 A. Ramezanpour , S. Moghimi-Araghi

The original description of the k-d tree recognized that rebalancing techniques, such as used to build an AVL tree or a red-black tree, are not applicable to a k-d tree. Hence, in order to build a balanced k-d tree, it is necessary to find…

Data Structures and Algorithms · Computer Science 2025-12-30 Russell A. Brown

Different spatial objects that vary in their characteristics, such as molecular biology and geography, are presented in spatial areas. Methods to organize, manage, and maintain those objects in a structured manner are required. Data mining…

Databases · Computer Science 2013-03-11 Dr. Mohammed Otair

We present an empirical analysis of data structures for approximate nearest neighbor searching. We compare the well-known optimized kd-tree splitting method against two alternative splitting methods. The first, called the sliding-midpoint…

Computational Geometry · Computer Science 2007-05-23 Songrit Maneewongvatana , David M. Mount

Partitioning trees are efficient data structures for $k$-nearest neighbor search. Machine learning libraries commonly use a special type of partitioning trees called $k$d-trees to perform $k$-nn search. Unfortunately, $k$d-trees can be…

Machine Learning · Computer Science 2023-02-28 Mashaan Alshammari , John Stavrakakis , Adel F. Ahmed , Masahiro Takatsuka

Recent theory work has found that a special type of spatial partition tree - called a random projection tree - is adaptive to the intrinsic dimension of the data from which it is built. Here we examine this same question, with a combination…

Machine Learning · Statistics 2025-03-27 Nakul Verma , Samory Kpotufe , Sanjoy Dasgupta

We propose and study a multi-scale approach to vector quantization. We develop an algorithm, dubbed reconstruction trees, inspired by decision trees. Here the objective is parsimonious reconstruction of unsupervised data, rather than…

Machine Learning · Computer Science 2019-09-05 Enrico Cecini , Ernesto De Vito , Lorenzo Rosasco

We introduce K-tree in an information retrieval context. It is an efficient approximation of the k-means clustering algorithm. Unlike k-means it forms a hierarchy of clusters. It has been extended to address issues with sparse…

Information Retrieval · Computer Science 2010-01-07 Christopher M. De Vries , Shlomo Geva

Several structural learning algorithms for staged tree models, an asymmetric extension of Bayesian networks, have been defined. However, they do not scale efficiently as the number of variables considered increases. Here we introduce the…

Machine Learning · Statistics 2022-06-15 Manuele Leonelli , Gherardo Varando

A buffer k-d tree is a k-d tree variant for massively-parallel nearest neighbor search. While providing valuable speed-ups on modern many-core devices in case both a large number of reference and query points are given, buffer k-d trees are…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-12-10 Fabian Gieseke , Cosmin Eugen Oancea , Ashish Mahabal , Christian Igel , Tom Heskes

Regression trees are one of the oldest forms of AI models, and their predictions can be made without a calculator, which makes them broadly useful, particularly for high-stakes applications. Within the large literature on regression trees,…

Machine Learning · Computer Science 2023-04-11 Rui Zhang , Rui Xin , Margo Seltzer , Cynthia Rudin

The original description of the k-d tree recognized that rebalancing techniques, such as are used to build an AVL tree or a red-black tree, are not applicable to a k-d tree. Hence, in order to build a balanced k-d tree, it is necessary to…

Data Structures and Algorithms · Computer Science 2020-11-03 Russell A. Brown

The k-means clustering algorithm is a popular algorithm that partitions data into k clusters. There are many improvements to accelerate the standard algorithm. Most current research employs upper and lower bounds on point-to-cluster…

Machine Learning · Computer Science 2024-10-22 Andreas Lang , Erich Schubert

Dynamic trees are mixtures of tree structured belief networks. They solve some of the problems of fixed tree networks at the cost of making exact inference intractable. For this reason approximate methods such as sampling or mean field…

Machine Learning · Computer Science 2013-01-18 Amos J. Storkey

We construct near optimal linear decision trees for a variety of decision problems in combinatorics and discrete geometry. For example, for any constant $k$, we construct linear decision trees that solve the $k$-SUM problem on $n$ elements…

Computational Geometry · Computer Science 2017-05-05 Daniel M. Kane , Shachar Lovett , Shay Moran

Frequencies of $k$-mers in sequences are sometimes used as a basis for inferring phylogenetic trees without first obtaining a multiple sequence alignment. We show that a standard approach of using the squared-Euclidean distance between…

Populations and Evolution · Quantitative Biology 2016-01-15 Elizabeth S. Allman , John A. Rhodes , Seth Sullivant

Many data are naturally modeled by an unobserved hierarchical structure. In this paper we propose a flexible nonparametric prior over unknown data hierarchies. The approach uses nested stick-breaking processes to allow for trees of…

Methodology · Statistics 2010-06-08 Ryan Prescott Adams , Zoubin Ghahramani , Michael I. Jordan
‹ Prev 1 2 3 10 Next ›