Related papers: Learning-Augmented Search Data Structures

Learning Augmented Binary Search Trees

A treap is a classic randomized binary search tree data structure that is easy to implement and supports O(\log n) expected time access. However, classic treaps do not take advantage of the input distribution or patterns in the input. Given…

Data Structures and Algorithms · Computer Science 2022-06-27 Honghao Lin , Tian Luo , David P. Woodruff

Learning Tree-Structured Composition of Data Augmentation

Data augmentation is widely used for training a neural network given little labeled data. A common practice of augmentation training is applying a composition of multiple transformations sequentially to the data. Existing augmentation…

Machine Learning · Computer Science 2024-08-27 Dongyue Li , Kailai Chen , Predrag Radivojac , Hongyang R. Zhang

Robust Learning-Augmented Dictionaries

We present the first learning-augmented data structure for implementing dictionaries with optimal consistency and robustness. Our data structure, named RobustSL, is a skip list augmented by predictions of access frequencies of elements in a…

Data Structures and Algorithms · Computer Science 2024-02-16 Ali Zeynali , Shahin Kamali , Mohammad Hajiesmaili

On the Power of Learning-Augmented Search Trees

We study learning-augmented binary search trees (BSTs) via Treaps with carefully designed priorities. The result is a simple search tree in which the depth of each item $x$ is determined by its predicted weight $w_x$. Specifically, each…

Data Structures and Algorithms · Computer Science 2025-05-16 Jingbang Chen , Xinyuan Cao , Alicia Stepin , Li Chen

From Specific to Generic Learned Sorted Set Dictionaries: A Theoretically Sound Paradigm Yelding Competitive Data Structural Boosters in Practice

This research concerns Learned Data Structures, a recent area that has emerged at the crossroad of Machine Learning and Classic Data Structures. It is methodologically important and with a high practical impact. We focus on Learned Indexes,…

Data Structures and Algorithms · Computer Science 2023-09-06 Domenico Amato , Giosué Lo Bosco , Raffaele Giancarlo

Compound Binary Search Tree and Algorithms

The Binary Search Tree (BST) is average in computer science which supports a compact data structure in memory and oneself even conducts a row of quick algorithms, by which people often apply it in dynamical circumstance. Besides these…

Data Structures and Algorithms · Computer Science 2018-10-05 Yong Tan

Forgetful Forests: high performance learning data structures for streaming data under concept drift

Database research can help machine learning performance in many ways. One way is to design better data structures. This paper combines the use of incremental computation and sequential and probabilistic filtering to enable "forgetful"…

Machine Learning · Computer Science 2022-12-16 Zhehu Yuan , Yinqi Sun , Dennis Shasha

Lazy Search Trees

We introduce the lazy search tree data structure. The lazy search tree is a comparison-based data structure on the pointer machine that supports order-based operations such as rank, select, membership, predecessor, successor, minimum, and…

Data Structures and Algorithms · Computer Science 2020-10-20 Bryce Sandlund , Sebastian Wild

Enhancing In-Memory Spatial Indexing with Learned Search

Spatial data is ubiquitous. Massive amounts of data are generated every day from a plethora of sources such as billions of GPS-enabled devices (e.g., cell phones, cars, and sensors), consumer-based applications (e.g., Uber and Strava), and…

Databases · Computer Science 2023-09-13 Varun Pandey , Alexander van Renen , Eleni Tzirita Zacharatou , Andreas Kipf , Ibrahim Sabek , Jialin Ding , Volker Markl , Alfons Kemper

Learned Sorted Table Search and Static Indexes in Small Model Space

Machine Learning Techniques, properly combined with Data Structures, have resulted in Learned Static Indexes, innovative and powerful tools that speed-up Binary Search, with the use of additional space with respect to the table being…

Information Retrieval · Computer Science 2022-09-20 Domenico Amato , Giosuè Lo Bosco , Raffaele Giancarlo

The Case for Learned Spatial Indexes

Spatial data is ubiquitous. Massive amounts of data are generated every day from billions of GPS-enabled devices such as cell phones, cars, sensors, and various consumer-based applications such as Uber, Tinder, location-tagged posts in…

Databases · Computer Science 2020-08-25 Varun Pandey , Alexander van Renen , Andreas Kipf , Ibrahim Sabek , Jialin Ding , Alfons Kemper

Double Coverage with Machine-Learned Advice

We study the fundamental online k-server problem in a learning-augmented setting. While in the traditional online model, an algorithm has no information about the request sequence, we assume that there is given some advice (e.g.…

Machine Learning · Computer Science 2021-11-17 Alexander Lindermayr , Nicole Megow , Bertrand Simon

Deterministic Iteratively Built KD-Tree with KNN Search for Exact Applications

K-Nearest Neighbors (KNN) search is a fundamental algorithm in artificial intelligence software with applications in robotics, and autonomous vehicles. These wide-ranging applications utilize KNN either directly for simple classification or…

Software Engineering · Computer Science 2021-06-08 Aryan Naim , Joseph Bowkett , Sisir Karumanchi , Peyman Tavallali , Brett Kennedy

Learning to Search Better Than Your Teacher

Methods for learning to search for structured prediction typically imitate a reference policy, with existing theoretical guarantees demonstrating low regret compared to that reference. This is unsatisfactory in many applications where the…

Machine Learning · Computer Science 2015-05-21 Kai-Wei Chang , Akshay Krishnamurthy , Alekh Agarwal , Hal Daumé , John Langford

Updatable Balanced Index for Fast On-device Search with Auto-selection Model

Diverse types of edge data, such as 2D geo-locations and 3D point clouds, are collected by sensors like lidar and GPS receivers on edge devices. On-device searches, such as k-nearest neighbor (kNN) search and radius search, are commonly…

Databases · Computer Science 2025-12-16 Yushuai Ji , Sheng Wang , Zhiyu Chen , Yuan Sun , Zhiyong Peng

Learning from Data to Speed-up Sorted Table Search Procedures: Methodology and Practical Guidelines

Sorted Table Search Procedures are the quintessential query-answering tool, with widespread usage that now includes also Web Applications, e.g, Search Engines (Google Chrome) and ad Bidding Systems (AppNexus). Speeding them up, at very…

Machine Learning · Computer Science 2020-07-31 Domenico Amato , Giosué Lo Bosco , Raffaele Giancarlo

Technical Report: Bundling Linked Data Structures for Linearizable Range Queries

We present bundled references, a new building block to provide linearizable range query operations for highly concurrent lock-based linked data structures. Bundled references allow range queries to traverse a path through the data structure…

Data Structures and Algorithms · Computer Science 2022-06-30 Jacob Nelson-Slivon , Ahmed Hassan , Roberto Palmieri

Towards Searching Efficient and Accurate Neural Network Architectures in Binary Classification Problems

In recent years, deep neural networks have had great success in machine learning and pattern recognition. Architecture size for a neural network contributes significantly to the success of any neural network. In this study, we optimize the…

Machine Learning · Computer Science 2021-01-19 Yigit Alparslan , Ethan Jacob Moyer , Isamu Mclean Isozaki , Daniel Schwartz , Adam Dunlop , Shesh Dave , Edward Kim

Maximum Inner-Product Search using Tree Data-structures

The problem of {\em efficiently} finding the best match for a query in a given set with respect to the Euclidean distance or the cosine similarity has been extensively studied in literature. However, a closely related problem of efficiently…

Computational Geometry · Computer Science 2021-06-24 Parikshit Ram , Alexander G. Gray

Using Multi-Core HW/SW Co-design Architecture for Accelerating K-means Clustering Algorithm

The capability of classifying and clustering a desired set of data is an essential part of building knowledge from data. However, as the size and dimensionality of input data increases, the run-time for such clustering algorithms is…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-07-25 Hadi Mardani Kamali