Related papers: FITing-Tree: A Data-aware Index Structure

Benchmarking Learned Indexes

Recent advancements in learned index structures propose replacing existing index structures, like B-Trees, with approximate learned models. In this work, we present a unified benchmark that compares well-tuned implementations of three…

Databases · Computer Science 2023-03-28 Ryan Marcus , Andreas Kipf , Alexander van Renen , Mihail Stoian , Sanchit Misra , Alfons Kemper , Thomas Neumann , Tim Kraska

The Case for Learned Index Structures

Indexes are models: a B-Tree-Index can be seen as a model to map a key to the position of a record within a sorted array, a Hash-Index as a model to map a key to a position of a record within an unsorted array, and a BitMap-Index as a model…

Databases · Computer Science 2018-05-01 Tim Kraska , Alex Beutel , Ed H. Chi , Jeffrey Dean , Neoklis Polyzotis

Evaluating Learned Indexes in LSM-tree Systems: Benchmarks,Insights and Design Choices

LSM-tree-based data stores are widely used in industry due to their exceptional performance. However, as data volumes grow, efficiently querying large-scale databases becomes increasingly challenging. To address this, recent studies…

Databases · Computer Science 2025-06-11 Junfeng Liu , Jiarui Ye , Mengshi Chen , Meng Li , Siqiang Luo

BITS-Tree-An Efficient Data Structure for Segment Storage and Query Processing

In this paper, a new and novel data structure is proposed to dynamically insert and delete segments. Unlike the standard segment trees[3], the proposed data structure permits insertion of a segment with interval range beyond the interval…

Computational Geometry · Computer Science 2015-01-15 K. S. Easwarakumar , T. Hema

A Scalable Learned Index Scheme in Storage Systems

Index structures are important for efficient data access, which have been widely used to improve the performance in many in-memory systems. Due to high in-memory overheads, traditional index structures become difficult to process the…

Databases · Computer Science 2019-05-16 Pengfei Li , Yu Hua , Pengfei Zuo , Jingnan Jia

CARMI: A Cache-Aware Learned Index with a Cost-based Construction Algorithm

Learned indexes, which use machine learning models to replace traditional index structures, have shown promising results in recent studies. However, existing learned indexes exhibit a performance gap between synthetic and real-world…

Databases · Computer Science 2022-05-20 Jiaoyi Zhang , Yihan Gao

Cracking In-Memory Database Index A Case Study for Adaptive Radix Tree Index

Indexes provide a method to access data in databases quickly. It can improve the response speed of subsequent queries by building a complete index in advance. However, it also leads to a huge overhead of the continuous updating during…

Databases · Computer Science 2019-11-27 Gang Wu , Yidong Song , Guodong Zhao , Wei Sun , Donghong Han , Baiyou Qiao , Guoren Wang , Ye Yuan

Updatable Learned Index with Precise Positions

Index plays an essential role in modern database engines to accelerate the query processing. The new paradigm of "learned index" has significantly changed the way of designing index structures in DBMS. The key insight is that indexes could…

Databases · Computer Science 2021-04-14 Jiacheng Wu , Yong Zhang , Shimin Chen , Jin Wang , Yu Chen , Chunxiao Xing

ALEX: An Updatable Adaptive Learned Index

Recent work on "learned indexes" has changed the way we look at the decades-old field of DBMS indexing. The key idea is that indexes can be thought of as "models" that predict the position of a key in a dataset. Indexes can, thus, be…

Databases · Computer Science 2020-06-25 Jialin Ding , Umar Farooq Minhas , Jia Yu , Chi Wang , Jaeyoung Do , Yinan Li , Hantian Zhang , Badrish Chandramouli , Johannes Gehrke , Donald Kossmann , David Lomet , Tim Kraska

Shift-Table: A Low-latency Learned Index for Range Queries using Model Correction

Indexing large-scale databases in main memory is still challenging today. Learned index structures -- in which the core components of classical indexes are replaced with machine learning models -- have recently been suggested to…

Databases · Computer Science 2021-01-27 Ali Hadian , Thomas Heinis

Learned Adaptive Indexing

Indexes can significantly improve search performance in relational databases. However, if the query workload changes frequently or new data updates occur continuously, it may not be worthwhile to build a conventional index upfront for query…

Databases · Computer Science 2025-08-06 Suvam Kumar Das , Suprio Ray

The PGM-index: a multicriteria, compressed and learned approach to data indexing

The recent introduction of learned indexes has shaken the foundations of the decades-old field of indexing data structures. Combining, or even replacing, classic design elements such as B-tree nodes with machine learning models has proven…

Data Structures and Algorithms · Computer Science 2020-05-08 Paolo Ferragina , Giorgio Vinciguerra

HIRE: A Hybrid Learned Index for Robust and Efficient Performance under Mixed Workloads

Indexes are critical for efficient data retrieval and updates in modern databases. Recent advances in machine learning have led to the development of learned indexes, which model the cumulative distribution function of data to predict…

Databases · Computer Science 2026-04-27 Xinyi Zhang , Liang Liang , Anastasia Ailamaki , Jianliang Xu

PIM-tree: A Skew-resistant Index for Processing-in-Memory

The performance of today's in-memory indexes is bottlenecked by the memory latency/bandwidth wall. Processing-in-memory (PIM) is an emerging approach that potentially mitigates this bottleneck, by enabling low-latency memory access whose…

Databases · Computer Science 2022-11-22 Hongbo Kang , Yiwei Zhao , Guy E. Blelloch , Laxman Dhulipala , Yan Gu , Charles McGuffey , Phillip B. Gibbons

COAX: Correlation-Aware Indexing on Multidimensional Data with Soft Functional Dependencies

Recent work proposed learned index structures, which learn the distribution of the underlying dataset to improve performance. The initial work on learned indexes has shown that by learning the cumulative distribution function of the data,…

Databases · Computer Science 2021-02-03 Ali Hadian , Behzad Ghaffari , Taiyi Wang , Thomas Heinis

Joint Optimization of Tree-based Index and Deep Model for Recommender Systems

Large-scale industrial recommender systems are usually confronted with computational problems due to the enormous corpus size. To retrieve and recommend the most relevant items to users under response time limits, resorting to an efficient…

Information Retrieval · Computer Science 2019-11-21 Han Zhu , Daqing Chang , Ziru Xu , Pengye Zhang , Xiang Li , Jie He , Han Li , Jian Xu , Kun Gai

AirIndex: Versatile Index Tuning Through Data and Storage

The end-to-end lookup latency of a hierarchical index -- such as a B-tree or a learned index -- is determined by its structure such as the number of layers, the kinds of branching functions appearing in each layer, the amount of data we…

Databases · Computer Science 2023-09-06 Supawit Chockchowwat , Wenjie Liu , Yongjoo Park

Progressive Neural Index Search for Database System

As a key ingredient of the DBMS, index plays an important role in the query optimization and processing. However, it is a non-trivial task to apply existing indexes or design new indexes for new applications, where both data distribution…

Databases · Computer Science 2020-03-05 Sai Wu , Xinyi Yu , Xiaojie Feng , Feifei Li , Wei Cao , Gang Chen

OSM-tree: A Sortedness-Aware Index

Indexes facilitate efficient querying when the selection predicate is on an indexed key. As a result, when loading data, if we anticipate future selective (point or range) queries, we typically maintain an index that is gradually populated…

Databases · Computer Science 2022-02-10 Aneesh Raman , Subhadeep Sarkar , Matthaios Olma , Manos Athanassoulis

How Good Are Multi-dimensional Learned Indices? An Experimental Survey

Efficient indexing is fundamental for multi-dimensional data management and analytics. An emerging tendency is to directly learn the storage layout of multi-dimensional data by simple machine learning models, yielding the concept of Learned…

Databases · Computer Science 2024-05-10 Qiyu Liu , Maocheng Li , Yuxiang Zeng , Yanyan Shen , Lei Chen