English
Related papers

Related papers: FITing-Tree: A Data-aware Index Structure

200 papers

Recent advancements in learned index structures propose replacing existing index structures, like B-Trees, with approximate learned models. In this work, we present a unified benchmark that compares well-tuned implementations of three…

Indexes are models: a B-Tree-Index can be seen as a model to map a key to the position of a record within a sorted array, a Hash-Index as a model to map a key to a position of a record within an unsorted array, and a BitMap-Index as a model…

Databases · Computer Science 2018-05-01 Tim Kraska , Alex Beutel , Ed H. Chi , Jeffrey Dean , Neoklis Polyzotis

LSM-tree-based data stores are widely used in industry due to their exceptional performance. However, as data volumes grow, efficiently querying large-scale databases becomes increasingly challenging. To address this, recent studies…

Databases · Computer Science 2025-06-11 Junfeng Liu , Jiarui Ye , Mengshi Chen , Meng Li , Siqiang Luo

In this paper, a new and novel data structure is proposed to dynamically insert and delete segments. Unlike the standard segment trees[3], the proposed data structure permits insertion of a segment with interval range beyond the interval…

Computational Geometry · Computer Science 2015-01-15 K. S. Easwarakumar , T. Hema

Index structures are important for efficient data access, which have been widely used to improve the performance in many in-memory systems. Due to high in-memory overheads, traditional index structures become difficult to process the…

Databases · Computer Science 2019-05-16 Pengfei Li , Yu Hua , Pengfei Zuo , Jingnan Jia

Learned indexes, which use machine learning models to replace traditional index structures, have shown promising results in recent studies. However, existing learned indexes exhibit a performance gap between synthetic and real-world…

Databases · Computer Science 2022-05-20 Jiaoyi Zhang , Yihan Gao

Indexes provide a method to access data in databases quickly. It can improve the response speed of subsequent queries by building a complete index in advance. However, it also leads to a huge overhead of the continuous updating during…

Databases · Computer Science 2019-11-27 Gang Wu , Yidong Song , Guodong Zhao , Wei Sun , Donghong Han , Baiyou Qiao , Guoren Wang , Ye Yuan

Index plays an essential role in modern database engines to accelerate the query processing. The new paradigm of "learned index" has significantly changed the way of designing index structures in DBMS. The key insight is that indexes could…

Databases · Computer Science 2021-04-14 Jiacheng Wu , Yong Zhang , Shimin Chen , Jin Wang , Yu Chen , Chunxiao Xing

Recent work on "learned indexes" has changed the way we look at the decades-old field of DBMS indexing. The key idea is that indexes can be thought of as "models" that predict the position of a key in a dataset. Indexes can, thus, be…

Indexing large-scale databases in main memory is still challenging today. Learned index structures -- in which the core components of classical indexes are replaced with machine learning models -- have recently been suggested to…

Databases · Computer Science 2021-01-27 Ali Hadian , Thomas Heinis

Indexes can significantly improve search performance in relational databases. However, if the query workload changes frequently or new data updates occur continuously, it may not be worthwhile to build a conventional index upfront for query…

Databases · Computer Science 2025-08-06 Suvam Kumar Das , Suprio Ray

The recent introduction of learned indexes has shaken the foundations of the decades-old field of indexing data structures. Combining, or even replacing, classic design elements such as B-tree nodes with machine learning models has proven…

Data Structures and Algorithms · Computer Science 2020-05-08 Paolo Ferragina , Giorgio Vinciguerra

Indexes are critical for efficient data retrieval and updates in modern databases. Recent advances in machine learning have led to the development of learned indexes, which model the cumulative distribution function of data to predict…

Databases · Computer Science 2026-04-27 Xinyi Zhang , Liang Liang , Anastasia Ailamaki , Jianliang Xu

The performance of today's in-memory indexes is bottlenecked by the memory latency/bandwidth wall. Processing-in-memory (PIM) is an emerging approach that potentially mitigates this bottleneck, by enabling low-latency memory access whose…

Recent work proposed learned index structures, which learn the distribution of the underlying dataset to improve performance. The initial work on learned indexes has shown that by learning the cumulative distribution function of the data,…

Databases · Computer Science 2021-02-03 Ali Hadian , Behzad Ghaffari , Taiyi Wang , Thomas Heinis

Large-scale industrial recommender systems are usually confronted with computational problems due to the enormous corpus size. To retrieve and recommend the most relevant items to users under response time limits, resorting to an efficient…

Information Retrieval · Computer Science 2019-11-21 Han Zhu , Daqing Chang , Ziru Xu , Pengye Zhang , Xiang Li , Jie He , Han Li , Jian Xu , Kun Gai

The end-to-end lookup latency of a hierarchical index -- such as a B-tree or a learned index -- is determined by its structure such as the number of layers, the kinds of branching functions appearing in each layer, the amount of data we…

Databases · Computer Science 2023-09-06 Supawit Chockchowwat , Wenjie Liu , Yongjoo Park

As a key ingredient of the DBMS, index plays an important role in the query optimization and processing. However, it is a non-trivial task to apply existing indexes or design new indexes for new applications, where both data distribution…

Databases · Computer Science 2020-03-05 Sai Wu , Xinyi Yu , Xiaojie Feng , Feifei Li , Wei Cao , Gang Chen

Indexes facilitate efficient querying when the selection predicate is on an indexed key. As a result, when loading data, if we anticipate future selective (point or range) queries, we typically maintain an index that is gradually populated…

Databases · Computer Science 2022-02-10 Aneesh Raman , Subhadeep Sarkar , Matthaios Olma , Manos Athanassoulis

Efficient indexing is fundamental for multi-dimensional data management and analytics. An emerging tendency is to directly learn the storage layout of multi-dimensional data by simple machine learning models, yielding the concept of Learned…

Databases · Computer Science 2024-05-10 Qiyu Liu , Maocheng Li , Yuxiang Zeng , Yanyan Shen , Lei Chen
‹ Prev 1 2 3 10 Next ›