Related papers: Hippo: A Fast, yet Scalable, Database Indexing App…

Forecasting the cost of processing multi-join queries via hashing for main-memory databases (Extended version)

Database management systems (DBMSs) carefully optimize complex multi-join queries to avoid expensive disk I/O. As servers today feature tens or hundreds of gigabytes of RAM, a significant fraction of many analytic databases becomes…

Databases · Computer Science 2015-07-22 Feilong Liu , Spyros Blanas

Indexes in Microsoft SQL Server

Indexes are the best apposite choice for quickly retrieving the records. This is nothing but cutting down the number of Disk IO. Instead of scanning the complete table for the results, we can decrease the number of IO's or page fetches…

Databases · Computer Science 2019-03-21 Sourav Mukherjee

FITing-Tree: A Data-aware Index Structure

Index structures are one of the most important tools that DBAs leverage to improve the performance of analytics and transactional workloads. However, building several indexes over large datasets can often become prohibitive and consume…

Databases · Computer Science 2020-03-26 Alex Galakatos , Michael Markovitch , Carsten Binnig , Rodrigo Fonseca , Tim Kraska

On the Scalability of Multidimensional Databases

It is commonly accepted in the practice of on-line analytical processing of databases that the multidimensional database organization is less scalable than the relational one. It is easy to see that the size of the multidimensional…

Databases · Computer Science 2011-04-27 István Szépkúti

DualTable: A Hybrid Storage Model for Update Optimization in Hive

Hive is the most mature and prevalent data warehouse tool providing SQL-like interface in the Hadoop ecosystem. It is successfully used in many Internet companies and shows its value for big data processing in traditional industries.…

Databases · Computer Science 2014-12-02 Songlin Hu , Wantao Liu , Tilmann Rabl , Shuo Huang , Ying Liang , Zheng Xiao , Hans-Arno Jacobsen , Xubin Pei , Jiye Wang

HIRE: A Hybrid Learned Index for Robust and Efficient Performance under Mixed Workloads

Indexes are critical for efficient data retrieval and updates in modern databases. Recent advances in machine learning have led to the development of learned indexes, which model the cumulative distribution function of data to predict…

Databases · Computer Science 2026-04-27 Xinyi Zhang , Liang Liang , Anastasia Ailamaki , Jianliang Xu

Learned Adaptive Indexing

Indexes can significantly improve search performance in relational databases. However, if the query workload changes frequently or new data updates occur continuously, it may not be worthwhile to build a conventional index upfront for query…

Databases · Computer Science 2025-08-06 Suvam Kumar Das , Suprio Ray

Bridging the Gap Between Theory and Practice on Insertion-Intensive Database

With the prevalence of online platforms, today, data is being generated and accessed by users at a very high rate. Besides, applications such as stock trading or high frequency trading require guaranteed low delays for performing an…

Databases · Computer Science 2020-03-03 Sepanta Zeighami , Raymond Chi-Wing Wong

Updatable Learned Indexes Meet Disk-Resident DBMS -- From Evaluations to Design Choices

Although many updatable learned indexes have been proposed in recent years, whether they can outperform traditional approaches on disk remains unknown. In this study, we revisit and implement four state-of-the-art updatable learned indexes…

Databases · Computer Science 2023-05-03 Hai Lan , Zhifeng Bao , J. Shane Culpepper , Renata Borovica-Gajic

A Simple Yet High-Performing On-disk Learned Index: Can We Have Our Cake and Eat it Too?

While in-memory learned indexes have shown promising performance as compared to B+-tree, most widely used databases in real applications still rely on disk-based operations. Based on our experiments, we observe that directly applying the…

Databases · Computer Science 2023-06-06 Hai Lan , Zhifeng Bao , J. Shane Culpepper , Renata Borovica-Gajic , Yu Dong

CompassDB: Pioneering High-Performance Key-Value Store with Perfect Hash

Modern mainstream persistent key-value storage engines utilize Log-Structured Merge tree (LSM-tree) based designs, optimizing read/write performance by leveraging sequential disk I/O. However, the advent of SSDs, with their significant…

Databases · Computer Science 2024-06-27 Jin Jiang , Dongsheng He , Yu Hu , Dong Liu , Chenfan Xiao , Hongxiao Bi , Yusong Zhang , Chaoqu Jiang , Zhijun Fu

Object-Relational Database Representations for Text Indexing

One of the distinctive features of Information Retrieval systems comparing to Database Management systems, is that they offer better compression for posting lists, resulting in better I/O performance and thus faster query evaluation. In…

Information Retrieval · Computer Science 2009-06-18 Panagiotis Papadakos , Yannis Theoharis , Yannis Marketakis , Nikos Armenatzoglou , Yannis Tzitzikas

Stochastic Database Cracking: Towards Robust Adaptive Indexing in Main-Memory Column-Stores

Modern business applications and scientific databases call for inherently dynamic data storage environments. Such environments are characterized by two challenging features: (a) they have little idle system time to devote on physical…

Databases · Computer Science 2012-03-02 Felix Halim , Stratos Idreos , Panagiotis Karras , Roland H. C. Yap

Cuckoo++ Hash Tables: High-Performance Hash Tables for Networking Applications

Hash tables are an essential data-structure for numerous networking applications (e.g., connection tracking, firewalls, network address translators). Among these, cuckoo hash tables provide excellent performance by allowing lookups to be…

Networking and Internet Architecture · Computer Science 2017-12-29 Nicolas Le Scouarnec

B+-tree Index Optimization by Exploiting Internal Parallelism of Flash-based Solid State Drives

Previous research addressed the potential problems of the hard-disk oriented design of DBMSs of flashSSDs. In this paper, we focus on exploiting potential benefits of flashSSDs. First, we examine the internal parallelism issues of flashSSDs…

Databases · Computer Science 2012-01-04 Hongchan Roh , Sanghyun Park , Sungho Kim , Mincheol Shin , Sang-Won Lee

Griffin: Fast Transactional Database Index with Hash and B+-Tree

Index access is one of the dominant performance factors in transactional database systems. Many systems use a B+-tree or one of its variants to handle point and range operations. This access pattern has room for performance improvement.…

Databases · Computer Science 2024-10-23 Sho Nakazono , Yutaro Bessho , Hideyuki Kawashima , Tatsuhiro Nakamori

Updatable Learned Index with Precise Positions

Index plays an essential role in modern database engines to accelerate the query processing. The new paradigm of "learned index" has significantly changed the way of designing index structures in DBMS. The key insight is that indexes could…

Databases · Computer Science 2021-04-14 Jiacheng Wu , Yong Zhang , Shimin Chen , Jin Wang , Yu Chen , Chunxiao Xing

Wormhole: A Fast Ordered Index for In-memory Data Management

In-memory data management systems, such as key-value stores, have become an essential infrastructure in today's big-data processing and cloud computing. They rely on efficient index structures to access data. While unordered indexes, such…

Databases · Computer Science 2019-02-19 Xingbo Wu , Fan Ni , Song Jiang

Ingestion, Indexing and Retrieval of High-Velocity Multidimensional Sensor Data on a Single Node

Multidimensional data are becoming more prevalent, partly due to the rise of the Internet of Things (IoT), and with that the need to ingest and analyze data streams at rates higher than before. Some industrial IoT applications require…

Databases · Computer Science 2017-07-05 Juan A. Colmenares , Reza Dorrigiv , Daniel G. Waddington

Indexing Join Inputs for Fast Queries and Maintenance

In database systems, joins are often expensive despite many years of research producing numerous join algorithms. Precomputed and materialized join views deliver the best query performance, whereas traditional indexes, used as pre-sorted…

Databases · Computer Science 2025-02-18 Wenhui Lyu , Goetz Graefe