English
Related papers

Related papers: An LSM-based Tuple Compaction Framework for Apache…

200 papers

In the last decade, document store database systems have gained more traction for storing and querying large volumes of semi-structured data. However, the flexibility of the document stores' data models has limited their ability to store…

Databases · Computer Science 2021-11-24 Wail Y. Alkowaileet , Michael J. Carey

In recent years, the Log Structured Merge (LSM) tree has been widely adopted by NoSQL and NewSQL systems for its superior write performance. Despite its popularity, however, most existing work has focused on LSM-based key-value stores with…

Databases · Computer Science 2019-01-08 Chen Luo , Michael J. Carey

Compaction is a necessary, but often costly background process in write-optimized data structures like LSM-trees that reorganizes incoming data that is sequentially appended to logs. In this paper, we introduce Transformation-Embedded…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-11 Holly Casaletto , Jeff Lefevre , Aldrin Montana , Peter Alvaro

Log-structured merge (LSM) trees offer efficient ingestion by appending incoming data, and thus, are widely used as the storage layer of production NoSQL data stores. To enable competitive read performance, LSM-trees periodically…

Databases · Computer Science 2022-03-01 Subhadeep Sarkar , Dimitris Staratzis , Zichen Zhu , Manos Athanassoulis

The Log-Structured Merge-Tree (LSM-tree) has been widely adopted for use in modern NoSQL systems for its superior write performance. Despite the popularity of LSM-trees, they have been criticized for suffering from write stalls and large…

Databases · Computer Science 2020-04-14 Chen Luo , Michael J. Carey

The proliferation of small files in data lakes poses significant challenges, including degraded query performance, increased storage costs, and scalability bottlenecks in distributed storage systems. Log-structured table formats (LSTs) such…

Log-Structured Merge-trees (LSM-trees) have been widely used in modern NoSQL systems. Due to their out-of-place update design, LSM-trees have introduced memory walls among the memory components of multiple LSM-trees and between the write…

Databases · Computer Science 2020-07-16 Chen Luo , Michael J. Carey

Modern key-value storage engines built on Log-Structured Merge-trees (LSM-trees), such as RocksDB and LevelDB, rely heavily on the performance of their compaction operations, which are impacted by a complex set of interdependent…

Performance · Computer Science 2026-02-16 Jiabiao Ding , Yina Lv , Qiao Li , Zhirong Shen , Chun Jason Xue

Big Data today is being generated at an unprecedented rate from various sources such as sensors, applications, and devices, and it often needs to be enriched based on other reference information to support complex analytical queries.…

Databases · Computer Science 2020-08-18 Xikui Wang , Michael J. Carey

Parallel shared-nothing data management systems have been widely used to exploit a cluster of machines for efficient and scalable data processing. When a cluster needs to be dynamically scaled in or out, data must be efficiently rebalanced.…

Databases · Computer Science 2021-05-25 Chen Luo , Michael J. Carey

Many applications process a stream of tuples over a window duration, and require the results within a specified deadline after the end of the window. For such scenarios, processing tuples intermittently (in batches) instead of eagerly…

Databases · Computer Science 2026-05-19 Saranya Chandrasekaran , S. Sudarshan

This paper introduces a novel approach to schema inference as an on-demand function integrated directly within a DBMS, targeting NoSQL databases where schema flexibility can create challenges. Unlike previous methods relying on external…

Databases · Computer Science 2024-11-21 Calvin Dani , Shiva Jahangiri , Thomas Hütter

LSM-tree has been widely used in cloud computing systems by Google, Facebook, and Amazon, to achieve high performance for write-intensive workloads. However, in LSM-tree, random key-value queries can experience long latency and low…

Data Structures and Algorithms · Computer Science 2016-06-08 Lei Guo , Dejun Teng , Rubao Lee , Feng Chen , Siyuan Ma , Xiaodong Zhang

The development of high-speed storage devices such as NVMe SSDs has shifted the primary I/O bottleneck from hardware to software. Modern database systems also rely on kernel-based I/O paths, where frequent system call invocations and…

Databases · Computer Science 2026-03-06 Hongsu Byun , Seungjae Lee , Honghyeon Yoo , Myoungjoon Kim , Sungyong Park

Log-Structured Merge trees (LSM trees) are increasingly used as the storage engines behind several data systems, frequently deployed in the cloud. Similar to other database architectures, LSM trees take into account information about the…

Databases · Computer Science 2021-11-04 Andy Huynh , Harshal A. Chaudhari , Evimaria Terzi , Manos Athanassoulis

Analyzing the increasingly large volumes of data that are available today, possibly including the application of custom machine learning models, requires the utilization of distributed frameworks. This can result in serious productivity…

Databases · Computer Science 2019-08-20 Phanwadee Sinthong , Michael J. Carey

LSM-tree is a widely adopted data structure in modern key-value store systems that optimizes write performance in write-heavy applications by using append writes to achieve sequential writes. However, the unpredictability of LSM-tree…

Databases · Computer Science 2025-07-15 Yina Lv , Qiao Li , Quanqing Xu , Congming Gao , Chuanhui Yang , Xiaoli Wang , Chun Jason Xue

Unstructured data formats account for over 80% of the data currently stored, and extracting value from such formats remains a considerable challenge. In particular, current approaches for managing unstructured documents do not support…

In the current context of Big Data, a multitude of new NoSQL solutions for storing, managing, and extracting information and patterns from semi-structured data have been proposed and implemented. These solutions were developed to relieve…

This study proposes a novel storage engine, SynchroStore, designed to address the inefficiency of update operations in columnar storage systems based on Log-Structured Merge Trees (LSM-Trees) under hybrid workload scenarios. While columnar…

Databases · Computer Science 2025-03-25 Yinan Zhang , Huiqi Hu , Xuan Zhou
‹ Prev 1 2 3 10 Next ›