English
Related papers

Related papers: MorphStore: Analytical Query Engine with a Holisti…

200 papers

With the prevalence of in-database AI-powered analytics, there is an increasing demand for database systems to efficiently manage the ever-expanding number and size of deep learning models. However, existing database systems typically store…

Databases · Computer Science 2025-09-16 Siqi Xiang , Sheng Wang , Xiaokui Xiao , Cong Yue , Zhanhao Zhao , Beng Chin Ooi

This study proposes a novel storage engine, SynchroStore, designed to address the inefficiency of update operations in columnar storage systems based on Log-Structured Merge Trees (LSM-Trees) under hybrid workload scenarios. While columnar…

Databases · Computer Science 2025-03-25 Yinan Zhang , Huiqi Hu , Xuan Zhou

This work presents an abstract model for the computations performed by analytic column stores or columnar query processors. The model is based on circuits whose wires carry columns rather than scalar values, and whose nodes apply operators…

Databases · Computer Science 2019-11-13 Eyal Rozenberg

Data compression is widely used in contemporary column-oriented DBMSes to lower space usage and to speed up query processing. Pioneering systems have introduced compression to tackle the disk bandwidth bottleneck by trading CPU processing…

Databases · Computer Science 2021-05-20 Alexander Slesarev , Evgeniy Klyuchikov , Kirill Smirnov , George Chernishev

In-memory columnar databases have become mainstream over the last decade and have vastly improved the fast processing of large volumes of data through multi-core parallelism and in-memory compression thereby eliminating the usual…

Databases · Computer Science 2016-09-27 Jayanth Jayanth

Data-centric ML pipelines extend traditional machine learning (ML) pipelines -- of feature transformations and ML model training -- by outer loops for data cleaning, augmentation, and feature engineering to create high-quality input data.…

Databases · Computer Science 2025-04-16 Sebastian Baunsgaard , Matthias Boehm

Modern in-memory databases are typically used for high-performance workloads, therefore they have to be optimized for small memory footprint and high query speed at the same time. Data compression has the potential to reduce memory…

Databases · Computer Science 2022-09-07 Marcell Fehér , Daniel E. Lucani , Ioannis Chatzigeorgiou

Compressing integer keys is a fundamental operation among multiple communities, such as database management (DB), information retrieval (IR), and high-performance computing (HPC). Recent advances in \emph{learned indexes} have inspired the…

Databases · Computer Science 2024-12-17 Qiyu Liu , Siyuan Han , Jianwei Liao , Jin Li , Jingshu Peng , Jun Du , Lei Chen

Modern applications commonly leverage large, multi-modal foundation models. These applications often feature complex workflows that demand the storage and usage of similar models in multiple precisions. A straightforward approach is to…

Databases · Computer Science 2025-10-21 Raunak Shah , Zhaoheng Li , Yongjoo Park

Modern AI models are growing rapidly in size and redundancy, leading to significant storage and distribution challenges in model hubs. We present TStore, a tensor-centric system for reducing storage overhead through fine-grained…

Distributed, Parallel, and Cluster Computing · Computer Science 2026-05-14 Tingfeng Lan , Zirui Wang , Yunjia Zheng , Zhaoyuan Su , Juncheng Yang , Yue Cheng

Modern RDBMSs support the ability to compress data using methods such as null suppression and dictionary encoding. Data compression offers the promise of significantly reducing storage requirements and improving I/O performance for decision…

Databases · Computer Science 2011-09-06 Hideaki Kimura , Vivek Narasayya , Manoj Syamala

In recent years, resource elasticity and cost optimization have become essential for RDBMSs. While cloud-native RDBMSs provide elastic computing resources via disaggregated computing and storage, storage costs remain a critical user…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-11-26 Qingda Hu , Xinjun Yang , Feifei Li , Junru Li , Ya Lin , Yuqi Zhou , Yicong Zhu , Junwei Zhang , Rongbiao Xie , Ling Zhou , Bin Wu , Wenchao Zhou

Storing tabular data to balance storage and query efficiency is a long-standing research question in the database community. In this work, we argue and show that a novel DeepMapping abstraction, which relies on the impressive memorization…

Databases · Computer Science 2024-09-27 Lixi Zhou , K. Selçuk Candan , Jia Zou

Analyzing database access logs is a key part of performance tuning, intrusion detection, benchmark development, and many other database administration tasks. Unfortunately, it is common for production databases to deal with millions or even…

Databases · Computer Science 2018-10-02 Ting Xie , Oliver Kennedy , Varun Chandola

Tracking data lineage is important for data integrity, reproducibility, and debugging data science workflows. However, fine-grained lineage (i.e., at a cell level) is challenging to store, even for the smallest datasets. This paper…

Databases · Computer Science 2024-05-29 Jinjin Zhao , Sanjay Krishnan

Existing data storage systems offer a wide range of functionalities to accommodate an equally diverse range of applications. However, new classes of applications have emerged, e.g., blockchain and collaborative analytics, featuring data…

In recent years, column stores (or C-stores for short) have emerged as a novel approach to deal with read-mostly data warehousing applications. Experimental evidence suggests that, for certain types of queries, the new features of C-stores…

Databases · Computer Science 2009-09-15 Nicolas Bruno

The increasing demand for deep neural inference within database environments has driven the emergence of AI-native DBMSs. However, existing solutions either rely on model-centric designs requiring developers to manually select, configure,…

Column-oriented database systems have been a real game changer for the industry in recent years. Highly tuned and performant systems have evolved that provide users with the possibility of answering ad hoc queries over large datasets in an…

Databases · Computer Science 2012-08-02 Alexander Hall , Olaf Bachmann , Robert Büssow , Silviu Gănceanu , Marc Nunkesser

In column-oriented query processing, a materialization strategy determines when lightweight positions (row IDs) are translated into tuples. It is an important part of column-store architecture, since it defines the class of supported query…

Databases · Computer Science 2023-04-19 Evgeniy Klyuchikov , Elena Mikhailova , George Chernishev
‹ Prev 1 2 3 10 Next ›