Related papers: Finding a Second Wind: Speeding Up Graph Traversal…

Speeding up SQL subqueries via decoupling of non-correlated predicate (extended version)

In this paper, we discuss a novel technique for processing correlated subqueries in SQL. The core idea is to isolate the non-correlated part of the predicate and use it to reduce the number of evaluations of the correlated part. We begin by…

Databases · Computer Science 2025-10-08 Dmitrii Radivonchik , Yakov Kuzin , Anton Chizhov , Dmitriy Shcheka , Mikhail Firsov , Kirill Smirnov , George Chernishev

Hybrid Materialization in a Disk-Based Column-Store

In column-oriented query processing, a materialization strategy determines when lightweight positions (row IDs) are translated into tuples. It is an important part of column-store architecture, since it defines the class of supported query…

Databases · Computer Science 2023-04-19 Evgeniy Klyuchikov , Elena Mikhailova , George Chernishev

Columnar Storage and List-based Processing for Graph Database Management Systems

We revisit column-oriented storage and query processing techniques in the context of contemporary graph database management systems (GDBMSs). Similar to column-oriented RDBMSs, GDBMSs support read-heavy analytical workloads that however…

Databases · Computer Science 2021-10-29 Pranjal Gupta , Amine Mhedhbi , Semih Salihoglu

Column-Oriented Storage Techniques for MapReduce

Users of MapReduce often run into performance problems when they scale up their workloads. Many of the problems they encounter can be overcome by applying techniques learned from over three decades of research on parallel DBMSs. However,…

Databases · Computer Science 2011-05-24 Avrilia Floratou , Jignesh Patel , Eugene Shekita , Sandeep Tata

Relational Memory: Native In-Memory Accesses on Rows and Columns

Analytical database systems are typically designed to use a column-first data layout to access only the desired fields. On the other hand, storing data row-first works great for accessing, inserting, or updating entire rows. Transforming…

Databases · Computer Science 2022-02-08 Shahin Roozkhosh , Denis Hoornaert , Ju Hyoung Mun , Tarikul Islam Papon , Ahmed Sanaullah , Ulrich Drepper , Renato Mancuso , Manos Athanassoulis

Processing a Trillion Cells per Mouse Click

Column-oriented database systems have been a real game changer for the industry in recent years. Highly tuned and performant systems have evolved that provide users with the possibility of answering ad hoc queries over large datasets in an…

Databases · Computer Science 2012-08-02 Alexander Hall , Olaf Bachmann , Robert Büssow , Silviu Gănceanu , Marc Nunkesser

Designing Succinct Secondary Indexing Mechanism by Exploiting Column Correlations (Extended Version)

Database administrators construct secondary indexes on data tables to accelerate query processing in relational database management systems (RDBMSs). These indexes are built on top of the most frequently queried columns according to the…

Databases · Computer Science 2019-04-03 Yingjun Wu , Jia Yu , Yuanyuan Tian , Richard Sidle , Ronald Barber

Revisiting Data Compression in Column-Stores

Data compression is widely used in contemporary column-oriented DBMSes to lower space usage and to speed up query processing. Pioneering systems have introduced compression to tackle the disk bandwidth bottleneck by trading CPU processing…

Databases · Computer Science 2021-05-20 Alexander Slesarev , Evgeniy Klyuchikov , Kirill Smirnov , George Chernishev

Cohort Query Processing

Modern Internet applications often produce a large volume of user activity records. Data analysts are interested in cohort analysis, or finding unusual user behavioral trends, in these large tables of activity records. In a traditional…

Databases · Computer Science 2016-05-05 Dawei Jiang , Qingchao Cai , Gang Chen , H. V. Jagadish , Beng Chin Ooi , Kian-Lee Tan , Anthony K. H. Tung

Robust Recursive Query Parallelism in Graph Database Management Systems

Efficient multi-core parallel processing of recursive join queries is critical for achieving good performance in graph database management systems (GDBMSs). Prior work adopts two broad approaches. First is the state of the art morsel-driven…

Databases · Computer Science 2025-08-28 Anurag Chakraborty , Semih Salihoğlu

Efficient and Effective Table-Centric Table Union Search in Data Lakes

In data lakes, information on the same subject is often fragmented across multiple tables. Table union search aims to find the top-k tables that can be unioned with a query table to extend it with more rows, without relying on metadata or…

Databases · Computer Science 2026-03-19 Yongkang Sun , Zhihao Ding , Huiqiang Wang , Reynold Cheng , Jieming Shi

Column-Oriented Datalog Materialization for Large Knowledge Graphs (Extended Technical Report)

The evaluation of Datalog rules over large Knowledge Graphs (KGs) is essential for many applications. In this paper, we present a new method of materializing Datalog inferences, which combines a column-based memory layout with novel…

Databases · Computer Science 2016-02-12 Jacopo Urbani , Ceriel Jacobs , Markus Krötzsch

Fast and Scalable Memristive In-Memory Sorting with Column-Skipping Algorithm

Memristive in-memory sorting has been proposed recently to improve hardware sorting efficiency. Using iterative in-memory min computations, data movements between memory and external processing units can be eliminated for improved latency…

Hardware Architecture · Computer Science 2022-02-22 Lianfeng Yu , Zhaokun Jing , Yuchao Yang , Yaoyu Tao

Optimizing Differentially-Maintained Recursive Queries on Dynamic Graphs

Differential computation (DC) is a highly general incremental computation/view maintenance technique that can maintain the output of an arbitrary and possibly recursive dataflow computation upon changes to its base inputs. As such, it is a…

Databases · Computer Science 2022-08-02 Khaled Ammar , Siddhartha Sahu , Semih Salihoglu , M. Tamer Ozsu

Multi Query Optimization in GLADE

SQL-on-Hadoop systems, query optimization, data distribution over multiple nodes and parallelization techniques are few of the areas under extreme research these days. Big names like Amazon, Google, Microsoft and many more are working on…

Databases · Computer Science 2016-08-17 Abdur Rafay

Optimizations and Heuristics to improve Compression in Columnar Database Systems

In-memory columnar databases have become mainstream over the last decade and have vastly improved the fast processing of large volumes of data through multi-core parallelism and in-memory compression thereby eliminating the usual…

Databases · Computer Science 2016-09-27 Jayanth Jayanth

Improving Seek Time for Column Store Using MMH Algorithm

Hash based search has, proven excellence on large data warehouses stored in column store. Data distribution has significant impact on hash based search. To reduce impact of data distribution, we have proposed Memory Managed Hash (MMH)…

Databases · Computer Science 2012-04-10 Tejaswini Apte , Dr. Maya Ingle , Dr. A. K. Goyal

Reordering Columns for Smaller Indexes

Column-oriented indexes-such as projection or bitmap indexes-are compressed by run-length encoding to reduce storage and increase speed. Sorting the tables improves compression. On realistic data sets, permuting the columns in the right…

Databases · Computer Science 2015-03-13 Daniel Lemire , Owen Kaser

ByteStore: Hybrid Layouts for Main-Memory Column Stores

The performance of main memory column stores highly depends on the scan and lookup operations on the base column layouts. Existing column-stores adopt a homogeneous column layout, leading to sub-optimal performance on real workloads since…

Databases · Computer Science 2022-09-02 Pengfei Zhang , Ziqiang Feng , Eric Lo , Hailin Qin

Scaling-Up In-Memory Datalog Processing: Observations and Techniques

Recursive query processing has experienced a recent resurgence, as a result of its use in many modern application domains, including data integration, graph analytics, security, program analysis, networking and decision making. Due to the…

Databases · Computer Science 2018-12-11 Zhiwei Fan , Jianqiao Zhu , Zuyu Zhang , Aws Albarghouthi , Paraschos Koutris , Jignesh Patel