Related papers: Columnar Storage and List-based Processing for Gra…

A computational model for analytic column stores

This work presents an abstract model for the computations performed by analytic column stores or columnar query processors. The model is based on circuits whose wires carry columns rather than scalar values, and whose nodes apply operators…

Databases · Computer Science 2019-11-13 Eyal Rozenberg

Finding a Second Wind: Speeding Up Graph Traversal Queries in RDBMSs Using Column-Oriented Processing

Recursive queries and recursive derived tables constitute an important part of the SQL standard. Their efficient processing is important for many real-life applications that rely on graph or hierarchy traversal. Position-enabled…

Databases · Computer Science 2023-08-21 Mikhail Firsov , Michael Polyntsov , Kirill Smirnov , George Chernishev

Mainlining Databases: Supporting Fast Transactional Workloads on Universal Columnar Data File Formats

The proliferation of modern data processing tools has given rise to open-source columnar data formats. The advantage of these formats is that they help organizations avoid repeatedly converting data to a new format for each application.…

Databases · Computer Science 2020-05-01 Tianyu Li , Matthew Butrovich , Amadou Ngom , Wan Shen Lim , Wes McKinney , Andrew Pavlo

Optimizations and Heuristics to improve Compression in Columnar Database Systems

In-memory columnar databases have become mainstream over the last decade and have vastly improved the fast processing of large volumes of data through multi-core parallelism and in-memory compression thereby eliminating the usual…

Databases · Computer Science 2016-09-27 Jayanth Jayanth

System G Distributed Graph Database

Motivated by the need to extract knowledge and value from interconnected data, graph analytics on big data is a very active area of research in both industry and academia. To support graph analytics efficiently a large number of in memory…

Databases · Computer Science 2018-07-10 Gabriel Tanase , Toyotaro Suzumura , Jinho Lee , Chun-Fu Chen , Jason Crawford , Hiroki Kanezashi , Song Zhang , Warut D. Vijitbenjaronk

Making RDBMSs Efficient on Graph Workloads Through Predefined Joins

Joins in native graph database management systems (GDBMSs) are predefined to the system as edges, which are indexed in adjacency list indices and serve as pointers. This contrasts with and can be more performant than value-based joins in…

Databases · Computer Science 2021-08-25 Guodong Jin , Semih Salihoglu

Query Optimization Techniques In Graph Databases

Graph databases (GDB) have recently been arisen to overcome the limits of traditional databases for storing and managing data with graph-like structure. Today, they represent a requirement for many applications that manage graph-like data,…

Databases · Computer Science 2016-09-08 Ali Ben Ammar

Control Flow Duplication for Columnar Arrays in a Dynamic Compiler

Columnar databases are an established way to speed up online analytical processing (OLAP) queries. Nowadays, data processing (e.g., storage, visualization, and analytics) is often performed at the programming language level, hence it is…

Programming Languages · Computer Science 2023-02-21 Sebastian Kloibhofer , Lukas Makor , David Leopoldseder , Daniele Bonetta , Lukas Stadler , Hanspeter Mössenböck

Revisiting Data Compression in Column-Stores

Data compression is widely used in contemporary column-oriented DBMSes to lower space usage and to speed up query processing. Pioneering systems have introduced compression to tackle the disk bandwidth bottleneck by trading CPU processing…

Databases · Computer Science 2021-05-20 Alexander Slesarev , Evgeniy Klyuchikov , Kirill Smirnov , George Chernishev

Optimizing Differentially-Maintained Recursive Queries on Dynamic Graphs

Differential computation (DC) is a highly general incremental computation/view maintenance technique that can maintain the output of an arbitrary and possibly recursive dataflow computation upon changes to its base inputs. As such, it is a…

Databases · Computer Science 2022-08-02 Khaled Ammar , Siddhartha Sahu , Semih Salihoglu , M. Tamer Ozsu

Fast Updates on Read-Optimized Databases Using Multi-Core CPUs

Read-optimized columnar databases use differential updates to handle writes by maintaining a separate write-optimized delta partition which is periodically merged with the read-optimized and compressed main partition. This merge process…

Databases · Computer Science 2015-03-19 Jens Krueger , Changkyu Kim , Martin Grund , Nadathur Satish , David Schwalb , Jatin Chhugani , Hasso Plattner , Pradeep Dubey , Alexander Zeier

MorphStore: Analytical Query Engine with a Holistic Compression-Enabled Processing Model

In this paper, we present MorphStore, an open-source in-memory columnar analytical query engine with a novel holistic compression-enabled processing model. Basically, compression using lightweight integer compression algorithms already…

Databases · Computer Science 2020-04-21 Patrick Damme , Annett Ungethüm , Johannes Pietrzyk , Alexander Krause , Dirk Habich , Wolfgang Lehner

Views: a hardware-friendly graph database model for storing semantic information

The graph database (GDB) is an increasingly common storage model for data involving relationships between entries. Beyond its widespread usage in database industries, the advantages of GDBs indicate a strong potential in constructing…

Databases · Computer Science 2025-11-17 Yanjun Yang , Adrian Wheeldon , Yihan Pan , Themis Prodromakis , Alex Serb

Columnar Formats for Schemaless LSM-based Document Stores

In the last decade, document store database systems have gained more traction for storing and querying large volumes of semi-structured data. However, the flexibility of the document stores' data models has limited their ability to store…

Databases · Computer Science 2021-11-24 Wail Y. Alkowaileet , Michael J. Carey

Cohort Query Processing

Modern Internet applications often produce a large volume of user activity records. Data analysts are interested in cohort analysis, or finding unusual user behavioral trends, in these large tables of activity records. In a traditional…

Databases · Computer Science 2016-05-05 Dawei Jiang , Qingchao Cai , Gang Chen , H. V. Jagadish , Beng Chin Ooi , Kian-Lee Tan , Anthony K. H. Tung

Relational Memory: Native In-Memory Accesses on Rows and Columns

Analytical database systems are typically designed to use a column-first data layout to access only the desired fields. On the other hand, storing data row-first works great for accessing, inserting, or updating entire rows. Transforming…

Databases · Computer Science 2022-02-08 Shahin Roozkhosh , Denis Hoornaert , Ju Hyoung Mun , Tarikul Islam Papon , Ahmed Sanaullah , Ulrich Drepper , Renato Mancuso , Manos Athanassoulis

Column-Oriented Storage Techniques for MapReduce

Users of MapReduce often run into performance problems when they scale up their workloads. Many of the problems they encounter can be overcome by applying techniques learned from over three decades of research on parallel DBMSs. However,…

Databases · Computer Science 2011-05-24 Avrilia Floratou , Jignesh Patel , Eugene Shekita , Sandeep Tata

Power Flow Analysis Using Graph based Combination of Iterative Methods and Vertex Contraction Approach

Compared with relational database (RDB), graph database (GDB) is a more intuitive expression of the real world. Each node in the GDB is a both storage and logic unit. Since it is connected to its neighboring nodes through edges, and its…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-09-06 Chen Yuan , Guangyi Liu , Renchang Dai , Zhiwei Wang

The Graph Database Interface: Scaling Online Transactional and Analytical Graph Workloads to Hundreds of Thousands of Cores

Graph databases (GDBs) are crucial in academic and industry applications. The key challenges in developing GDBs are achieving high performance, scalability, programmability, and portability. To tackle these challenges, we harness…

Databases · Computer Science 2023-11-21 Maciej Besta , Robert Gerstenberger , Marc Fischer , Michał Podstawski , Nils Blach , Berke Egeli , Georgy Mitenkov , Wojciech Chlapek , Marek Michalewicz , Hubert Niewiadomski , Jürgen Müller , Torsten Hoefler

Processing a Trillion Cells per Mouse Click

Column-oriented database systems have been a real game changer for the industry in recent years. Highly tuned and performant systems have evolved that provide users with the possibility of answering ad hoc queries over large datasets in an…

Databases · Computer Science 2012-08-02 Alexander Hall , Olaf Bachmann , Robert Büssow , Silviu Gănceanu , Marc Nunkesser