Related papers: FDB: A Query Engine for Factorised Relational Data…

Aggregation and Ordering in Factorised Databases

A common approach to data analysis involves understanding and manipulating succinct representations of data. In earlier work, we put forward a succinct representation system for relational data called factorised databases and reported on…

Databases · Computer Science 2013-07-02 Nurzhan Bakibayev , Tomáš Kočiský , Dan Olteanu , Jakub Závodný

Fast Factorized Learning: Powered by In-Memory Database Systems

Learning models over factorized joins avoids redundant computations by identifying and pre-computing shared cofactors. Previous work has investigated the performance gain when computing cofactors on traditional disk-based database systems.…

Databases · Computer Science 2025-12-11 Bernhard Stöckl , Maximilian E. Schüle

Towards a New Extracting and Querying Approach of Fuzzy Summaries

Diversification of DB applications highlighted the limitations of relational database management system (RDBMS) particularly on the modeling plan. In fact, in the real world, we are increasingly faced with the situation where applications…

Databases · Computer Science 2019-04-30 Ines Benali-Sougui , Minyar Sassi Hidri , Amel Grissa-Touzi

Query Optimization Techniques In Graph Databases

Graph databases (GDB) have recently been arisen to overcome the limits of traditional databases for storing and managing data with graph-like structure. Today, they represent a requirement for many applications that manage graph-like data,…

Databases · Computer Science 2016-09-08 Ali Ben Ammar

FactorBase: SQL for Learning A Multi-Relational Graphical Model

We describe FactorBase, a new SQL-based framework that leverages a relational database management system to support multi-relational model discovery. A multi-relational statistical model provides an integrated analysis of the heterogeneous…

Databases · Computer Science 2015-08-12 Oliver Schulte , Zhensong Qian

Multi-agent Databases via Independent Learning

Machine learning is rapidly being used in database research to improve the effectiveness of numerous tasks included but not limited to query optimization, workload scheduling, physical design, etc. Currently, the research focus has been on…

Databases · Computer Science 2022-08-08 Chi Zhang , Olga Papaemmanouil , Josiah P. Hanna , Aditya Akella

An Overview of Query Processing on Crowdsourced Databases

Crowd-sourcing is a powerful solution for finding correct answers to expensive and unanswered queries in databases, including those with uncertain and incomplete data. Attempts to use crowd-sourcing to exploit human abilities to process…

Databases · Computer Science 2022-04-19 Marwa B. Swidan , Ali A. Alwan , Yonis Gulzar , Abedallah Zaid Abualkishik

COMPARE: Accelerating Groupwise Comparison in Relational Databases for Data Analytics

Data analysis often involves comparing subsets of data across many dimensions for finding unusual trends and patterns. While the comparison between subsets of data can be expressed using SQL, they tend to be complex to write, and suffer…

Databases · Computer Science 2021-07-28 Tarique Siddiqui , Surajit Chaudhuri , Vivek Narasayya

HRDBMS: Combining the Best of Modern and Traditional Relational Databases

HRDBMS is a novel distributed relational database that uses a hybrid model combining the best of traditional distributed relational databases and Big Data analytics platforms such as Hive. This allows HRDBMS to leverage years worth of…

Databases · Computer Science 2019-01-28 Jason Arnold , Boris Glavic , Ioan Raicu

KathDB: Explainable Multimodal Database Management System with Human-AI Collaboration

Traditional DBMSs execute user- or application-provided SQL queries over relational data with strong semantic guarantees and advanced query optimization, but writing complex SQL is hard and focuses only on structured tables. Contemporary…

Databases · Computer Science 2025-12-15 Guorui Xiao , Enhao Zhang , Nicole Sullivan , Will Hansen , Magdalena Balazinska

Storage, Indexing, Query Processing, and Benchmarking in Centralized and Distributed RDF Engines: A Survey

The recent advancements of the Semantic Web and Linked Data have changed the working of the traditional web. There is significant adoption of the Resource Description Framework (RDF) format for saving of web-based data. This massive…

Databases · Computer Science 2020-09-24 Waqas Ali , Muhammad Saleem , Bin Yao , Aidan Hogan , Axel-Cyrille Ngonga Ngomo

Intensional RDB for Big Data Interoperability

A new family of Intensional RDBs (IRDBs), introduced in [1], extends the traditional RDBs with the Big Data and flexible and 'Open schema' features, able to preserve the user-defined relational database schemas and all preexisting user's…

Databases · Computer Science 2014-04-11 Zoran Majkic

One button machine for automating feature engineering in relational databases

Feature engineering is one of the most important and time consuming tasks in predictive analytics projects. It involves understanding domain knowledge and data exploration to discover relevant hand-crafted features from raw data. In this…

Databases · Computer Science 2017-06-02 Hoang Thanh Lam , Johann-Michael Thiebaut , Mathieu Sinn , Bei Chen , Tiep Mai , Oznur Alkan

iPDB -- Optimizing Semantic SQL Queries

Structured Query Language (SQL) has remained the standard query language for databases. SQL is highly optimized for processing structured data laid out in relations. Meanwhile, in the present application development landscape, it is highly…

Databases · Computer Science 2026-04-24 Udesh Kumarasinghe , Tyler Liu , Ahmed R. Mahmood , Chunwei Liu , Walid G. Aref

PathDB: A system for evaluating regular path queries

PathDB is a Java-based graph database designed for in-memory data loading and querying. By utilizing Regular Path Queries (RPQ) and a closed path algebra, PathDB processes paths through its three main components: the parser, the logical…

Databases · Computer Science 2025-07-04 Roberto García , Renzo Angles , Vicente Rojas , Sebastián Ferrada

Deductive Optimization of Relational Data Storage

Optimizing the physical data storage and retrieval of data are two key database management problems. In this paper, we propose a language that can express a wide range of physical database layouts, going well beyond the row- and…

Programming Languages · Computer Science 2020-02-07 John K. Feser , Samuel Madden , Nan Tang , Armando Solar-Lezama

Extending Databases to Support Data Manipulation with Functional Dependencies: a Vision Paper

In the current paper, we propose to fuse together stored data (tables) and their functional dependencies (FDs) inside a DBMS. We aim to make FDs first-class citizens: objects which can be queried and used to query data. Our idea is to allow…

Databases · Computer Science 2020-05-19 Nikita Bobrov , Kirill Smirnov , George Chernishev

GenDB: The Next Generation of Query Processing -- Synthesized, Not Engineered

Traditional query processing relies on engines that are carefully optimized and engineered by many experts. However, new techniques and user requirements evolve rapidly, and existing systems often cannot keep pace. At the same time, these…

Databases · Computer Science 2026-03-03 Jiale Lao , Immanuel Trummer

Push vs. Pull-Based Loop Fusion in Query Engines

Database query engines use pull-based or push-based approaches to avoid the materialization of data across query operators. In this paper, we study these two types of query engines in depth and present the limitations and advantages of each…

Databases · Computer Science 2016-10-31 Amir Shaikhha , Mohammad Dashti , Christoph Koch

SharedDB: Killing One Thousand Queries With One Stone

Traditional database systems are built around the query-at-a-time model. This approach tries to optimize performance in a best-effort way. Unfortunately, best effort is not good enough for many modern applications. These applications…

Databases · Computer Science 2012-03-02 Georgios Giannikis , Gustavo Alonso , Donald Kossmann