Related papers: COMPARE: Accelerating Groupwise Comparison in Rela…

Subset Queries in Relational Databases

In this paper, we motivated the need for relational database systems to support subset query processing. We defined new operators in relational algebra, and new constructs in SQL for expressing subset queries. We also illustrated the…

Databases · Computer Science 2007-05-23 Satyanarayana R Valluri , Kamalakar Karlapalem

Aggregation and Ordering in Factorised Databases

A common approach to data analysis involves understanding and manipulating succinct representations of data. In earlier work, we put forward a succinct representation system for relational data called factorised databases and reported on…

Databases · Computer Science 2013-07-02 Nurzhan Bakibayev , Tomáš Kočiský , Dan Olteanu , Jakub Závodný

An improved method of delta summation for faster current value selection across filtered subsets of interval and temporal relational data

Aggregation in relational databases is accomplished through hashing and sorting interval data, which is computationally expensive and scales poorly as the data volumes grow. In this paper, we show how quantitative interval and time-series…

Databases · Computer Science 2022-11-14 Derek Colley , Md Asaduzzaman

Cortex: Harnessing Correlations to Boost Query Performance

Databases employ indexes to filter out irrelevant records, which reduces scan overhead and speeds up query execution. However, this optimization is only available to queries that filter on the indexed attribute. To extend these speedups to…

Databases · Computer Science 2020-12-15 Vikram Nathan , Jialin Ding , Tim Kraska , Mohammad Alizadeh

Cohort Query Processing

Modern Internet applications often produce a large volume of user activity records. Data analysts are interested in cohort analysis, or finding unusual user behavioral trends, in these large tables of activity records. In a traditional…

Databases · Computer Science 2016-05-05 Dawei Jiang , Qingchao Cai , Gang Chen , H. V. Jagadish , Beng Chin Ooi , Kian-Lee Tan , Anthony K. H. Tung

A Formal Definition for Configuration

There exists a wide set of techniques to perform keyword-based search over relational databases but all of them match the keywords in the users' queries to elements of the databases to be queried as first step. The matching process is a…

Databases · Computer Science 2016-11-14 María Carmen Calvo Yanguas , Carmen Elvira Donázar , Raquel Trillo Lado

Scalable Relational Query Processing on Big Matrix Data

The use of large-scale machine learning methods is becoming ubiquitous in many applications ranging from business intelligence to self-driving cars. These methods require a complex computation pipeline consisting of various types of…

Databases · Computer Science 2021-11-10 Yongyang Yu , Mingjie Tang , Walid G. Aref

Relational Algebras for Subset Selection and Optimisation

The database community lacks a unified relational query language for subset selection and optimisation queries, limiting both user expression and query optimiser reasoning about such problems. Decades of research (latterly under the rubric…

Databases · Computer Science 2025-09-09 David Robert Pratten , Luke Mathieson , Fahimeh Ramezani

FDB: A Query Engine for Factorised Relational Databases

Factorised databases are relational databases that use compact factorised representations at the physical layer to reduce data redundancy and boost query performance. This paper introduces FDB, an in-memory query engine for…

Databases · Computer Science 2012-03-14 Nurzhan Bakibayev , Dan Olteanu , Jakub Závodný

Flare: Native Compilation for Heterogeneous Workloads in Apache Spark

The need for modern data analytics to combine relational, procedural, and map-reduce-style functional processing is widely recognized. State-of-the-art systems like Spark have added SQL front-ends and relational query optimization, which…

Databases · Computer Science 2017-03-27 Grégory M. Essertel , Ruby Y. Tahboub , James M. Decker , Kevin J. Brown , Kunle Olukotun , Tiark Rompf

Fast In-Memory SQL Analytics on Graphs

We study a class of graph analytics SQL queries, which we call relationship queries. Relationship queries are a wide superset of fixed-length graph reachability queries and of tree pattern queries. Intuitively, it discovers target entities…

Databases · Computer Science 2016-04-12 Chunbin Lin , Benjamin Mandel , Yannis Papakonstantinou , Matthias Springer

Improving Schema Matching with Linked Data

With today's public data sets containing billions of data items, more and more companies are looking to integrate external data with their traditional enterprise data to improve business intelligence analysis. These distributed data sources…

Databases · Computer Science 2012-05-16 Ahmad Assaf , Eldad Louw , Aline Senart , Corentin Follenfant , Raphaël Troncy , David Trastour

Fast and Simple Relational Processing of Uncertain Data

This paper introduces U-relations, a succinct and purely relational representation system for uncertain databases. U-relations support attribute-level uncertainty using vertical partitioning. If we consider positive relational algebra…

Databases · Computer Science 2007-07-12 Lyublena Antova , Thomas Jansen , Christoph Koch , Dan Olteanu

Modeling and In-Database Management of Relational, Data-Aware Processes (Extended Version)

During the last two decades, it has been increasingly acknowledged that the engineering of information systems usually requires a huge effort in integrating master data and business processes. This has led to a plethora of proposals, both…

Databases · Computer Science 2019-07-10 Diego Calvanese , Marco Montali , Fabio Patrizi , Andrey Rivkin

Scripting Relational Database Engine Using Transducer

We allow database user to script a parallel relational database engine with a procedural language. Procedural language code is executed as a user defined relational query operator called transducer. Transducer is tightly integrated with…

Databases · Computer Science 2018-05-14 Feng Tian

High-concurrency Custom-build Relational Database System's design and SQL parser design based on Turing-complete automata

Database system is an indispensable part of software projects. It plays an important role in data organization and storage. Its performance and efficiency are directly related to the performance of software. Nowadays, we have many general…

Databases · Computer Science 2020-08-12 WanHong Huang

Improving Database Performance by Application-side Transaction Merging

This paper explores a new opportunity to improve the performance of transaction processing at the application side by merging structurely similar statements or transactions. Concretely, we re-write transactions to 1) merge similar…

Databases · Computer Science 2026-01-16 Xueyuan Ren , Frank Li , Yang Wang

COMPARE: A Taxonomy and Dataset of Comparison Discussions in Peer Reviews

Comparing research papers is a conventional method to demonstrate progress in experimental research. We present COMPARE, a taxonomy and a dataset of comparison discussions in peer reviews of research papers in the domain of experimental…

Computation and Language · Computer Science 2021-08-11 Shruti Singh , Mayank Singh , Pawan Goyal

Optimizing Relational Queries over Array-Valued Data in Columnar Systems

Modern analytical workloads increasingly combine relational data with array-valued attributes. While columnar database systems efficiently process such workloads, their ability to optimize queries that interleave relational operators with…

Databases · Computer Science 2026-04-03 Maroua Zeblah , Etienne Couritas , Sarah Chlyah , Pierre Genevès , Nils Gesbert , Nabil Layaïda

Combining Heterogeneous Classifiers for Relational Databases

Most enterprise data is distributed in multiple relational databases with expert-designed schema. Using traditional single-table machine learning techniques over such data not only incur a computational penalty for converting to a 'flat'…

Machine Learning · Computer Science 2012-03-14 Geetha Manjunatha , M Narasimha Murty , Dinkar Sitaram