Related papers: Optimizing Cursor Loops in Relational Databases

Memory-Efficient Group-by Aggregates over Multi-Way Joins

Aggregate computation in relational databases has long been done using the standard unary aggregation and binary join operators. These implement the classical model of computing joins between relations two at a time, materializing the…

Databases · Computer Science 2019-06-18 Konstantinos Xirogiannopoulos , Amol Deshpande

Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems

Query Optimization remains an open problem for Big Data Management Systems. Traditional optimizers are cost-based and use statistical estimates of intermediate result cardinalities to assign costs and pick the best plan. However, such…

Databases · Computer Science 2020-10-07 Christina Pavlopoulou , Michael J. Carey , Vassilis J. Tsotras

Guided Automated Learning for query workload re-Optimization

Query optimization is a hallmark of database systems enabling complex SQL queries of today's applications to be run efficiently. The query optimizer often fails to find the best plan, when logical subtleties in business queries and schemas…

Databases · Computer Science 2019-05-23 Guilherme Damasio , Vincent Corvinelli , Parke Godfrey , Piotr Mierzejewski , Alexandar Mihaylov , Jaroslaw Szlichta , Calisto Zuzarte

Optimization of Imperative Programs in a Relational Database

For decades, RDBMSs have supported declarative SQL as well as imperative functions and procedures as ways for users to express data processing tasks. While the evaluation of declarative SQL has received a lot of attention resulting in…

Databases · Computer Science 2019-08-21 Karthik Ramachandra , Kwanghyun Park , K. Venkatesh Emani , Alan Halverson , Cesar Galindo-Legaria , Conor Cunningham

Multi-layer Optimizations for End-to-End Data Analytics

We consider the problem of training machine learning models over multi-relational data. The mainstream approach is to first construct the training dataset using a feature extraction query over input database and then use a statistical…

Programming Languages · Computer Science 2020-01-13 Amir Shaikhha , Maximilian Schleich , Alexandru Ghita , Dan Olteanu

Collectively Simplifying Trajectories in a Database: A Query Accuracy Driven Approach

Increasing and massive volumes of trajectory data are being accumulated that may serve a variety of applications, such as mining popular routes or identifying ridesharing candidates. As storing and querying massive trajectory data is…

Databases · Computer Science 2023-12-14 Zheng Wang , Cheng Long , Gao Cong , Christian S. Jensen

Ranked Enumeration of Join Queries with Projections

Join query evaluation with ordering is a fundamental data processing task in relational database management systems. SQL and custom graph query languages such as Cypher offer this functionality by allowing users to specify the order via the…

Databases · Computer Science 2022-01-25 Shaleen Deep , Xiao Hu , Paraschos Koutris

Learned Query Superoptimization

Traditional query optimizers are designed to be fast and stateless: each query is quickly optimized using approximate statistics, sent off to the execution engine, and promptly forgotten. Recent work on learned query optimization have shown…

Databases · Computer Science 2023-07-12 Ryan Marcus

Scalable Relational Query Processing on Big Matrix Data

The use of large-scale machine learning methods is becoming ubiquitous in many applications ranging from business intelligence to self-driving cars. These methods require a complex computation pipeline consisting of various types of…

Databases · Computer Science 2021-11-10 Yongyang Yu , Mingjie Tang , Walid G. Aref

SOFA: An Extensible Logical Optimizer for UDF-heavy Dataflows

Recent years have seen an increased interest in large-scale analytical dataflows on non-relational data. These dataflows are compiled into execution graphs scheduled on large compute clusters. In many novel application areas the predominant…

Databases · Computer Science 2013-11-26 Astrid Rheinländer , Arvid Heise , Fabian Hueske , Ulf Leser , Felix Naumann

Optimizing Queries with Many-to-Many Joins

As database query processing techniques are being used to handle diverse workloads, a key emerging challenge is how to efficiently handle multi-way join queries containing multiple many-to-many joins. While uncommon in traditional…

Databases · Computer Science 2025-05-20 Hasara Kalumin , Amol Deshpande

Deploying a Steered Query Optimizer in Production at Microsoft

Modern analytical workloads are highly heterogeneous and massively complex, making generic query optimizers untenable for many customers and scenarios. As a result, it is important to specialize these optimizers to instances of the…

Databases · Computer Science 2022-10-26 Wangda Zhang , Matteo Interlandi , Paul Mineiro , Shi Qiao , Nasim Ghazanfari Karlen Lie , Marc Friedman , Rafah Hosn , Hiren Patel , Alekh Jindal

COMPARE: Accelerating Groupwise Comparison in Relational Databases for Data Analytics

Data analysis often involves comparing subsets of data across many dimensions for finding unusual trends and patterns. While the comparison between subsets of data can be expressed using SQL, they tend to be complex to write, and suffer…

Databases · Computer Science 2021-07-28 Tarique Siddiqui , Surajit Chaudhuri , Vivek Narasayya

Towards a Hands-Free Query Optimizer through Deep Learning

Query optimization remains one of the most important and well-studied problems in database systems. However, traditional query optimizers are complex heuristically-driven systems, requiring large amounts of time to tune for a particular…

Databases · Computer Science 2018-12-19 Ryan Marcus , Olga Papaemmanouil

Juggling Functions Inside a Database

We define and study the Functional Aggregate Query (FAQ) problem, which captures common computational tasks across a very wide range of domains including relational databases, logic, matrix and tensor computation, probabilistic graphical…

Databases · Computer Science 2017-03-10 Mahmoud Abo Khamis , Hung Q. Ngo , Atri Rudra

Beyond Relevant Documents: A Knowledge-Intensive Approach for Query-Focused Summarization using Large Language Models

Query-focused summarization (QFS) is a fundamental task in natural language processing with broad applications, including search engines and report generation. However, traditional approaches assume the availability of relevant documents,…

Computation and Language · Computer Science 2024-08-21 Weijia Zhang , Jia-Hong Huang , Svitlana Vakulenko , Yumo Xu , Thilina Rajapakse , Evangelos Kanoulas

Approximate Computation for Big Data Analytics

Over the past a few years, research and development has made significant progresses on big data analytics. A fundamental issue for big data analytics is the efficiency. If the optimal solution is unable to attain or not required or has a…

Databases · Computer Science 2019-01-03 Shuai Ma , Jinpeng Huai

Cohort Query Processing

Modern Internet applications often produce a large volume of user activity records. Data analysts are interested in cohort analysis, or finding unusual user behavioral trends, in these large tables of activity records. In a traditional…

Databases · Computer Science 2016-05-05 Dawei Jiang , Qingchao Cai , Gang Chen , H. V. Jagadish , Beng Chin Ooi , Kian-Lee Tan , Anthony K. H. Tung

Push vs. Pull-Based Loop Fusion in Query Engines

Database query engines use pull-based or push-based approaches to avoid the materialization of data across query operators. In this paper, we study these two types of query engines in depth and present the limitations and advantages of each…

Databases · Computer Science 2016-10-31 Amir Shaikhha , Mohammad Dashti , Christoph Koch

Efficient Query Re-optimization with Judicious Subquery Selections

Query re-optimization is an adaptive query processing technique that re-invokes the optimizer at certain points in query execution. The goal is to dynamically correct the cardinality estimation errors using the statistics collected at…

Databases · Computer Science 2023-06-23 Junyi Zhao , Huanchen Zhang , Yihan Gao