Related papers: Efficient Query Re-optimization with Judicious Sub…

How I Learned to Stop Worrying and Love Re-optimization

Cost-based query optimizers remain one of the most important components of database management systems for analytic workloads. Though modern optimizers select plans close to optimal performance in the common case, a small number of queries…

Databases · Computer Science 2019-03-20 Matthew Perron , Zeyuan Shang , Tim Kraska , Michael Stonebraker

Sampling-Based Query Re-Optimization

Despite of decades of work, query optimizers still make mistakes on "difficult" queries because of bad cardinality estimates, often due to the interaction of multiple predicates and correlations in the data. In this paper, we propose a…

Databases · Computer Science 2016-01-22 Wentao Wu , Jeffrey F. Naughton , Harneet Singh

Redefining The Query Optimization Process

Traditionally, query optimizers have been designed for computer systems that share a common architecture, consisting of a CPU, main memory and disk subsystem. The efficiency of query optimizers and their successful employment relied on the…

Databases · Computer Science 2022-03-03 K. F. D. Rietveld , H. A. G. Wijshoff

Enabling Incremental Query Re-Optimization

As declarative query processing techniques expand in scope --- to the Web, data streams, network routers, and cloud platforms --- there is an increasing need for adaptive query processing techniques that can re-plan in the presence of…

Databases · Computer Science 2014-09-23 Mengmeng Liu , Zachary G. Ives , Boon Thau Loo

Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems

Query Optimization remains an open problem for Big Data Management Systems. Traditional optimizers are cost-based and use statistical estimates of intermediate result cardinalities to assign costs and pick the best plan. However, such…

Databases · Computer Science 2020-10-07 Christina Pavlopoulou , Michael J. Carey , Vassilis J. Tsotras

Learned Query Superoptimization

Traditional query optimizers are designed to be fast and stateless: each query is quickly optimized using approximate statistics, sent off to the execution engine, and promptly forgotten. Recent work on learned query optimization have shown…

Databases · Computer Science 2023-07-12 Ryan Marcus

Adaptive Recursive Query Optimization

Performance-critical industrial applications, including large-scale program, network, and distributed system analyses, are increasingly reliant on recursive queries for data analysis. Yet traditional relational algebra-based query…

Databases · Computer Science 2024-03-20 Anna Herlihy , Guillaume Martres , Anastasia Ailamaki , Martin Odersky

Towards a Hands-Free Query Optimizer through Deep Learning

Query optimization remains one of the most important and well-studied problems in database systems. However, traditional query optimizers are complex heuristically-driven systems, requiring large amounts of time to tune for a particular…

Databases · Computer Science 2018-12-19 Ryan Marcus , Olga Papaemmanouil

GenJoin: Conditional Generative Plan-to-Plan Query Optimizer that Learns from Subplan Hints

Query optimization has become a research area where classical algorithms are being challenged by machine learning algorithms. At the same time, recent trends in learned query optimizers have shown that it is prudent to take advantage of…

Databases · Computer Science 2025-06-16 Pavel Sulimov , Claude Lehmann , Kurt Stockinger

Parallelizing Query Optimization on Shared-Nothing Architectures

Data processing systems offer an ever increasing degree of parallelism on the levels of cores, CPUs, and processing nodes. Query optimization must exploit high degrees of parallelism in order not to gradually become the bottleneck of query…

Databases · Computer Science 2015-11-06 Immanuel Trummer , Christoph Koch

Pseudo-Query Reformulation

Automatic query reformulation refers to rewriting a user's original query in order to improve the ranking of retrieval results compared to the original query. We present a general framework for automatic query reformulation based on…

Information Retrieval · Computer Science 2015-07-15 Fernando Diaz

Optimizing Queries with Many-to-Many Joins

As database query processing techniques are being used to handle diverse workloads, a key emerging challenge is how to efficiently handle multi-way join queries containing multiple many-to-many joins. While uncommon in traditional…

Databases · Computer Science 2025-05-20 Hasara Kalumin , Amol Deshpande

QUEST: Query Optimization in Unstructured Document Analysis

Most recently, researchers have started building large language models (LLMs) powered data systems that allow users to analyze unstructured text documents like working with a database because LLMs are very effective in extracting attributes…

Databases · Computer Science 2025-07-14 Zhaoze Sun , Qiyan Deng , Chengliang Chai , Kaisen Jin , Xinyu Guo , Han Han , Ye Yuan , Guoren Wang , Lei Cao

Optimizing Subgraph Queries by Combining Binary and Worst-Case Optimal Joins

We study the problem of optimizing subgraph queries using the new worst-case optimal join plans. Worst-case optimal plans evaluate queries by matching one query vertex at a time using multiway intersections. The core problem in optimizing…

Databases · Computer Science 2019-06-04 Amine Mhedhbi , Semih Salihoglu

Probably Approximately Optimal Query Optimization

Evaluating query predicates on data samples is the only way to estimate their selectivity in certain scenarios. Finding a guaranteed optimal query plan is not a reasonable optimization goal in those cases as it might require an infinite…

Databases · Computer Science 2015-11-06 Immanuel Trummer , Christoph Koch

LQRS: Learned Query Re-optimization Framework for Spark SQL

The query optimizer is a fundamental component of database management systems. Recent studies have shown that learned query optimizers outperform traditional cost-based query optimizers. However, they fail to exploit valuable runtime…

Databases · Computer Science 2026-03-05 Jiahao He , Yutao Cui , Cuiping Li , Jikang Jiang , Yuheng Hou , Hong Chen

Learned Offline Query Planning via Bayesian Optimization

Analytics database workloads often contain queries that are executed repeatedly. Existing optimization techniques generally prioritize keeping optimization cost low, normally well below the time it takes to execute a single instance of a…

Databases · Computer Science 2025-02-11 Jeffrey Tao , Natalie Maus , Haydn Jones , Yimeng Zeng , Jacob R. Gardner , Ryan Marcus

Deep Reinforcement Learning for Join Order Enumeration

Join order selection plays a significant role in query performance. However, modern query optimizers typically employ static join enumeration algorithms that do not receive any feedback about the quality of the resulting plan. Hence,…

Databases · Computer Science 2018-09-28 Ryan Marcus , Olga Papaemmanouil

One Join Order Does Not Fit All: Reducing Intermediate Results with Per-Split Query Plans

Minimizing intermediate results is critical for efficient multi-join query processing. Although the seminal Yannakakis algorithm offers strong guarantees for acyclic queries, cyclic queries remain an open challenge. In this paper, we…

Databases · Computer Science 2025-10-30 Yujun He , Hangdong Zhao , Simon Frisk , Yifei Yang , Kevin Kristensen , Paraschos Koutris , Xiangyao Yu

ReSplit: Improving the Structure of Jupyter Notebooks by Re-Splitting Their Cells

Jupyter notebooks represent a unique format for programming - a combination of code and Markdown with rich formatting, separated into individual cells. We propose to perceive a Jupyter Notebook cell as a simplified and raw version of a…

Software Engineering · Computer Science 2022-01-03 Sergey Titov , Yaroslav Golubev , Timofey Bryksin