Related papers: Sampling-Based Query Re-Optimization

A Survey on Advancing the DBMS Query Optimizer: Cardinality Estimation, Cost Model, and Plan Enumeration

Query optimizer is at the heart of the database systems. Cost-based optimizer studied in this paper is adopted in almost all current database systems. A cost-based optimizer introduces a plan enumeration algorithm to find a (sub)plan, and…

Databases · Computer Science 2021-01-06 Hai Lan , Zhifeng Bao , Yuwei Peng

Adaptive Cardinality Estimation

In this paper we address cardinality estimation problem which is an important subproblem in query optimization. Query optimization is a part of every relational DBMS responsible for finding the best way of the execution for the given query.…

Databases · Computer Science 2017-11-23 Oleg Ivanov , Sergey Bartunov

How I Learned to Stop Worrying and Love Re-optimization

Cost-based query optimizers remain one of the most important components of database management systems for analytic workloads. Though modern optimizers select plans close to optimal performance in the common case, a small number of queries…

Databases · Computer Science 2019-03-20 Matthew Perron , Zeyuan Shang , Tim Kraska , Michael Stonebraker

Efficient Query Re-optimization with Judicious Subquery Selections

Query re-optimization is an adaptive query processing technique that re-invokes the optimizer at certain points in query execution. The goal is to dynamically correct the cardinality estimation errors using the statistics collected at…

Databases · Computer Science 2023-06-23 Junyi Zhao , Huanchen Zhang , Yihan Gao

Revisiting Runtime Dynamic Optimization for Join Queries in Big Data Management Systems

Query Optimization remains an open problem for Big Data Management Systems. Traditional optimizers are cost-based and use statistical estimates of intermediate result cardinalities to assign costs and pick the best plan. However, such…

Databases · Computer Science 2020-10-07 Christina Pavlopoulou , Michael J. Carey , Vassilis J. Tsotras

Least expected cost query optimization: an exercise in utility

We identify two unreasonable, though standard, assumptions made by database query optimizers that can adversely affect the quality of the chosen evaluation plans. One assumption is that it is enough to optimize for the expected case---that…

Databases · Computer Science 2007-05-23 Francis C. Chu , Joseph Y. Halpern , Praveen Seshadri

Learning to Sample: Counting with Complex Queries

We study the problem of efficiently estimating counts for queries involving complex filters, such as user-defined functions, or predicates involving self-joins and correlated subqueries. For such queries, traditional sampling techniques may…

Databases · Computer Science 2020-01-01 Brett Walenz , Stavros Sintos , Sudeepa Roy , Jun Yang

Making Databases Faster with LLM Evolutionary Sampling

Traditional query optimization relies on cost-based optimizers that estimate execution cost (e.g., runtime, memory, and I/O) using predefined heuristics and statistical models. Improving these heuristics requires substantial engineering…

Databases · Computer Science 2026-02-12 Mehmet Hamza Erol , Xiangpeng Hao , Federico Bianchi , Ciro Greco , Jacopo Tagliabue , James Zou

Learned Query Superoptimization

Traditional query optimizers are designed to be fast and stateless: each query is quickly optimized using approximate statistics, sent off to the execution engine, and promptly forgotten. Recent work on learned query optimization have shown…

Databases · Computer Science 2023-07-12 Ryan Marcus

Probably Approximately Optimal Query Optimization

Evaluating query predicates on data samples is the only way to estimate their selectivity in certain scenarios. Finding a guaranteed optimal query plan is not a reasonable optimization goal in those cases as it might require an infinite…

Databases · Computer Science 2015-11-06 Immanuel Trummer , Christoph Koch

Analyzing Query Optimizer Performance in the Presence and Absence of Cardinality Estimates

Most query optimizers rely on cardinality estimates to determine optimal execution plans. While traditional databases such as PostgreSQL, Oracle, and Db2 utilize many types of synopses -- including histograms, samples, and sketches --…

Databases · Computer Science 2023-11-30 Asoke Datta , Brian Tsan , Yesdaulet Izenov , Florin Rusu

Cardinality Estimation Meets Good-Turing

Cardinality estimation algorithms receive a stream of elements whose order might be arbitrary, with possible repetitions, and return the number of distinct elements. Such algorithms usually seek to minimize the required storage and…

Data Structures and Algorithms · Computer Science 2015-08-26 Reuven Cohen , Liran Katzir , Aviv Yehezkel

Random Postprocessing for Combinatorial Bayesian Optimization

Model-based sequential approaches to discrete "black-box" optimization, including Bayesian optimization techniques, often access the same points multiple times for a given objective function in interest, resulting in many steps to find the…

Machine Learning · Computer Science 2023-12-29 Keisuke Morita , Yoshihiko Nishikawa , Masayuki Ohzeki

PLANSIEVE: Real-time Suboptimal Query Plan Detection Through Incremental Refinements

Cardinality estimation remains a fundamental challenge in query optimization, often resulting in sub-optimal execution plans and degraded performance. While errors in cardinality estimation are inevitable, existing methods for identifying…

Databases · Computer Science 2025-01-29 Asoke Datta , Yesdaulet Izenov , Brian Tsan , Abylay Amanbayev , Florin Rusu

A Simple Algorithm for Worst-Case Optimal Join and Sampling

We present an elementary branch and bound algorithm with a simple analysis of why it achieves worstcase optimality for join queries on classes of databases defined respectively by cardinality or acyclic degree constraints. We then show that…

Databases · Computer Science 2024-09-24 Florent Capelli , Oliver Irwin , Sylvain Salvati

Enabling Incremental Query Re-Optimization

As declarative query processing techniques expand in scope --- to the Web, data streams, network routers, and cloud platforms --- there is an increasing need for adaptive query processing techniques that can re-plan in the presence of…

Databases · Computer Science 2014-09-23 Mengmeng Liu , Zachary G. Ives , Boon Thau Loo

Learned Offline Query Planning via Bayesian Optimization

Analytics database workloads often contain queries that are executed repeatedly. Existing optimization techniques generally prioritize keeping optimization cost low, normally well below the time it takes to execute a single instance of a…

Databases · Computer Science 2025-02-11 Jeffrey Tao , Natalie Maus , Haydn Jones , Yimeng Zeng , Jacob R. Gardner , Ryan Marcus

Optimizing Queries with Many-to-Many Joins

As database query processing techniques are being used to handle diverse workloads, a key emerging challenge is how to efficiently handle multi-way join queries containing multiple many-to-many joins. While uncommon in traditional…

Databases · Computer Science 2025-05-20 Hasara Kalumin , Amol Deshpande

Iterative Optimization and Simplification of Hierarchical Clusterings

Clustering is often used for discovering structure in data. Clustering systems differ in the objective function used to evaluate clustering quality and the control strategy used to search the space of clusterings. Ideally, the search…

Artificial Intelligence · Computer Science 2014-11-17 D. Fisher

Adaptive Recursive Query Optimization

Performance-critical industrial applications, including large-scale program, network, and distributed system analyses, are increasingly reliant on recursive queries for data analysis. Yet traditional relational algebra-based query…

Databases · Computer Science 2024-03-20 Anna Herlihy , Guillaume Martres , Anastasia Ailamaki , Martin Odersky