English
Related papers

Related papers: Online Sketch-based Query Optimization

200 papers

Sketches have shown high accuracy in multi-way join cardinality estimation, a critical problem in cost-based query optimization. Accurately estimating the cardinality of a join operation -- analogous to its computational cost -- allows the…

Databases · Computer Science 2025-06-18 Brian Tsan , Abylay Amanbayev , Asoke Datta , Florin Rusu

With the increasing rate of data generated by critical systems, estimating functions on streaming data has become essential. This demand has driven numerous advancements in algorithms designed to efficiently query and analyze one or more…

Databases · Computer Science 2024-05-16 Mike Heddes , Igor Nunes , Tony Givargis , Alex Nicolau

In this paper we address cardinality estimation problem which is an important subproblem in query optimization. Query optimization is a part of every relational DBMS responsible for finding the best way of the execution for the given query.…

Databases · Computer Science 2017-11-23 Oleg Ivanov , Sergey Bartunov

Query optimizer is at the heart of the database systems. Cost-based optimizer studied in this paper is adopted in almost all current database systems. A cost-based optimizer introduces a plan enumeration algorithm to find a (sub)plan, and…

Databases · Computer Science 2021-01-06 Hai Lan , Zhifeng Bao , Yuwei Peng

Join ordering is a key factor in query performance, yet traditional cost-based optimizers often produce sub-optimal plans due to inaccurate cardinality estimates in multi-predicate, multi-join queries. Existing alternatives such as…

Databases · Computer Science 2025-08-26 David Justen , Matthias Boehm

Interactive analytics increasingly involves querying for quantiles over sub-populations of high cardinality datasets. Data processing engines such as Druid and Spark use mergeable summaries to estimate quantiles, but summary merge times can…

Databases · Computer Science 2018-07-17 Edward Gan , Jialin Ding , Kai Sheng Tai , Vatsal Sharan , Peter Bailis

Cost-based query optimizers remain one of the most important components of database management systems for analytic workloads. Though modern optimizers select plans close to optimal performance in the common case, a small number of queries…

Databases · Computer Science 2019-03-20 Matthew Perron , Zeyuan Shang , Tim Kraska , Michael Stonebraker

A key need in different disciplines is to perform analytics over fast-paced data streams, similar in nature to the traditional OLAP analytics in relational databases i.e., with filters and aggregates. Storing unbounded streams, however, is…

Databases · Computer Science 2023-09-13 Wieger R. Punter , Odysseas Papapetrou , Minos Garofalakis

The principal component of conventional database query optimizers is a cost model that is used to estimate expected performance of query plans. The accuracy of the cost model has direct impact on the optimality of execution plans selected…

Databases · Computer Science 2024-09-26 Nikita Vasilenko , Alexander Demin , Denis Ponomaryov

Unreliable cardinality estimation remains a critical performance bottleneck in database management systems (DBMSs). Adaptive Query Processing (AQP) strategies address this limitation by providing a more robust query execution mechanism.…

Databases · Computer Science 2025-11-21 Pei Mu , Anderson Chaves Carniel , Antonio Barbalace , Amir Shaikhha

Despite of decades of work, query optimizers still make mistakes on "difficult" queries because of bad cardinality estimates, often due to the interaction of multiple predicates and correlations in the data. In this paper, we propose a…

Databases · Computer Science 2016-01-22 Wentao Wu , Jeffrey F. Naughton , Harneet Singh

Sketch-based streaming algorithms allow efficient processing of big data. These algorithms use small fixed-size storage to store a summary ("sketch") of the input data, and use probabilistic algorithms to estimate the desired quantity.…

Databases · Computer Science 2016-11-08 Reuven Cohen , Liran Katzir , Aviv Yehezkel

HPC systems expose many configuration parameters that jointly drive competing objectives. Existing tools such as autotuners recommend good configurations but do not identify minimal changes for a near-miss configuration to meet a…

Performance · Computer Science 2026-04-28 Ankur Lahiry , Banooqa Banday , Yugesh Bhattarai , Mohammad Zaeed , Tanzima Z. Islam

We study a class of graph analytics SQL queries, which we call relationship queries. Relationship queries are a wide superset of fixed-length graph reachability queries and of tree pattern queries. Intuitively, it discovers target entities…

Databases · Computer Science 2016-04-12 Chunbin Lin , Benjamin Mandel , Yannis Papakonstantinou , Matthias Springer

In modern large-scale distributed systems, analytics jobs submitted by various users often share similar work, for example scanning and processing the same subset of data. Instead of optimizing jobs independently, which may result in…

Databases · Computer Science 2018-05-23 Pietro Michiardi , Damiano Carra , Sara Migliorini

We study the problem of optimizing subgraph queries using the new worst-case optimal join plans. Worst-case optimal plans evaluate queries by matching one query vertex at a time using multiway intersections. The core problem in optimizing…

Databases · Computer Science 2019-06-04 Amine Mhedhbi , Semih Salihoglu

Data sketches balance resource efficiency with controllable approximations for extracting features in high-volume, high-rate data. Two important points of interest are highlighted separately in recent works; namely, to (1) answer multiple…

Data Structures and Algorithms · Computer Science 2025-07-08 Martin Hilgendorf , Marina Papatriantafilou

Estimating cardinality, i.e., the number of distinct elements, of a data stream is a fundamental problem in areas like databases, computer networks, and information retrieval. This study delves into a broader scenario where each element…

Databases · Computer Science 2024-06-28 Yiyan Qi , Rundong Li , Pinghui Wang , Yufang Sun , Rui Xing

Selectivity estimation remains a critical task in query optimization even after decades of research and industrial development. Optimizers rely on accurate selectivities when generating execution plans. They maintain a large range of…

Databases · Computer Science 2019-01-08 Jun Hyung Shin , Florin Rusu , Alex Suhan

Over the past a few years, research and development has made significant progresses on big data analytics. A fundamental issue for big data analytics is the efficiency. If the optimal solution is unable to attain or not required or has a…

Databases · Computer Science 2019-01-03 Shuai Ma , Jinpeng Huai
‹ Prev 1 2 3 10 Next ›