English
Related papers

Related papers: Forecasting SQL Query Cost at Twitter

200 papers

Formulating efficient SQL queries requires several cycles of tuning and execution, particularly for inexperienced users. We examine methods that can accelerate and improve this interaction by providing insights about SQL queries prior to…

Databases · Computer Science 2020-02-24 Zainab Zolaktaf , Mostafa Milani , Rachel Pottinger

The ability to estimate resource consumption of SQL queries is crucial for a number of tasks in a database system such as admission control, query scheduling and costing during query optimization. Recent work has explored the use of…

Databases · Computer Science 2012-08-02 Jiexing Li , Arnd Christian König , Vivek Narasayya , Surajit Chaudhuri

Lakehouse systems enable the same data to be queried with multiple execution engines. However, selecting the engine best suited to run a SQL query still requires a priori knowledge of the query computational requirements and an engine…

Databases · Computer Science 2025-06-04 András Strausz , Niels Pardon , Ioana Giurgiu

Having access to realistic workloads for a given database instance is extremely important to enable stress and vulnerability testing, as well as to optimize for cost and performance. Recent advances in learned cost models have shown that…

Query processing over big data is ubiquitous in modern clouds, where the system takes care of picking both the physical query execution plans and the resources needed to run those plans, using a cost-based query optimizer. A good cost…

Databases · Computer Science 2020-03-02 Tarique Siddiqui , Alekh Jindal , Shi Qiao , Hiren Patel , Wangchao le

Accurate query runtime prediction is a critical component of effective query optimization in modern database systems. Traditional cost models, such as those used in PostgreSQL, rely on static heuristics that often fail to reflect actual…

Databases · Computer Science 2025-10-08 Utsav Pathak , Amit Mankodi

Predicting query execution time is a fundamental issue underlying many database management tasks. Existing predictors rely on information such as cardinality estimates and system performance constants that are difficult to know exactly. As…

Databases · Computer Science 2014-08-29 Wentao Wu , Xi Wu , Hakan Hacıgümüş , Jeffrey F. Naughton

Query performance prediction, the task of predicting the latency of a query, is one of the most challenging problem in database management systems. Existing approaches rely on features and performance models engineered by human experts, but…

Databases · Computer Science 2020-04-09 Ryan Marcus , Olga Papaemmanouil

Cloud data warehouses bill compute based on slot-time consumed. In shared multi-tenant environments, query cost is highly variable and hard to estimate before execution, causing budget overruns and degraded scheduling. Static query-planner…

Databases · Computer Science 2026-04-23 Prashant Kumar Pathak

We present an efficient, parametric modeling framework for predictive resource allocations, focusing on the amount of computational resources, that can optimize for a range of price-performance objectives for data analytics in serverless…

Databases · Computer Science 2021-12-17 Rathijit Sen , Abhishek Roy , Alekh Jindal

While Text-to-SQL systems achieve high accuracy, existing efficiency metrics like the Valid Efficiency Score prioritize execution time, a metric we show is fundamentally decoupled from consumption-based cloud billing. This paper evaluates…

Databases · Computer Science 2026-03-10 Saurabh Deochake , Debajyoti Mukhopadhyay

Query optimization is a pivotal part of every database management system (DBMS) since it determines the efficiency of query execution. Numerous works have introduced Machine Learning (ML) techniques to cost modeling, cardinality estimation,…

Databases · Computer Science 2023-04-11 Xianghong Xu , Zhibing Zhao , Tieying Zhang , Rong Kang , Luming Sun , Jianjun Chen

Optimizing resource allocation for analytical workloads is vital for reducing costs of cloud-data services. At the same time, it is incredibly hard for users to allocate resources per query in serverless processing systems, and they…

Traditionally, query optimizers rely on cost models to choose the best execution plan from several candidates, making precise cost estimates critical for efficient query execution. In recent years, cost models based on machine learning have…

Databases · Computer Science 2025-07-22 Roman Heinrich , Manisha Luthra , Johannes Wehrstein , Harald Kornmayer , Carsten Binnig

Predictive models are often used for real-time decision making. However, typical machine learning techniques ignore feature evaluation cost, and focus solely on the accuracy of the machine learning models obtained utilizing all the features…

Machine Learning · Computer Science 2014-08-19 Leilani Battle , Edward Benson , Aditya Parameswaran , Eugene Wu

Data lakes hold a growing amount of cold data that is infrequently accessed, yet require interactive response times. Serverless functions are seen as a way to address this use case since they offer an appealing alternative to maintaining…

Databases · Computer Science 2022-08-23 Simon Kassing , Ingo Müller , Gustavo Alonso

Predicting the execution time of queries is an important problem with applications in scheduling, service level agreements and error detection. During query planning, a cost is associated with the chosen execution plan and used to rank…

Databases · Computer Science 2019-05-03 Anthony Kleerekoper , Javier Navaridas , Mikel Lujan

The principal component of conventional database query optimizers is a cost model that is used to estimate expected performance of query plans. The accuracy of the cost model has direct impact on the optimality of execution plans selected…

Databases · Computer Science 2024-09-26 Nikita Vasilenko , Alexander Demin , Denis Ponomaryov

Modern database systems rely on cost-based query optimizers to come up with good execution plans for input queries. Such query optimizers rely on cost models to estimate the costs of candidate query execution plans. A cost model represents…

Databases · Computer Science 2024-04-02 Wentao Wu , Chi Wang

Database management systems (DBMSs) carefully optimize complex multi-join queries to avoid expensive disk I/O. As servers today feature tens or hundreds of gigabytes of RAM, a significant fraction of many analytic databases becomes…

Databases · Computer Science 2015-07-22 Feilong Liu , Spyros Blanas
‹ Prev 1 2 3 10 Next ›