LQRS: Learned Query Re-optimization Framework for Spark SQL
Abstract
The query optimizer is a fundamental component of database management systems. Recent studies have shown that learned query optimizers outperform traditional cost-based query optimizers. However, they fail to exploit valuable runtime observations generated during query execution to dynamically re-optimize the plan, thereby limiting further improvements in query performance. To address this issue, we propose learned query re-optimization, which allows optimization decisions to be deferred to execution time and guided by actual runtime observations. We realize this idea through LQRS, a learned query re-optimization framework that builds upon Spark SQL, exploiting runtime observations for dynamic plan refinement. Specifically, LQRS employs a curriculum reinforcement learning strategy and jointly supports pre-execution and in-execution optimization, allowing knowledge learned during execution to directly benefit pre-execution planning. Furthermore, we design a plug-and-play planner extension built upon the extensibility interfaces of Spark SQL, enabling online plan modification. Experiments on Spark SQL demonstrate that LQRS reduces end-to-end execution time by up to 90% compared to other learned query optimizers and query re-optimization methods.
Cite
@article{arxiv.2510.10580,
title = {LQRS: Learned Query Re-optimization Framework for Spark SQL},
author = {Jiahao He and Yutao Cui and Cuiping Li and Jikang Jiang and Yuheng Hou and Hong Chen},
journal= {arXiv preprint arXiv:2510.10580},
year = {2026}
}
Comments
Submitted to VLDB2026