Related papers: Distributed Query Processing Plans generation usin…

Lero: A Learning-to-Rank Query Optimizer

A recent line of works apply machine learning techniques to assist or rebuild cost-based query optimizers in DBMS. While exhibiting superiority in some benchmarks, their deficiencies, e.g., unstable performance, high training cost, and slow…

Databases · Computer Science 2023-02-21 Rong Zhu , Wei Chen , Bolin Ding , Xingguang Chen , Andreas Pfadler , Ziniu Wu , Jingren Zhou

Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing

Large Language Models (LLMs) have demonstrated significant potential in handling complex reasoning tasks through step-by-step rationale generation. However, recent studies have raised concerns regarding the hallucination and flaws in their…

Artificial Intelligence · Computer Science 2024-10-16 Fangkai Jiao , Chengwei Qin , Zhengyuan Liu , Nancy F. Chen , Shafiq Joty

Can Large Language Models Be Query Optimizer for Relational Databases?

Query optimization, which finds the optimized execution plan for a given query, is a complex planning and decision-making problem within the exponentially growing plan space in database management systems (DBMS). Traditional optimizers…

Databases · Computer Science 2025-02-11 Jie Tan , Kangfei Zhao , Rui Li , Jeffrey Xu Yu , Chengzhi Piao , Hong Cheng , Helen Meng , Deli Zhao , Yu Rong

Token-level Proximal Policy Optimization for Query Generation

Query generation is a critical task for web search engines (e.g. Google, Bing) and recommendation systems. Recently, state-of-the-art query generation methods leverage Large Language Models (LLMs) for their strong capabilities in context…

Machine Learning · Computer Science 2024-11-04 Yichen Ouyang , Lu Wang , Fangkai Yang , Pu Zhao , Chenghua Huang , Jianfeng Liu , Bochen Pang , Yaming Yang , Yuefeng Zhan , Hao Sun , Qingwei Lin , Saravan Rajmohan , Weiwei Deng , Dongmei Zhang , Feng Sun , Qi Zhang

Learned Query Superoptimization

Traditional query optimizers are designed to be fast and stateless: each query is quickly optimized using approximate statistics, sent off to the execution engine, and promptly forgotten. Recent work on learned query optimization have shown…

Databases · Computer Science 2023-07-12 Ryan Marcus

Guided Automated Learning for query workload re-Optimization

Query optimization is a hallmark of database systems enabling complex SQL queries of today's applications to be run efficiently. The query optimizer often fails to find the best plan, when logical subtleties in business queries and schemas…

Databases · Computer Science 2019-05-23 Guilherme Damasio , Vincent Corvinelli , Parke Godfrey , Piotr Mierzejewski , Alexandar Mihaylov , Jaroslaw Szlichta , Calisto Zuzarte

Plan-Structured Deep Neural Network Models for Query Performance Prediction

Query performance prediction, the task of predicting the latency of a query, is one of the most challenging problem in database management systems. Existing approaches rely on features and performance models engineered by human experts, but…

Databases · Computer Science 2020-04-09 Ryan Marcus , Olga Papaemmanouil

Towards a Hands-Free Query Optimizer through Deep Learning

Query optimization remains one of the most important and well-studied problems in database systems. However, traditional query optimizers are complex heuristically-driven systems, requiring large amounts of time to tune for a particular…

Databases · Computer Science 2018-12-19 Ryan Marcus , Olga Papaemmanouil

Toward Efficient Online Scheduling for Distributed Machine Learning Systems

Recent years have witnessed a rapid growth of distributed machine learning (ML) frameworks, which exploit the massive parallelism of computing clusters to expedite ML training. However, the proliferation of distributed ML frameworks also…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-05-16 Menglu Yu , Jia Liu , Chuan Wu , Bo Ji , Elizabeth S. Bentley

Distributed learning of deep neural network over multiple agents

In domains such as health care and finance, shortage of labeled data and computational resources is a critical issue while developing machine learning algorithms. To address the issue of labeled data scarcity in training and deployment of…

Machine Learning · Computer Science 2018-10-16 Otkrist Gupta , Ramesh Raskar

Neo: A Learned Query Optimizer

Query optimization is one of the most challenging problems in database systems. Despite the progress made over the past decades, query optimizers remain extremely complex components that require a great deal of hand-tuning for specific…

Databases · Computer Science 2020-04-09 Ryan Marcus , Parimarjan Negi , Hongzi Mao , Chi Zhang , Mohammad Alizadeh , Tim Kraska , Olga Papaemmanouil , Nesime Tatbul

Tight Bounds for Online Balanced Partitioning in the Generalized Learning Model

Resource allocation in distributed and networked systems such as the Cloud is becoming increasingly flexible, allowing these systems to dynamically adjust toward the workloads they serve, in a demand-aware manner. Online balanced…

Data Structures and Algorithms · Computer Science 2024-10-24 Harald Räcke , Stefan Schmid , Ruslan Zabrodin

CARPO: Leveraging Listwise Learning-to-Rank for Context-Aware Query Plan Optimization

Efficient data processing is increasingly vital, with query optimizers playing a fundamental role in translating SQL queries into optimal execution plans. Traditional cost-based optimizers, however, often generate suboptimal plans due to…

Databases · Computer Science 2025-10-22 Wenrui Zhou , Qiyu Liu , Jingshu Peng , Aoqian Zhang , Lei Chen

Learning Based Distributed Tracking

Inspired by the great success of machine learning in the past decade, people have been thinking about the possibility of improving the theoretical results by exploring data distribution. In this paper, we revisit a fundamental problem…

Data Structures and Algorithms · Computer Science 2020-06-24 Hao Wu , Junhao Gan , Rui Zhang

Software Module Clustering based on the Fuzzy Adaptive Teaching Learning based Optimization Algorithm

Although showing competitive performances in many real-world optimization problems, Teaching Learning based Optimization Algorithm (TLBO) has been criticized for having poor control on exploration and exploitation. Addressing these issues,…

Software Engineering · Computer Science 2019-03-01 Kamal Z. Zamli , Fakhrud Din , Nazirah Ramli , Bestoun S. Ahmed

Online Distributed Learning with Quantized Finite-Time Coordination

In this paper we consider online distributed learning problems. Online distributed learning refers to the process of training learning models on distributed data sources. In our setting a set of agents need to cooperatively train a learning…

Machine Learning · Computer Science 2024-05-07 Nicola Bastianello , Apostolos I. Rikos , Karl H. Johansson

Automated Training of Learned Database Components with Generative AI

The use of deep learning for database optimization has gained significant traction, offering improvements in indexing, cardinality estimation, and query optimization. However, acquiring high-quality training data remains a significant…

Databases · Computer Science 2025-12-24 Angjela Davitkova , Sebastian Michel

Distributed Multi-Task Relationship Learning

Multi-task learning aims to learn multiple tasks jointly by exploiting their relatedness to improve the generalization performance for each task. Traditionally, to perform multi-task learning, one needs to centralize data from all the tasks…

Machine Learning · Computer Science 2017-06-21 Sulin Liu , Sinno Jialin Pan , Qirong Ho

Kepler: Robust Learning for Faster Parametric Query Optimization

Most existing parametric query optimization (PQO) techniques rely on traditional query optimizer cost models, which are often inaccurate and result in suboptimal query performance. We propose Kepler, an end-to-end learning-based approach to…

Databases · Computer Science 2023-10-20 Lyric Doshi , Vincent Zhuang , Gaurav Jain , Ryan Marcus , Haoyu Huang , Deniz Altinbüken , Eugene Brevdo , Campbell Fraser

Distributed Training and Optimization Of Neural Networks

Deep learning models are yielding increasingly better performances thanks to multiple factors. To be successful, model may have large number of parameters or complex architectures and be trained on large dataset. This leads to large…

Machine Learning · Computer Science 2022-12-20 Jean-Roch Vlimant , Junqi Yin