Related papers: Instance-Optimal Acyclic Join Processing Without R…

One Join Order Does Not Fit All: Reducing Intermediate Results with Per-Split Query Plans

Minimizing intermediate results is critical for efficient multi-join query processing. Although the seminal Yannakakis algorithm offers strong guarantees for acyclic queries, cyclic queries remain an open challenge. In this paper, we…

Databases · Computer Science 2025-10-30 Yujun He , Hangdong Zhao , Simon Frisk , Yifei Yang , Kevin Kristensen , Paraschos Koutris , Xiangyao Yu

Output-sensitive Conjunctive Query Evaluation

Join evaluation is one of the most fundamental operations performed by database systems and arguably the most well-studied problem in the Database community. A staggering number of join algorithms have been developed, and commercial…

Databases · Computer Science 2024-10-25 Shaleen Deep , Hangdong Zhao , Austen Z. Fan , Paraschos Koutris

Output-Optimal Algorithms for Join-Aggregate Queries

One of the most celebrated results of computing join-aggregate queries defined over commutative semi-rings is the classic Yannakakis algorithm proposed in 1981. It is known that the runtime of the Yannakakis algorithm is $O(N + \OUT)$ for…

Databases · Computer Science 2025-03-13 Xiao Hu

Database Theory in Action: Yannakakis' Algorithm

Yannakakis' seminal algorithm is optimal for acyclic joins, yet it has not been widely adopted due to its poor performance in practice. This paper briefly surveys recent advancements in making Yannakakis' algorithm more practical, in terms…

Databases · Computer Science 2026-01-13 Paraschos Koutris , Stijn Vansummeren , Qichen Wang , Yisu Remy Wang , Xiangyao Yu

Algorithms for Optimizing Acyclic Queries

Most research on query optimization has centered on binary join algorithms like hash join and sort-merge join. However, recent years have seen growing interest in theoretically optimal algorithms, notably Yannakakis' algorithm. These…

Databases · Computer Science 2026-01-09 Zheng Luo , Wim Van den Broeck , Guy Van den Broeck , Yisu Remy Wang

Yannakakis+: Practical Acyclic Query Evaluation with Theoretical Guarantees

Acyclic conjunctive queries form the backbone of most analytical workloads, and have been extensively studied in the literature from both theoretical and practical angles. However, there is still a large divide between theory and practice.…

Databases · Computer Science 2025-04-07 Qichen Wang , Bingnan Chen , Binyang Dai , Ke Yi , Feifei Li , Liang Lin

Instance and Output Optimal Parallel Algorithms for Acyclic Joins

Massively parallel join algorithms have received much attention in recent years, while most prior work has focused on worst-optimal algorithms. However, the worst-case optimality of these join algorithms relies on hard instances having very…

Databases · Computer Science 2019-04-01 Xiao Hu , Ke Yi

SnAKe: Bayesian Optimization with Pathwise Exploration

Bayesian Optimization is a very effective tool for optimizing expensive black-box functions. Inspired by applications developing and characterizing reaction chemistry using droplet microfluidic reactors, we consider a novel setting where…

Machine Learning · Computer Science 2023-01-12 Jose Pablo Folch , Shiqiang Zhang , Robert M Lee , Behrang Shafei , David Walz , Calvin Tsay , Mark van der Wilk , Ruth Misener

Selective Use of Yannakakis' Algorithm to Improve Query Performance: Machine Learning to the Rescue

Query optimization has played a central role in database research for decades. However, more often than not, the proposed optimization techniques lead to a performance improvement in some, but not in all, situations. Therefore, we urgently…

Databases · Computer Science 2025-06-23 Daniela Böhm , Georg Gottlob , Matthias Lanzinger , Davide Longo , Cem Okulmus , Reinhard Pichler , Alexander Selzer

Succinct Structure Representations for Efficient Query Optimization

Structural decomposition methods offer powerful theoretical guarantees for join evaluation, yet they are rarely used in real-world query optimizers. A major reason is the difficulty of combining cost-based plan search and structure-based…

Databases · Computer Science 2026-03-17 Zhekai Jiang , Qichen Wang , Christoph Koch

Optimal Join Algorithms Meet Top-k

Top-k queries have been studied intensively in the database community and they are an important means to reduce query cost when only the "best" or "most interesting" results are needed instead of the full output. While some optimality…

Databases · Computer Science 2020-05-04 Nikolaos Tziavelis , Wolfgang Gatterbauer , Mirek Riedewald

Snoopy: Effective and Efficient Semantic Join Discovery via Proxy Columns

Semantic join discovery, which aims to find columns in a table repository with high semantic joinabilities to a query column, is crucial for dataset discovery. Existing methods can be divided into two categories: cell-level methods and…

Databases · Computer Science 2025-02-25 Yuxiang Guo , Yuren Mao , Zhonghao Hu , Lu Chen , Yunjun Gao

Structure-Guided Query Evaluation: Towards Bridging the Gap from Theory to Practice

Join queries involving many relations pose a severe challenge to today's query optimisation techniques. To some extent, this is due to the fact that these techniques do not pay sufficient attention to structural properties of the query. In…

Databases · Computer Science 2023-05-23 Georg Gottlob , Matthias Lanzinger , Davide Mario Longo , Cem Okulmus , Reinhard Pichler , Alexander Selzer

High-Dimensional Yield Estimation using Shrinkage Deep Features and Maximization of Integral Entropy Reduction

Despite the fast advances in high-sigma yield analysis with the help of machine learning techniques in the past decade, one of the main challenges, the curse of dimensionality, which is inevitable when dealing with modern large-scale…

Computational Engineering, Finance, and Science · Computer Science 2022-12-06 Shuo Yin , Guohao Dai , Wei W. Xing

Conjunctive Queries with Theta Joins Under Updates

Modern application domains such as Composite Event Recognition (CER) and real-time Analytics require the ability to dynamically refresh query results under high update rates. Traditional approaches to this problem are based either on the…

Databases · Computer Science 2019-05-27 Muhammad Idris , Martín Ugarte , Stijn Vansummeren , Hannes Voigt , Wolfgang Lehner

Optimizing Queries with Many-to-Many Joins

As database query processing techniques are being used to handle diverse workloads, a key emerging challenge is how to efficiently handle multi-way join queries containing multiple many-to-many joins. While uncommon in traditional…

Databases · Computer Science 2025-05-20 Hasara Kalumin , Amol Deshpande

Skew Strikes Back: New Developments in the Theory of Join Algorithms

Evaluating the relational join is one of the central algorithmic and most well-studied problems in database systems. A staggering number of variants have been considered including Block-Nested loop join, Hash-Join, Grace, Sort-merge for…

Databases · Computer Science 2013-10-17 Hung Q. Ngo , Christopher Re , Atri Rudra

SYNTHESIS: A Semi-Asynchronous Path-Integrated Stochastic Gradient Method for Distributed Learning in Computing Clusters

To increase the training speed of distributed learning, recent years have witnessed a significant amount of interest in developing both synchronous and asynchronous distributed stochastic variance-reduced optimization methods. However, all…

Machine Learning · Computer Science 2022-08-30 Zhuqing Liu , Xin Zhang , Jia Liu

Fast Solving Complete 2000-Node Optimization Using Stochastic-Computing Simulated Annealing

In this paper, we evaluate stochastic-computing simulated annealing (SC-SA) for solving large-scale combinatorial optimization problems. SC-SA is designed using stochastic computing, where the computatoin is reazlied using random bitstream,…

Optimization and Control · Mathematics 2026-03-24 Kota Katsuki , Duckgyu Shin , Naoya Onizawa , Takahiro Hanyu

HoneyComb: A Parallel Worst-Case Optimal Join on Multicores

To achieve true scalability on massive datasets, a modern query engine needs to be able to take advantage of large, shared-memory, multicore systems. Binary joins are conceptually easy to parallelize on a multicore system; however, several…

Databases · Computer Science 2025-02-11 Jiacheng Wu , Dan Suciu