Related papers: Multi-Agent Join

Fast Join Project Query Evaluation using Matrix Multiplication

In the last few years, much effort has been devoted to developing join algorithms in order to achieve worst-case optimality for join queries over relational databases. Towards this end, the database community has had considerable success in…

Databases · Computer Science 2020-03-02 Shaleen Deep , Xiao Hu , Paraschos Koutris

Optimizing Queries with Many-to-Many Joins

As database query processing techniques are being used to handle diverse workloads, a key emerging challenge is how to efficiently handle multi-way join queries containing multiple many-to-many joins. While uncommon in traditional…

Databases · Computer Science 2025-05-20 Hasara Kalumin , Amol Deshpande

Runtime-optimized Multi-way Stream Join Operator for Large-scale Streaming data

Streaming computing enables the real-time processing of large volumes of data and offers significant advantages for various applications, including real-time recommendations, anomaly detection, and monitoring. The multi-way stream join…

Databases · Computer Science 2024-11-26 Jinlong Hu , Tingfeng Qiu

Implementing Semantic Join Operators Efficiently

Semantic query processing engines often support semantic joins, enabling users to match rows that satisfy conditions specified in natural language. Such join conditions can be evaluated using large language models (LLMs) that solve novel…

Databases · Computer Science 2025-10-10 Immanuel Trummer

Efficient Join Processing Over Incomplete Data Streams (Technical Report)

For decades, the join operator over fast data streams has always drawn much attention from the database community, due to its wide spectrum of real-world applications, such as online clustering, intrusion detection, sensor data monitoring,…

Databases · Computer Science 2019-08-26 Weilong Ren , Xiang Lian , Kambiz Ghazinour

Fast Distributed Complex Join Processing

In this work, we study the problem of co-optimize communication, pre-computing, and computation cost in one-round multi-way join evaluation. We propose a multi-way join approach ADJ (Adaptive Distributed Join) for complex join which finds…

Databases · Computer Science 2021-03-01 Hao Zhang , Miao Qiao , Jeffrey Xu Yu , Hong Cheng

Three-Way Joins on MapReduce: An Experimental Study

We study three-way joins on MapReduce. Joins are very useful in a multitude of applications from data integration and traversing social networks, to mining graphs and automata-based constructions. However, joins are expensive, even for…

Databases · Computer Science 2014-05-19 Ben Kimmett , Alex Thomo , S. Venkatesh

Efficiently Transforming Tables for Joinability

Data from different sources rarely conform to a single formatting even if they describe the same set of entities, and this raises concerns when data from multiple sources must be joined or cross-referenced. Such a formatting mismatch is…

Databases · Computer Science 2022-03-08 Arash Dargahi Nobari , Davood Rafiei

Towards Efficient Random-Order Enumeration for Join Queries

In many data analysis pipelines, a basic and time-consuming process is to produce join results and feed them into downstream tasks. Numerous enumeration algorithms have been developed for this purpose. To be a statistically meaningful…

Databases · Computer Science 2025-07-02 Pengyu Chen , Zizheng Guo , Jianwei Yang , Dongjing Miao

Feature and Instance Joint Selection: A Reinforcement Learning Perspective

Feature selection and instance selection are two important techniques of data processing. However, such selections have mostly been studied separately, while existing work towards the joint selection conducts feature/instance selection…

Machine Learning · Computer Science 2022-05-18 Wei Fan , Kunpeng Liu , Hao Liu , Hengshu Zhu , Hui Xiong , Yanjie Fu

Parallel In-Memory Evaluation of Spatial Joins

The spatial join is a popular operation in spatial database systems and its evaluation is a well-studied problem. As main memories become bigger and faster and commodity hardware supports parallel processing, there is a need to revamp…

Databases · Computer Science 2020-05-25 Dimitrios Tsitsigkos , Panagiotis Bouros , Nikos Mamoulis , Manolis Terrovitis

PanJoin: A Partition-based Adaptive Stream Join

In stream processing, stream join is one of the critical sources of performance bottlenecks. The sliding-window-based stream join provides a precise result but consumes considerable computational resources. The current solutions lack…

Databases · Computer Science 2018-11-14 Fei Pan , Hans-Arno Jacobsen

Scalable and robust set similarity join

Set similarity join is a fundamental and well-studied database operator. It is usually studied in the exact setting where the goal is to compute all pairs of sets that exceed a given similarity threshold (measured e.g. as Jaccard…

Databases · Computer Science 2018-03-05 Tobias Christiani , Rasmus Pagh , Johan Sivertsen

Random Sampling over Spatial Range Joins

Spatial range joins have many applications, including geographic information systems, location-based social networking services, neuroscience, and visualization. However, joins incur not only expensive computational costs but also too large…

Databases · Computer Science 2025-08-22 Daichi Amagata

Work Sharing and Offloading for Efficient Approximate Threshold-based Vector Join

Vector joins - finding all vector pairs between a set of query and data vectors whose distances are below a given threshold - are fundamental to modern vector and vector-relational database systems that power multimodal retrieval and…

Databases · Computer Science 2026-03-18 Kyoungmin Kim , Lennart Roth , Liang Liang , Anastasia Ailamaki

Joining relations under discrete uncertainty

In this paper we introduce and experimentally compare alternative algorithms to join uncertain relations. Different algorithms are based on specific principles, e.g., sorting, indexing, or building intermediate relational tables to apply…

Databases · Computer Science 2012-11-02 Matteo Magnani , Danilo Montesi

Weighted Random Sampling over Joins

Joining records with all other records that meet a linkage condition can result in an astronomically large number of combinations due to many-to-many relationships. For such challenging (acyclic) joins, a random sample over the join result…

Databases · Computer Science 2022-01-11 Michael Shekelyan , Graham Cormode , Peter Triantafillou , Ali Shanghooshabad , Qingzhi Ma

SieveJoin: Boosting Multi-Way Joins with Reusable Bloom Filters

Improving data systems' performance for join operations has long been an issue of great importance. More recently, a lot of focus has been devoted to multi-way join performance and especially on reducing the negative impact of producing…

Databases · Computer Science 2023-09-01 Qingzhi Ma

A Unified Architecture for Efficient Binary and Worst-Case Optimal Join Processing

Join processing is a fundamental operation in database management systems; however, traditional join algorithms often encounter efficiency challenges when dealing with complex queries that produce intermediate results much larger than the…

Databases · Computer Science 2025-05-27 Amirali Kaboli , Alex Mascolo , Amir Shaikhha

Memory-Efficient Group-by Aggregates over Multi-Way Joins

Aggregate computation in relational databases has long been done using the standard unary aggregation and binary join operators. These implement the classical model of computing joins between relations two at a time, materializing the…

Databases · Computer Science 2019-06-18 Konstantinos Xirogiannopoulos , Amol Deshpande