English
Related papers

Related papers: Fast Distributed Complex Join Processing

200 papers

As database query processing techniques are being used to handle diverse workloads, a key emerging challenge is how to efficiently handle multi-way join queries containing multiple many-to-many joins. While uncommon in traditional…

Databases · Computer Science 2025-05-20 Hasara Kalumin , Amol Deshpande

Selecting appropriate distributed join methods for logical join operations in a query plan is crucial for the performance of data-intensive scalable computing (DISC). Different network communication patterns in the data exchange phase…

Databases · Computer Science 2023-12-29 F. Liang , F. C. M. Lau , H. Cui , Y. Li , B. Lin , C. Li , X. Hu

We study three-way joins on MapReduce. Joins are very useful in a multitude of applications from data integration and traversing social networks, to mining graphs and automata-based constructions. However, joins are expensive, even for…

Databases · Computer Science 2014-05-19 Ben Kimmett , Alex Thomo , S. Venkatesh

Multi-way Theta-join queries are powerful in describing complex relations and therefore widely employed in real practices. However, existing solutions from traditional distributed and parallel databases for multi-way Theta-join queries…

Databases · Computer Science 2012-08-02 Xiaofei Zhang , Lei Chen , Min Wang

It is crucial to provide real-time performance in many applications, such as interactive and exploratory data analysis. In these settings, users often need to view subsets of query results quickly. It is challenging to deliver such results…

The Join operator, as one of the most expensive and commonly used operators in database systems, plays a substantial role in Database Management System (DBMS) performance. Among the many different Join algorithms studied over the last…

Databases · Computer Science 2021-12-07 Shiva Jahangiri , Michael J. Carey , Johann-Christoph Freytag

This paper investigates distributed resource allocation optimization over directed graphs with limited communication bandwidth. We develop a novel distributed algorithm that integrates the centralized Proximal Jacobian Alternating Direction…

Optimization and Control · Mathematics 2026-04-17 Xu Du , Boyu Han , Ivano Notarnicola , Karl H. Johansson , Apostolos I. Rikos

In distributed optimization and federated learning, asynchronous alternating direction method of multipliers (ADMM) serves as an attractive option for large-scale optimization, data privacy, straggler nodes and variety of objective…

Machine Learning · Computer Science 2025-08-19 Sagar Shrestha

Joining trajectory datasets is a significant operation in mobility data analytics and the cornerstone of various methods that aim to extract knowledge out of them. In the era of Big Data, the production of mobility data has become massive…

Databases · Computer Science 2020-02-07 Panagiotis Tampakis , Christos Doulkeridis , Nikos Pelekis , Yannis Theodoridis

In this paper, we propose a novel distributed alternating direction method of multipliers (ADMM) algorithm with synergetic communication and computation, called SCCD-ADMM, to reduce the total communication and computation cost of the…

Signal Processing · Electrical Eng. & Systems 2020-09-30 Zhuojun Tian , Zhaoyang Zhang , Jue Wang , Xiaoming Chen , Wei Wang , Huaiyu Dai

Handling skew is one of the major challenges in query processing. In distributed computational environments such as MapReduce, uneven distribution of the data to the servers is not desired. One of the dominant measures that we want to…

Databases · Computer Science 2015-04-14 Foto N. Afrati , Jeffrey D. Ullman , Angelos Vasilakopoulos

We address distributed learning problems over undirected networks. Specifically, we focus on designing a novel ADMM-based algorithm that is jointly computation- and communication-efficient. Our design guarantees computational efficiency by…

Machine Learning · Computer Science 2026-01-21 Xiaoxing Ren , Nicola Bastianello , Karl H. Johansson , Thomas Parisini

Streaming computing enables the real-time processing of large volumes of data and offers significant advantages for various applications, including real-time recommendations, anomaly detection, and monitoring. The multi-way stream join…

Databases · Computer Science 2024-11-26 Jinlong Hu , Tingfeng Qiu

In the last few years, much effort has been devoted to developing join algorithms in order to achieve worst-case optimality for join queries over relational databases. Towards this end, the database community has had considerable success in…

Databases · Computer Science 2020-03-02 Shaleen Deep , Xiao Hu , Paraschos Koutris

We propose a distributed algorithm, named Distributed Alternating Direction Method of Multipliers (D-ADMM), for solving separable optimization problems in networks of interconnected nodes or agents. In a separable optimization problem there…

Optimization and Control · Mathematics 2013-04-26 João F. C. Mota , João M. F. Xavier , Pedro M. Q. Aguiar , Markus Püschel

Large Language Models (LLMs) are being increasingly used within data systems to process large datasets with text fields. A broad class of such tasks involves a semantic join-joining two tables based on a natural language predicate per pair…

Databases · Computer Science 2025-12-08 Sepanta Zeighami , Shreya Shankar , Aditya Parameswaran

Join processing is a fundamental operation in database management systems; however, traditional join algorithms often encounter efficiency challenges when dealing with complex queries that produce intermediate results much larger than the…

Databases · Computer Science 2025-05-27 Amirali Kaboli , Alex Mascolo , Amir Shaikhha

Methods for distributed optimization have received significant attention in recent years owing to their wide applicability in various domains. A distributed optimization method typically consists of two key components: communication and…

Optimization and Control · Mathematics 2018-06-04 Albert S. Berahas , Raghu Bollapragada , Nitish Shirish Keskar , Ermin Wei

Machine Learning has proven useful in the recent years as a way to achieve failure prediction for industrial systems. However, the high computational resources necessary to run learning algorithms are an obstacle to its widespread…

Artificial Intelligence · Computer Science 2020-01-22 Nicolas Aussel , Sophie Chabridon , Yohan Petetin

Multi-agent distributed consensus optimization problems arise in many signal processing applications. Recently, the alternating direction method of multipliers (ADMM) has been used for solving this family of problems. ADMM based distributed…

Systems and Control · Computer Science 2015-06-18 Tsung-Hui Chang , Mingyi Hong , Xiangfeng Wang
‹ Prev 1 2 3 10 Next ›