English
Related papers

Related papers: GYM: A Multiround Join Algorithm In MapReduce

200 papers

Handling skew is one of the major challenges in query processing. In distributed computational environments such as MapReduce, uneven distribution of the data to the servers is not desired. One of the dominant measures that we want to…

Databases · Computer Science 2015-04-14 Foto N. Afrati , Jeffrey D. Ullman , Angelos Vasilakopoulos

In this paper, we investigate the problem of computing a multiway join in one round of MapReduce when the data may be skewed. We optimize on communication cost, i.e., the amount of data that is transferred from the mappers to the reducers.…

Databases · Computer Science 2020-01-14 Foto Afrati , Nikos Stasinopoulos , Jeffrey D. Ullman , Angelos Vassilakopoulos

We study the problem of computing conjunctive queries over large databases on parallel architectures without shared storage. Using the structure of such a query $q$ and the skew in the data, we study tradeoffs between the number of…

Databases · Computer Science 2016-02-22 Paul Beame , Paraschos Koutris , Dan Suciu

We optimize multiway equijoins on relational tables using degree information. We give a new bound that uses degree information to more tightly bound the maximum output size of a query. On real data, our bound on the number of triangles in a…

Databases · Computer Science 2015-12-23 Manas Joglekar , Christopher Re

We are presented with a graph, $G$, on $n$ vertices with $m$ edges whose edge set is unknown. Our goal is to learn the edges of $G$ with as few queries to an oracle as possible. When we submit a set $S$ of vertices to the oracle, it tells…

Quantum Physics · Physics 2024-03-01 Asaf Ferber , Liam Hardiman

The high cost of communicating gradients is a major bottleneck for federated learning, as the bandwidth of the participating user devices is limited. Existing gradient compression algorithms are mainly designed for data centers with…

Machine Learning · Computer Science 2019-11-26 Xinyan Dai , Xiao Yan , Kaiwen Zhou , Han Yang , Kelvin K. W. Ng , James Cheng , Yu Fan

This paper investigates the energy complexity of distributed graph problems in multi-hop radio networks, where the energy cost of an algorithm is measured by the maximum number of awake rounds of a vertex. Recent works revealed that some…

Data Structures and Algorithms · Computer Science 2023-04-11 Yi-Jun Chang

We propose a new method for estimating the number of answers OUT of a small join query Q in a large database D, and for uniform sampling over joins. Our method is the first to satisfy all the following statements. - Support arbitrary Q,…

Databases · Computer Science 2023-04-11 Kyoungmin Kim , Jaehyun Ha , George Fletcher , Wook-Shin Han

We study three-way joins on MapReduce. Joins are very useful in a multitude of applications from data integration and traversing social networks, to mining graphs and automata-based constructions. However, joins are expensive, even for…

Databases · Computer Science 2014-05-19 Ben Kimmett , Alex Thomo , S. Venkatesh

Given a large graph G = (V,E) with millions of nodes and edges, how do we compute its connected components efficiently? Recent work addresses this problem in map-reduce, where a fundamental trade-off exists between the number of map-reduce…

Data Structures and Algorithms · Computer Science 2012-11-14 Vibhor Rastogi , Ashwin Machanavajjhala , Laukik Chitnis , Anish Das Sarma

The computation of the diameter is one of the most central problems in distributed computation. In the standard CONGEST model, in which two adjacent nodes can exchange $O(\log n)$ bits per round (here $n$ denotes the number of nodes of the…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-01-12 François Le Gall , Frédéric Magniez

In this paper, we study systems of distributed entities that can actively modify their communication network. This gives rise to distributed algorithms that apart from communication can also exploit network reconfiguration in order to carry…

Distributed, Parallel, and Cluster Computing · Computer Science 2020-03-09 Othon Michail , George Skretas , Paul G. Spirakis

Can Grover's algorithm speed up search of a physical region - for example a 2-D grid of size sqrt(n) by sqrt(n)? The problem is that sqrt(n) time seems to be needed for each query, just to move amplitude across the grid. Here we show that…

Quantum Physics · Physics 2007-05-23 Scott Aaronson , Andris Ambainis

We initiate the study of diameter computation in geometric intersection graphs from the fine-grained complexity perspective. A geometric intersection graph is a graph whose vertices correspond to some shapes in $d$-dimensional Euclidean…

Computational Geometry · Computer Science 2022-03-11 Karl Bringmann , Sándor Kisfaludi-Bak , Marvin Künnemann , André Nusser , Zahra Parsaeian

In many data analysis pipelines, a basic and time-consuming process is to produce join results and feed them into downstream tasks. Numerous enumeration algorithms have been developed for this purpose. To be a statistically meaningful…

Databases · Computer Science 2025-07-02 Pengyu Chen , Zizheng Guo , Jianwei Yang , Dongjing Miao

The congested clique model is a message-passing model of distributed computation where the underlying communication network is the complete graph of $n$ nodes. In this paper we consider the situation where the joint input to the nodes is an…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-06-13 Pedro Montealegre , Sebastian Perez-Salazar , Ivan Rapaport , Ioan Todinca

We consider the problem of finding a maximal independent set (MIS) in the shared blackboard communication model with vertex-partitioned inputs. There are $n$ players corresponding to vertices of an undirected graph, and each player sees the…

Data Structures and Algorithms · Computer Science 2022-09-20 Sepehr Assadi , Gillat Kol , Zhijun Zhang

We show how to construct an overlay network of constant degree and diameter $O(\log n)$ in time $O(\log n)$ starting from an arbitrary weakly connected graph. We assume a synchronous communication network in which nodes can send messages to…

Distributed, Parallel, and Cluster Computing · Computer Science 2021-07-28 Thorsten Götte , Kristian Hinnenthal , Christian Scheideler , Julian Werthmann

In the semi-streaming model for processing massive graphs, an algorithm makes multiple passes over the edges of a given $n$-vertex graph and is tasked with computing the solution to a problem using $O(n \cdot \text{polylog}(n))$ space.…

Data Structures and Algorithms · Computer Science 2023-12-21 Sepehr Assadi , Christian Konrad , Kheeran K. Naidu , Janani Sundaresan

The problems of computing eccentricity, radius, and diameter are fundamental to graph theory. These parameters are intrinsically defined based on the distance metric of the graph. In this work, we propose quantum algorithms for the diameter…

Quantum Physics · Physics 2025-02-28 Adam Wesołowski , Jinge Bao
‹ Prev 1 2 3 10 Next ›