Related papers: Distributed Graph Clustering by Load Balancing

Distributed Graph Clustering and Sparsification

Graph clustering is a fundamental computational problem with a number of applications in algorithm design, machine learning, data mining, and analysis of social networks. Over the past decades, researchers have proposed a number of…

Data Structures and Algorithms · Computer Science 2017-11-06 He Sun , Luca Zanetti

Communication-Optimal Distributed Clustering

Clustering large datasets is a fundamental problem with a number of applications in machine learning. Data is often collected on different sites and clustering needs to be performed in a distributed manner with low communication. We would…

Data Structures and Algorithms · Computer Science 2017-02-02 Jiecao Chen , He Sun , David P. Woodruff , Qin Zhang

A Short Introduction to Local Graph Clustering Methods and Software

Graph clustering has many important applications in computing, but due to the increasing sizes of graphs, even traditionally fast clustering methods can be computationally expensive for real-world graphs of interest. Scalability problems…

Social and Information Networks · Computer Science 2018-10-18 Kimon Fountoulakis , David F. Gleich , Michael W. Mahoney

Clustering based on Random Graph Model embedding Vertex Features

Large datasets with interactions between objects are common to numerous scientific fields (i.e. social science, internet, biology...). The interactions naturally define a graph and a common way to explore or summarize such dataset is graph…

Applications · Statistics 2009-10-13 Hugo Zanghi , Stevenn Volant , Christophe Ambroise

Attributed Graph Clustering in Collaborative Settings

Graph clustering is an unsupervised machine learning method that partitions the nodes in a graph into different groups. Despite achieving significant progress in exploiting both attributed and structured data information, graph clustering…

Machine Learning · Computer Science 2025-01-03 Rui Zhang , Xiaoyang Hou , Zhihua Tian , Yan he , Enchao Gong , Jian Liu , Qingbiao Wu , Kui Ren

Distributed Graph Clustering using Modularity and Map Equation

We study large-scale, distributed graph clustering. Given an undirected graph, our objective is to partition the nodes into disjoint sets called clusters. A cluster should contain many internal edges while being sparsely connected to other…

Data Structures and Algorithms · Computer Science 2020-04-28 Michael Hamann , Ben Strasser , Dorothea Wagner , Tim Zeitz

Improved Graph Clustering

Graph clustering involves the task of dividing nodes into clusters, so that the edge density is higher within clusters as opposed to across clusters. A natural, classic and popular statistical setting for evaluating solutions to this…

Machine Learning · Statistics 2016-11-17 Yudong Chen , Sujay Sanghavi , Huan Xu

Clustering-aware Graph Construction: A Joint Learning Perspective

Graph-based clustering methods have demonstrated the effectiveness in various applications. Generally, existing graph-based clustering methods first construct a graph to represent the input data and then partition it to generate the…

Machine Learning · Computer Science 2019-12-17 Yuheng Jia , Hui Liu , Junhui Hou , Sam Kwong

A Survey of Distributed Graph Algorithms on Massive Graphs

Distributed processing of large-scale graph data has many practical applications and has been widely studied. In recent years, a lot of distributed graph processing frameworks and algorithms have been proposed. While many efforts have been…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-10-29 Lingkai Meng , Yu Shao , Long Yuan , Longbin Lai , Peng Cheng , Xue Li , Wenyuan Yu , Wenjie Zhang , Xuemin Lin , Jingren Zhou

A Polynomial Algorithm for Balanced Clustering via Graph Partitioning

The objective of clustering is to discover natural groups in datasets and to identify geometrical structures which might reside there, without assuming any prior knowledge on the characteristics of the data. The problem can be seen as…

Computational Geometry · Computer Science 2018-01-26 Luis-Evaristo Caraballo , José-Miguel Díaz-Báñez , Nadine Kroher

StruClus: Structural Clustering of Large-Scale Graph Databases

We present a structural clustering algorithm for large-scale datasets of small labeled graphs, utilizing a frequent subgraph sampling strategy. A set of representatives provides an intuitive description of each cluster, supports the…

Databases · Computer Science 2016-10-03 Till Schäfer , Petra Mutzel

Decentralized Distributed Graph Coloring: Cluster Graphs

Graph coloring is fundamental to distributed computing. We give the first sub-logarithmic distributed algorithm for coloring cluster graphs. These graphs are obtained from the underlying communication network by contracting nodes and edges,…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-06-17 Maxime Flin , Magnus M. Halldorsson , Alexandre Nolin

Study on Resource Efficiency of Distributed Graph Processing

Graphs may be used to represent many different problem domains -- a concrete example is that of detecting communities in social networks, which are represented as graphs. With big data and more sophisticated applications becoming widespread…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-04-03 Miguel E. Coimbra , Alexandre P. Francisco , Luis Veiga

A Survey of Deep Graph Clustering: Taxonomy, Challenge, Application, and Open Resource

Graph clustering, which aims to divide nodes in the graph into several distinct clusters, is a fundamental yet challenging task. Benefiting from the powerful representation capability of deep learning, deep graph clustering methods have…

Machine Learning · Computer Science 2023-09-13 Yue Liu , Jun Xia , Sihang Zhou , Xihong Yang , Ke Liang , Chenchen Fan , Yan Zhuang , Stan Z. Li , Xinwang Liu , Kunlun He

Graph Clustering with Dynamic Embedding

Graph clustering (or community detection) has long drawn enormous attention from the research on web mining and information networks. Recent literature on this topic has reached a consensus that node contents and link structures should be…

Social and Information Networks · Computer Science 2017-12-25 Carl Yang , Mengxiong Liu , Zongyi Wang , Liyuan Liu , Jiawei Han

Generalizing Lloyd's algorithm for graph clustering

Clustering is a commonplace problem in many areas of data science, with applications in biology and bioinformatics, understanding chemical structure, image segmentation, building recommender systems, and many more fields. While there are…

Numerical Analysis · Mathematics 2023-12-25 Tareq Zaman , Nicolas Nytko , Ali Taghibakhshi , Scott MacLachlan , Luke Olson , Matthew West

Multi-Dimensional Balanced Graph Partitioning via Projected Gradient Descent

Motivated by performance optimization of large-scale graph processing systems that distribute the graph across multiple machines, we consider the balanced graph partitioning problem. Compared to the previous work, we study the…

Data Structures and Algorithms · Computer Science 2019-02-19 Dmitrii Avdiukhin , Sergey Pupyrev , Grigory Yaroslavtsev

Online Sparsification of Bipartite-Like Clusters in Graphs

Graph clustering is an important algorithmic technique for analysing massive graphs, and has been widely applied in many research fields of data science. While the objective of most graph clustering algorithms is to find a vertex set of low…

Data Structures and Algorithms · Computer Science 2025-08-08 Joyentanuj Das , Suranjan De , He Sun

Clustering-based Partitioning for Large Web Graphs

Graph partitioning plays a vital role in distributedlarge-scale web graph analytics, such as pagerank and labelpropagation. The quality and scalability of partitioning strategyhave a strong impact on such communication- and…

Distributed, Parallel, and Cluster Computing · Computer Science 2022-01-04 Deyu Kong , Xike Xie , Zhuoxu Zhang

Distributed Algorithms for Finding Local Clusters Using Heat Kernel Pagerank

A distributed algorithm performs local computations on pieces of input and communicates the results through given communication links. When processing a massive graph in a distributed algorithm, local outputs must be configured as a…

Distributed, Parallel, and Cluster Computing · Computer Science 2016-12-06 Fan Chung , Olivia Simpson