Related papers: An Incremental Reseeding Strategy for Clustering

Parallel Algorithms for Median Consensus Clustering in Complex Networks

We develop an algorithm that finds the consensus of many different clustering solutions of a graph. We formulate the problem as a median set partitioning problem and propose a greedy optimization technique. Unlike other approaches that find…

Information Retrieval · Computer Science 2024-08-22 Md Taufique Hussain , Mahantesh Halappanavar , Samrat Chatterjee , Filippo Radicchi , Santo Fortunato , Ariful Azad

Partitioning Complex Networks via Size-constrained Clustering

The most commonly used method to tackle the graph partitioning problem in practice is the multilevel approach. During a coarsening phase, a multilevel graph partitioning algorithm reduces the graph size by iteratively contracting nodes and…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-03-26 Henning Meyerhenke , Peter Sanders , Christian Schulz

Distributed Graph Clustering and Sparsification

Graph clustering is a fundamental computational problem with a number of applications in algorithm design, machine learning, data mining, and analysis of social networks. Over the past decades, researchers have proposed a number of…

Data Structures and Algorithms · Computer Science 2017-11-06 He Sun , Luca Zanetti

StruClus: Structural Clustering of Large-Scale Graph Databases

We present a structural clustering algorithm for large-scale datasets of small labeled graphs, utilizing a frequent subgraph sampling strategy. A set of representatives provides an intuitive description of each cluster, supports the…

Databases · Computer Science 2016-10-03 Till Schäfer , Petra Mutzel

Parallel Local Graph Clustering

Graph clustering has many important applications in computing, but due to growing sizes of graphs, even traditionally fast clustering methods such as spectral partitioning can be computationally expensive for real-world graphs of interest.…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-06-11 Julian Shun , Farbod Roosta-Khorasani , Kimon Fountoulakis , Michael W. Mahoney

GANC: Greedy Agglomerative Normalized Cut

This paper describes a graph clustering algorithm that aims to minimize the normalized cut criterion and has a model order selection procedure. The performance of the proposed algorithm is comparable to spectral approaches in terms of…

Artificial Intelligence · Computer Science 2011-05-06 Seyed Salim Tabatabaei , Mark Coates , Michael Rabbat

Data Clustering and Graph Partitioning via Simulated Mixing

Spectral clustering approaches have led to well-accepted algorithms for finding accurate clusters in a given dataset. However, their application to large-scale datasets has been hindered by computational complexity of eigenvalue…

Machine Learning · Computer Science 2016-03-17 Shahzad Bhatti , Carolyn Beck , Angelia Nedic

Parallel Graph Partitioning for Complex Networks

Processing large complex networks like social networks or web graphs has recently attracted considerable interest. In order to do this in parallel, we need to partition them into pieces of about equal size. Unfortunately, previous parallel…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-01-27 Henning Meyerhenke , Peter Sanders , Christian Schulz

A Greedy Strategy for Graph Cut

We propose a Greedy strategy to solve the problem of Graph Cut, called GGC. It starts from the state where each data sample is regarded as a cluster and dynamically merges the two clusters which reduces the value of the global objective…

Machine Learning · Computer Science 2024-12-31 Feiping Nie , Shenfei Pei , Zengwei Zheng , Rong Wang , Xuelong Li

Spectral Clustering for Divide-and-Conquer Graph Matching

We present a parallelized bijective graph matching algorithm that leverages seeds and is designed to match very large graphs. Our algorithm combines spectral graph embedding with existing state-of-the-art seeded graph matching procedures.…

Machine Learning · Statistics 2015-03-13 Vince Lyzinski , Daniel L. Sussman , Donniell E. Fishkind , Henry Pao , Li Chen , Joshua T. Vogelstein , Youngser Park , Carey E. Priebe

Distributed Graph Clustering by Load Balancing

Graph clustering is a fundamental computational problem with a number of applications in algorithm design, machine learning, data mining, and analysis of social networks. Over the past decades, researchers have proposed a number of…

Data Structures and Algorithms · Computer Science 2019-04-12 He Sun , Luca Zanetti

Distributed Graph Clustering using Modularity and Map Equation

We study large-scale, distributed graph clustering. Given an undirected graph, our objective is to partition the nodes into disjoint sets called clusters. A cluster should contain many internal edges while being sparsely connected to other…

Data Structures and Algorithms · Computer Science 2020-04-28 Michael Hamann , Ben Strasser , Dorothea Wagner , Tim Zeitz

Improved Graph Clustering

Graph clustering involves the task of dividing nodes into clusters, so that the edge density is higher within clusters as opposed to across clusters. A natural, classic and popular statistical setting for evaluating solutions to this…

Machine Learning · Statistics 2016-11-17 Yudong Chen , Sujay Sanghavi , Huan Xu

A Serial Multilevel Hypergraph Partitioning Algorithm

The graph partitioning problem has many applications in scientific computing such as computer aided design, data mining, image compression and other applications with sparse-matrix vector multiplications as a kernel operation. In many cases…

Data Structures and Algorithms · Computer Science 2016-01-08 Foad Lotfifar , Matthew Johnson

Limited Random Walk Algorithm for Big Graph Data Clustering

Graph clustering is an important technique to understand the relationships between the vertices in a big graph. In this paper, we propose a novel random-walk-based graph clustering method. The proposed method restricts the reach of the…

Social and Information Networks · Computer Science 2016-06-22 Honglei Zhang , Jenni Raitoharju , Serkan Kiranyaz , Moncef Gabbouj

Scalable Edge Partitioning

Edge-centric distributed computations have appeared as a recent technique to improve the shortcomings of think-like-a-vertex algorithms on large scale-free networks. In order to increase parallelism on this model, edge partitioning -…

Data Structures and Algorithms · Computer Science 2018-10-12 Sebastian Schlag , Christian Schulz , Daniel Seemaier , Darren Strash

Multi-level algorithms for modularity clustering

Modularity is one of the most widely used quality measures for graph clusterings. Maximizing modularity is NP-hard, and the runtime of exact algorithms is prohibitive for large graphs. A simple and effective class of heuristics coarsens the…

Data Structures and Algorithms · Computer Science 2009-09-22 Andreas Noack , Randolf Rotta

Flow-Based Local Graph Clustering with Better Seed Set Inclusion

Flow-based methods for local graph clustering have received significant recent attention for their theoretical cut improvement and runtime guarantees. In this work we present two improvements for using flow-based methods in real-world…

Social and Information Networks · Computer Science 2019-03-26 Nate Veldt , Christine Klymko , David Gleich

A fast multilevel algorithm for graph clustering and community detection

One of the most useful measures of cluster quality is the modularity of a partition, which measures the difference between the number of the edges joining vertices from the same cluster and the expected number of such edges in a random…

Data Analysis, Statistics and Probability · Physics 2009-09-29 Hristo Djidjev

An Efficient Parallel Data Clustering Algorithm Using Isoperimetric Number of Trees

We propose a parallel graph-based data clustering algorithm using CUDA GPU, based on exact clustering of the minimum spanning tree in terms of a minimum isoperimetric criteria. We also provide a comparative performance analysis of our…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-02-17 Ramin Javadi , Saleh Ashkboos