English
Related papers

Related papers: An Incremental Reseeding Strategy for Clustering

200 papers

We develop an algorithm that finds the consensus of many different clustering solutions of a graph. We formulate the problem as a median set partitioning problem and propose a greedy optimization technique. Unlike other approaches that find…

Information Retrieval · Computer Science 2024-08-22 Md Taufique Hussain , Mahantesh Halappanavar , Samrat Chatterjee , Filippo Radicchi , Santo Fortunato , Ariful Azad

The most commonly used method to tackle the graph partitioning problem in practice is the multilevel approach. During a coarsening phase, a multilevel graph partitioning algorithm reduces the graph size by iteratively contracting nodes and…

Distributed, Parallel, and Cluster Computing · Computer Science 2014-03-26 Henning Meyerhenke , Peter Sanders , Christian Schulz

Graph clustering is a fundamental computational problem with a number of applications in algorithm design, machine learning, data mining, and analysis of social networks. Over the past decades, researchers have proposed a number of…

Data Structures and Algorithms · Computer Science 2017-11-06 He Sun , Luca Zanetti

We present a structural clustering algorithm for large-scale datasets of small labeled graphs, utilizing a frequent subgraph sampling strategy. A set of representatives provides an intuitive description of each cluster, supports the…

Databases · Computer Science 2016-10-03 Till Schäfer , Petra Mutzel

Graph clustering has many important applications in computing, but due to growing sizes of graphs, even traditionally fast clustering methods such as spectral partitioning can be computationally expensive for real-world graphs of interest.…

Distributed, Parallel, and Cluster Computing · Computer Science 2019-06-11 Julian Shun , Farbod Roosta-Khorasani , Kimon Fountoulakis , Michael W. Mahoney

This paper describes a graph clustering algorithm that aims to minimize the normalized cut criterion and has a model order selection procedure. The performance of the proposed algorithm is comparable to spectral approaches in terms of…

Artificial Intelligence · Computer Science 2011-05-06 Seyed Salim Tabatabaei , Mark Coates , Michael Rabbat

Spectral clustering approaches have led to well-accepted algorithms for finding accurate clusters in a given dataset. However, their application to large-scale datasets has been hindered by computational complexity of eigenvalue…

Machine Learning · Computer Science 2016-03-17 Shahzad Bhatti , Carolyn Beck , Angelia Nedic

Processing large complex networks like social networks or web graphs has recently attracted considerable interest. In order to do this in parallel, we need to partition them into pieces of about equal size. Unfortunately, previous parallel…

Distributed, Parallel, and Cluster Computing · Computer Science 2015-01-27 Henning Meyerhenke , Peter Sanders , Christian Schulz

We propose a Greedy strategy to solve the problem of Graph Cut, called GGC. It starts from the state where each data sample is regarded as a cluster and dynamically merges the two clusters which reduces the value of the global objective…

Machine Learning · Computer Science 2024-12-31 Feiping Nie , Shenfei Pei , Zengwei Zheng , Rong Wang , Xuelong Li

We present a parallelized bijective graph matching algorithm that leverages seeds and is designed to match very large graphs. Our algorithm combines spectral graph embedding with existing state-of-the-art seeded graph matching procedures.…

Graph clustering is a fundamental computational problem with a number of applications in algorithm design, machine learning, data mining, and analysis of social networks. Over the past decades, researchers have proposed a number of…

Data Structures and Algorithms · Computer Science 2019-04-12 He Sun , Luca Zanetti

We study large-scale, distributed graph clustering. Given an undirected graph, our objective is to partition the nodes into disjoint sets called clusters. A cluster should contain many internal edges while being sparsely connected to other…

Data Structures and Algorithms · Computer Science 2020-04-28 Michael Hamann , Ben Strasser , Dorothea Wagner , Tim Zeitz

Graph clustering involves the task of dividing nodes into clusters, so that the edge density is higher within clusters as opposed to across clusters. A natural, classic and popular statistical setting for evaluating solutions to this…

Machine Learning · Statistics 2016-11-17 Yudong Chen , Sujay Sanghavi , Huan Xu

The graph partitioning problem has many applications in scientific computing such as computer aided design, data mining, image compression and other applications with sparse-matrix vector multiplications as a kernel operation. In many cases…

Data Structures and Algorithms · Computer Science 2016-01-08 Foad Lotfifar , Matthew Johnson

Graph clustering is an important technique to understand the relationships between the vertices in a big graph. In this paper, we propose a novel random-walk-based graph clustering method. The proposed method restricts the reach of the…

Social and Information Networks · Computer Science 2016-06-22 Honglei Zhang , Jenni Raitoharju , Serkan Kiranyaz , Moncef Gabbouj

Edge-centric distributed computations have appeared as a recent technique to improve the shortcomings of think-like-a-vertex algorithms on large scale-free networks. In order to increase parallelism on this model, edge partitioning -…

Data Structures and Algorithms · Computer Science 2018-10-12 Sebastian Schlag , Christian Schulz , Daniel Seemaier , Darren Strash

Modularity is one of the most widely used quality measures for graph clusterings. Maximizing modularity is NP-hard, and the runtime of exact algorithms is prohibitive for large graphs. A simple and effective class of heuristics coarsens the…

Data Structures and Algorithms · Computer Science 2009-09-22 Andreas Noack , Randolf Rotta

Flow-based methods for local graph clustering have received significant recent attention for their theoretical cut improvement and runtime guarantees. In this work we present two improvements for using flow-based methods in real-world…

Social and Information Networks · Computer Science 2019-03-26 Nate Veldt , Christine Klymko , David Gleich

One of the most useful measures of cluster quality is the modularity of a partition, which measures the difference between the number of the edges joining vertices from the same cluster and the expected number of such edges in a random…

Data Analysis, Statistics and Probability · Physics 2009-09-29 Hristo Djidjev

We propose a parallel graph-based data clustering algorithm using CUDA GPU, based on exact clustering of the minimum spanning tree in terms of a minimum isoperimetric criteria. We also provide a comparative performance analysis of our…

Distributed, Parallel, and Cluster Computing · Computer Science 2017-02-17 Ramin Javadi , Saleh Ashkboos
‹ Prev 1 2 3 10 Next ›