Related papers: Testing Graph Clusterability: Algorithms and Lower…

Approximating Dasgupta Cost in Sublinear Time from a Few Random Seeds

Testing graph cluster structure has been a central object of study in property testing since the foundational work of Goldreich and Ron [STOC'96] on expansion testing, i.e. the problem of distinguishing between a single cluster (an…

Data Structures and Algorithms · Computer Science 2025-08-12 Michael Kapralov , Akash Kumar , Silvio Lattanzi , Aida Mousavifar , Weronika Wrzos-Kaminska

Testing Higher-order Clusterability on graphs

Analysis of higher-order organizations, usually small connected subgraphs called motifs, is a fundamental task on complex networks. This paper studies a new problem of testing higher-order clusterability: given query access to an undirected…

Data Structures and Algorithms · Computer Science 2023-10-09 Yifei Li , Donghua Yang , Jianzhong Li

Testing Cluster Structure of Graphs

We study the problem of recognizing the cluster structure of a graph in the framework of property testing in the bounded degree model. Given a parameter $\varepsilon$, a $d$-bounded degree graph is defined to be $(k, \phi)$-clusterable, if…

Data Structures and Algorithms · Computer Science 2015-04-14 Artur Czumaj , Pan Peng , Christian Sohler

Spectral methods for testing cluster structure of graphs

In the framework of graph property testing, we study the problem of determining if a graph admits a cluster structure. We say that a graph is $(k, \phi)$-clusterable if it can be partitioned into at most $k$ parts such that each part has…

Data Structures and Algorithms · Computer Science 2019-01-01 Sandeep Silwal , Jonathan Tidor

Spectral Clustering with Side Information

In the graph clustering problem with a planted solution, the input is a graph on $n$ vertices partitioned into $k$ clusters, and the task is to infer the clusters from graph structure. A standard assumption is that clusters induce…

Data Structures and Algorithms · Computer Science 2025-11-24 Hendrik Fichtenberger , Michael Kapralov , Ekaterina Kochetkova , Silvio Lattanzi , Davide Mazzali , Weronika Wrzos-Kaminska

Robust Clustering Oracle and Local Reconstructor of Cluster Structure of Graphs

Due to the massive size of modern network data, local algorithms that run in sublinear time for analyzing the cluster structure of the graph are receiving growing interest. Two typical examples are local graph clustering algorithms that…

Data Structures and Algorithms · Computer Science 2019-04-23 Pan Peng

A Sublinear-Time Spectral Clustering Oracle with Improved Preprocessing Time

We address the problem of designing a sublinear-time spectral clustering oracle for graphs that exhibit strong clusterability. Such graphs contain $k$ latent clusters, each characterized by a large inner conductance (at least $\varphi$) and…

Data Structures and Algorithms · Computer Science 2024-01-01 Ranran Shen , Pan Peng

Graph Clustering using Effective Resistance

$ \def\vecc#1{\boldsymbol{#1}} $We design a polynomial time algorithm that for any weighted undirected graph $G = (V, E,\vecc w)$ and sufficiently large $\delta > 1$, partitions $V$ into subsets $V_1, \ldots, V_h$ for some $h\geq 1$, such…

Data Structures and Algorithms · Computer Science 2017-11-20 Vedat Levi Alev , Nima Anari , Lap Chi Lau , Shayan Oveis Gharan

A Statistical Density-Based Analysis of Graph Clustering Algorithm Performance

Measuring graph clustering quality remains an open problem. To address it, we introduce quality measures based on comparisons of intra- and inter-cluster densities, an accompanying statistical test of the significance of their differences…

Social and Information Networks · Computer Science 2020-03-20 Pierre Miasnikof , Alexander Y. Shestopaloff , Anthony J. Bonner , Yuri Lawryshyn , Panos M. Pardalos

Sublinear Time and Space Algorithms for Correlation Clustering via Sparse-Dense Decompositions

We present a new approach for solving (minimum disagreement) correlation clustering that results in sublinear algorithms with highly efficient time and space complexity for this problem. In particular, we obtain the following algorithms for…

Data Structures and Algorithms · Computer Science 2021-09-30 Sepehr Assadi , Chen Wang

A fast multilevel algorithm for graph clustering and community detection

One of the most useful measures of cluster quality is the modularity of a partition, which measures the difference between the number of the edges joining vertices from the same cluster and the expected number of such edges in a random…

Data Analysis, Statistics and Probability · Physics 2009-09-29 Hristo Djidjev

Testing properties of signed graphs

In graph property testing the task is to distinguish whether a graph satisfies a given property or is "far" from having that property, preferably with a sublinear query and time complexity. In this work we initiate the study of property…

Data Structures and Algorithms · Computer Science 2021-02-16 Florian Adriaens , Simon Apers

A clusterability test for directed graphs

In this article, we extend a statistical test of graph clusterability, the $\delta$ test, to directed graphs with no self loops. The $\delta$ test, originally designed for undirected graphs, is based on the premise that graphs with a…

Networking and Internet Architecture · Computer Science 2025-06-26 Mario R. Guarracino , Pierre Miasnikof , Alexander Y. Shestopaloff , Houyem Demni , Cristián Bravo , Yuri Lawryshyn

Enhancing Graph Topology and Clustering Quality: A Modularity-Guided Approach

Current modularity-based community detection algorithms attempt to find cluster memberships that maximize modularity within a fixed graph topology. Diverging from this conventional approach, our work introduces a novel strategy that employs…

Data Analysis, Statistics and Probability · Physics 2024-02-27 Yongyu Wang , Shiqi Hao , Xiaoyang Wang , Xiaotian Zhuang

Expander Decomposition and Pruning: Faster, Stronger, and Simpler

We study the problem of graph clustering where the goal is to partition a graph into clusters, i.e. disjoint subsets of vertices, such that each cluster is well connected internally while sparsely connected to the rest of the graph. In…

Data Structures and Algorithms · Computer Science 2021-12-17 Thatchaphol Saranurak , Di Wang

A Polynomial Algorithm for Balanced Clustering via Graph Partitioning

The objective of clustering is to discover natural groups in datasets and to identify geometrical structures which might reside there, without assuming any prior knowledge on the characteristics of the data. The problem can be seen as…

Computational Geometry · Computer Science 2018-01-26 Luis-Evaristo Caraballo , José-Miguel Díaz-Báñez , Nadine Kroher

Structured Graph Learning for Clustering and Semi-supervised Classification

Graphs have become increasingly popular in modeling structures and interactions in a wide variety of problems during the last decade. Graph-based clustering and semi-supervised classification techniques have shown impressive performance.…

Machine Learning · Computer Science 2020-09-01 Zhao Kang , Chong Peng , Qiang Cheng , Xinwang Liu , Xi Peng , Zenglin Xu , Ling Tian

Clustering with Similarity Preserving

Graph-based clustering has shown promising performance in many tasks. A key step of graph-based approach is the similarity graph construction. In general, learning graph in kernel space can enhance clustering accuracy due to the…

Machine Learning · Computer Science 2019-05-22 Zhao Kang , Honghui Xu , Boyu Wang , Hongyuan Zhu , Zenglin Xu

Distributed Graph Clustering by Load Balancing

Graph clustering is a fundamental computational problem with a number of applications in algorithm design, machine learning, data mining, and analysis of social networks. Over the past decades, researchers have proposed a number of…

Data Structures and Algorithms · Computer Science 2019-04-12 He Sun , Luca Zanetti

Experiments on Density-Constrained Graph Clustering

Clustering a graph means identifying internally dense subgraphs which are only sparsely interconnected. Formalizations of this notion lead to measures that quantify the quality of a clustering and to algorithms that actually find…

Data Structures and Algorithms · Computer Science 2011-12-12 Robert Görke , Andrea Schumm , Dorothea Wagner