English
Related papers

Related papers: A clusterability test for directed graphs

200 papers

Measuring graph clustering quality remains an open problem. To address it, we introduce quality measures based on comparisons of intra- and inter-cluster densities, an accompanying statistical test of the significance of their differences…

Social and Information Networks · Computer Science 2020-03-20 Pierre Miasnikof , Alexander Y. Shestopaloff , Anthony J. Bonner , Yuri Lawryshyn , Panos M. Pardalos

Analysis of higher-order organizations, usually small connected subgraphs called motifs, is a fundamental task on complex networks. This paper studies a new problem of testing higher-order clusterability: given query access to an undirected…

Data Structures and Algorithms · Computer Science 2023-10-09 Yifei Li , Donghua Yang , Jianzhong Li

Graph clustering is a fundamental task in unsupervised learning with broad real-world applications. While spectral clustering methods for undirected graphs are well-established and guided by a minimum cut optimization consensus, their…

Machine Learning · Statistics 2025-06-04 Ning Zhang , Xiaowen Dong , Mihai Cucuringu

Clustering a graph means identifying internally dense subgraphs which are only sparsely interconnected. Formalizations of this notion lead to measures that quantify the quality of a clustering and to algorithms that actually find…

Data Structures and Algorithms · Computer Science 2011-12-12 Robert Görke , Andrea Schumm , Dorothea Wagner

Graph clustering is the problem of identifying sparsely connected dense subgraphs (clusters) in a given graph. Proposed clustering algorithms usually optimize various fitness functions that measure the quality of a cluster within the graph.…

Computational Complexity · Computer Science 2007-05-23 Jiri Sima , Satu Elisa Schaeffer

Networks (or graphs) appear as dominant structures in diverse domains, including sociology, biology, neuroscience and computer science. In most of the aforementioned cases graphs are directed - in the sense that there is directionality on…

Social and Information Networks · Computer Science 2015-06-16 Fragkiskos D. Malliaros , Michalis Vazirgiannis

We consider the problem of testing graph cluster structure: given access to a graph $G=(V, E)$, can we quickly determine whether the graph can be partitioned into a few clusters with good inner conductance, or is far from any such graph?…

Data Structures and Algorithms · Computer Science 2018-09-19 Ashish Chiplunkar , Michael Kapralov , Sanjeev Khanna , Aida Mousavifar , Yuval Peres

This article explores and analyzes the unsupervised clustering of large partially observed graphs. We propose a scalable and provable randomized framework for clustering graphs generated from the stochastic block model. The clustering is…

Social and Information Networks · Computer Science 2022-12-06 Mostafa Rahmani , Andre Beckus , Adel Karimian , George Atia

Local clustering aims to find a compact cluster near the given starting instances. This work focuses on graph local clustering, which has broad applications beyond graphs because of the internal connectivities within various modalities.…

Social and Information Networks · Computer Science 2024-12-05 Zihao Li , Dongqi Fu , Hengyu Liu , Jingrui He

Unsupervised node clustering (or community detection) is a classical graph learning task. In this paper, we study algorithms, which exploit the geometry of the graph to identify densely connected substructures, which form clusters or…

Social and Information Networks · Computer Science 2023-07-20 Yu Tian , Zachary Lubberts , Melanie Weber

Graph clustering is an unsupervised machine learning method that partitions the nodes in a graph into different groups. Despite achieving significant progress in exploiting both attributed and structured data information, graph clustering…

Machine Learning · Computer Science 2025-01-03 Rui Zhang , Xiaoyang Hou , Zhihua Tian , Yan he , Enchao Gong , Jian Liu , Qingbiao Wu , Kui Ren

Traditionally, graph quality metrics focus on readability, but recent studies show the need for metrics which are more specific to the discovery of patterns in graphs. Cluster analysis is a popular task within graph analysis, yet there is…

Data Structures and Algorithms · Computer Science 2019-08-22 Amyra Meidiana , Seok-Hee Hong , Peter Eades , Daniel Keim

We present a graph-theoretical approach to data clustering, which combines the creation of a graph from the data with Markov Stability, a multiscale community detection framework. We show how the multiscale capabilities of the method allow…

Information Retrieval · Computer Science 2020-01-14 Zijing Liu , Mauricio Barahona

Data clustering with uneven distribution in high level noise is challenging. Currently, HDBSCAN is considered as the SOTA algorithm for this problem. In this paper, we propose a novel clustering algorithm based on what we call graph of…

Machine Learning · Computer Science 2020-09-25 Zhangyang Gao , Haitao Lin , Stan. Z Li

How can we find a good graph clustering of a real-world network, that allows insight into its underlying structure and also potential functions? In this paper, we introduce a new graph clustering algorithm Dcut from a density point of view.…

Social and Information Networks · Computer Science 2016-06-06 Junming Shao , Qinli Yang , Jinhu Liu , Stefan Kramer

We propose a novel perspective on varied-density clustering for high-dimensional data by framing it as a label propagation process in neighborhood graphs that adapt to local density variations. Our method formally connects density-based…

Machine Learning · Computer Science 2025-08-06 Ninh Pham , Yingtao Zheng , Hugo Phibbs

Graph clustering is a fundamental technique in data analysis with applications in many different fields. While there is a large body of work on clustering undirected graphs, the problem of clustering directed graphs is much less understood.…

Physics and Society · Physics 2025-01-31 James Martin , Tim Rogers , Luca Zanetti

Clustering is a widely used unsupervised learning method for finding structure in the data. However, the resulting clusters are typically presented without any guarantees on their robustness; slightly changing the used data sample or…

Machine Learning · Statistics 2017-01-02 Andreas Henelius , Kai Puolamäki , Henrik Boström , Panagiotis Papapetrou

Two-sample tests utilizing a similarity graph on observations are useful for high-dimensional and non-Euclidean data due to their flexibility and good performance under a wide range of alternatives. Existing works mainly focused on sparse…

Statistics Theory · Mathematics 2023-11-14 Yejiong Zhu , Hao Chen

Graph-based clustering has shown promising performance in many tasks. A key step of graph-based approach is the similarity graph construction. In general, learning graph in kernel space can enhance clustering accuracy due to the…

Machine Learning · Computer Science 2019-05-22 Zhao Kang , Honghui Xu , Boyu Wang , Hongyuan Zhu , Zenglin Xu
‹ Prev 1 2 3 10 Next ›