Related papers: Network Cluster-Robust Inference

Learning Coherent Clusters in Weakly-Connected Network Systems

We propose a structure-preserving model-reduction methodology for large-scale dynamic networks with tightly-connected components. First, the coherent groups are identified by a spectral clustering algorithm on the graph Laplacian matrix…

Systems and Control · Electrical Eng. & Systems 2023-05-15 Hancheng Min , Enrique Mallada

When Can We Trust Cluster-Robust Inference?

It is common when using cross-section or panel data to assign each observation to a cluster and allow for arbitrary patterns of heteroskedasticity and correlation within clusters. For regression models, there are many ways to make…

Econometrics · Economics 2026-04-03 James G. MacKinnon

Inference for Dependent Data with Learned Clusters

This paper presents and analyzes an approach to cluster-based inference for dependent data. The primary setting considered here is with spatially indexed data in which the dependence structure of observed random variables is characterized…

Statistics Theory · Mathematics 2022-11-16 Jianfei Cao , Christian Hansen , Damian Kozbur , Lucciano Villacorta

Spectral clustering and model reduction for weakly-connected coherent network systems

We propose a novel model-reduction methodology for large-scale dynamic networks with tightly-connected components. First, the coherent groups are identified by a spectral clustering algorithm on the graph Laplacian matrix that models the…

Systems and Control · Electrical Eng. & Systems 2022-10-04 Hancheng Min , Enrique Mallada

A Statistical Density-Based Analysis of Graph Clustering Algorithm Performance

Measuring graph clustering quality remains an open problem. To address it, we introduce quality measures based on comparisons of intra- and inter-cluster densities, an accompanying statistical test of the significance of their differences…

Social and Information Networks · Computer Science 2020-03-20 Pierre Miasnikof , Alexander Y. Shestopaloff , Anthony J. Bonner , Yuri Lawryshyn , Panos M. Pardalos

Cluster-robust inference with a single treated cluster using the t-test

This paper considers inference when there is a single treated cluster and a fixed number of control clusters, a setting that is common in empirical work, especially in difference-in-differences designs. We use the t-statistic and develop…

Econometrics · Economics 2025-11-11 Chun Pong Lau , Xinran Li

Advancing Local Clustering on Graphs via Compressive Sensing: Semi-supervised and Unsupervised Methods

Local clustering aims to identify specific substructures within a large graph without any additional structural information of the graph. These substructures are typically small compared to the overall graph, enabling the problem to be…

Machine Learning · Computer Science 2025-10-31 Zhaiming Shen , Sung Ha Kang

Local dominance unveils clusters in networks

Clusters or communities can provide a coarse-grained description of complex systems at multiple scales, but their detection remains challenging in practice. Community detection methods often define communities as dense subgraphs, or…

Physics and Society · Physics 2024-06-11 Dingyi Shi , Fan Shang , Bingsheng Chen , Paul Expert , Linyuan Lü , H. Eugene Stanley , Renaud Lambiotte , Tim S. Evans , Ruiqi Li

Robust Clustering Oracle and Local Reconstructor of Cluster Structure of Graphs

Due to the massive size of modern network data, local algorithms that run in sublinear time for analyzing the cluster structure of the graph are receiving growing interest. Two typical examples are local graph clustering algorithms that…

Data Structures and Algorithms · Computer Science 2019-04-23 Pan Peng

Clustering and Structural Robustness in Causal Diagrams

Graphs are commonly used to represent and visualize causal relations. For a small number of variables, this approach provides a succinct and clear view of the scenario at hand. As the number of variables under study increases, the graphical…

Machine Learning · Statistics 2023-08-16 Santtu Tikka , Jouni Helske , Juha Karvanen

On clustering network-valued data

Community detection, which focuses on clustering nodes or detecting communities in (mostly) a single network, is a problem of considerable practical interest and has received a great deal of attention in the research community. While being…

Machine Learning · Statistics 2017-11-07 Soumendu Sundar Mukherjee , Purnamrita Sarkar , Lizhen Lin

Genuinely Robust Inference for Clustered Data

Conventional cluster-robust inference can be invalid when data contain clusters of unignorably large size. We formalize this issue by deriving a necessary and sufficient condition for its validity, and show that this condition is frequently…

Econometrics · Economics 2025-10-07 Harold D. Chiang , Yuya Sasaki , Yulong Wang

Modularity of complex networks models

Modularity is designed to measure the strength of division of a network into clusters (known also as communities). Networks with high modularity have dense connections between the vertices within clusters but sparse connections between…

Probability · Mathematics 2017-07-18 Liudmila Ostroumova Prokhorenkova , Pawel Pralat , Andrei Raigorodskii

Studying Cross-cluster Modularity in Neural Networks

An approach to improve neural network interpretability is via clusterability, i.e., splitting a model into disjoint clusters that can be studied independently. We define a measure for clusterability and show that pre-trained models form…

Machine Learning · Computer Science 2025-07-28 Satvik Golechha , Maheep Chaudhary , Joan Velja , Alessandro Abate , Nandi Schoots

A few-shot graph Laplacian-based approach for improving the accuracy of low-fidelity data

Low-fidelity data is typically inexpensive to generate but inaccurate. On the other hand, high-fidelity data is accurate but expensive to obtain. Multi-fidelity methods use a small set of high-fidelity data to enhance the accuracy of a…

Machine Learning · Computer Science 2023-04-12 Orazio Pinti , Assad A. Oberai

Testing Higher-order Clusterability on graphs

Analysis of higher-order organizations, usually small connected subgraphs called motifs, is a fundamental task on complex networks. This paper studies a new problem of testing higher-order clusterability: given query access to an undirected…

Data Structures and Algorithms · Computer Science 2023-10-09 Yifei Li , Donghua Yang , Jianzhong Li

An Infinite Latent Attribute Model for Network Data

Latent variable models for network data extract a summary of the relational structure underlying an observed network. The simplest possible models subdivide nodes of the network into clusters; the probability of a link between any two nodes…

Machine Learning · Computer Science 2012-07-03 Konstantina Palla , David Knowles , Zoubin Ghahramani

Enhancing Graph Topology and Clustering Quality: A Modularity-Guided Approach

Current modularity-based community detection algorithms attempt to find cluster memberships that maximize modularity within a fixed graph topology. Diverging from this conventional approach, our work introduces a novel strategy that employs…

Data Analysis, Statistics and Probability · Physics 2024-02-27 Yongyu Wang , Shiqi Hao , Xiaoyang Wang , Xiaotian Zhuang

StruClus: Structural Clustering of Large-Scale Graph Databases

We present a structural clustering algorithm for large-scale datasets of small labeled graphs, utilizing a frequent subgraph sampling strategy. A set of representatives provides an intuitive description of each cluster, supports the…

Databases · Computer Science 2016-10-03 Till Schäfer , Petra Mutzel

Average Sensitivity of Spectral Clustering

Spectral clustering is one of the most popular clustering methods for finding clusters in a graph, which has found many applications in data mining. However, the input graph in those applications may have many missing edges due to error in…

Data Structures and Algorithms · Computer Science 2020-06-09 Pan Peng , Yuichi Yoshida