Related papers: Clustering-Based Inter-Regional Correlation Estima…

Clustering coefficients for correlation networks

The clustering coefficient quantifies the abundance of connected triangles in a network and is a major descriptive statistics of networks. For example, it finds an application in the assessment of small-worldness of brain networks, which is…

Physics and Society · Physics 2018-06-28 Naoki Masuda , Michiko Sakaki , Takahiro Ezaki , Takamitsu Watanabe

Hierarchical Clustering Given Confidence Intervals of Metric Distances

This paper considers metric spaces where distances between a pair of nodes are represented by distance intervals. The goal is to study methods for the determination of hierarchical clusters, i.e., a family of nested partitions indexed by a…

Social and Information Networks · Computer Science 2016-10-17 Weiyu Huang , Alejandro Ribeiro

Generalization of Clustering Agreements and Distances for Overlapping Clusters and Network Communities

A measure of distance between two clusterings has important applications, including clustering validation and ensemble clustering. Generally, such distance measure provides navigation through the space of possible clusterings. Mostly used…

Social and Information Networks · Computer Science 2015-09-01 Reihaneh Rabbany , Osmar R. Zaïane

On Soft Clustering For Correlation Estimators

Properly estimating correlations between objects at different spatial scales necessitates $\mathcal{O}(n^2)$ distance calculations. For this reason, most widely adopted packages for estimating correlations use clustering algorithms to…

Instrumentation and Methods for Astrophysics · Physics 2025-09-15 Edward Berman , Sneh Pandya , Jacqueline McCleary , Marko Shuntov , Caitlin Casey , Nicole Drakos , Andreas Faisst , Steven Gillman , Ghassem Gozaliasl , Natalie Hogg , Jeyhan Kartaltepe , Anton Koekemoer , Wilfried Mercier , Diana Scognamiglio , COSMOS-Web , : , The JWST Cosmic Origins Survey

Normalised clustering accuracy: An asymmetric external cluster validity measure

There is no, nor will there ever be, single best clustering algorithm. Nevertheless, we would still like to be able to distinguish between methods that work well on certain task types and those that systematically underperform. Clustering…

Machine Learning · Computer Science 2025-10-16 Marek Gagolewski

A supervised clustering approach for fMRI-based inference of brain states

We propose a method that combines signals from many brain regions observed in functional Magnetic Resonance Imaging (fMRI) to predict the subject's behavior during a scanning session. Such predictions suffer from the huge number of brain…

Computer Vision and Pattern Recognition · Computer Science 2011-04-29 Vincent Michel , Alexandre Gramfort , Gaël Varoquaux , Evelyn Eger , Christine Keribin , Bertrand Thirion

Algorithms for Internal Validation Clustering Measures in the Post Genomic Era

Inferring cluster structure in microarray datasets is a fundamental task for the -omic sciences. A fundamental question in Statistics, Data Analysis and Classification, is the prediction of the number of clusters in a dataset, usually…

Data Structures and Algorithms · Computer Science 2011-02-16 Filippo Utro

Inter-regional correlation estimators for functional magnetic resonance imaging

Functional magnetic resonance imaging (fMRI) functional connectivity between brain regions is often computed using parcellations defined by functional or structural atlases. Typically, some kind of voxel averaging is performed to obtain a…

Methodology · Statistics 2023-03-10 Sophie Achard , Jean-Francois Coeurjolly , Pierre Lafaye de Micheaux , Jonas Richiardi

A new nonparametric interpoint distance-based measure for assessment of clustering

A new interpoint distance-based measure is proposed to identify the optimal number of clusters present in a data set. Designed in nonparametric approach, it is independent of the distribution of given data. Interpoint distances between the…

Machine Learning · Computer Science 2022-10-18 Soumita Modak

When Should You Adjust Standard Errors for Clustering?

In empirical work it is common to estimate parameters of models and report associated standard errors that account for "clustering" of units, where clusters are defined by factors such as geography. Clustering adjustments are typically…

Statistics Theory · Mathematics 2022-09-21 Alberto Abadie , Susan Athey , Guido Imbens , Jeffrey Wooldridge

Between- and Within-Cluster Spearman Rank Correlations

Clustered data are common in practice. Clustering arises when subjects are measured repeatedly, or subjects are nested in groups (e.g., households, schools). It is often of interest to evaluate the correlation between two variables with…

Methodology · Statistics 2025-01-16 Shengxin Tu , Chun Li , Bryan E. Shepherd

A new interpoint distance-based clustering algorithm using kernel density estimation

A novel nonparametric clustering algorithm is proposed using the interpoint distances between the members of the data to reveal the inherent clustering structure existing in the given set of data, where we apply the classical nonparametric…

Methodology · Statistics 2024-09-02 Soumita Modak

Consistency Theory of General Nonparametric Classification Methods in Cognitive Diagnosis

Cognitive diagnosis models have been popularly used in fields such as education, psychology, and social sciences. While parametric likelihood estimation is a prevailing method for fitting cognitive diagnosis models, nonparametric…

Statistics Theory · Mathematics 2025-10-01 Chengyu Cui , Yanlong Liu , Gongjun Xu

On Hyperparameter Search in Cluster Ensembles

Quality assessments of models in unsupervised learning and clustering verification in particular have been a long-standing problem in the machine learning research. The lack of robust and universally applicable cluster validity scores often…

Machine Learning · Statistics 2018-03-30 Luzie Helfmann , Johannes von Lindheim , Mattes Mollenhauer , Ralf Banisch

Clustering functional data with measurement errors: a simulation-based approach

Clustering analysis of functional data, which comprises observations that evolve continuously over time or space, has gained increasing attention across various scientific disciplines. Practical applications often involve functional data…

Methodology · Statistics 2024-06-19 Tingyu Zhu , Lan Xue , Carmen Tekwe , Keith Diaz , Mark Benden , Roger Zoh

A semiparametric model for cluster data

In the analysis of cluster data, the regression coefficients are frequently assumed to be the same across all clusters. This hampers the ability to study the varying impacts of factors on each cluster. In this paper, a semiparametric model…

Statistics Theory · Mathematics 2009-08-25 Wenyang Zhang , Jianqing Fan , Yan Sun

Nonparametric Hierarchical Clustering of Functional Data

In this paper, we deal with the problem of curves clustering. We propose a nonparametric method which partitions the curves into clusters and discretizes the dimensions of the curve points into intervals. The cross-product of these…

Machine Learning · Statistics 2014-07-03 Marc Boullé , Romain Guigourès , Fabrice Rossi

Clustering by Nonparametric Smoothing

A novel formulation of the clustering problem is introduced in which the task is expressed as an estimation problem, where the object to be estimated is a function which maps a point to its distribution of cluster membership. Unlike…

Machine Learning · Computer Science 2025-10-14 David P. Hofmeyr

Spatially-Aware Comparison and Consensus for Clusterings

This paper proposes a new distance metric between clusterings that incorporates information about the spatial distribution of points and clusters. Our approach builds on the idea of a Hilbert space-based representation of clusters as a…

Machine Learning · Computer Science 2015-03-18 Parasaran Raman , Jeff M. Phillips , Suresh Venkatasubramanian

Granger Causality Based Hierarchical Time Series Clustering for State Estimation

Clustering is an unsupervised learning technique that is useful when working with a large volume of unlabeled data. Complex dynamical systems in real life often entail data streaming from a large number of sources. Although it is desirable…

Machine Learning · Computer Science 2021-05-20 Sin Yong Tan , Homagni Saha , Margarite Jacoby , Gregor P. Henze , Soumik Sarkar