Related papers: Hausdorff clustering

Hausdorff clustering of financial time series

A clustering procedure, based on the Hausdorff distance, is introduced and tested on the financial time series of the Dow Jones Industrial Average (DJIA) index.

Physics and Society · Physics 2008-12-02 Nicolas Basalto , Roberto Bellotti , Francesco De Carlo , Paolo Facchi , Saverio Pascazio

Structural patterns in complex systems using multidendrograms

Complex systems are usually represented as an intricate set of relations between their components forming a complex graph or network. The understanding of their functioning and emergent properties are strongly related to their structural…

Data Analysis, Statistics and Probability · Physics 2014-01-08 Sergio Gomez , Alberto Fernandez , Clara Granell , Alex Arenas

Modern hierarchical, agglomerative clustering algorithms

This paper presents algorithms for hierarchical, agglomerative clustering which perform most efficiently in the general-purpose setup that is given in modern standard software. Requirements are: (1) the input data is given by pairwise…

Machine Learning · Statistics 2011-09-13 Daniel Müllner

A comparative study of divisive hierarchical clustering algorithms

A general scheme for divisive hierarchical clustering algorithms is proposed. It is made of three main steps : first a splitting procedure for the subdivision of clusters into two subclusters, second a local evaluation of the bipartitions…

Data Structures and Algorithms · Computer Science 2018-09-07 Maurice Roux

A Short Survey on Data Clustering Algorithms

With rapidly increasing data, clustering algorithms are important tools for data analytics in modern research. They have been successfully applied to a wide range of domains; for instance, bioinformatics, speech recognition, and financial…

Data Structures and Algorithms · Computer Science 2015-12-01 Ka-Chun Wong

Neural Network Clustering Based on Distances Between Objects

We present an algorithm of clustering of many-dimensional objects, where only the distances between objects are used. Centers of classes are found with the aid of neuron-like procedure with lateral inhibition. The result of clustering does…

Computer Vision and Pattern Recognition · Computer Science 2007-05-23 Leonid B. Litinskii , Dmitry E. Romanov

DECWA : Density-Based Clustering using Wasserstein Distance

Clustering is a data analysis method for extracting knowledge by discovering groups of data called clusters. Among these methods, state-of-the-art density-based clustering methods have proven to be effective for arbitrary-shaped clusters.…

Machine Learning · Computer Science 2023-10-26 Nabil El Malki , Robin Cugny , Olivier Teste , Franck Ravat

Clustering Multivariate Time Series using Energy Distance

A novel methodology is proposed for clustering multivariate time series data using energy distance defined in Sz\'ekely and Rizzo (2013). Specifically, a dissimilarity matrix is formed using the energy distance statistic to measure…

Methodology · Statistics 2024-03-13 Richard A. Davis , Leon Fernandes , Konstantinos Fokianos

Hierarchical Clustering with Prior Knowledge

Hierarchical clustering is a class of algorithms that seeks to build a hierarchy of clusters. It has been the dominant approach to constructing embedded classification schemes since it outputs dendrograms, which capture the hierarchical…

Machine Learning · Statistics 2018-08-28 Xiaofei Ma , Satya Dhavala

Solving non-uniqueness in agglomerative hierarchical clustering using multidendrograms

In agglomerative hierarchical clustering, pair-group methods suffer from a problem of non-uniqueness when two or more distances between different clusters coincide during the amalgamation process. The traditional approach for solving this…

Information Retrieval · Computer Science 2009-06-10 Alberto Fernandez , Sergio Gomez

Hierarchical clustering of bipartite data sets based on the statistical significance of coincidences

When some 'entities' are related by the 'features' they share they are amenable to a bipartite network representation. Plant-pollinator ecological communities, co-authorship of scientific papers, customers and purchases, or answers in a…

Social and Information Networks · Computer Science 2020-10-14 Ignacio Tamarit , María Pereda , José A. Cuesta

An algorithm for computing the centered Hausdorff measure of self-similar sets

We provide an algorithm for computing the centered Hausdorff measure of self-similar sets satisfying the strong separation condition. We prove the convergence of the algorithm and test its utility on some examples.

Metric Geometry · Mathematics 2015-05-28 Marta Llorente , Manuel Morán

Learning to Link

Clustering is an important part of many modern data analysis pipelines, including network analysis and data retrieval. There are many different clustering algorithms developed by various communities, and it is often not clear which…

Machine Learning · Computer Science 2019-10-04 Maria-Florina Balcan , Travis Dick , Manuel Lang

Determining the Hausdorff Distance Between Trees in Polynomial Time

The Hausdorff distance is a relatively new measure of similarity of graphs. The notion of the Hausdorff distance considers a special kind of a common subgraph of the compared graphs and depends on the structural properties outside of the…

Combinatorics · Mathematics 2023-06-22 Aleksander Kelenc

High Dimensional Cluster Analysis Using Path Lengths

A hierarchical scheme for clustering data is presented which applies to spaces with a high number of dimension ($N_{_{D}}>3$). The data set is first reduced to a smaller set of partitions (multi-dimensional bins). Multiple clustering…

Data Analysis, Statistics and Probability · Physics 2017-10-16 Kevin McIlhany , Stephen Wiggins

Hierarchical clustering: visualization, feature importance and model selection

We propose methods for the analysis of hierarchical clustering that fully use the multi-resolution structure provided by a dendrogram. Specifically, we propose a loss for choosing between clustering methods, a feature importance score and a…

Methodology · Statistics 2023-01-31 Luben M. C. Cabezas , Rafael Izbicki , Rafael B. Stern

Dynamic Clustering of Histogram Data Based on Adaptive Squared Wasserstein Distances

This paper deals with clustering methods based on adaptive distances for histogram data using a dynamic clustering algorithm. Histogram data describes individuals in terms of empirical distributions. These kind of data can be considered as…

Statistics Theory · Mathematics 2016-05-03 Antonio Irpino , Rosanna Verde , Francisco de AT De Carvalho

Clustering by Constructing Hyper-Planes

As a kind of basic machine learning method, clustering algorithms group data points into different categories based on their similarity or distribution. We present a clustering algorithm by finding hyper-planes to distinguish the data…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Luhong Diao , Jinying Gao1 , Manman Deng

Hausdorff Distance-Based Record Linkage for Improved Matching of Households and Individuals in Different Databases

Matching households and individuals across different databases poses challenges due to the lack of unique identifiers, typographical errors, and changes in attributes over time. Record linkage tools play a crucial role in overcoming these…

Applications · Statistics 2024-04-09 Thais Pacheco Menezes , Thomas Brendan Murphy , Michael Fop

Hierarchical Clustering in Astronomy

Hierarchical clustering is a common algorithm in data analysis. It is unique among many clustering algorithms in that it draws dendrograms based on the distance of data under a certain metric, and group them. It is widely used in all areas…

Instrumentation and Methods for Astrophysics · Physics 2022-11-14 Heng Yu , Xiaolan Hou