English
Related papers

Related papers: The Remarkable Simplicity of Very High Dimensional…

200 papers

An ultrametric topology formalizes the notion of hierarchical structure. An ultrametric embedding, referred to here as ultrametricity, is implied by a natural hierarchical embedding. Such hierarchical structure can be global in the data…

Data Analysis, Statistics and Probability · Physics 2007-05-23 Fionn Murtagh

Data analysis and data mining are concerned with unsupervised pattern finding and structure determination in data sets. "Structure" can be understood as symmetry and a range of symmetries are expressed by hierarchy. Such symmetries directly…

Machine Learning · Statistics 2015-03-17 Fionn Murtagh , Pedro Contreras

We begin with pervasive ultrametricity due to high dimensionality and/or spatial sparsity. How extent or degree of ultrametricity can be quantified leads us to the discussion of varied practical cases when ultrametricity can be partially or…

Statistics Theory · Mathematics 2011-01-11 Fionn Murtagh

High dimensional, sparsely populated data spaces have been characterized in terms of ultrametric topology. This implies that there are natural, not necessarily unique, tree or hierarchy structures defined by the ultrametric topology. In…

Computation and Language · Computer Science 2007-05-23 Fionn Murtagh

We study hierarchical clusterings of metric spaces that change over time. This is a natural geometric primitive for the analysis of dynamic data sets. Specifically, we introduce and study the problem of finding a temporally coherent…

Data Structures and Algorithms · Computer Science 2017-10-23 Tamal K. Dey , Alfred Rossi , Anastasios Sidiropoulos

Data analysis and data mining are concerned with unsupervised pattern finding and structure determination in data sets. The data sets themselves are explicitly linked as a form of representation to an observational or otherwise empirical…

Machine Learning · Statistics 2011-01-11 Fionn Murtagh

A novel method to obtain hierarchical and overlapping clusters from network data -i.e., a set of nodes endowed with pairwise dissimilarities- is presented. The introduced method is hierarchical in the sense that it outputs a nested…

Social and Information Networks · Computer Science 2017-12-13 Fernando Gama , Santiago Segarra , Alejandro Ribeiro

In a world abundant with diverse data arising from complex acquisition techniques, there is a growing need for new data analysis methods. In this paper we focus on high-dimensional data that are organized into several hierarchical datasets.…

Machine Learning · Computer Science 2021-04-06 Lior Aloni , Omer Bobrowski , Ronen Talmon

Hierarchical data representations in the context of classi cation and data clustering were put forward during the fties. Recently, hierarchical image representations have gained renewed interest for segmentation purposes. In this paper, we…

Discrete Mathematics · Computer Science 2012-09-19 Pierre Soille , Laurent Najman

The increasing needs of clustering massive datasets and the high cost of running clustering algorithms poses difficult problems for users. In this context it is important to determine if a data set is clusterable, that is, it may be…

Machine Learning · Computer Science 2020-01-08 Dan Simovici , Kaixun Hua

Hierarchical clustering is a powerful tool for exploratory data analysis, organizing data into a tree of clusterings from which a partition can be chosen. This paper generalizes these ideas by proving that, for any reasonable hierarchy, one…

Machine Learning · Computer Science 2025-11-13 Andrew Draganov , Pascal Weber , Rasmus Skibdahl Melanchton Jørgensen , Anna Beer , Claudia Plant , Ira Assent

We define a class of probability distributions that we call simplicial mixture models, inspired by simplicial complexes from algebraic topology. The parameters of these distributions represent their topology and we show that it is possible…

Statistics Theory · Mathematics 2019-09-24 James T. Griffin

High-dimensional images, or images with a high-dimensional attribute vector per pixel, are commonly explored with coordinated views of a low-dimensional embedding of the attribute space and a conventional image representation. Nowadays,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-02 Alexander Vieth , Boudewijn Lelieveldt , Elmar Eisemann , Anna Vilanova , Thomas Höllt

Learning the representation of data with hierarchical structures in the hyperbolic space attracts increasing attention in recent years. Due to the constant negative curvature, the hyperbolic space resembles tree metrics and captures the…

Machine Learning · Computer Science 2022-02-21 Huiru Xiao , Caigao Jiang , Yangqiu Song , James Zhang , Junwu Xiong

Consider observation data, comprised of n observation vectors with values on a set of attributes. This gives us n points in attribute space. Having data structured as a tree, implied by having our observations embedded in an ultrametric…

Information Retrieval · Computer Science 2012-02-17 Fionn Murtagh , Pedro Contreras

Datasets with non-trivial large scale topology can be hard to embed in low-dimensional Euclidean space with existing dimensionality reduction algorithms. We propose to model topologically complex datasets using vector bundles, in such a way…

Computational Geometry · Computer Science 2023-03-17 Luis Scoccola , Jose A. Perea

Topological data analysis is an emerging field that applies the study of topological invariants to data. Perhaps the simplest of these invariants is the number of connected components or clusters. In this work, we explore a topological…

Computational Geometry · Computer Science 2023-12-19 Ian Stewart Joyce , Grant Erdmann , Kirk P. Gardner , Ryan Kramer , Kyle Siegrist

Graph embeddings have emerged as a powerful tool for representing complex network structures in a low-dimensional space, enabling the use of efficient methods that employ the metric structure in the embedding space as a proxy for the…

Social and Information Networks · Computer Science 2024-04-18 Radosław Nowak , Adam Małkowski , Daniel Cieślak , Piotr Sokół , Paweł Wawrzyński

Different cell types aggregate and sort into hierarchical architectures during the formation of animal tissues. The resulting spatial organization depends (in part) on the strength of adhesion of one cell type to itself relative to other…

Quantitative Methods · Quantitative Biology 2023-08-02 Dhananjay Bhaskar , William Y. Zhang , Alexandria Volkening , Björn Sandstede , Ian Y. Wong

Scientific data has been growing in both size and complexity across the modern physical, engineering, life and social sciences. Spatial structure, for example, is a hallmark of many of the most important real-world complex systems, but its…

‹ Prev 1 2 3 10 Next ›