English

Geometric anomaly detection in data

Algebraic Topology 2022-05-25 v1 Algebraic Geometry

Abstract

This paper describes the systematic application of local topological methods for detecting interfaces and related anomalies in complicated high-dimensional data. By examining the topology of small regions around each point, one can optimally stratify a given dataset into clusters, each of which is in turn well-approximable by a suitable submanifold of the ambient space. Since these approximating submanifolds might have different dimensions, we are able to detect non-manifold like singular regions in data even when none of the data points have been sampled from those singularities. We showcase this method by identifying the intersection of two surfaces in the 24-dimensional space of cyclo-octane conformations, and by locating all the self-intersections of a Henneberg minimal surface immersed in 3-dimensional space. Due to the local nature of the required topological computations, the algorithmic burden of performing such data stratification is readily distributable across several processors.

Keywords

Cite

@article{arxiv.1908.09397,
  title  = {Geometric anomaly detection in data},
  author = {Bernadette J Stolz and Jared Tanner and Heather A Harrington and Vidit Nanda},
  journal= {arXiv preprint arXiv:1908.09397},
  year   = {2022}
}