English
Related papers

Related papers: Geometric anomaly detection in data

200 papers

The manifold hypothesis, which assumes that data lies on or close to an unknown manifold of low intrinsic dimension, is a staple of modern machine learning research. However, recent work has shown that real-world data exhibits distinct…

Machine Learning · Computer Science 2023-06-16 Julius von Rohrscheidt , Bastian Rieck

Timely detection of abrupt anomalies is crucial for real-time monitoring and security of modern systems producing high-dimensional data. With this goal, we propose effective and scalable algorithms. Proposed algorithms are nonparametric as…

Machine Learning · Computer Science 2020-02-19 Mehmet Necip Kurt , Yasin Yilmaz , Xiaodong Wang

We focus on the problem of identifying samples in a set that do not conform to structured patterns represented by low-dimensional manifolds. An effective way to solve this problem is to embed data in a high dimensional space, called…

Machine Learning · Computer Science 2025-05-19 Filippo Leveni , Luca Magri , Cesare Alippi , Giacomo Boracchi

Geometric alignment appears in a variety of applications, ranging from domain adaptation, optimal transport, and normalizing flows in machine learning; optical flow and learned augmentation in computer vision and deformable registration…

Computer Vision and Pattern Recognition · Computer Science 2021-10-27 Steffen Czolbe , Aasa Feragen , Oswin Krause

In a world abundant with diverse data arising from complex acquisition techniques, there is a growing need for new data analysis methods. In this paper we focus on high-dimensional data that are organized into several hierarchical datasets.…

Machine Learning · Computer Science 2021-04-06 Lior Aloni , Omer Bobrowski , Ronen Talmon

A clustering algorithm partitions a set of data points into smaller sets (clusters) such that each subset is more tightly packed than the whole. Many approaches to clustering translate the vector data into a graph with edges reflecting a…

Geometric Topology · Mathematics 2012-06-06 Jesse Johnson

Topological data analysis is becoming increasingly relevant to support the analysis of unstructured data sets. A common assumption in data analysis is that the data set is a sample---not necessarily a uniform one---of some high-dimensional…

Algebraic Topology · Mathematics 2021-01-20 Bastian Rieck , Markus Banagl , Filip Sadlo , Heike Leitte

Detecting the dimension of a hidden manifold from a point sample has become an important problem in the current data-driven era. Indeed, estimating the shape dimension is often the first step in studying the processes or phenomena…

Computational Geometry · Computer Science 2014-05-15 Tamal K. Dey , Fengtao Fan , Yusu Wang

Real data is often given as a point cloud, i.e. a finite set of points with pairwise distances between them. An important problem is to detect the topological shape of data --- for example, to approximate a point cloud by a low-dimensional…

Algebraic Topology · Mathematics 2018-10-09 Sara Kalisnik Verovsek , Vitaliy Kurlin , Davorin Lesnik

Given i.i.d. sample from a stratified mixture of immersed manifolds of different dimensions, we study the minimax estimation of the underlying stratified structure. We provide a constructive algorithm allowing to estimate each mixture…

Statistics Theory · Mathematics 2024-05-31 Eddie Aamari , Clément Berenfeld

Outlier, or anomaly, detection is essential for optimal performance of machine learning methods and statistical predictive models. It is not just a technical step in a data cleaning process but a key topic in many fields such as fraudulent…

Machine Learning · Computer Science 2020-02-19 O. Ramos Terrades , A. Berenguel , D. Gil

This paper introduces advanced techniques of Topological Data Analysis (TDA) for unsupervised anomaly detection and customer segmentation in banking data. Using the Mapper algorithm and persistent homology, we develop unsupervised…

Machine Learning · Computer Science 2025-08-21 Leonardo Aldo Alejandro Barberi , Linda Maria De Cave

In a variety of applications, one desires to detect groups of anomalous data samples, with a group potentially manifesting its atypicality (relative to a reference model) on a low-dimensional subset of the full measured set of features.…

Networking and Internet Architecture · Computer Science 2015-11-04 Zhicong Qiu , David J. Miller , George Kesidis

Mapping complex input data into suitable lower dimensional manifolds is a common procedure in machine learning. This step is beneficial mainly for two reasons: (1) it reduces the data dimensionality and (2) it provides a new data…

Machine Learning · Computer Science 2018-11-28 Daniele Zambon , Lorenzo Livi , Cesare Alippi

Topological data analysis (TDA) is a rising branch in modern applied mathematics. It extracts topological structures as features of a given space and uses these features to analyze digital data. Persistent homology, one of the central tools…

Algebraic Topology · Mathematics 2025-05-26 Chuan-Shen Hu

In the machine learning field, dimensionality reduction is an important task. It mitigates the undesired properties of high-dimensional spaces to facilitate classification, compression, and visualization of high-dimensional data. During the…

Machine Learning · Computer Science 2019-11-19 Mohammed Elhenawy , Mahmoud Masoud , Sebastian Glaser , Andry Rakotonirainy

Learning a latent embedding to understand the underlying nature of data distribution is often formulated in Euclidean spaces with zero curvature. However, the success of the geometry constraints, posed in the embedding space, indicates that…

Computer Vision and Pattern Recognition · Computer Science 2022-08-03 Jie Hong , Pengfei Fang , Weihao Li , Junlin Han , Lars Petersson , Mehrtash Harandi

We consider the problem of detecting anomalies in the directional distribution of fibre materials observed in 3D images. We divide the image into a set of scanning windows and classify them into two clusters: homogeneous material and…

Understanding the topological characteristics of data is important to many areas of research. Recent work has demonstrated that synthetic 4D image-type data can be useful to train 4D convolutional neural network models to see topological…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Khalil Mathieu Hannouch , Stephan Chalup

Anomalies are samples that significantly deviate from the rest of the data and their detection plays a major role in building machine learning models that can be reliably used in applications such as data-driven design and novelty…

Machine Learning · Statistics 2023-06-19 Amin Yousefpour , Mehdi Shishehbor , Zahra Zanjani Foumani , Ramin Bostanabad
‹ Prev 1 2 3 10 Next ›