Related papers: An outlier map for Support Vector Machine classifi…

LOSDD: Leave-Out Support Vector Data Description for Outlier Detection

Support Vector Machines have been successfully used for one-class classification (OCSVM, SVDD) when trained on clean data, but they work much worse on dirty data: outliers present in the training data tend to become support vectors, and are…

Machine Learning · Computer Science 2022-12-29 Daniel Boiar , Thomas Liebig , Erich Schubert

A Mathematical Optimization Approach to Multisphere Support Vector Data Description

We present a novel mathematical optimization framework for outlier detection in multimodal datasets, extending Support Vector Data Description approaches. We provide a primal formulation, in the shape of a Mixed Integer Second Order Cone…

Optimization and Control · Mathematics 2025-07-16 Víctor Blanco , Inmaculada Espejo , Raúl Páez , Antonio M. Rodríguez-Chía

Comparison of Outlier Detection Techniques for Structured Data

An outlier is an observation or a data point that is far from rest of the data points in a given dataset or we can be said that an outlier is away from the center of mass of observations. Presence of outliers can skew statistical measures…

Machine Learning · Computer Science 2021-06-17 Amulya Agarwal , Nitin Gupta

Outlier Detection in High Dimensional Data

High-dimensional data poses unique challenges in outlier detection process. Most of the existing algorithms fail to properly address the issues stemming from a large number of features. In particular, outlier detection algorithms perform…

Machine Learning · Computer Science 2020-09-22 Firuz Kamalov , Ho Hon Leung

DOD: Detection of outliers in high dimensional data with distance of distances

Reliable outlier detection in high-dimensional data is crucial in modern science, yet it remains a challenging task. Traditional methods often break down in these settings due to their reliance on asymptotic behaviors with respect to sample…

Methodology · Statistics 2025-11-05 Seong-ho Lee , Yongho Jeon

Outlier Detection in Plantar Pressure: Human-Centered Comparison of Statistical Parametric Mapping and Explainable Machine Learning

Plantar pressure mapping is essential in clinical diagnostics and sports science, yet large heterogeneous datasets often contain outliers from technical errors or procedural inconsistencies. Statistical Parametric Mapping (SPM) provides…

Artificial Intelligence · Computer Science 2025-09-30 Carlo Dindorf , Jonas Dully , Steven Simon , Dennis Perchthaler , Stephan Becker , Hannah Ehmann , Kjell Heitmann , Bernd Stetter , Christian Diers , Michael Fröhlich

Detecting Outliers in High-dimensional Data with Mixed Variable Types using Conditional Gaussian Regression Models

Outlier detection has gained increasing interest in recent years, due to newly emerging technologies and the huge amount of high-dimensional data that are now available. Outlier detection can help practitioners to identify unwanted noise…

Statistics Theory · Mathematics 2021-05-20 Mads Lindskou , Torben Tvedebrink , Poul Svante Eriksen , Niels Morling

Toward Scalable and Unified Example-based Explanation and Outlier Detection

When neural networks are employed for high-stakes decision-making, it is desirable that they provide explanations for their prediction in order for us to understand the features that have contributed to the decision. At the same time, it is…

Machine Learning · Computer Science 2022-05-10 Penny Chong , Ngai-Man Cheung , Yuval Elovici , Alexander Binder

Anomaly Detection by Robust Statistics

Real data often contain anomalous cases, also known as outliers. These may spoil the resulting analysis but they may also contain valuable information. In either case, the ability to detect such anomalies is essential. A useful tool for…

Machine Learning · Statistics 2021-01-13 Peter J. Rousseeuw , Mia Hubert

A method for outlier detection based on cluster analysis and visual expert criteria

Outlier detection is an important problem occurring in a wide range of areas. Outliers are the outcome of fraudulent behaviour, mechanical faults, human error, or simply natural deviations. Many data mining applications perform outlier…

Machine Learning · Computer Science 2025-10-28 Juan A. Lara , David Lizcano , Víctor Rampérez , Javier Soriano

Support vector machine for functional data classification

In many applications, input data are sampled functions taking their values in infinite dimensional spaces rather than standard vectors. This fact has complex consequences on data analysis algorithms that motivate modifications of them. In…

Statistics Theory · Mathematics 2007-05-23 Fabrice Rossi , Nathalie Villa

Outlier Detection and Spatial Analysis Algorithms

Outlier detection is a significant area in data mining. It can be either used to pre-process the data prior to an analysis or post the processing phase (before visualization) depending on the effectiveness of the outlier and its importance.…

Machine Learning · Statistics 2021-06-22 Jacob John

Outliers resistant image classification by anomaly detection

Various technologies, including computer vision models, are employed for the automatic monitoring of manual assembly processes in production. These models detect and classify events such as the presence of components in an assembly area or…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Anton Sergeev , Victor Minchenkov , Aleksei Soldatov , Vasiliy Kakurin , Yaroslav Mazikov

An Outlyingness Matrix for Multivariate Functional Data Classification

The classification of multivariate functional data is an important task in scientific research. Unlike point-wise data, functional data are usually classified by their shapes rather than by their scales. We define an outlyingness matrix by…

Methodology · Statistics 2018-04-24 Wenlin Dai , Marc G. Genton

Outlier Explanation via Sum-Product Networks

Outlier explanation is the task of identifying a set of features that distinguish a sample from normal data, which is important for downstream (human) decision-making. Existing methods are based on beam search in the space of feature…

Machine Learning · Computer Science 2022-07-19 Stefan Lüdtke , Christian Bartelt , Heiner Stuckenschmidt

Gaining Outlier Resistance with Progressive Quantiles: Fast Algorithms and Theoretical Studies

Outliers widely occur in big-data applications and may severely affect statistical estimation and inference. In this paper, a framework of outlier-resistant estimation is introduced to robustify an arbitrarily given loss function. It has a…

Methodology · Statistics 2023-04-20 Yiyuan She , Zhifeng Wang , Jiahui Shen

A geometric framework for outlier detection in high-dimensional data

Outlier or anomaly detection is an important task in data analysis. We discuss the problem from a geometrical perspective and provide a framework that exploits the metric structure of a data set. Our approach rests on the manifold…

Machine Learning · Statistics 2022-08-01 Moritz Herrmann , Florian Pfisterer , Fabian Scheipl

Fast Incremental SVDD Learning Algorithm with the Gaussian Kernel

Support vector data description (SVDD) is a machine learning technique that is used for single-class classification and outlier detection. The idea of SVDD is to find a set of support vectors that defines a boundary around data. When…

Machine Learning · Statistics 2018-11-05 Hansi Jiang , Haoyu Wang , Wenhao Hu , Deovrat Kakde , Arin Chaudhuri

Robust Statistical Scaling of Outlier Scores: Improving the Quality of Outlier Probabilities for Outliers (Extended Version)

Outlier detection algorithms typically assign an outlier score to each observation in a dataset, indicating the degree to which an observation is an outlier. However, these scores are often not comparable across algorithms and can be…

Machine Learning · Computer Science 2024-10-31 Philipp Röchner , Henrique O. Marques , Ricardo J. G. B. Campello , Arthur Zimek , Franz Rothlauf

Outlier Detection on Mixed-Type Data: An Energy-based Approach

Outlier detection amounts to finding data points that differ significantly from the norm. Classic outlier detection methods are largely designed for single data type such as continuous or discrete. However, real world data is increasingly…

Machine Learning · Statistics 2016-08-18 Kien Do , Truyen Tran , Dinh Phung , Svetha Venkatesh