Related papers: Outlier Detection by Consistent Data Selection Met…

A method for outlier detection based on cluster analysis and visual expert criteria

Outlier detection is an important problem occurring in a wide range of areas. Outliers are the outcome of fraudulent behaviour, mechanical faults, human error, or simply natural deviations. Many data mining applications perform outlier…

Machine Learning · Computer Science 2025-10-28 Juan A. Lara , David Lizcano , Víctor Rampérez , Javier Soriano

Comparison of Outlier Detection Techniques for Structured Data

An outlier is an observation or a data point that is far from rest of the data points in a given dataset or we can be said that an outlier is away from the center of mass of observations. Presence of outliers can skew statistical measures…

Machine Learning · Computer Science 2021-06-17 Amulya Agarwal , Nitin Gupta

Credit Card Fraud Detection in e-Commerce: An Outlier Detection Approach

Often the challenge associated with tasks like fraud and spam detection is the lack of all likely patterns needed to train suitable supervised learning models. This problem accentuates when the fraudulent patterns are not only scarce, they…

Machine Learning · Computer Science 2019-05-08 Utkarsh Porwal , Smruthi Mukund

Robust Outlier Detection Technique in Data Mining: A Univariate Approach

Outliers are the points which are different from or inconsistent with the rest of the data. They can be novel, new, abnormal, unusual or noisy information. Outliers are sometimes more interesting than the majority of the data. The main…

Computer Vision and Pattern Recognition · Computer Science 2014-06-20 Singh Vijendra , Pathak Shivani

Outlier detection for mixed-type data: A novel approach

Outlier detection can serve as an extremely important tool for researchers from a wide range of fields. From the sectors of banking and marketing to the social sciences and healthcare sectors, outlier detection techniques are very useful…

Methodology · Statistics 2023-12-12 Efthymios Costa , Ioanna Papatsouma

Are Outlier Detection Methods Resilient to Sampling?

Outlier detection is a fundamental task in data mining and has many applications including detecting errors in databases. While there has been extensive prior work on methods for outlier detection, modern datasets often have sizes that are…

Machine Learning · Computer Science 2019-08-01 Laure Berti-Equille , Ji Meng Loh , Saravanan Thirumuruganathan

Outlier Detection for Improved Data Quality and Diversity in Dialog Systems

In a corpus of data, outliers are either errors: mistakes in the data that are counterproductive, or are unique: informative samples that improve model robustness. Identifying outliers can lead to better datasets by (1) removing noise in…

Computation and Language · Computer Science 2019-04-08 Stefan Larson , Anish Mahendran , Andrew Lee , Jonathan K. Kummerfeld , Parker Hill , Michael A. Laurenzano , Johann Hauswald , Lingjia Tang , Jason Mars

Detecting outliers by clustering algorithms

Clustering and outlier detection are two important tasks in data mining. Outliers frequently interfere with clustering algorithms to determine the similarity between objects, resulting in unreliable clustering results. Currently, only a few…

Machine Learning · Computer Science 2024-12-10 Qi Li , Shuliang Wang

A Practical Algorithm for Distributed Clustering and Outlier Detection

We study the classic $k$-means/median clustering, which are fundamental problems in unsupervised learning, in the setting where data are partitioned across multiple sites, and where we are allowed to discard a small portion of the data by…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-10-12 Jiecao Chen , Erfan Sadeqi Azer , Qin Zhang

An empirical comparison of some outlier detection methods with longitudinal data

This note investigates the problem of detecting outliers in longitudinal data. It compares well-known methods used in official statistics with proposals from the fields of data mining and machine learning that are based on the distance…

Methodology · Statistics 2025-07-30 Marcello D'Orazio

Robust Statistical Scaling of Outlier Scores: Improving the Quality of Outlier Probabilities for Outliers (Extended Version)

Outlier detection algorithms typically assign an outlier score to each observation in a dataset, indicating the degree to which an observation is an outlier. However, these scores are often not comparable across algorithms and can be…

Machine Learning · Computer Science 2024-10-31 Philipp Röchner , Henrique O. Marques , Ricardo J. G. B. Campello , Arthur Zimek , Franz Rothlauf

Linear-time Outlier Detection via Sensitivity

Outliers are ubiquitous in modern data sets. Distance-based techniques are a popular non-parametric approach to outlier detection as they require no prior assumptions on the data generating distribution and are simple to implement. Scaling…

Machine Learning · Statistics 2016-05-04 Mario Lucic , Olivier Bachem , Andreas Krause

An Optimization Model for Outlier Detection in Categorical Data

The task of outlier detection is to find small groups of data objects that are exceptional when compared with rest large amount of data. Detection of such outliers is important for many applications such as fraud detection and customer…

Databases · Computer Science 2007-05-23 Zengyou He , Xiaofei Xu , Shengchun Deng

A Cluster-based Approach for Outlier Detection in Dynamic Data Streams (KORM: k-median OutlieR Miner)

Outlier detection in data streams has gained wide importance presently due to the increasing cases of fraud in various applications of data streams. The techniques for outlier detection have been divided into either statistics based,…

Distributed, Parallel, and Cluster Computing · Computer Science 2010-03-25 Parneeta Dhaliwal , M. P. S. Bhatia , Priti Bansal

Outlier Detection Using a Novel method: Quantum Clustering

We propose a new assumption in outlier detection: Normal data instances are commonly located in the area that there is hardly any fluctuation on data density, while outliers are often appeared in the area that there is violent fluctuation…

Machine Learning · Computer Science 2020-06-09 Ding Liu , Hui Li

Augment to Detect Anomalies with Continuous Labelling

Anomaly detection is to recognize samples that differ in some respect from the training observations. These samples which do not conform to the distribution of normal data are called outliers or anomalies. In real-world anomaly detection…

Computer Vision and Pattern Recognition · Computer Science 2022-07-05 Vahid Reza Khazaie , Anthony Wong , Yalda Mohsenzadeh

Outlier absorbing based on a Bayesian approach

The presence of outliers is prevalent in machine learning applications and may produce misleading results. In this paper a new method for dealing with outliers and anomal samples is proposed. To overcome the outlier issue, the proposed…

Machine Learning · Computer Science 2016-07-05 Parsa Bagherzadeh , Hadi Sadoghi Yazdi

Detecting Outliers in Data with Correlated Measures

Advances in sensor technology have enabled the collection of large-scale datasets. Such datasets can be extremely noisy and often contain a significant amount of outliers that result from sensor malfunction or human operation faults. In…

Machine Learning · Computer Science 2018-08-28 Yu-Hsuan Kuo , Zhenhui Li , Daniel Kifer

Long-Term Outlier Prediction Through Outlier Score Modeling

This study addresses an important gap in time series outlier detection by proposing a novel problem setting: long-term outlier prediction. Conventional methods primarily focus on immediate detection by identifying deviations from normal…

Machine Learning · Computer Science 2026-03-24 Yuma Aoki , Joon Park , Koh Takeuchi , Hisashi Kashima , Shinya Akimoto , Ryuichi Hashimoto , Takahiro Adachi , Takeshi Kishikawa , Takamitsu Sasaki

WePaMaDM-Outlier Detection: Weighted Outlier Detection using Pattern Approaches for Mass Data Mining

Weighted Outlier Detection is a method for identifying unusual or anomalous data points in a dataset, which can be caused by various factors like human error, fraud, or equipment malfunctions. Detecting outliers can reveal vital information…

Machine Learning · Computer Science 2023-06-13 Ravindrakumar Purohit , Jai Prakash Verma , Rachna Jain , Madhuri Bhavsar