Related papers: A Fast Greedy Algorithm for Outlier Mining

An Optimization Model for Outlier Detection in Categorical Data

The task of outlier detection is to find small groups of data objects that are exceptional when compared with rest large amount of data. Detection of such outliers is important for many applications such as fraud detection and customer…

Databases · Computer Science 2007-05-23 Zengyou He , Xiaofei Xu , Shengchun Deng

Detecting outliers by clustering algorithms

Clustering and outlier detection are two important tasks in data mining. Outliers frequently interfere with clustering algorithms to determine the similarity between objects, resulting in unreliable clustering results. Currently, only a few…

Machine Learning · Computer Science 2024-12-10 Qi Li , Shuliang Wang

An Evaluation of Classification and Outlier Detection Algorithms

This paper evaluates algorithms for classification and outlier detection accuracies in temporal data. We focus on algorithms that train and classify rapidly and can be used for systems that need to incorporate new data regularly. Hence, we…

Machine Learning · Statistics 2018-05-03 Victoria J. Hodge , Jim Austin

Coverage-based Outlier Explanation

Outlier detection is a core task in data mining with a plethora of algorithms that have enjoyed wide scale usage. Existing algorithms are primarily focused on detection, that is the identification of outliers in a given dataset. In this…

Machine Learning · Computer Science 2019-11-11 Yue Wu , Leman Akoglu , Ian Davidson

Outlier Detection and Spatial Analysis Algorithms

Outlier detection is a significant area in data mining. It can be either used to pre-process the data prior to an analysis or post the processing phase (before visualization) depending on the effectiveness of the outlier and its importance.…

Machine Learning · Statistics 2021-06-22 Jacob John

A Practical Algorithm for Distributed Clustering and Outlier Detection

We study the classic $k$-means/median clustering, which are fundamental problems in unsupervised learning, in the setting where data are partitioned across multiple sites, and where we are allowed to discard a small portion of the data by…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-10-12 Jiecao Chen , Erfan Sadeqi Azer , Qin Zhang

Outlier Detection by Consistent Data Selection Method

Often the challenge associated with tasks like fraud and spam detection[1] is the lack of all likely patterns needed to train suitable supervised learning models. In order to overcome this limitation, such tasks are attempted as outlier or…

Machine Learning · Computer Science 2018-08-22 Utkarsh Porwal , Smruthi Mukund

Robust Outlier Detection Technique in Data Mining: A Univariate Approach

Outliers are the points which are different from or inconsistent with the rest of the data. They can be novel, new, abnormal, unusual or noisy information. Outliers are sometimes more interesting than the majority of the data. The main…

Computer Vision and Pattern Recognition · Computer Science 2014-06-20 Singh Vijendra , Pathak Shivani

Interpreting Outliers: Localized Logistic Regression for Density Ratio Estimation

We propose an inlier-based outlier detection method capable of both identifying the outliers and explaining why they are outliers, by identifying the outlier-specific features. Specifically, we employ an inlier-based outlier detection…

Machine Learning · Statistics 2017-02-22 Makoto Yamada , Song Liu , Samuel Kaski

Outlier Detection in High Dimensional Data

High-dimensional data poses unique challenges in outlier detection process. Most of the existing algorithms fail to properly address the issues stemming from a large number of features. In particular, outlier detection algorithms perform…

Machine Learning · Computer Science 2020-09-22 Firuz Kamalov , Ho Hon Leung

Linear-time Outlier Detection via Sensitivity

Outliers are ubiquitous in modern data sets. Distance-based techniques are a popular non-parametric approach to outlier detection as they require no prior assumptions on the data generating distribution and are simple to implement. Scaling…

Machine Learning · Statistics 2016-05-04 Mario Lucic , Olivier Bachem , Andreas Krause

MSD-Kmeans: A Novel Algorithm for Efficient Detection of Global and Local Outliers

Outlier detection is a technique in data mining that aims to detect unusual or unexpected records in the dataset. Existing outlier detection algorithms have different pros and cons and exhibit different sensitivity to noisy data such as…

Machine Learning · Computer Science 2023-12-22 Yuanyuan Wei , Julian Jang-Jaccard , Fariza Sabrina , Timothy McIntosh

A Novel Geometric Approach for Outlier Recognition in High Dimension

Outlier recognition is a fundamental problem in data analysis and has attracted a great deal of attention in the past decades. However, most existing methods still suffer from several issues such as high time and space complexities or…

Computational Geometry · Computer Science 2019-04-09 Hu Ding , Mingquan Ye

A method for outlier detection based on cluster analysis and visual expert criteria

Outlier detection is an important problem occurring in a wide range of areas. Outliers are the outcome of fraudulent behaviour, mechanical faults, human error, or simply natural deviations. Many data mining applications perform outlier…

Machine Learning · Computer Science 2025-10-28 Juan A. Lara , David Lizcano , Víctor Rampérez , Javier Soriano

Robust Linear Regression Analysis - A Greedy Approach

The task of robust linear estimation in the presence of outliers is of particular importance in signal processing, statistics and machine learning. Although the problem has been stated a few decades ago and solved using classical…

Information Theory · Computer Science 2023-07-19 George Papageorgiou , Pantelis Bouboulis , Sergios Theodoridis , Kostantinos Themelis

Comparison of Outlier Detection Algorithms on String Data

Outlier detection is a well-researched and crucial problem in machine learning. However, there is little research on string data outlier detection, as most literature focuses on outlier detection of numerical data. A robust string data…

Machine Learning · Computer Science 2026-03-13 Philip Maus

Finding Outliers in Gaussian Model-Based Clustering

Clustering, or unsupervised classification, is a task often plagued by outliers. Yet there is a paucity of work on handling outliers in clustering. Outlier identification algorithms tend to fall into three broad categories: outlier…

Methodology · Statistics 2024-05-31 Katharine M. Clark , Paul D. McNicholas

Outlier Detection using Improved Genetic K-means

The outlier detection problem in some cases is similar to the classification problem. For example, the main concern of clustering-based outlier detection algorithms is to find clusters and outliers, which are often regarded as noise that…

Machine Learning · Computer Science 2014-05-25 M. H. Marghny , Ahmed I. Taloba

Practical Bayesian optimization in the presence of outliers

Inference in the presence of outliers is an important field of research as outliers are ubiquitous and may arise across a variety of problems and domains. Bayesian optimization is method that heavily relies on probabilistic inference. This…

Machine Learning · Computer Science 2017-12-14 Ruben Martinez-Cantin , Kevin Tee , Michael McCourt

Outlier Detection from Network Data with Subnetwork Interpretation

Detecting a small number of outliers from a set of data observations is always challenging. This problem is more difficult in the setting of multiple network samples, where computing the anomalous degree of a network sample is generally not…

Artificial Intelligence · Computer Science 2016-10-04 Xuan-Hong Dang , Arlei Silva , Ambuj Singh , Ananthram Swami , Prithwish Basu