Anomaly Detection by Robust Statistics

Peter J. Rousseeuw; Mia Hubert

doi:10.1002/widm.1236

Anomaly Detection by Robust Statistics

Machine Learning 2021-01-13 v2

Authors: Peter J. Rousseeuw , Mia Hubert

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

Real data often contain anomalous cases, also known as outliers. These may spoil the resulting analysis but they may also contain valuable information. In either case, the ability to detect such anomalies is essential. A useful tool for this purpose is robust statistics, which aims to detect the outliers by first fitting the majority of the data and then flagging data points that deviate from it. We present an overview of several robust methods and the resulting graphical outlier detection tools. We discuss robust procedures for univariate, low-dimensional, and high-dimensional data, such as estimating location and scatter, linear regression, principal component analysis, classification, clustering, and functional data analysis. Also the challenging new topic of cellwise outliers is introduced.

Keywords

robust outlier detection statistical data analysis statistical inference and model selection

Cite

@article{arxiv.1707.09752,
  title  = {Anomaly Detection by Robust Statistics},
  author = {Peter J. Rousseeuw and Mia Hubert},
  journal= {arXiv preprint arXiv:1707.09752},
  year   = {2021}
}

Comments

To appear in WIREs Data Mining and Knowledge Discovery

Anomaly Detection by Robust Statistics

Abstract

Keywords

Cite

Comments

Related papers