English

Robust Principal Component Analysis Using Statistical Estimators

Artificial Intelligence 2012-07-03 v1

Abstract

Principal Component Analysis (PCA) finds a linear mapping and maximizes the variance of the data which makes PCA sensitive to outliers and may cause wrong eigendirection. In this paper, we propose techniques to solve this problem; we use the data-centering method and reestimate the covariance matrix using robust statistic techniques such as median, robust scaling which is a booster to data-centering and Huber M-estimator which measures the presentation of outliers and reweight them with small values. The results on several real world data sets show that our proposed method handles outliers and gains better results than the original PCA and provides the same accuracy with lower computation cost than the Kernel PCA using the polynomial kernel in classification tasks.

Keywords

Cite

@article{arxiv.1207.0403,
  title  = {Robust Principal Component Analysis Using Statistical Estimators},
  author = {Peratham Wiriyathammabhum and Boonserm Kijsirikul},
  journal= {arXiv preprint arXiv:1207.0403},
  year   = {2012}
}

Comments

In Proc. of the International Joint Conference on Computer Science and Software Engineering (JCSSE) 2009

R2 v1 2026-06-21T21:29:10.786Z