Distributed Robust Principal Component Analysis
Distributed, Parallel, and Cluster Computing
2022-08-16 v2 Machine Learning
Abstract
We study the robust principal component analysis (RPCA) problem in a distributed setting. The goal of RPCA is to find an underlying low-rank estimation for a raw data matrix when the data matrix is subject to the corruption of gross sparse errors. Previous studies have developed RPCA algorithms that provide stable solutions with fast convergence. However, these algorithms are typically hard to scale and cannot be implemented distributedly, due to the use of either SVD or large matrix multiplication. In this paper, we propose the first distributed robust principal analysis algorithm based on consensus factorization, dubbed DCF-PCA. We prove the convergence of DCF-PCA and evaluate DCF-PCA on various problem setting
Cite
@article{arxiv.2207.11669,
title = {Distributed Robust Principal Component Analysis},
author = {Wenda Chu},
journal= {arXiv preprint arXiv:2207.11669},
year = {2022}
}
Comments
13 pages