English

Statistical inference based on robust low-rank data matrix approximation

Statistics Theory 2014-02-28 v1 Statistics Theory

Abstract

The singular value decomposition is widely used to approximate data matrices with lower rank matrices. Feng and He [Ann. Appl. Stat. 3 (2009) 1634-1654] developed tests on dimensionality of the mean structure of a data matrix based on the singular value decomposition. However, the first singular values and vectors can be driven by a small number of outlying measurements. In this paper, we consider a robust alternative that moderates the effect of outliers in low-rank approximations. Under the assumption of random row effects, we provide the asymptotic representations of the robust low-rank approximation. These representations may be used in testing the adequacy of a low-rank approximation. We use oligonucleotide gene microarray data to demonstrate how robust singular value decomposition compares with the its traditional counterparts. Examples show that the robust methods often lead to a more meaningful assessment of the dimensionality of gene intensity data matrices.

Keywords

Cite

@article{arxiv.1402.6806,
  title  = {Statistical inference based on robust low-rank data matrix approximation},
  author = {Xingdong Feng and Xuming He},
  journal= {arXiv preprint arXiv:1402.6806},
  year   = {2014}
}

Comments

Published in at http://dx.doi.org/10.1214/13-AOS1186 the Annals of Statistics (http://www.imstat.org/aos/) by the Institute of Mathematical Statistics (http://www.imstat.org)

R2 v1 2026-06-22T03:16:52.304Z