English

Approximation Algorithms for Orthogonal Non-negative Matrix Factorization

Data Structures and Algorithms 2021-03-09 v2

Abstract

In the non-negative matrix factorization (NMF) problem, the input is an m×nm\times n matrix MM with non-negative entries and the goal is to factorize it as MAWM\approx AW. The m×km\times k matrix AA and the k×nk\times n matrix WW are both constrained to have non-negative entries. This is in contrast to singular value decomposition, where the matrices AA and WW can have negative entries but must satisfy the orthogonality constraint: the columns of AA are orthogonal and the rows of WW are also orthogonal. The orthogonal non-negative matrix factorization (ONMF) problem imposes both the non-negativity and the orthogonality constraints, and previous work showed that it leads to better performances than NMF on many clustering tasks. We give the first constant-factor approximation algorithm for ONMF when one or both of AA and WW are subject to the orthogonality constraint. We also show an interesting connection to the correlation clustering problem on bipartite graphs. Our experiments on synthetic and real-world data show that our algorithm achieves similar or smaller errors compared to previous ONMF algorithms while ensuring perfect orthogonality (many previous algorithms do not satisfy the hard orthogonality constraint).

Keywords

Cite

@article{arxiv.2103.01398,
  title  = {Approximation Algorithms for Orthogonal Non-negative Matrix Factorization},
  author = {Moses Charikar and Lunjia Hu},
  journal= {arXiv preprint arXiv:2103.01398},
  year   = {2021}
}

Comments

26 pages, 5 figures. To be published in AISTATS 2021. Font size increased