English

Principal Graphs and Manifolds

Machine Learning 2011-05-10 v2 Neural and Evolutionary Computing Machine Learning

Abstract

In many physical, statistical, biological and other investigations it is desirable to approximate a system of points by objects of lower dimension and/or complexity. For this purpose, Karl Pearson invented principal component analysis in 1901 and found 'lines and planes of closest fit to system of points'. The famous k-means algorithm solves the approximation problem too, but by finite sets instead of lines and planes. This chapter gives a brief practical introduction into the methods of construction of general principal objects, i.e. objects embedded in the 'middle' of the multidimensional data set. As a basis, the unifying framework of mean squared distance approximation of finite datasets is selected. Principal graphs and manifolds are constructed as generalisations of principal components and k-means principal points. For this purpose, the family of expectation/maximisation algorithms with nearest generalisations is presented. Construction of principal graphs with controlled complexity is based on the graph grammar approach.

Keywords

Cite

@article{arxiv.0809.0490,
  title  = {Principal Graphs and Manifolds},
  author = {A. N. Gorban and A. Y. Zinovyev},
  journal= {arXiv preprint arXiv:0809.0490},
  year   = {2011}
}

Comments

36 pages, 6 figures, minor corrections

R2 v1 2026-06-21T11:16:13.892Z