English

A Fast Incremental Gaussian Mixture Model

Machine Learning 2017-02-08 v2

Abstract

This work builds upon previous efforts in online incremental learning, namely the Incremental Gaussian Mixture Network (IGMN). The IGMN is capable of learning from data streams in a single-pass by improving its model after analyzing each data point and discarding it thereafter. Nevertheless, it suffers from the scalability point-of-view, due to its asymptotic time complexity of O(NKD3)\operatorname{O}\bigl(NKD^3\bigr) for NN data points, KK Gaussian components and DD dimensions, rendering it inadequate for high-dimensional data. In this paper, we manage to reduce this complexity to O(NKD2)\operatorname{O}\bigl(NKD^2\bigr) by deriving formulas for working directly with precision matrices instead of covariance matrices. The final result is a much faster and scalable algorithm which can be applied to high dimensional tasks. This is confirmed by applying the modified algorithm to high-dimensional classification datasets.

Keywords

Cite

@article{arxiv.1506.04422,
  title  = {A Fast Incremental Gaussian Mixture Model},
  author = {Rafael Pinto and Paulo Engel},
  journal= {arXiv preprint arXiv:1506.04422},
  year   = {2017}
}

Comments

10 pages, no figures, draft submission to Plos One

R2 v1 2026-06-22T09:53:24.489Z