English

A new heuristic algorithm for fast k-segmentation

Computer Vision and Pattern Recognition 2020-09-14 v1 Machine Learning Machine Learning

Abstract

The kk-segmentation of a video stream is used to partition it into kk piecewise-linear segments, so that each linear segment has a meaningful interpretation. Such segmentation may be used to summarize large videos using a small set of images, to identify anomalies within segments and change points between segments, and to select critical subsets for training machine learning models. Exact and approximate segmentation methods for kk-segmentation exist in the literature. Each of these algorithms occupies a different spot in the trade-off between computational complexity and accuracy. A novel heuristic algorithm is proposed in this paper to improve upon existing methods. It is empirically found to provide accuracies competitive with exact methods at a fraction of the computational expense. The new algorithm is inspired by Lloyd's algorithm for K-Means and Lloyd-Max algorithm for scalar quantization, and is called the LM algorithm for convenience. It works by iteratively minimizing a cost function from any given initialisation; the commonly used L2L_2 cost is chosen in this paper. While the greedy minimization makes the algorithm sensitive to initialisation, the ability to converge from any initial guess to a local optimum allows the algorithm to be integrated into other existing algorithms. Three variants of the algorithm are tested over a large number of synthetic datasets, one being a standalone LM implementation, and two others that combine with existing algorithms. One of the latter two -- LM-enhanced-Bottom-Up segmentation -- is found to have the best accuracy and the lowest computational complexity among all algorithms. This variant of LM can provide kk-segmentations over data sets with up to a million image frames within several seconds.

Keywords

Cite

@article{arxiv.2009.05148,
  title  = {A new heuristic algorithm for fast k-segmentation},
  author = {Sabarish Vadarevu and Vijay Karamcheti},
  journal= {arXiv preprint arXiv:2009.05148},
  year   = {2020}
}

Comments

10 pages, 10 figures, 5 tables, and 1 pseudo-code. Submitted to IEEE BigData 2020. Supplementary material (200 segmented videos) at https://figshare.com/articles/media/7-segmentation of 200 scenes from BDD100k/12859493/1