English

Multimodal Sparse Coding for Event Detection

Machine Learning 2016-05-18 v1 Computer Vision and Pattern Recognition

Abstract

Unsupervised feature learning methods have proven effective for classification tasks based on a single modality. We present multimodal sparse coding for learning feature representations shared across multiple modalities. The shared representations are applied to multimedia event detection (MED) and evaluated in comparison to unimodal counterparts, as well as other feature learning methods such as GMM supervectors and sparse RBM. We report the cross-validated classification accuracy and mean average precision of the MED system trained on features learned from our unimodal and multimodal settings for a subset of the TRECVID MED 2014 dataset.

Keywords

Cite

@article{arxiv.1605.05212,
  title  = {Multimodal Sparse Coding for Event Detection},
  author = {Youngjune Gwon and William Campbell and Kevin Brady and Douglas Sturim and Miriam Cha and H. T. Kung},
  journal= {arXiv preprint arXiv:1605.05212},
  year   = {2016}
}

Comments

Multimodal Machine Learning Workshop at NIPS 2015