On Deep Multi-View Representation Learning: Objectives and Optimization

Weiran Wang; Raman Arora; Karen Livescu; Jeff Bilmes

On Deep Multi-View Representation Learning: Objectives and Optimization

Machine Learning 2016-02-03 v1

Authors: Weiran Wang , Raman Arora , Karen Livescu , Jeff Bilmes

Abstract

We consider learning representations (features) in the setting in which we have access to multiple unlabeled views of the data for learning while only one view is available for downstream tasks. Previous work on this problem has proposed several techniques based on deep neural networks, typically involving either autoencoder-like networks with a reconstruction objective or paired feedforward networks with a batch-style correlation-based objective. We analyze several techniques based on prior work, as well as new variants, and compare them empirically on image, speech, and text tasks. We find an advantage for correlation-based representation learning, while the best results on most tasks are obtained with our new variant, deep canonically correlated autoencoders (DCCAE). We also explore a stochastic optimization procedure for minibatch correlation-based objectives and discuss the time/performance trade-offs for kernel-based and neural network-based implementations.

Keywords

representation learning deep learning neural network

Cite

@article{arxiv.1602.01024,
  title  = {On Deep Multi-View Representation Learning: Objectives and Optimization},
  author = {Weiran Wang and Raman Arora and Karen Livescu and Jeff Bilmes},
  journal= {arXiv preprint arXiv:1602.01024},
  year   = {2016}
}

On Deep Multi-View Representation Learning: Objectives and Optimization

Abstract

Keywords

Cite

Related papers