English

Learning a Predictable and Generative Vector Representation for Objects

Computer Vision and Pattern Recognition 2016-09-01 v2

Abstract

What is a good vector representation of an object? We believe that it should be generative in 3D, in the sense that it can produce new 3D objects; as well as be predictable from 2D, in the sense that it can be perceived from 2D images. We propose a novel architecture, called the TL-embedding network, to learn an embedding space with these properties. The network consists of two components: (a) an autoencoder that ensures the representation is generative; and (b) a convolutional network that ensures the representation is predictable. This enables tackling a number of tasks including voxel prediction from 2D images and 3D model retrieval. Extensive experimental analysis demonstrates the usefulness and versatility of this embedding.

Keywords

Cite

@article{arxiv.1603.08637,
  title  = {Learning a Predictable and Generative Vector Representation for Objects},
  author = {Rohit Girdhar and David F. Fouhey and Mikel Rodriguez and Abhinav Gupta},
  journal= {arXiv preprint arXiv:1603.08637},
  year   = {2016}
}

Comments

To appear in ECCV 2016. Project webpage: rohitgirdhar.github.io/GenerativePredictableVoxels/

R2 v1 2026-06-22T13:20:11.638Z