English

Inverting Visual Representations with Convolutional Networks

Neural and Evolutionary Computing 2016-04-28 v4 Computer Vision and Pattern Recognition Machine Learning

Abstract

Feature representations, both hand-designed and learned ones, are often hard to analyze and interpret, even when they are extracted from visual data. We propose a new approach to study image representations by inverting them with an up-convolutional neural network. We apply the method to shallow representations (HOG, SIFT, LBP), as well as to deep networks. For shallow representations our approach provides significantly better reconstructions than existing methods, revealing that there is surprisingly rich information contained in these features. Inverting a deep network trained on ImageNet provides several insights into the properties of the feature representation learned by the network. Most strikingly, the colors and the rough contours of an image can be reconstructed from activations in higher network layers and even from the predicted class probabilities.

Keywords

Cite

@article{arxiv.1506.02753,
  title  = {Inverting Visual Representations with Convolutional Networks},
  author = {Alexey Dosovitskiy and Thomas Brox},
  journal= {arXiv preprint arXiv:1506.02753},
  year   = {2016}
}

Comments

Version 4 - final version to appear in CVPR-2016. Visually better results obtained with feature similarity and adversarial training are in a different paper - arXiv:1602.02644

R2 v1 2026-06-22T09:49:48.327Z