English

Learning to Predict 3D Objects with an Interpolation-based Differentiable Renderer

Computer Vision and Pattern Recognition 2019-11-22 v2

Abstract

Many machine learning models operate on images, but ignore the fact that images are 2D projections formed by 3D geometry interacting with light, in a process called rendering. Enabling ML models to understand image formation might be key for generalization. However, due to an essential rasterization step involving discrete assignment operations, rendering pipelines are non-differentiable and thus largely inaccessible to gradient-based ML techniques. In this paper, we present {\emph DIB-R}, a differentiable rendering framework which allows gradients to be analytically computed for all pixels in an image. Key to our approach is to view foreground rasterization as a weighted interpolation of local properties and background rasterization as a distance-based aggregation of global geometry. Our approach allows for accurate optimization over vertex positions, colors, normals, light directions and texture coordinates through a variety of lighting models. We showcase our approach in two ML applications: single-image 3D object prediction, and 3D textured object generation, both trained using exclusively using 2D supervision. Our project website is: https://nv-tlabs.github.io/DIB-R/

Keywords

Cite

@article{arxiv.1908.01210,
  title  = {Learning to Predict 3D Objects with an Interpolation-based Differentiable Renderer},
  author = {Wenzheng Chen and Jun Gao and Huan Ling and Edward J. Smith and Jaakko Lehtinen and Alec Jacobson and Sanja Fidler},
  journal= {arXiv preprint arXiv:1908.01210},
  year   = {2019}
}

Comments

Accepted to NeurIPS 2019