Related papers: Perspective Plane Program Induction from a Single …

Inverse Graphics with Probabilistic CAD Models

Recently, multiple formulations of vision problems as probabilistic inversions of generative models based on computer graphics have been proposed. However, applications to 3D perception from natural images have focused on low-dimensional…

Computer Vision and Pattern Recognition · Computer Science 2014-07-08 Tejas D. Kulkarni , Vikash K. Mansinghka , Pushmeet Kohli , Joshua B. Tenenbaum

Multi-Plane Program Induction with 3D Box Priors

We consider two important aspects in understanding and editing images: modeling regular, program-like texture or patterns in 2D planes, and 3D posing of these planes in the scene. Unlike prior work on image-based program synthesis, which…

Computer Vision and Pattern Recognition · Computer Science 2020-11-24 Yikai Li , Jiayuan Mao , Xiuming Zhang , William T. Freeman , Joshua B. Tenenbaum , Noah Snavely , Jiajun Wu

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shapes, object poses, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Cheng Zhang , Zhaopeng Cui , Yinda Zhang , Bing Zeng , Marc Pollefeys , Shuaicheng Liu

3DP3: 3D Scene Perception via Probabilistic Programming

We present 3DP3, a framework for inverse graphics that uses inference in a structured generative model of objects, scenes, and images. 3DP3 uses (i) voxel models to represent the 3D shape of objects, (ii) hierarchical scene graphs to…

Computer Vision and Pattern Recognition · Computer Science 2021-11-02 Nishad Gothoskar , Marco Cusumano-Towner , Ben Zinberg , Matin Ghavamizadeh , Falk Pollok , Austin Garrett , Joshua B. Tenenbaum , Dan Gutfreund , Vikash K. Mansinghka

PlaneRecTR++: Unified Query Learning for Joint 3D Planar Reconstruction and Pose Estimation

The challenging task of 3D planar reconstruction from images involves several sub-tasks including frame-wise plane detection, segmentation, parameter regression and possibly depth prediction, along with cross-frame plane correspondence and…

Computer Vision and Pattern Recognition · Computer Science 2025-09-18 Jingjia Shi , Shuaifeng Zhi , Kai Xu

Probabilistic Reconstruction Networks for 3D Shape Inference from a Single Image

We study end-to-end learning strategies for 3D shape inference from images, in particular from a single image. Several approaches in this direction have been investigated that explore different shape representations and suitable learning…

Computer Vision and Pattern Recognition · Computer Science 2019-08-21 Roman Klokov , Jakob Verbeek , Edmond Boyer

Approximate Bayesian Image Interpretation using Generative Probabilistic Graphics Programs

The idea of computer vision as the Bayesian inverse problem to computer graphics has a long history and an appealing elegance, but it has proved difficult to directly implement. Instead, most vision tasks are approached via complex…

Artificial Intelligence · Computer Science 2013-07-02 Vikash K. Mansinghka , Tejas D. Kulkarni , Yura N. Perov , Joshua B. Tenenbaum

Shape, Illumination, and Reflectance from Shading

A fundamental problem in computer vision is that of inferring the intrinsic, 3D structure of the world from flat, 2D images of that world. Traditional methods for recovering scene properties such as shape, reflectance, or illumination rely…

Computer Vision and Pattern Recognition · Computer Science 2020-10-09 Jonathan T. Barron , Jitendra Malik

Inverse Rendering Techniques for Physically Grounded Image Editing

From a single picture of a scene, people can typically grasp the spatial layout immediately and even make good guesses at materials properties and where light is coming from to illuminate the scene. For example, we can reliably tell which…

Computer Vision and Pattern Recognition · Computer Science 2020-01-07 Kevin Karsch

Deep Proximal Learning for High-Resolution Plane Wave Compounding

Plane Wave imaging enables many applications that require high frame rates, including localisation microscopy, shear wave elastography, and ultra-sensitive Doppler. To alleviate the degradation of image quality with respect to conventional…

Signal Processing · Electrical Eng. & Systems 2021-12-24 Nishith Chennakeshava , Ben Luijten , Massimo Mischi , Yonina C. Eldar , Ruud J. G. van Sloun

Planar Surface Reconstruction from Sparse Views

The paper studies planar surface reconstruction of indoor scenes from two views with unknown camera poses. While prior approaches have successfully created object-centric reconstructions of many scenes, they fail to exploit other…

Computer Vision and Pattern Recognition · Computer Science 2021-08-23 Linyi Jin , Shengyi Qian , Andrew Owens , David F. Fouhey

Towards a Sampling Theory for Implicit Neural Representations

Implicit neural representations (INRs) have emerged as a powerful tool for solving inverse problems in computer vision and computational imaging. INRs represent images as continuous domain functions realized by a neural network taking…

Image and Video Processing · Electrical Eng. & Systems 2025-06-12 Mahrokh Najaf , Gregory Ongie

3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose Estimation

The ability to perceive and understand 3D scenes is crucial for many applications in computer vision and robotics. Inverse graphics is an appealing approach to 3D scene understanding that aims to infer the 3D scene structure from 2D images.…

Computer Vision and Pattern Recognition · Computer Science 2023-09-08 Guangyao Zhou , Nishad Gothoskar , Lirui Wang , Joshua B. Tenenbaum , Dan Gutfreund , Miguel Lázaro-Gredilla , Dileep George , Vikash K. Mansinghka

Holistic++ Scene Understanding: Single-view 3D Holistic Scene Parsing and Human Pose Estimation with Human-Object Interaction and Physical Commonsense

We propose a new 3D holistic++ scene understanding problem, which jointly tackles two tasks from a single-view image: (i) holistic scene parsing and reconstruction---3D estimations of object bounding boxes, camera pose, and room layout, and…

Computer Vision and Pattern Recognition · Computer Science 2019-09-05 Yixin Chen , Siyuan Huang , Tao Yuan , Siyuan Qi , Yixin Zhu , Song-Chun Zhu

Planar Geometry and Image Recovery from Motion-Blur

Existing works on motion deblurring either ignore the effects of depth-dependent blur or work with the assumption of a multi-layered scene wherein each layer is modeled in the form of fronto-parallel plane. In this work, we consider the…

Computer Vision and Pattern Recognition · Computer Science 2022-02-08 Kuldeep Purohit , Subeesh Vasu , M. Purnachandra Rao , A. N. Rajagopalan

AirPlanes: Accurate Plane Estimation via 3D-Consistent Embeddings

Extracting planes from a 3D scene is useful for downstream tasks in robotics and augmented reality. In this paper we tackle the problem of estimating the planar surfaces in a scene from posed images. Our first finding is that a surprisingly…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Jamie Watson , Filippo Aleotti , Mohamed Sayed , Zawar Qureshi , Oisin Mac Aodha , Gabriel Brostow , Michael Firman , Sara Vicente

Physically-Based Editing of Indoor Scene Lighting from a Single Image

We present a method to edit complex indoor lighting from a single image with its predicted depth and light source segmentation masks. This is an extremely challenging problem that requires modeling complex light transport, and disentangling…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Zhengqin Li , Jia Shi , Sai Bi , Rui Zhu , Kalyan Sunkavalli , Miloš Hašan , Zexiang Xu , Ravi Ramamoorthi , Manmohan Chandraker

Equivariant Single View Pose Prediction Via Induced and Restricted Representations

Learning about the three-dimensional world from two-dimensional images is a fundamental problem in computer vision. An ideal neural network architecture for such tasks would leverage the fact that objects can be rotated and translated in…

Computer Vision and Pattern Recognition · Computer Science 2023-07-10 Owen Howell , David Klee , Ondrej Biza , Linfeng Zhao , Robin Walters

PNeRF: Probabilistic Neural Scene Representations for Uncertain 3D Visual Mapping

Recently neural scene representations have provided very impressive results for representing 3D scenes visually, however, their study and progress have mainly been limited to visualization of virtual models in computer graphics or scene…

Computer Vision and Pattern Recognition · Computer Science 2022-09-26 Yassine Ahmine , Arnab Dey , Andrew I. Comport

Incremental Joint Learning of Depth, Pose and Implicit Scene Representation on Monocular Camera in Large-scale Scenes

Dense scene reconstruction for photo-realistic view synthesis has various applications, such as VR/AR, autonomous vehicles. However, most existing methods have difficulties in large-scale scenes due to three core challenges: \textit{(a)…

Computer Vision and Pattern Recognition · Computer Science 2025-12-24 Tianchen Deng , Nailin Wang , Chongdi Wang , Shenghai Yuan , Jingchuan Wang , Hesheng Wang , Danwei Wang , Weidong Chen