Related papers: Object-Driven Multi-Layer Scene Decomposition From…

Peeking Behind Objects: Layered Depth Prediction from a Single Image

While conventional depth estimation can infer the geometry of a scene from a single RGB image, it fails to estimate scene regions that are occluded by foreground objects. This limits the use of depth prediction in augmented and virtual…

Computer Vision and Pattern Recognition · Computer Science 2019-05-09 Helisa Dhamo , Keisuke Tateno , Iro Laina , Nassir Navab , Federico Tombari

Visiting the Invisible: Layer-by-Layer Completed Scene Decomposition

Existing scene understanding systems mainly focus on recognizing the visible parts of a scene, ignoring the intact appearance of physical objects in the real-world. Concurrently, image completion has aimed to create plausible appearance for…

Computer Vision and Pattern Recognition · Computer Science 2021-04-13 Chuanxia Zheng , Duy-Son Dao , Guoxian Song , Tat-Jen Cham , Jianfei Cai

3D Photography using Context-aware Layered Depth Inpainting

We propose a method for converting a single RGB-D input image into a 3D photo - a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view. We use a…

Computer Vision and Pattern Recognition · Computer Science 2020-06-11 Meng-Li Shih , Shih-Yang Su , Johannes Kopf , Jia-Bin Huang

Counterfactual Depth from a Single RGB Image

We describe a method that predicts, from a single RGB image, a depth map that describes the scene when a masked object is removed - we call this "counterfactual depth" that models hidden scene geometry together with the observations. Our…

Computer Vision and Pattern Recognition · Computer Science 2019-09-04 Theerasit Issaranon , Chuhang Zou , David Forsyth

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shapes, object poses, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Cheng Zhang , Zhaopeng Cui , Yinda Zhang , Bing Zeng , Marc Pollefeys , Shuaicheng Liu

Learning to Predict Scene-Level Implicit 3D from Posed RGBD Data

We introduce a method that can learn to predict scene-level implicit functions for 3D reconstruction from posed RGBD data. At test time, our system maps a previously unseen RGB image to a 3D reconstruction of a scene via implicit functions.…

Computer Vision and Pattern Recognition · Computer Science 2023-06-16 Nilesh Kulkarni , Linyi Jin , Justin Johnson , David F. Fouhey

Layer-structured 3D Scene Inference via View Synthesis

We present an approach to infer a layer-structured 3D representation of a scene from a single input image. This allows us to infer not only the depth of the visible pixels, but also to capture the texture and depth for content in the scene…

Computer Vision and Pattern Recognition · Computer Science 2018-07-27 Shubham Tulsiani , Richard Tucker , Noah Snavely

3D Scene Reconstruction with Multi-layer Depth and Epipolar Transformers

We tackle the problem of automatically reconstructing a complete 3D model of a scene from a single RGB image. This challenging task requires inferring the shape of both visible and occluded surfaces. Our approach utilizes viewer-centered,…

Computer Vision and Pattern Recognition · Computer Science 2019-08-28 Daeyun Shin , Zhile Ren , Erik B. Sudderth , Charless C. Fowlkes

Complete 3D Scene Parsing from an RGBD Image

One major goal of vision is to infer physical models of objects, surfaces, and their layout from sensors. In this paper, we aim to interpret indoor scenes from one RGBD image. Our representation encodes the layout of orthogonal walls and…

Computer Vision and Pattern Recognition · Computer Science 2018-11-15 Chuhang Zou , Ruiqi Guo , Zhizhong Li , Derek Hoiem

Single Image Depth Estimation Trained via Depth from Defocus Cues

Estimating depth from a single RGB images is a fundamental task in computer vision, which is most directly solved using supervised deep learning. In the field of unsupervised learning of depth from a single RGB image, depth is not given…

Computer Vision and Pattern Recognition · Computer Science 2020-01-16 Shir Gur , Lior Wolf

Predicting Complete 3D Models of Indoor Scenes

One major goal of vision is to infer physical models of objects, surfaces, and their layout from sensors. In this paper, we aim to interpret indoor scenes from one RGBD image. Our representation encodes the layout of walls, which must…

Computer Vision and Pattern Recognition · Computer Science 2017-08-21 Ruiqi Guo , Chuhang Zou , Derek Hoiem

Unsupervised Layered Image Decomposition into Object Prototypes

We present an unsupervised learning framework for decomposing images into layers of automatically discovered object models. Contrary to recent approaches that model image layers with autoencoder networks, we represent them as explicit…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Tom Monnier , Elliot Vincent , Jean Ponce , Mathieu Aubry

Referring Layer Decomposition

Precise, object-aware control over visual content is essential for advanced image editing and compositional generation. Yet, most existing approaches operate on entire images holistically, limiting the ability to isolate and manipulate…

Computer Vision and Pattern Recognition · Computer Science 2026-02-24 Fangyi Chen , Yaojie Shen , Lu Xu , Ye Yuan , Shu Zhang , Yulei Niu , Longyin Wen

Learning to See Through Obstructions with Layered Decomposition

We present a learning-based approach for removing unwanted obstructions, such as window reflections, fence occlusions, or adherent raindrops, from a short sequence of images captured by a moving camera. Our method leverages motion…

Computer Vision and Pattern Recognition · Computer Science 2021-07-27 Yu-Lun Liu , Wei-Sheng Lai , Ming-Hsuan Yang , Yung-Yu Chuang , Jia-Bin Huang

Unsupervised Learning for Intrinsic Image Decomposition from a Single Image

Intrinsic image decomposition, which is an essential task in computer vision, aims to infer the reflectance and shading of the scene. It is challenging since it needs to separate one image into two components. To tackle this, conventional…

Computer Vision and Pattern Recognition · Computer Science 2020-05-28 Yunfei Liu , Yu Li , Shaodi You , Feng Lu

Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors

Representing scenes at the granularity of objects is a prerequisite for scene understanding and decision making. We propose PriSMONet, a novel approach based on Prior Shape knowledge for learning Multi-Object 3D scene decomposition and…

Computer Vision and Pattern Recognition · Computer Science 2022-05-04 Cathrin Elich , Martin R. Oswald , Marc Pollefeys , Joerg Stueckler

Learning to Predict Indoor Illumination from a Single Image

We propose an automatic method to infer high dynamic range illumination from a single, limited field-of-view, low dynamic range photograph of an indoor scene. In contrast to previous work that relies on specialized image capture, user…

Computer Vision and Pattern Recognition · Computer Science 2017-11-22 Marc-André Gardner , Kalyan Sunkavalli , Ersin Yumer , Xiaohui Shen , Emiliano Gambaretto , Christian Gagné , Jean-François Lalonde

Physically-Based Editing of Indoor Scene Lighting from a Single Image

We present a method to edit complex indoor lighting from a single image with its predicted depth and light source segmentation masks. This is an extremely challenging problem that requires modeling complex light transport, and disentangling…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Zhengqin Li , Jia Shi , Sai Bi , Rui Zhu , Kalyan Sunkavalli , Miloš Hašan , Zexiang Xu , Ravi Ramamoorthi , Manmohan Chandraker

Implicit Mesh Reconstruction from Unannotated Image Collections

We present an approach to infer the 3D shape, texture, and camera pose for an object from a single RGB image, using only category-level image collections with foreground masks as supervision. We represent the shape as an image-conditioned…

Computer Vision and Pattern Recognition · Computer Science 2020-07-17 Shubham Tulsiani , Nilesh Kulkarni , Abhinav Gupta

Object-Centric Image Generation with Factored Depths, Locations, and Appearances

We present a generative model of images that explicitly reasons over the set of objects they show. Our model learns a structured latent representation that separates objects from each other and from the background; unlike prior works, it…

Machine Learning · Computer Science 2020-04-03 Titas Anciukevicius , Christoph H. Lampert , Paul Henderson