Related papers: Learning to Reconstruct and Segment 3D Objects

Language-Assisted 3D Feature Learning for Semantic Scene Understanding

Learning descriptive 3D features is crucial for understanding 3D scenes with diverse objects and complex structures. However, it is usually unknown whether important geometric attributes and scene context obtain enough emphasis in an…

Computer Vision and Pattern Recognition · Computer Science 2022-12-13 Junbo Zhang , Guofan Fan , Guanghan Wang , Zhengyuan Su , Kaisheng Ma , Li Yi

Towards Part-Based Understanding of RGB-D Scans

Recent advances in 3D semantic scene understanding have shown impressive progress in 3D instance segmentation, enabling object-level reasoning about 3D scenes; however, a finer-grained understanding is required to enable interactions with…

Computer Vision and Pattern Recognition · Computer Science 2020-12-04 Alexey Bokhovkin , Vladislav Ishimtsev , Emil Bogomolov , Denis Zorin , Alexey Artemov , Evgeny Burnaev , Angela Dai

Fully Understanding Generic Objects: Modeling, Segmentation, and Reconstruction

Inferring 3D structure of a generic object from a 2D image is a long-standing objective of computer vision. Conventional approaches either learn completely from CAD-generated synthetic data, which have difficulty in inference from real…

Computer Vision and Pattern Recognition · Computer Science 2021-04-05 Feng Liu , Luan Tran , Xiaoming Liu

Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions

Scene understanding has been of high interest in computer vision. It encompasses not only identifying objects in a scene, but also their relationships within the given context. With this goal, a recent line of works tackles 3D semantic…

Computer Vision and Pattern Recognition · Computer Science 2020-04-09 Johanna Wald , Helisa Dhamo , Nassir Navab , Federico Tombari

Complete 3D Scene Parsing from an RGBD Image

One major goal of vision is to infer physical models of objects, surfaces, and their layout from sensors. In this paper, we aim to interpret indoor scenes from one RGBD image. Our representation encodes the layout of orthogonal walls and…

Computer Vision and Pattern Recognition · Computer Science 2018-11-15 Chuhang Zou , Ruiqi Guo , Zhizhong Li , Derek Hoiem

Towards Scene Understanding with Detailed 3D Object Representations

Current approaches to semantic image and scene understanding typically employ rather simple object representations such as 2D or 3D bounding boxes. While such coarse models are robust and allow for reliable object detection, they discard…

Computer Vision and Pattern Recognition · Computer Science 2014-11-24 M. Zeeshan Zia , Michael Stark , Konrad Schindler

Recognizing Scenes from Novel Viewpoints

Humans can perceive scenes in 3D from a handful of 2D views. For AI agents, the ability to recognize a scene from any viewpoint given only a few images enables them to efficiently interact with the scene and its objects. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2021-12-03 Shengyi Qian , Alexander Kirillov , Nikhila Ravi , Devendra Singh Chaplot , Justin Johnson , David F. Fouhey , Georgia Gkioxari

DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding

While deep neural networks have led to human-level performance on computer vision tasks, they have yet to demonstrate similar gains for holistic scene understanding. In particular, 3D context has been shown to be an extremely important cue…

Computer Vision and Pattern Recognition · Computer Science 2017-08-17 Yinda Zhang , Mingru Bai , Pushmeet Kohli , Shahram Izadi , Jianxiong Xiao

Learning 3D object-centric representation through prediction

As part of human core knowledge, the representation of objects is the building block of mental representation that supports high-level concepts and symbolic reasoning. While humans develop the ability of perceiving objects situated in 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-03-07 John Day , Tushar Arora , Jirui Liu , Li Erran Li , Ming Bo Cai

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shapes, object poses, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Cheng Zhang , Zhaopeng Cui , Yinda Zhang , Bing Zeng , Marc Pollefeys , Shuaicheng Liu

Deep Learning Perspective of Scene Understanding in Autonomous Robots

This paper provides a review of deep learning applications in scene understanding in autonomous robots, including innovations in object detection, semantic and instance segmentation, depth estimation, 3D reconstruction, and visual SLAM. It…

Computer Vision and Pattern Recognition · Computer Science 2025-12-17 Afia Maham , Dur E Nayab Tashfa

Mask2CAD: 3D Shape Prediction by Learning to Segment and Retrieve

Object recognition has seen significant progress in the image domain, with focus primarily on 2D perception. We propose to leverage existing large-scale datasets of 3D models to understand the underlying 3D structure of objects seen in an…

Computer Vision and Pattern Recognition · Computer Science 2020-07-28 Weicheng Kuo , Anelia Angelova , Tsung-Yi Lin , Angela Dai

Multi Modal Semantic Segmentation using Synthetic Data

Semantic understanding of scenes in three-dimensional space (3D) is a quintessential part of robotics oriented applications such as autonomous driving as it provides geometric cues such as size, orientation and true distance of separation…

Computer Vision and Pattern Recognition · Computer Science 2019-11-01 Kartik Srivastava , Akash Kumar Singh , Guruprasad M. Hegde

Volumetric Instance-Aware Semantic Mapping and 3D Object Discovery

To autonomously navigate and plan interactions in real-world environments, robots require the ability to robustly perceive and map complex, unstructured surrounding scenes. Besides building an internal representation of the observed scene…

Robotics · Computer Science 2021-05-18 Margarita Grinvald , Fadri Furrer , Tonci Novkovic , Jen Jen Chung , Cesar Cadena , Roland Siegwart , Juan Nieto

Multiview Based 3D Scene Understanding On Partial Point Sets

Deep learning within the context of point clouds has gained much research interest in recent years mostly due to the promising results that have been achieved on a number of challenging benchmarks, such as 3D shape recognition and scene…

Computer Vision and Pattern Recognition · Computer Science 2018-12-06 Ye Zhu , Sven Ewan Shepstone , Pablo Martínez-Nuevo , Miklas Strøm Kristoffersen , Fabien Moutarde , Zhuang Fu

Aligning Text, Images, and 3D Structure Token-by-Token

Creating machines capable of understanding the world in 3D is essential in assisting designers that build and edit 3D environments and robots navigating and interacting within a three-dimensional space. Inspired by advances in language and…

Computer Vision and Pattern Recognition · Computer Science 2026-01-07 Aadarsh Sahoo , Vansh Tibrewal , Georgia Gkioxari

Panoptic 3D Scene Reconstruction From a Single RGB Image

Understanding 3D scenes from a single image is fundamental to a wide variety of tasks, such as for robotics, motion planning, or augmented reality. Existing works in 3D perception from a single RGB image tend to focus on geometric…

Computer Vision and Pattern Recognition · Computer Science 2022-05-17 Manuel Dahnert , Ji Hou , Matthias Nießner , Angela Dai

Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding

General scene understanding for robotics requires flexible semantic representation, so that novel objects and structures which may not have been known at training time can be identified, segmented and grouped. We present an algorithm which…

Computer Vision and Pattern Recognition · Computer Science 2022-10-07 Kirill Mazur , Edgar Sucar , Andrew J. Davison

Open-Ended Fine-Grained 3D Object Categorization by Combining Shape and Texture Features in Multiple Colorspaces

As a consequence of an ever-increasing number of service robots, there is a growing demand for highly accurate real-time 3D object recognition. Considering the expansion of robot applications in more complex and dynamic environments,it is…

Computer Vision and Pattern Recognition · Computer Science 2021-06-01 Nils Keunecke , S. Hamidreza Kasaei

Review on 6D Object Pose Estimation with the focus on Indoor Scene Understanding

6D object pose estimation problem has been extensively studied in the field of Computer Vision and Robotics. It has wide range of applications such as robot manipulation, augmented reality, and 3D scene understanding. With the advent of…

Computer Vision and Pattern Recognition · Computer Science 2023-04-13 Negar Nejatishahidin , Pooya Fayyazsanavi