Related papers: Zero-Shot Multi-Object Scene Completion

Automatic Objects Removal for Scene Completion

With the explosive growth of web-based cameras and mobile devices, billions of photographs are uploaded to the internet. We can trivially collect a huge number of photo streams for various goals, such as 3D scene reconstruction and other…

Computer Vision and Pattern Recognition · Computer Science 2016-11-18 Jianjun Yang , Yin Wang , Honggang Wang , Kun Hua , Wei Wang , Ju Shen

Diorama: Unleashing Zero-shot Single-view 3D Indoor Scene Modeling

Reconstructing structured 3D scenes from RGB images using CAD objects unlocks efficient and compact scene representations that maintain compositionality and interactability. Existing works propose training-heavy methods relying on either…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Qirui Wu , Denys Iliash , Daniel Ritchie , Manolis Savva , Angel X. Chang

Learning to Complete Object Shapes for Object-level Mapping in Dynamic Scenes

In this paper, we propose a novel object-level mapping system that can simultaneously segment, track, and reconstruct objects in dynamic scenes. It can further predict and complete their full geometries by conditioning on reconstructions…

Computer Vision and Pattern Recognition · Computer Science 2022-08-11 Binbin Xu , Andrew J. Davison , Stefan Leutenegger

OMEGAS: Object Mesh Extraction from Large Scenes Guided by Gaussian Segmentation

Recent advancements in 3D reconstruction technologies have paved the way for high-quality and real-time rendering of complex 3D scenes. Despite these achievements, a notable challenge persists: it is difficult to precisely reconstruct…

Computer Vision and Pattern Recognition · Computer Science 2024-08-29 Lizhi Wang , Feng Zhou , Bo yu , Pu Cao , Jianqin Yin

Tracking objects using 3D object proposals

3D object proposals, quickly detected regions in a 3D scene that likely contain an object of interest, are an effective approach to improve the computational efficiency and accuracy of the object detection framework. In this work, we…

Robotics · Computer Science 2018-06-27 Ramanpreet Singh Pahwa , Tian Tsong Ng , Minh N. Do

O$^2$-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model

Occlusion is a common issue in 3D reconstruction from RGB-D videos, often blocking the complete reconstruction of objects and presenting an ongoing problem. In this paper, we propose a novel framework, empowered by a 2D diffusion-based…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Yubin Hu , Sheng Ye , Wang Zhao , Matthieu Lin , Yuze He , Yu-Hui Wen , Ying He , Yong-Jin Liu

ZeroScene: A Zero-Shot Framework for 3D Scene Generation from a Single Image and Controllable Texture Editing

In the field of 3D content generation, single image scene reconstruction methods still struggle to simultaneously ensure the quality of individual assets and the coherence of the overall scene in complex environments, while texture editing…

Graphics · Computer Science 2026-02-18 Xiang Tang , Ruotong Li , Xiaopeng Fan

SceneComplete: Open-World 3D Scene Completion in Cluttered Real World Environments for Robot Manipulation

Careful robot manipulation in every-day cluttered environments requires an accurate understanding of the 3D scene, in order to grasp and place objects stably and reliably and to avoid colliding with other objects. In general, we must…

Robotics · Computer Science 2025-11-11 Aditya Agarwal , Gaurav Singh , Bipasha Sen , Tomás Lozano-Pérez , Leslie Pack Kaelbling

Robust 3D Shape Reconstruction in Zero-Shot from a Single Image in the Wild

Recent monocular 3D shape reconstruction methods have shown promising zero-shot results on object-segmented images without any occlusions. However, their effectiveness is significantly compromised in real-world conditions, due to imperfect…

Computer Vision and Pattern Recognition · Computer Science 2025-06-10 Junhyeong Cho , Kim Youwang , Hunmin Yang , Tae-Hyun Oh

SAM3D-Phys: Towards Multi-Object Interactive Simulation in Real World

This work addresses the problem of recovering complete, simulatable object geometry from reconstructed real-world scenes, enabling physics-based interaction with objects embedded in the scene. While modern multi-view reconstruction methods…

Computer Vision and Pattern Recognition · Computer Science 2026-05-29 Xin Dong , Weijian Deng , Lihan Zhang , Tianru Dai , Wenfeng Deng , Yansong Tang

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shapes, object poses, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Cheng Zhang , Zhaopeng Cui , Yinda Zhang , Bing Zeng , Marc Pollefeys , Shuaicheng Liu

Octree Diffusion for Semantic Scene Generation and Completion

The completion, extension, and generation of 3D semantic scenes are an interrelated set of capabilities that are useful for robotic navigation and exploration. Existing approaches seek to decouple these problems and solve them one-off.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Xujia Zhang , Brendan Crowe , Christoffer Heckman

Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture

Reconstructing detailed 3D scenes from single-view images remains a challenging task due to limitations in existing approaches, which primarily focus on geometric shape recovery, overlooking object appearances and fine shape details. To…

Computer Vision and Pattern Recognition · Computer Science 2023-11-02 Yixin Chen , Junfeng Ni , Nan Jiang , Yaowei Zhang , Yixin Zhu , Siyuan Huang

Learning Human Mesh Recovery in 3D Scenes

We present a novel method for recovering the absolute pose and shape of a human in a pre-scanned scene given a single image. Unlike previous methods that perform sceneaware mesh optimization, we propose to first estimate absolute position…

Computer Vision and Pattern Recognition · Computer Science 2023-06-07 Zehong Shen , Zhi Cen , Sida Peng , Qing Shuai , Hujun Bao , Xiaowei Zhou

MID-Fusion: Octree-based Object-Level Multi-Instance Dynamic SLAM

We propose a new multi-instance dynamic RGB-D SLAM system using an object-level octree-based volumetric representation. It can provide robust camera tracking in dynamic environments and at the same time, continuously estimate geometric,…

Robotics · Computer Science 2019-03-25 Binbin Xu , Wenbin Li , Dimos Tzoumanikas , Michael Bloesch , Andrew Davison , Stefan Leutenegger

MonoMAE: Enhancing Monocular 3D Detection through Depth-Aware Masked Autoencoders

Monocular 3D object detection aims for precise 3D localization and identification of objects from a single-view image. Despite its recent progress, it often struggles while handling pervasive object occlusions that tend to complicate and…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Xueying Jiang , Sheng Jin , Xiaoqin Zhang , Ling Shao , Shijian Lu

CoReNet: Coherent 3D scene reconstruction from a single RGB image

Advances in deep learning techniques have allowed recent work to reconstruct the shape of a single object given only one RBG image as input. Building on common encoder-decoder architectures for this task, we propose three extensions: (1)…

Computer Vision and Pattern Recognition · Computer Science 2020-08-06 Stefan Popov , Pablo Bauszat , Vittorio Ferrari

Object-Scene-Camera Decomposition and Recomposition for Data-Efficient Monocular 3D Object Detection

Monocular 3D object detection (M3OD) is intrinsically ill-posed, hence training a high-performance deep learning based M3OD model requires a humongous amount of labeled data with complicated visual variation from diverse scenes, variety of…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Zhaonian Kuang , Rui Ding , Meng Yang , Xinhu Zheng , Gang Hua

IMFine: 3D Inpainting via Geometry-guided Multi-view Refinement

Current 3D inpainting and object removal methods are largely limited to front-facing scenes, facing substantial challenges when applied to diverse, "unconstrained" scenes where the camera orientation and trajectory are unrestricted. To…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Zhihao Shi , Dong Huo , Yuhongze Zhou , Kejia Yin , Yan Min , Juwei Lu , Xinxin Zuo

OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation

3D reconstruction has been widely used in autonomous navigation fields of mobile robotics. However, the former research can only provide the basic geometry structure without the capability of open-world scene understanding, limiting…

Computer Vision and Pattern Recognition · Computer Science 2024-08-12 Haochen Jiang , Yueming Xu , Yihan Zeng , Hang Xu , Wei Zhang , Jianfeng Feng , Li Zhang