Related papers: OccuSeg: Occupancy-aware 3D Instance Segmentation

Scene as Occupancy

Human driver can easily describe the complex traffic scene by visual system. Such an ability of precise perception is essential for driver's planning. To achieve this, a geometry-aware representation that quantizes the physical 3D scene…

Computer Vision and Pattern Recognition · Computer Science 2023-06-27 Chonghao Sima , Wenwen Tong , Tai Wang , Li Chen , Silei Wu , Hanming Deng , Yi Gu , Lewei Lu , Ping Luo , Dahua Lin , Hongyang Li

Occ3D: A Large-Scale 3D Occupancy Prediction Benchmark for Autonomous Driving

Robotic perception requires the modeling of both 3D geometry and semantics. Existing methods typically focus on estimating 3D bounding boxes, neglecting finer geometric details and struggling to handle general, out-of-vocabulary objects. 3D…

Computer Vision and Pattern Recognition · Computer Science 2023-12-14 Xiaoyu Tian , Tao Jiang , Longfei Yun , Yucheng Mao , Huitong Yang , Yue Wang , Yilun Wang , Hang Zhao

OccFusion: Depth Estimation Free Multi-sensor Fusion for 3D Occupancy Prediction

3D occupancy prediction based on multi-sensor fusion,crucial for a reliable autonomous driving system, enables fine-grained understanding of 3D scenes. Previous fusion-based 3D occupancy predictions relied on depth estimation for processing…

Computer Vision and Pattern Recognition · Computer Science 2024-07-11 Ji Zhang , Yiran Ding , Zixin Liu

OccGen: Generative Multi-modal 3D Occupancy Prediction for Autonomous Driving

Existing solutions for 3D semantic occupancy prediction typically treat the task as a one-shot 3D voxel-wise segmentation perception problem. These discriminative methods focus on learning the mapping between the inputs and occupancy map in…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Guoqing Wang , Zhongdao Wang , Pin Tang , Jilai Zheng , Xiangxuan Ren , Bailan Feng , Chao Ma

3D Instance Segmentation via Multi-Task Metric Learning

We propose a novel method for instance label segmentation of dense 3D voxel grids. We target volumetric scene representations, which have been acquired with depth sensors or multi-view stereo methods and which have been processed with…

Computer Vision and Pattern Recognition · Computer Science 2019-11-04 Jean Lahoud , Bernard Ghanem , Marc Pollefeys , Martin R. Oswald

VoxDet: Rethinking 3D Semantic Occupancy Prediction as Dense Object Detection

3D semantic occupancy prediction aims to reconstruct the 3D geometry and semantics of the surrounding environment. With dense voxel labels, prior works typically formulate it as a dense segmentation task, independently classifying each…

Graphics · Computer Science 2025-06-06 Wuyang Li , Zhu Yu , Alexandre Alahi

OPUS: Occupancy Prediction Using a Sparse Set

Occupancy prediction, aiming at predicting the occupancy status within voxelized 3D environment, is quickly gaining momentum within the autonomous driving community. Mainstream occupancy prediction works first discretize the 3D environment…

Computer Vision and Pattern Recognition · Computer Science 2024-11-01 Jiabao Wang , Zhaojiang Liu , Qiang Meng , Liujiang Yan , Ke Wang , Jie Yang , Wei Liu , Qibin Hou , Ming-Ming Cheng

SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving

3D scene understanding plays a vital role in vision-based autonomous driving. While most existing methods focus on 3D object detection, they have difficulty describing real-world objects of arbitrary shapes and infinite classes. Towards a…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Yi Wei , Linqing Zhao , Wenzhao Zheng , Zheng Zhu , Jie Zhou , Jiwen Lu

OctreeOcc: Efficient and Multi-Granularity Occupancy Prediction Using Octree Queries

Occupancy prediction has increasingly garnered attention in recent years for its fine-grained understanding of 3D scenes. Traditional approaches typically rely on dense, regular grid representations, which often leads to excessive…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Yuhang Lu , Xinge Zhu , Tai Wang , Yuexin Ma

Occlusion-Ordered Semantic Instance Segmentation

Standard semantic instance segmentation provides useful, but inherently 2D information from a single image. To enable 3D analysis, one usually integrates absolute monocular depth estimation with instance segmentation. However, monocular…

Computer Vision and Pattern Recognition · Computer Science 2025-04-22 Soroosh Baselizadeh , Cheuk-To Yu , Olga Veksler , Yuri Boykov

OccLE: Label-Efficient 3D Semantic Occupancy Prediction

3D semantic occupancy prediction offers an intuitive and efficient scene understanding and has attracted significant interest in autonomous driving perception. Existing approaches either rely on full supervision, which demands costly…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Naiyu Fang , Zheyuan Zhou , Fayao Liu , Xulei Yang , Jiacheng Wei , Lemiao Qiu , Hongsheng Li , Guosheng Lin

SceneSense: Diffusion Models for 3D Occupancy Synthesis from Partial Observation

When exploring new areas, robotic systems generally exclusively plan and execute controls over geometry that has been directly measured. When entering space that was previously obstructed from view such as turning corners in hallways or…

Robotics · Computer Science 2024-03-19 Alec Reed , Brendan Crowe , Doncey Albin , Lorin Achey , Bradley Hayes , Christoffer Heckman

Occupancy Learning with Spatiotemporal Memory

3D occupancy becomes a promising perception representation for autonomous driving to model the surrounding environment at a fine-grained scale. However, it remains challenging to efficiently aggregate 3D occupancy over time across multiple…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Ziyang Leng , Jiawei Yang , Wenlong Yi , Bolei Zhou

SliceSemOcc: Vertical Slice Based Multimodal 3D Semantic Occupancy Representation

Driven by autonomous driving's demands for precise 3D perception, 3D semantic occupancy prediction has become a pivotal research topic. Unlike bird's-eye-view (BEV) methods, which restrict scene representation to a 2D plane, occupancy…

Computer Vision and Pattern Recognition · Computer Science 2025-09-05 Han Huang , Han Sun , Ningzhong Liu , Huiyu Zhou , Jiaquan Shen

OccFusion: Multi-Sensor Fusion Framework for 3D Semantic Occupancy Prediction

A comprehensive understanding of 3D scenes is crucial in autonomous vehicles (AVs), and recent models for 3D semantic occupancy prediction have successfully addressed the challenge of describing real-world objects with varied shapes and…

Computer Vision and Pattern Recognition · Computer Science 2024-05-10 Zhenxing Ming , Julie Stephany Berrio , Mao Shan , Stewart Worrall

Occupancy as Set of Points

In this paper, we explore a novel point representation for 3D occupancy prediction from multi-view images, which is named Occupancy as Set of Points. Existing camera-based methods tend to exploit dense volume-based representation to predict…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Yiang Shi , Tianheng Cheng , Qian Zhang , Wenyu Liu , Xinggang Wang

ODG: Occupancy Prediction Using Dual Gaussians

Occupancy prediction infers fine-grained 3D geometry and semantics from camera images of the surrounding environment, making it a critical perception task for autonomous driving. Existing methods either adopt dense grids as scene…

Computer Vision and Pattern Recognition · Computer Science 2025-06-13 Yunxiao Shi , Yinhao Zhu , Shizhong Han , Jisoo Jeong , Amin Ansari , Hong Cai , Fatih Porikli

ClusterNet: 3D Instance Segmentation in RGB-D Images

We propose a method for instance-level segmentation that uses RGB-D data as input and provides detailed information about the location, geometry and number of individual objects in the scene. This level of understanding is fundamental for…

Robotics · Computer Science 2018-09-20 Lin Shao , Ye Tian , Jeannette Bohg

Improving 3D Occupancy Prediction through Class-balancing Loss and Multi-scale Representation

3D environment recognition is essential for autonomous driving systems, as autonomous vehicles require a comprehensive understanding of surrounding scenes. Recently, the predominant approach to define this real-life problem is through 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Huizhou Chen , Jiangyi Wang , Yuxin Li , Na Zhao , Jun Cheng , Xulei Yang

EFFOcc: Learning Efficient Occupancy Networks from Minimal Labels for Autonomous Driving

3D occupancy prediction (3DOcc) is a rapidly rising and challenging perception task in the field of autonomous driving. Existing 3D occupancy networks (OccNets) are both computationally heavy and label-hungry. In terms of model complexity,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Yining Shi , Kun Jiang , Jinyu Miao , Ke Wang , Kangan Qian , Yunlong Wang , Jiusi Li , Tuopu Wen , Mengmeng Yang , Yiliang Xu , Diange Yang