Related papers: 3D-Aware Object Localization using Gaussian Implic…

3D-Aware Ellipse Prediction for Object-Based Camera Pose Estimation

In this paper, we propose a method for coarse camera pose computation which is robust to viewing conditions and does not require a detailed model of the scene. This method meets the growing need of easy deployment of robotics or augmented…

Computer Vision and Pattern Recognition · Computer Science 2021-05-26 Matthieu Zins , Gilles Simon , Marie-Odile Berger

Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction

In this paper, we propose a method for initial camera pose estimation from just a single image which is robust to viewing conditions and does not require a detailed model of the scene. This method meets the growing need of easy deployment…

Computer Vision and Pattern Recognition · Computer Science 2022-03-10 Matthieu Zins , Gilles Simon , Marie-Odile Berger

LabelGS: Label-Aware 3D Gaussian Splatting for 3D Scene Segmentation

3D Gaussian Splatting (3DGS) has emerged as a novel explicit representation for 3D scenes, offering both high-fidelity reconstruction and efficient rendering. However, 3DGS lacks 3D segmentation ability, which limits its applicability in…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Yupeng Zhang , Dezhi Zheng , Ping Lu , Han Zhang , Lei Wang , Liping xiang , Cheng Luo , Kaijun Deng , Xiaowen Fu , Linlin Shen , Jinbao Wang

GaussRender: Learning 3D Occupancy with Gaussian Rendering

Understanding the 3D geometry and semantics of driving scenes is critical for safe autonomous driving. Recent advances in 3D occupancy prediction have improved scene representation but often suffer from visual inconsistencies, leading to…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Loïck Chambon , Eloi Zablocki , Alexandre Boulch , Mickaël Chen , Matthieu Cord

3D Pose from Detections

We present a novel method to infer, in closed-form, a general 3D spatial occupancy and orientation of a collection of rigid objects given 2D image detections from a sequence of images. In particular, starting from 2D ellipses fitted to…

Computer Vision and Pattern Recognition · Computer Science 2015-07-21 Cosimo Rubino , Marco Crocco , Alessandro Perina , Vittorio Murino , Alessio Del Bue

EmbodiedOcc: Embodied 3D Occupancy Prediction for Vision-based Online Scene Understanding

3D occupancy prediction provides a comprehensive description of the surrounding scenes and has become an essential task for 3D perception. Most existing methods focus on offline perception from one or a few views and cannot be applied to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Yuqi Wu , Wenzhao Zheng , Sicheng Zuo , Yuanhui Huang , Jie Zhou , Jiwen Lu

Occlusion-Aware Object Localization, Segmentation and Pose Estimation

We present a learning approach for localization and segmentation of objects in an image in a manner that is robust to partial occlusion. Our algorithm produces a bounding box around the full extent of the object and labels pixels in the…

Computer Vision and Pattern Recognition · Computer Science 2015-07-29 Samarth Brahmbhatt , Heni Ben Amor , Henrik Christensen

GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting

We introduce GaussianOcc, a systematic method that investigates the two usages of Gaussian splatting for fully self-supervised and efficient 3D occupancy estimation in surround views. First, traditional methods for self-supervised 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-07-15 Wanshui Gan , Fang Liu , Hongbin Xu , Ningkai Mo , Naoto Yokoya

OP2GS: Object-Aware 3D Gaussian Splatting with Dual-Opacity Primitives

3D Gaussian Splatting (3DGS) provides an explicit and efficient scene representation, but its primitives lack inherent object-level identity, hindering downstream tasks such as open-vocabulary scene understanding. Existing methods typically…

Computer Vision and Pattern Recognition · Computer Science 2026-05-20 Guiyu Liu , Niklas Vaara , Janne Mustaniemi , Juho Kannala , Janne Heikkilä

Towards Efficient Occupancy Mapping via Gaussian Process Latent Field Shaping

Occupancy mapping has been a key enabler of mobile robotics. Originally based on a discrete grid representation, occupancy mapping has evolved towards continuous representations that can predict the occupancy status at any location and…

Robotics · Computer Science 2025-06-17 Cedric Le Gentil , Cedric Pradalier , Timothy D. Barfoot

Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection

This paper addresses the challenge of robotic grasping of general objects. Similar to prior research, the task reads a single-view 3D observation (i.e., point clouds) captured by a depth camera as input. Crucially, the success of object…

Robotics · Computer Science 2024-07-23 Kangqi Ma , Hao Dong , Yadong Mu

Learning Implicit Functions for Dense 3D Shape Correspondence of Generic Objects

The objective of this paper is to learn dense 3D shape correspondence for topology-varying generic objects in an unsupervised manner. Conventional implicit functions estimate the occupancy of a 3D point given a shape latent code. Instead,…

Computer Vision and Pattern Recognition · Computer Science 2023-01-02 Feng Liu , Xiaoming Liu

RoboOcc: Enhancing the Geometric and Semantic Scene Understanding for Robots

3D occupancy prediction enables the robots to obtain spatial fine-grained geometry and semantics of the surrounding scene, and has become an essential task for embodied perception. Existing methods based on 3D Gaussians instead of dense…

Robotics · Computer Science 2025-04-22 Zhang Zhang , Qiang Zhang , Wei Cui , Shuai Shi , Yijie Guo , Gang Han , Wen Zhao , Hengle Ren , Renjing Xu , Jian Tang

OccLE: Label-Efficient 3D Semantic Occupancy Prediction

3D semantic occupancy prediction offers an intuitive and efficient scene understanding and has attracted significant interest in autonomous driving perception. Existing approaches either rely on full supervision, which demands costly…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Naiyu Fang , Zheyuan Zhou , Fayao Liu , Xulei Yang , Jiacheng Wei , Lemiao Qiu , Hongsheng Li , Guosheng Lin

Towards Flexible 3D Perception: Object-Centric Occupancy Completion Augments 3D Object Detection

While 3D object bounding box (bbox) representation has been widely used in autonomous driving perception, it lacks the ability to capture the precise details of an object's intrinsic geometry. Recently, occupancy has emerged as a promising…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Chaoda Zheng , Feng Wang , Naiyan Wang , Shuguang Cui , Zhen Li

GaussianFormer3D: Multi-Modal Gaussian-based Semantic Occupancy Prediction with 3D Deformable Attention

3D semantic occupancy prediction is essential for achieving safe, reliable autonomous driving and robotic navigation. Compared to camera-only perception systems, multi-modal pipelines, especially LiDAR-camera fusion methods, can produce…

Computer Vision and Pattern Recognition · Computer Science 2026-02-17 Lingjun Zhao , Sizhe Wei , James Hays , Lu Gan

Elliptical Ordinal Embedding

Ordinal embedding aims at finding a low dimensional representation of objects from a set of constraints of the form "item $j$ is closer to item $i$ than item $k$". Typically, each object is mapped onto a point vector in a low dimensional…

Machine Learning · Computer Science 2021-05-26 Aïssatou Diallo , Johannes Fürnkranz

GaussianFormer: Scene as Gaussians for Vision-Based 3D Semantic Occupancy Prediction

3D semantic occupancy prediction aims to obtain 3D fine-grained geometry and semantics of the surrounding scene and is an important task for the robustness of vision-centric autonomous driving. Most existing methods employ dense grids such…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Yuanhui Huang , Wenzhao Zheng , Yunpeng Zhang , Jie Zhou , Jiwen Lu

Object-centric Reconstruction and Tracking of Dynamic Unknown Objects using 3D Gaussian Splatting

Generalizable perception is one of the pillars of high-level autonomy in space robotics. Estimating the structure and motion of unknown objects in dynamic environments is fundamental for such autonomous systems. Traditionally, the solutions…

Robotics · Computer Science 2024-11-26 Kuldeep R Barad , Antoine Richard , Jan Dentler , Miguel Olivares-Mendez , Carol Martinez

Detecting Rotated Objects as Gaussian Distributions and Its 3-D Generalization

Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects and an additional rotation angle parameter is used for rotated objects. We argue that such a mechanism has fundamental…

Computer Vision and Pattern Recognition · Computer Science 2022-09-23 Xue Yang , Gefan Zhang , Xiaojiang Yang , Yue Zhou , Wentao Wang , Jin Tang , Tao He , Junchi Yan