English
Related papers

Related papers: 3D-Aware Object Localization using Gaussian Implic…

200 papers

In this paper, we propose a method for coarse camera pose computation which is robust to viewing conditions and does not require a detailed model of the scene. This method meets the growing need of easy deployment of robotics or augmented…

Computer Vision and Pattern Recognition · Computer Science 2021-05-26 Matthieu Zins , Gilles Simon , Marie-Odile Berger

In this paper, we propose a method for initial camera pose estimation from just a single image which is robust to viewing conditions and does not require a detailed model of the scene. This method meets the growing need of easy deployment…

Computer Vision and Pattern Recognition · Computer Science 2022-03-10 Matthieu Zins , Gilles Simon , Marie-Odile Berger

3D Gaussian Splatting (3DGS) has emerged as a novel explicit representation for 3D scenes, offering both high-fidelity reconstruction and efficient rendering. However, 3DGS lacks 3D segmentation ability, which limits its applicability in…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Yupeng Zhang , Dezhi Zheng , Ping Lu , Han Zhang , Lei Wang , Liping xiang , Cheng Luo , Kaijun Deng , Xiaowen Fu , Linlin Shen , Jinbao Wang

Understanding the 3D geometry and semantics of driving scenes is critical for safe autonomous driving. Recent advances in 3D occupancy prediction have improved scene representation but often suffer from visual inconsistencies, leading to…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Loïck Chambon , Eloi Zablocki , Alexandre Boulch , Mickaël Chen , Matthieu Cord

We present a novel method to infer, in closed-form, a general 3D spatial occupancy and orientation of a collection of rigid objects given 2D image detections from a sequence of images. In particular, starting from 2D ellipses fitted to…

Computer Vision and Pattern Recognition · Computer Science 2015-07-21 Cosimo Rubino , Marco Crocco , Alessandro Perina , Vittorio Murino , Alessio Del Bue

3D occupancy prediction provides a comprehensive description of the surrounding scenes and has become an essential task for 3D perception. Most existing methods focus on offline perception from one or a few views and cannot be applied to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Yuqi Wu , Wenzhao Zheng , Sicheng Zuo , Yuanhui Huang , Jie Zhou , Jiwen Lu

We present a learning approach for localization and segmentation of objects in an image in a manner that is robust to partial occlusion. Our algorithm produces a bounding box around the full extent of the object and labels pixels in the…

Computer Vision and Pattern Recognition · Computer Science 2015-07-29 Samarth Brahmbhatt , Heni Ben Amor , Henrik Christensen

We introduce GaussianOcc, a systematic method that investigates the two usages of Gaussian splatting for fully self-supervised and efficient 3D occupancy estimation in surround views. First, traditional methods for self-supervised 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-07-15 Wanshui Gan , Fang Liu , Hongbin Xu , Ningkai Mo , Naoto Yokoya

3D Gaussian Splatting (3DGS) provides an explicit and efficient scene representation, but its primitives lack inherent object-level identity, hindering downstream tasks such as open-vocabulary scene understanding. Existing methods typically…

Computer Vision and Pattern Recognition · Computer Science 2026-05-20 Guiyu Liu , Niklas Vaara , Janne Mustaniemi , Juho Kannala , Janne Heikkilä

Occupancy mapping has been a key enabler of mobile robotics. Originally based on a discrete grid representation, occupancy mapping has evolved towards continuous representations that can predict the occupancy status at any location and…

Robotics · Computer Science 2025-06-17 Cedric Le Gentil , Cedric Pradalier , Timothy D. Barfoot

This paper addresses the challenge of robotic grasping of general objects. Similar to prior research, the task reads a single-view 3D observation (i.e., point clouds) captured by a depth camera as input. Crucially, the success of object…

Robotics · Computer Science 2024-07-23 Kangqi Ma , Hao Dong , Yadong Mu

The objective of this paper is to learn dense 3D shape correspondence for topology-varying generic objects in an unsupervised manner. Conventional implicit functions estimate the occupancy of a 3D point given a shape latent code. Instead,…

Computer Vision and Pattern Recognition · Computer Science 2023-01-02 Feng Liu , Xiaoming Liu

3D occupancy prediction enables the robots to obtain spatial fine-grained geometry and semantics of the surrounding scene, and has become an essential task for embodied perception. Existing methods based on 3D Gaussians instead of dense…

Robotics · Computer Science 2025-04-22 Zhang Zhang , Qiang Zhang , Wei Cui , Shuai Shi , Yijie Guo , Gang Han , Wen Zhao , Hengle Ren , Renjing Xu , Jian Tang

3D semantic occupancy prediction offers an intuitive and efficient scene understanding and has attracted significant interest in autonomous driving perception. Existing approaches either rely on full supervision, which demands costly…

Computer Vision and Pattern Recognition · Computer Science 2026-01-23 Naiyu Fang , Zheyuan Zhou , Fayao Liu , Xulei Yang , Jiacheng Wei , Lemiao Qiu , Hongsheng Li , Guosheng Lin

While 3D object bounding box (bbox) representation has been widely used in autonomous driving perception, it lacks the ability to capture the precise details of an object's intrinsic geometry. Recently, occupancy has emerged as a promising…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Chaoda Zheng , Feng Wang , Naiyan Wang , Shuguang Cui , Zhen Li

3D semantic occupancy prediction is essential for achieving safe, reliable autonomous driving and robotic navigation. Compared to camera-only perception systems, multi-modal pipelines, especially LiDAR-camera fusion methods, can produce…

Computer Vision and Pattern Recognition · Computer Science 2026-02-17 Lingjun Zhao , Sizhe Wei , James Hays , Lu Gan

Ordinal embedding aims at finding a low dimensional representation of objects from a set of constraints of the form "item $j$ is closer to item $i$ than item $k$". Typically, each object is mapped onto a point vector in a low dimensional…

Machine Learning · Computer Science 2021-05-26 Aïssatou Diallo , Johannes Fürnkranz

3D semantic occupancy prediction aims to obtain 3D fine-grained geometry and semantics of the surrounding scene and is an important task for the robustness of vision-centric autonomous driving. Most existing methods employ dense grids such…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Yuanhui Huang , Wenzhao Zheng , Yunpeng Zhang , Jie Zhou , Jiwen Lu

Generalizable perception is one of the pillars of high-level autonomy in space robotics. Estimating the structure and motion of unknown objects in dynamic environments is fundamental for such autonomous systems. Traditionally, the solutions…

Robotics · Computer Science 2024-11-26 Kuldeep R Barad , Antoine Richard , Jan Dentler , Miguel Olivares-Mendez , Carol Martinez

Existing detection methods commonly use a parameterized bounding box (BBox) to model and detect (horizontal) objects and an additional rotation angle parameter is used for rotated objects. We argue that such a mechanism has fundamental…

Computer Vision and Pattern Recognition · Computer Science 2022-09-23 Xue Yang , Gefan Zhang , Xiaojiang Yang , Yue Zhou , Wentao Wang , Jin Tang , Tao He , Junchi Yan
‹ Prev 1 2 3 10 Next ›