Related papers: Efficient Scene Compression for Visual-based Local…

Hybrid Scene Compression for Visual Localization

Localizing an image wrt. a 3D scene model represents a core task for many computer vision applications. An increasing number of real-world applications of visual localization on mobile devices, e.g., Augmented Reality or autonomous robots…

Computer Vision and Pattern Recognition · Computer Science 2019-04-23 Federico Camposeco , Andrea Cohen , Marc Pollefeys , Torsten Sattler

3D Scene Compression through Entropy Penalized Neural Representation Functions

Some forms of novel visual media enable the viewer to explore a 3D scene from arbitrary viewpoints, by interpolating between a discrete set of original views. Compared to 2D imagery, these types of applications require much larger amounts…

Computer Vision and Pattern Recognition · Computer Science 2021-04-27 Thomas Bird , Johannes Ballé , Saurabh Singh , Philip A. Chou

Fast and Lightweight Scene Regressor for Camera Relocalization

Camera relocalization involving a prior 3D reconstruction plays a crucial role in many mixed reality and robotics applications. Estimating the camera pose directly with respect to pre-built 3D models can be prohibitively expensive for…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Thuan B. Bui , Dinh-Tuan Tran , Joo-Ho Lee

Visual Localization via Few-Shot Scene Region Classification

Visual (re)localization addresses the problem of estimating the 6-DoF (Degree of Freedom) camera pose of a query image captured in a known scene, which is a key building block of many computer vision and robotics applications. Recent…

Computer Vision and Pattern Recognition · Computer Science 2022-08-16 Siyan Dong , Shuzhe Wang , Yixin Zhuang , Juho Kannala , Marc Pollefeys , Baoquan Chen

Perceive-Sample-Compress: Towards Real-Time 3D Gaussian Splatting

Recent advances in 3D Gaussian Splatting (3DGS) have demonstrated remarkable capabilities in real-time and photorealistic novel view synthesis. However, traditional 3DGS representations often struggle with large-scale scene management and…

Graphics · Computer Science 2025-08-08 Zijian Wang , Beizhen Zhao , Hao Wang

Neural 3D Scene Compression via Model Compression

Rendering 3D scenes requires access to arbitrary viewpoints from the scene. Storage of such a 3D scene can be done in two ways; (1) storing 2D images taken from the 3D scene that can reconstruct the scene back through interpolations, or (2)…

Computer Vision and Pattern Recognition · Computer Science 2021-05-10 Berivan Isik

A PnP Algorithm for Two-Dimensional Pose Estimation

We propose a PnP algorithm for a camera constrained to two-dimensional motion (applicable, for instance, to many wheeled robotics platforms). Leveraging this assumption allows accuracy and performance improvements over 3D PnP algorithms due…

Robotics · Computer Science 2024-03-11 Joshua Wang

Nerfels: Renderable Neural Codes for Improved Camera Pose Estimation

This paper presents a framework that combines traditional keypoint-based camera pose optimization with an invertible neural rendering mechanism. Our proposed 3D scene representation, Nerfels, is locally dense yet globally sparse. As opposed…

Computer Vision and Pattern Recognition · Computer Science 2022-06-07 Gil Avraham , Julian Straub , Tianwei Shen , Tsun-Yi Yang , Hugo Germain , Chris Sweeney , Vasileios Balntas , David Novotny , Daniel DeTone , Richard Newcombe

Differentiable Product Quantization for Memory Efficient Camera Relocalization

Camera relocalization relies on 3D models of the scene with a large memory footprint that is incompatible with the memory budget of several applications. One solution to reduce the scene memory size is map compression by removing certain 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-07-25 Zakaria Laskar , Iaroslav Melekhov , Assia Benbihi , Shuzhe Wang , Juho Kannala

Learning to Localize Through Compressed Binary Maps

One of the main difficulties of scaling current localization systems to large environments is the on-board storage required for the maps. In this paper we propose to learn to compress the map representation such that it is optimal for the…

Computer Vision and Pattern Recognition · Computer Science 2020-12-22 Xinkai Wei , Ioan Andrei Bârsan , Shenlong Wang , Julieta Martinez , Raquel Urtasun

Robust 6D Object Pose Estimation in Cluttered Scenes using Semantic Segmentation and Pose Regression Networks

Object pose estimation is a crucial prerequisite for robots to perform autonomous manipulation in clutter. Real-world bin-picking settings such as warehouses present additional challenges, e.g., new objects are added constantly. Most of the…

Computer Vision and Pattern Recognition · Computer Science 2018-10-09 Arul Selvam Periyasamy , Max Schwarz , Sven Behnke

Learning Object Localization and 6D Pose Estimation from Simulation and Weakly Labeled Real Images

This work proposes a process for efficiently training a point-wise object detector that enables localizing objects and computing their 6D poses in cluttered and occluded scenes. Accurate pose estimation is typically a requirement for robust…

Computer Vision and Pattern Recognition · Computer Science 2019-02-22 Jean-Philippe Mercier , Chaitanya Mitash , Philippe Giguère , Abdeslam Boularias

COMPOSE: Hypergraph Cover Optimization for Multi-view 3D Human Pose Estimation

3D pose estimation from sparse multi-views is a critical task for numerous applications, including action recognition, sports analysis, and human-robot interaction. Optimization-based methods typically follow a two-stage pipeline, first…

Computer Vision and Pattern Recognition · Computer Science 2026-01-15 Tony Danjun Wang , Tolga Birdal , Nassir Navab , Lennart Bastian

3D Scene Geometry-Aware Constraint for Camera Localization with Deep Learning

Camera localization is a fundamental and key component of autonomous driving vehicles and mobile robots to localize themselves globally for further environment perception, path planning and motion control. Recently end-to-end approaches…

Computer Vision and Pattern Recognition · Computer Science 2020-05-14 Mi Tian , Qiong Nie , Hao Shen

Efficient in-situ image and video compression through probabilistic image representation

Fast and effective image compression for multi-dimensional images has become increasingly important for efficient storage and transfer of massive amounts of high-resolution images and videos. Desirable properties in compression methods…

Image and Video Processing · Electrical Eng. & Systems 2020-11-13 Rongjie Liu , Meng Li , Li Ma

Compressed Volumetric Heatmaps for Multi-Person 3D Pose Estimation

In this paper we present a novel approach for bottom-up multi-person 3D human pose estimation from monocular RGB images. We propose to use high resolution volumetric heatmaps to model joint locations, devising a simple and effective…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Matteo Fabbri , Fabio Lanzi , Simone Calderara , Stefano Alletto , Rita Cucchiara

Quadric Representations for LiDAR Odometry, Mapping and Localization

Current LiDAR odometry, mapping and localization methods leverage point-wise representations of 3D scenes and achieve high accuracy in autonomous driving tasks. However, the space-inefficiency of methods that use point-wise representations…

Computer Vision and Pattern Recognition · Computer Science 2023-04-28 Chao Xia , Chenfeng Xu , Patrick Rim , Mingyu Ding , Nanning Zheng , Kurt Keutzer , Masayoshi Tomizuka , Wei Zhan

ImPosing: Implicit Pose Encoding for Efficient Visual Localization

We propose a novel learning-based formulation for visual localization of vehicles that can operate in real-time in city-scale environments. Visual localization algorithms determine the position and orientation from which an image has been…

Computer Vision and Pattern Recognition · Computer Science 2022-10-31 Arthur Moreau , Thomas Gilles , Nathan Piasco , Dzmitry Tsishkou , Bogdan Stanciulescu , Arnaud de La Fortelle

Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis

Recently, high-fidelity scene reconstruction with an optimized 3D Gaussian splat representation has been introduced for novel view synthesis from sparse image sets. Making such representations suitable for applications like network…

Computer Vision and Pattern Recognition · Computer Science 2024-01-23 Simon Niedermayr , Josef Stumpfegger , Rüdiger Westermann

A Pose-only Geometric Constraint for Multi-Camera Pose Adjustment

Multi-camera systems offer rich observation capabilities for visual navigation and 3D scene reconstruction; however, the resulting feature redundancy often compromises computational efficiency. This challenge is particularly pronounced…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Shunkun Liang , Banglei Guan , Bin Li , Qifeng Yu , Yang Shang