Related papers: Learning models for visual 3D localization with im…

Neural Multisensory Scene Inference

For embodied agents to infer representations of the underlying 3D physical world they inhabit, they should efficiently combine multisensory cues from numerous trials, e.g., by looking at and touching objects. Despite its importance,…

Machine Learning · Computer Science 2019-11-11 Jae Hyun Lim , Pedro O. Pinheiro , Negar Rostamzadeh , Christopher Pal , Sungjin Ahn

Self-supervised Learning of Neural Implicit Feature Fields for Camera Pose Refinement

Visual localization techniques rely upon some underlying scene representation to localize against. These representations can be explicit such as 3D SFM map or implicit, such as a neural network that learns to encode the scene. The former…

Computer Vision and Pattern Recognition · Computer Science 2024-06-13 Maxime Pietrantoni , Gabriela Csurka , Martin Humenberger , Torsten Sattler

Semantic Implicit Neural Scene Representations With Semi-Supervised Training

The recent success of implicit neural scene representations has presented a viable new method for how we capture and store 3D scenes. Unlike conventional 3D representations, such as point clouds, which explicitly store scene properties in…

Computer Vision and Pattern Recognition · Computer Science 2021-01-19 Amit Kohli , Vincent Sitzmann , Gordon Wetzstein

I-Scene: 3D Instance Models are Implicit Generalizable Spatial Learners

Generalization remains the central challenge for interactive 3D scene generation. Existing learning-based approaches ground spatial understanding in limited scene dataset, restricting generalization to new layouts. We instead reprogram a…

Computer Vision and Pattern Recognition · Computer Science 2026-01-08 Lu Ling , Yunhao Ge , Yichen Sheng , Aniket Bera

Convolutional Occupancy Networks

Recently, implicit neural representations have gained popularity for learning-based 3D reconstruction. While demonstrating promising results, most implicit approaches are limited to comparably simple geometry of single objects and do not…

Computer Vision and Pattern Recognition · Computer Science 2020-08-04 Songyou Peng , Michael Niemeyer , Lars Mescheder , Marc Pollefeys , Andreas Geiger

Holistic 3D Scene Understanding from a Single Image with Implicit Representation

We present a new pipeline for holistic 3D scene understanding from a single image, which could predict object shapes, object poses, and scene layout. As it is a highly ill-posed problem, existing methods usually suffer from inaccurate…

Computer Vision and Pattern Recognition · Computer Science 2021-08-24 Cheng Zhang , Zhaopeng Cui , Yinda Zhang , Bing Zeng , Marc Pollefeys , Shuaicheng Liu

LatentGNN: Learning Efficient Non-local Relations for Visual Recognition

Capturing long-range dependencies in feature representations is crucial for many visual recognition tasks. Despite recent successes of deep convolutional networks, it remains challenging to model non-local context relations between visual…

Computer Vision and Pattern Recognition · Computer Science 2019-05-29 Songyang Zhang , Shipeng Yan , Xuming He

Unsupervised Generative 3D Shape Learning from Natural Images

In this paper we present, to the best of our knowledge, the first method to learn a generative model of 3D shapes from natural images in a fully unsupervised way. For example, we do not use any ground truth 3D or 2D annotations, stereo…

Computer Vision and Pattern Recognition · Computer Science 2019-10-02 Attila Szabó , Givi Meishvili , Paolo Favaro

Learning to Localize in New Environments from Synthetic Training Data

Most existing approaches for visual localization either need a detailed 3D model of the environment or, in the case of learning-based methods, must be retrained for each new scene. This can either be very expensive or simply impossible for…

Robotics · Computer Science 2021-06-22 Dominik Winkelbauer , Maximilian Denninger , Rudolph Triebel

Imagining the Unseen: Generative Location Modeling for Object Placement

Location modeling, or determining where non-existing objects could feasibly appear in a scene, has the potential to benefit numerous computer vision tasks, from automatic object insertion to scene creation in virtual reality. Yet, this…

Computer Vision and Pattern Recognition · Computer Science 2025-10-08 Jooyeol Yun , Davide Abati , Mohamed Omran , Jaegul Choo , Amirhossein Habibian , Auke Wiggers

GNeSF: Generalizable Neural Semantic Fields

3D scene segmentation based on neural implicit representation has emerged recently with the advantage of training only on 2D supervision. However, existing approaches still requires expensive per-scene optimization that prohibits…

Computer Vision and Pattern Recognition · Computer Science 2023-10-27 Hanlin Chen , Chen Li , Mengqi Guo , Zhiwen Yan , Gim Hee Lee

Unconstrained Scene Generation with Locally Conditioned Radiance Fields

We tackle the challenge of learning a distribution over complex, realistic, indoor scenes. In this paper, we introduce Generative Scene Networks (GSN), which learns to decompose scenes into a collection of many local radiance fields that…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Terrance DeVries , Miguel Angel Bautista , Nitish Srivastava , Graham W. Taylor , Joshua M. Susskind

Differentiable Volumetric Rendering: Learning Implicit 3D Representations without 3D Supervision

Learning-based 3D reconstruction methods have shown impressive results. However, most methods require 3D supervision which is often hard to obtain for real-world datasets. Recently, several works have proposed differentiable rendering…

Computer Vision and Pattern Recognition · Computer Science 2020-03-24 Michael Niemeyer , Lars Mescheder , Michael Oechsle , Andreas Geiger

Learning 3D Segment Descriptors for Place Recognition

In the absence of global positioning information, place recognition is a key capability for enabling localization, mapping and navigation in any environment. Most place recognition methods rely on images, point clouds, or a combination of…

Robotics · Computer Science 2018-04-26 Andrei Cramariuc , Renaud Dubé , Hannes Sommer , Roland Siegwart , Igor Gilitschenski

GINA-3D: Learning to Generate Implicit Neural Assets in the Wild

Modeling the 3D world from sensor data for simulation is a scalable way of developing testing and validation environments for robotic learning problems such as autonomous driving. However, manually creating or re-creating real-world-like…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Bokui Shen , Xinchen Yan , Charles R. Qi , Mahyar Najibi , Boyang Deng , Leonidas Guibas , Yin Zhou , Dragomir Anguelov

Cycle-Consistent Generative Rendering for 2D-3D Modality Translation

For humans, visual understanding is inherently generative: given a 3D shape, we can postulate how it would look in the world; given a 2D image, we can infer the 3D structure that likely gave rise to it. We can thus translate between the 2D…

Computer Vision and Pattern Recognition · Computer Science 2020-11-17 Tristan Aumentado-Armstrong , Alex Levinshtein , Stavros Tsogkas , Konstantinos G. Derpanis , Allan D. Jepson

Semantic Visual Localization

Robust visual localization under a wide range of viewing conditions is a fundamental problem in computer vision. Handling the difficult cases of this problem is not only very challenging but also of high practical relevance, e.g., in the…

Computer Vision and Pattern Recognition · Computer Science 2018-04-17 Johannes L. Schönberger , Marc Pollefeys , Andreas Geiger , Torsten Sattler

$L^3$:Scene-agnostic Visual Localization in the Wild

Standard visual localization methods typically require offline pre-processing of scenes to obtain 3D structural information for better performance. This inevitably introduces additional computational and time costs, as well as the overhead…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Yu Zhang , Muhua Zhu , Yifei Xue , Tie Ji , Yizhen Lao

Deep Generative Modeling for Scene Synthesis via Hybrid Representations

We present a deep generative scene modeling technique for indoor environments. Our goal is to train a generative model using a feed-forward neural network that maps a prior distribution (e.g., a normal distribution) to the distribution of…

Computer Vision and Pattern Recognition · Computer Science 2018-08-08 Zaiwei Zhang , Zhenpei Yang , Chongyang Ma , Linjie Luo , Alexander Huth , Etienne Vouga , Qixing Huang

Global visual localization in LiDAR-maps through shared 2D-3D embedding space

Global localization is an important and widely studied problem for many robotic applications. Place recognition approaches can be exploited to solve this task, e.g., in the autonomous driving field. While most vision-based approaches match…

Computer Vision and Pattern Recognition · Computer Science 2020-03-11 Daniele Cattaneo , Matteo Vaghi , Simone Fontana , Augusto Luis Ballardini , Domenico Giorgio Sorrenti