Related papers: Memory Over Maps: 3D Object Localization Without R…

Visible Structure Retrieval for Lightweight Image-Based Relocalisation

Accurate camera pose estimation from an image observation in a previously mapped environment is commonly done through structure-based methods: by finding correspondences between 2D keypoints on the image and 3D structure points in the map.…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Fereidoon Zangeneh , Leonard Bruns , Amit Dekel , Alessandro Pieropan , Patric Jensfelt

A survey on real-time 3D scene reconstruction with SLAM methods in embedded systems

The 3D reconstruction of simultaneous localization and mapping (SLAM) is an important topic in the field for transport systems such as drones, service robots and mobile AR/VR devices. Compared to a point cloud representation, the 3D…

Robotics · Computer Science 2023-09-12 Quentin Picard , Stephane Chevobbe , Mehdi Darouich , Jean-Yves Didier

Scene Reconstruction as Mapping Priors for 3D Detection

In autonomous driving, mapping is critical for motion planning but remains an under-utilized resource for perception tasks such as 3D object detection. Maps can provide robust structural priors of the static environment, helping resolve…

Computer Vision and Pattern Recognition · Computer Science 2026-05-25 Yang Fu , Yuliang Zou , Hao Xiang , Xin Huang , Yijing Bai , Chen Song , Weijing Shi , Govind Thattai , Dragomir Anguelov , Mingxing Tan , Yingwei Li

3D object reconstruction and 6D-pose estimation from 2D shape for robotic grasping of objects

We propose a method for 3D object reconstruction and 6D-pose estimation from 2D images that uses knowledge about object shape as the primary key. In the proposed pipeline, recognition and labeling of objects in 2D images deliver 2D segment…

Computer Vision and Pattern Recognition · Computer Science 2022-03-03 Marcell Wolnitza , Osman Kaya , Tomas Kulvicius , Florentin Wörgötter , Babette Dellen

Map-free Visual Relocalization: Metric Pose Relative to a Single Image

Can we relocalize in a scene represented by a single reference image? Standard visual relocalization requires hundreds of images and scale calibration to build a scene-specific 3D map. In contrast, we propose Map-free Relocalization, i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2022-10-12 Eduardo Arnold , Jamie Wynn , Sara Vicente , Guillermo Garcia-Hernando , Áron Monszpart , Victor Adrian Prisacariu , Daniyar Turmukhambetov , Eric Brachmann

Mapping, Localization and Path Planning for Image-based Navigation using Visual Features and Map

Building on progress in feature representations for image retrieval, image-based localization has seen a surge of research interest. Image-based localization has the advantage of being inexpensive and efficient, often avoiding the use of 3D…

Computer Vision and Pattern Recognition · Computer Science 2019-07-12 Janine Thoma , Danda Pani Paudel , Ajad Chhatkuli , Thomas Probst , Luc Van Gool

Recovering 3D Planar Arrangements from Videos

Acquiring 3D geometry of real world objects has various applications in 3D digitization, such as navigation and content generation in virtual environments. Image remains one of the most popular media for such visual tasks due to its…

Computer Vision and Pattern Recognition · Computer Science 2017-01-26 Shuai Du , Youyi Zheng

Lightweight Object-level Topological Semantic Mapping and Long-term Global Localization based on Graph Matching

Mapping and localization are two essential tasks for mobile robots in real-world applications. However, largescale and dynamic scenes challenge the accuracy and robustness of most current mature solutions. This situation becomes even worse…

Robotics · Computer Science 2022-01-19 Fan Wang , Chaofan Zhang , Fulin Tang , Hongkui Jiang , Yihong Wu , Yong Liu

Object Learning and Robust 3D Reconstruction

In this thesis we discuss architectural designs and training methods for a neural network to have the ability of dissecting an image into objects of interest without supervision. The main challenge in 2D unsupervised object segmentation is…

Computer Vision and Pattern Recognition · Computer Science 2025-04-28 Sara Sabour

From Monocular Vision to Autonomous Action: Guiding Tumor Resection via 3D Reconstruction

Surgical automation requires precise guidance and understanding of the scene. Current methods in the literature rely on bulky depth cameras to create maps of the anatomy, however this does not translate well to space-limited clinical…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Ayberk Acar , Mariana Smith , Lidia Al-Zogbi , Tanner Watts , Fangjie Li , Hao Li , Nural Yilmaz , Paul Maria Scheikl , Jesse F. d'Almeida , Susheela Sharma , Lauren Branscombe , Tayfun Efe Ertop , Robert J. Webster , Ipek Oguz , Alan Kuntz , Axel Krieger , Jie Ying Wu

Improving Map Re-localization with Deep 'Movable' Objects Segmentation on 3D LiDAR Point Clouds

Localization and Mapping is an essential component to enable Autonomous Vehicles navigation, and requires an accuracy exceeding that of commercial GPS-based systems. Current odometry and mapping algorithms are able to provide this accurate…

Computer Vision and Pattern Recognition · Computer Science 2019-10-09 Victor Vaquero , Kai Fischer , Francesc Moreno-Noguer , Alberto Sanfeliu , Stefan Milz

Global Localization in Unstructured Environments using Semantic Object Maps Built from Various Viewpoints

We present a novel framework for global localization and guided relocalization of a vehicle in an unstructured environment. Compared to existing methods, our pipeline does not rely on cues from urban fixtures (e.g., lane markings,…

Robotics · Computer Science 2023-10-27 Jacqueline Ankenbauer , Parker C. Lusk , Annika Thomas , Jonathan P. How

SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation

In this paper, we propose a new framework for zero-shot object navigation. Existing zero-shot object navigation methods prompt LLM with the text of spatially closed objects, which lacks enough scene context for in-depth reasoning. To better…

Computer Vision and Pattern Recognition · Computer Science 2024-10-11 Hang Yin , Xiuwei Xu , Zhenyu Wu , Jie Zhou , Jiwen Lu

ReMemNav: A Rethinking and Memory-Augmented Framework for Zero-Shot Object Navigation

Zero-shot object navigation requires agents to locate unseen target objects in unfamiliar environments without prior maps or task-specific training which remains a significant challenge. Although recent advancements in vision-language…

Robotics · Computer Science 2026-04-08 Feng Wu , Wei Zuo , Wenliang Yang , Jun Xiao , Yang Liu , Xinhua Zeng

Automatic 3D Reconstruction for Symmetric Shapes

Generic 3D reconstruction from a single image is a difficult problem. A lot of data loss occurs in the projection. A domain based approach to reconstruction where we solve a smaller set of problems for a particular use case lead to greater…

Computer Vision and Pattern Recognition · Computer Science 2016-06-21 Atishay Jain

Enhancing MLLM Spatial Understanding via Active 3D Scene Exploration for Multi-Perspective Reasoning

Although Multimodal Large Language Models have achieved remarkable progress, they still struggle with complex 3D spatial reasoning due to the reliance on 2D visual priors. Existing approaches typically mitigate this limitation either…

Computer Vision and Pattern Recognition · Computer Science 2026-04-09 Jiahua Chen , Qihong Tang , Weinong Wang , Qi Fan

Learning Less is More - 6D Camera Localization via 3D Surface Regression

Popular research areas like autonomous driving and augmented reality have renewed the interest in image-based camera localization. In this work, we address the task of predicting the 6D camera pose from a single RGB image in a given 3D…

Computer Vision and Pattern Recognition · Computer Science 2018-03-28 Eric Brachmann , Carsten Rother

SSR-2D: Semantic 3D Scene Reconstruction from 2D Images

Most deep learning approaches to comprehensive semantic modeling of 3D indoor spaces require costly dense annotations in the 3D domain. In this work, we explore a central 3D scene modeling task, namely, semantic scene reconstruction without…

Computer Vision and Pattern Recognition · Computer Science 2024-06-06 Junwen Huang , Alexey Artemov , Yujin Chen , Shuaifeng Zhi , Kai Xu , Matthias Nießner

Seeing Through Clutter: Structured 3D Scene Reconstruction via Iterative Object Removal

We present SeeingThroughClutter, a method for reconstructing structured 3D representations from single images by segmenting and modeling objects individually. Prior approaches rely on intermediate tasks such as semantic segmentation and…

Computer Vision and Pattern Recognition · Computer Science 2026-02-16 Rio Aguina-Kang , Kevin James Blackburn-Matzen , Thibault Groueix , Vladimir Kim , Matheus Gadelha

Reconstructing Interactive 3D Scenes by Panoptic Mapping and CAD Model Alignments

In this paper, we rethink the problem of scene reconstruction from an embodied agent's perspective: While the classic view focuses on the reconstruction accuracy, our new perspective emphasizes the underlying functions and constraints such…

Robotics · Computer Science 2021-03-31 Muzhi Han , Zeyu Zhang , Ziyuan Jiao , Xu Xie , Yixin Zhu , Song-Chun Zhu , Hangxin Liu