Related papers: Deep Patch Visual SLAM

DROID-SLAM: Deep Visual SLAM for Monocular, Stereo, and RGB-D Cameras

We introduce DROID-SLAM, a new deep learning based SLAM system. DROID-SLAM consists of recurrent iterative updates of camera pose and pixelwise depth through a Dense Bundle Adjustment layer. DROID-SLAM is accurate, achieving large…

Computer Vision and Pattern Recognition · Computer Science 2022-02-04 Zachary Teed , Jia Deng

Deep Patch Visual Odometry

We propose Deep Patch Visual Odometry (DPVO), a new deep learning system for monocular Visual Odometry (VO). DPVO uses a novel recurrent network architecture designed for tracking image patches across time. Recent approaches to VO have…

Computer Vision and Pattern Recognition · Computer Science 2023-05-24 Zachary Teed , Lahav Lipson , Jia Deng

NGD-SLAM: Towards Real-Time Dynamic SLAM without GPU

Many existing visual SLAM methods can achieve high localization accuracy in dynamic environments by leveraging deep learning to mask moving objects. However, these methods incur significant computational overhead as the camera tracking…

Robotics · Computer Science 2025-06-18 Yuhao Zhang , Mihai Bujanca , Mikel Luján

VOLDOR-SLAM: For the Times When Feature-Based or Direct Methods Are Not Good Enough

We present a dense-indirect SLAM system using external dense optical flows as input. We extend the recent probabilistic visual odometry model VOLDOR [Min et al. CVPR'20], by incorporating the use of geometric priors to 1) robustly bootstrap…

Computer Vision and Pattern Recognition · Computer Science 2021-04-15 Zhixiang Min , Enrique Dunn

Visual Odometry Revisited: What Should Be Learnt?

In this work we present a monocular visual odometry (VO) algorithm which leverages geometry-based methods and deep learning. Most existing VO/SLAM systems with superior performance are based on geometry and have to be carefully designed for…

Computer Vision and Pattern Recognition · Computer Science 2020-02-19 Huangying Zhan , Chamara Saroj Weerasekera , Jiawang Bian , Ian Reid

Dropping the D: RGB-D SLAM Without the Depth Sensor

We present DropD-SLAM, a real-time monocular SLAM system that achieves RGB-D-level accuracy without relying on depth sensors. The system replaces active depth input with three pretrained vision modules: a monocular metric depth estimator, a…

Computer Vision and Pattern Recognition · Computer Science 2025-11-04 Mert Kiray , Alican Karaomer , Benjamin Busam

DPVO-QAT++: Heterogeneous QAT and CUDA Kernel Fusion for High-Performance Deep Patch Visual Odometry

Deep learning-based Visual SLAM (vSLAM) systems exhibit exceptional geometric reasoning capabilities, yet their prohibitive computational overhead severely restricts deployment on resource-constrained autonomous platforms. This paper…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Cheng Liao

DVN-SLAM: Dynamic Visual Neural SLAM Based on Local-Global Encoding

Recent research on Simultaneous Localization and Mapping (SLAM) based on implicit representation has shown promising results in indoor environments. However, there are still some challenges: the limited scene representation capability of…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Wenhua Wu , Guangming Wang , Ting Deng , Sebastian Aegidius , Stuart Shanks , Valerio Modugno , Dimitrios Kanoulas , Hesheng Wang

ViSTA-SLAM: Visual SLAM with Symmetric Two-view Association

We present ViSTA-SLAM as a real-time monocular visual SLAM system that operates without requiring camera intrinsics, making it broadly applicable across diverse camera setups. At its core, the system employs a lightweight symmetric two-view…

Computer Vision and Pattern Recognition · Computer Science 2026-01-07 Ganlin Zhang , Shenhan Qian , Xi Wang , Daniel Cremers

DF-SLAM: A Deep-Learning Enhanced Visual SLAM System based on Deep Local Features

As the foundation of driverless vehicle and intelligent robots, Simultaneous Localization and Mapping(SLAM) has attracted much attention these days. However, non-geometric modules of traditional SLAM algorithms are limited by data…

Computer Vision and Pattern Recognition · Computer Science 2019-01-25 Rong Kang , Jieqi Shi , Xueming Li , Yang Liu , Xiao Liu

DynaSLAM: Tracking, Mapping and Inpainting in Dynamic Scenes

The assumption of scene rigidity is typical in SLAM algorithms. Such a strong assumption limits the use of most visual SLAM systems in populated real-world environments, which are the target of several relevant applications like service…

Computer Vision and Pattern Recognition · Computer Science 2018-08-16 Berta Bescos , José M. Fácil , Javier Civera , José Neira

Light-SLAM: A Robust Deep-Learning Visual SLAM System Based on LightGlue under Challenging Lighting Conditions

Simultaneous Localization and Mapping (SLAM) has become a critical technology for intelligent transportation systems and autonomous robots and is widely used in autonomous driving. However, traditional manual feature-based methods in…

Computer Vision and Pattern Recognition · Computer Science 2024-07-03 Zhiqi Zhao , Chang Wu , Xiaotong Kong , Zejie Lv , Xiaoqi Du , Qiyan Li

A real-time, robust and versatile visual-SLAM framework based on deep learning networks

This paper explores how deep learning techniques can improve visual-based SLAM performance in challenging environments. By combining deep feature extraction and deep matching methods, we introduce a versatile hybrid visual SLAM system…

Robotics · Computer Science 2024-06-05 Zhang Xiao , Shuaixin Li

DXSLAM: A Robust and Efficient Visual SLAM System with Deep Features

A robust and efficient Simultaneous Localization and Mapping (SLAM) system is essential for robot autonomy. For visual SLAM algorithms, though the theoretical framework has been well established for most aspects, feature extraction and…

Computer Vision and Pattern Recognition · Computer Science 2020-08-13 Dongjiang Li , Xuesong Shi , Qiwei Long , Shenghui Liu , Wei Yang , Fangshi Wang , Qi Wei , Fei Qiao

DeepVO: A Deep Learning approach for Monocular Visual Odometry

Deep Learning based techniques have been adopted with precision to solve a lot of standard computer vision problems, some of which are image classification, object detection and segmentation. Despite the widespread success of these…

Computer Vision and Pattern Recognition · Computer Science 2016-11-21 Vikram Mohanty , Shubh Agrawal , Shaswat Datta , Arna Ghosh , Vishnu Dutt Sharma , Debashish Chakravarty

coVoxSLAM: GPU Accelerated Globally Consistent Dense SLAM

A dense SLAM system is essential for mobile robots, as it provides localization and allows navigation, path planning, obstacle avoidance, and decision-making in unstructured environments. Due to increasing computational demands the use of…

Robotics · Computer Science 2024-10-29 Emiliano Höss , Pablo De Cristóforis

DVI-SLAM: A Dual Visual Inertial SLAM Network

Recent deep learning based visual simultaneous localization and mapping (SLAM) methods have made significant progress. However, how to make full use of visual information as well as better integrate with inertial measurement unit (IMU) in…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Xiongfeng Peng , Zhihua Liu , Weiming Li , Ping Tan , SoonYong Cho , Qiang Wang

OV$^{2}$SLAM : A Fully Online and Versatile Visual SLAM for Real-Time Applications

Many applications of Visual SLAM, such as augmented reality, virtual reality, robotics or autonomous driving, require versatile, robust and precise solutions, most often with real-time capability. In this work, we describe OV$^{2}$SLAM, a…

Computer Vision and Pattern Recognition · Computer Science 2021-02-09 Maxime Ferrera , Alexandre Eudes , Julien Moras , Martial Sanfourche , Guy Le Besnerais

WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments

We present WildGS-SLAM, a robust and efficient monocular RGB SLAM system designed to handle dynamic environments by leveraging uncertainty-aware geometric mapping. Unlike traditional SLAM systems, which assume static scenes, our approach…

Computer Vision and Pattern Recognition · Computer Science 2025-04-08 Jianhao Zheng , Zihan Zhu , Valentin Bieri , Marc Pollefeys , Songyou Peng , Iro Armeni

VGGT-SLAM++

We introduce VGGT-SLAM++, a complete visual SLAM system that leverages the geometry-rich outputs of the Visual Geometry Grounded Transformer (VGGT). The system comprises a visual odometry (front-end) fusing the VGGT feed-forward transformer…

Computer Vision and Pattern Recognition · Computer Science 2026-04-09 Avilasha Mandal , Rajesh Kumar , Sudarshan Sunil Harithas , Chetan Arora