Related papers: Point3R: Streaming 3D Reconstruction with Explicit…

3D Reconstruction with Spatial Memory

We present Spann3R, a novel approach for dense 3D reconstruction from ordered or unordered image collections. Built on the DUSt3R paradigm, Spann3R uses a transformer-based architecture to directly regress pointmaps from images without any…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Hengyi Wang , Lourdes Agapito

DePT3R: Joint Dense Point Tracking and 3D Reconstruction of Dynamic Scenes in a Single Forward Pass

Current methods for dense 3D point tracking in dynamic scenes typically rely on pairwise processing, require known camera poses, or assume temporal ordering of input frames, thereby constraining their flexibility and applicability.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-06 Vivek Alumootil , Tuan-Anh Vu

Ray-Aware Pointer Memory with Adaptive Updates for Streaming 3D Reconstruction

Dense 3D reconstruction from continuous image streams requires both accurate geometric aggregation and stable long-term memory management. Recent feed-forward reconstruction frameworks integrate observations through persistent memory…

Computer Vision and Pattern Recognition · Computer Science 2026-05-22 Feifei Li , Qi Song , Chi Zhang , Rui Huang

LONG3R: Long Sequence Streaming 3D Reconstruction

Recent advancements in multi-view scene reconstruction have been significant, yet existing methods face limitations when processing streams of input images. These methods either rely on time-consuming offline optimization or are restricted…

Computer Vision and Pattern Recognition · Computer Science 2025-07-25 Zhuoguang Chen , Minghui Qin , Tianyuan Yuan , Zhe Liu , Hang Zhao

Driv3R: Learning Dense 4D Reconstruction for Autonomous Driving

Realtime 4D reconstruction for dynamic scenes remains a crucial challenge for autonomous driving perception. Most existing methods rely on depth estimation through self-supervision or multi-modality sensor fusion. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Xin Fei , Wenzhao Zheng , Yueqi Duan , Wei Zhan , Masayoshi Tomizuka , Kurt Keutzer , Jiwen Lu

STream3R: Scalable Sequential 3D Reconstruction with Causal Transformer

We present STream3R, a novel approach to 3D reconstruction that reformulates pointmap prediction as a decoder-only Transformer problem. Existing state-of-the-art methods for multi-view reconstruction either depend on expensive global…

Computer Vision and Pattern Recognition · Computer Science 2025-08-15 Yushi Lan , Yihang Luo , Fangzhou Hong , Shangchen Zhou , Honghua Chen , Zhaoyang Lyu , Shuai Yang , Bo Dai , Chen Change Loy , Xingang Pan

Mem3R: Streaming 3D Reconstruction with Hybrid Memory via Test-Time Training

Streaming 3D perception is well suited to robotics and augmented reality, where long visual streams must be processed efficiently and consistently. Recent recurrent models offer a promising solution by maintaining fixed-size states and…

Computer Vision and Pattern Recognition · Computer Science 2026-04-09 Changkun Liu , Jiezhi Yang , Zeman Li , Yuan Deng , Jiancong Guo , Luca Ballan

PAS3R: Pose-Adaptive Streaming 3D Reconstruction for Long Video Sequences

Online monocular 3D reconstruction enables dense scene recovery from streaming video but remains fundamentally limited by the stability-adaptation dilemma: the reconstruction model must rapidly incorporate novel viewpoints while preserving…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Lanbo Xu , Liang Guo , Caigui Jiang , Cheng Wang

Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction

DUSt3R has recently shown that one can reduce many tasks in multi-view geometry, including estimating camera intrinsics and extrinsics, reconstructing the scene in 3D, and establishing image correspondences, to the prediction of a pair of…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Edgar Sucar , Zihang Lai , Eldar Insafutdinov , Andrea Vedaldi

D$^2$USt3R: Enhancing 3D Reconstruction for Dynamic Scenes

In this work, we address the task of 3D reconstruction in dynamic scenes, where object motions frequently degrade the quality of previous 3D pointmap regression methods, such as DUSt3R, that are originally designed for static 3D scene…

Computer Vision and Pattern Recognition · Computer Science 2025-11-03 Jisang Han , Honggyu An , Jaewoo Jung , Takuya Narihira , Junyoung Seo , Kazumi Fukuda , Chaehyun Kim , Sunghwan Hong , Yuki Mitsufuji , Seungryong Kim

GRS-SLAM3R: Real-Time Dense SLAM with Gated Recurrent State

DUSt3R-based end-to-end scene reconstruction has recently shown promising results in dense visual SLAM. However, most existing methods only use image pairs to estimate pointmaps, overlooking spatial memory and global consistency.To this…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Guole Shen , Tianchen Deng , Yanbo Wang , Yongtao Chen , Yilin Shen , Jiuming Liu , Jingchuan Wang

Continuous 3D Perception Model with Persistent State

We present a unified framework capable of solving a broad range of 3D tasks. Our approach features a stateful recurrent model that continuously updates its state representation with each new observation. Given a stream of images, this…

Computer Vision and Pattern Recognition · Computer Science 2025-01-22 Qianqian Wang , Yifei Zhang , Aleksander Holynski , Alexei A. Efros , Angjoo Kanazawa

MUSt3R: Multi-view Network for Stereo 3D Reconstruction

DUSt3R introduced a novel paradigm in geometric computer vision by proposing a model that can provide dense and unconstrained Stereo 3D Reconstruction of arbitrary image collections with no prior information about camera calibration nor…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Yohann Cabon , Lucas Stoffl , Leonid Antsfeld , Gabriela Csurka , Boris Chidlovskii , Jerome Revaud , Vincent Leroy

Dens3R: A Foundation Model for 3D Geometry Prediction

Recent advances in dense 3D reconstruction have led to significant progress, yet achieving accurate unified geometric prediction remains a major challenge. Most existing methods are limited to predicting a single geometry quantity from…

Computer Vision and Pattern Recognition · Computer Science 2025-07-23 Xianze Fang , Jingnan Gao , Zhe Wang , Zhuo Chen , Xingyu Ren , Jiangjing Lyu , Qiaomu Ren , Zhonglei Yang , Xiaokang Yang , Yichao Yan , Chengfei Lyu

Easi3R: Estimating Disentangled Motion from DUSt3R Without Training

Recent advances in DUSt3R have enabled robust estimation of dense point clouds and camera parameters of static scenes, leveraging Transformer network architectures and direct supervision on large-scale 3D datasets. In contrast, the limited…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Xingyu Chen , Yue Chen , Yuliang Xiu , Andreas Geiger , Anpei Chen

PointRecon: Online Point-based 3D Reconstruction via Ray-based 2D-3D Matching

We propose a novel online, point-based 3D reconstruction method from posed monocular RGB videos. Our model maintains a global point cloud representation of the scene, continuously updating the features and 3D locations of points as new…

Computer Vision and Pattern Recognition · Computer Science 2024-11-25 Chen Ziwen , Zexiang Xu , Li Fuxin

SLAM3R: Real-Time Dense Scene Reconstruction from Monocular RGB Videos

In this paper, we introduce SLAM3R, a novel and effective system for real-time, high-quality, dense 3D reconstruction using RGB videos. SLAM3R provides an end-to-end solution by seamlessly integrating local 3D reconstruction and global…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Yuzheng Liu , Siyan Dong , Shuzhe Wang , Yingda Yin , Yanchao Yang , Qingnan Fan , Baoquan Chen

Dense 3D Point Cloud Reconstruction Using a Deep Pyramid Network

Reconstructing a high-resolution 3D model of an object is a challenging task in computer vision. Designing scalable and light-weight architectures is crucial while addressing this problem. Existing point-cloud based reconstruction…

Computer Vision and Pattern Recognition · Computer Science 2019-01-28 Priyanka Mandikal , R. Venkatesh Babu

Interp3R: Continuous-time 3D Geometry Estimation with Frames and Events

In recent years, 3D visual foundation models pioneered by pointmap-based approaches such as DUSt3R have attracted a lot of interest, achieving impressive accuracy and strong generalization across diverse scenes. However, these methods are…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Shuang Guo , Filbert Febryanto , Lei Sun , Guillermo Gallego

V-DPM: 4D Video Reconstruction with Dynamic Point Maps

Powerful 3D representations such as DUSt3R invariant point maps, which encode 3D shape and camera parameters, have significantly advanced feed forward 3D reconstruction. While point maps assume static scenes, Dynamic Point Maps (DPMs)…

Computer Vision and Pattern Recognition · Computer Science 2026-01-15 Edgar Sucar , Eldar Insafutdinov , Zihang Lai , Andrea Vedaldi