Related papers: Feed-Forward 3D Scene Modeling: A Problem-Driven P…

Advances in Feed-Forward 3D Reconstruction and View Synthesis: A Survey

3D reconstruction and view synthesis are foundational problems in computer vision, graphics, and immersive technologies such as augmented reality (AR), virtual reality (VR), and digital twins. Traditional methods rely on computationally…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Jiahui Zhang , Yuelei Li , Anpei Chen , Muyu Xu , Kunhao Liu , Jianyuan Wang , Xiao-Xiao Long , Hanxue Liang , Zexiang Xu , Hao Su , Christian Theobalt , Christian Rupprecht , Andrea Vedaldi , Kaichen Zhou , Hanspeter Pfister , Paul Pu Liang , Shijian Lu , Fangneng Zhan

Review of Feed-forward 3D Reconstruction: From DUSt3R to VGGT

3D reconstruction, which aims to recover the dense three-dimensional structure of a scene, is a cornerstone technology for numerous applications, including augmented/virtual reality, autonomous driving, and robotics. While traditional…

Computer Vision and Pattern Recognition · Computer Science 2025-07-14 Wei Zhang , Yihang Wu , Songhua Li , Wenjie Ma , Xin Ma , Qiang Li , Qi Wang

MapAnything: Universal Feed-Forward Metric 3D Reconstruction

We introduce MapAnything, a unified transformer-based feed-forward model that ingests one or more images along with optional geometric inputs such as camera intrinsics, poses, depth, or partial reconstructions, and then directly regresses…

Computer Vision and Pattern Recognition · Computer Science 2026-01-26 Nikhil Keetha , Norman Müller , Johannes Schönberger , Lorenzo Porzi , Yuchen Zhang , Tobias Fischer , Arno Knapitsch , Duncan Zauss , Ethan Weber , Nelson Antunes , Jonathon Luiten , Manuel Lopez-Antequera , Samuel Rota Bulò , Christian Richardt , Deva Ramanan , Sebastian Scherer , Peter Kontschieder

Taming Feed-forward Reconstruction Models as Latent Encoders for 3D Generative Models

Recent AI-based 3D content creation has largely evolved along two paths: feed-forward image-to-3D reconstruction approaches and 3D generative models trained with 2D or 3D supervision. In this work, we show that existing feed-forward…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Suttisak Wizadwongsa , Jinfan Zhou , Edward Li , Jeong Joon Park

Global Structure-from-Motion Meets Feedforward Reconstruction

Structure-from-Motion -- the process of simultaneously estimating camera poses and 3D scene structure from a collection of images -- remains a central challenge in computer vision, with many open problems yet to be solved. Recent advances…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Linfei Pan , Johannes Schönberger , Marc Pollefeys

Free Geometry: Refining 3D Reconstruction from Longer Versions of Itself

Feed-forward 3D reconstruction models are efficient but rigid: once trained, they perform inference in a zero-shot manner and cannot adapt to the test scene. As a result, visually plausible reconstructions often contain errors, particularly…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Yuhang Dai , Xingyi Yang

TriSplat: Simulation-Ready Feed-Forward 3D Scene Reconstruction

Sparse-view 3D reconstruction is increasingly addressed with feed-forward splatting networks that predict explicit primitives directly from images. Yet most existing methods remain centered on Gaussian primitives and expose surfaces only…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Weijie Wang , Zimu Li , Jinchuan Shi , Zeyu Zhang , Botao Ye , Marc Pollefeys , Donny Y. Chen , Bohan Zhuang

PointForward: Feedforward Driving Reconstruction through Point-Aligned Representations

High-fidelity reconstruction of driving scenes is crucial for autonomous driving. While recent feedforward 3D Gaussian Splatting (3DGS) methods enable fast reconstruction, their per-pixel Gaussian prediction paradigm often suffers from…

Computer Vision and Pattern Recognition · Computer Science 2026-05-13 Cheng Chi , Xianqi Wang , Hongcheng Luo , Mingfei Tu , Gangwei Xu , Zehan Zhang , Bing Wang , Guang Chen , Hangjun Ye , Sida Peng , Xin Yang , Haiyang Sun

Scalable Scene Modeling from Perspective Imaging: Physics-based Appearance and Geometry Inference

3D scene modeling techniques serve as the bedrocks in the geospatial engineering and computer science, which drives many applications ranging from automated driving, terrain mapping, navigation, virtual, augmented, mixed, and extended…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Shuang Song

UniForward: Unified 3D Scene and Semantic Field Reconstruction via Feed-Forward Gaussian Splatting from Only Sparse-View Images

We propose a feed-forward Gaussian Splatting model that unifies 3D scene and semantic field reconstruction. Combining 3D scenes with semantic fields facilitates the perception and understanding of the surrounding environment. However, key…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Qijian Tian , Xin Tan , Jingyu Gong , Yuan Xie , Lizhuang Ma

Geometry-guided Feature Learning and Fusion for Indoor Scene Reconstruction

In addition to color and textural information, geometry provides important cues for 3D scene reconstruction. However, current reconstruction methods only include geometry at the feature level thus not fully exploiting the geometric…

Computer Vision and Pattern Recognition · Computer Science 2024-08-29 Ruihong Yin , Sezer Karaoglu , Theo Gevers

Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single Image

We propose Flash3D, a method for scene reconstruction and novel view synthesis from a single image which is both very generalisable and efficient. For generalisability, we start from a "foundation" model for monocular depth estimation and…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Stanislaw Szymanowicz , Eldar Insafutdinov , Chuanxia Zheng , Dylan Campbell , João F. Henriques , Christian Rupprecht , Andrea Vedaldi

360-GeoGS: Geometrically Consistent Feed-Forward 3D Gaussian Splatting Reconstruction for 360 Images

3D scene reconstruction is fundamental for spatial intelligence applications such as AR, robotics, and digital twins. Traditional multi-view stereo struggles with sparse viewpoints or low-texture regions, while neural rendering approaches,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-06 Jiaqi Yao , Zhongmiao Yan , Jingyi Xu , Songpengcheng Xia , Yan Xiang , Ling Pei

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

We present 4DNeX, the first feed-forward framework for generating 4D (i.e., dynamic 3D) scene representations from a single image. In contrast to existing methods that rely on computationally intensive optimization or require multi-frame…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Zhaoxi Chen , Tianqi Liu , Long Zhuo , Jiawei Ren , Zeng Tao , He Zhu , Fangzhou Hong , Liang Pan , Ziwei Liu

Diffusion Models in 3D Vision: A Survey

In recent years, 3D vision has become a crucial field within computer vision, powering a wide range of applications such as autonomous driving, robotics, augmented reality, and medical imaging. This field relies on accurate perception,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Zhen Wang , Dongyuan Li , Yaozu Wu , Tianyu He , Jiang Bian , Renhe Jiang

Recent Advances in 3D Object and Scene Generation: A Survey

In recent years, the demand for 3D content has grown exponentially with the intelligent upgrade of interactive media, extended reality (XR), and Metaverse industries. In order to overcome the limitations of traditional manual modeling…

Graphics · Computer Science 2025-12-23 Xiang Tang , Ruotong Li , Xiaopeng Fan

Feedforward 3D Editing Learns from Semantic-Part Transformation

3D editing is a fundamental capability for scalable 3D content creation. While image editing has rapidly evolved toward large-scale feedforward generative paradigms, 3D AI generation remains dominated by training-free editing pipelines. A…

Computer Vision and Pattern Recognition · Computer Science 2026-05-28 Jiawei Weng , Saining Zhang , Zhenxin Diao , Peishuo Li , Henghaofan Zhang , Junhao Chen , Hao Zhao

Generative Densification: Learning to Densify Gaussians for High-Fidelity Generalizable 3D Reconstruction

Generalized feed-forward Gaussian models have achieved significant progress in sparse-view 3D reconstruction by leveraging prior knowledge from large multi-view datasets. However, these models often struggle to represent high-frequency…

Computer Vision and Pattern Recognition · Computer Science 2025-03-10 Seungtae Nam , Xiangyu Sun , Gyeongjin Kang , Younggeun Lee , Seungjun Oh , Eunbyung Park

3D Scene Generation: A Survey

3D scene generation seeks to synthesize spatially structured, semantically meaningful, and photorealistic environments for applications such as immersive media, robotics, autonomous driving, and embodied AI. Early methods based on…

Computer Vision and Pattern Recognition · Computer Science 2025-05-09 Beichen Wen , Haozhe Xie , Zhaoxi Chen , Fangzhou Hong , Ziwei Liu

Pl\"uckeRF: A Line-based 3D Representation for Few-view Reconstruction

Feed-forward 3D reconstruction methods aim to predict the 3D structure of a scene directly from input images, providing a faster alternative to per-scene optimization approaches. Significant progress has been made in single-view and…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Sam Bahrami , Dylan Campbell