English
Related papers

Related papers: Feed-Forward 3D Scene Modeling: A Problem-Driven P…

200 papers

3D reconstruction and view synthesis are foundational problems in computer vision, graphics, and immersive technologies such as augmented reality (AR), virtual reality (VR), and digital twins. Traditional methods rely on computationally…

3D reconstruction, which aims to recover the dense three-dimensional structure of a scene, is a cornerstone technology for numerous applications, including augmented/virtual reality, autonomous driving, and robotics. While traditional…

Computer Vision and Pattern Recognition · Computer Science 2025-07-14 Wei Zhang , Yihang Wu , Songhua Li , Wenjie Ma , Xin Ma , Qiang Li , Qi Wang

We introduce MapAnything, a unified transformer-based feed-forward model that ingests one or more images along with optional geometric inputs such as camera intrinsics, poses, depth, or partial reconstructions, and then directly regresses…

Recent AI-based 3D content creation has largely evolved along two paths: feed-forward image-to-3D reconstruction approaches and 3D generative models trained with 2D or 3D supervision. In this work, we show that existing feed-forward…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Suttisak Wizadwongsa , Jinfan Zhou , Edward Li , Jeong Joon Park

Structure-from-Motion -- the process of simultaneously estimating camera poses and 3D scene structure from a collection of images -- remains a central challenge in computer vision, with many open problems yet to be solved. Recent advances…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Linfei Pan , Johannes Schönberger , Marc Pollefeys

Feed-forward 3D reconstruction models are efficient but rigid: once trained, they perform inference in a zero-shot manner and cannot adapt to the test scene. As a result, visually plausible reconstructions often contain errors, particularly…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Yuhang Dai , Xingyi Yang

Sparse-view 3D reconstruction is increasingly addressed with feed-forward splatting networks that predict explicit primitives directly from images. Yet most existing methods remain centered on Gaussian primitives and expose surfaces only…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Weijie Wang , Zimu Li , Jinchuan Shi , Zeyu Zhang , Botao Ye , Marc Pollefeys , Donny Y. Chen , Bohan Zhuang

High-fidelity reconstruction of driving scenes is crucial for autonomous driving. While recent feedforward 3D Gaussian Splatting (3DGS) methods enable fast reconstruction, their per-pixel Gaussian prediction paradigm often suffers from…

Computer Vision and Pattern Recognition · Computer Science 2026-05-13 Cheng Chi , Xianqi Wang , Hongcheng Luo , Mingfei Tu , Gangwei Xu , Zehan Zhang , Bing Wang , Guang Chen , Hangjun Ye , Sida Peng , Xin Yang , Haiyang Sun

3D scene modeling techniques serve as the bedrocks in the geospatial engineering and computer science, which drives many applications ranging from automated driving, terrain mapping, navigation, virtual, augmented, mixed, and extended…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Shuang Song

We propose a feed-forward Gaussian Splatting model that unifies 3D scene and semantic field reconstruction. Combining 3D scenes with semantic fields facilitates the perception and understanding of the surrounding environment. However, key…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Qijian Tian , Xin Tan , Jingyu Gong , Yuan Xie , Lizhuang Ma

In addition to color and textural information, geometry provides important cues for 3D scene reconstruction. However, current reconstruction methods only include geometry at the feature level thus not fully exploiting the geometric…

Computer Vision and Pattern Recognition · Computer Science 2024-08-29 Ruihong Yin , Sezer Karaoglu , Theo Gevers

We propose Flash3D, a method for scene reconstruction and novel view synthesis from a single image which is both very generalisable and efficient. For generalisability, we start from a "foundation" model for monocular depth estimation and…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Stanislaw Szymanowicz , Eldar Insafutdinov , Chuanxia Zheng , Dylan Campbell , João F. Henriques , Christian Rupprecht , Andrea Vedaldi

3D scene reconstruction is fundamental for spatial intelligence applications such as AR, robotics, and digital twins. Traditional multi-view stereo struggles with sparse viewpoints or low-texture regions, while neural rendering approaches,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-06 Jiaqi Yao , Zhongmiao Yan , Jingyi Xu , Songpengcheng Xia , Yan Xiang , Ling Pei

We present 4DNeX, the first feed-forward framework for generating 4D (i.e., dynamic 3D) scene representations from a single image. In contrast to existing methods that rely on computationally intensive optimization or require multi-frame…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Zhaoxi Chen , Tianqi Liu , Long Zhuo , Jiawei Ren , Zeng Tao , He Zhu , Fangzhou Hong , Liang Pan , Ziwei Liu

In recent years, 3D vision has become a crucial field within computer vision, powering a wide range of applications such as autonomous driving, robotics, augmented reality, and medical imaging. This field relies on accurate perception,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Zhen Wang , Dongyuan Li , Yaozu Wu , Tianyu He , Jiang Bian , Renhe Jiang

In recent years, the demand for 3D content has grown exponentially with the intelligent upgrade of interactive media, extended reality (XR), and Metaverse industries. In order to overcome the limitations of traditional manual modeling…

Graphics · Computer Science 2025-12-23 Xiang Tang , Ruotong Li , Xiaopeng Fan

3D editing is a fundamental capability for scalable 3D content creation. While image editing has rapidly evolved toward large-scale feedforward generative paradigms, 3D AI generation remains dominated by training-free editing pipelines. A…

Computer Vision and Pattern Recognition · Computer Science 2026-05-28 Jiawei Weng , Saining Zhang , Zhenxin Diao , Peishuo Li , Henghaofan Zhang , Junhao Chen , Hao Zhao

Generalized feed-forward Gaussian models have achieved significant progress in sparse-view 3D reconstruction by leveraging prior knowledge from large multi-view datasets. However, these models often struggle to represent high-frequency…

Computer Vision and Pattern Recognition · Computer Science 2025-03-10 Seungtae Nam , Xiangyu Sun , Gyeongjin Kang , Younggeun Lee , Seungjun Oh , Eunbyung Park

3D scene generation seeks to synthesize spatially structured, semantically meaningful, and photorealistic environments for applications such as immersive media, robotics, autonomous driving, and embodied AI. Early methods based on…

Computer Vision and Pattern Recognition · Computer Science 2025-05-09 Beichen Wen , Haozhe Xie , Zhaoxi Chen , Fangzhou Hong , Ziwei Liu

Feed-forward 3D reconstruction methods aim to predict the 3D structure of a scene directly from input images, providing a faster alternative to per-scene optimization approaches. Significant progress has been made in single-view and…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Sam Bahrami , Dylan Campbell
‹ Prev 1 2 3 10 Next ›