Related papers: T-Code: Simple Temporal Latent Code for Efficient …

CD-NGP: A Fast Scalable Continual Representation for Dynamic Scenes

Novel view synthesis (NVS) in dynamic scenes faces persistent challenges in memory consumption, model complexity, training efficiency, and rendering quality. Offline methods offer high fidelity but suffer from high memory usage and limited…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Zhenhuan Liu , Shuai Liu , Zhiwei Ning , Jie Yang , Yifan Zuo , Yuming Fang , Wei Liu

Temporal-MPI: Enabling Multi-Plane Images for Dynamic Scene Modelling via Temporal Basis Learning

Novel view synthesis of static scenes has achieved remarkable advancements in producing photo-realistic results. However, key challenges remain for immersive rendering of dynamic scenes. One of the seminal image-based rendering method, the…

Computer Vision and Pattern Recognition · Computer Science 2022-08-09 Wenpeng Xing , Jie Chen

Fast Non-Rigid Radiance Fields from Monocularized Data

The reconstruction and novel view synthesis of dynamic scenes recently gained increased attention. As reconstruction from large-scale multi-view data involves immense memory and computational requirements, recent benchmark datasets provide…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Moritz Kappel , Vladislav Golyanik , Susana Castillo , Christian Theobalt , Marcus Magnor

Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression

For neural video codec, it is critical, yet challenging, to design an efficient entropy model which can accurately predict the probability distribution of the quantized latent representation. However, most existing video codecs directly use…

Image and Video Processing · Electrical Eng. & Systems 2022-07-14 Jiahao Li , Bin Li , Yan Lu

D-NPC: Dynamic Neural Point Clouds for Non-Rigid View Synthesis from Monocular Video

Dynamic reconstruction and spatiotemporal novel-view synthesis of non-rigidly deforming scenes recently gained increased attention. While existing work achieves impressive quality and performance on multi-view or teleporting camera setups,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-03 Moritz Kappel , Florian Hahlbohm , Timon Scholz , Susana Castillo , Christian Theobalt , Martin Eisemann , Vladislav Golyanik , Marcus Magnor

Real-time High-resolution View Synthesis of Complex Scenes with Explicit 3D Visibility Reasoning

Rendering photo-realistic novel-view images of complex scenes has been a long-standing challenge in computer graphics. In recent years, great research progress has been made on enhancing rendering quality and accelerating rendering speed in…

Graphics · Computer Science 2024-02-21 Tiansong Zhou , Yebin Liu , Xuangeng Chu , Chengkun Cao , Changyin Zhou , Fei Yu , Yu Li

HGS: Hybrid Gaussian Splatting with Static-Dynamic Decomposition for Compact Dynamic View Synthesis

Dynamic novel view synthesis (NVS) is essential for creating immersive experiences. Existing approaches have advanced dynamic NVS by introducing 3D Gaussian Splatting (3DGS) with implicit deformation fields or indiscriminately assigned…

Computer Vision and Pattern Recognition · Computer Science 2025-12-17 Kaizhe Zhang , Yijie Zhou , Weizhan Zhang , Caixia Yan , Haipeng Du , yugui xie , Yu-Hui Wen , Yong-Jin Liu

Memorize When Needed: Decoupled Memory Control for Spatially Consistent Long-Horizon Video Generation

Spatially consistent long-horizon video generation aims to maintain temporal and spatial consistency along predefined camera trajectories. Existing methods mostly entangle memory modeling with video generation, leading to inconsistent…

Computer Vision and Pattern Recognition · Computer Science 2026-04-22 Yanjun Guo , Zhengqiang Zhang , Pengfei Wang , Xinyue Liang , Zhiyuan Ma , Lei Zhang

You Can Ground Earlier than See: An Effective and Efficient Pipeline for Temporal Sentence Grounding in Compressed Videos

Given an untrimmed video, temporal sentence grounding (TSG) aims to locate a target moment semantically according to a sentence query. Although previous respectable works have made decent success, they only focus on high-level visual…

Computer Vision and Pattern Recognition · Computer Science 2026-05-26 Xiang Fang , Daizong Liu , Pan Zhou , Guoshun Nan

Fast View Synthesis of Casual Videos with Soup-of-Planes

Novel view synthesis from an in-the-wild video is difficult due to challenges like scene dynamics and lack of parallax. While existing methods have shown promising results with implicit neural radiance fields, they are slow to train and…

Computer Vision and Pattern Recognition · Computer Science 2024-07-22 Yao-Chih Lee , Zhoutong Zhang , Kevin Blackburn-Matzen , Simon Niklaus , Jianming Zhang , Jia-Bin Huang , Feng Liu

Compound Text-Guided Prompt Tuning via Image-Adaptive Cues

Vision-Language Models (VLMs) such as CLIP have demonstrated remarkable generalization capabilities to downstream tasks. However, existing prompt tuning based frameworks need to parallelize learnable textual inputs for all categories,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Hao Tan , Jun Li , Yizhuang Zhou , Jun Wan , Zhen Lei , Xiangyu Zhang

Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans

This paper addresses the challenge of novel view synthesis for a human performer from a very sparse set of camera views. Some recent works have shown that learning implicit neural representations of 3D scenes achieves remarkable view…

Computer Vision and Pattern Recognition · Computer Science 2021-03-30 Sida Peng , Yuanqing Zhang , Yinghao Xu , Qianqian Wang , Qing Shuai , Hujun Bao , Xiaowei Zhou

Tensor4D : Efficient Neural 4D Decomposition for High-fidelity Dynamic Reconstruction and Rendering

We present Tensor4D, an efficient yet effective approach to dynamic scene modeling. The key of our solution is an efficient 4D tensor decomposition method so that the dynamic scene can be directly represented as a 4D spatio-temporal tensor.…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Ruizhi Shao , Zerong Zheng , Hanzhang Tu , Boning Liu , Hongwen Zhang , Yebin Liu

Conceptual Compression via Deep Structure and Texture Synthesis

Existing compression methods typically focus on the removal of signal-level redundancies, while the potential and versatility of decomposing visual data into compact conceptual components still lack further study. To this end, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2022-03-11 Jianhui Chang , Zhenghui Zhao , Chuanmin Jia , Shiqi Wang , Lingbo Yang , Qi Mao , Jian Zhang , Siwei Ma

VNVC: A Versatile Neural Video Coding Framework for Efficient Human-Machine Vision

Almost all digital videos are coded into compact representations before being transmitted. Such compact representations need to be decoded back to pixels before being displayed to humans and - as usual - before being enhanced/analyzed by…

Image and Video Processing · Electrical Eng. & Systems 2023-11-03 Xihua Sheng , Li Li , Dong Liu , Houqiang Li

Efficient training for future video generation based on hierarchical disentangled representation of latent variables

Generating videos predicting the future of a given sequence has been an area of active research in recent years. However, an essential problem remains unsolved: most of the methods require large computational cost and memory usage for…

Computer Vision and Pattern Recognition · Computer Science 2021-06-09 Naoya Fushishita , Antonio Tejero-de-Pablos , Yusuke Mukuta , Tatsuya Harada

Hybrid Local-Global Context Learning for Neural Video Compression

In neural video codecs, current state-of-the-art methods typically adopt multi-scale motion compensation to handle diverse motions. These methods estimate and compress either optical flow or deformable offsets to reduce inter-frame…

Multimedia · Computer Science 2024-12-03 Yongqi Zhai , Jiayu Yang , Wei Jiang , Chunhui Yang , Luyang Tang , Ronggang Wang

Neural Deformable Voxel Grid for Fast Optimization of Dynamic View Synthesis

Recently, Neural Radiance Fields (NeRF) is revolutionizing the task of novel view synthesis (NVS) for its superior performance. In this paper, we propose to synthesize dynamic scenes. Extending the methods for static scenes to dynamic…

Computer Vision and Pattern Recognition · Computer Science 2022-10-12 Xiang Guo , Guanying Chen , Yuchao Dai , Xiaoqing Ye , Jiadai Sun , Xiao Tan , Errui Ding

Neural 3D Video Synthesis from Multi-view Video

We propose a novel approach for 3D video synthesis that is able to represent multi-view video recordings of a dynamic real-world scene in a compact, yet expressive representation that enables high-quality view synthesis and motion…

Computer Vision and Pattern Recognition · Computer Science 2022-05-04 Tianye Li , Mira Slavcheva , Michael Zollhoefer , Simon Green , Christoph Lassner , Changil Kim , Tanner Schmidt , Steven Lovegrove , Michael Goesele , Richard Newcombe , Zhaoyang Lv

Text-Visual Prompting for Efficient 2D Temporal Video Grounding

In this paper, we study the problem of temporal video grounding (TVG), which aims to predict the starting/ending time points of moments described by a text sentence within a long untrimmed video. Benefiting from fine-grained 3D visual…

Computer Vision and Pattern Recognition · Computer Science 2023-10-05 Yimeng Zhang , Xin Chen , Jinghan Jia , Sijia Liu , Ke Ding