Related papers: Rethinking Video Super-Resolution: Towards Diffusi…

VIDM: Video Implicit Diffusion Models

Diffusion models have emerged as a powerful generative method for synthesizing high-quality and diverse set of images. In this paper, we propose a video generation method based on diffusion models, where the effects of motion are modeled in…

Computer Vision and Pattern Recognition · Computer Science 2022-12-02 Kangfu Mei , Vishal M. Patel

Video Diffusion Models

Generating temporally coherent high fidelity video is an important milestone in generative modeling research. We make progress towards this milestone by proposing a diffusion model for video generation that shows very promising initial…

Computer Vision and Pattern Recognition · Computer Science 2022-06-24 Jonathan Ho , Tim Salimans , Alexey Gritsenko , William Chan , Mohammad Norouzi , David J. Fleet

Dreamix: Video Diffusion Models are General Video Editors

Text-driven image and video diffusion models have recently achieved unprecedented generation realism. While diffusion models have been successfully applied for image editing, very few works have done so for video editing. We present the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-03 Eyal Molad , Eliahu Horwitz , Dani Valevski , Alex Rav Acha , Yossi Matias , Yael Pritch , Yaniv Leviathan , Yedid Hoshen

Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution

Text-based diffusion models have exhibited remarkable success in generation and editing, showing great promise for enhancing visual content with their generative prior. However, applying these models to video super-resolution remains…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Shangchen Zhou , Peiqing Yang , Jianyi Wang , Yihang Luo , Chen Change Loy

From Image to Video: An Empirical Study of Diffusion Representations

Diffusion models have revolutionized generative modeling, enabling unprecedented realism in image and video synthesis. This success has sparked interest in leveraging their representations for visual understanding tasks. While recent works…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Pedro Vélez , Luisa F. Polanía , Yi Yang , Chuhan Zhang , Rishabh Kabra , Anurag Arnab , Mehdi S. M. Sajjadi

An Efficient Algorithm for Video Super-Resolution Based On a Sequential Model

In this work, we propose a novel procedure for video super-resolution, that is the recovery of a sequence of high-resolution images from its low-resolution counterpart. Our approach is based on a "sequential" model (i.e., each…

Computer Vision and Pattern Recognition · Computer Science 2016-02-16 Patrick Héas , Angélique Drémeau , Cédric Herzet

VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models

In this paper, we propose a novel framework for solving high-definition video inverse problems using latent image diffusion models. Building on recent advancements in spatio-temporal optimization for video inverse problems using image…

Computer Vision and Pattern Recognition · Computer Science 2025-03-10 Taesung Kwon , Jong Chul Ye

Align your Latents: High-Resolution Video Synthesis with Latent Diffusion Models

Latent Diffusion Models (LDMs) enable high-quality image synthesis while avoiding excessive compute demands by training a diffusion model in a compressed lower-dimensional latent space. Here, we apply the LDM paradigm to high-resolution…

Computer Vision and Pattern Recognition · Computer Science 2023-12-29 Andreas Blattmann , Robin Rombach , Huan Ling , Tim Dockhorn , Seung Wook Kim , Sanja Fidler , Karsten Kreis

Latent-Reframe: Enabling Camera Control for Video Diffusion Model without Training

Precise camera pose control is crucial for video generation with diffusion models. Existing methods require fine-tuning with additional datasets containing paired videos and camera pose annotations, which are both data-intensive and…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Zhenghong Zhou , Jie An , Jiebo Luo

Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction

Diffusion models have made significant strides in image generation, mastering tasks such as unconditional image synthesis, text-image translation, and image-to-image conversions. However, their capability falls short in the realm of video…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Gaurav Shrivastava , Abhinav Shrivastava

Image Motion Blur Removal in the Temporal Dimension with Video Diffusion Models

Most motion deblurring algorithms rely on spatial-domain convolution models, which struggle with the complex, non-linear blur arising from camera shake and object motion. In contrast, we propose a novel single-image deblurring approach that…

Image and Video Processing · Electrical Eng. & Systems 2025-01-23 Wang Pang , Zhihao Zhan , Xiang Zhu , Yechao Bai

Local-Global Context-Aware and Structure-Preserving Image Super-Resolution

Diffusion models have recently achieved significant success in various image manipulation tasks, including image super-resolution and perceptual quality enhancement. Pretrained text-to-image models, such as Stable Diffusion, have exhibited…

Computer Vision and Pattern Recognition · Computer Science 2025-10-16 Sanchar Palit , Subhasis Chaudhuri , Biplab Banerjee

Diffusion Posterior Proximal Sampling for Image Restoration

Diffusion models have demonstrated remarkable efficacy in generating high-quality samples. Existing diffusion-based image restoration algorithms exploit pre-trained diffusion models to leverage data priors, yet they still preserve elements…

Image and Video Processing · Electrical Eng. & Systems 2024-08-07 Hongjie Wu , Linchao He , Mingqin Zhang , Dongdong Chen , Kunming Luo , Mengting Luo , Ji-Zhe Zhou , Hu Chen , Jiancheng Lv

Imagen Video: High Definition Video Generation with Diffusion Models

We present Imagen Video, a text-conditional video generation system based on a cascade of video diffusion models. Given a text prompt, Imagen Video generates high definition videos using a base video generation model and a sequence of…

Computer Vision and Pattern Recognition · Computer Science 2022-10-06 Jonathan Ho , William Chan , Chitwan Saharia , Jay Whang , Ruiqi Gao , Alexey Gritsenko , Diederik P. Kingma , Ben Poole , Mohammad Norouzi , David J. Fleet , Tim Salimans

DiffVSR: Revealing an Effective Recipe for Taming Robust Video Super-Resolution Against Complex Degradations

Diffusion models have demonstrated exceptional capabilities in image restoration, yet their application to video super-resolution (VSR) faces significant challenges in balancing fidelity with temporal consistency. Our evaluation reveals a…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Xiaohui Li , Yihao Liu , Shuo Cao , Ziyan Chen , Shaobin Zhuang , Xiangyu Chen , Yinan He , Yi Wang , Yu Qiao

Survey of Video Diffusion Models: Foundations, Implementations, and Applications

Recent advances in diffusion models have revolutionized video generation, offering superior temporal consistency and visual quality compared to traditional generative adversarial networks-based approaches. While this emerging field shows…

Computer Vision and Pattern Recognition · Computer Science 2026-02-11 Yimu Wang , Xuye Liu , Wei Pang , Li Ma , Shuai Yuan , Paul Debevec , Ning Yu

OS-DiffVSR: Towards One-step Latent Diffusion Model for High-detailed Real-world Video Super-Resolution

Recently, latent diffusion models has demonstrated promising performance in real-world video super-resolution (VSR) task, which can reconstruct high-quality videos from distorted low-resolution input through multiple diffusion steps.…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Hanting Li , Huaao Tang , Jianhong Han , Tianxiong Zhou , Jiulong Cui , Haizhen Xie , Yan Chen , Jie Hu

VideoMerge: Towards Training-free Long Video Generation

Long video generation remains a challenging and compelling topic in computer vision. Diffusion based models, among the various approaches to video generation, have achieved state of the art quality with their iterative denoising procedures.…

Computer Vision and Pattern Recognition · Computer Science 2025-03-14 Siyang Zhang , Harry Yang , Ser-Nam Lim

DiffuseSlide: Training-Free High Frame Rate Video Generation Diffusion

Recent advancements in diffusion models have revolutionized video generation, enabling the creation of high-quality, temporally consistent videos. However, generating high frame-rate (FPS) videos remains a significant challenge due to…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Geunmin Hwang , Hyun-kyu Ko , Younghyun Kim , Seungryong Lee , Eunbyung Park

Diffusion-aided Extreme Video Compression with Lightweight Semantics Guidance

Modern video codecs and learning-based approaches struggle for semantic reconstruction at extremely low bit-rates due to reliance on low-level spatiotemporal redundancies. Generative models, especially diffusion models, offer a new paradigm…

Image and Video Processing · Electrical Eng. & Systems 2026-02-06 Maojun Zhang , Haotian Wu , Richeng Jin , Deniz Gunduz , Krystian Mikolajczyk