Related papers: Video Interpolation with Diffusion Models

VIDM: Video Implicit Diffusion Models

Diffusion models have emerged as a powerful generative method for synthesizing high-quality and diverse set of images. In this paper, we propose a video generation method based on diffusion models, where the effects of motion are modeled in…

Computer Vision and Pattern Recognition · Computer Science 2022-12-02 Kangfu Mei , Vishal M. Patel

MiVID: Multi-Strategic Self-Supervision for Video Frame Interpolation using Diffusion Model

Video Frame Interpolation (VFI) remains a cornerstone in video enhancement, enabling temporal upscaling for tasks like slow-motion rendering, frame rate conversion, and video restoration. While classical methods rely on optical flow and…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Priyansh Srivastava , Romit Chatterjee , Abir Sen , Aradhana Behura , Ratnakar Dash

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

We present a method for generating video sequences with coherent motion between a pair of input key frames. We adapt a pretrained large-scale image-to-video diffusion model (originally trained to generate videos moving forward in time from…

Computer Vision and Pattern Recognition · Computer Science 2025-02-13 Xiaojuan Wang , Boyang Zhou , Brian Curless , Ira Kemelmacher-Shlizerman , Aleksander Holynski , Steven M. Seitz

ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler

Recent progress in large-scale text-to-video (T2V) and image-to-video (I2V) diffusion models has greatly enhanced video generation, especially in terms of keyframe interpolation. However, current image-to-video diffusion models, while…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Serin Yang , Taesung Kwon , Jong Chul Ye

GD-VDM: Generated Depth for better Diffusion-based Video Generation

The field of generative models has recently witnessed significant progress, with diffusion models showing remarkable performance in image generation. In light of this success, there is a growing interest in exploring the application of…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Ariel Lapid , Idan Achituve , Lior Bracha , Ethan Fetaya

ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation

Video generation has made remarkable progress in recent years, especially since the advent of the video diffusion models. Many video generation models can produce plausible synthetic videos, e.g., Stable Video Diffusion (SVD). However, most…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Shaoshu Yang , Yong Zhang , Xiaodong Cun , Ying Shan , Ran He

VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models

Diffusion models have achieved significant success in image and video generation. This motivates a growing interest in video editing tasks, where videos are edited according to provided text descriptions. However, most existing approaches…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Zhen Xing , Qi Dai , Zihao Zhang , Hui Zhang , Han Hu , Zuxuan Wu , Yu-Gang Jiang

Video Diffusion Models

Generating temporally coherent high fidelity video is an important milestone in generative modeling research. We make progress towards this milestone by proposing a diffusion model for video generation that shows very promising initial…

Computer Vision and Pattern Recognition · Computer Science 2022-06-24 Jonathan Ho , Tim Salimans , Alexey Gritsenko , William Chan , Mohammad Norouzi , David J. Fleet

VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models

Recent video inpainting methods have achieved encouraging improvements by leveraging optical flow to guide pixel propagation from reference frames either in the image space or feature space. However, they would produce severe artifacts in…

Computer Vision and Pattern Recognition · Computer Science 2025-01-22 Chaohao Xie , Kai Han , Kwan-Yee K. Wong

JVID: Joint Video-Image Diffusion for Visual-Quality and Temporal-Consistency in Video Generation

We introduce the Joint Video-Image Diffusion model (JVID), a novel approach to generating high-quality and temporally coherent videos. We achieve this by integrating two diffusion models: a Latent Image Diffusion Model (LIDM) trained on…

Computer Vision and Pattern Recognition · Computer Science 2024-09-30 Hadrien Reynaud , Matthew Baugh , Mischa Dombrowski , Sarah Cechnicka , Qingjie Meng , Bernhard Kainz

VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation

A diffusion probabilistic model (DPM), which constructs a forward diffusion process by gradually adding noise to data points and learns the reverse denoising process to generate new samples, has been shown to handle complex data…

Computer Vision and Pattern Recognition · Computer Science 2023-10-16 Zhengxiong Luo , Dayou Chen , Yingya Zhang , Yan Huang , Liang Wang , Yujun Shen , Deli Zhao , Jingren Zhou , Tieniu Tan

GVD: Guiding Video Diffusion Model for Scalable Video Distillation

To address the larger computation and storage requirements associated with large video datasets, video dataset distillation aims to capture spatial and temporal information in a significantly smaller dataset, such that training on the…

Computer Vision and Pattern Recognition · Computer Science 2025-07-31 Kunyang Li , Jeffrey A Chan Santiago , Sarinda Dhanesh Samarasinghe , Gaowen Liu , Mubarak Shah

IDO-VFI: Identifying Dynamics via Optical Flow Guidance for Video Frame Interpolation with Events

Video frame interpolation aims to generate high-quality intermediate frames from boundary frames and increase frame rate. While existing linear, symmetric and nonlinear models are used to bridge the gap from the lack of inter-frame motion,…

Computer Vision and Pattern Recognition · Computer Science 2023-05-19 Chenyang Shi , Hanxiao Liu , Jing Jin , Wenzhuo Li , Yuzhen Li , Boyi Wei , Yibo Zhang

Interpolating between Images with Diffusion Models

One little-explored frontier of image generation and editing is the task of interpolating between two input images, a feature missing from all currently deployed image generation pipelines. We argue that such a feature can expand the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Clinton J. Wang , Polina Golland

LDMVFI: Video Frame Interpolation with Latent Diffusion Models

Existing works on video frame interpolation (VFI) mostly employ deep neural networks that are trained by minimizing the L1, L2, or deep feature space distance (e.g. VGG loss) between their outputs and ground-truth frames. However, recent…

Image and Video Processing · Electrical Eng. & Systems 2024-06-11 Duolikun Danier , Fan Zhang , David Bull

Accelerating Video Diffusion Models via Distribution Matching

Generative models, particularly diffusion models, have made significant success in data synthesis across various modalities, including images, videos, and 3D assets. However, current diffusion models are computationally intensive, often…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Yuanzhi Zhu , Hanshu Yan , Huan Yang , Kai Zhang , Junnan Li

VMDiff: Visual Mixing Diffusion for Limitless Cross-Object Synthesis

Creating novel images by fusing visual cues from multiple sources is a fundamental yet underexplored problem in image-to-image generation, with broad applications in artistic creation, virtual reality and visual media. Existing methods…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Zeren Xiong , Yue Yu , Zedong Zhang , Shuo Chen , Jian Yang , Jun Li

High-Resolution Frame Interpolation with Patch-based Cascaded Diffusion

Despite the recent progress, existing frame interpolation methods still struggle with processing extremely high resolution input and handling challenging cases such as repetitive textures, thin objects, and large motion. To address these…

Computer Vision and Pattern Recognition · Computer Science 2025-04-30 Junhwa Hur , Charles Herrmann , Saurabh Saxena , Janne Kontkanen , Wei-Sheng Lai , Yichang Shih , Michael Rubinstein , David J. Fleet , Deqing Sun

EventDiff: A Unified and Efficient Diffusion Model Framework for Event-based Video Frame Interpolation

Video Frame Interpolation (VFI) is a fundamental yet challenging task in computer vision, particularly under conditions involving large motion, occlusion, and lighting variation. Recent advancements in event cameras have opened up new…

Computer Vision and Pattern Recognition · Computer Science 2025-05-14 Hanle Zheng , Xujie Han , Zegang Peng , Shangbin Zhang , Guangxun Du , Zhuo Zou , Xilin Wang , Jibin Wu , Hao Guo , Lei Deng

Adapting Image-to-Video Diffusion Models for Large-Motion Frame Interpolation

With the development of video generation models has advanced significantly in recent years, we adopt large-scale image-to-video diffusion models for video frame interpolation. We present a conditional encoder designed to adapt an…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Luoxu Jin , Hiroshi Watanabe