Related papers: Warped Diffusion: Solving Video Inverse Problems w…

Solving Video Inverse Problems Using Image Diffusion Models

Recently, diffusion model-based inverse problem solvers (DIS) have emerged as state-of-the-art approaches for addressing inverse problems, including image super-resolution, deblurring, inpainting, etc. However, their application to video…

Computer Vision and Pattern Recognition · Computer Science 2025-02-28 Taesung Kwon , Jong Chul Ye

VISION-XL: High Definition Video Inverse Problem Solver using Latent Image Diffusion Models

In this paper, we propose a novel framework for solving high-definition video inverse problems using latent image diffusion models. Building on recent advancements in spatio-temporal optimization for video inverse problems using image…

Computer Vision and Pattern Recognition · Computer Science 2025-03-10 Taesung Kwon , Jong Chul Ye

StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Diffusion-based methods can generate realistic images and videos, but they struggle to edit existing objects in a video while preserving their appearance over time. This prevents diffusion models from being applied to natural video editing…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Wenhao Chai , Xun Guo , Gaoang Wang , Yan Lu

Infusion: internal diffusion for inpainting of dynamic textures and complex motion

Video inpainting is the task of filling a region in a video in a visually convincing manner. It is very challenging due to the high dimensionality of the data and the temporal consistency required for obtaining convincing results. Recently,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-29 Nicolas Cherel , Andrés Almansa , Yann Gousseau , Alasdair Newson

On Equivariance and Fast Sampling in Video Diffusion Models Trained with Warped Noise

Temporally consistent video-to-video generation is critical for applications such as style transfer and upsampling. In this paper, we provide a theoretical analysis of warped noise - a recently proposed technique for training video…

Computer Vision and Pattern Recognition · Computer Science 2025-10-17 Chao Liu , Arash Vahdat

Solving 3D Inverse Problems using Pre-trained 2D Diffusion Models

Diffusion models have emerged as the new state-of-the-art generative model with high quality samples, with intriguing properties such as mode coverage and high flexibility. They have also been shown to be effective inverse problem solvers,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-06 Hyungjin Chung , Dohoon Ryu , Michael T. McCann , Marc L. Klasky , Jong Chul Ye

Solving Inverse Problems with Latent Diffusion Models via Hard Data Consistency

Diffusion models have recently emerged as powerful generative priors for solving inverse problems. However, training diffusion models in the pixel space are both data-intensive and computationally demanding, which restricts their…

Computer Vision and Pattern Recognition · Computer Science 2024-04-17 Bowen Song , Soo Min Kwon , Zecheng Zhang , Xinyu Hu , Qing Qu , Liyue Shen

A Survey on Diffusion Models for Inverse Problems

Diffusion models have become increasingly popular for generative modeling due to their ability to generate high-quality samples. This has unlocked exciting new possibilities for solving inverse problems, especially in image restoration and…

Machine Learning · Computer Science 2024-10-02 Giannis Daras , Hyungjin Chung , Chieh-Hsin Lai , Yuki Mitsufuji , Jong Chul Ye , Peyman Milanfar , Alexandros G. Dimakis , Mauricio Delbracio

G2D2: Gradient-Guided Discrete Diffusion for Inverse Problem Solving

Recent literature has effectively leveraged diffusion models trained on continuous variables as priors for solving inverse problems. Notably, discrete diffusion models with discrete latent codes have shown strong performance, particularly…

Computer Vision and Pattern Recognition · Computer Science 2025-09-22 Naoki Murata , Chieh-Hsin Lai , Yuhta Takida , Toshimitsu Uesaka , Bac Nguyen , Stefano Ermon , Yuki Mitsufuji

Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction

Diffusion models have made significant strides in image generation, mastering tasks such as unconditional image synthesis, text-image translation, and image-to-image conversions. However, their capability falls short in the realm of video…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Gaurav Shrivastava , Abhinav Shrivastava

How I Warped Your Noise: a Temporally-Correlated Noise Prior for Diffusion Models

Video editing and generation methods often rely on pre-trained image-based diffusion models. During the diffusion process, however, the reliance on rudimentary noise sampling techniques that do not preserve correlations present in…

Computer Vision and Pattern Recognition · Computer Science 2025-04-07 Pascal Chang , Jingwei Tang , Markus Gross , Vinicius C. Azevedo

World-consistent Video Diffusion with Explicit 3D Modeling

Recent advancements in diffusion models have set new benchmarks in image and video generation, enabling realistic visual synthesis across single- and multi-frame contexts. However, these models still struggle with efficiently and explicitly…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Qihang Zhang , Shuangfei Zhai , Miguel Angel Bautista , Kevin Miao , Alexander Toshev , Joshua Susskind , Jiatao Gu

Fitting Image Diffusion Models on Video Datasets

Image diffusion models are trained on independently sampled static images. While this is the bedrock task protocol in generative modeling, capturing the temporal world through the lens of static snapshots is information-deficient by design.…

Computer Vision and Pattern Recognition · Computer Science 2025-09-05 Juhun Lee , Simon S. Woo

LatentColorization: Latent Diffusion-Based Speaker Video Colorization

While current research predominantly focuses on image-based colorization, the domain of video-based colorization remains relatively unexplored. Most existing video colorization techniques operate on a frame-by-frame basis, often overlooking…

Computer Vision and Pattern Recognition · Computer Science 2024-05-10 Rory Ward , Dan Bigioi , Shubhajit Basak , John G. Breslin , Peter Corcoran

Image Motion Blur Removal in the Temporal Dimension with Video Diffusion Models

Most motion deblurring algorithms rely on spatial-domain convolution models, which struggle with the complex, non-linear blur arising from camera shake and object motion. In contrast, we propose a novel single-image deblurring approach that…

Image and Video Processing · Electrical Eng. & Systems 2025-01-23 Wang Pang , Zhihao Zhan , Xiang Zhu , Yechao Bai

VipDiff: Towards Coherent and Diverse Video Inpainting via Training-free Denoising Diffusion Models

Recent video inpainting methods have achieved encouraging improvements by leveraging optical flow to guide pixel propagation from reference frames either in the image space or feature space. However, they would produce severe artifacts in…

Computer Vision and Pattern Recognition · Computer Science 2025-01-22 Chaohao Xie , Kai Han , Kwan-Yee K. Wong

Video Diffusion Models

Generating temporally coherent high fidelity video is an important milestone in generative modeling research. We make progress towards this milestone by proposing a diffusion model for video generation that shows very promising initial…

Computer Vision and Pattern Recognition · Computer Science 2022-06-24 Jonathan Ho , Tim Salimans , Alexey Gritsenko , William Chan , Mohammad Norouzi , David J. Fleet

SwiftTry: Fast and Consistent Video Virtual Try-On with Diffusion Models

Given an input video of a person and a new garment, the objective of this paper is to synthesize a new video where the person is wearing the specified garment while maintaining spatiotemporal consistency. Although significant advances have…

Computer Vision and Pattern Recognition · Computer Science 2024-12-19 Hung Nguyen , Quang Qui-Vinh Nguyen , Khoi Nguyen , Rang Nguyen

Improving 3D Imaging with Pre-Trained Perpendicular 2D Diffusion Models

Diffusion models have become a popular approach for image generation and reconstruction due to their numerous advantages. However, most diffusion-based inverse problem-solving methods only deal with 2D images, and even recently published 3D…

Image and Video Processing · Electrical Eng. & Systems 2023-09-04 Suhyeon Lee , Hyungjin Chung , Minyoung Park , Jonghyuk Park , Wi-Sun Ryu , Jong Chul Ye

DiffuEraser: A Diffusion Model for Video Inpainting

Recent video inpainting algorithms integrate flow-based pixel propagation with transformer-based generation to leverage optical flow for restoring textures and objects using information from neighboring frames, while completing masked…

Computer Vision and Pattern Recognition · Computer Science 2025-01-20 Xiaowen Li , Haolan Xue , Peiran Ren , Liefeng Bo