Related papers: IMPUS: Image Morphing with Perceptually-Uniform Sa…

DiffMorpher: Unleashing the Capability of Diffusion Models for Image Morphing

Diffusion models have achieved remarkable image generation quality surpassing previous generative models. However, a notable limitation of diffusion models, in comparison to GANs, is their difficulty in smoothly interpolating between two…

Computer Vision and Pattern Recognition · Computer Science 2023-12-13 Kaiwen Zhang , Yifan Zhou , Xudong Xu , Xingang Pan , Bo Dai

Interpolating between Images with Diffusion Models

One little-explored frontier of image generation and editing is the task of interpolating between two input images, a feature missing from all currently deployed image generation pipelines. We argue that such a feature can expand the…

Computer Vision and Pattern Recognition · Computer Science 2023-07-25 Clinton J. Wang , Polina Golland

Diffusion-Based Image-to-Image Translation by Noise Correction via Prompt Interpolation

We propose a simple but effective training-free approach tailored to diffusion-based image-to-image translation. Our approach revises the original noise prediction network of a pretrained diffusion model by introducing a noise correction…

Computer Vision and Pattern Recognition · Computer Science 2024-09-13 Junsung Lee , Minsoo Kang , Bohyung Han

Which Way from B to A: The role of embedding geometry in image interpolation for Stable Diffusion

It can be shown that Stable Diffusion has a permutation-invariance property with respect to the rows of Contrastive Language-Image Pretraining (CLIP) embedding matrices. This inspired the novel observation that these embeddings can…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Nicholas Karris , Luke Durell , Javier Flores , Tegan Emerson

Coupled Diffusion Sampling for Training-Free Multi-View Image Editing

We present an inference-time diffusion sampling method to perform multi-view consistent image editing using pre-trained 2D image editing models. These models can independently produce high-quality edits for each image in a set of multi-view…

Computer Vision and Pattern Recognition · Computer Science 2025-10-17 Hadi Alzayer , Yunzhi Zhang , Chen Geng , Jia-Bin Huang , Jiajun Wu

Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion Models

Recently large-scale language-image models (e.g., text-guided diffusion models) have considerably improved the image generation capabilities to generate photorealistic images in various domains. Based on this success, current image editing…

Computer Vision and Pattern Recognition · Computer Science 2023-05-09 Wenkai Dong , Song Xue , Xiaoyue Duan , Shumin Han

DreamMover: Leveraging the Prior of Diffusion Models for Image Interpolation with Large Motion

We study the problem of generating intermediate images from image pairs with large motion while maintaining semantic consistency. Due to the large motion, the intermediate semantic information may be absent in input images. Existing methods…

Computer Vision and Pattern Recognition · Computer Science 2024-09-19 Liao Shen , Tianqi Liu , Huiqiang Sun , Xinyi Ye , Baopu Li , Jianming Zhang , Zhiguo Cao

MIDMs: Matching Interleaved Diffusion Models for Exemplar-based Image Translation

We present a novel method for exemplar-based image translation, called matching interleaved diffusion models (MIDMs). Most existing methods for this task were formulated as GAN-based matching-then-generation framework. However, in this…

Computer Vision and Pattern Recognition · Computer Science 2023-03-30 Junyoung Seo , Gyuseong Lee , Seokju Cho , Jiyoung Lee , Seungryong Kim

Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models

Despite promising progress in face swapping task, realistic swapped images remain elusive, often marred by artifacts, particularly in scenarios involving high pose variation, color differences, and occlusion. To address these issues, we…

Computer Vision and Pattern Recognition · Computer Science 2024-09-12 Sanoojan Baliah , Qinliang Lin , Shengcai Liao , Xiaodan Liang , Muhammad Haris Khan

NoiseDiffusion: Correcting Noise for Image Interpolation with Diffusion Models beyond Spherical Linear Interpolation

Image interpolation based on diffusion models is promising in creating fresh and interesting images. Advanced interpolation methods mainly focus on spherical linear interpolation, where images are encoded into the noise space and then…

Computer Vision and Pattern Recognition · Computer Science 2024-03-15 PengFei Zheng , Yonggang Zhang , Zhen Fang , Tongliang Liu , Defu Lian , Bo Han

DiffCap: Diffusion-based Real-time Human Motion Capture using Sparse IMUs and a Monocular Camera

Combining sparse IMUs and a monocular camera is a new promising setting to perform real-time human motion capture. This paper proposes a diffusion-based solution to learn human motion priors and fuse the two modalities of signals together…

Computer Vision and Pattern Recognition · Computer Science 2025-08-11 Shaohua Pan , Xinyu Yi , Yan Zhou , Weihua Jian , Yuan Zhang , Pengfei Wan , Feng Xu

Implicit and Explicit Language Guidance for Diffusion-based Visual Perception

Text-to-image diffusion models have shown powerful ability on conditional image synthesis. With large-scale vision-language pre-training, diffusion models are able to generate high-quality images with rich texture and reasonable structure…

Computer Vision and Pattern Recognition · Computer Science 2024-08-16 Hefeng Wang , Jiale Cao , Jin Xie , Aiping Yang , Yanwei Pang

IRAD: Implicit Representation-driven Image Resampling against Adversarial Attacks

We introduce a novel approach to counter adversarial attacks, namely, image resampling. Image resampling transforms a discrete image into a new one, simulating the process of scene recapturing or rerendering as specified by a geometrical…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Yue Cao , Tianlin Li , Xiaofeng Cao , Ivor Tsang , Yang Liu , Qing Guo

Multi-Prompt Style Interpolation for Fine-Grained Artistic Control

Text-driven image style transfer has seen remarkable progress with methods leveraging cross-modal embeddings for fast, high-quality stylization. However, most existing pipelines assume a \emph{single} textual style prompt, limiting the…

Graphics · Computer Science 2025-07-31 Lei Chen , Hao Li , Yuxin Zhang , Chao Li , Kai Wen

Paint by Example: Exemplar-based Image Editing with Diffusion Models

Language-guided image editing has achieved great success recently. In this paper, for the first time, we investigate exemplar-guided image editing for more precise control. We achieve this goal by leveraging self-supervised training to…

Computer Vision and Pattern Recognition · Computer Science 2022-11-24 Binxin Yang , Shuyang Gu , Bo Zhang , Ting Zhang , Xuejin Chen , Xiaoyan Sun , Dong Chen , Fang Wen

LD-RPS: Zero-Shot Unified Image Restoration via Latent Diffusion Recurrent Posterior Sampling

Unified image restoration is a significantly challenging task in low-level vision. Existing methods either make tailored designs for specific tasks, limiting their generalizability across various types of degradation, or rely on training…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Huaqiu Li , Yong Wang , Tongwen Huang , Hailang Huang , Haoqian Wang , Xiangxiang Chu

Diffusion-Based Conditional Image Editing through Optimized Inference with Guidance

We present a simple but effective training-free approach for text-driven image-to-image translation based on a pretrained text-to-image diffusion model. Our goal is to generate an image that aligns with the target task while preserving the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-23 Hyunsoo Lee , Minsoo Kang , Bohyung Han

AID: Attention Interpolation of Text-to-Image Diffusion

Conditional diffusion models can create unseen images in various settings, aiding image interpolation. Interpolation in latent spaces is well-studied, but interpolation with specific conditions like text or poses is less understood. Simple…

Computer Vision and Pattern Recognition · Computer Science 2024-10-07 Qiyuan He , Jinghao Wang , Ziwei Liu , Angela Yao

When using a diffusion model for image editing, there are times when the modified image can differ greatly from the source. To address this, we apply a dual-guidance approach to maintain high fidelity to the original in areas that are not…

Computer Vision and Pattern Recognition · Computer Science 2023-12-13 Ruichen Zhang

StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing

A significant research effort is focused on exploiting the amazing capacities of pretrained diffusion models for the editing of images.They either finetune the model, or invert the image in the latent space of the pretrained model. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Senmao Li , Joost van de Weijer , Taihang Hu , Fahad Shahbaz Khan , Qibin Hou , Yaxing Wang , Jian Yang , Ming-Ming Cheng