Related papers: EditInfinity: Image Editing with Binary-Quantized …

Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code

Text-guided diffusion models have revolutionized image generation and editing, offering exceptional realism and diversity. Specifically, in the context of diffusion-based editing, where a source image is edited according to a target prompt,…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Xuan Ju , Ailing Zeng , Yuxuan Bian , Shaoteng Liu , Qiang Xu

Direct Inversion: Optimization-Free Text-Driven Real Image Editing with Diffusion Models

With the rise of large, publicly-available text-to-image diffusion models, text-guided real image editing has garnered much research attention recently. Existing methods tend to either rely on some form of per-instance or per-task…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Adham Elarabawy , Harish Kamath , Samuel Denton

Inverse-and-Edit: Effective and Fast Image Editing by Cycle Consistency Models

Recent advances in image editing with diffusion models have achieved impressive results, offering fine-grained control over the generation process. However, these methods are computationally intensive because of their iterative nature.…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Ilia Beletskii , Andrey Kuznetsov , Aibek Alanov

Null-text Inversion for Editing Real Images using Guided Diffusion Models

Recent text-guided diffusion models provide powerful image generation capabilities. Currently, a massive effort is given to enable the modification of these images using text only as means to offer intuitive and versatile editing. To edit a…

Computer Vision and Pattern Recognition · Computer Science 2022-11-18 Ron Mokady , Amir Hertz , Kfir Aberman , Yael Pritch , Daniel Cohen-Or

ReGeneration Learning of Diffusion Models with Rich Prompts for Zero-Shot Image Translation

Large-scale text-to-image models have demonstrated amazing ability to synthesize diverse and high-fidelity images. However, these models are often violated by several limitations. Firstly, they require the user to provide precise and…

Computer Vision and Pattern Recognition · Computer Science 2023-05-09 Yupei Lin , Sen Zhang , Xiaojun Yang , Xiao Wang , Yukai Shi

Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion Models

Recently large-scale language-image models (e.g., text-guided diffusion models) have considerably improved the image generation capabilities to generate photorealistic images in various domains. Based on this success, current image editing…

Computer Vision and Pattern Recognition · Computer Science 2023-05-09 Wenkai Dong , Song Xue , Xiaoyue Duan , Shumin Han

StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing

A significant research effort is focused on exploiting the amazing capacities of pretrained diffusion models for the editing of images.They either finetune the model, or invert the image in the latent space of the pretrained model. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Senmao Li , Joost van de Weijer , Taihang Hu , Fahad Shahbaz Khan , Qibin Hou , Yaxing Wang , Jian Yang , Ming-Ming Cheng

Pix2Video: Video Editing using Image Diffusion

Image diffusion models, trained on massive image collections, have emerged as the most versatile image generator model in terms of quality and diversity. They support inverting real images and conditional (e.g., text) generation, making…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Duygu Ceylan , Chun-Hao Paul Huang , Niloy J. Mitra

SimInversion: A Simple Framework for Inversion-Based Text-to-Image Editing

Diffusion models demonstrate impressive image generation performance with text guidance. Inspired by the learning process of diffusion, existing images can be edited according to text by DDIM inversion. However, the vanilla DDIM inversion…

Computer Vision and Pattern Recognition · Computer Science 2024-09-17 Qi Qian , Haiyang Xu , Ming Yan , Juhua Hu

Eta Inversion: Designing an Optimal Eta Function for Diffusion-based Real Image Editing

Diffusion models have achieved remarkable success in the domain of text-guided image generation and, more recently, in text-guided image editing. A commonly adopted strategy for editing real images involves inverting the diffusion process…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Wonjun Kang , Kevin Galim , Hyung Il Koo

IntrinsicEdit: Precise generative image manipulation in intrinsic space

Generative diffusion models have advanced image editing with high-quality results and intuitive interfaces such as prompts and semantic drawing. However, these interfaces lack precise control, and the associated methods typically specialize…

Graphics · Computer Science 2025-05-16 Linjie Lyu , Valentin Deschaintre , Yannick Hold-Geoffroy , Miloš Hašan , Jae Shin Yoon , Thomas Leimkühler , Christian Theobalt , Iliyan Georgiev

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models

Generative models have been widely studied in computer vision. Recently, diffusion models have drawn substantial attention due to the high quality of their generated images. A key desired property of image generative models is the ability…

Computer Vision and Pattern Recognition · Computer Science 2022-12-20 Qiucheng Wu , Yujian Liu , Handong Zhao , Ajinkya Kale , Trung Bui , Tong Yu , Zhe Lin , Yang Zhang , Shiyu Chang

ResetEdit: Precise Text-guided Editing of Generated Image via Resettable Starting Latent

Recent advances in diffusion models have enabled high-quality image generation, leading to increasing demand for post-generation editing that modifies local regions while preserving global structure. Achieving such flexible and precise…

Computer Vision and Pattern Recognition · Computer Science 2026-04-29 Hanyi Wang , Han Fang , Zheng Wang , Shilin Wang , Ee-Chien Chang

Inversion-Free Image Editing with Natural Language

Despite recent advances in inversion-based editing, text-guided image manipulation remains challenging for diffusion models. The primary bottlenecks include 1) the time-consuming nature of the inversion process; 2) the struggle to balance…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Sihan Xu , Yidong Huang , Jiayi Pan , Ziqiao Ma , Joyce Chai

ReNoise: Real Image Inversion Through Iterative Noising

Recent advancements in text-guided diffusion models have unlocked powerful image manipulation capabilities. However, applying these methods to real images necessitates the inversion of the images into the domain of the pretrained diffusion…

Computer Vision and Pattern Recognition · Computer Science 2024-03-22 Daniel Garibi , Or Patashnik , Andrey Voynov , Hadar Averbuch-Elor , Daniel Cohen-Or

EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model

We propose EditCrafter, a high-resolution image editing method that operates without tuning, leveraging pretrained text-to-image (T2I) diffusion models to process images at resolutions significantly exceeding those used during training.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Kunho Kim , Sumin Seo , Yongjun Cho , Hyungjin Chung

Image Inversion: A Survey from GANs to Diffusion and Beyond

Image inversion is a fundamental task in generative models, aiming to map images back to their latent representations to enable downstream applications such as editing, restoration, and style transfer. This paper provides a comprehensive…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Yinan Chen , Jiangning Zhang , Yali Bi , Xiaobin Hu , Teng Hu , Zhucun Xue , Ran Yi , Yong Liu , Ying Tai

LatentEdit: Adaptive Latent Control for Consistent Semantic Editing

Diffusion-based Image Editing has achieved significant success in recent years. However, it remains challenging to achieve high-quality image editing while maintaining the background similarity without sacrificing speed or memory…

Graphics · Computer Science 2025-09-03 Siyi Liu , Weiming Chen , Yushun Tang , Zhihai He

MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation

Recent advances in text-to-image generation with diffusion models present transformative capabilities in image quality. However, user controllability of the generated image, and fast adaptation to new tasks still remains an open challenge,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-17 Omer Bar-Tal , Lior Yariv , Yaron Lipman , Tali Dekel

DiffUTE: Universal Text Editing Diffusion Model

Diffusion model based language-guided image editing has achieved great success recently. However, existing state-of-the-art diffusion models struggle with rendering correct text and text style during generation. To tackle this problem, we…

Computer Vision and Pattern Recognition · Computer Science 2023-10-19 Haoxing Chen , Zhuoer Xu , Zhangxuan Gu , Jun Lan , Xing Zheng , Yaohui Li , Changhua Meng , Huijia Zhu , Weiqiang Wang