English
Related papers

Related papers: SimInversion: A Simple Framework for Inversion-Bas…

200 papers

Text-guided diffusion models have revolutionized image generation and editing, offering exceptional realism and diversity. Specifically, in the context of diffusion-based editing, where a source image is edited according to a target prompt,…

Computer Vision and Pattern Recognition · Computer Science 2023-10-20 Xuan Ju , Ailing Zeng , Yuxuan Bian , Shaoteng Liu , Qiang Xu

Despite recent advances in inversion-based editing, text-guided image manipulation remains challenging for diffusion models. The primary bottlenecks include 1) the time-consuming nature of the inversion process; 2) the struggle to balance…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Sihan Xu , Yidong Huang , Jiayi Pan , Ziqiao Ma , Joyce Chai

Text-conditional image editing is a practical AIGC task that has recently emerged with great commercial and academic value. For real image editing, most diffusion model-based methods use DDIM Inversion as the first stage before editing.…

Computer Vision and Pattern Recognition · Computer Science 2024-12-17 Jiancheng Huang , Yi Huang , Jianzhuang Liu , Donghao Zhou , Yifan Liu , Shifeng Chen

Diffusion models have opened the path to a wide range of text-based image editing frameworks. However, these typically build on the multi-step nature of the diffusion backwards process, and adapting them to distilled, fast-sampling methods…

Computer Vision and Pattern Recognition · Computer Science 2024-08-02 Gilad Deutch , Rinon Gal , Daniel Garibi , Or Patashnik , Daniel Cohen-Or

Diffusion models have achieved remarkable success in the domain of text-guided image generation and, more recently, in text-guided image editing. A commonly adopted strategy for editing real images involves inverting the diffusion process…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Wonjun Kang , Kevin Galim , Hyung Il Koo

Recent text-guided diffusion models provide powerful image generation capabilities. Currently, a massive effort is given to enable the modification of these images using text only as means to offer intuitive and versatile editing. To edit a…

Computer Vision and Pattern Recognition · Computer Science 2022-11-18 Ron Mokady , Amir Hertz , Kfir Aberman , Yael Pritch , Daniel Cohen-Or

Despite recent advances in UNet-based image editing, methods for shape-aware object editing in high-resolution images are still lacking. Compared to UNet, Diffusion Transformers (DiT) demonstrate superior capabilities to effectively capture…

Computer Vision and Pattern Recognition · Computer Science 2024-11-08 Kunyu Feng , Yue Ma , Bingyuan Wang , Chenyang Qi , Haozhe Chen , Qifeng Chen , Zeyu Wang

We present a simple but effective training-free approach for text-driven image-to-image translation based on a pretrained text-to-image diffusion model. Our goal is to generate an image that aligns with the target task while preserving the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-23 Hyunsoo Lee , Minsoo Kang , Bohyung Han

With the rise of large, publicly-available text-to-image diffusion models, text-guided real image editing has garnered much research attention recently. Existing methods tend to either rely on some form of per-instance or per-task…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Adham Elarabawy , Harish Kamath , Samuel Denton

Despite all recent progress, it is still challenging to edit and manipulate natural images with modern generative models. When using Generative Adversarial Network (GAN), one major hurdle is in the inversion process mapping a real image to…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Zhihong Pan , Riccardo Gherardi , Xiufeng Xie , Stephen Huang

Recently, text-to-image (T2I) editing has been greatly pushed forward by applying diffusion models. Despite the visual promise of the generated images, inconsistencies with the expected textual prompt remain prevalent. This paper aims to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-20 Aoxue Li , Mingyang Yi , Zhenguo Li

Leveraging the large generative prior of the flow transformer for tuning-free image editing requires authentic inversion to project the image into the model's domain and a flexible invariance control mechanism to preserve non-target…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Pengcheng Xu , Boyuan Jiang , Xiaobin Hu , Donghao Luo , Qingdong He , Jiangning Zhang , Chengjie Wang , Yunsheng Wu , Charles Ling , Boyu Wang

Recently large-scale language-image models (e.g., text-guided diffusion models) have considerably improved the image generation capabilities to generate photorealistic images in various domains. Based on this success, current image editing…

Computer Vision and Pattern Recognition · Computer Science 2023-05-09 Wenkai Dong , Song Xue , Xiaoyue Duan , Shumin Han

As one of the most successful generative models, diffusion models have demonstrated remarkable efficacy in synthesizing high-quality images. These models learn the underlying high-dimensional data distribution in an unsupervised manner.…

Computer Vision and Pattern Recognition · Computer Science 2024-12-12 Min Hou , Yueying Wu , Chang Xu , Yu-Hao Huang , Chenxi Bai , Le Wu , Jiang Bian

Diffusion distillation represents a highly promising direction for achieving faithful text-to-image generation in a few sampling steps. However, despite recent successes, existing distilled models still do not provide the full spectrum of…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Nikita Starodubcev , Mikhail Khoroshikh , Artem Babenko , Dmitry Baranchuk

Diffusion models have achieved remarkable success in image generation and editing tasks. Inversion within these models aims to recover the latent noise representation for a real or generated image, enabling reconstruction, editing, and…

Computer Vision and Pattern Recognition · Computer Science 2025-06-04 Zixiang Li , Haoyu Wang , Wei Wang , Chuangchuang Tan , Yunchao Wei , Yao Zhao

Text-guided diffusion models have significantly advanced image editing, enabling high-quality and diverse modifications driven by text prompts. However, effective editing requires inverting the source image into a latent space, a process…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Haonan Lin , Mengmeng Wang , Jiahao Wang , Wenbin An , Yan Chen , Yong Liu , Feng Tian , Guang Dai , Jingdong Wang , Qianying Wang

Diffusion models have shown significant progress in image translation tasks recently. However, due to their stochastic nature, there's often a trade-off between style transformation and content preservation. Current strategies aim to…

Computer Vision and Pattern Recognition · Computer Science 2023-06-08 Gihyun Kwon , Jong Chul Ye

Finding an initial noise vector that produces an input image when fed into the diffusion process (known as inversion) is an important problem in denoising diffusion models (DDMs), with applications for real image editing. The…

Computer Vision and Pattern Recognition · Computer Science 2022-12-23 Bram Wallace , Akash Gokul , Nikhil Naik

Diffusion Models achieve state-of-the-art performance in generating new samples but lack a low-dimensional latent space that encodes the data into editable features. Inversion-based methods address this by reversing the denoising…

Computer Vision and Pattern Recognition · Computer Science 2026-01-28 Łukasz Staniszewski , Łukasz Kuciński , Kamil Deja
‹ Prev 1 2 3 10 Next ›