English
Related papers

Related papers: DragDiffusion: Harnessing Diffusion Models for Int…

200 papers

DragDiffusion is a diffusion-based method for interactive point-based image editing that enables users to manipulate images by directly dragging selected points. The method claims that accurate spatial control can be achieved by optimizing…

Computer Vision and Pattern Recognition · Computer Science 2026-02-16 Ali Subhan , Ashir Raza

Point-based image editing has attracted remarkable attention since the emergence of DragGAN. Recently, DragDiffusion further pushes forward the generative quality via adapting this dragging technique to diffusion models. Despite these great…

Computer Vision and Pattern Recognition · Computer Science 2024-03-08 Yutao Cui , Xiaotong Zhao , Guozhen Zhang , Shengming Cao , Kai Ma , Limin Wang

A precise and user-friendly manipulation of image content while preserving image fidelity has always been crucial to the field of image editing. Thanks to the power of generative models, recent point-based image editing methods allow users…

Computer Vision and Pattern Recognition · Computer Science 2024-01-15 Minxing Luo , Wentao Cheng , Jian Yang

Recently, several point-based image editing methods (e.g., DragDiffusion, FreeDrag, DragNoise) have emerged, yielding precise and high-quality results based on user instructions. However, these methods often make insufficient use of…

Computer Vision and Pattern Recognition · Computer Science 2024-12-04 DuoSheng Chen , Binghui Chen , Yifeng Geng , Liefeng Bo

Despite the ability of existing large-scale text-to-image (T2I) models to generate high-quality images from detailed textual descriptions, they often lack the ability to precisely edit the generated or real images. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2023-11-21 Chong Mou , Xintao Wang , Jiechong Song , Ying Shan , Jian Zhang

The rapid advancement in image generation models has predominantly been driven by diffusion models, which have demonstrated unparalleled success in generating high-fidelity, diverse images from textual prompts. Despite their success,…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Yusuf Dalva , Hidir Yesiltepe , Pinar Yanardag

Point-based interactive editing serves as an essential tool to complement the controllability of existing generative models. A concurrent work, DragDiffusion, updates the diffusion latent map in response to user inputs, causing global…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Haofeng Liu , Chenshu Xu , Yifei Yang , Lihua Zeng , Shengfeng He

Traditional point-based image editing methods rely on iterative latent optimization or geometric transformations, which are either inefficient in their processing or fail to capture the semantic relationships within the image. These methods…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Biao Yang , Muqi Huang , Yuhui Zhang , Yun Xiong , Kun Zhou , Xi Chen , Shiyang Zhou , Huishuai Bao , Chuan Li , Feng Shi , Hualei Liu

Facial attribute editing aims to modify target attributes while preserving attribute-irrelevant content and overall image fidelity. Existing GAN-based methods provide favorable controllability, but often suffer from weak alignment between…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Wenmin Huang , Weiqi Luo , Xiaochun Cao , Jiwu Huang

Text-guided image editing has recently experienced rapid development. However, simultaneously performing multiple editing actions on a single image, such as background replacement and specific subject attribute changes, while maintaining…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Pengzhi Li , QInxuan Huang , Yikang Ding , Zhiheng Li

Point-drag-based image editing methods, like DragDiffusion, have attracted significant attention. However, point-drag-based approaches suffer from computational overhead and misinterpretation of user intentions due to the sparsity of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-26 Jingyi Lu , Xinghui Li , Kai Han

This paper explores image editing under the joint control of text and drag interactions. While recent advances in text-driven and drag-driven editing have achieved remarkable progress, they suffer from complementary limitations: text-driven…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Qihang Wang , Yaxiong Wang , Lechao Cheng , Zhun Zhong

Point-based image editing enables accurate and flexible control through content dragging. However, the role of text embedding during the editing process has not been thoroughly investigated. A significant aspect that remains unexplored is…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Gayoon Choi , Taejin Jeong , Sujung Hong , Seong Jae Hwang

One highly promising direction for enabling flexible real-time on-device image editing is utilizing data distillation by leveraging large-scale text-to-image diffusion models to generate paired datasets used for training generative…

Computer Vision and Pattern Recognition · Computer Science 2024-06-04 Yifan Gong , Zheng Zhan , Qing Jin , Yanyu Li , Yerlan Idelbayev , Xian Liu , Andrey Zharkov , Kfir Aberman , Sergey Tulyakov , Yanzhi Wang , Jian Ren

Synthesizing visual content that meets users' needs often requires flexible and precise controllability of the pose, shape, expression, and layout of the generated objects. Existing approaches gain controllability of generative adversarial…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Xingang Pan , Ayush Tewari , Thomas Leimkühler , Lingjie Liu , Abhimitra Meka , Christian Theobalt

Drag-based editing within pretrained diffusion model provides a precise and flexible way to manipulate foreground objects. Traditional methods optimize the input feature obtained from DDIM inversion directly, adjusting them iteratively to…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Siwei Xia , Li Sun , Tiantian Sun , Qingli Li

Image inversion is a fundamental task in generative models, aiming to map images back to their latent representations to enable downstream applications such as editing, restoration, and style transfer. This paper provides a comprehensive…

Computer Vision and Pattern Recognition · Computer Science 2025-02-18 Yinan Chen , Jiangning Zhang , Yali Bi , Xiaobin Hu , Teng Hu , Zhucun Xue , Ran Yi , Yong Liu , Ying Tai

Editing real facial images is a crucial task in computer vision with significant demand in various real-world applications. While GAN-based methods have showed potential in manipulating images especially when combined with CLIP, these…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Dongxu Yue , Qin Guo , Munan Ning , Jiaxi Cui , Yuesheng Zhu , Li Yuan

Large-scale Text-to-Image (T2I) diffusion models have revolutionized image generation over the last few years. Although owning diverse and high-quality generation capabilities, translating these abilities to fine-grained image editing…

Computer Vision and Pattern Recognition · Computer Science 2024-02-06 Chong Mou , Xintao Wang , Jiechong Song , Ying Shan , Jian Zhang

Our goal is to develop fine-grained real-image editing methods suitable for real-world applications. In this paper, we first summarize four requirements for these methods and propose a novel diffusion-based image editing framework with…

Computer Vision and Pattern Recognition · Computer Science 2023-06-01 Naoki Matsunaga , Masato Ishii , Akio Hayakawa , Kenji Suzuki , Takuya Narihira
‹ Prev 1 2 3 10 Next ›