Related papers: Reproducing DragDiffusion: Interactive Point-Based…

DragDiffusion: Harnessing Diffusion Models for Interactive Point-based Image Editing

Accurate and controllable image editing is a challenging task that has attracted significant attention recently. Notably, DragGAN is an interactive point-based image editing framework that achieves impressive editing results with…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Yujun Shi , Chuhui Xue , Jun Hao Liew , Jiachun Pan , Hanshu Yan , Wenqing Zhang , Vincent Y. F. Tan , Song Bai

StableDrag: Stable Dragging for Point-based Image Editing

Point-based image editing has attracted remarkable attention since the emergence of DragGAN. Recently, DragDiffusion further pushes forward the generative quality via adapting this dragging technique to diffusion models. Despite these great…

Computer Vision and Pattern Recognition · Computer Science 2024-03-08 Yutao Cui , Xiaotong Zhao , Guozhen Zhang , Shengming Cao , Kai Ma , Limin Wang

RotationDrag: Point-based Image Editing with Rotated Diffusion Features

A precise and user-friendly manipulation of image content while preserving image fidelity has always been crucial to the field of image editing. Thanks to the power of generative models, recent point-based image editing methods allow users…

Computer Vision and Pattern Recognition · Computer Science 2024-01-15 Minxing Luo , Wentao Cheng , Jian Yang

AdaptiveDrag: Semantic-Driven Dragging on Diffusion-Based Image Editing

Recently, several point-based image editing methods (e.g., DragDiffusion, FreeDrag, DragNoise) have emerged, yielding precise and high-quality results based on user instructions. However, these methods often make insufficient use of…

Computer Vision and Pattern Recognition · Computer Science 2024-12-04 DuoSheng Chen , Binghui Chen , Yifeng Geng , Liefeng Bo

RegionDrag: Fast Region-Based Image Editing with Diffusion Models

Point-drag-based image editing methods, like DragDiffusion, have attracted significant attention. However, point-drag-based approaches suffer from computational overhead and misinterpretation of user intentions due to the sparsity of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-26 Jingyi Lu , Xinghui Li , Kai Han

DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models

Despite the ability of existing large-scale text-to-image (T2I) models to generate high-quality images from detailed textual descriptions, they often lack the ability to precisely edit the generated or real images. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2023-11-21 Chong Mou , Xintao Wang , Jiechong Song , Ying Shan , Jian Zhang

Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation

Point-based interactive editing serves as an essential tool to complement the controllability of existing generative models. A concurrent work, DragDiffusion, updates the diffusion latent map in response to user inputs, causing global…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Haofeng Liu , Chenshu Xu , Yifei Yang , Lihua Zeng , Shengfeng He

DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion Model

Drag-based editing within pretrained diffusion model provides a precise and flexible way to manipulate foreground objects. Traditional methods optimize the input feature obtained from DDIM inversion directly, adjusting them iteratively to…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Siwei Xia , Li Sun , Tiantian Sun , Qingli Li

DragText: Rethinking Text Embedding in Point-based Image Editing

Point-based image editing enables accurate and flexible control through content dragging. However, the role of text embedding during the editing process has not been thoroughly investigated. A significant aspect that remains unexplored is…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Gayoon Choi , Taejin Jeong , Sujung Hong , Seong Jae Hwang

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models

Text-guided image editing has recently experienced rapid development. However, simultaneously performing multiple editing actions on a single image, such as background replacement and specific subject attribute changes, while maintaining…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Pengzhi Li , QInxuan Huang , Yikang Ding , Zhiheng Li

AttentionDrag: Exploiting Latent Correlation Knowledge in Pre-trained Diffusion Models for Image Editing

Traditional point-based image editing methods rely on iterative latent optimization or geometric transformations, which are either inefficient in their processing or fail to capture the semantic relationships within the image. These methods…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Biao Yang , Muqi Huang , Yuhui Zhang , Yun Xiong , Kun Zhou , Xi Chen , Shiyang Zhou , Huishuai Bao , Chuan Li , Feng Shi , Hualei Liu

TDEdit: A Unified Diffusion Framework for Text-Drag Guided Image Manipulation

This paper explores image editing under the joint control of text and drag interactions. While recent advances in text-driven and drag-driven editing have achieved remarkable progress, they suffer from complementary limitations: text-driven…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Qihang Wang , Yaxiong Wang , Lechao Cheng , Zhun Zhong

DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts

Image retouching aims to enhance the visual quality of photos. Considering the different aesthetic preferences of users, the target of retouching is subjective. However, current retouching methods mostly adopt deterministic models, which…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Zheng-Peng Duan , Jiawei zhang , Zheng Lin , Xin Jin , Dongqing Zou , Chunle Guo , Chongyi Li

InstructUDrag: Joint Text Instructions and Object Dragging for Interactive Image Editing

Text-to-image diffusion models have shown great potential for image editing, with techniques such as text-based and object-dragging methods emerging as key approaches. However, each of these methods has inherent limitations: text-based…

Computer Vision and Pattern Recognition · Computer Science 2025-10-10 Haoran Yu , Yi Shi

DragTex: Generative Point-Based Texture Editing on 3D Mesh

Creating 3D textured meshes using generative artificial intelligence has garnered significant attention recently. While existing methods support text-based generative texture generation or editing on 3D meshes, they often struggle to…

Computer Vision and Pattern Recognition · Computer Science 2024-03-05 Yudi Zhang , Qi Xu , Lei Zhang

Drag within Prior Distribution: Text-Conditioned Point-Based Image Editing within Distribution Constraints

Diffusion-based point editing methods have gained significant traction in image editing tasks due to their ability to manipulate image semantics and fine details by applying localized perturbations on the manifold of noise latent. However,…

Computer Vision and Pattern Recognition · Computer Science 2026-05-14 Haoyang Hu , Masataka Seo , Yen-Wei Chen

Diffusion-Based Attention Warping for Consistent 3D Scene Editing

We present a novel method for 3D scene editing using diffusion models, designed to ensure view consistency and realism across perspectives. Our approach leverages attention features extracted from a single reference image to define the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-12 Eyal Gomel , Lior Wolf

FastDrag: Manipulate Anything in One Step

Drag-based image editing using generative models provides precise control over image contents, enabling users to manipulate anything in an image with a few clicks. However, prevailing methods typically adopt $n$-step iterations for latent…

Computer Vision and Pattern Recognition · Computer Science 2024-10-30 Xuanjia Zhao , Jian Guan , Congyi Fan , Dongli Xu , Youtian Lin , Haiwei Pan , Pengming Feng

StyleDiffusion: Prompt-Embedding Inversion for Text-Based Editing

A significant research effort is focused on exploiting the amazing capacities of pretrained diffusion models for the editing of images.They either finetune the model, or invert the image in the latent space of the pretrained model. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Senmao Li , Joost van de Weijer , Taihang Hu , Fahad Shahbaz Khan , Qibin Hou , Yaxing Wang , Jian Yang , Ming-Ming Cheng

DiffusionAct: Controllable Diffusion Autoencoder for One-shot Face Reenactment

Video-driven neural face reenactment aims to synthesize realistic facial images that successfully preserve the identity and appearance of a source face, while transferring the target head pose and facial expressions. Existing GAN-based…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Stella Bounareli , Christos Tzelepis , Vasileios Argyriou , Ioannis Patras , Georgios Tzimiropoulos