Related papers: Zero-shot Text-driven Physically Interpretable Fac…

Zero-shot Face Editing via ID-Attribute Decoupled Inversion

Recent advancements in text-guided diffusion models have shown promise for general image editing via inversion techniques, but often struggle to maintain ID and structural consistency in real face editing tasks. To address this limitation,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Yang Hou , Minggu Wang , Jianjun Zhao

Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion Models

Recently large-scale language-image models (e.g., text-guided diffusion models) have considerably improved the image generation capabilities to generate photorealistic images in various domains. Based on this success, current image editing…

Computer Vision and Pattern Recognition · Computer Science 2023-05-09 Wenkai Dong , Song Xue , Xiaoyue Duan , Shumin Han

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

Editing real facial images is a crucial task in computer vision with significant demand in various real-world applications. While GAN-based methods have showed potential in manipulating images especially when combined with CLIP, these…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Dongxu Yue , Qin Guo , Munan Ning , Jiaxi Cui , Yuesheng Zhu , Li Yuan

Direct Inversion: Optimization-Free Text-Driven Real Image Editing with Diffusion Models

With the rise of large, publicly-available text-to-image diffusion models, text-guided real image editing has garnered much research attention recently. Existing methods tend to either rely on some form of per-instance or per-task…

Computer Vision and Pattern Recognition · Computer Science 2022-11-16 Adham Elarabawy , Harish Kamath , Samuel Denton

EditTransfer++: Toward Faithful and Efficient Visual-Prompt-Guided Image Editing

Visual-prompt-guided edit transfer aims to learn image transformations directly from example pairs, offering more precise and controllable editing than purely text-driven approaches. However, existing diffusion transformer-based methods…

Computer Vision and Pattern Recognition · Computer Science 2026-05-11 Lan Chen , Qi Mao , Yiren Song , Yuchao Gu , Siwei Ma

Semantic Facial Expression Editing using Autoencoded Flow

High-level manipulation of facial expressions in images --- such as changing a smile to a neutral expression --- is challenging because facial expression changes are highly non-linear, and vary depending on the appearance of the face. We…

Computer Vision and Pattern Recognition · Computer Science 2016-12-01 Raymond Yeh , Ziwei Liu , Dan B Goldman , Aseem Agarwala

User-friendly Image Editing with Minimal Text Input: Leveraging Captioning and Injection Techniques

Recent text-driven image editing in diffusion models has shown remarkable success. However, the existing methods assume that the user's description sufficiently grounds the contexts in the source image, such as objects, background, style,…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Sunwoo Kim , Wooseok Jang , Hyunsu Kim , Junho Kim , Yunjey Choi , Seungryong Kim , Gayeong Lee

Visual Instruction Inversion: Image Editing via Visual Prompting

Text-conditioned image editing has emerged as a powerful tool for editing images. However, in many situations, language can be ambiguous and ineffective in describing specific image edits. When faced with such challenges, visual prompts can…

Computer Vision and Pattern Recognition · Computer Science 2023-07-27 Thao Nguyen , Yuheng Li , Utkarsh Ojha , Yong Jae Lee

Text2LIVE: Text-Driven Layered Image and Video Editing

We present a method for zero-shot, text-driven appearance manipulation in natural images and videos. Given an input image or video and a target text prompt, our goal is to edit the appearance of existing objects (e.g., object's texture) or…

Computer Vision and Pattern Recognition · Computer Science 2022-05-26 Omer Bar-Tal , Dolev Ofri-Amar , Rafail Fridman , Yoni Kasten , Tali Dekel

Null-text Inversion for Editing Real Images using Guided Diffusion Models

Recent text-guided diffusion models provide powerful image generation capabilities. Currently, a massive effort is given to enable the modification of these images using text only as means to offer intuitive and versatile editing. To edit a…

Computer Vision and Pattern Recognition · Computer Science 2022-11-18 Ron Mokady , Amir Hertz , Kfir Aberman , Yael Pritch , Daniel Cohen-Or

LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusion Models

Research in vision-language models has seen rapid developments off-late, enabling natural language-based interfaces for image generation and manipulation. Many existing text guided manipulation techniques are restricted to specific classes…

Computer Vision and Pattern Recognition · Computer Science 2024-05-07 Paramanand Chandramouli , Kanchana Vaishnavi Gandikota

FaceShop: Deep Sketch-based Face Image Editing

We present a novel system for sketch-based face image editing, enabling users to edit images intuitively by sketching a few strokes on a region of interest. Our interface features tools to express a desired image manipulation by providing…

Computer Vision and Pattern Recognition · Computer Science 2018-06-08 Tiziano Portenier , Qiyang Hu , Attila Szabó , Siavash Arjomand Bigdeli , Paolo Favaro , Matthias Zwicker

Image-to-Image Translation with Disentangled Latent Vectors for Face Editing

We propose an image-to-image translation framework for facial attribute editing with disentangled interpretable latent directions. Facial attribute editing task faces the challenges of targeted attribute editing with controllable strength…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Yusuf Dalva , Hamza Pehlivan , Cansu Moran , Öykü Irmak Hatipoğlu , Ayşegül Dündar

Towards Real-time Text-driven Image Manipulation with Unconditional Diffusion Models

Recent advances in diffusion models enable many powerful instruments for image editing. One of these instruments is text-driven image manipulations: editing semantic attributes of an image according to the provided text description. %…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Nikita Starodubcev , Dmitry Baranchuk , Valentin Khrulkov , Artem Babenko

DiffusionCLIP: Text-Guided Diffusion Models for Robust Image Manipulation

Recently, GAN inversion methods combined with Contrastive Language-Image Pretraining (CLIP) enables zero-shot image manipulation guided by text prompts. However, their applications to diverse real images are still difficult due to the…

Computer Vision and Pattern Recognition · Computer Science 2022-08-12 Gwanghyun Kim , Taesung Kwon , Jong Chul Ye

Face Aging via Diffusion-based Editing

In this paper, we address the problem of face aging: generating past or future facial images by incorporating age-related changes to the given face. Previous aging methods rely solely on human facial image datasets and are thus constrained…

Computer Vision and Pattern Recognition · Computer Science 2023-09-21 Xiangyi Chen , Stéphane Lathuilière

Region-Aware Diffusion for Zero-shot Text-driven Image Editing

Image manipulation under the guidance of textual descriptions has recently received a broad range of attention. In this study, we focus on the regional editing of images with the guidance of given text prompts. Different from current…

Computer Vision and Pattern Recognition · Computer Science 2023-02-24 Nisha Huang , Fan Tang , Weiming Dong , Tong-Yee Lee , Changsheng Xu

Optimal Transport for Rectified Flow Image Editing: Unifying Inversion-Based and Direct Methods

Image editing in rectified flow models remains challenging due to the fundamental trade-off between reconstruction fidelity and editing flexibility. While inversion-based methods suffer from trajectory deviation, recent inversion-free…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Marian Lupascu , Mihai-Sorin Stupariu

Training-Free Text-Guided Image Editing with Visual Autoregressive Model

Text-guided image editing is an essential task that enables users to modify images through natural language descriptions. Recent advances in diffusion models and rectified flows have significantly improved editing quality, primarily relying…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Yufei Wang , Lanqing Guo , Zhihao Li , Jiaxing Huang , Pichao Wang , Bihan Wen , Jian Wang

InFusion: Inject and Attention Fusion for Multi Concept Zero-Shot Text-based Video Editing

Large text-to-image diffusion models have achieved remarkable success in generating diverse, high-quality images. Additionally, these models have been successfully leveraged to edit input images by just changing the text prompt. But when…

Computer Vision and Pattern Recognition · Computer Science 2023-08-11 Anant Khandelwal