Related papers: Exploring Attribute Variations in Style-based GANs…

Uncovering the Disentanglement Capability in Text-to-Image Diffusion Models

Generative models have been widely studied in computer vision. Recently, diffusion models have drawn substantial attention due to the high quality of their generated images. A key desired property of image generative models is the ability…

Computer Vision and Pattern Recognition · Computer Science 2022-12-20 Qiucheng Wu , Yujian Liu , Handong Zhao , Ajinkya Kale , Trung Bui , Tong Yu , Zhe Lin , Yang Zhang , Shiyu Chang

AttDiff-GAN: A Hybrid Diffusion-GAN Framework for Facial Attribute Editing

Facial attribute editing aims to modify target attributes while preserving attribute-irrelevant content and overall image fidelity. Existing GAN-based methods provide favorable controllability, but often suffer from weak alignment between…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Wenmin Huang , Weiqi Luo , Xiaochun Cao , Jiwu Huang

HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion

Hair editing is a critical image synthesis task that aims to edit hair color and hairstyle using text descriptions or reference images, while preserving irrelevant attributes (e.g., identity, background, cloth). Many existing methods are…

Computer Vision and Pattern Recognition · Computer Science 2024-11-14 Yu Zeng , Yang Zhang , Jiachen Liu , Linlin Shen , Kaijun Deng , Weizhao He , Jinbao Wang

LatRef-Diff: Latent and Reference-Guided Diffusion for Facial Attribute Editing and Style Manipulation

Facial attribute editing and style manipulation are crucial for applications like virtual avatars and photo editing. However, achieving precise control over facial attributes without altering unrelated features is challenging due to the…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Wenmin Huang , Weiqi Luo , Xiaochun Cao , Jiwu Huang

Stylistic Attribute Control in Latent Diffusion Models

Text-to-image diffusion models have revolutionized image synthesis and editing, but precise control over stylistic attributes remains a challenge, often causing unintended content modifications. We propose an approach for fine-grained…

Computer Vision and Pattern Recognition · Computer Science 2026-05-05 Max Reimann , Benito Buchheim , Jürgen Döllner

Analyzing Bias in Diffusion-based Face Generation Models

Diffusion models are becoming increasingly popular in synthetic data generation and image editing applications. However, these models can amplify existing biases and propagate them to downstream applications. Therefore, it is crucial to…

Computer Vision and Pattern Recognition · Computer Science 2023-05-12 Malsha V. Perera , Vishal M. Patel

Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation

Fashion attribute editing is a task that aims to convert the semantic attributes of a given fashion image while preserving the irrelevant regions. Previous works typically employ conditional GANs where the generator explicitly learns the…

Computer Vision and Pattern Recognition · Computer Science 2022-10-13 Chaerin Kong , DongHyeon Jeon , Ohjoon Kwon , Nojun Kwak

Revealing Directions for Text-guided 3D Face Editing

3D face editing is a significant task in multimedia, aimed at the manipulation of 3D face models across various control signals. The success of 3D-aware GAN provides expressive 3D models learned from 2D single-view images only, encouraging…

Computer Vision and Pattern Recognition · Computer Science 2024-10-08 Zhuo Chen , Yichao Yan , Sehngqi Liu , Yuhao Cheng , Weiming Zhao , Lincheng Li , Mengxiao Bi , Xiaokang Yang

Supervised makeup transfer with a curated dataset: Decoupling identity and makeup features for enhanced transformation

Diffusion models have recently shown strong progress in generative tasks, offering a more stable alternative to GAN-based approaches for makeup transfer. Existing methods often suffer from limited datasets, poor disentanglement between…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Qihe Pan , Yiming Wu , Xing Zhao , Liang Xie , Guodao Sun , Ronghua Liang

GANTASTIC: GAN-based Transfer of Interpretable Directions for Disentangled Image Editing in Text-to-Image Diffusion Models

The rapid advancement in image generation models has predominantly been driven by diffusion models, which have demonstrated unparalleled success in generating high-fidelity, diverse images from textual prompts. Despite their success,…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Yusuf Dalva , Hidir Yesiltepe , Pinar Yanardag

Rethinking the editing of generative adversarial networks: a method to estimate editing vectors based on dimension reduction

While Generative Adversarial Networks (GANs) have recently found applications in image editing, most previous GAN-based image editing methods require largescale datasets with semantic segmentation annotations for training, only provide high…

Computer Vision and Pattern Recognition · Computer Science 2023-05-17 Yuhan Cao , Haoran Jiang , Zhenghong Yu , Qi Li , Xuyang Li

Multi-Directional Subspace Editing in Style-Space

This paper describes a new technique for finding disentangled semantic directions in the latent space of StyleGAN. Our method identifies meaningful orthogonal subspaces that allow editing of one human face attribute, while minimizing…

Computer Vision and Pattern Recognition · Computer Science 2024-07-12 Chen Naveh , Yacov Hel-Or

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation

The recent advancements in image-text diffusion models have stimulated research interest in large-scale 3D generative models. Nevertheless, the limited availability of diverse 3D resources presents significant challenges to learning. In…

Computer Vision and Pattern Recognition · Computer Science 2023-06-01 Chi Zhang , Yiwen Chen , Yijun Fu , Zhenglin Zhou , Gang YU , Billzb Wang , Bin Fu , Tao Chen , Guosheng Lin , Chunhua Shen

SMILE: Semantically-guided Multi-attribute Image and Layout Editing

Attribute image manipulation has been a very active topic since the introduction of Generative Adversarial Networks (GANs). Exploring the disentangled attribute space within a transformation is a very challenging task due to the multiple…

Computer Vision and Pattern Recognition · Computer Science 2020-10-07 Andrés Romero , Luc Van Gool , Radu Timofte

DiffuseGAE: Controllable and High-fidelity Image Manipulation from Disentangled Representation

Diffusion probabilistic models (DPMs) have shown remarkable results on various image synthesis tasks such as text-to-image generation and image inpainting. However, compared to other generative methods like VAEs and GANs, DPMs lack a…

Computer Vision and Pattern Recognition · Computer Science 2023-07-13 Yipeng Leng , Qiangjuan Huang , Zhiyuan Wang , Yangyang Liu , Haoyu Zhang

Unsupervised Discovery of Semantic Latent Directions in Diffusion Models

Despite the success of diffusion models (DMs), we still lack a thorough understanding of their latent space. While image editing with GANs builds upon latent space, DMs rely on editing the conditions such as text prompts. We present an…

Computer Vision and Pattern Recognition · Computer Science 2023-02-27 Yong-Hyun Park , Mingi Kwon , Junghyo Jo , Youngjung Uh

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models

Text-guided image editing has recently experienced rapid development. However, simultaneously performing multiple editing actions on a single image, such as background replacement and specific subject attribute changes, while maintaining…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Pengzhi Li , QInxuan Huang , Yikang Ding , Zhiheng Li

Blended Latent Diffusion

The tremendous progress in neural image generation, coupled with the emergence of seemingly omnipotent vision-language models has finally enabled text-based interfaces for creating and editing images. Handling generic images requires a…

Computer Vision and Pattern Recognition · Computer Science 2023-07-27 Omri Avrahami , Ohad Fried , Dani Lischinski

Face Animation with an Attribute-Guided Diffusion Model

Face animation has achieved much progress in computer vision. However, prevailing GAN-based methods suffer from unnatural distortions and artifacts due to sophisticated motion deformation. In this paper, we propose a Face Animation…

Computer Vision and Pattern Recognition · Computer Science 2023-04-07 Bohan Zeng , Xuhui Liu , Sicheng Gao , Boyu Liu , Hong Li , Jianzhuang Liu , Baochang Zhang

ChatFace: Chat-Guided Real Face Editing via Diffusion Latent Space Manipulation

Editing real facial images is a crucial task in computer vision with significant demand in various real-world applications. While GAN-based methods have showed potential in manipulating images especially when combined with CLIP, these…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Dongxu Yue , Qin Guo , Munan Ning , Jiaxi Cui , Yuesheng Zhu , Li Yuan