Related papers: DreamSteerer: Enhancing Source Image Conditioned E…

DiffEditor: Boosting Accuracy and Flexibility on Diffusion-based Image Editing

Large-scale Text-to-Image (T2I) diffusion models have revolutionized image generation over the last few years. Although owning diverse and high-quality generation capabilities, translating these abilities to fine-grained image editing…

Computer Vision and Pattern Recognition · Computer Science 2024-02-06 Chong Mou , Xintao Wang , Jiechong Song , Ying Shan , Jian Zhang

Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models

Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. Recent research has extended these models to support text-guided image editing. While text guidance is an intuitive editing…

Computer Vision and Pattern Recognition · Computer Science 2023-05-26 Jooyoung Choi , Yunjey Choi , Yunji Kim , Junho Kim , Sungroh Yoon

DreamEdit3D: Personalization of Multi-View Diffusion Models for 3D Editing

While 2D diffusion models have achieved remarkable success in identity-preserving personalization, extending this capability to 3D assets remains a significant challenge due to the complexities of multi-view consistency and spatial control.…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Jinxin Ai , Matthias Nießner , Ziya Erkoç

DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization

The objective of text-to-image (T2I) personalization is to customize a diffusion model to a user-provided reference concept, generating diverse images of the concept aligned with the target prompts. Conventional methods representing the…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Jisu Nam , Heesu Kim , DongJae Lee , Siyoon Jin , Seungryong Kim , Seunggyu Chang

DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models

Recent progresses in large-scale text-to-image models have yielded remarkable accomplishments, finding various applications in art domain. However, expressing unique characteristics of an artwork (e.g. brushwork, colortone, or composition)…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Namhyuk Ahn , Junsoo Lee , Chunggi Lee , Kunhee Kim , Daesik Kim , Seung-Hun Nam , Kibeom Hong

EditCrafter: Tuning-free High-Resolution Image Editing via Pretrained Diffusion Model

We propose EditCrafter, a high-resolution image editing method that operates without tuning, leveraging pretrained text-to-image (T2I) diffusion models to process images at resolutions significantly exceeding those used during training.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Kunho Kim , Sumin Seo , Yongjun Cho , Hyungjin Chung

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Personalized text-to-image models allow users to generate varied styles of images (specified with a sentence) for an object (specified with a set of reference images). While remarkable results have been achieved using diffusion-based…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Fanyue Wei , Wei Zeng , Zhenyang Li , Dawei Yin , Lixin Duan , Wen Li

Energy-Guided Optimization for Personalized Image Editing with Pretrained Text-to-Image Diffusion Models

The rapid advancement of pretrained text-driven diffusion models has significantly enriched applications in image generation and editing. However, as the demand for personalized content editing increases, new challenges emerge especially…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Rui Jiang , Xinghe Fu , Guangcong Zheng , Teng Li , Taiping Yao , Xi Li

DragonDiffusion: Enabling Drag-style Manipulation on Diffusion Models

Despite the ability of existing large-scale text-to-image (T2I) models to generate high-quality images from detailed textual descriptions, they often lack the ability to precisely edit the generated or real images. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2023-11-21 Chong Mou , Xintao Wang , Jiechong Song , Ying Shan , Jian Zhang

EditShield: Protecting Unauthorized Image Editing by Instruction-guided Diffusion Models

Text-to-image diffusion models have emerged as an evolutionary for producing creative content in image synthesis. Based on the impressive generation abilities of these models, instruction-guided diffusion models can edit images with simple…

Cryptography and Security · Computer Science 2024-08-21 Ruoxi Chen , Haibo Jin , Yixin Liu , Jinyin Chen , Haohan Wang , Lichao Sun

An Improved Method for Personalizing Diffusion Models

Diffusion models have demonstrated impressive image generation capabilities. Personalized approaches, such as textual inversion and Dreambooth, enhance model individualization using specific images. These methods enable generating images of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Yan Zeng , Masanori Suganuma , Takayuki Okatani

Personalized Image Editing in Text-to-Image Diffusion Models via Collaborative Direct Preference Optimization

Text-to-image (T2I) diffusion models have made remarkable strides in generating and editing high-fidelity images from text. Yet, these models remain fundamentally generic, failing to adapt to the nuanced aesthetic preferences of individual…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Connor Dunlop , Matthew Zheng , Kavana Venkatesh , Pinar Yanardag

A Survey of Multimodal-Guided Image Editing with Text-to-Image Diffusion Models

Image editing aims to edit the given synthetic or real image to meet the specific requirements from users. It is widely studied in recent years as a promising and challenging field of Artificial Intelligence Generative Content (AIGC).…

Computer Vision and Pattern Recognition · Computer Science 2024-06-21 Xincheng Shuai , Henghui Ding , Xingjun Ma , Rongcheng Tu , Yu-Gang Jiang , Dacheng Tao

Direct Consistency Optimization for Robust Customization of Text-to-Image Diffusion Models

Text-to-image (T2I) diffusion models, when fine-tuned on a few personal images, can generate visuals with a high degree of consistency. However, such fine-tuned models are not robust; they often fail to compose with concepts of pretrained…

Computer Vision and Pattern Recognition · Computer Science 2024-12-13 Kyungmin Lee , Sangkyung Kwak , Kihyuk Sohn , Jinwoo Shin

DreamReader: An Interpretability Toolkit for Text-to-Image Models

Despite the rapid adoption of text-to-image (T2I) diffusion models, causal and representation-level analysis remains fragmented and largely limited to isolated probing techniques. To address this gap, we introduce DreamReader: a unified…

Machine Learning · Computer Science 2026-03-17 Nirmalendu Prakash , Narmeen Oozeer , Michael Lan , Luka Samkharadze , Phillip Howard , Roy Ka-Wei Lee , Dhruv Nathawani , Shivam Raval , Amirali Abdullah

Diffusion Self-Distillation for Zero-Shot Customized Image Generation

Text-to-image diffusion models produce impressive results but are frustrating tools for artists who desire fine-grained control. For example, a common use case is to create images of a specific instance in novel contexts, i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Shengqu Cai , Eric Chan , Yunzhi Zhang , Leonidas Guibas , Jiajun Wu , Gordon Wetzstein

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-quality and diverse synthesis of images from a given text prompt. However, these models lack the ability to mimic the appearance of subjects in a…

Computer Vision and Pattern Recognition · Computer Science 2023-03-16 Nataniel Ruiz , Yuanzhen Li , Varun Jampani , Yael Pritch , Michael Rubinstein , Kfir Aberman

TexSliders: Diffusion-Based Texture Editing in CLIP Space

Generative models have enabled intuitive image creation and manipulation using natural language. In particular, diffusion models have recently shown remarkable results for natural image editing. In this work, we propose to apply diffusion…

Graphics · Computer Science 2024-05-02 Julia Guerrero-Viu , Milos Hasan , Arthur Roullier , Midhun Harikumar , Yiwei Hu , Paul Guerrero , Diego Gutierrez , Belen Masia , Valentin Deschaintre

S$^2$Edit: Text-Guided Image Editing with Precise Semantic and Spatial Control

Recent advances in diffusion models have enabled high-quality generation and manipulation of images guided by texts, as well as concept learning from images. However, naive applications of existing methods to editing tasks that require…

Computer Vision and Pattern Recognition · Computer Science 2025-12-29 Xudong Liu , Zikun Chen , Ruowei Jiang , Ziyi Wu , Kejia Yin , Han Zhao , Parham Aarabi , Igor Gilitschenski

Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion

Recently, text-to-image (T2I) editing has been greatly pushed forward by applying diffusion models. Despite the visual promise of the generated images, inconsistencies with the expected textual prompt remain prevalent. This paper aims to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-20 Aoxue Li , Mingyang Yi , Zhenguo Li