English
Related papers

Related papers: DreamSteerer: Enhancing Source Image Conditioned E…

200 papers

Large-scale Text-to-Image (T2I) diffusion models have revolutionized image generation over the last few years. Although owning diverse and high-quality generation capabilities, translating these abilities to fine-grained image editing…

Computer Vision and Pattern Recognition · Computer Science 2024-02-06 Chong Mou , Xintao Wang , Jiechong Song , Ying Shan , Jian Zhang

Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. Recent research has extended these models to support text-guided image editing. While text guidance is an intuitive editing…

Computer Vision and Pattern Recognition · Computer Science 2023-05-26 Jooyoung Choi , Yunjey Choi , Yunji Kim , Junho Kim , Sungroh Yoon

While 2D diffusion models have achieved remarkable success in identity-preserving personalization, extending this capability to 3D assets remains a significant challenge due to the complexities of multi-view consistency and spatial control.…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Jinxin Ai , Matthias Nießner , Ziya Erkoç

The objective of text-to-image (T2I) personalization is to customize a diffusion model to a user-provided reference concept, generating diverse images of the concept aligned with the target prompts. Conventional methods representing the…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Jisu Nam , Heesu Kim , DongJae Lee , Siyoon Jin , Seungryong Kim , Seunggyu Chang

Recent progresses in large-scale text-to-image models have yielded remarkable accomplishments, finding various applications in art domain. However, expressing unique characteristics of an artwork (e.g. brushwork, colortone, or composition)…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Namhyuk Ahn , Junsoo Lee , Chunggi Lee , Kunhee Kim , Daesik Kim , Seung-Hun Nam , Kibeom Hong

We propose EditCrafter, a high-resolution image editing method that operates without tuning, leveraging pretrained text-to-image (T2I) diffusion models to process images at resolutions significantly exceeding those used during training.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Kunho Kim , Sumin Seo , Yongjun Cho , Hyungjin Chung

Personalized text-to-image models allow users to generate varied styles of images (specified with a sentence) for an object (specified with a set of reference images). While remarkable results have been achieved using diffusion-based…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Fanyue Wei , Wei Zeng , Zhenyang Li , Dawei Yin , Lixin Duan , Wen Li

The rapid advancement of pretrained text-driven diffusion models has significantly enriched applications in image generation and editing. However, as the demand for personalized content editing increases, new challenges emerge especially…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Rui Jiang , Xinghe Fu , Guangcong Zheng , Teng Li , Taiping Yao , Xi Li

Despite the ability of existing large-scale text-to-image (T2I) models to generate high-quality images from detailed textual descriptions, they often lack the ability to precisely edit the generated or real images. In this paper, we propose…

Computer Vision and Pattern Recognition · Computer Science 2023-11-21 Chong Mou , Xintao Wang , Jiechong Song , Ying Shan , Jian Zhang

Text-to-image diffusion models have emerged as an evolutionary for producing creative content in image synthesis. Based on the impressive generation abilities of these models, instruction-guided diffusion models can edit images with simple…

Cryptography and Security · Computer Science 2024-08-21 Ruoxi Chen , Haibo Jin , Yixin Liu , Jinyin Chen , Haohan Wang , Lichao Sun

Diffusion models have demonstrated impressive image generation capabilities. Personalized approaches, such as textual inversion and Dreambooth, enhance model individualization using specific images. These methods enable generating images of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Yan Zeng , Masanori Suganuma , Takayuki Okatani

Text-to-image (T2I) diffusion models have made remarkable strides in generating and editing high-fidelity images from text. Yet, these models remain fundamentally generic, failing to adapt to the nuanced aesthetic preferences of individual…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Connor Dunlop , Matthew Zheng , Kavana Venkatesh , Pinar Yanardag

Image editing aims to edit the given synthetic or real image to meet the specific requirements from users. It is widely studied in recent years as a promising and challenging field of Artificial Intelligence Generative Content (AIGC).…

Computer Vision and Pattern Recognition · Computer Science 2024-06-21 Xincheng Shuai , Henghui Ding , Xingjun Ma , Rongcheng Tu , Yu-Gang Jiang , Dacheng Tao

Text-to-image (T2I) diffusion models, when fine-tuned on a few personal images, can generate visuals with a high degree of consistency. However, such fine-tuned models are not robust; they often fail to compose with concepts of pretrained…

Computer Vision and Pattern Recognition · Computer Science 2024-12-13 Kyungmin Lee , Sangkyung Kwak , Kihyuk Sohn , Jinwoo Shin

Despite the rapid adoption of text-to-image (T2I) diffusion models, causal and representation-level analysis remains fragmented and largely limited to isolated probing techniques. To address this gap, we introduce DreamReader: a unified…

Text-to-image diffusion models produce impressive results but are frustrating tools for artists who desire fine-grained control. For example, a common use case is to create images of a specific instance in novel contexts, i.e.,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Shengqu Cai , Eric Chan , Yunzhi Zhang , Leonidas Guibas , Jiajun Wu , Gordon Wetzstein

Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-quality and diverse synthesis of images from a given text prompt. However, these models lack the ability to mimic the appearance of subjects in a…

Computer Vision and Pattern Recognition · Computer Science 2023-03-16 Nataniel Ruiz , Yuanzhen Li , Varun Jampani , Yael Pritch , Michael Rubinstein , Kfir Aberman

Generative models have enabled intuitive image creation and manipulation using natural language. In particular, diffusion models have recently shown remarkable results for natural image editing. In this work, we propose to apply diffusion…

Recent advances in diffusion models have enabled high-quality generation and manipulation of images guided by texts, as well as concept learning from images. However, naive applications of existing methods to editing tasks that require…

Computer Vision and Pattern Recognition · Computer Science 2025-12-29 Xudong Liu , Zikun Chen , Ruowei Jiang , Ziyi Wu , Kejia Yin , Han Zhao , Parham Aarabi , Igor Gilitschenski

Recently, text-to-image (T2I) editing has been greatly pushed forward by applying diffusion models. Despite the visual promise of the generated images, inconsistencies with the expected textual prompt remain prevalent. This paper aims to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-20 Aoxue Li , Mingyang Yi , Zhenguo Li
‹ Prev 1 2 3 10 Next ›