English
Related papers

Related papers: T-LoRA: Single Image Diffusion Model Customization…

200 papers

Personalized text-to-image generation has gained significant attention for its capability to generate high-fidelity portraits of specific identities conditioned on user-defined prompts. Existing methods typically involve test-time…

Computer Vision and Pattern Recognition · Computer Science 2024-11-18 Yujia Wu , Yiming Shi , Jiwei Wei , Chengwei Sun , Yang Yang , Heng Tao Shen

Recent works demonstrate a remarkable ability to customize text-to-image diffusion models while only providing a few example images. What happens if you try to customize such models using multiple, fine-grained concepts in a sequential…

Computer Vision and Pattern Recognition · Computer Science 2024-05-03 James Seale Smith , Yen-Chang Hsu , Lingyu Zhang , Ting Hua , Zsolt Kira , Yilin Shen , Hongxia Jin

We introduce ProLoRA, enabling zero-shot adaptation of parameter-efficient fine-tuning in text-to-image diffusion models. ProLoRA transfers pre-trained low-rank adjustments (e.g., LoRA) from a source to a target model without additional…

Artificial Intelligence · Computer Science 2025-06-06 Farzad Farhadzadeh , Debasmit Das , Shubhankar Borse , Fatih Porikli

Low-rank adaptation (LoRA) is a fine-tuning technique that can be applied to conditional generative diffusion models. LoRA utilizes a small number of context examples to adapt the model to a specific domain, character, style, or concept.…

Computer Vision and Pattern Recognition · Computer Science 2024-10-08 Artur Kasymov , Marcin Sendera , Michał Stypułkowski , Maciej Zięba , Przemysław Spurek

Personalized image generation requires effectively balancing content fidelity with stylistic consistency when synthesizing images based on text and reference examples. Low-Rank Adaptation (LoRA) offers an efficient personalization approach,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-04 Yu Li , Yujun Cai , Chi Zhang

Diffusion-based image super-resolution (SR) methods have achieved remarkable success by leveraging large pre-trained text-to-image diffusion models as priors. However, these methods still face two challenges: the requirement for dozens of…

Computer Vision and Pattern Recognition · Computer Science 2024-09-26 Aiping Zhang , Zongsheng Yue , Renjing Pei , Wenqi Ren , Xiaochun Cao

Drag-based editing within pretrained diffusion model provides a precise and flexible way to manipulate foreground objects. Traditional methods optimize the input feature obtained from DDIM inversion directly, adjusting them iteratively to…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Siwei Xia , Li Sun , Tiantian Sun , Qingli Li

The objective of personalization and stylization in text-to-image is to instruct a pre-trained diffusion model to analyze new concepts introduced by users and incorporate them into expected styles. Recently, parameter-efficient fine-tuning…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Likun Li , Haoqi Zeng , Changpeng Yang , Haozhe Jia , Di Xu

In recent years, image editing has garnered growing attention. However, general image editing models often fail to produce satisfactory results when confronted with new styles. The challenge lies in how to effectively fine-tune general…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Cong Cao , Huanjing Yue , Yujie Xu , Xiaodong Xu

In the realm of subject-driven text-to-image (T2I) generative models, recent developments like DreamBooth and BLIP-Diffusion have led to impressive results yet encounter limitations due to their intensive fine-tuning demands and substantial…

Computer Vision and Pattern Recognition · Computer Science 2024-02-29 Shyam Marjit , Harshit Singh , Nityanand Mathur , Sayak Paul , Chia-Mu Yu , Pin-Yu Chen

Recent personalization methods for diffusion models, such as Dreambooth and LoRA, allow fine-tuning pre-trained models to generate new concepts. However, applying these techniques across consecutive tasks in order to include, e.g., new…

Machine Learning · Computer Science 2025-03-14 Łukasz Staniszewski , Katarzyna Zaleska , Kamil Deja

Low-rank Adaptation (LoRA) models have revolutionized the personalization of pre-trained diffusion models by enabling fine-tuning through low-rank, factorized weight matrices specifically optimized for attention layers. These models…

Computer Vision and Pattern Recognition · Computer Science 2025-12-18 Mert Sonmezer , Matthew Zheng , Pinar Yanardag

Fine-tuning large diffusion models for custom applications demands substantial power and time, which poses significant challenges for efficient implementation on mobile devices. In this paper, we develop a novel training accelerator…

Graphics · Computer Science 2025-04-14 Jinming Lu , Minghao She , Wendong Mao , Zhongfeng Wang

The persistent challenge of medical image synthesis posed by the scarcity of annotated data and the need to synthesize `missing modalities' for multi-modal analysis, underscored the imperative development of effective synthesis methods.…

Computer Vision and Pattern Recognition · Computer Science 2024-08-15 Yuchen Mao , Hongwei Li , Wei Pang , Giorgos Papanastasiou , Guang Yang , Chengjia Wang

Low-Rank Adaptation (LoRA) and other parameter-efficient fine-tuning (PEFT) methods provide low-memory, storage-efficient solutions for personalizing text-to-image models. However, these methods offer little to no improvement in wall-clock…

Machine Learning · Computer Science 2024-12-04 Ethan Smith , Rami Seid , Alberto Hojel , Paramita Mishra , Jianbo Wu

While Low-Rank Adaptation (LoRA) has proven beneficial for efficiently fine-tuning large models, LoRA fine-tuned text-to-image diffusion models lack diversity in the generated images, as the model tends to copy data from the observed…

With the advance of text-to-image (T2I) diffusion models (e.g., Stable Diffusion) and corresponding personalization techniques such as DreamBooth and LoRA, everyone can manifest their imagination into high-quality images at an affordable…

Computer Vision and Pattern Recognition · Computer Science 2024-02-09 Yuwei Guo , Ceyuan Yang , Anyi Rao , Zhengyang Liang , Yaohui Wang , Yu Qiao , Maneesh Agrawala , Dahua Lin , Bo Dai

As deep learning technology continues to advance, image generation models, especially models like Stable Diffusion, are finding increasingly widespread application in visual arts creation. However, these models often face challenges such as…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Chengcheng Feng , Mu He , Qiuyu Tian , Haojie Yin , Xiaofang Zhao , Hongwei Tang , Xingqiang Wei

Diffusion models have revolutionized text-to-image (T2I) synthesis, producing high-quality, photorealistic images. However, they still struggle to properly render the spatial relationships described in text prompts. To address the lack of…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Andrea Rigo , Luca Stornaiuolo , Mauro Martino , Bruno Lepri , Nicu Sebe

Public large-scale text-to-image diffusion models, such as Stable Diffusion, have gained significant attention from the community. These models can be easily customized for new concepts using low-rank adaptations (LoRAs). However, the…

Computer Vision and Pattern Recognition · Computer Science 2023-11-13 Yuchao Gu , Xintao Wang , Jay Zhangjie Wu , Yujun Shi , Yunpeng Chen , Zihan Fan , Wuyou Xiao , Rui Zhao , Shuning Chang , Weijia Wu , Yixiao Ge , Ying Shan , Mike Zheng Shou
‹ Prev 1 2 3 10 Next ›