Related papers: An Improved Method for Personalizing Diffusion Mod…

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Personalized text-to-image models allow users to generate varied styles of images (specified with a sentence) for an object (specified with a set of reference images). While remarkable results have been achieved using diffusion-based…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Fanyue Wei , Wei Zeng , Zhenyang Li , Dawei Yin , Lixin Duan , Wen Li

Zero-Shot Personalization of Objects via Textual Inversion

Recent advances in text-to-image diffusion models have substantially improved the quality of image customization, enabling the synthesis of highly realistic images. Despite this progress, achieving fast and efficient personalization remains…

Computer Vision and Pattern Recognition · Computer Science 2026-03-25 Aniket Roy , Maitreya Suin , Rama Chellappa

Text-image guided Diffusion Model for generating Deepfake celebrity interactions

Deepfake images are fast becoming a serious concern due to their realism. Diffusion models have recently demonstrated highly realistic visual content generation, which makes them an excellent potential tool for Deepfake generation. To curb…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Yunzhuo Chen , Nur Al Hasan Haldar , Naveed Akhtar , Ajmal Mian

Multi-Concept Customization of Text-to-Image Diffusion

While generative models produce high-quality images of concepts learned from a large-scale database, a user often wishes to synthesize instantiations of their own concepts (for example, their family, pets, or items). Can we teach a model to…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Nupur Kumari , Bingliang Zhang , Richard Zhang , Eli Shechtman , Jun-Yan Zhu

DreamBooth: Fine Tuning Text-to-Image Diffusion Models for Subject-Driven Generation

Large text-to-image models achieved a remarkable leap in the evolution of AI, enabling high-quality and diverse synthesis of images from a given text prompt. However, these models lack the ability to mimic the appearance of subjects in a…

Computer Vision and Pattern Recognition · Computer Science 2023-03-16 Nataniel Ruiz , Yuanzhen Li , Varun Jampani , Yael Pritch , Michael Rubinstein , Kfir Aberman

Personalized Text-to-Image Generation with Auto-Regressive Models

Personalized image synthesis has emerged as a pivotal application in text-to-image generation, enabling the creation of images featuring specific subjects in diverse contexts. While diffusion models have dominated this domain,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-18 Kaiyue Sun , Xian Liu , Yao Teng , Xihui Liu

DreamBooth3D: Subject-Driven Text-to-3D Generation

We present DreamBooth3D, an approach to personalize text-to-3D generative models from as few as 3-6 casually captured images of a subject. Our approach combines recent advances in personalizing text-to-image models (DreamBooth) with…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Amit Raj , Srinivas Kaza , Ben Poole , Michael Niemeyer , Nataniel Ruiz , Ben Mildenhall , Shiran Zada , Kfir Aberman , Michael Rubinstein , Jonathan Barron , Yuanzhen Li , Varun Jampani

Generate Anything Anywhere in Any Scene

Text-to-image diffusion models have attracted considerable interest due to their wide applicability across diverse fields. However, challenges persist in creating controllable models for personalized object generation. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2023-06-30 Yuheng Li , Haotian Liu , Yangming Wen , Yong Jae Lee

InstructBooth: Instruction-following Personalized Text-to-Image Generation

Personalizing text-to-image models using a limited set of images for a specific object has been explored in subject-specific image generation. However, existing methods often face challenges in aligning with text prompts due to overfitting…

Computer Vision and Pattern Recognition · Computer Science 2024-02-16 Daewon Chae , Nokyung Park , Jinkyu Kim , Kimin Lee

TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation

Despite significant advancements in customizing text-to-image and video generation models, generating images and videos that effectively integrate multiple personalized concepts remains a challenging task. To address this, we present…

Computer Vision and Pattern Recognition · Computer Science 2025-03-05 Gihyun Kwon , Jong Chul Ye

DreamEdit3D: Personalization of Multi-View Diffusion Models for 3D Editing

While 2D diffusion models have achieved remarkable success in identity-preserving personalization, extending this capability to 3D assets remains a significant challenge due to the complexities of multi-view consistency and spatial control.…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Jinxin Ai , Matthias Nießner , Ziya Erkoç

Customizing Text-to-Image Diffusion with Object Viewpoint Control

Model customization introduces new concepts to existing text-to-image models, enabling the generation of these new concepts/objects in novel contexts. However, such methods lack accurate camera view control with respect to the new object,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-04 Nupur Kumari , Grace Su , Richard Zhang , Taesung Park , Eli Shechtman , Jun-Yan Zhu

Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models

Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. Recent research has extended these models to support text-guided image editing. While text guidance is an intuitive editing…

Computer Vision and Pattern Recognition · Computer Science 2023-05-26 Jooyoung Choi , Yunjey Choi , Yunji Kim , Junho Kim , Sungroh Yoon

Fast Personalized Text-to-Image Syntheses With Attention Injection

Currently, personalized image generation methods mostly require considerable time to finetune and often overfit the concept resulting in generated images that are similar to custom concepts but difficult to edit by prompts. We propose an…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Yuxuan Zhang , Yiren Song , Jinpeng Yu , Han Pan , Zhongliang Jing

Personalized Interiors at Scale: Leveraging AI for Efficient and Customizable Design Solutions

In this paper, we introduce an innovative application of artificial intelligence in the realm of interior design through the integration of Stable Diffusion and Dreambooth models. This paper explores the potential of these advanced…

Human-Computer Interaction · Computer Science 2024-05-30 Kaiwen Zhou , Tianyu Wang

Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion

Diffusion models have shown superior performance in image generation and manipulation, but the inherent stochasticity presents challenges in preserving and manipulating image content and identity. While previous approaches like DreamBooth…

Computer Vision and Pattern Recognition · Computer Science 2023-04-20 Inhwa Han , Serin Yang , Taesung Kwon , Jong Chul Ye

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion

Customized generation using diffusion models has made impressive progress in image generation, but remains unsatisfactory in the challenging video generation task, as it requires the controllability of both subjects and motions. To that…

Computer Vision and Pattern Recognition · Computer Science 2023-12-08 Yujie Wei , Shiwei Zhang , Zhiwu Qing , Hangjie Yuan , Zhiheng Liu , Yu Liu , Yingya Zhang , Jingren Zhou , Hongming Shan

PortraitBooth: A Versatile Portrait Model for Fast Identity-preserved Personalization

Recent advancements in personalized image generation using diffusion models have been noteworthy. However, existing methods suffer from inefficiencies due to the requirement for subject-specific fine-tuning. This computationally intensive…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Xu Peng , Junwei Zhu , Boyuan Jiang , Ying Tai , Donghao Luo , Jiangning Zhang , Wei Lin , Taisong Jin , Chengjie Wang , Rongrong Ji

Object-Driven One-Shot Fine-tuning of Text-to-Image Diffusion with Prototypical Embedding

As large-scale text-to-image generation models have made remarkable progress in the field of text-to-image generation, many fine-tuning methods have been proposed. However, these models often struggle with novel objects, especially with…

Computer Vision and Pattern Recognition · Computer Science 2024-01-30 Jianxiang Lu , Cong Xie , Hui Guo

DreamSteerer: Enhancing Source Image Conditioned Editability using Personalized Diffusion Models

Recent text-to-image personalization methods have shown great promise in teaching a diffusion model user-specified concepts given a few images for reusing the acquired concepts in a novel context. With massive efforts being dedicated to…

Computer Vision and Pattern Recognition · Computer Science 2024-10-31 Zhengyang Yu , Zhaoyuan Yang , Jing Zhang