Related papers: Identity Encoder for Personalized Diffusion

A Method for Training-free Person Image Picture Generation

The current state-of-the-art Diffusion model has demonstrated excellent results in generating images. However, the images are monotonous and are mostly the result of the distribution of images of people in the training set, making it…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 Tianyu Chen

Encoder-based Domain Tuning for Fast Personalization of Text-to-Image Models

Text-to-image personalization aims to teach a pre-trained diffusion model to reason about novel, user provided concepts, embedding them into new scenes guided by natural language prompts. However, current personalization approaches struggle…

Computer Vision and Pattern Recognition · Computer Science 2023-03-07 Rinon Gal , Moab Arar , Yuval Atzmon , Amit H. Bermano , Gal Chechik , Daniel Cohen-Or

Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models

This paper proposes a method for generating images of customized objects specified by users. The method is based on a general framework that bypasses the lengthy optimization required by previous approaches, which often employ a per-object…

Computer Vision and Pattern Recognition · Computer Science 2023-04-06 Xuhui Jia , Yang Zhao , Kelvin C. K. Chan , Yandong Li , Han Zhang , Boqing Gong , Tingbo Hou , Huisheng Wang , Yu-Chuan Su

LCM-Lookahead for Encoder-based Text-to-Image Personalization

Recent advancements in diffusion models have introduced fast sampling methods that can effectively produce high-quality images in just one or a few denoising steps. Interestingly, when these are distilled from existing diffusion models,…

Computer Vision and Pattern Recognition · Computer Science 2024-04-05 Rinon Gal , Or Lichter , Elad Richardson , Or Patashnik , Amit H. Bermano , Gal Chechik , Daniel Cohen-Or

DreamIdentity: Improved Editability for Efficient Face-identity Preserved Image Generation

While large-scale pre-trained text-to-image models can synthesize diverse and high-quality human-centric images, an intractable problem is how to preserve the face identity for conditioned face images. Existing methods either require…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Zhuowei Chen , Shancheng Fang , Wei Liu , Qian He , Mengqi Huang , Yongdong Zhang , Zhendong Mao

FaceMe: Robust Blind Face Restoration with Personal Identification

Blind face restoration is a highly ill-posed problem due to the lack of necessary context. Although existing methods produce high-quality outputs, they often fail to faithfully preserve the individual's identity. In this paper, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Siyu Liu , Zheng-Peng Duan , Jia OuYang , Jiayi Fu , Hyunhee Park , Zikun Liu , Chun-Le Guo , Chongyi Li

PFStorer: Personalized Face Restoration and Super-Resolution

Recent developments in face restoration have achieved remarkable results in producing high-quality and lifelike outputs. The stunning results however often fail to be faithful with respect to the identity of the person as the models lack…

Computer Vision and Pattern Recognition · Computer Science 2024-03-14 Tuomas Varanka , Tapani Toivonen , Soumya Tripathy , Guoying Zhao , Erman Acar

Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference

One of the main drawback of diffusion models is the slow inference time for image generation. Among the most successful approaches to addressing this problem are distillation methods. However, these methods require considerable…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Senmao Li , Taihang Hu , Joost van de Weijer , Fahad Shahbaz Khan , Tao Liu , Linxuan Li , Shiqi Yang , Yaxing Wang , Ming-Ming Cheng , Jian Yang

Powerful and Flexible: Personalized Text-to-Image Generation via Reinforcement Learning

Personalized text-to-image models allow users to generate varied styles of images (specified with a sentence) for an object (specified with a set of reference images). While remarkable results have been achieved using diffusion-based…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Fanyue Wei , Wei Zeng , Zhenyang Li , Dawei Yin , Lixin Duan , Wen Li

Inserting Anybody in Diffusion Models via Celeb Basis

Exquisite demand exists for customizing the pretrained large text-to-image model, $\textit{e.g.}$, Stable Diffusion, to generate innovative concepts, such as the users themselves. However, the newly-added concept from previous customization…

Computer Vision and Pattern Recognition · Computer Science 2023-06-02 Ge Yuan , Xiaodong Cun , Yong Zhang , Maomao Li , Chenyang Qi , Xintao Wang , Ying Shan , Huicheng Zheng

Imagine yourself: Tuning-Free Personalized Image Generation

Diffusion models have demonstrated remarkable efficacy across various image-to-image tasks. In this research, we introduce Imagine yourself, a state-of-the-art model designed for personalized image generation. Unlike conventional…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Zecheng He , Bo Sun , Felix Juefei-Xu , Haoyu Ma , Ankit Ramchandani , Vincent Cheung , Siddharth Shah , Anmol Kalia , Harihar Subramanyam , Alireza Zareian , Li Chen , Ankit Jain , Ning Zhang , Peizhao Zhang , Roshan Sumbaly , Peter Vajda , Animesh Sinha

IC-Portrait: In-Context Matching for View-Consistent Personalized Portrait

Existing diffusion models show great potential for identity-preserving generation. However, personalized portrait generation remains challenging due to the diversity in user profiles, including variations in appearance and lighting…

Computer Vision and Pattern Recognition · Computer Science 2025-02-03 Han Yang , Enis Simsar , Sotiris Anagnostidis , Yanlong Zang , Thomas Hofmann , Ziwei Liu

IDAdapter: Learning Mixed Features for Tuning-Free Personalization of Text-to-Image Models

Leveraging Stable Diffusion for the generation of personalized portraits has emerged as a powerful and noteworthy tool, enabling users to create high-fidelity, custom character avatars based on their specific prompts. However, existing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-22 Siying Cui , Jia Guo , Xiang An , Jiankang Deng , Yongle Zhao , Xinyu Wei , Ziyong Feng

Correcting Diffusion-Based Perceptual Image Compression with Privileged End-to-End Decoder

The images produced by diffusion models can attain excellent perceptual quality. However, it is challenging for diffusion models to guarantee distortion, hence the integration of diffusion models and image compression models still needs…

Image and Video Processing · Electrical Eng. & Systems 2024-05-03 Yiyang Ma , Wenhan Yang , Jiaying Liu

Generating Multi-Image Synthetic Data for Text-to-Image Customization

Customization of text-to-image models enables users to insert new concepts or objects and generate them in unseen settings. Existing methods either rely on comparatively expensive test-time optimization or train encoders on single-image…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Nupur Kumari , Xi Yin , Jun-Yan Zhu , Ishan Misra , Samaneh Azadi

The Diffusion Encoder

We construct a new kind of encoder, leveraging the expressive power of diffusion models. In a traditional variational autoencoder, the encoder and decoder jointly negotiate a latent representation of the input. This is made possible by the…

Machine Learning · Computer Science 2026-05-14 Akhil Premkumar , Sarah Lucioni

Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation

In human-centric content generation, the pre-trained text-to-image models struggle to produce user-wanted portrait images, which retain the identity of individuals while exhibiting diverse expressions. This paper introduces our efforts…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Renshuai Liu , Bowen Ma , Wei Zhang , Zhipeng Hu , Changjie Fan , Tangjie Lv , Yu Ding , Xuan Cheng

FaceCrafter: Identity-Conditional Diffusion with Disentangled Control over Facial Pose, Expression, and Emotion

Human facial images encode a rich spectrum of information, encompassing both stable identity-related traits and mutable attributes such as pose, expression, and emotion. While recent advances in image generation have enabled high-quality…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Kazuaki Mishima , Antoni Bigata Casademunt , Stavros Petridis , Maja Pantic , Kenji Suzuki

Face2Diffusion for Fast and Editable Face Personalization

Face personalization aims to insert specific faces, taken from images, into pretrained text-to-image diffusion models. However, it is still challenging for previous methods to preserve both the identity similarity and editability due to…

Computer Vision and Pattern Recognition · Computer Science 2024-03-11 Kaede Shiohara , Toshihiko Yamasaki

Identity-guided Face Generation with Multi-modal Contour Conditions

Recent face generation methods have tried to synthesize faces based on the given contour condition, like a low-resolution image or sketch. However, the problem of identity ambiguity remains unsolved, which usually occurs when the contour is…

Computer Vision and Pattern Recognition · Computer Science 2022-08-03 Qingyan Bai , Weihao Xia , Fei Yin , Yujiu Yang