Related papers: Event-Customized Image Generation

Customized Image Narrative Generation via Interactive Visual Question Generation and Answering

Image description task has been invariably examined in a static manner with qualitative presumptions held to be universally applicable, regardless of the scope or target of the description. In practice, however, different viewers may pay…

Computation and Language · Computer Science 2018-05-02 Andrew Shin , Yoshitaka Ushiku , Tatsuya Harada

User-Friendly Customized Generation with Multi-Modal Prompts

Text-to-image generation models have seen considerable advancement, catering to the increasing interest in personalized image creation. Current customization techniques often necessitate users to provide multiple images (typically 3-5) for…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Linhao Zhong , Yan Hong , Wentao Chen , Binglin Zhou , Yiyi Zhang , Jianfu Zhang , Liqing Zhang

Fast Personalized Text-to-Image Syntheses With Attention Injection

Currently, personalized image generation methods mostly require considerable time to finetune and often overfit the concept resulting in generated images that are similar to custom concepts but difficult to edit by prompts. We propose an…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Yuxuan Zhang , Yiren Song , Jinpeng Yu , Han Pan , Zhongliang Jing

Customization Assistant for Text-to-image Generation

Customizing pre-trained text-to-image generation model has attracted massive research interest recently, due to its huge potential in real-world applications. Although existing methods are able to generate creative content for a novel…

Computer Vision and Pattern Recognition · Computer Science 2024-05-10 Yufan Zhou , Ruiyi Zhang , Jiuxiang Gu , Tong Sun

CustomVideo: Customizing Text-to-Video Generation with Multiple Subjects

Customized text-to-video generation aims to generate high-quality videos guided by text prompts and subject references. Current approaches for personalizing text-to-video generation suffer from tackling multiple subjects, which is a more…

Computer Vision and Pattern Recognition · Computer Science 2025-10-29 Zhao Wang , Aoxue Li , Lingting Zhu , Yong Guo , Qi Dou , Zhenguo Li

CustomText: Customized Textual Image Generation using Diffusion Models

Textual image generation spans diverse fields like advertising, education, product packaging, social media, information visualization, and branding. Despite recent strides in language-guided image synthesis using diffusion models, current…

Computer Vision and Pattern Recognition · Computer Science 2024-05-22 Shubham Paliwal , Arushi Jain , Monika Sharma , Vikram Jamwal , Lovekesh Vig

Sketch-Guided Scene Image Generation

Text-to-image models are showcasing the impressive ability to create high-quality and diverse generative images. Nevertheless, the transition from freehand sketches to complex scene images remains challenging using diffusion models. In this…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Tianyu Zhang , Xiaoxuan Xie , Xusheng Du , Haoran Xie

Interact-Custom: Customized Human Object Interaction Image Generation

Compositional Customized Image Generation aims to customize multiple target concepts within generation content, which has gained attention for its wild application. Existing approaches mainly concentrate on the target entity's appearance…

Computer Vision and Pattern Recognition · Computer Science 2025-08-29 Zhu Xu , Zhaowen Wang , Yuxin Peng , Yang Liu

CustomNet: Zero-shot Object Customization with Variable-Viewpoints in Text-to-Image Diffusion Models

Incorporating a customized object into image generation presents an attractive feature in text-to-image generation. However, existing optimization-based and encoder-based methods are hindered by drawbacks such as time-consuming…

Computer Vision and Pattern Recognition · Computer Science 2023-12-08 Ziyang Yuan , Mingdeng Cao , Xintao Wang , Zhongang Qi , Chun Yuan , Ying Shan

DreamRelation: Bridging Customization and Relation Generation

Customized image generation is essential for creating personalized content based on user prompts, allowing large-scale text-to-image diffusion models to more effectively meet individual needs. However, existing models often neglect the…

Computer Vision and Pattern Recognition · Computer Science 2025-04-08 Qingyu Shi , Lu Qi , Jianzong Wu , Jinbin Bai , Jingbo Wang , Yunhai Tong , Xiangtai Li

AlignGen: Boosting Personalized Image Generation with Cross-Modality Prior Alignment

Personalized image generation aims to integrate user-provided concepts into text-to-image models, enabling the generation of customized content based on a given prompt. Recent zero-shot approaches, particularly those leveraging diffusion…

Computer Vision and Pattern Recognition · Computer Science 2025-05-29 Yiheng Lin , Shifang Zhao , Ting Liu , Xiaochao Qu , Luoqi Liu , Yao Zhao , Yunchao Wei

ControlEvents: Controllable Synthesis of Event Camera Datawith Foundational Prior from Image Diffusion Models

In recent years, event cameras have gained significant attention due to their bio-inspired properties, such as high temporal resolution and high dynamic range. However, obtaining large-scale labeled ground-truth data for event-based vision…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Yixuan Hu , Yuxuan Xue , Simon Klenk , Daniel Cremers , Gerard Pons-Moll

Modeling Complex Event Scenarios via Simple Entity-focused Questions

Event scenarios are often complex and involve multiple event sequences connected through different entity participants. Exploring such complex scenarios requires an ability to branch through different sequences, something that is difficult…

Computation and Language · Computer Science 2023-02-15 Mahnaz Koupaee , Greg Durrett , Nathanael Chambers , Niranjan Balasubramanian

Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Recent text-to-image generation methods provide a simple yet exciting conversion capability between text and image domains. While these methods have incrementally improved the generated image fidelity and text relevancy, several pivotal…

Computer Vision and Pattern Recognition · Computer Science 2022-03-25 Oran Gafni , Adam Polyak , Oron Ashual , Shelly Sheynin , Devi Parikh , Yaniv Taigman

FreeEnhance: Tuning-Free Image Enhancement via Content-Consistent Noising-and-Denoising Process

The emergence of text-to-image generation models has led to the recognition that image enhancement, performed as post-processing, would significantly improve the visual quality of the generated images. Exploring diffusion models to enhance…

Computer Vision and Pattern Recognition · Computer Science 2024-09-12 Yang Luo , Yiheng Zhang , Zhaofan Qiu , Ting Yao , Zhineng Chen , Yu-Gang Jiang , Tao Mei

Custom-Edit: Text-Guided Image Editing with Customized Diffusion Models

Text-to-image diffusion models can generate diverse, high-fidelity images based on user-provided text prompts. Recent research has extended these models to support text-guided image editing. While text guidance is an intuitive editing…

Computer Vision and Pattern Recognition · Computer Science 2023-05-26 Jooyoung Choi , Yunjey Choi , Yunji Kim , Junho Kim , Sungroh Yoon

Personalized Image Generation with Deep Generative Models: A Decade Survey

Recent advancements in generative models have significantly facilitated the development of personalized content creation. Given a small set of images with user-specific concept, personalized image generation allows to create images that…

Computer Vision and Pattern Recognition · Computer Science 2025-02-19 Yuxiang Wei , Yiheng Zheng , Yabo Zhang , Ming Liu , Zhilong Ji , Lei Zhang , Wangmeng Zuo

FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition

Benefiting from large-scale pre-trained text-to-image (T2I) generative models, impressive progress has been achieved in customized image generation, which aims to generate user-specified concepts. Existing approaches have extensively…

Computer Vision and Pattern Recognition · Computer Science 2024-05-24 Ganggui Ding , Canyu Zhao , Wen Wang , Zhen Yang , Zide Liu , Hao Chen , Chunhua Shen

Text-to-Events: Synthetic Event Camera Streams from Conditional Text Input

Event cameras are advantageous for tasks that require vision sensors with low-latency and sparse output responses. However, the development of deep network algorithms using event cameras has been slow because of the lack of large labelled…

Computer Vision and Pattern Recognition · Computer Science 2024-06-06 Joachim Ott , Zuowen Wang , Shih-Chii Liu

Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach

Recent text-to-image generation models have demonstrated impressive capability of generating text-aligned images with high fidelity. However, generating images of novel concept provided by the user input image is still a challenging task.…

Computer Vision and Pattern Recognition · Computer Science 2023-05-24 Yufan Zhou , Ruiyi Zhang , Tong Sun , Jinhui Xu