Computer Vision and Pattern Recognition · Computer Science
LLM Blueprint: Enabling Text-to-Image Generation with Complex and Detailed Prompts
Hanan Gani, Shariq Farooq Bhat, Muzammal Naseer, Salman Khan +1
2024-02-27
Computer Vision and Pattern Recognition · Computer Science
Instruct-Imagen: Image Generation with Multi-modal Instruction
Hexiang Hu, Kelvin C. K. Chan, Yu-Chuan Su, Wenhu Chen +8
2024-01-05
Computer Vision and Pattern Recognition · Computer Science
LayoutLLM-T2I: Eliciting Layout Guidance from LLM for Text-to-Image Generation
Leigang Qu, Shengqiong Wu, Hao Fei, Liqiang Nie +1
2023-08-15
Computer Vision and Pattern Recognition · Computer Science
Images in Sentences: Scaling Interleaved Instructions for Unified Visual Generation
Yabo Zhang, Kunchang Li, Dewei Zhou, Xinyu Huang +1
2026-05-13
Computer Vision and Pattern Recognition · Computer Science
Generating Compositional Scenes via Text-to-image RGBA Instance Generation
Alessandro Fontanella, Petru-Daniel Tudosiu, Yongxin Yang, Shifeng Zhang +1
2024-11-19
Computation and Language · Computer Science
$I^2G$: Generating Instructional Illustrations via Text-Conditioned Diffusion
Jing Bi, Pinxin Liu, Ali Vosoughi, Jiarui Wu +2
2025-05-23
Computer Vision and Pattern Recognition · Computer Science
SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis
Azade Farshad, Yousef Yeganeh, Yu Chi, Chengzhi Shen +2
2023-05-01
Computer Vision and Pattern Recognition · Computer Science
LLM-grounded Diffusion: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models
Long Lian, Boyi Li, Adam Yala, Trevor Darrell
2024-03-05
Computer Vision and Pattern Recognition · Computer Science
Sketch-Guided Scene Image Generation
Tianyu Zhang, Xiaoxuan Xie, Xusheng Du, Haoran Xie
2024-07-10
Computer Vision and Pattern Recognition · Computer Science
Instruction-based Image Editing with Planning, Reasoning, and Generation
Liya Ji, Chenyang Qi, Qifeng Chen
2026-02-27
Computer Vision and Pattern Recognition · Computer Science
Generating Coherent Sequences of Visual Illustrations for Real-World Manual Tasks
João Bordalo, Vasco Ramos, Rodrigo Valério, Diogo Glória-Silva +4
2024-05-17
Computer Vision and Pattern Recognition · Computer Science
Multi-modal Generation via Cross-Modal In-Context Learning
Amandeep Kumar, Muzammal Naseer, Sanath Narayan, Rao Muhammad Anwer +2
2024-05-29
Computer Vision and Pattern Recognition · Computer Science
PrefGen: Multimodal Preference Learning for Preference-Conditioned Image Generation
Wenyi Mo, Tianyu Zhang, Yalong Bai, Ligong Han +2
2025-12-09
Computer Vision and Pattern Recognition · Computer Science
Conditional Text-to-Image Generation with Reference Guidance
Taewook Kim, Ze Wang, Zhengyuan Yang, Jiang Wang +3
2025-12-15
Computer Vision and Pattern Recognition · Computer Science
Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction
Alaaeldin El-Nouby, Shikhar Sharma, Hannes Schulz, Devon Hjelm +4
2019-09-24
Computer Vision and Pattern Recognition · Computer Science
InstructBooth: Instruction-following Personalized Text-to-Image Generation
Daewon Chae, Nokyung Park, Jinkyu Kim, Kimin Lee
2024-02-16
Computer Vision and Pattern Recognition · Computer Science
Instance-Level Generation for Representation Learning
Yankun Wu, Zakaria Laskar, Giorgos Kordopatis-Zilos, Noa Garcia +1
2025-10-13
Computer Vision and Pattern Recognition · Computer Science
ComfyGen: Prompt-Adaptive Workflows for Text-to-Image Generation
Rinon Gal, Adi Haviv, Yuval Alaluf, Amit H. Bermano +2
2024-10-03
Computer Vision and Pattern Recognition · Computer Science
InstanceV: Instance-Level Video Generation
Yuheng Chen, Teng Hu, Jiangning Zhang, Zhucun Xue +2
2025-12-01