Computer Vision and Pattern Recognition · Computer Science
Exploring Discrete Diffusion Models for Image Captioning
Zixin Zhu, Yixuan Wei, Jianfeng Wang, Zhe Gan +6
2022-12-12
Computer Vision and Pattern Recognition · Computer Science
Prefix-diffusion: A Lightweight Diffusion Model for Diverse Image Captioning
Guisheng Liu, Yi Li, Zhengcong Fei, Haiyan Fu +2
2023-10-18
Computer Vision and Pattern Recognition · Computer Science
DiffCap: Exploring Continuous Diffusion on Image Captioning
Yufeng He, Zefan Cai, Xu Gan, Baobao Chang
2023-05-23
Computer Vision and Pattern Recognition · Computer Science
Hierarchical Text-Conditional Image Generation with CLIP Latents
Aditya Ramesh, Prafulla Dhariwal, Alex Nichol, Casey Chu +1
2022-04-14
Computer Vision and Pattern Recognition · Computer Science
Dense Text-to-Image Generation with Attention Modulation
Yunji Kim, Jiyoung Lee, Jin-Hwa Kim, Jung-Woo Ha +1
2023-08-25
Computer Vision and Pattern Recognition · Computer Science
Guiding Image Captioning Models Toward More Specific Captions
Simon Kornblith, Lala Li, Zirui Wang, Thao Nguyen
2023-08-01
Computer Vision and Pattern Recognition · Computer Science
Controlling Latent Diffusion Using Latent CLIP
Jason Becker, Chris Wendler, Peter Baylies, Robert West +1
2025-03-12
Computer Vision and Pattern Recognition · Computer Science
Diffusion Model-Based Image Editing: A Survey
Yi Huang, Jiancheng Huang, Yifan Liu, Mingfu Yan +6
2025-03-12
Computer Vision and Pattern Recognition · Computer Science
A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation
Eyal Segalis, Dani Valevski, Danny Lumen, Yossi Matias +1
2023-10-26
Computer Vision and Pattern Recognition · Computer Science
DiffVC: A Non-autoregressive Framework Based on Diffusion Model for Video Captioning
Junbo Wang, Liangyu Fu, Yuke Li, Yining Zhu +3
2026-04-10
Computer Vision and Pattern Recognition · Computer Science
Variational Distribution Learning for Unsupervised Text-to-Image Generation
Minsoo Kang, Doyup Lee, Jiseob Kim, Saehoon Kim +1
2023-03-29
Computer Vision and Pattern Recognition · Computer Science
From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models
Changming Xiao, Qi Yang, Feng Zhou, Changshui Zhang
2024-10-02
Image and Video Processing · Electrical Eng. & Systems
Lossy Image Compression with Foundation Diffusion Models
Lucas Relic, Roberto Azevedo, Markus Gross, Christopher Schroers
2024-10-10
Computer Vision and Pattern Recognition · Computer Science
Debiasing Classifiers by Amplifying Bias with Latent Diffusion and Large Language Models
Donggeun Ko, Dongjun Lee, Namjun Park, Wonkyeong Shim +1
2024-11-26
Computer Vision and Pattern Recognition · Computer Science
Cap2Aug: Caption guided Image to Image data Augmentation
Aniket Roy, Anshul Shah, Ketul Shah, Anirban Roy +1
2023-11-08
Computer Vision and Pattern Recognition · Computer Science
Conditional Generation from Unconditional Diffusion Models using Denoiser Representations
Alexandros Graikos, Srikar Yellapragada, Dimitris Samaras
2023-06-06