Related papers: A Diffusion-based Method for Multi-turn Compositio…

Text-guided Controllable Diffusion for Realistic Camouflage Images Generation

Camouflage Images Generation (CIG) is an emerging research area that focuses on synthesizing images in which objects are harmoniously blended and exhibit high visual consistency with their surroundings. Existing methods perform CIG by…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Yuhang Qian , Haiyan Chen , Wentong Li , Ningzhong Liu , Jie Qin

Conditional Text Image Generation with Diffusion Models

Current text recognition systems, including those for handwritten scripts and scene text, have relied heavily on image synthesis and augmentation, since it is difficult to realize real-world complexity and diversity through collecting and…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Yuanzhi Zhu , Zhaohai Li , Tianwei Wang , Mengchao He , Cong Yao

Person Image Synthesis via Denoising Diffusion Model

The pose-guided person image generation task requires synthesizing photorealistic images of humans in arbitrary poses. The existing approaches use generative adversarial networks that do not necessarily maintain realistic textures or need…

Computer Vision and Pattern Recognition · Computer Science 2023-03-02 Ankan Kumar Bhunia , Salman Khan , Hisham Cholakkal , Rao Muhammad Anwer , Jorma Laaksonen , Mubarak Shah , Fahad Shahbaz Khan

DDFM: Denoising Diffusion Model for Multi-Modality Image Fusion

Multi-modality image fusion aims to combine different modalities to produce fused images that retain the complementary features of each modality, such as functional highlights and texture details. To leverage strong generative priors and…

Computer Vision and Pattern Recognition · Computer Science 2023-08-24 Zixiang Zhao , Haowen Bai , Yuanzhi Zhu , Jiangshe Zhang , Shuang Xu , Yulun Zhang , Kai Zhang , Deyu Meng , Radu Timofte , Luc Van Gool

Enhancing Diffusion-Based Quantitatively Controllable Image Generation via Matrix-Form EDM and Adaptive Vicinal Training

Continuous Conditional Diffusion Model (CCDM) is a diffusion-based framework designed to generate high-quality images conditioned on continuous regression labels. Although CCDM has demonstrated clear advantages over prior approaches across…

Computer Vision and Pattern Recognition · Computer Science 2026-02-03 Xin Ding , Yun Chen , Sen Zhang , Kao Zhang , Nenglun Chen , Peibei Cao , Yongwei Wang , Fei Wu

Dig2DIG: Dig into Diffusion Information Gains for Image Fusion

Image fusion integrates complementary information from multi-source images to generate more informative results. Recently, the diffusion model, which demonstrates unprecedented generative potential, has been explored in image fusion.…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Bing Cao , Baoshuo Cai , Changqing Zhang , Qinghua Hu

Generative Image Compression by Estimating Gradients of the Rate-variable Feature Distribution

While learned image compression (LIC) focuses on efficient data transmission, generative image compression (GIC) extends this framework by integrating generative modeling to produce photo-realistic reconstructed images. In this paper, we…

Image and Video Processing · Electrical Eng. & Systems 2025-05-28 Minghao Han , Weiyi You , Jinhua Zhang , Leheng Zhang , Ce Zhu , Shuhang Gu

Cross-conditioned Diffusion Model for Medical Image to Image Translation

Multi-modal magnetic resonance imaging (MRI) provides rich, complementary information for analyzing diseases. However, the practical challenges of acquiring multiple MRI modalities, such as cost, scan time, and safety considerations, often…

Image and Video Processing · Electrical Eng. & Systems 2024-09-16 Zhaohu Xing , Sicheng Yang , Sixiang Chen , Tian Ye , Yijun Yang , Jing Qin , Lei Zhu

Conditional Controllable Image Fusion

Image fusion aims to integrate complementary information from multiple input images acquired through various sources to synthesize a new fused image. Existing methods usually employ distinct constraint designs tailored to specific scenes,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Bing Cao , Xingxin Xu , Pengfei Zhu , Qilong Wang , Qinghua Hu

CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation

Recently, 3D generation methods have shown their powerful ability to automate 3D model creation. However, most 3D generation methods only rely on an input image or a text prompt to generate a 3D model, which lacks the control of each…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Peng Li , Suizhi Ma , Jialiang Chen , Yuan Liu , Congyi Zhang , Wei Xue , Wenhan Luo , Alla Sheffer , Wenping Wang , Yike Guo

ControlCom: Controllable Image Composition using Diffusion Model

Image composition targets at synthesizing a realistic composite image from a pair of foreground and background images. Recently, generative composition methods are built on large pretrained diffusion models to generate composite images,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Bo Zhang , Yuxuan Duan , Jun Lan , Yan Hong , Huijia Zhu , Weiqiang Wang , Li Niu

Discrete Contrastive Diffusion for Cross-Modal Music and Image Generation

Diffusion probabilistic models (DPMs) have become a popular approach to conditional generation, due to their promising results and support for cross-modal synthesis. A key desideratum in conditional synthesis is to achieve high…

Computer Vision and Pattern Recognition · Computer Science 2023-02-17 Ye Zhu , Yu Wu , Kyle Olszewski , Jian Ren , Sergey Tulyakov , Yan Yan

Generating Intermediate Representations for Compositional Text-To-Image Generation

Text-to-image diffusion models have demonstrated an impressive ability to produce high-quality outputs. However, they often struggle to accurately follow fine-grained spatial information in an input text. To this end, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Ran Galun , Sagie Benaim

CCDM: Continuous Conditional Diffusion Models for Image Generation

Continuous Conditional Generative Modeling (CCGM) estimates high-dimensional data distributions, such as images, conditioned on scalar continuous variables (aka regression labels). While Continuous Conditional Generative Adversarial…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Xin Ding , Yongwei Wang , Kao Zhang , Z. Jane Wang

Higher fidelity perceptual image and video compression with a latent conditioned residual denoising diffusion model

Denoising diffusion models achieved impressive results on several image generation tasks often outperforming GAN based models. Recently, the generative capabilities of diffusion models have been employed for perceptual image compression,…

Image and Video Processing · Electrical Eng. & Systems 2025-05-20 Jonas Brenig , Radu Timofte

Cycle Diffusion Model for Counterfactual Image Generation

Deep generative models have demonstrated remarkable success in medical image synthesis. However, ensuring conditioning faithfulness and high-quality synthetic images for direct or counterfactual generation remains a challenge. In this work,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-31 Fangrui Huang , Alan Wang , Binxu Li , Bailey Trang , Ridvan Yesiloglu , Tianyu Hua , Wei Peng , Ehsan Adeli

FFusionCGAN: An end-to-end fusion method for few-focus images using conditional GAN in cytopathological digital slides

Multi-focus image fusion technologies compress different focus depth images into an image in which most objects are in focus. However, although existing image fusion techniques, including traditional algorithms and deep learning-based…

Computer Vision and Pattern Recognition · Computer Science 2020-01-06 Xiebo Geng , Sibo Liua , Wei Han , Xu Li , Jiabo Ma , Jingya Yu , Xiuli Liu , Sahoqun Zeng , Li Chen , Shenghua Cheng

MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation

Recent advances in text-to-image generation with diffusion models present transformative capabilities in image quality. However, user controllability of the generated image, and fast adaptation to new tasks still remains an open challenge,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-17 Omer Bar-Tal , Lior Yariv , Yaron Lipman , Tali Dekel

MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation

Diffusion models have shown excellent performance in text-to-image generation. Nevertheless, existing methods often suffer from performance bottlenecks when handling complex prompts that involve multiple objects, characteristics, and…

Computer Vision and Pattern Recognition · Computer Science 2025-05-07 Mingcheng Li , Xiaolu Hou , Ziyang Liu , Dingkang Yang , Ziyun Qian , Jiawei Chen , Jinjie Wei , Yue Jiang , Qingyao Xu , Lihua Zhang

Insights into Closed-form IPM-GAN Discriminator Guidance for Diffusion Modeling

Diffusion models are a state-of-the-art generative modeling framework that transform noise to images via Langevin sampling, guided by the score, which is the gradient of the logarithm of the data distribution. Recent works have shown…

Machine Learning · Computer Science 2025-08-01 Aadithya Srikanth , Siddarth Asokan , Nishanth Shetty , Chandra Sekhar Seelamantula