English
Related papers

Related papers: A-SDM: Accelerating Stable Diffusion through Redun…

200 papers

The Stable Diffusion Model (SDM) is a prevalent and effective model for text-to-image (T2I) and image-to-image (I2I) generation. Despite various attempts at sampler optimization, model distillation, and network quantification, these…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Jinchao Zhu , Yuxuan Wang , Siyuan Pan , Pengfei Wan , Di Zhang , Gao Huang

Text-to-image (T2I) generation with Stable Diffusion models (SDMs) involves high computing demands due to billion-scale parameters. To enhance efficiency, recent studies have reduced sampling steps and applied network quantization while…

Machine Learning · Computer Science 2024-12-03 Bo-Kyeong Kim , Hyoung-Kyu Song , Thibault Castells , Shinkook Choi

The intensive computational burden of Stable Diffusion (SD) for text-to-image generation poses a significant hurdle for its practical application. To tackle this challenge, recent research focuses on methods to reduce sampling steps, such…

Diffusion models (DMs) have been adopted across diverse fields with its remarkable abilities in capturing intricate data distributions. In this paper, we propose a Fast Diffusion Model (FDM) to significantly speed up DMs from a stochastic…

Computer Vision and Pattern Recognition · Computer Science 2023-10-05 Zike Wu , Pan Zhou , Kenji Kawaguchi , Hanwang Zhang

The emergence of diffusion models has significantly advanced generative AI, improving the quality, realism, and creativity of image and video generation. Among them, Stable Diffusion (StableDiff) stands out as a key model for text-to-image…

Hardware Architecture · Computer Science 2025-07-03 Zhican Wang , Guanghui He , Hongxiang Fan

Stable Diffusion Models (SDMs) have shown remarkable proficiency in image synthesis. However, their broad application is impeded by their large model sizes and intensive computational requirements, which typically require expensive cloud…

Computer Vision and Pattern Recognition · Computer Science 2024-10-31 Chenqian Yan , Songwei Liu , Hongjian Liu , Xurui Peng , Xiaojian Wang , Fangmin Chen , Lean Fu , Xing Mei

We present Stable Video Diffusion - a latent video diffusion model for high-resolution, state-of-the-art text-to-video and image-to-video generation. Recently, latent diffusion models trained for 2D image synthesis have been turned into…

Computer Vision and Pattern Recognition · Computer Science 2023-11-28 Andreas Blattmann , Tim Dockhorn , Sumith Kulal , Daniel Mendelevitch , Maciej Kilian , Dominik Lorenz , Yam Levi , Zion English , Vikram Voleti , Adam Letts , Varun Jampani , Robin Rombach

Recent years have witnessed Spiking Neural Networks (SNNs) gaining attention for their ultra-low energy consumption and high biological plausibility compared with traditional Artificial Neural Networks (ANNs). Despite their distinguished…

Neural and Evolutionary Computing · Computer Science 2024-08-30 Jiahang Cao , Hanzhong Guo , Ziqing Wang , Deming Zhou , Hao Cheng , Qiang Zhang , Renjing Xu

Diffusion models (DMs) are a powerful generative framework that have attracted significant attention in recent years. However, the high computational cost of training DMs limits their practical applications. In this paper, we start with a…

Machine Learning · Computer Science 2024-04-12 Tianshuo Xu , Peng Mi , Ruilin Wang , Yingcong Chen

Diffusion models (DMs) have become the dominant paradigm of generative modeling in a variety of domains by learning stochastic processes from noise to data. Recently, diffusion denoising bridge models (DDBMs), a new formulation of…

Machine Learning · Computer Science 2024-11-01 Guande He , Kaiwen Zheng , Jianfei Chen , Fan Bao , Jun Zhu

Diffusion models achieve superior generation quality but suffer from slow generation speed due to the iterative nature of denoising. In contrast, consistency models, a new generative family, achieve competitive performance with…

Machine Learning · Computer Science 2024-12-05 Fu-Yun Wang , Zhengyang Geng , Hongsheng Li

Diffusion models, such as Stable Diffusion, have shown incredible performance on text-to-image generation. Since text-to-image generation often requires models to generate visual concepts with fine-grained details and attributes specified…

Computer Vision and Pattern Recognition · Computer Science 2024-04-26 Xuehai He , Weixi Feng , Tsu-Jui Fu , Varun Jampani , Arjun Akula , Pradyumna Narayana , Sugato Basu , William Yang Wang , Xin Eric Wang

Stable diffusion models have ushered in a new era of advancements in image generation, currently reigning as the state-of-the-art approach, exhibiting unparalleled performance. The process of diffusion, accompanied by denoising through…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Andras Horvath

As text-to-image models grow increasingly powerful and complex, their burgeoning size presents a significant obstacle to widespread adoption, especially on resource-constrained devices. This paper presents a pioneering study on…

Computer Vision and Pattern Recognition · Computer Science 2024-11-25 Samarth N Ramesh , Zhixue Zhao

We introduce the Fixed Point Diffusion Model (FPDM), a novel approach to image generation that integrates the concept of fixed point solving into the framework of diffusion-based generative modeling. Our approach embeds an implicit fixed…

Computer Vision and Pattern Recognition · Computer Science 2024-01-18 Xingjian Bai , Luke Melas-Kyriazi

Background: Text-to-image generation models are widely used across numerous domains. Among these models, Stable Diffusion (SD) - an open-source text-to-image generation model - has become the most popular, producing over 12 billion images…

Software Engineering · Computer Science 2025-12-08 Giordano d'Aloisio , Tosin Fadahunsi , Jay Choy , Rebecca Moussa , Federica Sarro

Diffusion Models (DMs) have demonstrated state-of-the-art performance in content generation without requiring adversarial training. These models are trained using a two-step process. First, a forward - diffusion - process gradually adds…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Anwaar Ulhaq , Naveed Akhtar

The Diffusion Model (DM) has emerged as the SOTA approach for image synthesis. However, the existing DM cannot perform well on some image-to-image translation (I2I) tasks. Different from image synthesis, some I2I tasks, such as…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Bin Xia , Yulun Zhang , Shiyin Wang , Yitong Wang , Xinglong Wu , Yapeng Tian , Wenming Yang , Radu Timotfe , Luc Van Gool

Diffusion Model (DM) based Semantic Image Communication (SIC) systems face significant challenges, such as slow inference speed and generation randomness, that limit their reliability and practicality. To overcome these issues, we propose a…

Image and Video Processing · Electrical Eng. & Systems 2025-07-24 Bilal Khalid , Pedro Freire , Sergei K. Turitsyn , Jaroslaw E. Prilepsky

This paper presents SANA-Sprint, an efficient diffusion model for ultra-fast text-to-image (T2I) generation. SANA-Sprint is built on a pre-trained foundation model and augmented with hybrid distillation, dramatically reducing inference…

Graphics · Computer Science 2025-09-30 Junsong Chen , Shuchen Xue , Yuyang Zhao , Jincheng Yu , Sayak Paul , Junyu Chen , Han Cai , Song Han , Enze Xie
‹ Prev 1 2 3 10 Next ›