English
Related papers

Related papers: Temporal Feature Matters: A Framework for Diffusio…

200 papers

The Diffusion model, a prevalent framework for image generation, encounters significant challenges in terms of broad applicability due to its extended inference times and substantial memory requirements. Efficient Post-training Quantization…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Yushi Huang , Ruihao Gong , Jing Liu , Tianlong Chen , Xianglong Liu

The diffusion model has gained popularity in vision applications due to its remarkable generative performance and versatility. However, high storage and computation demands, resulting from the model size and iterative generation, hinder its…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Junhyuk So , Jungwon Lee , Daehyun Ahn , Hyungjun Kim , Eunhyeok Park

Diffusion models have achieved great success in image synthesis through iterative noise estimation using deep neural networks. However, the slow inference, high memory consumption, and computation intensity of the noise estimation model…

Computer Vision and Pattern Recognition · Computer Science 2023-06-09 Xiuyu Li , Yijiang Liu , Long Lian , Huanrui Yang , Zhen Dong , Daniel Kang , Shanghang Zhang , Kurt Keutzer

Diffusion models have recently dominated image synthesis tasks. However, the iterative denoising process is expensive in computations at inference time, making diffusion models less practical for low-latency and scalable real-world…

Computer Vision and Pattern Recognition · Computer Science 2023-11-02 Yefei He , Luping Liu , Jing Liu , Weijia Wu , Hong Zhou , Bohan Zhuang

Diffusionmodels(DMs)havedemonstratedremarkableachievements in synthesizing images of high fidelity and diversity. However, the extensive computational requirements and slow generative speed of diffusion models have limited their widespread…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Jiaojiao Ye , Zhen Wang , Linnan Jiang

Diffusion models have shown remarkable performance in image synthesis by progressively estimating a smooth transition from a Gaussian distribution of noise to a real image. Unfortunately, their practical deployment is limited by slow…

Machine Learning · Computer Science 2026-03-03 Dung Anh Hoang , Cuong Pham anh Trung Le , Jianfei Cai , Thanh-Toan Do

Diffusion models have emerged as preeminent contenders in the realm of generative models. Distinguished by their distinctive sequential generative processes, characterized by hundreds or even thousands of timesteps, diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Haojun Sun , Chen Tang , Zhi Wang , Yuan Meng , Jingyan jiang , Xinzhu Ma , Wenwu Zhu

Recently, video diffusion models (VDMs) have garnered significant attention due to their notable advancements in generating coherent and realistic video content. However, processing multiple frame features concurrently, coupled with the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Shilong Tian , Hong Chen , Chengtao Lv , Yu Liu , Jinyang Guo , Xianglong Liu , Shengxi Li , Hao Yang , Tao Xie

Denoising diffusion (score-based) generative models have recently achieved significant accomplishments in generating realistic and diverse data. These approaches define a forward diffusion process for transforming data into noise and a…

Computer Vision and Pattern Recognition · Computer Science 2023-03-17 Yuzhang Shang , Zhihang Yuan , Bin Xie , Bingzhe Wu , Yan Yan

Diffusion models have achieved remarkable success in the image and video generation tasks. Nevertheless, they often require a large amount of memory and time overhead during inference, due to the complex network architecture and…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Haocheng Huang , Jiaxin Chen , Jinyang Guo , Ruiyi Zhan , Yunhong Wang

Diffusion models have marked a significant breakthrough in the synthesis of semantically coherent images. However, their extensive noise estimation networks and the iterative generation process limit their wider application, particularly on…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Yuzhe Yao , Feng Tian , Jun Chen , Haonan Lin , Guang Dai , Yong Liu , Jingdong Wang

Diffusion Transformer (DiT) has now become the preferred choice for building image generation models due to its great generation capability. Unlike previous convolution-based UNet models, DiT is purely composed of a stack of transformer…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Ning Ding , Jing Han , Yuchuan Tian , Chao Xu , Kai Han , Yehui Tang

Latent Diffusion Models (LDMs) capture the dynamic evolution of latent variables over time, blending patterns and multimodality in a generative system. Despite the proficiency of LDM in various applications, such as text-to-image…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Yuewei Yang , Xiaoliang Dai , Jialiang Wang , Peizhao Zhang , Hongbo Zhang

Diffusion transformers (DiTs) combine transformer architectures with diffusion models. However, their computational complexity imposes significant limitations on real-time applications and sustainability of AI systems. In this study, we aim…

Machine Learning · Computer Science 2025-02-07 Younghye Hwang , Hyojin Lee , Joonhyuk Kang

Diffusion Models (DMs) utilize an iterative denoising process to transform random noise into synthetic data. Initally proposed with a UNet structure, DMs excel at producing images that are virtually indistinguishable with or without…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Yuewei Yang , Jialiang Wang , Xiaoliang Dai , Peizhao Zhang , Hongbo Zhang

Diffusion models have been widely adopted in image and video generation. However, their complex network architecture leads to high inference overhead for its generation process. Existing diffusion quantization methods primarily focus on the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Yihua Shao , Deyang Lin , Fanhu Zeng , Minxi Yan , Muyang Zhang , Siyu Chen , Yuxuan Fan , Ziyang Yan , Haozhe Wang , Jingcai Guo , Yan Wang , Haotong Qin , Hao Tang

Transformer-based diffusion models, dubbed Diffusion Transformers (DiTs), have achieved state-of-the-art performance in image and video generation tasks. However, their large model size and slow inference speed limit their practical…

Image and Video Processing · Electrical Eng. & Systems 2026-01-26 Xinyan Liu , Huihong Shi , Yang Xu , Zhongfeng Wang

Diffusion models are the go-to method for Text-to-Image generation, but their iterative denoising processes has high inference latency. Quantization reduces compute time by using lower bitwidths, but applies a fixed precision across all…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Basile Lewandowski , Simon Kurz , Aditya Shankar , Robert Birke , Jian-Jia Chen , Lydia Y. Chen

Recent advancements in diffusion models, particularly the architectural transformation from UNet-based models to Diffusion Transformers (DiTs), significantly improve the quality and scalability of image and video generation. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Lei Chen , Yuan Meng , Chen Tang , Xinzhu Ma , Jingyan Jiang , Xin Wang , Zhi Wang , Wenwu Zhu

Diffusion transformers have demonstrated remarkable performance in visual generation tasks, such as generating realistic images or videos based on textual instructions. However, larger model sizes and multi-frame processing for video…

Computer Vision and Pattern Recognition · Computer Science 2025-02-25 Tianchen Zhao , Tongcheng Fang , Haofeng Huang , Enshu Liu , Rui Wan , Widyadewi Soedarmadji , Shiyao Li , Zinan Lin , Guohao Dai , Shengen Yan , Huazhong Yang , Xuefei Ning , Yu Wang
‹ Prev 1 2 3 10 Next ›