Related papers: PQD: Post-training Quantization for Efficient Diff…

PTQD: Accurate Post-Training Quantization for Diffusion Models

Diffusion models have recently dominated image synthesis tasks. However, the iterative denoising process is expensive in computations at inference time, making diffusion models less practical for low-latency and scalable real-world…

Computer Vision and Pattern Recognition · Computer Science 2023-11-02 Yefei He , Luping Liu , Jing Liu , Weijia Wu , Hong Zhou , Bohan Zhuang

Q-Diffusion: Quantizing Diffusion Models

Diffusion models have achieved great success in image synthesis through iterative noise estimation using deep neural networks. However, the slow inference, high memory consumption, and computation intensity of the noise estimation model…

Computer Vision and Pattern Recognition · Computer Science 2023-06-09 Xiuyu Li , Yijiang Liu , Long Lian , Huanrui Yang , Zhen Dong , Daniel Kang , Shanghang Zhang , Kurt Keutzer

QVD: Post-training Quantization for Video Diffusion Models

Recently, video diffusion models (VDMs) have garnered significant attention due to their notable advancements in generating coherent and realistic video content. However, processing multiple frame features concurrently, coupled with the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-18 Shilong Tian , Hong Chen , Chengtao Lv , Yu Liu , Jinyang Guo , Xianglong Liu , Shengxi Li , Hao Yang , Tao Xie

Gradient-Aligned Calibration for Post-Training Quantization of Diffusion Models

Diffusion models have shown remarkable performance in image synthesis by progressively estimating a smooth transition from a Gaussian distribution of noise to a real image. Unfortunately, their practical deployment is limited by slow…

Machine Learning · Computer Science 2026-03-03 Dung Anh Hoang , Cuong Pham anh Trung Le , Jianfei Cai , Thanh-Toan Do

Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers

Recent advancements in diffusion models, particularly the architectural transformation from UNet-based models to Diffusion Transformers (DiTs), significantly improve the quality and scalability of image and video generation. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Lei Chen , Yuan Meng , Chen Tang , Xinzhu Ma , Jingyan Jiang , Xin Wang , Zhi Wang , Wenwu Zhu

Post-training Quantization on Diffusion Models

Denoising diffusion (score-based) generative models have recently achieved significant accomplishments in generating realistic and diverse data. These approaches define a forward diffusion process for transforming data into noise and a…

Computer Vision and Pattern Recognition · Computer Science 2023-03-17 Yuzhang Shang , Zhihang Yuan , Bin Xie , Bingzhe Wu , Yan Yan

EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models

Diffusion models have demonstrated remarkable capabilities in image synthesis and related generative tasks. Nevertheless, their practicality for real-world applications is constrained by substantial computational costs and latency issues.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Yefei He , Jing Liu , Weijia Wu , Hong Zhou , Bohan Zhuang

EDA-DM: Enhanced Distribution Alignment for Post-Training Quantization of Diffusion Models

Diffusion models have achieved great success in image generation tasks. However, the lengthy denoising process and complex neural networks hinder their low-latency applications in real-world scenarios. Quantization can effectively reduce…

Computer Vision and Pattern Recognition · Computer Science 2025-06-24 Xuewen Liu , Zhikai Li , Junrui Xiao , Mengjuan Chen , Jianquan Li , Qingyi Gu

Temporal Dynamic Quantization for Diffusion Models

The diffusion model has gained popularity in vision applications due to its remarkable generative performance and versatility. However, high storage and computation demands, resulting from the model size and iterative generation, hinder its…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Junhyuk So , Jungwon Lee , Daehyun Ahn , Hyungjun Kim , Eunhyeok Park

ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation

Diffusion transformers have demonstrated remarkable performance in visual generation tasks, such as generating realistic images or videos based on textual instructions. However, larger model sizes and multi-frame processing for video…

Computer Vision and Pattern Recognition · Computer Science 2025-02-25 Tianchen Zhao , Tongcheng Fang , Haofeng Huang , Enshu Liu , Rui Wan , Widyadewi Soedarmadji , Shiyao Li , Zinan Lin , Guohao Dai , Shengen Yan , Huazhong Yang , Xuefei Ning , Yu Wang

An Analysis on Quantizing Diffusion Transformers

Diffusion Models (DMs) utilize an iterative denoising process to transform random noise into synthetic data. Initally proposed with a UNet structure, DMs excel at producing images that are virtually indistinguishable with or without…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Yuewei Yang , Jialiang Wang , Xiaoliang Dai , Peizhao Zhang , Hongbo Zhang

PQCAD-DM: Progressive Quantization and Calibration-Assisted Distillation for Extremely Efficient Diffusion Model

Diffusion models excel in image generation but are computational and resource-intensive due to their reliance on iterative Markov chain processes, leading to error accumulation and limiting the effectiveness of naive compression techniques.…

Computer Vision and Pattern Recognition · Computer Science 2025-06-23 Beomseok Ko , Hyeryung Jang

Efficient Quantization Strategies for Latent Diffusion Models

Latent Diffusion Models (LDMs) capture the dynamic evolution of latent variables over time, blending patterns and multimodality in a generative system. Despite the proficiency of LDM in various applications, such as text-to-image…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Yuewei Yang , Xiaoliang Dai , Jialiang Wang , Peizhao Zhang , Hongbo Zhang

MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

Diffusion models have achieved significant visual generation quality. However, their significant computational and memory costs pose challenge for their application on resource-constrained mobile devices or even desktop GPUs. Recent…

Computer Vision and Pattern Recognition · Computer Science 2024-05-31 Tianchen Zhao , Xuefei Ning , Tongcheng Fang , Enshu Liu , Guyue Huang , Zinan Lin , Shengen Yan , Guohao Dai , Yu Wang

QNCD: Quantization Noise Correction for Diffusion Models

Diffusion models have revolutionized image synthesis, setting new benchmarks in quality and creativity. However, their widespread adoption is hindered by the intensive computation required during the iterative denoising process.…

Computer Vision and Pattern Recognition · Computer Science 2024-09-19 Huanpeng Chu , Wei Wu , Chengjie Zang , Kun Yuan

TR-DQ: Time-Rotation Diffusion Quantization

Diffusion models have been widely adopted in image and video generation. However, their complex network architecture leads to high inference overhead for its generation process. Existing diffusion quantization methods primarily focus on the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Yihua Shao , Deyang Lin , Fanhu Zeng , Minxi Yan , Muyang Zhang , Siyu Chen , Yuxuan Fan , Ziyang Yan , Haozhe Wang , Jingcai Guo , Yan Wang , Haotong Qin , Hao Tang

Effective Quantization for Diffusion Models on CPUs

Diffusion models have gained popularity for generating images from textual descriptions. Nonetheless, the substantial need for computational resources continues to present a noteworthy challenge, contributing to time-consuming processes.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Hanwen Chang , Haihao Shen , Yiyang Cai , Xinyu Ye , Zhenzhong Xu , Wenhua Cheng , Kaokao Lv , Weiwei Zhang , Yintong Lu , Heng Guo

DMQ: Dissecting Outliers of Diffusion Models for Post-Training Quantization

Diffusion models have achieved remarkable success in image generation but come with significant computational costs, posing challenges for deployment in resource-constrained environments. Recent post-training quantization (PTQ) methods have…

Computer Vision and Pattern Recognition · Computer Science 2025-07-18 Dongyeun Lee , Jiwan Hur , Hyounguk Shon , Jae Young Lee , Junmo Kim

Efficiency Meets Fidelity: A Novel Quantization Framework for Stable Diffusion

Text-to-image generation via Stable Diffusion models (SDM) have demonstrated remarkable capabilities. However, their computational intensity, particularly in the iterative denoising process, hinders real-time deployment in latency-sensitive…

Computer Vision and Pattern Recognition · Computer Science 2025-05-08 Shuaiting Li , Juncan Deng , Zeyu Wang , Kedong Xu , Rongtao Deng , Hong Gu , Haibin Shen , Kejie Huang

MPQ-Diff: Mixed Precision Quantization for Diffusion Models

Diffusion models (DMs) generate remarkable high quality images via the stochastic denoising process, which unfortunately incurs high sampling time. Post-quantizing the trained diffusion models in fixed bit-widths, e.g., 4 bits on weights…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Rocco Manz Maruzzelli , Basile Lewandowski , Lydia Y. Chen