Related papers: Timestep-Aware Correction for Quantized Diffusion …

Error Propagation Mechanisms and Compensation Strategies for Quantized Diffusion

Diffusion models have transformed image synthesis by establishing unprecedented quality and creativity benchmarks. Nevertheless, their large-scale deployment faces challenges due to computationally intensive iterative denoising processes.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-02 Songwei Liu , Chao Zeng , Chenqian Yan , Xurui Peng , Xing Wang , Fangmin Chen , Xing Mei

Q-Diffusion: Quantizing Diffusion Models

Diffusion models have achieved great success in image synthesis through iterative noise estimation using deep neural networks. However, the slow inference, high memory consumption, and computation intensity of the noise estimation model…

Computer Vision and Pattern Recognition · Computer Science 2023-06-09 Xiuyu Li , Yijiang Liu , Long Lian , Huanrui Yang , Zhen Dong , Daniel Kang , Shanghang Zhang , Kurt Keutzer

Temporal Dynamic Quantization for Diffusion Models

The diffusion model has gained popularity in vision applications due to its remarkable generative performance and versatility. However, high storage and computation demands, resulting from the model size and iterative generation, hinder its…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Junhyuk So , Jungwon Lee , Daehyun Ahn , Hyungjun Kim , Eunhyeok Park

TCAQ-DM: Timestep-Channel Adaptive Quantization for Diffusion Models

Diffusion models have achieved remarkable success in the image and video generation tasks. Nevertheless, they often require a large amount of memory and time overhead during inference, due to the complex network architecture and…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Haocheng Huang , Jiaxin Chen , Jinyang Guo , Ruiyi Zhan , Yunhong Wang

TMPQ-DM: Joint Timestep Reduction and Quantization Precision Selection for Efficient Diffusion Models

Diffusion models have emerged as preeminent contenders in the realm of generative models. Distinguished by their distinctive sequential generative processes, characterized by hundreds or even thousands of timesteps, diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Haojun Sun , Chen Tang , Zhi Wang , Yuan Meng , Jingyan jiang , Xinzhu Ma , Wenwu Zhu

Gradient-Aligned Calibration for Post-Training Quantization of Diffusion Models

Diffusion models have shown remarkable performance in image synthesis by progressively estimating a smooth transition from a Gaussian distribution of noise to a real image. Unfortunately, their practical deployment is limited by slow…

Machine Learning · Computer Science 2026-03-03 Dung Anh Hoang , Cuong Pham anh Trung Le , Jianfei Cai , Thanh-Toan Do

Sampling-Aware Quantization for Diffusion Models

Diffusion models have recently emerged as the dominant approach in visual generation tasks. However, the lengthy denoising chains and the computationally intensive noise estimation networks hinder their applicability in low-latency and…

Computer Vision and Pattern Recognition · Computer Science 2026-04-23 Qian Zeng , Jie Song , Yuanyu Wan , Huiqiong Wang , Mingli Song

PTQD: Accurate Post-Training Quantization for Diffusion Models

Diffusion models have recently dominated image synthesis tasks. However, the iterative denoising process is expensive in computations at inference time, making diffusion models less practical for low-latency and scalable real-world…

Computer Vision and Pattern Recognition · Computer Science 2023-11-02 Yefei He , Luping Liu , Jing Liu , Weijia Wu , Hong Zhou , Bohan Zhuang

TR-DQ: Time-Rotation Diffusion Quantization

Diffusion models have been widely adopted in image and video generation. However, their complex network architecture leads to high inference overhead for its generation process. Existing diffusion quantization methods primarily focus on the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Yihua Shao , Deyang Lin , Fanhu Zeng , Minxi Yan , Muyang Zhang , Siyu Chen , Yuxuan Fan , Ziyang Yan , Haozhe Wang , Jingcai Guo , Yan Wang , Haotong Qin , Hao Tang

MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization

Diffusion models have achieved significant visual generation quality. However, their significant computational and memory costs pose challenge for their application on resource-constrained mobile devices or even desktop GPUs. Recent…

Computer Vision and Pattern Recognition · Computer Science 2024-05-31 Tianchen Zhao , Xuefei Ning , Tongcheng Fang , Enshu Liu , Guyue Huang , Zinan Lin , Shengen Yan , Guohao Dai , Yu Wang

QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning

The practical deployment of diffusion models is still hindered by the high memory and computational overhead. Although quantization paves a way for model compression and acceleration, existing methods face challenges in achieving low-bit…

Computer Vision and Pattern Recognition · Computer Science 2025-07-16 Haoxuan Wang , Yuzhang Shang , Zhihang Yuan , Junyi Wu , Junchi Yan , Yan Yan

Temporal Feature Matters: A Framework for Diffusion Model Quantization

The Diffusion models, widely used for image generation, face significant challenges related to their broad applicability due to prolonged inference times and high memory demands. Efficient Post-Training Quantization (PTQ) is crucial to…

Computer Vision and Pattern Recognition · Computer Science 2025-07-15 Yushi Huang , Ruihao Gong , Xianglong Liu , Jing Liu , Yuhang Li , Jiwen Lu , Dacheng Tao

AccuQuant: Simulating Multiple Denoising Steps for Quantizing Diffusion Models

We present in this paper a novel post-training quantization (PTQ) method, dubbed AccuQuant, for diffusion models. We show analytically and empirically that quantization errors for diffusion models are accumulated over denoising steps in a…

Computer Vision and Pattern Recognition · Computer Science 2025-10-24 Seunghoon Lee , Jeongwoo Choi , Byunggwan Son , Jaehyeon Moon , Jeimin Jeon , Bumsub Ham

TMPDiff: Temporal Mixed-Precision for Diffusion Models

Diffusion models are the go-to method for Text-to-Image generation, but their iterative denoising processes has high inference latency. Quantization reduces compute time by using lower bitwidths, but applies a fixed precision across all…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Basile Lewandowski , Simon Kurz , Aditya Shankar , Robert Birke , Jian-Jia Chen , Lydia Y. Chen

PQD: Post-training Quantization for Efficient Diffusion Models

Diffusionmodels(DMs)havedemonstratedremarkableachievements in synthesizing images of high fidelity and diversity. However, the extensive computational requirements and slow generative speed of diffusion models have limited their widespread…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Jiaojiao Ye , Zhen Wang , Linnan Jiang

Q-Drift: Quantization-Aware Drift Correction for Diffusion Model Sampling

Post-training quantization (PTQ) is a practical path to deploy large diffusion models, but quantization noise can accumulate over the denoising trajectory and degrade generation quality. We propose Q-Drift, a principled sampler-side…

Computer Vision and Pattern Recognition · Computer Science 2026-03-20 Sooyoung Ryu , Mathieu Salzmann , Saqib Javed

Diffusion Model Quantization: A Review

Recent success of large text-to-image models has empirically underscored the exceptional performance of diffusion models in generative tasks. To facilitate their efficient deployment on resource-constrained edge devices, model quantization…

Computer Vision and Pattern Recognition · Computer Science 2025-05-09 Qian Zeng , Chenggong Hu , Mingli Song , Jie Song

EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diffusion Models

Diffusion models have demonstrated remarkable capabilities in image synthesis and related generative tasks. Nevertheless, their practicality for real-world applications is constrained by substantial computational costs and latency issues.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Yefei He , Jing Liu , Weijia Wu , Hong Zhou , Bohan Zhuang

Effective Quantization for Diffusion Models on CPUs

Diffusion models have gained popularity for generating images from textual descriptions. Nonetheless, the substantial need for computational resources continues to present a noteworthy challenge, contributing to time-consuming processes.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-30 Hanwen Chang , Haihao Shen , Yiyang Cai , Xinyu Ye , Zhenzhong Xu , Wenhua Cheng , Kaokao Lv , Weiwei Zhang , Yintong Lu , Heng Guo

Softmax Bias Correction for Quantized Generative Models

Post-training quantization (PTQ) is the go-to compression technique for large generative models, such as stable diffusion or large language models. PTQ methods commonly keep the softmax activation in higher precision as it has been shown to…

Machine Learning · Computer Science 2023-09-06 Nilesh Prasad Pandey , Marios Fournarakis , Chirag Patel , Markus Nagel