English
Related papers

Related papers: TQ-DiT: Efficient Time-Aware Quantization for Diff…

200 papers

Transformer-based diffusion models, dubbed Diffusion Transformers (DiTs), have achieved state-of-the-art performance in image and video generation tasks. However, their large model size and slow inference speed limit their practical…

Image and Video Processing · Electrical Eng. & Systems 2026-01-26 Xinyan Liu , Huihong Shi , Yang Xu , Zhongfeng Wang

Recent advancements in diffusion models, particularly the architectural transformation from UNet-based models to Diffusion Transformers (DiTs), significantly improve the quality and scalability of image and video generation. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Lei Chen , Yuan Meng , Chen Tang , Xinzhu Ma , Jingyan Jiang , Xin Wang , Zhi Wang , Wenwu Zhu

Diffusion transformers have demonstrated remarkable performance in visual generation tasks, such as generating realistic images or videos based on textual instructions. However, larger model sizes and multi-frame processing for video…

Computer Vision and Pattern Recognition · Computer Science 2025-02-25 Tianchen Zhao , Tongcheng Fang , Haofeng Huang , Enshu Liu , Rui Wan , Widyadewi Soedarmadji , Shiyao Li , Zinan Lin , Guohao Dai , Shengen Yan , Huazhong Yang , Xuefei Ning , Yu Wang

Diffusion models have been widely adopted in image and video generation. However, their complex network architecture leads to high inference overhead for its generation process. Existing diffusion quantization methods primarily focus on the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Yihua Shao , Deyang Lin , Fanhu Zeng , Minxi Yan , Muyang Zhang , Siyu Chen , Yuxuan Fan , Ziyang Yan , Haozhe Wang , Jingcai Guo , Yan Wang , Haotong Qin , Hao Tang

Diffusion transformers (DiT) have demonstrated exceptional performance in video generation. However, their large number of parameters and high computational complexity limit their deployment on edge devices. Quantization can reduce storage…

Computer Vision and Pattern Recognition · Computer Science 2025-05-29 Weilun Feng , Chuanguang Yang , Haotong Qin , Xiangqi Li , Yu Wang , Zhulin An , Libo Huang , Boyu Diao , Zixiang Zhao , Yongjun Xu , Michele Magno

Diffusion Transformer (DiT) has now become the preferred choice for building image generation models due to its great generation capability. Unlike previous convolution-based UNet models, DiT is purely composed of a stack of transformer…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Ning Ding , Jing Han , Yuchuan Tian , Chao Xu , Kai Han , Yehui Tang

Diffusion models have emerged as preeminent contenders in the realm of generative models. Distinguished by their distinctive sequential generative processes, characterized by hundreds or even thousands of timesteps, diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Haojun Sun , Chen Tang , Zhi Wang , Yuan Meng , Jingyan jiang , Xinzhu Ma , Wenwu Zhu

The recent introduction of Diffusion Transformers (DiTs) has demonstrated exceptional capabilities in image generation by using a different backbone architecture, departing from traditional U-Nets and embracing the scalable nature of…

Computer Vision and Pattern Recognition · Computer Science 2024-10-18 Junyi Wu , Haoxuan Wang , Yuzhang Shang , Mubarak Shah , Yan Yan

The diffusion model has gained popularity in vision applications due to its remarkable generative performance and versatility. However, high storage and computation demands, resulting from the model size and iterative generation, hinder its…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Junhyuk So , Jungwon Lee , Daehyun Ahn , Hyungjun Kim , Eunhyeok Park

Diffusion Transformers (DiTs) have emerged as a highly scalable and effective backbone for image generation, outperforming U-Net architectures in both scalability and performance. However, their real-world deployment remains challenging due…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Kaicheng Yang , Kaisen Yang , Baiting Wu , Xun Zhang , Qianrui Yang , Haotong Qin , He Zhang , Yulun Zhang

Diffusion Transformers (DiTs) have recently gained substantial attention in both industrial and academic fields for their superior visual generation capabilities, outperforming traditional diffusion models that use U-Net. However,the…

Computer Vision and Pattern Recognition · Computer Science 2024-06-03 Wenxuan Liu , Sai Qian Zhang

Diffusion Transformers (DiTs) have achieved impressive performance in text-to-image and text-to-video generation. However, their high computational cost and large parameter sizes pose significant challenges for usage in resource-constrained…

Computer Vision and Pattern Recognition · Computer Science 2025-09-24 Lianwei Yang , Haokun Lin , Tianchen Zhao , Yichen Wu , Hongyu Zhu , Ruiqi Xie , Zhenan Sun , Yu Wang , Qingyi Gu

The Diffusion model, a prevalent framework for image generation, encounters significant challenges in terms of broad applicability due to its extended inference times and substantial memory requirements. Efficient Post-training Quantization…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Yushi Huang , Ruihao Gong , Jing Liu , Tianlong Chen , Xianglong Liu

Diffusion Transformers (DiTs) achieve state-of-the-art image generation quality but incur substantial memory and computational costs at inference. While aggressive Post-Training Quantization (PTQ) to 4-bit precision offers significant…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Sayeh Sharify , Mahsa Salmani , Hesham Mostafa

Model quantization is a promising method for accelerating and compressing diffusion models. Nevertheless, since post-training quantization (PTQ) fails catastrophically at low-bit cases, quantization-aware training (QAT) is essential.…

Computer Vision and Pattern Recognition · Computer Science 2025-07-10 Xuewen Liu , Zhikai Li , Minhao Jiang , Mengjuan Chen , Jianquan Li , Qingyi Gu

Diffusionmodels(DMs)havedemonstratedremarkableachievements in synthesizing images of high fidelity and diversity. However, the extensive computational requirements and slow generative speed of diffusion models have limited their widespread…

Computer Vision and Pattern Recognition · Computer Science 2025-01-03 Jiaojiao Ye , Zhen Wang , Linnan Jiang

Diffusion Transformers (DiTs) have recently attracted significant interest from both industry and academia due to their enhanced capabilities in visual generation, surpassing the performance of traditional diffusion models that employ…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Zhenyuan Dong , Sai Qian Zhang

Visual generation quality has been greatly promoted with the rapid advances in diffusion transformers (DiTs), which is attributed to the scaling of model size and complexity. However, these attributions also hinder the practical deployment…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Kai Liu , Shaoqiu Zhang , Linghe Kong , Yulun Zhang

Diffusion Models (DMs) utilize an iterative denoising process to transform random noise into synthetic data. Initally proposed with a UNet structure, DMs excel at producing images that are virtually indistinguishable with or without…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Yuewei Yang , Jialiang Wang , Xiaoliang Dai , Peizhao Zhang , Hongbo Zhang

The Diffusion models, widely used for image generation, face significant challenges related to their broad applicability due to prolonged inference times and high memory demands. Efficient Post-Training Quantization (PTQ) is crucial to…

Computer Vision and Pattern Recognition · Computer Science 2025-07-15 Yushi Huang , Ruihao Gong , Xianglong Liu , Jing Liu , Yuhang Li , Jiwen Lu , Dacheng Tao
‹ Prev 1 2 3 10 Next ›