Related papers: TinyFusion: Diffusion Transformers Learned Shallow

Structural Pruning for Diffusion Models

Generative modeling has recently undergone remarkable advancements, primarily propelled by the transformative implications of Diffusion Probabilistic Models (DPMs). The impressive capability of these models, however, often entails…

Machine Learning · Computer Science 2023-10-03 Gongfan Fang , Xinyin Ma , Xinchao Wang

Learnable Sparsity for Vision Generative Models

Diffusion models have achieved impressive advancements in various vision tasks. However, these gains often rely on increasing model size, which escalates computational complexity and memory demands, complicating deployment, raising…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Yang Zhang , Er Jin , Wenzhong Liang , Yanfei Dong , Ashkan Khakzar , Philip Torr , Johannes Stegmaier , Kenji Kawaguchi

TinySR: Pruning Diffusion for Real-World Image Super-Resolution

Real-world image super-resolution (Real-ISR) focuses on recovering high-quality images from low-resolution inputs that suffer from complex degradations like noise, blur, and compression. Recently, diffusion models (DMs) have shown great…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Linwei Dong , Qingnan Fan , Yuhang Yu , Qi Zhang , Jinwei Chen , Yawei Luo , Changqing Zou

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning

Diffusion models have proven to be highly effective in generating high-quality images. However, adapting large pre-trained diffusion models to new domains remains an open challenge, which is critical for real-world applications. This paper…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Enze Xie , Lewei Yao , Han Shi , Zhili Liu , Daquan Zhou , Zhaoqiang Liu , Jiawei Li , Zhenguo Li

Sprint: Sparse-Dense Residual Fusion for Efficient Diffusion Transformers

Diffusion Transformers (DiTs) deliver state-of-the-art generative performance but their quadratic training cost with sequence length makes large-scale pretraining prohibitively expensive. Token dropping can reduce training cost, yet na\"ive…

Computer Vision and Pattern Recognition · Computer Science 2026-01-12 Dogyun Park , Moayed Haji-Ali , Yanyu Li , Willi Menapace , Sergey Tulyakov , Hyunwoo J. Kim , Aliaksandr Siarohin , Anil Kag

Efficient Fine-Tuning and Concept Suppression for Pruned Diffusion Models

Recent advances in diffusion generative models have yielded remarkable progress. While the quality of generated content continues to improve, these models have grown considerably in size and complexity. This increasing computational burden…

Machine Learning · Computer Science 2025-03-13 Reza Shirkavand , Peiran Yu , Shangqian Gao , Gowthami Somepalli , Tom Goldstein , Heng Huang

DiP-GO: A Diffusion Pruner via Few-step Gradient Optimization

Diffusion models have achieved remarkable progress in the field of image generation due to their outstanding capabilities. However, these models require substantial computing resources because of the multi-step denoising process during…

Computer Vision and Pattern Recognition · Computer Science 2024-10-23 Haowei Zhu , Dehua Tang , Ji Liu , Mingjie Lu , Jintu Zheng , Jinzhang Peng , Dong Li , Yu Wang , Fan Jiang , Lu Tian , Spandan Tiwari , Ashish Sirasao , Jun-Hai Yong , Bin Wang , Emad Barsoum

Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models

Despite the remarkable generation capabilities of Diffusion Models (DMs), conducting training and inference remains computationally expensive. Previous works have been devoted to accelerating diffusion sampling, but achieving data-efficient…

Computer Vision and Pattern Recognition · Computer Science 2024-10-03 Yize Li , Yihua Zhang , Sijia Liu , Xue Lin

PGP-DiffSR: Phase-Guided Progressive Pruning for Efficient Diffusion-based Image Super-Resolution

Although diffusion-based models have achieved impressive results in image super-resolution, they often rely on large-scale backbones such as Stable Diffusion XL (SDXL) and Diffusion Transformers (DiT), which lead to excessive computational…

Computer Vision and Pattern Recognition · Computer Science 2025-12-03 Zhongbao Yang , Jiangxin Dong , Yazhou Yao , Jinhui Tang , Jinshan Pan

Diffusion Tuning: Transferring Diffusion Models via Chain of Forgetting

Diffusion models have significantly advanced the field of generative modeling. However, training a diffusion model is computationally expensive, creating a pressing need to adapt off-the-shelf diffusion models for downstream generation…

Machine Learning · Computer Science 2024-06-07 Jincheng Zhong , Xingzhuo Guo , Jiaxiang Dong , Mingsheng Long

Efficient Pruning of Text-to-Image Models: Insights from Pruning Stable Diffusion

As text-to-image models grow increasingly powerful and complex, their burgeoning size presents a significant obstacle to widespread adoption, especially on resource-constrained devices. This paper presents a pioneering study on…

Computer Vision and Pattern Recognition · Computer Science 2024-11-25 Samarth N Ramesh , Zhixue Zhao

Distributed Pruning Towards Tiny Neural Networks in Federated Learning

Neural network pruning is an essential technique for reducing the size and complexity of deep neural networks, enabling large-scale models on devices with limited resources. However, existing pruning approaches heavily rely on training data…

Machine Learning · Computer Science 2023-07-12 Hong Huang , Lan Zhang , Chaoyue Sun , Ruogu Fang , Xiaoyong Yuan , Dapeng Wu

MosaicDiff: Training-free Structural Pruning for Diffusion Model Acceleration Reflecting Pretraining Dynamics

Diffusion models are renowned for their generative capabilities, yet their pretraining processes exhibit distinct phases of learning speed that have been entirely overlooked in prior post-training acceleration efforts in the community. In…

Machine Learning · Computer Science 2025-10-15 Bowei Guo , Shengkun Tang , Cong Zeng , Zhiqiang Shen

CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method

Transforming large pre-trained low-resolution diffusion models to cater to higher-resolution demands, i.e., diffusion extrapolation, significantly improves diffusion adaptability. We propose tuning-free CutDiffusion, aimed at simplifying…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Mingbao Lin , Zhihang Lin , Wengyi Zhan , Liujuan Cao , Rongrong Ji

LD-Pruner: Efficient Pruning of Latent Diffusion Models using Task-Agnostic Insights

Latent Diffusion Models (LDMs) have emerged as powerful generative models, known for delivering remarkable results under constrained computational resources. However, deploying LDMs on resource-limited devices remains a complex issue,…

Machine Learning · Computer Science 2024-04-19 Thibault Castells , Hyoung-Kyu Song , Bo-Kyeong Kim , Shinkook Choi

RePack then Refine: Efficient Diffusion Transformer with Vision Foundation Model

Semantic-rich features from Vision Foundation Models (VFMs) have been leveraged to enhance Latent Diffusion Models (LDMs). However, raw VFM features are typically high-dimensional and redundant, increasing the difficulty of learning and…

Computer Vision and Pattern Recognition · Computer Science 2026-05-15 Guanfang Dong , Luke Schultz , Negar Hassanpour , Chao Gao

Greedy-layer Pruning: Speeding up Transformer Models for Natural Language Processing

Fine-tuning transformer models after unsupervised pre-training reaches a very high performance on many different natural language processing tasks. Unfortunately, transformers suffer from long inference times which greatly increases costs…

Computation and Language · Computer Science 2022-03-30 David Peer , Sebastian Stabinger , Stefan Engl , Antonio Rodriguez-Sanchez

Pruning for Sparse Diffusion Models based on Gradient Flow

Diffusion Models (DMs) have impressive capabilities among generation models, but are limited to slower inference speeds and higher computational costs. Previous works utilize one-shot structure pruning to derive lightweight DMs from…

Machine Learning · Computer Science 2025-01-17 Ben Wan , Tianyi Zheng , Zhaoyu Chen , Yuxiao Wang , Jia Wang

Parameter-Efficient Transfer Learning with Diff Pruning

While task-specific finetuning of pretrained networks has led to significant empirical advances in NLP, the large size of networks makes finetuning difficult to deploy in multi-task, memory-constrained settings. We propose diff pruning as a…

Computation and Language · Computer Science 2021-06-10 Demi Guo , Alexander M. Rush , Yoon Kim

TT-MPD: Test Time Model Pruning and Distillation

Pruning can be an effective method of compressing large pre-trained models for inference speed acceleration. Previous pruning approaches rely on access to the original training dataset for both pruning and subsequent fine-tuning. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-11 Haihang Wu , Wei Wang , Tamasha Malepathirana , Sachith Seneviratne , Denny Oetomo , Saman Halgamuge