English
Related papers

Related papers: TinyFusion: Diffusion Transformers Learned Shallow

200 papers

Generative modeling has recently undergone remarkable advancements, primarily propelled by the transformative implications of Diffusion Probabilistic Models (DPMs). The impressive capability of these models, however, often entails…

Machine Learning · Computer Science 2023-10-03 Gongfan Fang , Xinyin Ma , Xinchao Wang

Diffusion models have achieved impressive advancements in various vision tasks. However, these gains often rely on increasing model size, which escalates computational complexity and memory demands, complicating deployment, raising…

Computer Vision and Pattern Recognition · Computer Science 2026-03-06 Yang Zhang , Er Jin , Wenzhong Liang , Yanfei Dong , Ashkan Khakzar , Philip Torr , Johannes Stegmaier , Kenji Kawaguchi

Real-world image super-resolution (Real-ISR) focuses on recovering high-quality images from low-resolution inputs that suffer from complex degradations like noise, blur, and compression. Recently, diffusion models (DMs) have shown great…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Linwei Dong , Qingnan Fan , Yuhang Yu , Qi Zhang , Jinwei Chen , Yawei Luo , Changqing Zou

Diffusion models have proven to be highly effective in generating high-quality images. However, adapting large pre-trained diffusion models to new domains remains an open challenge, which is critical for real-world applications. This paper…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Enze Xie , Lewei Yao , Han Shi , Zhili Liu , Daquan Zhou , Zhaoqiang Liu , Jiawei Li , Zhenguo Li

Diffusion Transformers (DiTs) deliver state-of-the-art generative performance but their quadratic training cost with sequence length makes large-scale pretraining prohibitively expensive. Token dropping can reduce training cost, yet na\"ive…

Computer Vision and Pattern Recognition · Computer Science 2026-01-12 Dogyun Park , Moayed Haji-Ali , Yanyu Li , Willi Menapace , Sergey Tulyakov , Hyunwoo J. Kim , Aliaksandr Siarohin , Anil Kag

Recent advances in diffusion generative models have yielded remarkable progress. While the quality of generated content continues to improve, these models have grown considerably in size and complexity. This increasing computational burden…

Machine Learning · Computer Science 2025-03-13 Reza Shirkavand , Peiran Yu , Shangqian Gao , Gowthami Somepalli , Tom Goldstein , Heng Huang

Diffusion models have achieved remarkable progress in the field of image generation due to their outstanding capabilities. However, these models require substantial computing resources because of the multi-step denoising process during…

Computer Vision and Pattern Recognition · Computer Science 2024-10-23 Haowei Zhu , Dehua Tang , Ji Liu , Mingjie Lu , Jintu Zheng , Jinzhang Peng , Dong Li , Yu Wang , Fan Jiang , Lu Tian , Spandan Tiwari , Ashish Sirasao , Jun-Hai Yong , Bin Wang , Emad Barsoum

Despite the remarkable generation capabilities of Diffusion Models (DMs), conducting training and inference remains computationally expensive. Previous works have been devoted to accelerating diffusion sampling, but achieving data-efficient…

Computer Vision and Pattern Recognition · Computer Science 2024-10-03 Yize Li , Yihua Zhang , Sijia Liu , Xue Lin

Although diffusion-based models have achieved impressive results in image super-resolution, they often rely on large-scale backbones such as Stable Diffusion XL (SDXL) and Diffusion Transformers (DiT), which lead to excessive computational…

Computer Vision and Pattern Recognition · Computer Science 2025-12-03 Zhongbao Yang , Jiangxin Dong , Yazhou Yao , Jinhui Tang , Jinshan Pan

Diffusion models have significantly advanced the field of generative modeling. However, training a diffusion model is computationally expensive, creating a pressing need to adapt off-the-shelf diffusion models for downstream generation…

Machine Learning · Computer Science 2024-06-07 Jincheng Zhong , Xingzhuo Guo , Jiaxiang Dong , Mingsheng Long

As text-to-image models grow increasingly powerful and complex, their burgeoning size presents a significant obstacle to widespread adoption, especially on resource-constrained devices. This paper presents a pioneering study on…

Computer Vision and Pattern Recognition · Computer Science 2024-11-25 Samarth N Ramesh , Zhixue Zhao

Neural network pruning is an essential technique for reducing the size and complexity of deep neural networks, enabling large-scale models on devices with limited resources. However, existing pruning approaches heavily rely on training data…

Machine Learning · Computer Science 2023-07-12 Hong Huang , Lan Zhang , Chaoyue Sun , Ruogu Fang , Xiaoyong Yuan , Dapeng Wu

Diffusion models are renowned for their generative capabilities, yet their pretraining processes exhibit distinct phases of learning speed that have been entirely overlooked in prior post-training acceleration efforts in the community. In…

Machine Learning · Computer Science 2025-10-15 Bowei Guo , Shengkun Tang , Cong Zeng , Zhiqiang Shen

Transforming large pre-trained low-resolution diffusion models to cater to higher-resolution demands, i.e., diffusion extrapolation, significantly improves diffusion adaptability. We propose tuning-free CutDiffusion, aimed at simplifying…

Computer Vision and Pattern Recognition · Computer Science 2024-04-24 Mingbao Lin , Zhihang Lin , Wengyi Zhan , Liujuan Cao , Rongrong Ji

Latent Diffusion Models (LDMs) have emerged as powerful generative models, known for delivering remarkable results under constrained computational resources. However, deploying LDMs on resource-limited devices remains a complex issue,…

Machine Learning · Computer Science 2024-04-19 Thibault Castells , Hyoung-Kyu Song , Bo-Kyeong Kim , Shinkook Choi

Semantic-rich features from Vision Foundation Models (VFMs) have been leveraged to enhance Latent Diffusion Models (LDMs). However, raw VFM features are typically high-dimensional and redundant, increasing the difficulty of learning and…

Computer Vision and Pattern Recognition · Computer Science 2026-05-15 Guanfang Dong , Luke Schultz , Negar Hassanpour , Chao Gao

Fine-tuning transformer models after unsupervised pre-training reaches a very high performance on many different natural language processing tasks. Unfortunately, transformers suffer from long inference times which greatly increases costs…

Computation and Language · Computer Science 2022-03-30 David Peer , Sebastian Stabinger , Stefan Engl , Antonio Rodriguez-Sanchez

Diffusion Models (DMs) have impressive capabilities among generation models, but are limited to slower inference speeds and higher computational costs. Previous works utilize one-shot structure pruning to derive lightweight DMs from…

Machine Learning · Computer Science 2025-01-17 Ben Wan , Tianyi Zheng , Zhaoyu Chen , Yuxiao Wang , Jia Wang

While task-specific finetuning of pretrained networks has led to significant empirical advances in NLP, the large size of networks makes finetuning difficult to deploy in multi-task, memory-constrained settings. We propose diff pruning as a…

Computation and Language · Computer Science 2021-06-10 Demi Guo , Alexander M. Rush , Yoon Kim

Pruning can be an effective method of compressing large pre-trained models for inference speed acceleration. Previous pruning approaches rely on access to the original training dataset for both pruning and subsequent fine-tuning. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-12-11 Haihang Wu , Wei Wang , Tamasha Malepathirana , Sachith Seneviratne , Denny Oetomo , Saman Halgamuge
‹ Prev 1 2 3 10 Next ›