Related papers: SparseDM: Toward Sparse Efficient Diffusion Models

DiffSparse: Accelerating Diffusion Transformers with Learned Token Sparsity

Diffusion models demonstrate outstanding performance in image generation, but their multi-step inference mechanism requires immense computational cost. Previous works accelerate inference by leveraging layer or token cache techniques to…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Haowei Zhu , Ji Liu , Ziqiong Liu , Dong Li , Junhai Yong , Bin Wang , Emad Barsoum

Sparse-to-Sparse Training of Diffusion Models

Diffusion models (DMs) are a powerful type of generative models that have achieved state-of-the-art results in various image synthesis tasks and have shown potential in other domains, such as natural language processing and temporal data…

Machine Learning · Computer Science 2026-02-05 Inês Cardoso Oliveira , Decebal Constantin Mocanu , Luis A. Leiva

Learning Sparse Masks for Diffusion-based Image Inpainting

Diffusion-based inpainting is a powerful tool for the reconstruction of images from sparse data. Its quality strongly depends on the choice of known data. Optimising their spatial location -- the inpainting mask -- is challenging. A…

Image and Video Processing · Electrical Eng. & Systems 2022-05-17 Tobias Alt , Pascal Peter , Joachim Weickert

Sparse-LaViDa: Sparse Multimodal Discrete Diffusion Language Models

Masked Discrete Diffusion Models (MDMs) have achieved strong performance across a wide range of multimodal tasks, including image understanding, generation, and editing. However, their inference speed remains suboptimal due to the need to…

Computer Vision and Pattern Recognition · Computer Science 2025-12-17 Shufan Li , Jiuxiang Gu , Kangning Liu , Zhe Lin , Zijun Wei , Aditya Grover , Jason Kuen

Sparsely Supervised Diffusion

Diffusion models have shown remarkable success across a wide range of generative tasks. However, they often suffer from spatially inconsistent generation, arguably due to the inherent locality of their denoising mechanisms. This can yield…

Machine Learning · Computer Science 2026-02-04 Wenshuai Zhao , Zhiyuan Li , Yi Zhao , Mohammad Hassan Vali , Martin Trapp , Joni Pajarinen , Juho Kannala , Arno Solin

Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation

We investigate methods to reduce inference time and memory footprint in stable diffusion models by introducing lightweight decoders for both image and video synthesis. Traditional latent diffusion pipelines rely on large Variational…

Computer Vision and Pattern Recognition · Computer Science 2025-03-10 Alexey Buzovkin , Evgeny Shilov

Faster Diffusion: Rethinking the Role of the Encoder for Diffusion Model Inference

One of the main drawback of diffusion models is the slow inference time for image generation. Among the most successful approaches to addressing this problem are distillation methods. However, these methods require considerable…

Computer Vision and Pattern Recognition · Computer Science 2024-10-16 Senmao Li , Taihang Hu , Joost van de Weijer , Fahad Shahbaz Khan , Tao Liu , Linxuan Li , Shiqi Yang , Yaxing Wang , Ming-Ming Cheng , Jian Yang

Fast Training of Diffusion Models with Masked Transformers

We propose an efficient approach to train large diffusion models with masked transformers. While masked transformers have been extensively explored for representation learning, their application to generative learning is less explored in…

Computer Vision and Pattern Recognition · Computer Science 2024-03-06 Hongkai Zheng , Weili Nie , Arash Vahdat , Anima Anandkumar

Diffusion Buffer: Online Diffusion-based Speech Enhancement with Sub-Second Latency

Diffusion models are a class of generative models that have been recently used for speech enhancement with remarkable success but are computationally expensive at inference time. Therefore, these models are impractical for processing…

Audio and Speech Processing · Electrical Eng. & Systems 2025-09-15 Bunlong Lay , Rostislav Makarov , Timo Gerkmann

An Efficient Sparse Inference Software Accelerator for Transformer-based Language Models on CPUs

In recent years, Transformer-based language models have become the standard approach for natural language processing tasks. However, stringent throughput and latency requirements in industrial applications are limiting their adoption. To…

Machine Learning · Computer Science 2023-06-30 Haihao Shen , Hengyu Meng , Bo Dong , Zhe Wang , Ofir Zafrir , Yi Ding , Yu Luo , Hanwen Chang , Qun Gao , Ziheng Wang , Guy Boudoukh , Moshe Wasserblat

Guiding Token-Sparse Diffusion Models

Diffusion models deliver high quality in image synthesis but remain expensive during training and inference. Recent works have leveraged the inherent redundancy in visual content to make training more affordable by training only on a subset…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Felix Krause , Stefan Andreas Baumann , Johannes Schusterbauer , Olga Grebenkova , Ming Gui , Vincent Tao Hu , Björn Ommer

Accelerating Image Generation with Sub-path Linear Approximation Model

Diffusion models have significantly advanced the state of the art in image, audio, and video generation tasks. However, their applications in practical scenarios are hindered by slow inference speed. Drawing inspiration from the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Chen Xu , Tianhui Song , Weixin Feng , Xubin Li , Tiezheng Ge , Bo Zheng , Limin Wang

Optimizing Inference in Transformer-Based Models: A Multi-Method Benchmark

Efficient inference is a critical challenge in deep generative modeling, particularly as diffusion models grow in capacity and complexity. While increased complexity often improves accuracy, it raises compute costs, latency, and memory…

Machine Learning · Computer Science 2025-09-24 Siu Hang Ho , Prasad Ganesan , Nguyen Duong , Daniel Schlabig

SQ-DM: Accelerating Diffusion Models with Aggressive Quantization and Temporal Sparsity

Diffusion models have gained significant popularity in image generation tasks. However, generating high-quality content remains notably slow because it requires running model inference over many time steps. To accelerate these models, we…

Computer Vision and Pattern Recognition · Computer Science 2025-01-28 Zichen Fan , Steve Dai , Rangharajan Venkatesan , Dennis Sylvester , Brucek Khailany

Pruning for Sparse Diffusion Models based on Gradient Flow

Diffusion Models (DMs) have impressive capabilities among generation models, but are limited to slower inference speeds and higher computational costs. Previous works utilize one-shot structure pruning to derive lightweight DMs from…

Machine Learning · Computer Science 2025-01-17 Ben Wan , Tianyi Zheng , Zhaoyu Chen , Yuxiao Wang , Jia Wang

A Simple Early Exiting Framework for Accelerated Sampling in Diffusion Models

Diffusion models have shown remarkable performance in generation problems over various domains including images, videos, text, and audio. A practical bottleneck of diffusion models is their sampling speed, due to the repeated evaluation of…

Computer Vision and Pattern Recognition · Computer Science 2024-08-13 Taehong Moon , Moonseok Choi , EungGu Yun , Jongmin Yoon , Gayoung Lee , Jaewoong Cho , Juho Lee

ResPanDiff: Diffusion Model for Pansharpening by Inferring Residual Inference

The implementation of diffusion-based pansharpening task is predominantly constrained by its slow inference speed, which results from numerous sampling steps. Despite the existing techniques aiming to accelerate sampling, they often…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Shiqi Cao , Liangjian Deng , Shangqi Deng

DiffFit: Unlocking Transferability of Large Diffusion Models via Simple Parameter-Efficient Fine-Tuning

Diffusion models have proven to be highly effective in generating high-quality images. However, adapting large pre-trained diffusion models to new domains remains an open challenge, which is critical for real-world applications. This paper…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Enze Xie , Lewei Yao , Han Shi , Zhili Liu , Daquan Zhou , Zhaoqiang Liu , Jiawei Li , Zhenguo Li

Mixture of Distributions Matters: Dynamic Sparse Attention for Efficient Video Diffusion Transformers

While Diffusion Transformers (DiTs) have achieved notable progress in video generation, this long-sequence generation task remains constrained by the quadratic complexity inherent to self-attention mechanisms, creating significant barriers…

Computer Vision and Pattern Recognition · Computer Science 2026-02-04 Yuxi Liu , Yipeng Hu , Zekun Zhang , Kunze Jiang , Kun Yuan

Fixed Point Diffusion Models

We introduce the Fixed Point Diffusion Model (FPDM), a novel approach to image generation that integrates the concept of fixed point solving into the framework of diffusion-based generative modeling. Our approach embeds an implicit fixed…

Computer Vision and Pattern Recognition · Computer Science 2024-01-18 Xingjian Bai , Luke Melas-Kyriazi