Related papers: Efficient Spatially Sparse Inference for Condition…

InstGenIE: Generative Image Editing Made Efficient with Mask-aware Caching and Scheduling

Generative image editing using diffusion models has become a prevalent application in today's AI cloud services. In production environments, image editing typically involves a mask that specifies the regions of an image template to be…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-05-28 Xiaoxiao Jiang , Suyi Li , Lingyun Yang , Tianyu Feng , Zhipeng Di , Weiyi Lu , Guoxuan Zhu , Xiu Lin , Kan Liu , Yinghao Yu , Tao Lan , Guodong Yang , Lin Qu , Liping Zhang , Wei Wang

Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference

Due to the recent success of diffusion models, text-to-image generation is becoming increasingly popular and achieves a wide range of applications. Among them, text-to-image editing, or continuous text-to-image generation, attracts lots of…

Computer Vision and Pattern Recognition · Computer Science 2024-01-05 Zihao Yu , Haoyang Li , Fangcheng Fu , Xupeng Miao , Bin Cui

Guiding Token-Sparse Diffusion Models

Diffusion models deliver high quality in image synthesis but remain expensive during training and inference. Recent works have leveraged the inherent redundancy in visual content to make training more affordable by training only on a subset…

Computer Vision and Pattern Recognition · Computer Science 2026-05-27 Felix Krause , Stefan Andreas Baumann , Johannes Schusterbauer , Olga Grebenkova , Ming Gui , Vincent Tao Hu , Björn Ommer

Improving Supervised Machine Learning Performance in Optical Quality Control via Generative AI for Dataset Expansion

Supervised machine learning algorithms play a crucial role in optical quality control within industrial production. These approaches require representative datasets for effective model training. However, while non-defective components are…

Computer Vision and Pattern Recognition · Computer Science 2026-02-02 Dennis Sprute , Hanna Senke , Holger Flatt

SparseDM: Toward Sparse Efficient Diffusion Models

Diffusion models represent a powerful family of generative models widely used for image and video generation. However, the time-consuming deployment, long inference time, and requirements on large memory hinder their applications on…

Machine Learning · Computer Science 2025-04-18 Kafeng Wang , Jianfei Chen , He Li , Zhenpeng Mi , Jun Zhu

A Diffusion-Based Generative Prior Approach to Sparse-view Computed Tomography

The reconstruction of X-rays CT images from sparse or limited-angle geometries is a highly challenging task. The lack of data typically results in artifacts in the reconstructed image and may even lead to object distortions. For this…

Computer Vision and Pattern Recognition · Computer Science 2026-02-12 Davide Evangelista , Pasquale Cascarano , Elena Loli Piccolomini

Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts

Image synthesis approaches, e.g., generative adversarial networks, have been popular as a form of data augmentation in medical image analysis tasks. It is primarily beneficial to overcome the shortage of publicly accessible data and…

Computer Vision and Pattern Recognition · Computer Science 2023-10-05 Shiyi Du , Xiaosong Wang , Yongyi Lu , Yuyin Zhou , Shaoting Zhang , Alan Yuille , Kang Li , Zongwei Zhou

Generative Image Compression by Estimating Gradients of the Rate-variable Feature Distribution

While learned image compression (LIC) focuses on efficient data transmission, generative image compression (GIC) extends this framework by integrating generative modeling to produce photo-realistic reconstructed images. In this paper, we…

Image and Video Processing · Electrical Eng. & Systems 2025-05-28 Minghao Han , Weiyi You , Jinhua Zhang , Leheng Zhang , Ce Zhu , Shuhang Gu

Sparse Generative Adversarial Network

We propose a new approach to Generative Adversarial Networks (GANs) to achieve an improved performance with additional robustness to its so-called and well recognized mode collapse. We first proceed by mapping the desired data onto a…

Computer Vision and Pattern Recognition · Computer Science 2019-08-26 Shahin Mahdizadehaghdam , Ashkan Panahi , Hamid Krim

SCoRe: Clean Image Generation from Diffusion Models Trained on Noisy Images

Diffusion models trained on noisy datasets often reproduce high-frequency training artifacts, significantly degrading generation quality. To address this, we propose SCoRe (Spectral Cutoff Regeneration), a training-free, generation-time…

Computer Vision and Pattern Recognition · Computer Science 2026-04-13 Yuta Matsuzaki , Seiichi Uchida , Shumpei Takezaki

Accelerating Score-based Generative Models with Preconditioned Diffusion Sampling

Score-based generative models (SGMs) have recently emerged as a promising class of generative models. However, a fundamental limitation is that their inference is very slow due to a need for many (e.g., 2000) iterations of sequential…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Hengyuan Ma , Li Zhang , Xiatian Zhu , Jianfeng Feng

Few-Shot Image Generation by Conditional Relaxing Diffusion Inversion

In the field of Few-Shot Image Generation (FSIG) using Deep Generative Models (DGMs), accurately estimating the distribution of target domain with minimal samples poses a significant challenge. This requires a method that can both capture…

Computer Vision and Pattern Recognition · Computer Science 2024-07-11 Yu Cao , Shaogang Gong

Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity

Diffusion Transformers (DiTs) dominate video generation but their high computational cost severely limits real-world applicability, usually requiring tens of minutes to generate a few seconds of video even on high-performance GPUs. This…

Computer Vision and Pattern Recognition · Computer Science 2025-04-29 Haocheng Xi , Shuo Yang , Yilong Zhao , Chenfeng Xu , Muyang Li , Xiuyu Li , Yujun Lin , Han Cai , Jintao Zhang , Dacheng Li , Jianfei Chen , Ion Stoica , Kurt Keutzer , Song Han

Sparse-to-Sparse Training of Diffusion Models

Diffusion models (DMs) are a powerful type of generative models that have achieved state-of-the-art results in various image synthesis tasks and have shown potential in other domains, such as natural language processing and temporal data…

Machine Learning · Computer Science 2026-02-05 Inês Cardoso Oliveira , Decebal Constantin Mocanu , Luis A. Leiva

Sparsely Supervised Diffusion

Diffusion models have shown remarkable success across a wide range of generative tasks. However, they often suffer from spatially inconsistent generation, arguably due to the inherent locality of their denoising mechanisms. This can yield…

Machine Learning · Computer Science 2026-02-04 Wenshuai Zhao , Zhiyuan Li , Yi Zhao , Mohammad Hassan Vali , Martin Trapp , Joni Pajarinen , Juho Kannala , Arno Solin

DiffAxE: Diffusion-driven Hardware Accelerator Generation and Design Space Exploration

Design space exploration (DSE) is critical for developing optimized hardware architectures, especially for AI workloads such as deep neural networks (DNNs) and large language models (LLMs), which require specialized acceleration. As model…

Hardware Architecture · Computer Science 2025-08-15 Arkapravo Ghosh , Abhishek Moitra , Abhiroop Bhattacharjee , Ruokai Yin , Priyadarshini Panda

Improved Image Generation via Sparse Modeling

The interest of the deep learning community in image synthesis has grown massively in recent years. Nowadays, deep generative methods, and especially Generative Adversarial Networks (GANs), are leading to state-of-the-art performance,…

Computer Vision and Pattern Recognition · Computer Science 2022-05-16 Roy Ganz , Michael Elad

Sparse Forcing: Native Trainable Sparse Attention for Real-time Autoregressive Diffusion Video Generation

We introduce Sparse Forcing, a training-and-inference paradigm for autoregressive video diffusion models that improves long-horizon generation quality while reducing decoding latency. Sparse Forcing is motivated by an empirical observation…

Computer Vision and Pattern Recognition · Computer Science 2026-04-24 Boxun Xu , Yuming Du , Zichang Liu , Siyu Yang , Ziyang Jiang , Siqi Yan , Rajasi Saha , Albert Pumarola , Wenchen Wang , Peng Li

Don't Forget your Inverse DDIM for Image Editing

The field of text-to-image generation has undergone significant advancements with the introduction of diffusion models. Nevertheless, the challenge of editing real images persists, as most methods are either computationally intensive or…

Computer Vision and Pattern Recognition · Computer Science 2025-07-16 Guillermo Gomez-Trenado , Pablo Mesejo , Oscar Cordón , Stéphane Lathuilière

G4Seg: Generation for Inexact Segmentation Refinement with Diffusion Models

This paper considers the problem of utilizing a large-scale text-to-image diffusion model to tackle the challenging Inexact Segmentation (IS) task. Unlike traditional approaches that rely heavily on discriminative-model-based paradigms or…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Tianjiao Zhang , Fei Zhang , Jiangchao Yao , Ya Zhang , Yanfeng Wang