Related papers: Distribution-Aware Data Expansion with Diffusion M…

Diffusion-Based Data Augmentation for Image Recognition: A Systematic Analysis and Evaluation

Diffusion-based data augmentation (DiffDA) has emerged as a promising approach to improving classification performance under data scarcity. However, existing works vary significantly in task configurations, model choices, and experimental…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Zekun Li , Yinghuan Shi , Yang Gao , Dong Xu

Towards Effective Data-Free Knowledge Distillation via Diverse Diffusion Augmentation

Data-free knowledge distillation (DFKD) has emerged as a pivotal technique in the domain of model compression, substantially reducing the dependency on the original training data. Nonetheless, conventional DFKD methods that employ…

Computer Vision and Pattern Recognition · Computer Science 2024-10-24 Muquan Li , Dongyang Zhang , Tao He , Xiurui Xie , Yuan-Fang Li , Ke Qin

Dynamical Diffusion: Learning Temporal Dynamics with Diffusion Models

Diffusion models have emerged as powerful generative frameworks by progressively adding noise to data through a forward process and then reversing this process to generate realistic samples. While these models have achieved strong…

Machine Learning · Computer Science 2025-03-04 Xingzhuo Guo , Yu Zhang , Baixu Chen , Haoran Xu , Jianmin Wang , Mingsheng Long

MedDiff-FT: Data-Efficient Diffusion Model Fine-tuning with Structural Guidance for Controllable Medical Image Synthesis

Recent advancements in deep learning for medical image segmentation are often limited by the scarcity of high-quality training data.While diffusion models provide a potential solution by generating synthetic images, their effectiveness in…

Computer Vision and Pattern Recognition · Computer Science 2025-07-02 Jianhao Xie , Ziang Zhang , Zhenyu Weng , Yuesheng Zhu , Guibo Luo

SimDiff: Simpler Yet Better Diffusion Model for Time Series Point Forecasting

Diffusion models have recently shown promise in time series forecasting, particularly for probabilistic predictions. However, they often fail to achieve state-of-the-art point estimation performance compared to regression-based methods.…

Artificial Intelligence · Computer Science 2025-11-25 Hang Ding , Xue Wang , Tian Zhou , Tao Yao

Diffusion-Driven High-Dimensional Variable Selection

Variable selection for high-dimensional, highly correlated data has long been a challenging problem, often yielding unstable and unreliable models. We propose a resample-aggregate framework that exploits diffusion models' ability to…

Methodology · Statistics 2025-08-20 Minjie Wang , Xiaotong Shen , Wei Pan

A Time-Series Data Augmentation Model through Diffusion and Transformer Integration

With the development of Artificial Intelligence, numerous real-world tasks have been accomplished using technology integrated with deep learning. To achieve optimal performance, deep neural networks typically require large volumes of data…

Machine Learning · Computer Science 2025-05-09 Yuren Zhang , Zhongnan Pu , Lei Jing

DiffuseExpand: Expanding dataset for 2D medical image segmentation using diffusion models

Dataset expansion can effectively alleviate the problem of data scarcity for medical image segmentation, due to privacy concerns and labeling difficulties. However, existing expansion algorithms still face great challenges due to their…

Image and Video Processing · Electrical Eng. & Systems 2023-06-07 Shitong Shao , Xiaohan Yuan , Zhen Huang , Ziming Qiu , Shuai Wang , Kevin Zhou

Dynamic Diffusion Transformer

Diffusion Transformer (DiT), an emerging diffusion model for image generation, has demonstrated superior performance but suffers from substantial computational costs. Our investigations reveal that these costs stem from the static inference…

Computer Vision and Pattern Recognition · Computer Science 2024-10-10 Wangbo Zhao , Yizeng Han , Jiasheng Tang , Kai Wang , Yibing Song , Gao Huang , Fan Wang , Yang You

DiffuseMix: Label-Preserving Data Augmentation with Diffusion Models

Recently, a number of image-mixing-based augmentation techniques have been introduced to improve the generalization of deep neural networks. In these techniques, two or more randomly selected natural images are mixed together to generate an…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Khawar Islam , Muhammad Zaigham Zaheer , Arif Mahmood , Karthik Nandakumar

Data-free Knowledge Distillation with Diffusion Models

Recently Data-Free Knowledge Distillation (DFKD) has garnered attention and can transfer knowledge from a teacher neural network to a student neural network without requiring any access to training data. Although diffusion models are adept…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Xiaohua Qi , Renda Li , Long Peng , Qiang Ling , Jun Yu , Ziyi Chen , Peng Chang , Mei Han , Jing Xiao

Advances in Diffusion Models for Image Data Augmentation: A Review of Methods, Models, Evaluation Metrics and Future Research Directions

Image data augmentation constitutes a critical methodology in modern computer vision tasks, since it can facilitate towards enhancing the diversity and quality of training datasets; thereby, improving the performance and robustness of…

Computer Vision and Pattern Recognition · Computer Science 2025-01-13 Panagiotis Alimisis , Ioannis Mademlis , Panagiotis Radoglou-Grammatikis , Panagiotis Sarigiannidis , Georgios Th. Papadopoulos

DeepCache: Accelerating Diffusion Models for Free

Diffusion models have recently gained unprecedented attention in the field of image synthesis due to their remarkable generative capabilities. Notwithstanding their prowess, these models often incur substantial computational costs,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Xinyin Ma , Gongfan Fang , Xinchao Wang

PathDiff: Histopathology Image Synthesis with Unpaired Text and Mask Conditions

Diffusion-based generative models have shown promise in synthesizing histopathology images to address data scarcity caused by privacy constraints. Diagnostic text reports provide high-level semantic descriptions, and masks offer…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Mahesh Bhosale , Abdul Wasi , Yuanhao Zhai , Yunjie Tian , Samuel Border , Nan Xi , Pinaki Sarder , Junsong Yuan , David Doermann , Xuan Gong

DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery

Learning from a large corpus of data, pre-trained models have achieved impressive progress nowadays. As popular generative pre-training, diffusion models capture both low-level visual knowledge and high-level semantic relations. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Chaofan Ma , Yuhuan Yang , Chen Ju , Fei Zhang , Jinxiang Liu , Yu Wang , Ya Zhang , Yanfeng Wang

TabDiff: a Mixed-type Diffusion Model for Tabular Data Generation

Synthesizing high-quality tabular data is an important topic in many data science tasks, ranging from dataset augmentation to privacy protection. However, developing expressive generative models for tabular data is challenging due to its…

Machine Learning · Computer Science 2025-02-18 Juntong Shi , Minkai Xu , Harper Hua , Hengrui Zhang , Stefano Ermon , Jure Leskovec

AutoDiff: combining Auto-encoder and Diffusion model for tabular data synthesizing

Diffusion model has become a main paradigm for synthetic data generation in many subfields of modern machine learning, including computer vision, language model, or speech synthesis. In this paper, we leverage the power of diffusion model…

Machine Learning · Statistics 2023-11-20 Namjoon Suh , Xiaofeng Lin , Din-Yin Hsieh , Merhdad Honarkhah , Guang Cheng

Data Diffusion: Dynamic Resource Provision and Data-Aware Scheduling for Data Intensive Applications

Data intensive applications often involve the analysis of large datasets that require large amounts of compute and storage resources. While dedicated compute and/or storage farms offer good task/data throughput, they suffer low resource…

Distributed, Parallel, and Cluster Computing · Computer Science 2008-08-27 Ioan Raicu , Yong Zhao , Ian Foster , Alex Szalay

Diversity-Driven Synthesis: Enhancing Dataset Distillation through Directed Weight Adjustment

The sharp increase in data-related expenses has motivated research into condensing datasets while retaining the most informative features. Dataset distillation has thus recently come to the fore. This paradigm generates synthetic datasets…

Machine Learning · Computer Science 2024-11-20 Jiawei Du , Xin Zhang , Juncheng Hu , Wenxin Huang , Joey Tianyi Zhou

Towards Consistent and Efficient Dataset Distillation via Diffusion-Driven Selection

Dataset distillation provides an effective approach to reduce memory and computational costs by optimizing a compact dataset that achieves performance comparable to the full original. However, for large-scale datasets and complex deep…

Computer Vision and Pattern Recognition · Computer Science 2025-11-14 Xinhao Zhong , Shuoyang Sun , Xulin Gu , Zhaoyang Xu , Yaowei Wang , Min Zhang , Bin Chen