Related papers: Latent Conditional Diffusion-based Data Augmentati…

Contrastive Diffusion Alignment: Learning Structured Latents for Controllable Generation

Diffusion models excel at generation, but their latent spaces are high dimensional and not explicitly organized for interpretation or control. We introduce ConDA (Contrastive Diffusion Alignment), a plug-and-play geometry layer that applies…

Machine Learning · Computer Science 2026-02-20 Ruchi Sandilya , Sumaira Perez , Charles Lynch , Lindsay Victoria , Benjamin Zebley , Derrick Matthew Buchanan , Mahendra T. Bhati , Nolan Williams , Timothy J. Spellman , Faith M. Gunning , Conor Liston , Logan Grosenick

Diffusion on Graph: Augmentation of Graph Structure for Node Classification

Graph diffusion models have recently been proposed to synthesize entire graphs, such as molecule graphs. Although existing methods have shown great performance in generating entire graphs for graph-level learning tasks, no graph diffusion…

Machine Learning · Computer Science 2025-03-18 Yancheng Wang , Changyu Liu , Yingzhen Yang

Generative Data Augmentation for Object Point Cloud Segmentation

Data augmentation is widely used to train deep learning models to address data scarcity. However, traditional data augmentation (TDA) typically relies on simple geometric transformation, such as random rotation and rescaling, resulting in…

Computer Vision and Pattern Recognition · Computer Science 2025-08-27 Dekai Zhu , Stefan Gavranovic , Flavien Boussuge , Benjamin Busam , Slobodan Ilic

Conditional Data Synthesis Augmentation

Reliable machine learning and statistical analysis rely on diverse, well-distributed training data. However, real-world datasets are often limited in size and exhibit underrepresentation across key subpopulations, leading to biased…

Methodology · Statistics 2025-07-15 Xinyu Tian , Xiaotong Shen

Goal-Conditioned Data Augmentation for Offline Reinforcement Learning

Offline reinforcement learning (RL) enables policy learning from pre-collected offline datasets, relaxing the need to interact directly with the environment. However, limited by the quality of offline datasets, it generally fails to learn…

Machine Learning · Computer Science 2025-09-03 Xingshuai Huang , Di Wu , Benoit Boulet

Context-Guided Diffusion for Out-of-Distribution Molecular and Protein Design

Generative models have the potential to accelerate key steps in the discovery of novel molecular therapeutics and materials. Diffusion models have recently emerged as a powerful approach, excelling at unconditional sample generation and,…

Biomolecules · Quantitative Biology 2024-07-17 Leo Klarner , Tim G. J. Rudner , Garrett M. Morris , Charlotte M. Deane , Yee Whye Teh

Do We Need All the Synthetic Data? Targeted Image Augmentation via Diffusion Models

Synthetically augmenting training datasets with diffusion models has become an effective strategy for improving the generalization of image classifiers. However, existing approaches typically increase dataset size by 10-30x and struggle to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Dang Nguyen , Jiping Li , Jinghao Zheng , Baharan Mirzasoleiman

GDA: Generalized Diffusion for Robust Test-time Adaptation

Machine learning models struggle with generalization when encountering out-of-distribution (OOD) samples with unexpected distribution shifts. For vision tasks, recent studies have shown that test-time adaptation employing diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-04-03 Yun-Yun Tsai , Fu-Chen Chen , Albert Y. C. Chen , Junfeng Yang , Che-Chun Su , Min Sun , Cheng-Hao Kuo

GAUDA: Generative Adaptive Uncertainty-guided Diffusion-based Augmentation for Surgical Segmentation

Augmentation by generative modelling yields a promising alternative to the accumulation of surgical data, where ethical, organisational and regulatory aspects must be considered. Yet, the joint synthesis of (image, mask) pairs for…

Computer Vision and Pattern Recognition · Computer Science 2025-07-02 Yannik Frisch , Christina Bornberg , Moritz Fuchs , Anirban Mukhopadhyay

Learning Structure-Semantic Evolution Trajectories for Graph Domain Adaptation

Graph Domain Adaptation (GDA) aims to bridge distribution shifts between domains by transferring knowledge from well-labeled source graphs to given unlabeled target graphs. One promising recent approach addresses graph transfer by…

Machine Learning · Computer Science 2026-02-12 Wei Chen , Xingyu Guo , Shuang Li , Yan Zhong , Zhao Zhang , Fuzhen Zhuang , Hongrui Liu , Libang Zhang , Guo Ye , Huimei He

Everything to the Synthetic: Diffusion-driven Test-time Adaptation via Synthetic-Domain Alignment

Test-time adaptation (TTA) aims to improve the performance of source-domain pre-trained models on previously unseen, shifted target domains. Traditional TTA methods primarily adapt model weights based on target data streams, making model…

Computer Vision and Pattern Recognition · Computer Science 2024-12-17 Jiayi Guo , Junhao Zhao , Chaoqun Du , Yulin Wang , Chunjiang Ge , Zanlin Ni , Shiji Song , Humphrey Shi , Gao Huang

TTIDA: Controllable Generative Data Augmentation via Text-to-Text and Text-to-Image Models

Data augmentation has been established as an efficacious approach to supplement useful information for low-resource datasets. Traditional augmentation techniques such as noise injection and image transformations have been widely used. In…

Computer Vision and Pattern Recognition · Computer Science 2023-04-19 Yuwei Yin , Jean Kaddour , Xiang Zhang , Yixin Nie , Zhenguang Liu , Lingpeng Kong , Qi Liu

GALA: Graph Diffusion-based Alignment with Jigsaw for Source-free Domain Adaptation

Source-free domain adaptation is a crucial machine learning topic, as it contains numerous applications in the real world, particularly with respect to data privacy. Existing approaches predominantly focus on Euclidean data, such as images…

Machine Learning · Computer Science 2024-10-23 Junyu Luo , Yiyang Gu , Xiao Luo , Wei Ju , Zhiping Xiao , Yusheng Zhao , Jingyang Yuan , Ming Zhang

Time-aware Random Walk Diffusion to Improve Dynamic Graph Learning

How can we augment a dynamic graph for improving the performance of dynamic graph neural networks? Graph augmentation has been widely utilized to boost the learning performance of GNN-based models. However, most existing approaches only…

Machine Learning · Computer Science 2023-02-15 Jong-whi Lee , Jinhong Jung

Efficient Topology-aware Data Augmentation for High-Degree Graph Neural Networks

In recent years, graph neural networks (GNNs) have emerged as a potent tool for learning on graph-structured data and won fruitful successes in varied fields. The majority of GNNs follow the message-passing paradigm, where representations…

Machine Learning · Computer Science 2024-08-30 Yurui Lai , Xiaoyang Lin , Renchi Yang , Hongtao Wang

Latent Code Augmentation Based on Stable Diffusion for Data-free Substitute Attacks

Since the training data of the target model is not available in the black-box substitute attack, most recent schemes utilize GANs to generate data for training the substitute model. However, these GANs-based schemes suffer from low training…

Computer Vision and Pattern Recognition · Computer Science 2024-10-28 Mingwen Shao , Lingzhuang Meng , Yuanjian Qiao , Lixu Zhang , Wangmeng Zuo

Towards Synthesizing High-Dimensional Tabular Data with Limited Samples

Diffusion-based tabular data synthesis models have yielded promising results. However, when the data dimensionality increases, existing models tend to degenerate and may perform even worse than simpler, non-diffusion-based models. This is…

Machine Learning · Computer Science 2025-11-12 Zuqing Li , Junhao Gan , Jianzhong Qi

Controllable Data Augmentation for Context-Dependent Text-to-SQL

The limited scale of annotated data constraints existing context-dependent text-to-SQL models because of the complexity of labeling. The data augmentation method is a commonly used method to solve this problem. However, the data generated…

Computation and Language · Computer Science 2023-05-01 Dingzirui Wang , Longxu Dou , Wanxiang Che

Advancing Graph Generation through Beta Diffusion

Diffusion models have excelled in generating natural images and are now being adapted to a variety of data types, including graphs. However, conventional models often rely on Gaussian or categorical diffusion processes, which can struggle…

Machine Learning · Computer Science 2024-10-08 Xinyang Liu , Yilin He , Bo Chen , Mingyuan Zhou

Deep-Graph-Sprints: Accelerated Representation Learning in Continuous-Time Dynamic Graphs

Continuous-time dynamic graphs (CTDGs) are essential for modeling interconnected, evolving systems. Traditional methods for extracting knowledge from these graphs often depend on feature engineering or deep learning. Feature engineering is…

Machine Learning · Computer Science 2024-11-08 Ahmad Naser Eddin , Jacopo Bono , David Aparício , Hugo Ferreira , Pedro Ribeiro , Pedro Bizarro