Related papers: Diffusion Augmented Agents: A Framework for Effici…

DAug: Diffusion-based Channel Augmentation for Radiology Image Retrieval and Classification

Medical image understanding requires meticulous examination of fine visual details, with particular regions requiring additional attention. While radiologists build such expertise over years of experience, it is challenging for AI models to…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Ying Jin , Zhuoran Zhou , Haoquan Fang , Jenq-Neng Hwang

Underlying Semantic Diffusion for Effective and Efficient In-Context Learning

Diffusion models has emerged as a powerful framework for tasks like image controllable generation and dense prediction. However, existing models often struggle to capture underlying semantics (e.g., edges, textures, shapes) and effectively…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Zhong Ji , Weilong Cao , Yan Zhang , Yanwei Pang , Jungong Han , Xuelong Li

Diffusion Domain Teacher: Diffusion Guided Domain Adaptive Object Detector

Object detectors often suffer a decrease in performance due to the large domain gap between the training data (source domain) and real-world data (target domain). Diffusion-based generative models have shown remarkable abilities in…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Boyong He , Yuxiang Ji , Zhuoyue Tan , Liaoni Wu

Learning Graph Representation of Agent Diffusers

Diffusion-based generative models have significantly advanced text-to-image synthesis, demonstrating impressive text comprehension and zero-shot generalization. These models refine images from random noise based on textual prompts, with…

Machine Learning · Computer Science 2025-05-16 Youcef Djenouri , Nassim Belmecheri , Tomasz Michalak , Jan Dubiński , Ahmed Nabil Belbachir , Anis Yazidi

DiffusionAgent: Navigating Expert Models for Agentic Image Generation

In the accelerating era of human-instructed visual content creation, diffusion models have demonstrated remarkable generative potential. Yet their deployment is constrained by a dual bottleneck: semantic ambiguity in diverse prompts and the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Jie Qin , Jie Wu , Weifeng Chen , Yueming Lyu

DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery

Learning from a large corpus of data, pre-trained models have achieved impressive progress nowadays. As popular generative pre-training, diffusion models capture both low-level visual knowledge and high-level semantic relations. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Chaofan Ma , Yuhuan Yang , Chen Ju , Fei Zhang , Jinxiang Liu , Yu Wang , Ya Zhang , Yanfeng Wang

On Designing Diffusion Autoencoders for Efficient Generation and Representation Learning

Diffusion autoencoders (DAs) are variants of diffusion generative models that use an input-dependent latent variable to capture representations alongside the diffusion process. These representations, to varying extents, can be used for…

Machine Learning · Computer Science 2025-06-03 Magdalena Proszewska , Nikolay Malkin , N. Siddharth

On Data Augmentation for GAN Training

Recent successes in Generative Adversarial Networks (GAN) have affirmed the importance of using more data in GAN training. Yet it is expensive to collect data in many domains such as medical applications. Data Augmentation (DA) has been…

Computer Vision and Pattern Recognition · Computer Science 2021-02-24 Ngoc-Trung Tran , Viet-Hung Tran , Ngoc-Bao Nguyen , Trung-Kien Nguyen , Ngai-Man Cheung

AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation

Convolutional neural networks have been widely applied to medical image segmentation and have achieved considerable performance. However, the performance may be significantly affected by the domain gap between training data (source domain)…

Image and Video Processing · Electrical Eng. & Systems 2022-07-28 Junyan Lyu , Yiqi Zhang , Yijin Huang , Li Lin , Pujin Cheng , Xiaoying Tang

Agents of Diffusion: Enhancing Diffusion Language Models with Multi-Agent Reinforcement Learning for Structured Data Generation (Extended Version)

Generating high-quality structured data such as JSON records, remains a fundamental challenge for large language models (LLMs), particularly when semantic richness must coexist with strict schema adherence. While autoregressive LLMs offer…

Multiagent Systems · Computer Science 2026-01-13 Aja Khanal , Kaushik T. Ranade , Rishabh Agrawal , Kalyan S. Basu , Apurva Narayan

DADP: Domain Adaptive Diffusion Policy

Learning domain adaptive policies that can generalize to unseen transition dynamics, remains a fundamental challenge in learning-based control. Substantial progress has been made through domain representation learning to capture…

Machine Learning · Computer Science 2026-03-31 Pengcheng Wang , Qinghang Liu , Haotian Lin , Yiheng Li , Guojian Zhan , Masayoshi Tomizuka , Yixiao Wang

DLLM Agent: See Farther, Run Faster

Diffusion large language models (DLLMs) have emerged as an alternative to autoregressive (AR) decoding with appealing efficiency and modeling properties, yet their implications for agentic multi-step decision making remain underexplored. We…

Computation and Language · Computer Science 2026-03-23 Huiling Zhen , Weizhe Lin , Renxi Liu , Kai Han , Yiming Li , Yuchuan Tian , Hanting Chen , Xiaoguang Li , Xiaosong Li , Chen Chen , Xianzhi Yu , Mingxuan Yuan , Youliang Yan , Peifeng Qin , Jun Wang , Yu Wang , Dacheng Tao , Yunhe Wang

DiffAug: Enhance Unsupervised Contrastive Learning with Domain-Knowledge-Free Diffusion-based Data Augmentation

Unsupervised Contrastive learning has gained prominence in fields such as vision, and biology, leveraging predefined positive/negative samples for representation learning. Data augmentation, categorized into hand-designed and model-based…

Machine Learning · Computer Science 2024-05-28 Zelin Zang , Hao Luo , Kai Wang , Panpan Zhang , Fan Wang , Stan. Z Li , Yang You

DiffAug: A Diffuse-and-Denoise Augmentation for Training Robust Classifiers

We introduce DiffAug, a simple and efficient diffusion-based augmentation technique to train image classifiers for the crucial yet challenging goal of improved classifier robustness. Applying DiffAug to a given example consists of one…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Chandramouli Sastry , Sri Harsha Dumpala , Sageev Oore

The Bitter Lesson of Diffusion Language Models for Agentic Workflows: A Comprehensive Reality Check

The pursuit of real-time agentic interaction has driven interest in Diffusion-based Large Language Models (dLLMs) as alternatives to auto-regressive backbones, promising to break the sequential latency bottleneck. However, does such…

Computation and Language · Computer Science 2026-04-27 Qingyu Lu , Liang Ding , Kanjian Zhang , Jinxia Zhang , Dacheng Tao

Agentic Retrieval-Augmented Generation for Time Series Analysis

Time series modeling is crucial for many applications, however, it faces challenges such as complex spatio-temporal dependencies and distribution shifts in learning from historical context to predict task-specific outcomes. To address these…

Artificial Intelligence · Computer Science 2024-08-28 Chidaksh Ravuru , Sagar Srinivas Sakhinana , Venkataramana Runkana

A Novel Task-Driven Diffusion-Based Policy with Affordance Learning for Generalizable Manipulation of Articulated Objects

Despite recent advances in dexterous manipulations, the manipulation of articulated objects and generalization across different categories remain significant challenges. To address these issues, we introduce DART, a novel framework that…

Robotics · Computer Science 2025-09-19 Hao Zhang , Zhen Kan , Weiwei Shang , Yongduan Song

Detecting Diffusion-generated Images via Dynamic Assembly Forests

Diffusion models are known for generating high-quality images, causing serious security concerns. To combat this, most efforts rely on deep neural networks (e.g., CNNs and Transformers), while largely overlooking the potential of…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Mengxin Fu , Yuezun Li

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models

Recent advancements in visual generative models have enabled high-quality image and video generation, opening diverse applications. However, evaluating these models often demands sampling hundreds or thousands of images or videos, making…

Computer Vision and Pattern Recognition · Computer Science 2025-08-22 Fan Zhang , Shulin Tian , Ziqi Huang , Yu Qiao , Ziwei Liu

DDM-Lag : A Diffusion-based Decision-making Model for Autonomous Vehicles with Lagrangian Safety Enhancement

Decision-making stands as a pivotal component in the realm of autonomous vehicles (AVs), playing a crucial role in navigating the intricacies of autonomous driving. Amidst the evolving landscape of data-driven methodologies, enhancing…

Robotics · Computer Science 2024-04-08 Jiaqi Liu , Peng Hang , Xiaocong Zhao , Jianqiang Wang , Jian Sun