English
Related papers

Related papers: Diffusion Augmented Agents: A Framework for Effici…

200 papers

Medical image understanding requires meticulous examination of fine visual details, with particular regions requiring additional attention. While radiologists build such expertise over years of experience, it is challenging for AI models to…

Computer Vision and Pattern Recognition · Computer Science 2024-12-09 Ying Jin , Zhuoran Zhou , Haoquan Fang , Jenq-Neng Hwang

Diffusion models has emerged as a powerful framework for tasks like image controllable generation and dense prediction. However, existing models often struggle to capture underlying semantics (e.g., edges, textures, shapes) and effectively…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Zhong Ji , Weilong Cao , Yan Zhang , Yanwei Pang , Jungong Han , Xuelong Li

Object detectors often suffer a decrease in performance due to the large domain gap between the training data (source domain) and real-world data (target domain). Diffusion-based generative models have shown remarkable abilities in…

Computer Vision and Pattern Recognition · Computer Science 2025-06-05 Boyong He , Yuxiang Ji , Zhuoyue Tan , Liaoni Wu

Diffusion-based generative models have significantly advanced text-to-image synthesis, demonstrating impressive text comprehension and zero-shot generalization. These models refine images from random noise based on textual prompts, with…

Machine Learning · Computer Science 2025-05-16 Youcef Djenouri , Nassim Belmecheri , Tomasz Michalak , Jan Dubiński , Ahmed Nabil Belbachir , Anis Yazidi

In the accelerating era of human-instructed visual content creation, diffusion models have demonstrated remarkable generative potential. Yet their deployment is constrained by a dual bottleneck: semantic ambiguity in diverse prompts and the…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Jie Qin , Jie Wu , Weifeng Chen , Yueming Lyu

Learning from a large corpus of data, pre-trained models have achieved impressive progress nowadays. As popular generative pre-training, diffusion models capture both low-level visual knowledge and high-level semantic relations. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Chaofan Ma , Yuhuan Yang , Chen Ju , Fei Zhang , Jinxiang Liu , Yu Wang , Ya Zhang , Yanfeng Wang

Diffusion autoencoders (DAs) are variants of diffusion generative models that use an input-dependent latent variable to capture representations alongside the diffusion process. These representations, to varying extents, can be used for…

Machine Learning · Computer Science 2025-06-03 Magdalena Proszewska , Nikolay Malkin , N. Siddharth

Recent successes in Generative Adversarial Networks (GAN) have affirmed the importance of using more data in GAN training. Yet it is expensive to collect data in many domains such as medical applications. Data Augmentation (DA) has been…

Computer Vision and Pattern Recognition · Computer Science 2021-02-24 Ngoc-Trung Tran , Viet-Hung Tran , Ngoc-Bao Nguyen , Trung-Kien Nguyen , Ngai-Man Cheung

Convolutional neural networks have been widely applied to medical image segmentation and have achieved considerable performance. However, the performance may be significantly affected by the domain gap between training data (source domain)…

Image and Video Processing · Electrical Eng. & Systems 2022-07-28 Junyan Lyu , Yiqi Zhang , Yijin Huang , Li Lin , Pujin Cheng , Xiaoying Tang

Generating high-quality structured data such as JSON records, remains a fundamental challenge for large language models (LLMs), particularly when semantic richness must coexist with strict schema adherence. While autoregressive LLMs offer…

Multiagent Systems · Computer Science 2026-01-13 Aja Khanal , Kaushik T. Ranade , Rishabh Agrawal , Kalyan S. Basu , Apurva Narayan

Learning domain adaptive policies that can generalize to unseen transition dynamics, remains a fundamental challenge in learning-based control. Substantial progress has been made through domain representation learning to capture…

Machine Learning · Computer Science 2026-03-31 Pengcheng Wang , Qinghang Liu , Haotian Lin , Yiheng Li , Guojian Zhan , Masayoshi Tomizuka , Yixiao Wang

Diffusion large language models (DLLMs) have emerged as an alternative to autoregressive (AR) decoding with appealing efficiency and modeling properties, yet their implications for agentic multi-step decision making remain underexplored. We…

Unsupervised Contrastive learning has gained prominence in fields such as vision, and biology, leveraging predefined positive/negative samples for representation learning. Data augmentation, categorized into hand-designed and model-based…

Machine Learning · Computer Science 2024-05-28 Zelin Zang , Hao Luo , Kai Wang , Panpan Zhang , Fan Wang , Stan. Z Li , Yang You

We introduce DiffAug, a simple and efficient diffusion-based augmentation technique to train image classifiers for the crucial yet challenging goal of improved classifier robustness. Applying DiffAug to a given example consists of one…

Computer Vision and Pattern Recognition · Computer Science 2024-05-30 Chandramouli Sastry , Sri Harsha Dumpala , Sageev Oore

The pursuit of real-time agentic interaction has driven interest in Diffusion-based Large Language Models (dLLMs) as alternatives to auto-regressive backbones, promising to break the sequential latency bottleneck. However, does such…

Computation and Language · Computer Science 2026-04-27 Qingyu Lu , Liang Ding , Kanjian Zhang , Jinxia Zhang , Dacheng Tao

Time series modeling is crucial for many applications, however, it faces challenges such as complex spatio-temporal dependencies and distribution shifts in learning from historical context to predict task-specific outcomes. To address these…

Artificial Intelligence · Computer Science 2024-08-28 Chidaksh Ravuru , Sagar Srinivas Sakhinana , Venkataramana Runkana

Despite recent advances in dexterous manipulations, the manipulation of articulated objects and generalization across different categories remain significant challenges. To address these issues, we introduce DART, a novel framework that…

Robotics · Computer Science 2025-09-19 Hao Zhang , Zhen Kan , Weiwei Shang , Yongduan Song

Diffusion models are known for generating high-quality images, causing serious security concerns. To combat this, most efforts rely on deep neural networks (e.g., CNNs and Transformers), while largely overlooking the potential of…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Mengxin Fu , Yuezun Li

Recent advancements in visual generative models have enabled high-quality image and video generation, opening diverse applications. However, evaluating these models often demands sampling hundreds or thousands of images or videos, making…

Computer Vision and Pattern Recognition · Computer Science 2025-08-22 Fan Zhang , Shulin Tian , Ziqi Huang , Yu Qiao , Ziwei Liu

Decision-making stands as a pivotal component in the realm of autonomous vehicles (AVs), playing a crucial role in navigating the intricacies of autonomous driving. Amidst the evolving landscape of data-driven methodologies, enhancing…

Robotics · Computer Science 2024-04-08 Jiaqi Liu , Peng Hang , Xiaocong Zhao , Jianqiang Wang , Jian Sun
‹ Prev 1 2 3 10 Next ›