DiffusionAgent: Navigating Expert Models for Agentic Image Generation

Jie Qin; Jie Wu; Weifeng Chen; Yueming Lyu

DiffusionAgent: Navigating Expert Models for Agentic Image Generation

Computer Vision and Pattern Recognition 2026-01-21 v2 Artificial Intelligence

Authors: Jie Qin , Jie Wu , Weifeng Chen , Yueming Lyu

Abstract

In the accelerating era of human-instructed visual content creation, diffusion models have demonstrated remarkable generative potential. Yet their deployment is constrained by a dual bottleneck: semantic ambiguity in diverse prompts and the narrow specialization of individual models. A single diffusion architecture struggles to maintain optimal performance across heterogeneous prompts, while conventional "parse-then-call" pipelines artificially separate semantic understanding from generative execution. To bridge this gap, we introduce DiffusionAgent, a unified, language-model-driven agent that casts the entire "prompt comprehension-expert routing-image synthesis" loop into a agentic framework. Our contributions are three-fold: (1) a tree-of-thought-powered expert navigator that performs fine-grained semantic parsing and zero-shot matching to the most suitable diffusion model via an extensible prior-knowledge tree; (2) an advantage database updated with human-in-the-loop feedback, continually aligning model-selection policy with human aesthetic and semantic preferences; and (3) a fully decoupled agent architecture that activates the optimal generative path for open-domain prompts without retraining or fine-tuning any expert. Extensive experiments show that DiffusionAgent retains high generation quality while significantly broadening prompt coverage, establishing a new performance and generality benchmark for multi-domain image synthesis. The code is available at https://github.com/DiffusionAgent/DiffusionAgent

Keywords

diffusion model multi-agent reasoning multi-agent systems

Cite

@article{arxiv.2401.10061,
  title  = {DiffusionAgent: Navigating Expert Models for Agentic Image Generation},
  author = {Jie Qin and Jie Wu and Weifeng Chen and Yueming Lyu},
  journal= {arXiv preprint arXiv:2401.10061},
  year   = {2026}
}

DiffusionAgent: Navigating Expert Models for Agentic Image Generation

Abstract

Keywords

Cite

Related papers