English

Reimagining Parameter Space Exploration with Diffusion Models

Machine Learning 2025-06-24 v1 Artificial Intelligence

Abstract

Adapting neural networks to new tasks typically requires task-specific fine-tuning, which is time-consuming and reliant on labeled data. We explore a generative alternative that produces task-specific parameters directly from task identity, eliminating the need for task-specific training. To this end, we propose using diffusion models to learn the underlying structure of effective task-specific parameter space and synthesize parameters on demand. Once trained, the task-conditioned diffusion model can generate specialized weights directly from task identifiers. We evaluate this approach across three scenarios: generating parameters for a single seen task, for multiple seen tasks, and for entirely unseen tasks. Experiments show that diffusion models can generate accurate task-specific parameters and support multi-task interpolation when parameter subspaces are well-structured, but fail to generalize to unseen tasks, highlighting both the potential and limitations of this generative solution.

Keywords

Cite

@article{arxiv.2506.17807,
  title  = {Reimagining Parameter Space Exploration with Diffusion Models},
  author = {Lijun Zhang and Xiao Liu and Hui Guan},
  journal= {arXiv preprint arXiv:2506.17807},
  year   = {2025}
}

Comments

Accepted at ICML 2025 EXAIT Workshop