Neural Network Diffusion

Kai Wang; Dongwen Tang; Boya Zeng; Yida Yin; Zhaopan Xu; Yukun Zhou; Zelin Zang; Trevor Darrell; Zhuang Liu; Yang You

Neural Network Diffusion

Machine Learning 2025-01-03 v3 Computer Vision and Pattern Recognition

Authors: Kai Wang , Dongwen Tang , Boya Zeng , Yida Yin , Zhaopan Xu , Yukun Zhou , Zelin Zang , Trevor Darrell , Zhuang Liu , Yang You

View on arXiv ↗ PDF ↗

Abstract

Diffusion models have achieved remarkable success in image and video generation. In this work, we demonstrate that diffusion models can also \textit{generate high-performing neural network parameters}. Our approach is simple, utilizing an autoencoder and a diffusion model. The autoencoder extracts latent representations of a subset of the trained neural network parameters. Next, a diffusion model is trained to synthesize these latent representations from random noise. This model then generates new representations, which are passed through the autoencoder's decoder to produce new subsets of high-performing network parameters. Across various architectures and datasets, our approach consistently generates models with comparable or improved performance over trained networks, with minimal additional cost. Notably, we empirically find that the generated models are not memorizing the trained ones. Our results encourage more exploration into the versatile use of diffusion models. Our code is available \href{https://github.com/NUS-HPC-AI-Lab/Neural-Network-Diffusion}{here}.

Keywords

diffusion model deep neural networks image generation

Cite

@article{arxiv.2402.13144,
  title  = {Neural Network Diffusion},
  author = {Kai Wang and Dongwen Tang and Boya Zeng and Yida Yin and Zhaopan Xu and Yukun Zhou and Zelin Zang and Trevor Darrell and Zhuang Liu and Yang You},
  journal= {arXiv preprint arXiv:2402.13144},
  year   = {2025}
}

Comments

We introduce a novel approach for parameter generation, named neural network parameter diffusion (\textbf{p-diff}), which employs a standard latent diffusion model to synthesize a new set of parameters

Neural Network Diffusion

Abstract

Keywords

Cite

Comments

Related papers