Related papers: Diffusion-Based Neural Network Weights Generation

Text2Weight: Bridging Natural Language and Neural Network Weight Spaces

How far are we really from automatically generating neural networks? While neural network weight generation shows promise, current approaches struggle with generalization to unseen tasks and practical application exploration. To address…

Machine Learning · Computer Science 2025-08-20 Bowen Tian , Wenshuo Chen , Zexi Li , Songning Lai , Jiemin Wu , Yutao Yue

Reimagining Parameter Space Exploration with Diffusion Models

Adapting neural networks to new tasks typically requires task-specific fine-tuning, which is time-consuming and reliant on labeled data. We explore a generative alternative that produces task-specific parameters directly from task identity,…

Machine Learning · Computer Science 2025-06-24 Lijun Zhang , Xiao Liu , Hui Guan

Feedforward Neural Networks with Diffused Nonlinear Weight Functions

In this paper, feedforward neural networks are presented that have nonlinear weight functions based on look--up tables, that are specially smoothed in a regularization called the diffusion. The idea of such a type of networks is based on…

Neural and Evolutionary Computing · Computer Science 2007-05-23 Artur Rataj

DeepWeightFlow: Re-Basined Flow Matching for Generating Neural Network Weights

Building efficient and effective generative models for neural network weights has been a research focus of significant interest that faces challenges posed by the high-dimensional weight spaces of modern neural networks and their…

Machine Learning · Computer Science 2026-05-01 Saumya Gupta , Scott Biggs , Moritz Laber , Zohair Shafi , Robin Walters , Ayan Paul

Pruning then Reweighting: Towards Data-Efficient Training of Diffusion Models

Despite the remarkable generation capabilities of Diffusion Models (DMs), conducting training and inference remains computationally expensive. Previous works have been devoted to accelerating diffusion sampling, but achieving data-efficient…

Computer Vision and Pattern Recognition · Computer Science 2024-10-03 Yize Li , Yihua Zhang , Sijia Liu , Xue Lin

Learning to Learn Weight Generation via Local Consistency Diffusion

Diffusion-based algorithms have emerged as promising techniques for weight generation. However, existing solutions are limited by two challenges: generalizability and local target assignment. The former arises from the inherent lack of…

Machine Learning · Computer Science 2025-05-20 Yunchuan Guan , Yu Liu , Ke Zhou , Zhiqi Shen , Jenq-Neng Hwang , Lei Li

Efficient Training with Denoised Neural Weights

Good weight initialization serves as an effective measure to reduce the training cost of a deep neural network (DNN) model. The choice of how to initialize parameters is challenging and may require manual tuning, which can be time-consuming…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Yifan Gong , Zheng Zhan , Yanyu Li , Yerlan Idelbayev , Andrey Zharkov , Kfir Aberman , Sergey Tulyakov , Yanzhi Wang , Jian Ren

Neural Network Diffusion

Diffusion models have achieved remarkable success in image and video generation. In this work, we demonstrate that diffusion models can also \textit{generate high-performing neural network parameters}. Our approach is simple, utilizing an…

Machine Learning · Computer Science 2025-01-03 Kai Wang , Dongwen Tang , Boya Zeng , Yida Yin , Zhaopan Xu , Yukun Zhou , Zelin Zang , Trevor Darrell , Zhuang Liu , Yang You

Fast and Stable Diffusion Planning through Variational Adaptive Weighting

Diffusion models have recently shown promise in offline RL. However, these methods often suffer from high training costs and slow convergence, particularly when using transformer-based denoising backbones. While several optimization…

Machine Learning · Computer Science 2025-06-23 Zhiying Qiu , Tao Lin

Neural Diffusion Models

Diffusion models have shown remarkable performance on many generative tasks. Despite recent success, most diffusion models are restricted in that they only allow linear transformation of the data distribution. In contrast, broader family of…

Machine Learning · Computer Science 2024-06-04 Grigory Bartosh , Dmitry Vetrov , Christian A. Naesseth

Make (Nearly) Every Neural Network Better: Generating Neural Network Ensembles by Weight Parameter Resampling

Deep Neural Networks (DNNs) have become increasingly popular in computer vision, natural language processing, and other areas. However, training and fine-tuning a deep learning model is computationally intensive and time-consuming. We…

Machine Learning · Computer Science 2018-07-04 Jiayi Liu , Samarth Tripathi , Unmesh Kurup , Mohak Shah

Flows and Diffusions on the Neural Manifold

Diffusion and flow-based generative models have achieved remarkable success in domains such as image synthesis, video generation, and natural language modeling. In this work, we extend these advances to weight space learning by leveraging…

Machine Learning · Computer Science 2025-10-17 Daniel Saragih , Deyu Cao , Tejas Balaji

Differentiable Weightless Neural Networks

We introduce the Differentiable Weightless Neural Network (DWN), a model based on interconnected lookup tables. Training of DWNs is enabled by a novel Extended Finite Difference technique for approximate differentiation of binary values. We…

Machine Learning · Computer Science 2025-03-04 Alan T. L. Bacellar , Zachary Susskind , Mauricio Breternitz , Eugene John , Lizy K. John , Priscila M. V. Lima , Felipe M. G. França

Hyper Diffusion Avatars: Dynamic Human Avatar Generation using Network Weight Space Diffusion

Creating human avatars is a highly desirable yet challenging task. Recent advancements in radiance field rendering have achieved unprecedented photorealism and real-time performance for personalized dynamic human avatars. However, these…

Graphics · Computer Science 2025-09-09 Dongliang Cao , Guoxing Sun , Marc Habermann , Florian Bernard

Diffusion Model as Representation Learner

Diffusion Probabilistic Models (DPMs) have recently demonstrated impressive results on various generative tasks.Despite its promises, the learned representations of pre-trained DPMs, however, have not been fully understood. In this paper,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Xingyi Yang , Xinchao Wang

DPBridge: Latent Diffusion Bridge for Dense Prediction

Diffusion models demonstrate remarkable capabilities in capturing complex data distributions and have achieved compelling results in many generative tasks. While they have recently been extended to dense prediction tasks such as depth…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Haorui Ji , Taojun Lin , Hongdong Li

Deep Generative Modeling on Limited Data with Regularization by Nontransferable Pre-trained Models

Deep generative models (DGMs) are data-eager because learning a complex model on limited data suffers from a large variance and easily overfits. Inspired by the classical perspective of the bias-variance tradeoff, we propose regularized…

Machine Learning · Computer Science 2023-04-11 Yong Zhong , Hongtao Liu , Xiaodong Liu , Fan Bao , Weiran Shen , Chongxuan Li

Metasurface-empowered freely-arrangeable multi-task diffractive neural networks with weighted training

Recent advancements in optical computing have garnered considerable research interests owing to its ener-gy-efficient operation and ultralow latency characteristics. As an emerging framework in this domain, dif-fractive deep neural networks…

Applied Physics · Physics 2025-06-24 Yudong Tian , Haifeng Xu , Yuqing Liu , Xiangyu Zhao , Jingzhu Shao , Jierong Cheng , Chongzhao Wu

DiffusedWrinkles: A Diffusion-Based Model for Data-Driven Garment Animation

We present a data-driven method for learning to generate animations of 3D garments using a 2D image diffusion model. In contrast to existing methods, typically based on fully connected networks, graph neural networks, or generative…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Raquel Vidaurre , Elena Garces , Dan Casas

Neural Flow Diffusion Models: Learnable Forward Process for Improved Diffusion Modelling

Conventional diffusion models typically relies on a fixed forward process, which implicitly defines complex marginal distributions over latent variables. This can often complicate the reverse process' task in learning generative…

Machine Learning · Statistics 2025-06-10 Grigory Bartosh , Dmitry Vetrov , Christian A. Naesseth