Related papers: HyperNetworks

A Brief Review of Hypernetworks in Deep Learning

Hypernetworks, or hypernets for short, are neural networks that generate weights for another neural network, known as the target network. They have emerged as a powerful deep learning technique that allows for greater flexibility,…

Machine Learning · Computer Science 2025-01-03 Vinod Kumar Chauhan , Jiandong Zhou , Ping Lu , Soheila Molaei , David A. Clifton

Generating Neural Networks with Neural Networks

Hypernetworks are neural networks that generate weights for another neural network. We formulate the hypernetwork training objective as a compromise between accuracy and diversity, where the diversity takes into account trivial symmetry…

Machine Learning · Statistics 2018-04-10 Lior Deutsch

Weighted Hypernetworks

Complex network theory has been used to study complex systems. However, many real-life systems involve multiple kinds of objects . They can't be described by simple graphs. In order to provide complete information of these systems, we…

Physics and Society · Physics 2015-11-10 Jin-Li Guo , Xin-Yun Zhu

Continual Learning with Dependency Preserving Hypernetworks

Humans learn continually throughout their lifespan by accumulating diverse knowledge and fine-tuning it for future tasks. When presented with a similar goal, neural networks suffer from catastrophic forgetting if data distributions across…

Machine Learning · Computer Science 2022-09-19 Dupati Srikar Chandra , Sakshi Varshney , P. K. Srijith , Sunil Gupta

A Generative Model for Sampling High-Performance and Diverse Weights for Neural Networks

Recent work on mode connectivity in the loss landscape of deep neural networks has demonstrated that the locus of (sub-)optimal weight vectors lies on continuous paths. In this work, we train a neural network that serves as a hypernetwork,…

Machine Learning · Statistics 2019-05-09 Lior Deutsch , Erik Nijkamp , Yu Yang

Principled Weight Initialization for Hypernetworks

Hypernetworks are meta neural networks that generate weights for a main neural network in an end-to-end differentiable manner. Despite extensive applications ranging from multi-task learning to Bayesian deep learning, the problem of…

Machine Learning · Computer Science 2023-12-15 Oscar Chang , Lampros Flokas , Hod Lipson

Network with Sub-Networks

We introduce network with sub-networks, a neural network which its weight layers could be detached into sub-neural networks during inference. To develop weights and biases which could be inserted in both base and sub-neural networks,…

Machine Learning · Computer Science 2021-10-20 Ninnart Fuengfusin , Hakaru Tamukoh

Weight Sharing is Crucial to Succesful Optimization

Exploiting the great expressive power of Deep Neural Network architectures, relies on the ability to train them. While current theoretical work provides, mostly, results showing the hardness of this task, empirical evidence usually differs…

Machine Learning · Computer Science 2017-06-05 Shai Shalev-Shwartz , Ohad Shamir , Shaked Shammah

HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories

To efficiently adapt large models or to train generative models of neural representations, Hypernetworks have drawn interest. While hypernetworks work well, training them is cumbersome, and often requires ground truth optimized weights for…

Machine Learning · Computer Science 2025-05-21 Eric Hedlin , Munawar Hayat , Fatih Porikli , Kwang Moo Yi , Shweta Mahajan

HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning

In this work we propose a HyperTransformer, a Transformer-based model for supervised and semi-supervised few-shot learning that generates weights of a convolutional neural network (CNN) directly from support samples. Since the dependence of…

Machine Learning · Computer Science 2022-07-15 Andrey Zhmoginov , Mark Sandler , Max Vladymyrov

Foundation Models Secretly Understand Neural Network Weights: Enhancing Hypernetwork Architectures with Foundation Models

Large pre-trained models, or foundation models, have shown impressive performance when adapted to a variety of downstream tasks, often out-performing specialized models. Hypernetworks, neural networks that generate some or all of the…

Machine Learning · Computer Science 2025-03-04 Jeffrey Gu , Serena Yeung-Levy

DHP: Differentiable Meta Pruning via HyperNetworks

Network pruning has been the driving force for the acceleration of neural networks and the alleviation of model storage/transmission burden. With the advent of AutoML and neural architecture search (NAS), pruning has become topical with…

Computer Vision and Pattern Recognition · Computer Science 2020-08-04 Yawei Li , Shuhang Gu , Kai Zhang , Luc Van Gool , Radu Timofte

Graph Metanetworks for Processing Diverse Neural Architectures

Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of…

Machine Learning · Computer Science 2024-01-02 Derek Lim , Haggai Maron , Marc T. Law , Jonathan Lorraine , James Lucas

A Powerful Generative Model Using Random Weights for the Deep Image Representation

To what extent is the success of deep visualization due to the training? Could we do deep visualization using untrained, random weight networks? To address this issue, we explore new and powerful generative models for three popular deep…

Computer Vision and Pattern Recognition · Computer Science 2016-06-17 Kun He , Yan Wang , John Hopcroft

Multi-Scale Weight Sharing Network for Image Recognition

In this paper, we explore the idea of weight sharing over multiple scales in convolutional networks. Inspired by traditional computer vision approaches, we share the weights of convolution kernels over different scales in the same layers of…

Computer Vision and Pattern Recognition · Computer Science 2020-01-10 Shubhra Aich , Ian Stavness , Yasuhiro Taniguchi , Masaki Yamazaki

Towards Biologically Plausible Convolutional Networks

Convolutional networks are ubiquitous in deep learning. They are particularly useful for images, as they reduce the number of parameters, reduce training time, and increase accuracy. However, as a model of the brain they are seriously…

Machine Learning · Computer Science 2022-01-19 Roman Pogodin , Yash Mehta , Timothy P. Lillicrap , Peter E. Latham

Generating Interpretable Networks using Hypernetworks

An essential goal in mechanistic interpretability to decode a network, i.e., to convert a neural network's raw weights to an interpretable algorithm. Given the difficulty of the decoding problem, progress has been made to understand the…

Machine Learning · Computer Science 2023-12-07 Isaac Liao , Ziming Liu , Max Tegmark

On Infinite-Width Hypernetworks

{\em Hypernetworks} are architectures that produce the weights of a task-specific {\em primary network}. A notable application of hypernetworks in the recent literature involves learning to output functional representations. In these…

Machine Learning · Computer Science 2021-02-24 Etai Littwin , Tomer Galanti , Lior Wolf , Greg Yang

Learning in the Machine: To Share or Not to Share?

Weight-sharing is one of the pillars behind Convolutional Neural Networks and their successes. However, in physical neural systems such as the brain, weight-sharing is implausible. This discrepancy raises the fundamental question of whether…

Machine Learning · Computer Science 2019-10-08 Jordan Ott , Erik Linstead , Nicholas LaHaye , Pierre Baldi

Computing Multiple Image Reconstructions with a Single Hypernetwork

Deep learning based techniques achieve state-of-the-art results in a wide range of image reconstruction tasks like compressed sensing. These methods almost always have hyperparameters, such as the weight coefficients that balance the…

Computer Vision and Pattern Recognition · Computer Science 2022-06-10 Alan Q. Wang , Adrian V. Dalca , Mert R. Sabuncu