Related papers: Network with Sub-Networks

HyperNetworks

This work explores hypernetworks: an approach of using a one network, also known as a hypernetwork, to generate the weights for another network. Hypernetworks provide an abstraction that is similar to what is found in nature: the…

Machine Learning · Computer Science 2016-12-02 David Ha , Andrew Dai , Quoc V. Le

A Brief Review of Hypernetworks in Deep Learning

Hypernetworks, or hypernets for short, are neural networks that generate weights for another neural network, known as the target network. They have emerged as a powerful deep learning technique that allows for greater flexibility,…

Machine Learning · Computer Science 2025-01-03 Vinod Kumar Chauhan , Jiandong Zhou , Ping Lu , Soheila Molaei , David A. Clifton

Neural networks with differentiable structure

While gradient descent has proven highly successful in learning connection weights for neural networks, the actual structure of these networks is usually determined by hand, or by other optimization algorithms. Here we describe a simple…

Neural and Evolutionary Computing · Computer Science 2016-08-09 Thomas Miconi

Unifying and Merging Well-trained Deep Neural Networks for Inference Stage

We propose a novel method to merge convolutional neural-nets for the inference stage. Given two well-trained networks that may have different architectures that handle different tasks, our method aligns the layers of the original networks…

Computer Vision and Pattern Recognition · Computer Science 2018-05-15 Yi-Min Chou , Yi-Ming Chan , Jia-Hong Lee , Chih-Yi Chiu , Chu-Song Chen

Generating Neural Networks with Neural Networks

Hypernetworks are neural networks that generate weights for another neural network. We formulate the hypernetwork training objective as a compromise between accuracy and diversity, where the diversity takes into account trivial symmetry…

Machine Learning · Statistics 2018-04-10 Lior Deutsch

Neural networks with dynamical coefficients and adjustable connections on the basis of integrated backpropagation

We consider artificial neurons which will update their weight coefficients with an internal rule based on backpropagation, rather than using it as an external training procedure. To achieve this we include the backpropagation error estimate…

Neural and Evolutionary Computing · Computer Science 2018-08-07 M. N. Nazarov

Subnetwork-to-go: Elastic Neural Network with Dynamic Training and Customizable Inference

Deploying neural networks to different devices or platforms is in general challenging, especially when the model size is large or model complexity is high. Although there exist ways for model pruning or distillation, it is typically…

Machine Learning · Computer Science 2023-12-07 Kai Li , Yi Luo

The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by Isolating Task-Specific Subnetworks in Feedforward Neural Networks

Neural networks have seen an explosion of usage and research in the past decade, particularly within the domains of computer vision and natural language processing. However, only recently have advancements in neural networks yielded…

Machine Learning · Computer Science 2022-07-20 Jacob Renn , Ian Sotnek , Benjamin Harvey , Brian Caffo

Making Sense of Hidden Layer Information in Deep Networks by Learning Hierarchical Targets

This paper proposes an architecture for deep neural networks with hidden layer branches that learn targets of lower hierarchy than final layer targets. The branches provide a channel for enforcing useful information in hidden layer which…

Neural and Evolutionary Computing · Computer Science 2016-09-27 Abhinav Tushar

Disentangling Neural Architectures and Weights: A Case Study in Supervised Classification

The history of deep learning has shown that human-designed problem-specific networks can greatly improve the classification performance of general neural models. In most practical cases, however, choosing the optimal architecture for a…

Machine Learning · Computer Science 2020-09-14 Nicolo Colombo , Yang Gao

Modeling Uncertainty by Learning a Hierarchy of Deep Neural Connections

Modeling uncertainty in deep neural networks, despite recent important advances, is still an open problem. Bayesian neural networks are a powerful solution, where the prior over network weights is a design choice, often a normal…

Machine Learning · Statistics 2019-10-29 Raanan Y. Rohekar , Yaniv Gurwicz , Shami Nisimov , Gal Novik

WeightScale: Interpreting Weight Change in Neural Networks

Interpreting the learning dynamics of neural networks can provide useful insights into how networks learn and the development of better training and design approaches. We present an approach to interpret learning in neural networks by…

Machine Learning · Computer Science 2022-03-29 Ayush Manish Agrawal , Atharva Tendle , Harshvardhan Sikka , Sahib Singh

Less is More: Selective Layer Finetuning with SubTuning

Finetuning a pretrained model has become a standard approach for training neural networks on novel tasks, resulting in fast convergence and improved performance. In this work, we study an alternative finetuning method, where instead of…

Machine Learning · Computer Science 2023-07-04 Gal Kaplun , Andrey Gurevich , Tal Swisa , Mazor David , Shai Shalev-Shwartz , Eran Malach

On the Transferability of Representations in Neural Networks Between Datasets and Tasks

Deep networks, composed of multiple layers of hierarchical distributed representations, tend to learn low-level features in initial layers and transition to high-level features towards final layers. Paradigms such as transfer learning,…

Machine Learning · Computer Science 2018-11-30 Haytham M. Fayek , Lawrence Cavedon , Hong Ren Wu

A Global Algorithm for Training Multilayer Neural Networks

We present a global algorithm for training multilayer neural networks in this Letter. The algorithm is focused on controlling the local fields of neurons induced by the input of samples by random adaptations of the synaptic weights. Unlike…

Biological Physics · Physics 2007-05-23 Hong Zhao , Tao Jin

With Greater Distance Comes Worse Performance: On the Perspective of Layer Utilization and Model Generalization

Generalization of deep neural networks remains one of the main open problems in machine learning. Previous theoretical works focused on deriving tight bounds of model complexity, while empirical works revealed that neural networks exhibit…

Machine Learning · Computer Science 2022-01-31 James Wang , Cheng-Lin Yang

Some Insights into the Geometry and Training of Neural Networks

Neural networks have been successfully used for classification tasks in a rapidly growing number of practical applications. Despite their popularity and widespread use, there are still many aspects of training and classification that are…

Machine Learning · Computer Science 2016-05-03 Ewout van den Berg

A simple approach for quantizing neural networks

In this short note, we propose a new method for quantizing the weights of a fully trained neural network. A simple deterministic pre-processing step allows us to quantize network layers via memoryless scalar quantization while preserving…

Machine Learning · Computer Science 2023-04-06 Johannes Maly , Rayan Saab

Half-Layered Neural Networks

We propose a ``half'' layer of hidden units that has some of its weights randomly set and some of them trained. A half unit is composed of two stages: First, it takes a weighted sum of its inputs with fixed random weights, and second, the…

Machine Learning · Computer Science 2025-06-06 Ethem Alpaydin

Measuring Uncertainty through Bayesian Learning of Deep Neural Network Structure

Bayesian neural networks (BNNs) augment deep networks with uncertainty quantification by Bayesian treatment of the network weights. However, such models face the challenge of Bayesian inference in a high-dimensional and usually…

Machine Learning · Computer Science 2021-03-30 Zhijie Deng , Yucen Luo , Jun Zhu , Bo Zhang