English
Related papers

Related papers: Universal Hypernetworks for Arbitrary Models

200 papers

Neural networks are powerful function estimators, leading to their status as a paradigm of choice for modeling structured data. However, unlike other structured representations that emphasize the modularity of the problem -- e.g., factor…

Machine Learning · Computer Science 2022-06-20 Tsvetomila Mihaylova , Vlad Niculae , André F. T. Martins

Hypernetworks are neural networks that generate weights for another neural network. We formulate the hypernetwork training objective as a compromise between accuracy and diversity, where the diversity takes into account trivial symmetry…

Machine Learning · Statistics 2018-04-10 Lior Deutsch

Hypernetworks, or hypernets for short, are neural networks that generate weights for another neural network, known as the target network. They have emerged as a powerful deep learning technique that allows for greater flexibility,…

Machine Learning · Computer Science 2025-01-03 Vinod Kumar Chauhan , Jiandong Zhou , Ping Lu , Soheila Molaei , David A. Clifton

The present paper provides a generalized model of network, namely, Hybrid Layered Network (HLN). We proved that the sets of all homogeneous, heterogeneous and multi-layered networks are subsets of the set of all HLNs depicting the model's…

Social and Information Networks · Computer Science 2025-03-03 Shraban Kumar Chatterjee , Suman Kundu

Neural architecture search (NAS) automatically finds the best task-specific neural network topology, outperforming many manual architecture designs. However, it can be prohibitively expensive as the search requires training thousands of…

Machine Learning · Computer Science 2020-12-21 Chris Zhang , Mengye Ren , Raquel Urtasun

Standard neural networks are often overconfident when presented with data outside the training distribution. We introduce HyperGAN, a new generative model for learning a distribution of neural network parameters. HyperGAN does not require…

Machine Learning · Computer Science 2020-07-16 Neale Ratzlaff , Li Fuxin

The recurrent geometric network (RGN), the first end-to-end differentiable neural architecture for protein structure prediction, is a competitive alternative to existing models. However, the RGN's use of recurrent neural networks (RNNs) as…

Biomolecules · Quantitative Biology 2019-08-05 Jin Li

Deep neural networks have enabled researchers to create powerful generalized frameworks, such as transformers, that can be used to solve well-studied problems in various application domains, such as text and image. However, such generalized…

Machine Learning · Computer Science 2025-02-12 Rudrajit Dawn , Madhusudan Ghosh , Partha Basuchowdhuri , Sudip Kumar Naskar

Large pre-trained models, or foundation models, have shown impressive performance when adapted to a variety of downstream tasks, often out-performing specialized models. Hypernetworks, neural networks that generate some or all of the…

Machine Learning · Computer Science 2025-03-04 Jeffrey Gu , Serena Yeung-Levy

Since real-world objects and their interactions are often multi-modal and multi-typed, heterogeneous networks have been widely used as a more powerful, realistic, and generic superclass of traditional homogeneous networks (graphs).…

Social and Information Networks · Computer Science 2020-12-18 Carl Yang , Yuxin Xiao , Yu Zhang , Yizhou Sun , Jiawei Han

This work explores hypernetworks: an approach of using a one network, also known as a hypernetwork, to generate the weights for another network. Hypernetworks provide an abstraction that is similar to what is found in nature: the…

Machine Learning · Computer Science 2016-12-02 David Ha , Andrew Dai , Quoc V. Le

We investigate the universal approximation property (UAP) of transformer-type architectures, providing a unified theoretical framework that extends prior results on residual networks to models incorporating attention mechanisms. Our work…

Machine Learning · Computer Science 2025-10-22 Jingpu Cheng , Ting Lin , Zuowei Shen , Qianxiao Li

Recurrent neural networks (RNNs) are notoriously difficult to train. When the eigenvalues of the hidden to hidden weight matrix deviate from absolute value 1, optimization becomes difficult due to the well studied issue of vanishing and…

Machine Learning · Computer Science 2016-10-13 Martin Arjovsky , Amar Shah , Yoshua Bengio

A practical limitation of deep neural networks is their high degree of specialization to a single task and visual domain. Recently, inspired by the successes of transfer learning, several authors have proposed to learn instead universal,…

Computer Vision and Pattern Recognition · Computer Science 2018-03-28 Sylvestre-Alvise Rebuffi , Hakan Bilen , Andrea Vedaldi

In this work we introduce a convolutional neural network (CNN) that jointly handles low-, mid-, and high-level vision tasks in a unified architecture that is trained end-to-end. Such a universal network can act like a `swiss knife' for…

Computer Vision and Pattern Recognition · Computer Science 2016-09-08 Iasonas Kokkinos

Monotonic neural networks have recently been proposed as a way to define invertible transformations. These transformations can be combined into powerful autoregressive flows that have been shown to be universal approximators of continuous…

Machine Learning · Computer Science 2021-04-01 Antoine Wehenkel , Gilles Louppe

Training large neural networks is possible by training a smaller hypernetwork that predicts parameters for the large ones. A recently released Graph HyperNetwork (GHN) trained this way on one million smaller ImageNet architectures is able…

Computer Vision and Pattern Recognition · Computer Science 2022-07-21 Boris Knyazev

Deep learning models are increasingly deployed in safety-critical tasks where predictions must satisfy hard constraints, such as physical laws, fairness requirements, or safety limits. However, standard architectures lack built-in…

Machine Learning · Computer Science 2025-11-26 Gonzalo E. Constante-Flores , Hao Chen , Can Li

Recent research on graph neural networks (GNNs) has explored mechanisms for capturing local uncertainty and exploiting graph hierarchies to mitigate data sparsity and leverage structural properties. However, the synergistic integration of…

Machine Learning · Computer Science 2025-05-06 Yoonhyuk Choi , Jiho Choi , Taewook Ko , Chong-Kwon Kim

Due to the exponentially increased demands of mobile data traffic, e.g., a 1000-fold increase in traffic demand from 4G to 5G, network densification is considered as a key mechanism in the evolution of cellular networks, and ultra-dense…

Networking and Internet Architecture · Computer Science 2017-11-15 Jianping An , Kai Yang , Jinsong Wu , Neng Ye , Song Guo , Zhifang Liao
‹ Prev 1 2 3 10 Next ›