Related papers: Universal Neural Functionals

Permutation Equivariant Neural Functionals

This work studies the design of neural networks that can process the weights or gradients of other neural networks, which we refer to as neural functional networks (NFNs). Despite a wide range of potential applications, including learned…

Machine Learning · Computer Science 2023-09-27 Allan Zhou , Kaien Yang , Kaylee Burns , Adriano Cardace , Yiding Jiang , Samuel Sokota , J. Zico Kolter , Chelsea Finn

On the universality of neural encodings in CNNs

We explore the universality of neural encodings in convolutional neural networks trained on image classification tasks. We develop a procedure to directly compare the learned weights rather than their representations. It is based on a…

Machine Learning · Computer Science 2024-10-01 Florentin Guth , Brice Ménard

Neural Networks Trained by Weight Permutation are Universal Approximators

The universal approximation property is fundamental to the success of neural networks, and has traditionally been achieved by training networks without any constraints on their parameters. However, recent experimental research proposed a…

Machine Learning · Computer Science 2025-03-21 Yongqiang Cai , Gaohang Chen , Zhonghua Qiao

Equivariant Architectures for Learning in Deep Weight Spaces

Designing machine learning architectures for processing neural networks in their raw weight matrix form is a newly introduced research direction. Unfortunately, the unique symmetry structure of deep weight spaces makes this design very…

Machine Learning · Computer Science 2023-06-02 Aviv Navon , Aviv Shamsian , Idan Achituve , Ethan Fetaya , Gal Chechik , Haggai Maron

Graph Neural Networks for Learning Equivariant Representations of Neural Networks

Neural networks that process the parameters of other neural networks find applications in domains as diverse as classifying implicit neural representations, generating neural network weights, and predicting generalization errors. However,…

Machine Learning · Computer Science 2024-07-24 Miltiadis Kofinas , Boris Knyazev , Yan Zhang , Yunlu Chen , Gertjan J. Burghouts , Efstratios Gavves , Cees G. M. Snoek , David W. Zhang

Neural Functional Transformers

The recent success of neural networks as implicit representation of data has driven growing interest in neural functionals: models that can process other neural networks as input by operating directly over their weight spaces. Nevertheless,…

Machine Learning · Computer Science 2023-05-24 Allan Zhou , Kaien Yang , Yiding Jiang , Kaylee Burns , Winnie Xu , Samuel Sokota , J. Zico Kolter , Chelsea Finn

On the Expressive Power of Permutation-Equivariant Weight-Space Networks

Weight-space learning studies neural architectures that operate directly on the parameters of other neural networks. Motivated by the growing availability of pretrained models, recent work has demonstrated the effectiveness of weight-space…

Machine Learning · Computer Science 2026-02-03 Adir Dayan , Yam Eitan , Haggai Maron

A Functional Perspective on Learning Symmetric Functions with Neural Networks

Symmetric functions, which take as input an unordered, fixed-size set, are known to be universally representable by neural networks that enforce permutation invariance. These architectures only give guarantees for fixed input sizes, yet in…

Machine Learning · Computer Science 2022-10-11 Aaron Zweig , Joan Bruna

Universally Invariant Learning in Equivariant GNNs

Equivariant Graph Neural Networks (GNNs) have demonstrated significant success across various applications. To achieve completeness -- that is, the universal approximation property over the space of equivariant functions -- the network must…

Machine Learning · Computer Science 2025-10-16 Jiacheng Cen , Anyi Li , Ning Lin , Tingyang Xu , Yu Rong , Deli Zhao , Zihe Wang , Wenbing Huang

Permutation Equivariant Neural Networks for Symmetric Tensors

Incorporating permutation equivariance into neural networks has proven to be useful in ensuring that models respect symmetries that exist in data. Symmetric tensors, which naturally appear in statistics, machine learning, and graph theory,…

Machine Learning · Computer Science 2025-05-26 Edward Pearce-Crump

Equivariant Machine Learning on Graphs with Nonlinear Spectral Filters

Equivariant machine learning is an approach for designing deep learning models that respect the symmetries of the problem, with the aim of reducing model complexity and improving generalization. In this paper, we focus on an extension of…

Machine Learning · Computer Science 2024-12-10 Ya-Wei Eileen Lin , Ronen Talmon , Ron Levie

Unconstrained Monotonic Neural Networks

Monotonic neural networks have recently been proposed as a way to define invertible transformations. These transformations can be combined into powerful autoregressive flows that have been shown to be universal approximators of continuous…

Machine Learning · Computer Science 2021-04-01 Antoine Wehenkel , Gilles Louppe

On the Universality of Rotation Equivariant Point Cloud Networks

Learning functions on point clouds has applications in many fields, including computer vision, computer graphics, physics, and chemistry. Recently, there has been a growing interest in neural architectures that are invariant or equivariant…

Machine Learning · Computer Science 2020-10-07 Nadav Dym , Haggai Maron

Modeling Structure with Undirected Neural Networks

Neural networks are powerful function estimators, leading to their status as a paradigm of choice for modeling structured data. However, unlike other structured representations that emphasize the modularity of the problem -- e.g., factor…

Machine Learning · Computer Science 2022-06-20 Tsvetomila Mihaylova , Vlad Niculae , André F. T. Martins

Deep Neural Network Approximation of Invariant Functions through Dynamical Systems

We study the approximation of functions which are invariant with respect to certain permutations of the input indices using flow maps of dynamical systems. Such invariant functions includes the much studied translation-invariant ones…

Machine Learning · Computer Science 2022-08-19 Qianxiao Li , Ting Lin , Zuowei Shen

The Universal Weight Subspace Hypothesis

We show that deep neural networks trained across diverse tasks exhibit remarkably similar low-dimensional parametric subspaces. We provide the first large-scale empirical evidence that demonstrates that neural networks systematically…

Machine Learning · Computer Science 2025-12-09 Prakhar Kaushik , Shravan Chaudhari , Ankit Vaidya , Rama Chellappa , Alan Yuille

Tunable Efficient Unitary Neural Networks (EUNN) and their application to RNNs

Using unitary (instead of general) matrices in artificial neural networks (ANNs) is a promising way to solve the gradient explosion/vanishing problem, as well as to enable ANNs to learn long-term correlations in the data. This approach…

Machine Learning · Computer Science 2017-04-04 Li Jing , Yichen Shen , Tena Dubček , John Peurifoy , Scott Skirlo , Yann LeCun , Max Tegmark , Marin Soljačić

Efficient Learning for Deep Quantum Neural Networks

Neural networks enjoy widespread success in both research and industry and, with the imminent advent of quantum technology, it is now a crucial challenge to design quantum neural networks for fully quantum learning tasks. Here we propose…

Quantum Physics · Physics 2020-04-30 Kerstin Beer , Dmytro Bondarenko , Terry Farrelly , Tobias J. Osborne , Robert Salzmann , Ramona Wolf

Expressivity of Neural Networks with Random Weights and Learned Biases

Landmark universal function approximation results for neural networks with trained weights and biases provided the impetus for the ubiquitous use of neural networks as learning models in neuroscience and Artificial Intelligence (AI). Recent…

Neural and Evolutionary Computing · Computer Science 2025-03-25 Ezekiel Williams , Alexandre Payeur , Avery Hee-Woon Ryoo , Thomas Jiralerspong , Matthew G. Perich , Luca Mazzucato , Guillaume Lajoie

Unitary Evolution Recurrent Neural Networks

Recurrent neural networks (RNNs) are notoriously difficult to train. When the eigenvalues of the hidden to hidden weight matrix deviate from absolute value 1, optimization becomes difficult due to the well studied issue of vanishing and…

Machine Learning · Computer Science 2016-10-13 Martin Arjovsky , Amar Shah , Yoshua Bengio