Related papers: Permutation Equivariant Neural Functionals

Equivariant Architectures for Learning in Deep Weight Spaces

Designing machine learning architectures for processing neural networks in their raw weight matrix form is a newly introduced research direction. Unfortunately, the unique symmetry structure of deep weight spaces makes this design very…

Machine Learning · Computer Science 2023-06-02 Aviv Navon , Aviv Shamsian , Idan Achituve , Ethan Fetaya , Gal Chechik , Haggai Maron

Graph Neural Networks for Learning Equivariant Representations of Neural Networks

Neural networks that process the parameters of other neural networks find applications in domains as diverse as classifying implicit neural representations, generating neural network weights, and predicting generalization errors. However,…

Machine Learning · Computer Science 2024-07-24 Miltiadis Kofinas , Boris Knyazev , Yan Zhang , Yunlu Chen , Gertjan J. Burghouts , Efstratios Gavves , Cees G. M. Snoek , David W. Zhang

Monomial Matrix Group Equivariant Neural Functional Networks

Neural functional networks (NFNs) have recently gained significant attention due to their diverse applications, ranging from predicting network generalization and network editing to classifying implicit neural representation. Previous NFN…

Machine Learning · Computer Science 2025-03-14 Viet-Hoang Tran , Thieu N. Vo , Tho H. Tran , An T. Nguyen , Tan M. Nguyen

Universal Neural Functionals

A challenging problem in many modern machine learning tasks is to process weight-space features, i.e., to transform or extract information from the weights and gradients of a neural network. Recent works have developed promising…

Machine Learning · Computer Science 2024-02-09 Allan Zhou , Chelsea Finn , James Harrison

Equivariant Neural Functional Networks for Transformers

This paper systematically explores neural functional networks (NFN) for transformer architectures. NFN are specialized neural networks that treat the weights, gradients, or sparsity patterns of a deep neural network (DNN) as input data and…

Machine Learning · Computer Science 2025-03-10 Viet-Hoang Tran , Thieu N. Vo , An Nguyen The , Tho Tran Huu , Minh-Khoi Nguyen-Nhat , Thanh Tran , Duy-Tung Pham , Tan Minh Nguyen

Neural Functional Transformers

The recent success of neural networks as implicit representation of data has driven growing interest in neural functionals: models that can process other neural networks as input by operating directly over their weight spaces. Nevertheless,…

Machine Learning · Computer Science 2023-05-24 Allan Zhou , Kaien Yang , Yiding Jiang , Kaylee Burns , Winnie Xu , Samuel Sokota , J. Zico Kolter , Chelsea Finn

Equivariant Polynomial Functional Networks

Neural Functional Networks (NFNs) have gained increasing interest due to their wide range of applications, including extracting information from implicit representations of data, editing network weights, and evaluating policies. A key…

Machine Learning · Computer Science 2025-12-23 Thieu N. Vo , Viet-Hoang Tran , Tho Tran Huu , An Nguyen The , Thanh Tran , Minh-Khoi Nguyen-Nhat , Duy-Tung Pham , Tan Minh Nguyen

Equivariant Machine Learning on Graphs with Nonlinear Spectral Filters

Equivariant machine learning is an approach for designing deep learning models that respect the symmetries of the problem, with the aim of reducing model complexity and improving generalization. In this paper, we focus on an extension of…

Machine Learning · Computer Science 2024-12-10 Ya-Wei Eileen Lin , Ronen Talmon , Ron Levie

Graph Metanetworks for Processing Diverse Neural Architectures

Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of…

Machine Learning · Computer Science 2024-01-02 Derek Lim , Haggai Maron , Marc T. Law , Jonathan Lorraine , James Lucas

Equivariant Matrix Function Neural Networks

Graph Neural Networks (GNNs), especially message-passing neural networks (MPNNs), have emerged as powerful architectures for learning on graphs in diverse applications. However, MPNNs face challenges when modeling non-local interactions in…

Machine Learning · Statistics 2024-01-31 Ilyes Batatia , Lars L. Schaaf , Huajie Chen , Gábor Csányi , Christoph Ortner , Felix A. Faber

Learning Linear Groups in Neural Networks

Employing equivariance in neural networks leads to greater parameter efficiency and improved generalization performance through the encoding of domain knowledge in the architecture; however, the majority of existing approaches require an a…

Machine Learning · Computer Science 2023-05-31 Emmanouil Theodosis , Karim Helwani , Demba Ba

Theory for Equivariant Quantum Neural Networks

Quantum neural network architectures that have little-to-no inductive biases are known to face trainability and generalization issues. Inspired by a similar problem, recent breakthroughs in machine learning address this challenge by…

Quantum Physics · Physics 2024-05-14 Quynh T. Nguyen , Louis Schatzki , Paolo Braccia , Michael Ragone , Patrick J. Coles , Frederic Sauvage , Martin Larocca , M. Cerezo

Permutation Equivariant Neural Networks for Symmetric Tensors

Incorporating permutation equivariance into neural networks has proven to be useful in ensuring that models respect symmetries that exist in data. Symmetric tensors, which naturally appear in statistics, machine learning, and graph theory,…

Machine Learning · Computer Science 2025-05-26 Edward Pearce-Crump

Equivariant Deep Weight Space Alignment

Permutation symmetries of deep networks make basic operations like model merging and similarity estimation challenging. In many cases, aligning the weights of the networks, i.e., finding optimal permutations between their weights, is…

Machine Learning · Computer Science 2024-11-12 Aviv Navon , Aviv Shamsian , Ethan Fetaya , Gal Chechik , Nadav Dym , Haggai Maron

Scale Equivariant Graph Metanetworks

This paper pertains to an emerging machine learning paradigm: learning higher-order functions, i.e. functions whose inputs are functions themselves, $\textit{particularly when these inputs are Neural Networks (NNs)}$. With the growing…

Machine Learning · Computer Science 2024-10-31 Ioannis Kalogeropoulos , Giorgos Bouritsas , Yannis Panagakis

Relaxing Equivariance Constraints with Non-stationary Continuous Filters

Equivariances provide useful inductive biases in neural network modeling, with the translation equivariance of convolutional neural networks being a canonical example. Equivariances can be embedded in architectures through weight-sharing…

Machine Learning · Computer Science 2022-11-15 Tycho F. A. van der Ouderaa , David W. Romero , Mark van der Wilk

Geometry of Linear Neural Networks: Equivariance and Invariance under Permutation Groups

The set of functions parameterized by a linear fully-connected neural network is a determinantal variety. We investigate the subvariety of functions that are equivariant or invariant under the action of a permutation group. Examples of such…

Machine Learning · Computer Science 2025-01-13 Kathlén Kohn , Anna-Laura Sattelberger , Vahid Shahverdi

Implicit Bias of Linear Equivariant Networks

Group equivariant convolutional neural networks (G-CNNs) are generalizations of convolutional neural networks (CNNs) which excel in a wide range of technical applications by explicitly encoding symmetries, such as rotations and…

Machine Learning · Computer Science 2022-09-14 Hannah Lawrence , Kristian Georgiev , Andrew Dienes , Bobak T. Kiani

Quasi-Equivariant Metanetworks

Metanetworks are neural architectures designed to operate directly on pretrained weights to perform downstream tasks. However, the parameter space serves only as a proxy for the underlying function class, and the parameter-function mapping…

Machine Learning · Computer Science 2026-04-28 Viet-Hoang Tran , An Nguyen , Benoît Guérand , Thieu N. Vo , Tan M. Nguyen

Permutation-equivariant neural networks applied to dynamics prediction

The introduction of convolutional layers greatly advanced the performance of neural networks on image tasks due to innately capturing a way of encoding and learning translation-invariant operations, matching one of the underlying symmetries…

Computer Vision and Pattern Recognition · Computer Science 2016-12-15 Nicholas Guttenberg , Nathaniel Virgo , Olaf Witkowski , Hidetoshi Aoki , Ryota Kanai