English
Related papers

Related papers: Permutation Equivariant Neural Functionals

200 papers

Designing machine learning architectures for processing neural networks in their raw weight matrix form is a newly introduced research direction. Unfortunately, the unique symmetry structure of deep weight spaces makes this design very…

Machine Learning · Computer Science 2023-06-02 Aviv Navon , Aviv Shamsian , Idan Achituve , Ethan Fetaya , Gal Chechik , Haggai Maron

Neural networks that process the parameters of other neural networks find applications in domains as diverse as classifying implicit neural representations, generating neural network weights, and predicting generalization errors. However,…

Neural functional networks (NFNs) have recently gained significant attention due to their diverse applications, ranging from predicting network generalization and network editing to classifying implicit neural representation. Previous NFN…

Machine Learning · Computer Science 2025-03-14 Viet-Hoang Tran , Thieu N. Vo , Tho H. Tran , An T. Nguyen , Tan M. Nguyen

A challenging problem in many modern machine learning tasks is to process weight-space features, i.e., to transform or extract information from the weights and gradients of a neural network. Recent works have developed promising…

Machine Learning · Computer Science 2024-02-09 Allan Zhou , Chelsea Finn , James Harrison

This paper systematically explores neural functional networks (NFN) for transformer architectures. NFN are specialized neural networks that treat the weights, gradients, or sparsity patterns of a deep neural network (DNN) as input data and…

The recent success of neural networks as implicit representation of data has driven growing interest in neural functionals: models that can process other neural networks as input by operating directly over their weight spaces. Nevertheless,…

Machine Learning · Computer Science 2023-05-24 Allan Zhou , Kaien Yang , Yiding Jiang , Kaylee Burns , Winnie Xu , Samuel Sokota , J. Zico Kolter , Chelsea Finn

Neural Functional Networks (NFNs) have gained increasing interest due to their wide range of applications, including extracting information from implicit representations of data, editing network weights, and evaluating policies. A key…

Equivariant machine learning is an approach for designing deep learning models that respect the symmetries of the problem, with the aim of reducing model complexity and improving generalization. In this paper, we focus on an extension of…

Machine Learning · Computer Science 2024-12-10 Ya-Wei Eileen Lin , Ronen Talmon , Ron Levie

Neural networks efficiently encode learned information within their parameters. Consequently, many tasks can be unified by treating neural networks themselves as input data. When doing so, recent studies demonstrated the importance of…

Machine Learning · Computer Science 2024-01-02 Derek Lim , Haggai Maron , Marc T. Law , Jonathan Lorraine , James Lucas

Graph Neural Networks (GNNs), especially message-passing neural networks (MPNNs), have emerged as powerful architectures for learning on graphs in diverse applications. However, MPNNs face challenges when modeling non-local interactions in…

Machine Learning · Statistics 2024-01-31 Ilyes Batatia , Lars L. Schaaf , Huajie Chen , Gábor Csányi , Christoph Ortner , Felix A. Faber

Employing equivariance in neural networks leads to greater parameter efficiency and improved generalization performance through the encoding of domain knowledge in the architecture; however, the majority of existing approaches require an a…

Machine Learning · Computer Science 2023-05-31 Emmanouil Theodosis , Karim Helwani , Demba Ba

Quantum neural network architectures that have little-to-no inductive biases are known to face trainability and generalization issues. Inspired by a similar problem, recent breakthroughs in machine learning address this challenge by…

Incorporating permutation equivariance into neural networks has proven to be useful in ensuring that models respect symmetries that exist in data. Symmetric tensors, which naturally appear in statistics, machine learning, and graph theory,…

Machine Learning · Computer Science 2025-05-26 Edward Pearce-Crump

Permutation symmetries of deep networks make basic operations like model merging and similarity estimation challenging. In many cases, aligning the weights of the networks, i.e., finding optimal permutations between their weights, is…

Machine Learning · Computer Science 2024-11-12 Aviv Navon , Aviv Shamsian , Ethan Fetaya , Gal Chechik , Nadav Dym , Haggai Maron

This paper pertains to an emerging machine learning paradigm: learning higher-order functions, i.e. functions whose inputs are functions themselves, $\textit{particularly when these inputs are Neural Networks (NNs)}$. With the growing…

Machine Learning · Computer Science 2024-10-31 Ioannis Kalogeropoulos , Giorgos Bouritsas , Yannis Panagakis

Equivariances provide useful inductive biases in neural network modeling, with the translation equivariance of convolutional neural networks being a canonical example. Equivariances can be embedded in architectures through weight-sharing…

Machine Learning · Computer Science 2022-11-15 Tycho F. A. van der Ouderaa , David W. Romero , Mark van der Wilk

The set of functions parameterized by a linear fully-connected neural network is a determinantal variety. We investigate the subvariety of functions that are equivariant or invariant under the action of a permutation group. Examples of such…

Machine Learning · Computer Science 2025-01-13 Kathlén Kohn , Anna-Laura Sattelberger , Vahid Shahverdi

Group equivariant convolutional neural networks (G-CNNs) are generalizations of convolutional neural networks (CNNs) which excel in a wide range of technical applications by explicitly encoding symmetries, such as rotations and…

Machine Learning · Computer Science 2022-09-14 Hannah Lawrence , Kristian Georgiev , Andrew Dienes , Bobak T. Kiani

Metanetworks are neural architectures designed to operate directly on pretrained weights to perform downstream tasks. However, the parameter space serves only as a proxy for the underlying function class, and the parameter-function mapping…

Machine Learning · Computer Science 2026-04-28 Viet-Hoang Tran , An Nguyen , Benoît Guérand , Thieu N. Vo , Tan M. Nguyen

The introduction of convolutional layers greatly advanced the performance of neural networks on image tasks due to innately capturing a way of encoding and learning translation-invariant operations, matching one of the underlying symmetries…

Computer Vision and Pattern Recognition · Computer Science 2016-12-15 Nicholas Guttenberg , Nathaniel Virgo , Olaf Witkowski , Hidetoshi Aoki , Ryota Kanai
‹ Prev 1 2 3 10 Next ›