Related papers: Optimizing Mode Connectivity via Neuron Alignment

Input Space Mode Connectivity in Deep Neural Networks

We extend the concept of loss landscape mode connectivity to the input space of deep neural networks. Mode connectivity was originally studied within parameter space, where it describes the existence of low-loss paths between different…

Machine Learning · Computer Science 2024-09-10 Jakub Vrabel , Ori Shem-Ur , Yaron Oz , David Krueger

Generalized Linear Mode Connectivity for Transformers

Understanding the geometry of neural network loss landscapes is a central question in deep learning, with implications for generalization and optimization. A striking phenomenon is linear mode connectivity (LMC), where independently trained…

Machine Learning · Computer Science 2025-11-14 Alexander Theus , Alessandro Cabodi , Sotiris Anagnostidis , Antonio Orvieto , Sidak Pal Singh , Valentina Boeva

Understanding Mode Connectivity via Parameter Space Symmetry

Neural network minima are often connected by curves along which train and test loss remain nearly constant, a phenomenon known as mode connectivity. While this property has enabled applications such as model merging and fine-tuning, its…

Machine Learning · Computer Science 2025-05-30 Bo Zhao , Nima Dehmamy , Robin Walters , Rose Yu

Proving Linear Mode Connectivity of Neural Networks via Optimal Transport

The energy landscape of high-dimensional non-convex optimization problems is crucial to understanding the effectiveness of modern deep neural network architectures. Recent works have experimentally shown that two different solutions found…

Machine Learning · Computer Science 2024-03-04 Damien Ferbach , Baptiste Goujaud , Gauthier Gidel , Aymeric Dieuleveut

Git Re-Basin: Merging Models modulo Permutation Symmetries

The success of deep learning is due in large part to our ability to solve certain massive non-convex optimization problems with relative ease. Though non-convex optimization is NP-hard, simple algorithms -- often variants of stochastic…

Machine Learning · Computer Science 2023-03-03 Samuel K. Ainsworth , Jonathan Hayase , Siddhartha Srinivasa

Geodesic Mode Connectivity

Mode connectivity is a phenomenon where trained models are connected by a path of low loss. We reframe this in the context of Information Geometry, where neural networks are studied as spaces of parameterized distributions with curved…

Machine Learning · Computer Science 2023-08-25 Charlie Tan , Theodore Long , Sarah Zhao , Rudolf Laine

Large Scale Structure of Neural Network Loss Landscapes

There are many surprising and perhaps counter-intuitive properties of optimization of deep neural networks. We propose and experimentally verify a unified phenomenological model of the loss landscape that incorporates many of them. High…

Machine Learning · Computer Science 2019-06-12 Stanislav Fort , Stanislaw Jastrzebski

The Role of Symmetry in Optimizing Overparameterized Networks

Overparameterization is central to the success of deep learning, yet the mechanisms by which it improves optimization remain incompletely understood. We analyze weight-space symmetries in neural networks and show that overparameterization…

Machine Learning · Computer Science 2026-05-11 Kusha Sareen , Mohammad Pedramfar , Sékou-Oumar Kaba , Mehran Shakerinava , Siamak Ravanbakhsh

Simultaneous linear connectivity of neural networks modulo permutation

Neural networks typically exhibit permutation symmetries which contribute to the non-convexity of the networks' loss landscapes, since linearly interpolating between two permuted versions of a trained network tends to encounter a high loss…

Machine Learning · Computer Science 2024-04-10 Ekansh Sharma , Devin Kwok , Tom Denton , Daniel M. Roy , David Rolnick , Gintare Karolina Dziugaite

Weight-space symmetry in deep networks gives rise to permutation saddles, connected by equal-loss valleys across the loss landscape

The permutation symmetry of neurons in each layer of a deep neural network gives rise not only to multiple equivalent global minima of the loss function, but also to first-order saddle points located on the path between the global minima.…

Machine Learning · Computer Science 2019-07-08 Johanni Brea , Berfin Simsek , Bernd Illing , Wulfram Gerstner

Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness

Mode connectivity provides novel geometric insights on analyzing loss landscapes and enables building high-accuracy pathways between well-trained neural networks. In this work, we propose to employ mode connectivity in loss landscapes to…

Machine Learning · Computer Science 2020-07-06 Pu Zhao , Pin-Yu Chen , Payel Das , Karthikeyan Natesan Ramamurthy , Xue Lin

Unveiling Mode Connectivity in Graph Neural Networks

A fundamental challenge in understanding graph neural networks (GNNs) lies in characterizing their optimization dynamics and loss landscape geometry, critical for improving interpretability and robustness. While mode connectivity, a lens…

Machine Learning · Computer Science 2025-02-19 Bingheng Li , Zhikai Chen , Haoyu Han , Shenglai Zeng , Jingzhe Liu , Jiliang Tang

A Generative Model for Sampling High-Performance and Diverse Weights for Neural Networks

Recent work on mode connectivity in the loss landscape of deep neural networks has demonstrated that the locus of (sub-)optimal weight vectors lies on continuous paths. In this work, we train a neural network that serves as a hypernetwork,…

Machine Learning · Statistics 2019-05-09 Lior Deutsch , Erik Nijkamp , Yu Yang

Combining Optimal Path Search With Task-Dependent Learning in a Neural Network

Finding optimal paths in connected graphs requires determining the smallest total cost for traveling along the graph's edges. This problem can be solved by several classical algorithms where, usually, costs are predefined for all edges.…

Machine Learning · Computer Science 2023-11-03 Tomas Kulvicius , Minija Tamosiunaite , Florentin Wörgötter

Mechanistic Mode Connectivity

We study neural network loss landscapes through the lens of mode connectivity, the observation that minimizers of neural networks retrieved via training on a dataset are connected via simple paths of low loss. Specifically, we ask the…

Machine Learning · Computer Science 2023-06-02 Ekdeep Singh Lubana , Eric J. Bigelow , Robert P. Dick , David Krueger , Hidenori Tanaka

Topology and Geometry of Half-Rectified Network Optimization

The loss surface of deep neural networks has recently attracted interest in the optimization and machine learning communities as a prime example of high-dimensional non-convex problem. Some insights were recently gained using spin glass…

Machine Learning · Statistics 2017-06-05 C. Daniel Freeman , Joan Bruna

Explaining Landscape Connectivity of Low-cost Solutions for Multilayer Nets

Mode connectivity is a surprising phenomenon in the loss landscape of deep nets. Optima -- at least those discovered by gradient-based optimization -- turn out to be connected by simple paths on which the loss function is almost constant.…

Machine Learning · Computer Science 2020-01-07 Rohith Kuditipudi , Xiang Wang , Holden Lee , Yi Zhang , Zhiyuan Li , Wei Hu , Sanjeev Arora , Rong Ge

On permutation symmetries in Bayesian neural network posteriors: a variational perspective

The elusive nature of gradient-based optimization in neural networks is tied to their loss landscape geometry, which is poorly understood. However recent work has brought solid evidence that there is essentially no loss barrier between the…

Machine Learning · Statistics 2023-10-17 Simone Rossi , Ankit Singh , Thomas Hannagan

A Tale of Two Symmetries: Exploring the Loss Landscape of Equivariant Models

Equivariant neural networks have proven to be effective for tasks with known underlying symmetries. However, optimizing equivariant networks can be tricky and best training practices are less established than for standard networks. In…

Machine Learning · Computer Science 2025-11-04 YuQing Xie , Tess Smidt

Deep Model Merging: The Sister of Neural Network Interpretability -- A Survey

We survey the model merging literature through the lens of loss landscape geometry to connect observations from empirical studies on model merging and loss landscape analysis to phenomena that govern neural network training and the…

Machine Learning · Computer Science 2025-03-25 Arham Khan , Todd Nief , Nathaniel Hudson , Mansi Sakarvadia , Daniel Grzenda , Aswathy Ajith , Jordan Pettyjohn , Kyle Chard , Ian Foster