English
Related papers

Related papers: Optimizing Mode Connectivity via Neuron Alignment

200 papers

We extend the concept of loss landscape mode connectivity to the input space of deep neural networks. Mode connectivity was originally studied within parameter space, where it describes the existence of low-loss paths between different…

Machine Learning · Computer Science 2024-09-10 Jakub Vrabel , Ori Shem-Ur , Yaron Oz , David Krueger

Understanding the geometry of neural network loss landscapes is a central question in deep learning, with implications for generalization and optimization. A striking phenomenon is linear mode connectivity (LMC), where independently trained…

Machine Learning · Computer Science 2025-11-14 Alexander Theus , Alessandro Cabodi , Sotiris Anagnostidis , Antonio Orvieto , Sidak Pal Singh , Valentina Boeva

Neural network minima are often connected by curves along which train and test loss remain nearly constant, a phenomenon known as mode connectivity. While this property has enabled applications such as model merging and fine-tuning, its…

Machine Learning · Computer Science 2025-05-30 Bo Zhao , Nima Dehmamy , Robin Walters , Rose Yu

The energy landscape of high-dimensional non-convex optimization problems is crucial to understanding the effectiveness of modern deep neural network architectures. Recent works have experimentally shown that two different solutions found…

Machine Learning · Computer Science 2024-03-04 Damien Ferbach , Baptiste Goujaud , Gauthier Gidel , Aymeric Dieuleveut

The success of deep learning is due in large part to our ability to solve certain massive non-convex optimization problems with relative ease. Though non-convex optimization is NP-hard, simple algorithms -- often variants of stochastic…

Machine Learning · Computer Science 2023-03-03 Samuel K. Ainsworth , Jonathan Hayase , Siddhartha Srinivasa

Mode connectivity is a phenomenon where trained models are connected by a path of low loss. We reframe this in the context of Information Geometry, where neural networks are studied as spaces of parameterized distributions with curved…

Machine Learning · Computer Science 2023-08-25 Charlie Tan , Theodore Long , Sarah Zhao , Rudolf Laine

There are many surprising and perhaps counter-intuitive properties of optimization of deep neural networks. We propose and experimentally verify a unified phenomenological model of the loss landscape that incorporates many of them. High…

Machine Learning · Computer Science 2019-06-12 Stanislav Fort , Stanislaw Jastrzebski

Overparameterization is central to the success of deep learning, yet the mechanisms by which it improves optimization remain incompletely understood. We analyze weight-space symmetries in neural networks and show that overparameterization…

Machine Learning · Computer Science 2026-05-11 Kusha Sareen , Mohammad Pedramfar , Sékou-Oumar Kaba , Mehran Shakerinava , Siamak Ravanbakhsh

Neural networks typically exhibit permutation symmetries which contribute to the non-convexity of the networks' loss landscapes, since linearly interpolating between two permuted versions of a trained network tends to encounter a high loss…

Machine Learning · Computer Science 2024-04-10 Ekansh Sharma , Devin Kwok , Tom Denton , Daniel M. Roy , David Rolnick , Gintare Karolina Dziugaite

The permutation symmetry of neurons in each layer of a deep neural network gives rise not only to multiple equivalent global minima of the loss function, but also to first-order saddle points located on the path between the global minima.…

Machine Learning · Computer Science 2019-07-08 Johanni Brea , Berfin Simsek , Bernd Illing , Wulfram Gerstner

Mode connectivity provides novel geometric insights on analyzing loss landscapes and enables building high-accuracy pathways between well-trained neural networks. In this work, we propose to employ mode connectivity in loss landscapes to…

Machine Learning · Computer Science 2020-07-06 Pu Zhao , Pin-Yu Chen , Payel Das , Karthikeyan Natesan Ramamurthy , Xue Lin

A fundamental challenge in understanding graph neural networks (GNNs) lies in characterizing their optimization dynamics and loss landscape geometry, critical for improving interpretability and robustness. While mode connectivity, a lens…

Machine Learning · Computer Science 2025-02-19 Bingheng Li , Zhikai Chen , Haoyu Han , Shenglai Zeng , Jingzhe Liu , Jiliang Tang

Recent work on mode connectivity in the loss landscape of deep neural networks has demonstrated that the locus of (sub-)optimal weight vectors lies on continuous paths. In this work, we train a neural network that serves as a hypernetwork,…

Machine Learning · Statistics 2019-05-09 Lior Deutsch , Erik Nijkamp , Yu Yang

Finding optimal paths in connected graphs requires determining the smallest total cost for traveling along the graph's edges. This problem can be solved by several classical algorithms where, usually, costs are predefined for all edges.…

Machine Learning · Computer Science 2023-11-03 Tomas Kulvicius , Minija Tamosiunaite , Florentin Wörgötter

We study neural network loss landscapes through the lens of mode connectivity, the observation that minimizers of neural networks retrieved via training on a dataset are connected via simple paths of low loss. Specifically, we ask the…

Machine Learning · Computer Science 2023-06-02 Ekdeep Singh Lubana , Eric J. Bigelow , Robert P. Dick , David Krueger , Hidenori Tanaka

The loss surface of deep neural networks has recently attracted interest in the optimization and machine learning communities as a prime example of high-dimensional non-convex problem. Some insights were recently gained using spin glass…

Machine Learning · Statistics 2017-06-05 C. Daniel Freeman , Joan Bruna

Mode connectivity is a surprising phenomenon in the loss landscape of deep nets. Optima -- at least those discovered by gradient-based optimization -- turn out to be connected by simple paths on which the loss function is almost constant.…

Machine Learning · Computer Science 2020-01-07 Rohith Kuditipudi , Xiang Wang , Holden Lee , Yi Zhang , Zhiyuan Li , Wei Hu , Sanjeev Arora , Rong Ge

The elusive nature of gradient-based optimization in neural networks is tied to their loss landscape geometry, which is poorly understood. However recent work has brought solid evidence that there is essentially no loss barrier between the…

Machine Learning · Statistics 2023-10-17 Simone Rossi , Ankit Singh , Thomas Hannagan

Equivariant neural networks have proven to be effective for tasks with known underlying symmetries. However, optimizing equivariant networks can be tricky and best training practices are less established than for standard networks. In…

Machine Learning · Computer Science 2025-11-04 YuQing Xie , Tess Smidt

We survey the model merging literature through the lens of loss landscape geometry to connect observations from empirical studies on model merging and loss landscape analysis to phenomena that govern neural network training and the…

‹ Prev 1 2 3 10 Next ›