Related papers: Data-driven effective model shows a liquid-like de…

Geometry of energy landscapes and the optimizability of deep neural networks

Deep neural networks are workhorse models in machine learning with multiple layers of non-linear functions composed in series. Their loss function is highly non-convex, yet empirically even gradient descent minimisation is sufficient to…

Disordered Systems and Neural Networks · Physics 2020-03-18 Simon Becker , Yao Zhang , Alpha A. Lee

Geometric Flow Models over Neural Network Weights

Deep generative models such as flow and diffusion models have proven to be effective in modeling high-dimensional and complex data types such as videos or proteins, and this has motivated their use in different data modalities, such as…

Machine Learning · Computer Science 2025-04-08 Ege Erdogan

Large Scale Structure of Neural Network Loss Landscapes

There are many surprising and perhaps counter-intuitive properties of optimization of deep neural networks. We propose and experimentally verify a unified phenomenological model of the loss landscape that incorporates many of them. High…

Machine Learning · Computer Science 2019-06-12 Stanislav Fort , Stanislaw Jastrzebski

From complex to simple : hierarchical free-energy landscape renormalized in deep neural networks

We develop a statistical mechanical approach based on the replica method to study the design space of deep and wide neural networks constrained to meet a large number of training data. Specifically, we analyze the configuration space of the…

Disordered Systems and Neural Networks · Physics 2020-04-17 Hajime Yoshino

Geometric Properties and Graph-Based Optimization of Neural Networks: Addressing Non-Linearity, Dimensionality, and Scalability

Deep learning models are often considered black boxes due to their complex hierarchical transformations. Identifying suitable architectures is crucial for maximizing predictive performance with limited data. Understanding the geometric…

Machine Learning · Computer Science 2025-03-11 Michael Wienczkowski , Addisu Desta , Paschal Ugochukwu

On the Modeling of Error Functions as High Dimensional Landscapes for Weight Initialization in Learning Networks

Next generation deep neural networks for classification hosted on embedded platforms will rely on fast, efficient, and accurate learning algorithms. Initialization of weights in learning networks has a great impact on the classification…

Machine Learning · Computer Science 2016-07-21 Julius , Gopinath Mahale , Sumana T. , C. S. Adityakrishna

MLDS: A Dataset for Weight-Space Analysis of Neural Networks

Neural networks are powerful models that solve a variety of complex real-world problems. However, the stochastic nature of training and large number of parameters in a typical neural model makes them difficult to evaluate via inspection.…

Machine Learning · Computer Science 2021-04-22 John Clemens

Geometric Understanding of Deep Learning

Deep learning is the mainstream technique for many machine learning tasks, including image recognition, machine translation, speech recognition, and so on. It has outperformed conventional methods in various fields and achieved great…

Machine Learning · Computer Science 2018-06-01 Na Lei , Zhongxuan Luo , Shing-Tung Yau , David Xianfeng Gu

Hyperbolic Geometry of Complex Networks

We develop a geometric framework to study the structure and function of complex networks. We assume that hyperbolic geometry underlies these networks, and we show that with this assumption, heterogeneous degree distributions and strong…

Statistical Mechanics · Physics 2010-09-14 Dmitri Krioukov , Fragkiskos Papadopoulos , Maksim Kitsak , Amin Vahdat , Marian Boguna

On Learnable Parameters of Optimal and Suboptimal Deep Learning Models

We scrutinize the structural and operational aspects of deep learning models, particularly focusing on the nuances of learnable parameters (weight) statistics, distribution, node interaction, and visualization. By establishing correlations…

Machine Learning · Computer Science 2024-08-22 Ziwei Zheng , Huizhi Liang , Vaclav Snasel , Vito Latora , Panos Pardalos , Giuseppe Nicosia , Varun Ojha

Solving hybrid machine learning tasks by traversing weight space geodesics

Machine learning problems have an intrinsic geometric structure as central objects including a neural network's weight space and the loss function associated with a particular task can be viewed as encoding the intrinsic geometry of a given…

Machine Learning · Computer Science 2021-06-08 Guruprasad Raghavan , Matt Thomson

Reframing Neural Networks: Deep Structure in Overcomplete Representations

In comparison to classical shallow representation learning techniques, deep neural networks have achieved superior performance in nearly every application benchmark. But despite their clear empirical advantages, it is still not well…

Machine Learning · Computer Science 2022-01-11 Calvin Murdock , George Cazenavette , Simon Lucey

Geometric deep learning: going beyond Euclidean data

Many scientific fields study data with an underlying structure that is a non-Euclidean space. Some examples include social networks in computational social sciences, sensor networks in communications, functional networks in brain imaging,…

Computer Vision and Pattern Recognition · Computer Science 2017-08-02 Michael M. Bronstein , Joan Bruna , Yann LeCun , Arthur Szlam , Pierre Vandergheynst

Doing the impossible: Why neural networks can be trained at all

As deep neural networks grow in size, from thousands to millions to billions of weights, the performance of those networks becomes limited by our ability to accurately train them. A common naive question arises: if we have a system with…

Machine Learning · Computer Science 2018-05-29 Nathan O. Hodas , Panos Stinis

Classifying the classifier: dissecting the weight space of neural networks

This paper presents an empirical study on the weights of neural networks, where we interpret each model as a point in a high-dimensional space -- the neural weight space. To explore the complex structure of this space, we sample from a…

Computer Vision and Pattern Recognition · Computer Science 2020-02-14 Gabriel Eilertsen , Daniel Jönsson , Timo Ropinski , Jonas Unger , Anders Ynnerman

Topology and Geometry of Half-Rectified Network Optimization

The loss surface of deep neural networks has recently attracted interest in the optimization and machine learning communities as a prime example of high-dimensional non-convex problem. Some insights were recently gained using spin glass…

Machine Learning · Statistics 2017-06-05 C. Daniel Freeman , Joan Bruna

A Geometric Modeling of Occam's Razor in Deep Learning

Why do deep neural networks (DNNs) benefit from very high dimensional parameter spaces? Their huge parameter complexities vs stunning performance in practice is all the more intriguing and not explainable using the standard theory of model…

Machine Learning · Computer Science 2025-06-12 Ke Sun , Frank Nielsen

A Generative Model for Sampling High-Performance and Diverse Weights for Neural Networks

Recent work on mode connectivity in the loss landscape of deep neural networks has demonstrated that the locus of (sub-)optimal weight vectors lies on continuous paths. In this work, we train a neural network that serves as a hypernetwork,…

Machine Learning · Statistics 2019-05-09 Lior Deutsch , Erik Nijkamp , Yu Yang

Modelling the influence of data structure on learning in neural networks: the hidden manifold model

Understanding the reasons for the success of deep neural networks trained using stochastic gradient-based methods is a key open problem for the nascent theory of deep learning. The types of data where these networks are most successful,…

Machine Learning · Statistics 2020-12-04 Sebastian Goldt , Marc Mézard , Florent Krzakala , Lenka Zdeborová

Theoretical insights into the optimization landscape of over-parameterized shallow neural networks

In this paper we study the problem of learning a shallow artificial neural network that best fits a training data set. We study this problem in the over-parameterized regime where the number of observations are fewer than the number of…

Machine Learning · Computer Science 2022-08-25 Mahdi Soltanolkotabi , Adel Javanmard , Jason D. Lee