English
Related papers

Related papers: Predicting Neural Network Accuracy from Weights

200 papers

We demonstrate that there is significant redundancy in the parameterization of several deep learning models. Given only a few weight values for each feature it is possible to accurately predict the remaining values. Moreover, we show that…

Machine Learning · Computer Science 2014-10-28 Misha Denil , Babak Shakibi , Laurent Dinh , Marc'Aurelio Ranzato , Nando de Freitas

Not all neural network architectures are created equal, some perform much better than others for certain tasks. But how important are the weight parameters of a neural network compared to its architecture? In this work, we question to what…

Machine Learning · Computer Science 2019-09-06 Adam Gaier , David Ha

We introduce an algorithm where the individual bits representing the weights of a neural network are learned. This method allows training weights with integer values on arbitrary bit-depths and naturally uncovers sparse networks, without…

Machine Learning · Computer Science 2022-02-22 Cristian Ivan

Recent results in nonparametric regression show that for deep learning, i.e., for neural network estimates with many hidden layers, we are able to achieve good rates of convergence even in case of high-dimensional predictor variables,…

Statistics Theory · Mathematics 2019-12-12 Alina Braun , Michael Kohler , Adam Krzyzak

Training a neural network is synonymous with learning the values of the weights. By contrast, we demonstrate that randomly weighted neural networks contain subnetworks which achieve impressive performance without ever training the weight…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Vivek Ramanujan , Mitchell Wortsman , Aniruddha Kembhavi , Ali Farhadi , Mohammad Rastegari

Neural networks based on metric recognition methods have a strictly determined architecture. Number of neurons, connections, as well as weights and thresholds values are calculated analytically, based on the initial conditions of tasks:…

Neural and Evolutionary Computing · Computer Science 2025-06-10 Polad Geidarov

Neural Networks are function approximators that have achieved state-of-the-art accuracy in numerous machine learning tasks. In spite of their great success in terms of accuracy, their large training time makes it difficult to use them for…

Machine Learning · Computer Science 2017-04-18 Abhishek Sinha , Mausoom Sarkar , Aahitagni Mukherjee , Balaji Krishnamurthy

In this short note, we propose a new method for quantizing the weights of a fully trained neural network. A simple deterministic pre-processing step allows us to quantize network layers via memoryless scalar quantization while preserving…

Machine Learning · Computer Science 2023-04-06 Johannes Maly , Rayan Saab

Training deep neural networks results in strong learned representations that show good generalization capabilities. In most cases, training involves iterative modification of all weights inside the network via back-propagation. In Extreme…

Machine Learning · Computer Science 2018-02-06 Amir Rosenfeld , John K. Tsotsos

Neural networks are nowadays highly successful despite strong hardness results. The existing hardness results focus on the network architecture, and assume that the network's weights are arbitrary. A natural approach to settle the…

Machine Learning · Computer Science 2020-10-15 Amit Daniely , Gal Vardi

This paper presents an algorithm for analytically calculating the weights and thresholds of convolutional neural networks (CNNs) without using standard training procedures. The algorithm enables the determination of CNN parameters based on…

Computer Vision and Pattern Recognition · Computer Science 2025-05-29 Polad Geidarov

This paper presents an empirical study on the weights of neural networks, where we interpret each model as a point in a high-dimensional space -- the neural weight space. To explore the complex structure of this space, we sample from a…

Computer Vision and Pattern Recognition · Computer Science 2020-02-14 Gabriel Eilertsen , Daniel Jönsson , Timo Ropinski , Jonas Unger , Anders Ynnerman

The paper discusses the capabilities of multilayer perceptron neural networks implementing metric recognition methods, for which the values of the weights are calculated analytically by formulas. Comparative experiments in training a neural…

Machine Learning · Computer Science 2025-06-02 Polad Geidarov

We present some novel, straightforward methods for training the connection graph of a randomly initialized neural network without training the weights. These methods do not use hyperparameters defining cutoff thresholds and therefore remove…

Machine Learning · Computer Science 2020-11-18 Cristian Ivan , Razvan Florian

A recent work by Ramanujan et al. (2020) provides significant empirical evidence that sufficiently overparameterized, random neural networks contain untrained subnetworks that achieve state-of-the-art accuracy on several predictive tasks. A…

Machine Learning · Computer Science 2021-10-26 Kartik Sreenivasan , Shashank Rajput , Jy-yong Sohn , Dimitris Papailiopoulos

Recent findings have shown that highly over-parameterized Neural Networks generalize without pretraining or explicit regularization. It is achieved with zero training error, i.e., complete over-fitting by memorizing the training data. This…

Computer Vision and Pattern Recognition · Computer Science 2022-10-24 Christoph Linse , Thomas Martinetz

The universal approximation property is fundamental to the success of neural networks, and has traditionally been achieved by training networks without any constraints on their parameters. However, recent experimental research proposed a…

Machine Learning · Computer Science 2025-03-21 Yongqiang Cai , Gaohang Chen , Zhonghua Qiao

Convolutional Neural Networks spread through computer vision like a wildfire, impacting almost all visual tasks imaginable. Despite this, few researchers dare to train their models from scratch. Most work builds on one of a handful of…

Computer Vision and Pattern Recognition · Computer Science 2016-09-26 Philipp Krähenbühl , Carl Doersch , Jeff Donahue , Trevor Darrell

Given that conventional recommenders, while deeply effective, rely on large distributed systems pre-trained on aggregate user data, incorporating new data necessitates large training cycles, making them slow to adapt to real-time user…

Information Retrieval · Computer Science 2025-11-11 Rafayel Latif , Satwik Behera , Ali Al-Ebrahim

Knowledge embedded in the weights of the artificial neural network can be used to improve the network structure, such as in network compression. However, the knowledge is set up by hand, which may not be very accurate, and relevant…

Neural and Evolutionary Computing · Computer Science 2021-10-13 Mengqiao Han , Xiabi Liu , Zhaoyang Hai , Xin Duan
‹ Prev 1 2 3 10 Next ›