Related papers: Computed Decision Weights and a New Learning Algor…

A comparative analysis of a neural network with calculated weights and a neural network with random generation of weights based on the training dataset size

The paper discusses the capabilities of multilayer perceptron neural networks implementing metric recognition methods, for which the values of the weights are calculated analytically by formulas. Comparative experiments in training a neural…

Machine Learning · Computer Science 2025-06-02 Polad Geidarov

A simple approach for quantizing neural networks

In this short note, we propose a new method for quantizing the weights of a fully trained neural network. A simple deterministic pre-processing step allows us to quantize network layers via memoryless scalar quantization while preserving…

Machine Learning · Computer Science 2023-04-06 Johannes Maly , Rayan Saab

Classifying the classifier: dissecting the weight space of neural networks

This paper presents an empirical study on the weights of neural networks, where we interpret each model as a point in a high-dimensional space -- the neural weight space. To explore the complex structure of this space, we sample from a…

Computer Vision and Pattern Recognition · Computer Science 2020-02-14 Gabriel Eilertsen , Daniel Jönsson , Timo Ropinski , Jonas Unger , Anders Ynnerman

Training without Gradients -- A Filtering Approach

A particle filtering approach is suggested for the training of multi-layer neural networks without utilizing gradients calculation. The network weights are considered to be the components of the estimated state-vector of a noise driven…

Optimization and Control · Mathematics 2020-10-13 Isaac Yaesh , Natan Grinfeld

A comprehensive theoretical framework for the optimization of neural networks classification performance with respect to weighted metrics

In many contexts, customized and weighted classification scores are designed in order to evaluate the goodness of the predictions carried out by neural networks. However, there exists a discrepancy between the maximization of such scores…

Machine Learning · Computer Science 2023-05-24 Francesco Marchetti , Sabrina Guastavino , Cristina Campi , Federico Benvenuto , Michele Piana

Model compression as constrained optimization, with application to neural nets. Part II: quantization

We consider the problem of deep neural net compression by quantization: given a large, reference net, we want to quantize its real-valued weights using a codebook with $K$ entries so that the training loss of the quantized net is minimal.…

Machine Learning · Computer Science 2017-07-17 Miguel Á. Carreira-Perpiñán , Yerlan Idelbayev

Training Decision Trees as Replacement for Convolution Layers

We present an alternative layer to convolution layers in convolutional neural networks (CNNs). Our approach reduces the complexity of convolutions by replacing it with binary decisions. Those binary decisions are used as indexes to…

Machine Learning · Computer Science 2020-02-12 Wolfgang Fuhl , Gjergji Kasneci , Wolfgang Rosenstiel , Enkelejda Kasneci

Deep Learning with a Classifier System: Initial Results

This article presents the first results from using a learning classifier system capable of performing adaptive computation with deep neural networks. Individual classifiers within the population are composed of two neural networks. The…

Neural and Evolutionary Computing · Computer Science 2021-03-02 Richard J. Preen , Larry Bull

Investigating Learning in Deep Neural Networks using Layer-Wise Weight Change

Understanding the per-layer learning dynamics of deep neural networks is of significant interest as it may provide insights into how neural networks learn and the potential for better training regimens. We investigate learning in Deep…

Machine Learning · Computer Science 2020-12-02 Ayush Manish Agrawal , Atharva Tendle , Harshvardhan Sikka , Sahib Singh , Amr Kayid

Deep Weighted Averaging Classifiers

Recent advances in deep learning have achieved impressive gains in classification accuracy on a variety of types of data, including images and text. Despite these gains, however, concerns have been raised about the calibration, robustness,…

Machine Learning · Computer Science 2018-11-20 Dallas Card , Michael Zhang , Noah A. Smith

Some Insights into the Geometry and Training of Neural Networks

Neural networks have been successfully used for classification tasks in a rapidly growing number of practical applications. Despite their popularity and widespread use, there are still many aspects of training and classification that are…

Machine Learning · Computer Science 2016-05-03 Ewout van den Berg

Predicting Neural Network Accuracy from Weights

We show experimentally that the accuracy of a trained neural network can be predicted surprisingly well by looking only at its weights, without evaluating it on input data. We motivate this task and introduce a formal setting for it. Even…

Machine Learning · Statistics 2021-04-12 Thomas Unterthiner , Daniel Keysers , Sylvain Gelly , Olivier Bousquet , Ilya Tolstikhin

Bit-wise Training of Neural Network Weights

We introduce an algorithm where the individual bits representing the weights of a neural network are learned. This method allows training weights with integer values on arbitrary bit-depths and naturally uncovers sparse networks, without…

Machine Learning · Computer Science 2022-02-22 Cristian Ivan

Increasing biases can be more efficient than increasing weights

We introduce a novel computational unit for neural networks that features multiple biases, challenging the traditional perceptron structure. This unit emphasizes the importance of preserving uncorrupted information as it is passed from one…

Neural and Evolutionary Computing · Computer Science 2024-01-19 Carlo Metta , Marco Fantozzi , Andrea Papini , Gianluca Amato , Matteo Bergamaschi , Silvia Giulia Galfrè , Alessandro Marchetti , Michelangelo Vegliò , Maurizio Parton , Francesco Morandin

Neural Networks with Few Multiplications

For most deep learning algorithms training is notoriously time consuming. Since most of the computation in training neural networks is typically spent on floating point multiplications, we investigate an approach to training that eliminates…

Machine Learning · Computer Science 2016-02-29 Zhouhan Lin , Matthieu Courbariaux , Roland Memisevic , Yoshua Bengio

Decision-Focused Learning: Through the Lens of Learning to Rank

In the last years decision-focused learning framework, also known as predict-and-optimize, have received increasing attention. In this setting, the predictions of a machine learning model are used as estimated cost coefficients in the…

Machine Learning · Computer Science 2022-06-20 Jayanta Mandi , Víctor Bucarey , Maxime Mulamba , Tias Guns

A Good Start Matters: Enhancing Continual Learning with Data-Driven Weight Initialization

To adapt to real-world data streams, continual learning (CL) systems must rapidly learn new concepts while preserving and utilizing prior knowledge. When it comes to adding new information to continually-trained deep neural networks (DNNs),…

Machine Learning · Computer Science 2025-07-02 Md Yousuf Harun , Christopher Kanan

Training Quantized Nets: A Deeper Understanding

Currently, deep neural networks are deployed on low-power portable devices by first training a full-precision model using powerful hardware, and then deriving a corresponding low-precision model for efficient inference on such systems.…

Machine Learning · Computer Science 2017-11-15 Hao Li , Soham De , Zheng Xu , Christoph Studer , Hanan Samet , Tom Goldstein

WeightScale: Interpreting Weight Change in Neural Networks

Interpreting the learning dynamics of neural networks can provide useful insights into how networks learn and the development of better training and design approaches. We present an approach to interpret learning in neural networks by…

Machine Learning · Computer Science 2022-03-29 Ayush Manish Agrawal , Atharva Tendle , Harshvardhan Sikka , Sahib Singh

Multi-Loss Weighting with Coefficient of Variations

Many interesting tasks in machine learning and computer vision are learned by optimising an objective function defined as a weighted linear combination of multiple losses. The final performance is sensitive to choosing the correct…

Computer Vision and Pattern Recognition · Computer Science 2020-11-11 Rick Groenendijk , Sezer Karaoglu , Theo Gevers , Thomas Mensink