English
Related papers

Related papers: Regularization of linear machine learning problems

200 papers

We present a simple linear regression based approach for learning the weights and biases of a neural network, as an alternative to standard gradient based backpropagation. The present work is exploratory in nature, and we restrict the…

Machine Learning · Computer Science 2023-07-17 Harshad Khadilkar

Operating deep neural networks (DNNs) on devices with limited resources requires the reduction of their memory as well as computational footprint. Popular reduction methods are network quantization or pruning, which either reduce the word…

Based on the property that solving the system of linear matrix equations via the column space and the row space projections boils down to an approximation in the least squares error sense, a formulation for learning the weight matrices of…

Machine Learning · Computer Science 2018-11-21 Kar-Ann Toh

Many scientific and geometric problems exhibit general linear symmetries, yet most equivariant neural networks are built for compact groups or simple vector features, limiting their reuse on matrix-valued data such as covariances, inertias,…

Machine Learning · Computer Science 2026-02-02 Chankyo Kim , Sicheng Zhao , Minghan Zhu , Tzu-Yuan Lin , Maani Ghaffari

Single-hidden layer feed forward neural networks (SLFNs) are widely used in pattern classification problems, but a huge bottleneck encountered is the slow speed and poor performance of the traditional iterative gradient-based learning…

Machine Learning · Computer Science 2019-10-28 Jie He , Tao Chen , Zhijun Zhang

Linear regression is a widely used technique to fit linear models and finds widespread applications across different areas such as machine learning and statistics. In most real-world scenarios, however, linear regression problems are often…

Quantum Physics · Physics 2023-05-02 Shantanav Chakraborty , Aditya Morolia , Anurudh Peduri

Employing equivariance in neural networks leads to greater parameter efficiency and improved generalization performance through the encoding of domain knowledge in the architecture; however, the majority of existing approaches require an a…

Machine Learning · Computer Science 2023-05-31 Emmanouil Theodosis , Karim Helwani , Demba Ba

Diagonal linear networks are neural networks with linear activation and diagonal weight matrices. Their theoretical interest is that their implicit regularization can be rigorously analyzed: from a small initialization, the training of…

Machine Learning · Computer Science 2026-03-19 Raphaël Berthier

In recent years, Orthogonal Recurrent Neural Networks (ORNNs) have gained popularity due to their ability to manage tasks involving long-term dependencies, such as the copy-task, and their linear complexity. However, existing ORNNs utilize…

Neural and Evolutionary Computing · Computer Science 2024-06-11 Armand Foucault , Franck Mamalet , François Malgouyres

We investigate the computational complexity of various problems for simple recurrent neural networks (RNNs) as formal models for recognizing weighted languages. We focus on the single-layer, ReLU-activation, rational-weight RNNs with…

Formal Languages and Automata Theory · Computer Science 2018-03-06 Yining Chen , Sorcha Gilroy , Andreas Maletti , Jonathan May , Kevin Knight

Neural networks are a powerful class of nonlinear functions that can be trained end-to-end on various applications. While the over-parametrization nature in many neural networks renders the ability to fit complex functions and the strong…

Machine Learning · Computer Science 2020-07-23 Weiyang Liu , Rongmei Lin , Zhen Liu , Lixin Liu , Zhiding Yu , Bo Dai , Le Song

Sparse neural networks are important for achieving better generalization and enhancing computation efficiency. This paper proposes a novel learning approach to obtain sparse fully connected layers in neural networks (NNs) automatically. We…

Machine Learning · Computer Science 2021-04-28 Mengqiao Han , Xiabi Liu , Zhaoyang Hai , Zhengwen Li

In this short note, we propose a new method for quantizing the weights of a fully trained neural network. A simple deterministic pre-processing step allows us to quantize network layers via memoryless scalar quantization while preserving…

Machine Learning · Computer Science 2023-04-06 Johannes Maly , Rayan Saab

In recent years there has been a collective research effort to find new formulations of reinforcement learning that are simultaneously more efficient and more amenable to analysis. This paper concerns one approach that builds on the linear…

Optimization and Control · Mathematics 2022-10-19 Fan Lu , Prashant Mehta , Sean Meyn , Gergely Neu

Due to the strong correlations present in quantum systems, classical machine learning algorithms like stochastic gradient descent are often insufficient for the training of neural network quantum states (NQSs). These difficulties can be…

Quantum Physics · Physics 2021-04-23 J. Thorben Frank , Michael J. Kastoryano

Recently, the interpretability of deep learning has attracted a lot of attention. A plethora of methods have attempted to explain neural networks by feature visualization, saliency maps, model distillation, and so on. However, it is hard…

Machine Learning · Computer Science 2021-10-19 Hangcheng Dong , Jingxiao Liao , Yan Wang , Yixin Chen , Bingguo Liu , Dong Ye , Guodong Liu

In this article, we introduce a novel normalization technique for neural network weight matrices, which we term weight conditioning. This approach aims to narrow the gap between the smallest and largest singular values of the weight…

Computer Vision and Pattern Recognition · Computer Science 2026-03-16 Hemanth Saratchandran , Thomas X. Wang , Simon Lucey

Recurrent neural networks (RNNs) are notoriously difficult to train. When the eigenvalues of the hidden to hidden weight matrix deviate from absolute value 1, optimization becomes difficult due to the well studied issue of vanishing and…

Machine Learning · Computer Science 2016-10-13 Martin Arjovsky , Amar Shah , Yoshua Bengio

Despite their impressive performance, Deep Neural Networks (DNNs) typically underperform Gradient Boosting Trees (GBTs) on many tabular-dataset learning tasks. We propose that applying a different regularization coefficient to each weight…

Machine Learning · Statistics 2018-10-25 Ira Shavitt , Eran Segal

A single layer neural network for the solution of linear equations is presented. The proposed circuit is based on the standard Hopfield model albeit with the added flexibility that the interconnection weight matrix need not be symmetric.…

Neural and Evolutionary Computing · Computer Science 2021-05-05 Mohammad Samar Ansari
‹ Prev 1 2 3 10 Next ›