Related papers: Sparse Deep Learning Models with the $\ell_1$ Regu…

Sparse deep neural networks for modeling aluminum electrolysis dynamics

Deep neural networks have become very popular in modeling complex nonlinear processes due to their extraordinary ability to fit arbitrary nonlinear functions from data with minimal expert intervention. However, they are almost always…

Chemical Physics · Physics 2023-01-16 Erlend Torje Berg Lundby , Adil Rasheed , Ivar Johan Halvorsen , Jan Tommy Gravdahl

Learning Sparse Neural Networks via Sensitivity-Driven Regularization

The ever-increasing number of parameters in deep neural networks poses challenges for memory-limited applications. Regularize-and-prune methods aim at meeting these challenges by sparsifying the network weights. In this context we quantify…

Machine Learning · Computer Science 2018-10-30 Enzo Tartaglione , Skjalg Lepsøy , Attilio Fiandrotti , Gianluca Francini

Learning Sparse Filters in Deep Convolutional Neural Networks with a l1/l2 Pseudo-Norm

While deep neural networks (DNNs) have proven to be efficient for numerous tasks, they come at a high memory and computation cost, thus making them impractical on resource-limited devices. However, these networks are known to contain a…

Neural and Evolutionary Computing · Computer Science 2020-07-21 Anthony Berthelier , Yongzhe Yan , Thierry Chateau , Christophe Blanc , Stefan Duffner , Christophe Garcia

Parameter Choices for Sparse Regularization with the $\ell_1$ Norm

We consider a regularization problem whose objective function consists of a convex fidelity term and a regularization term determined by the $\ell_1$ norm composed with a linear transform. Empirical results show that the regularization with…

Numerical Analysis · Mathematics 2023-01-18 Qianru Liu , Rui Wang , Yuesheng Xu , Mingsong Yan

Regularization-based Pruning of Irrelevant Weights in Deep Neural Architectures

Deep neural networks exploiting millions of parameters are nowadays the norm in deep learning applications. This is a potential issue because of the great amount of computational resources needed for training, and of the possible loss of…

Computation and Language · Computer Science 2022-10-31 Giovanni Bonetta , Matteo Ribero , Rossella Cancelliere

Neural Network Training Using $\ell_1$-Regularization and Bi-fidelity Data

With the capability of accurately representing a functional relationship between the inputs of a physical system's model and output quantities of interest, neural networks have become popular for surrogate modeling in scientific…

Machine Learning · Statistics 2022-04-06 Subhayan De , Alireza Doostan

Accurate Neural Network Pruning Requires Rethinking Sparse Optimization

Obtaining versions of deep neural networks that are both highly-accurate and highly-sparse is one of the main challenges in the area of model compression, and several high-performance pruning techniques have been investigated by the…

Machine Learning · Computer Science 2023-09-11 Denis Kuznedelev , Eldar Kurtic , Eugenia Iofinova , Elias Frantar , Alexandra Peste , Dan Alistarh

Deep Learning Meets Sparse Regularization: A Signal Processing Perspective

Deep learning has been wildly successful in practice and most state-of-the-art machine learning methods are based on neural networks. Lacking, however, is a rigorous mathematical theory that adequately explains the amazing performance of…

Machine Learning · Statistics 2023-10-03 Rahul Parhi , Robert D. Nowak

Learning the Number of Neurons in Deep Networks

Nowadays, the number of layers and of neurons in each layer of a deep network are typically set manually. While very deep and wide networks have proven effective in general, they come at a high memory and computation cost, thus making them…

Computer Vision and Pattern Recognition · Computer Science 2018-10-12 Jose M Alvarez , Mathieu Salzmann

A multiobjective continuation method to compute the regularization path of deep neural networks

Sparsity is a highly desired feature in deep neural networks (DNNs) since it ensures numerical efficiency, improves the interpretability of models (due to the smaller number of relevant features), and robustness. For linear models, it is…

Machine Learning · Computer Science 2024-04-01 Augustina C. Amakor , Konstantin Sonntag , Sebastian Peitz

Transformed $\ell_1$ Regularization for Learning Sparse Deep Neural Networks

Deep neural networks (DNNs) have achieved extraordinary success in numerous areas. However, to attain this success, DNNs often carry a large number of weight parameters, leading to heavy costs of memory and computation resources.…

Computer Vision and Pattern Recognition · Computer Science 2019-01-07 Rongrong Ma , Jianyu Miao , Lingfeng Niu , Peng Zhang

On Sparsity in Overparametrised Shallow ReLU Networks

The analysis of neural network training beyond their linearization regime remains an outstanding open question, even in the simplest setup of a single hidden-layer. The limit of infinitely wide networks provides an appealing route forward…

Machine Learning · Computer Science 2020-06-19 Jaume de Dios , Joan Bruna

Layer Sparsity in Neural Networks

Sparsity has become popular in machine learning, because it can save computational resources, facilitate interpretations, and prevent overfitting. In this paper, we discuss sparsity in the framework of neural networks. In particular, we…

Machine Learning · Computer Science 2020-06-30 Mohamed Hebiri , Johannes Lederer

Sparse Regularization with the $\ell_0$ Norm

We consider a minimization problem whose objective function is the sum of a fidelity term, not necessarily convex, and a regularization term defined by a positive regularization parameter $\lambda$ multiple of the $\ell_0$ norm composed…

Optimization and Control · Mathematics 2021-11-17 Yuesheng Xu

Sparsity-Guided Multi-Parameter Selection in $\ell_1$-Regularized Models via a Fixed-Point Proximity Approach

We study a regularization framework that combines a convex fidelity term with multiple $\ell_1$-based regularizers, each linked to a distinct linear transform. This multi-penalty model enhances flexibility in promoting structured sparsity.…

Numerical Analysis · Mathematics 2026-02-02 Qianru Liu , Rui Wang , Yuesheng Xu

Exploiting the Full Capacity of Deep Neural Networks while Avoiding Overfitting by Targeted Sparsity Regularization

Overfitting is one of the most common problems when training deep neural networks on comparatively small datasets. Here, we demonstrate that neural network activation sparsity is a reliable indicator for overfitting which we utilize to…

Machine Learning · Computer Science 2020-02-24 Karim Huesmann , Soeren Klemm , Lars Linsen , Benjamin Risse

Sparse-Aware Neural Networks for Nonlinear Functionals: Mitigating the Exponential Dependence on Dimension

Deep neural networks have emerged as powerful tools for learning operators defined over infinite-dimensional function spaces. However, existing theories frequently encounter difficulties related to dimensionality and limited…

Machine Learning · Computer Science 2026-05-12 Jianfei Li , Shuo Huang , Han Feng , Ding-Xuan Zhou , Gitta Kutyniok

Differentiable Sparsification for Deep Neural Networks

Deep neural networks have significantly alleviated the burden of feature engineering, but comparable efforts are now required to determine effective architectures for these networks. Furthermore, as network sizes have become excessively…

Machine Learning · Computer Science 2023-10-25 Yognjin Lee

Towards the interpretation of time-varying regularization parameters in streaming penalized regression models

High-dimensional, streaming datasets are ubiquitous in modern applications. Examples range from finance and e-commerce to the study of biomedical and neuroimaging data. As a result, many novel algorithms have been proposed to address…

Methodology · Statistics 2020-09-28 Lenka Zboňáková , Ricardo Pio Monti , Wolfgang Karl Härdle

Meta-Sparsity: Learning Optimal Sparse Structures in Multi-task Networks through Meta-learning

This paper presents meta-sparsity, a framework for learning model sparsity, basically learning the parameter that controls the degree of sparsity, that allows deep neural networks (DNNs) to inherently generate optimal sparse shared…

Machine Learning · Computer Science 2025-01-22 Richa Upadhyay , Ronald Phlypo , Rajkumar Saini , Marcus Liwicki