English
Related papers

Related papers: Sparse Deep Learning Models with the $\ell_1$ Regu…

200 papers

Deep neural networks have become very popular in modeling complex nonlinear processes due to their extraordinary ability to fit arbitrary nonlinear functions from data with minimal expert intervention. However, they are almost always…

Chemical Physics · Physics 2023-01-16 Erlend Torje Berg Lundby , Adil Rasheed , Ivar Johan Halvorsen , Jan Tommy Gravdahl

The ever-increasing number of parameters in deep neural networks poses challenges for memory-limited applications. Regularize-and-prune methods aim at meeting these challenges by sparsifying the network weights. In this context we quantify…

Machine Learning · Computer Science 2018-10-30 Enzo Tartaglione , Skjalg Lepsøy , Attilio Fiandrotti , Gianluca Francini

While deep neural networks (DNNs) have proven to be efficient for numerous tasks, they come at a high memory and computation cost, thus making them impractical on resource-limited devices. However, these networks are known to contain a…

Neural and Evolutionary Computing · Computer Science 2020-07-21 Anthony Berthelier , Yongzhe Yan , Thierry Chateau , Christophe Blanc , Stefan Duffner , Christophe Garcia

We consider a regularization problem whose objective function consists of a convex fidelity term and a regularization term determined by the $\ell_1$ norm composed with a linear transform. Empirical results show that the regularization with…

Numerical Analysis · Mathematics 2023-01-18 Qianru Liu , Rui Wang , Yuesheng Xu , Mingsong Yan

Deep neural networks exploiting millions of parameters are nowadays the norm in deep learning applications. This is a potential issue because of the great amount of computational resources needed for training, and of the possible loss of…

Computation and Language · Computer Science 2022-10-31 Giovanni Bonetta , Matteo Ribero , Rossella Cancelliere

With the capability of accurately representing a functional relationship between the inputs of a physical system's model and output quantities of interest, neural networks have become popular for surrogate modeling in scientific…

Machine Learning · Statistics 2022-04-06 Subhayan De , Alireza Doostan

Obtaining versions of deep neural networks that are both highly-accurate and highly-sparse is one of the main challenges in the area of model compression, and several high-performance pruning techniques have been investigated by the…

Machine Learning · Computer Science 2023-09-11 Denis Kuznedelev , Eldar Kurtic , Eugenia Iofinova , Elias Frantar , Alexandra Peste , Dan Alistarh

Deep learning has been wildly successful in practice and most state-of-the-art machine learning methods are based on neural networks. Lacking, however, is a rigorous mathematical theory that adequately explains the amazing performance of…

Machine Learning · Statistics 2023-10-03 Rahul Parhi , Robert D. Nowak

Nowadays, the number of layers and of neurons in each layer of a deep network are typically set manually. While very deep and wide networks have proven effective in general, they come at a high memory and computation cost, thus making them…

Computer Vision and Pattern Recognition · Computer Science 2018-10-12 Jose M Alvarez , Mathieu Salzmann

Sparsity is a highly desired feature in deep neural networks (DNNs) since it ensures numerical efficiency, improves the interpretability of models (due to the smaller number of relevant features), and robustness. For linear models, it is…

Machine Learning · Computer Science 2024-04-01 Augustina C. Amakor , Konstantin Sonntag , Sebastian Peitz

Deep neural networks (DNNs) have achieved extraordinary success in numerous areas. However, to attain this success, DNNs often carry a large number of weight parameters, leading to heavy costs of memory and computation resources.…

Computer Vision and Pattern Recognition · Computer Science 2019-01-07 Rongrong Ma , Jianyu Miao , Lingfeng Niu , Peng Zhang

The analysis of neural network training beyond their linearization regime remains an outstanding open question, even in the simplest setup of a single hidden-layer. The limit of infinitely wide networks provides an appealing route forward…

Machine Learning · Computer Science 2020-06-19 Jaume de Dios , Joan Bruna

Sparsity has become popular in machine learning, because it can save computational resources, facilitate interpretations, and prevent overfitting. In this paper, we discuss sparsity in the framework of neural networks. In particular, we…

Machine Learning · Computer Science 2020-06-30 Mohamed Hebiri , Johannes Lederer

We consider a minimization problem whose objective function is the sum of a fidelity term, not necessarily convex, and a regularization term defined by a positive regularization parameter $\lambda$ multiple of the $\ell_0$ norm composed…

Optimization and Control · Mathematics 2021-11-17 Yuesheng Xu

We study a regularization framework that combines a convex fidelity term with multiple $\ell_1$-based regularizers, each linked to a distinct linear transform. This multi-penalty model enhances flexibility in promoting structured sparsity.…

Numerical Analysis · Mathematics 2026-02-02 Qianru Liu , Rui Wang , Yuesheng Xu

Overfitting is one of the most common problems when training deep neural networks on comparatively small datasets. Here, we demonstrate that neural network activation sparsity is a reliable indicator for overfitting which we utilize to…

Machine Learning · Computer Science 2020-02-24 Karim Huesmann , Soeren Klemm , Lars Linsen , Benjamin Risse

Deep neural networks have emerged as powerful tools for learning operators defined over infinite-dimensional function spaces. However, existing theories frequently encounter difficulties related to dimensionality and limited…

Machine Learning · Computer Science 2026-05-12 Jianfei Li , Shuo Huang , Han Feng , Ding-Xuan Zhou , Gitta Kutyniok

Deep neural networks have significantly alleviated the burden of feature engineering, but comparable efforts are now required to determine effective architectures for these networks. Furthermore, as network sizes have become excessively…

Machine Learning · Computer Science 2023-10-25 Yognjin Lee

High-dimensional, streaming datasets are ubiquitous in modern applications. Examples range from finance and e-commerce to the study of biomedical and neuroimaging data. As a result, many novel algorithms have been proposed to address…

Methodology · Statistics 2020-09-28 Lenka Zboňáková , Ricardo Pio Monti , Wolfgang Karl Härdle

This paper presents meta-sparsity, a framework for learning model sparsity, basically learning the parameter that controls the degree of sparsity, that allows deep neural networks (DNNs) to inherently generate optimal sparse shared…

Machine Learning · Computer Science 2025-01-22 Richa Upadhyay , Ronald Phlypo , Rajkumar Saini , Marcus Liwicki
‹ Prev 1 2 3 10 Next ›