Related papers: Sparse Activity and Sparse Connectivity in Supervi…

Sparse learning enabled by constraints on connectivity and function

Sparse connectivity is a hallmark of the brain and a desired property of artificial neural networks. It promotes energy efficiency, simplifies training, and enhances the robustness of network function. Thus, a detailed understanding of how…

Disordered Systems and Neural Networks · Physics 2024-09-10 Mirza M. Junaid Baig , Armen Stepanyants

Sparse-Aware Neural Networks for Nonlinear Functionals: Mitigating the Exponential Dependence on Dimension

Deep neural networks have emerged as powerful tools for learning operators defined over infinite-dimensional function spaces. However, existing theories frequently encounter difficulties related to dimensionality and limited…

Machine Learning · Computer Science 2026-05-12 Jianfei Li , Shuo Huang , Han Feng , Ding-Xuan Zhou , Gitta Kutyniok

On improving deep learning generalization with adaptive sparse connectivity

Large neural networks are very successful in various tasks. However, with limited data, the generalization capabilities of deep neural networks are also very limited. In this paper, we empirically start showing that intrinsically sparse…

Neural and Evolutionary Computing · Computer Science 2019-06-28 Shiwei Liu , Decebal Constantin Mocanu , Mykola Pechenizkiy

Balance is Essence: Accelerating Sparse Training via Adaptive Gradient Correction

Despite impressive performance, deep neural networks require significant memory and computation costs, prohibiting their application in resource-constrained scenarios. Sparse training is one of the most common techniques to reduce these…

Machine Learning · Computer Science 2023-12-06 Bowen Lei , Dongkuan Xu , Ruqi Zhang , Shuren He , Bani K. Mallick

Exploiting the Full Capacity of Deep Neural Networks while Avoiding Overfitting by Targeted Sparsity Regularization

Overfitting is one of the most common problems when training deep neural networks on comparatively small datasets. Here, we demonstrate that neural network activation sparsity is a reliable indicator for overfitting which we utilize to…

Machine Learning · Computer Science 2020-02-24 Karim Huesmann , Soeren Klemm , Lars Linsen , Benjamin Risse

Activation function impact on Sparse Neural Networks

While the concept of a Sparse Neural Network has been researched for some time, researchers have only recently made notable progress in the matter. Techniques like Sparse Evolutionary Training allow for significantly lower computational…

Neural and Evolutionary Computing · Computer Science 2020-10-14 Adam Dubowski

A Sparsity-Aware Adaptive Algorithm for Distributed Learning

In this paper, a sparsity-aware adaptive algorithm for distributed learning in diffusion networks is developed. The algorithm follows the set-theoretic estimation rationale. At each time instance and at each node of the network, a closed…

Information Theory · Computer Science 2015-06-03 Symeon Chouvardas , Konstantinos Slavakis , Yannis Kopsinis , Sergios Theodoridis

Efficient Neural Network Training via Forward and Backward Propagation Sparsification

Sparse training is a natural idea to accelerate the training speed of deep neural networks and save the memory usage, especially since large modern neural networks are significantly over-parameterized. However, most of the existing methods…

Machine Learning · Computer Science 2021-11-11 Xiao Zhou , Weizhong Zhang , Zonghao Chen , Shizhe Diao , Tong Zhang

Unsupervised Pretraining Encourages Moderate-Sparseness

It is well known that direct training of deep neural networks will generally lead to poor results. A major progress in recent years is the invention of various pretraining methods to initialize network parameters and it was shown that such…

Machine Learning · Computer Science 2014-06-10 Jun Li , Wei Luo , Jian Yang , Xiaotong Yuan

Sparsity in Deep Learning: Pruning and growth for efficient inference and training in neural networks

The growing energy and performance costs of deep learning have driven the community to reduce the size of neural networks by selectively pruning components. Similarly to their biological counterparts, sparse networks generalize just as…

Machine Learning · Computer Science 2021-02-02 Torsten Hoefler , Dan Alistarh , Tal Ben-Nun , Nikoli Dryden , Alexandra Peste

Representations learnt by SGD and Adaptive learning rules: Conditions that vary sparsity and selectivity in neural networks

From the point of view of the human brain, continual learning can perform various tasks without mutual interference. An effective way to reduce mutual interference can be found in sparsity and selectivity of neurons. According to Aljundi et…

Machine Learning · Computer Science 2024-10-04 Jin Hyun Park

The Utility of Sparse Representations for Control in Reinforcement Learning

We investigate sparse representations for control in reinforcement learning. While these representations are widely used in computer vision, their prevalence in reinforcement learning is limited to sparse coding where extracting…

Machine Learning · Computer Science 2018-11-19 Vincent Liu , Raksha Kumaraswamy , Lei Le , Martha White

Intrinsically Interpretable Attention via Sparse Post-Training

We introduce a simple post-training method that makes transformer attention sparse without sacrificing performance. Applying a flexible sparsity regularisation under a constrained-loss objective, we show on models up to 7B parameters that…

Machine Learning · Computer Science 2026-05-26 Florent Draye , Anson Lei , Hsiao-Ru Pan , Ingmar Posner , Bernhard Schölkopf

Sparseout: Controlling Sparsity in Deep Networks

Dropout is commonly used to help reduce overfitting in deep neural networks. Sparsity is a potentially important property of neural networks, but is not explicitly controlled by Dropout-based regularization. In this work, we propose…

Machine Learning · Computer Science 2019-04-18 Najeeb Khan , Ian Stavness

Sparse Models for Machine Learning

The sparse modeling is an evident manifestation capturing the parsimony principle just described, and sparse models are widespread in statistics, physics, information sciences, neuroscience, computational mathematics, and so on. In…

Machine Learning · Computer Science 2023-08-29 Jianyi Lin

Sparse Networks from Scratch: Faster Training without Losing Performance

We demonstrate the possibility of what we call sparse learning: accelerated training of deep neural networks that maintain sparse weights throughout training while achieving dense performance levels. We accomplish this by developing sparse…

Machine Learning · Computer Science 2019-08-27 Tim Dettmers , Luke Zettlemoyer

Learning with Structured Sparsity

This paper investigates a new learning formulation called structured sparsity, which is a natural extension of the standard sparsity concept in statistical learning and compressive sensing. By allowing arbitrary structures on the feature…

Methodology · Statistics 2009-05-05 Junzhou Huang , Tong Zhang , Dimitris Metaxas

Accurate Neural Network Pruning Requires Rethinking Sparse Optimization

Obtaining versions of deep neural networks that are both highly-accurate and highly-sparse is one of the main challenges in the area of model compression, and several high-performance pruning techniques have been investigated by the…

Machine Learning · Computer Science 2023-09-11 Denis Kuznedelev , Eldar Kurtic , Eugenia Iofinova , Elias Frantar , Alexandra Peste , Dan Alistarh

k-Sparse Autoencoders

Recently, it has been observed that when representations are learnt in a way that encourages sparsity, improved performance is obtained on classification tasks. These methods involve combinations of activation functions, sampling steps and…

Machine Learning · Computer Science 2014-03-25 Alireza Makhzani , Brendan Frey

Finding trainable sparse networks through Neural Tangent Transfer

Deep neural networks have dramatically transformed machine learning, but their memory and energy demands are substantial. The requirements of real biological neural networks are rather modest in comparison, and one feature that might…

Machine Learning · Computer Science 2020-07-27 Tianlin Liu , Friedemann Zenke