Related papers: Practical Network Acceleration with Tiny Sets

Practical Network Acceleration with Tiny Sets: Hypothesis, Theory, and Algorithm

Due to data privacy issues, accelerating networks with tiny training sets has become a critical need in practice. Previous methods achieved promising results empirically by filter-level pruning. In this paper, we both study this problem…

Computer Vision and Pattern Recognition · Computer Science 2023-03-13 Guo-Hua Wang , Jianxin Wu

Block-Wise Dynamic-Precision Neural Network Training Acceleration via Online Quantization Sensitivity Analytics

Data quantization is an effective method to accelerate neural network training and reduce power consumption. However, it is challenging to perform low-bit quantized training: the conventional equal-precision quantization will lead to either…

Machine Learning · Computer Science 2022-11-01 Ruoyang Liu , Chenhan Wei , Yixiong Yang , Wenxun Wang , Huazhong Yang , Yongpan Liu

Progressive Skeletonization: Trimming more fat from a network at initialization

Recent studies have shown that skeletonization (pruning parameters) of networks \textit{at initialization} provides all the practical benefits of sparsity both at inference and training time, while only marginally degrading their…

Computer Vision and Pattern Recognition · Computer Science 2021-03-22 Pau de Jorge , Amartya Sanyal , Harkirat S. Behl , Philip H. S. Torr , Gregory Rogez , Puneet K. Dokania

Learning Sparse Networks Using Targeted Dropout

Neural networks are easier to optimise when they have many more weights than are required for modelling the mapping from inputs to outputs. This suggests a two-stage learning procedure that first learns a large net and then prunes away…

Machine Learning · Computer Science 2019-09-10 Aidan N. Gomez , Ivan Zhang , Siddhartha Rao Kamalakara , Divyam Madaan , Kevin Swersky , Yarin Gal , Geoffrey E. Hinton

Training Behavior of Sparse Neural Network Topologies

Improvements in the performance of deep neural networks have often come through the design of larger and more complex networks. As a result, fast memory is a significant limiting factor in our ability to improve network performance. One…

Machine Learning · Computer Science 2019-12-25 Simon Alford , Ryan Robinett , Lauren Milechin , Jeremy Kepner

On the Power of Simple Reductions for the Maximum Independent Set Problem

Reductions---rules that reduce input size while maintaining the ability to compute an optimal solution---are critical for developing efficient maximum independent set algorithms in both theory and practice. While several simple reductions…

Data Structures and Algorithms · Computer Science 2016-08-03 Darren Strash

Compacting Neural Network Classifiers via Dropout Training

We introduce dropout compaction, a novel method for training feed-forward neural networks which realizes the performance gains of training a large model with dropout regularization, yet extracts a compact neural network for run-time…

Machine Learning · Statistics 2017-05-25 Yotaro Kubo , George Tucker , Simon Wiesler

DropNet: Reducing Neural Network Complexity via Iterative Pruning

Modern deep neural networks require a significant amount of computing time and power to train and deploy, which limits their usage on edge devices. Inspired by the iterative weight pruning in the Lottery Ticket Hypothesis, we propose…

Machine Learning · Computer Science 2022-07-15 John Tan Chong Min , Mehul Motani

Minibatch training of neural network ensembles via trajectory sampling

Most iterative neural network training methods use estimates of the loss function over small random subsets (or minibatches) of the data to update the parameters, which aid in decoupling the training time from the (often very large) size of…

Statistical Mechanics · Physics 2023-06-28 Jamie F. Mair , Luke Causer , Juan P. Garrahan

Less Data, Faster Training: repeating smaller datasets speeds up learning via sampling biases

This work investigates the ``small-vs-large gap'', where repeating on fewer samples can lead to compute saving during training compared to using a larger dataset. This is observed across algorithmic tasks, architectures and optimizers and…

Machine Learning · Computer Science 2026-05-21 Jingwen Liu , Ezra Edelman , Surbhi Goel , Bingbin Liu

DRESS: Dynamic REal-time Sparse Subnets

The limited and dynamically varied resources on edge devices motivate us to deploy an optimized deep neural network that can adapt its sub-networks to fit in different resource constraints. However, existing works often build sub-networks…

Computer Vision and Pattern Recognition · Computer Science 2022-07-05 Zhongnan Qu , Syed Shakib Sarwar , Xin Dong , Yuecheng Li , Ekin Sumbul , Barbara De Salvo

Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers

We present a novel network pruning algorithm called Dynamic Sparse Training that can jointly find the optimal network parameters and sparse network structure in a unified optimization process with trainable pruning thresholds. These…

Machine Learning · Computer Science 2020-05-15 Junjie Liu , Zhe Xu , Runbin Shi , Ray C. C. Cheung , Hayden K. H. So

Faster Convergence & Generalization in DNNs

Deep neural networks have gained tremendous popularity in last few years. They have been applied for the task of classification in almost every domain. Despite the success, deep networks can be incredibly slow to train for even moderate…

Machine Learning · Computer Science 2018-10-11 Gaurav Singh , John Shawe-Taylor

Procrustes: a Dataflow and Accelerator for Sparse Deep Neural Network Training

The success of DNN pruning has led to the development of energy-efficient inference accelerators that support pruned models with sparse weight and activation tensors. Because the memory layouts and dataflows in these architectures are…

Neural and Evolutionary Computing · Computer Science 2020-09-24 Dingqing Yang , Amin Ghasemazar , Xiaowei Ren , Maximilian Golub , Guy Lemieux , Mieszko Lis

Network Recasting: A Universal Method for Network Architecture Transformation

This paper proposes network recasting as a general method for network architecture transformation. The primary goal of this method is to accelerate the inference process through the transformation, but there can be many other practical…

Machine Learning · Computer Science 2019-06-20 Joonsang Yu , Sungbum Kang , Kiyoung Choi

Activation Density driven Energy-Efficient Pruning in Training

Neural network pruning with suitable retraining can yield networks with considerably fewer parameters than the original with comparable degrees of accuracy. Typical pruning methods require large, fully trained networks as a starting point…

Machine Learning · Computer Science 2020-10-13 Timothy Foldy-Porto , Yeshwanth Venkatesha , Priyadarshini Panda

CompressNet: Generative Compression at Extremely Low Bitrates

Compressing images at extremely low bitrates (< 0.1 bpp) has always been a challenging task since the quality of reconstruction significantly reduces due to the strong imposed constraint on the number of bits allocated for the compressed…

Image and Video Processing · Electrical Eng. & Systems 2020-06-16 Suraj Kiran Raman , Aditya Ramesh , Vijayakrishna Naganoor , Shubham Dash , Giridharan Kumaravelu , Honglak Lee

Learning to Sample the Most Useful Training Patches from Images

Some image restoration tasks like demosaicing require difficult training samples to learn effective models. Existing methods attempt to address this data training problem by manually collecting a new training dataset that contains adequate…

Computer Vision and Pattern Recognition · Computer Science 2020-11-25 Shuyang Sun , Liang Chen , Gregory Slabaugh , Philip Torr

Efficient Stagewise Pretraining via Progressive Subnetworks

Recent developments in large language models have sparked interest in efficient pretraining methods. Stagewise training approaches to improve efficiency, like gradual stacking and layer dropping (Reddi et al, 2023; Zhang & He, 2020), have…

Computation and Language · Computer Science 2024-10-15 Abhishek Panigrahi , Nikunj Saunshi , Kaifeng Lyu , Sobhan Miryoosefi , Sashank Reddi , Satyen Kale , Sanjiv Kumar

Block Pruning for Enhanced Efficiency in Convolutional Neural Networks

This paper presents a novel approach to network pruning, targeting block pruning in deep neural networks for edge computing environments. Our method diverges from traditional techniques that utilize proxy metrics, instead employing a direct…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Cheng-En Wu , Azadeh Davoodi , Yu Hen Hu