Related papers: Towards Efficient Tensor Decomposition-Based DNN M…

ADMM-NN: An Algorithm-Hardware Co-Design Framework of DNNs Using Alternating Direction Method of Multipliers

To facilitate efficient embedded and hardware implementations of deep neural networks (DNNs), two important categories of DNN model compression techniques: weight pruning and weight quantization are investigated. The former leverages the…

Machine Learning · Computer Science 2019-01-03 Ao Ren , Tianyun Zhang , Shaokai Ye , Jiayu Li , Wenyao Xu , Xuehai Qian , Xue Lin , Yanzhi Wang

Hybrid Tensor Decomposition in Neural Network Compression

Deep neural networks (DNNs) have enabled impressive breakthroughs in various artificial intelligence (AI) applications recently due to its capability of learning high-level features from big data. However, the current demand of DNNs for…

Computer Vision and Pattern Recognition · Computer Science 2020-09-22 Bijiao Wu , Dingheng Wang , Guangshe Zhao , Lei Deng , Guoqi Li

Compressing Neural Networks Using Tensor Networks with Exponentially Fewer Variational Parameters

Neural network (NN) designed for challenging machine learning tasks is in general a highly nonlinear mapping that contains massive variational parameters. High complexity of NN, if unbounded or unconstrained, might unpredictably cause…

Machine Learning · Computer Science 2025-05-23 Yong Qing , Ke Li , Peng-Fei Zhou , Shi-Ju Ran

Nonlinear Tensor Ring Network

The state-of-the-art deep neural networks (DNNs) have been widely applied for various real-world applications, and achieved significant performance for cognitive problems. However, the increment of DNNs' width and depth in architecture…

Machine Learning · Computer Science 2021-11-15 Xiao Peng Li , Qi Liu , Hing Cheung So

Compressing 3DCNNs Based on Tensor Train Decomposition

Three dimensional convolutional neural networks (3DCNNs) have been applied in many tasks, e.g., video and 3D point cloud recognition. However, due to the higher dimension of convolutional kernels, the space complexity of 3DCNNs is generally…

Computer Vision and Pattern Recognition · Computer Science 2020-08-12 Dingheng Wang , Guangshe Zhao , Guoqi Li , Lei Deng , Yang Wu

A Unified Approximation Framework for Compressing and Accelerating Deep Neural Networks

Deep neural networks (DNNs) have achieved significant success in a variety of real world applications, i.e., image classification. However, tons of parameters in the networks restrict the efficiency of neural networks due to the large model…

Machine Learning · Computer Science 2019-08-21 Yuzhe Ma , Ran Chen , Wei Li , Fanhua Shang , Wenjian Yu , Minsik Cho , Bei Yu

Adaptive Learning of Tensor Network Structures

Tensor Networks (TN) offer a powerful framework to efficiently represent very high-dimensional objects. TN have recently shown their potential for machine learning applications and offer a unifying view of common tensor decomposition models…

Machine Learning · Computer Science 2021-06-24 Meraj Hashemizadeh , Michelle Liu , Jacob Miller , Guillaume Rabusseau

A Systematic DNN Weight Pruning Framework using Alternating Direction Method of Multipliers

Weight pruning methods for deep neural networks (DNNs) have been investigated recently, but prior work in this area is mainly heuristic, iterative pruning, thereby lacking guarantees on the weight reduction ratio and convergence time. To…

Neural and Evolutionary Computing · Computer Science 2018-10-23 Tianyun Zhang , Shaokai Ye , Kaiqi Zhang , Jian Tang , Wujie Wen , Makan Fardad , Yanzhi Wang

Progressive DNN Compression: A Key to Achieve Ultra-High Weight Pruning and Quantization Rates using ADMM

Weight pruning and weight quantization are two important categories of DNN model compression. Prior work on these techniques are mainly based on heuristics. A recent work developed a systematic frame-work of DNN weight pruning using the…

Neural and Evolutionary Computing · Computer Science 2019-04-02 Shaokai Ye , Xiaoyu Feng , Tianyun Zhang , Xiaolong Ma , Sheng Lin , Zhengang Li , Kaidi Xu , Wujie Wen , Sijia Liu , Jian Tang , Makan Fardad , Xue Lin , Yongpan Liu , Yanzhi Wang

Quantization Aware Factorization for Deep Neural Network Compression

Tensor decomposition of convolutional and fully-connected layers is an effective way to reduce parameters and FLOP in neural networks. Due to memory and power consumption limitations of mobile or embedded devices, the quantization step is…

Machine Learning · Computer Science 2023-08-10 Daria Cherniuk , Stanislav Abukhovich , Anh-Huy Phan , Ivan Oseledets , Andrzej Cichocki , Julia Gusak

Reduced storage direct tensor ring decomposition for convolutional neural networks compression

Convolutional neural networks (CNNs) are among the most widely used machine learning models for computer vision tasks, such as image classification. To improve the efficiency of CNNs, many CNNs compressing approaches have been developed.…

Computer Vision and Pattern Recognition · Computer Science 2024-08-06 Mateusz Gabor , Rafał Zdunek

Generalized Higher-Order Tensor Decomposition via Parallel ADMM

Higher-order tensors are becoming prevalent in many scientific areas such as computer vision, social network analysis, data mining and neuroscience. Traditional tensor decomposition approaches face three major challenges: model selecting,…

Numerical Analysis · Computer Science 2014-07-08 Fanhua Shang , Yuanyuan Liu , James Cheng

Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization

Tensor decomposition is one of the fundamental technique for model compression of deep convolution neural networks owing to its ability to reveal the latent relations among complex structures. However, most existing methods compress the…

Computer Vision and Pattern Recognition · Computer Science 2021-12-08 Bo-Shiuan Chu , Che-Rung Lee

Toward Extremely Low Bit and Lossless Accuracy in DNNs with Progressive ADMM

Weight quantization is one of the most important techniques of Deep Neural Networks (DNNs) model compression method. A recent work using systematic framework of DNN weight quantization with the advanced optimization algorithm ADMM…

Machine Learning · Computer Science 2019-05-03 Sheng Lin , Xiaolong Ma , Shaokai Ye , Geng Yuan , Kaisheng Ma , Yanzhi Wang

Systematic Weight Pruning of DNNs using Alternating Direction Method of Multipliers

We present a systematic weight pruning framework of deep neural networks (DNNs) using the alternating direction method of multipliers (ADMM). We first formulate the weight pruning problem of DNNs as a constrained nonconvex optimization…

Machine Learning · Computer Science 2018-04-24 Tianyun Zhang , Shaokai Ye , Yipeng Zhang , Yanzhi Wang , Makan Fardad

Progressive Weight Pruning of Deep Neural Networks using ADMM

Deep neural networks (DNNs) although achieving human-level performance in many domains, have very large model size that hinders their broader applications on edge computing devices. Extensive research work have been conducted on DNN model…

Machine Learning · Computer Science 2018-11-06 Shaokai Ye , Tianyun Zhang , Kaiqi Zhang , Jiayu Li , Kaidi Xu , Yunfei Yang , Fuxun Yu , Jian Tang , Makan Fardad , Sijia Liu , Xiang Chen , Xue Lin , Yanzhi Wang

Tensor Ring Decomposition with Rank Minimization on Latent Space: An Efficient Approach for Tensor Completion

In tensor completion tasks, the traditional low-rank tensor decomposition models suffer from the laborious model selection problem due to their high model sensitivity. In particular, for tensor ring (TR) decomposition, the number of model…

Machine Learning · Computer Science 2018-12-03 Longhao Yuan , Chao Li , Danilo Mandic , Jianting Cao , Qibin Zhao

A Unified Framework of DNN Weight Pruning and Weight Clustering/Quantization Using ADMM

Many model compression techniques of Deep Neural Networks (DNNs) have been investigated, including weight pruning, weight clustering and quantization, etc. Weight pruning leverages the redundancy in the number of weights in DNNs, while…

Neural and Evolutionary Computing · Computer Science 2018-11-06 Shaokai Ye , Tianyun Zhang , Kaiqi Zhang , Jiayu Li , Jiaming Xie , Yun Liang , Sijia Liu , Xue Lin , Yanzhi Wang

Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition

Recurrent Neural Networks (RNNs) have been widely used in sequence analysis and modeling. However, when processing high-dimensional data, RNNs typically require very large model sizes, thereby bringing a series of deployment challenges.…

Machine Learning · Computer Science 2020-05-12 Miao Yin , Siyu Liao , Xiao-Yang Liu , Xiaodong Wang , Bo Yuan

Compressing Recurrent Neural Network with Tensor Train

Recurrent Neural Network (RNN) are a popular choice for modeling temporal and sequential tasks and achieve many state-of-the-art performance on various complex problems. However, most of the state-of-the-art RNNs have millions of parameters…

Machine Learning · Computer Science 2017-10-31 Andros Tjandra , Sakriani Sakti , Satoshi Nakamura