Related papers: Wide Compression: Tensor Ring Nets

Nonlinear Tensor Ring Network

The state-of-the-art deep neural networks (DNNs) have been widely applied for various real-world applications, and achieved significant performance for cognitive problems. However, the increment of DNNs' width and depth in architecture…

Machine Learning · Computer Science 2021-11-15 Xiao Peng Li , Qi Liu , Hing Cheung So

Tensorizing Neural Networks

Deep neural networks currently demonstrate state-of-the-art performance in several domains. At the same time, models of this class are very demanding in terms of computational resources. In particular, a large amount of memory is required…

Machine Learning · Computer Science 2015-12-22 Alexander Novikov , Dmitry Podoprikhin , Anton Osokin , Dmitry Vetrov

Compressing Neural Networks Using Tensor Networks with Exponentially Fewer Variational Parameters

Neural network (NN) designed for challenging machine learning tasks is in general a highly nonlinear mapping that contains massive variational parameters. High complexity of NN, if unbounded or unconstrained, might unpredictably cause…

Machine Learning · Computer Science 2025-05-23 Yong Qing , Ke Li , Peng-Fei Zhou , Shi-Ju Ran

Cramnet: Layer-wise Deep Neural Network Compression with Knowledge Transfer from a Teacher Network

Neural Networks accomplish amazing things, but they suffer from computational and memory bottlenecks that restrict their usage. Nowhere can this be better seen than in the mobile space, where specialized hardware is being created just to…

Computer Vision and Pattern Recognition · Computer Science 2019-04-15 Jon Hoffman

Convolutional neural networks compression with low rank and sparse tensor decompositions

Convolutional neural networks show outstanding results in a variety of computer vision tasks. However, a neural network architecture design usually faces a trade-off between model performance and computational/memory complexity. For some…

Computer Vision and Pattern Recognition · Computer Science 2020-06-12 Pavel Kaloshin

Optimize Deep Convolutional Neural Network with Ternarized Weights and High Accuracy

Deep convolution neural network has achieved great success in many artificial intelligence applications. However, its enormous model size and massive computation cost have become the main obstacle for deployment of such powerful algorithm…

Computer Vision and Pattern Recognition · Computer Science 2018-07-23 Zhezhi He , Boqing Gong , Deliang Fan

Tensor Regression Networks with various Low-Rank Tensor Approximations

Tensor regression networks achieve high compression rate of neural networks while having slight impact on performances. They do so by imposing low tensor rank structure on the weight matrices of fully connected layers. In recent years,…

Machine Learning · Computer Science 2018-11-30 Xingwei Cao , Guillaume Rabusseau

Compressing Recurrent Neural Networks with Tensor Ring for Action Recognition

Recurrent Neural Networks (RNNs) and their variants, such as Long-Short Term Memory (LSTM) networks, and Gated Recurrent Unit (GRU) networks, have achieved promising performance in sequential data modeling. The hidden layers in RNNs can be…

Computer Vision and Pattern Recognition · Computer Science 2018-11-20 Yu Pan , Jing Xu , Maolin Wang , Jinmian Ye , Fei Wang , Kun Bai , Zenglin Xu

Adaptive Learning of Tensor Network Structures

Tensor Networks (TN) offer a powerful framework to efficiently represent very high-dimensional objects. TN have recently shown their potential for machine learning applications and offer a unifying view of common tensor decomposition models…

Machine Learning · Computer Science 2021-06-24 Meraj Hashemizadeh , Michelle Liu , Jacob Miller , Guillaume Rabusseau

Pruning at a Glance: Global Neural Pruning for Model Compression

Deep Learning models have become the dominant approach in several areas due to their high performance. Unfortunately, the size and hence computational requirements of operating such models can be considerably high. Therefore, this…

Computer Vision and Pattern Recognition · Computer Science 2019-12-04 Abdullah Salama , Oleksiy Ostapenko , Tassilo Klein , Moin Nabi

Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression

We propose tensorial neural networks (TNNs), a generalization of existing neural networks by extending tensor operations on low order operands to those on high order ones. The problem of parameter learning is challenging, as it corresponds…

Machine Learning · Statistics 2018-12-11 Jiahao Su , Jingling Li , Bobby Bhattacharjee , Furong Huang

Low-Rank+Sparse Tensor Compression for Neural Networks

Low-rank tensor compression has been proposed as a promising approach to reduce the memory and compute requirements of neural networks for their deployment on edge devices. Tensor compression reduces the number of parameters required to…

Machine Learning · Computer Science 2021-11-03 Cole Hawkins , Haichuan Yang , Meng Li , Liangzhen Lai , Vikas Chandra

"Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach

Modern deep neural networks (DNNs) are extremely powerful; however, this comes at the price of increased depth and having more parameters per layer, making their training and inference more computationally challenging. In an attempt to…

Machine Learning · Statistics 2024-03-04 Lingyu Gu , Yongqi Du , Yuan Zhang , Di Xie , Shiliang Pu , Robert C. Qiu , Zhenyu Liao

Compressing complex convolutional neural network based on an improved deep compression algorithm

Although convolutional neural network (CNN) has made great progress, large redundant parameters restrict its deployment on embedded devices, especially mobile devices. The recent compression works are focused on real-value convolutional…

Computer Vision and Pattern Recognition · Computer Science 2019-03-07 Jiasong Wu , Hongshan Ren , Youyong Kong , Chunfeng Yang , Lotfi Senhadji , Huazhong Shu

Tensor Ring Optimized Quantum-Enhanced Tensor Neural Networks

Quantum machine learning researchers often rely on incorporating Tensor Networks (TN) into Deep Neural Networks (DNN) and variational optimization. However, the standard optimization techniques used for training the contracted trainable…

Quantum Physics · Physics 2023-10-04 Debanjan Konar , Dheeraj Peddireddy , Vaneet Aggarwal , Bijaya K. Panigrahi

CompressNAS : A Fast and Efficient Technique for Model Compression using Decomposition

Deep Convolutional Neural Networks (CNNs) are increasingly difficult to deploy on microcontrollers (MCUs) and lightweight NPUs (Neural Processing Units) due to their growing size and compute demands. Low-rank tensor decomposition, such as…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Sudhakar Sah , Nikhil Chabbra , Matthieu Durnerin

Tensor Decomposition for Model Reduction in Neural Networks: A Review

Modern neural networks have revolutionized the fields of computer vision (CV) and Natural Language Processing (NLP). They are widely used for solving complex CV tasks and NLP tasks such as image classification, image generation, and machine…

Machine Learning · Computer Science 2023-07-25 Xingyi Liu , Keshab K. Parhi

Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization

Tensor decomposition is one of the fundamental technique for model compression of deep convolution neural networks owing to its ability to reveal the latent relations among complex structures. However, most existing methods compress the…

Computer Vision and Pattern Recognition · Computer Science 2021-12-08 Bo-Shiuan Chu , Che-Rung Lee

Randomized Tensor Ring Decomposition and Its Application to Large-scale Data Reconstruction

Dimensionality reduction is an essential technique for multi-way large-scale data, i.e., tensor. Tensor ring (TR) decomposition has become popular due to its high representation ability and flexibility. However, the traditional TR…

Numerical Analysis · Mathematics 2024-12-20 Longhao Yuan , Chao Li , Jianting Cao , Qibin Zhao

Tensorization is a powerful but underexplored tool for compression and interpretability of neural networks

Tensorizing a neural network involves reshaping some or all of its dense weight matrices into higher-order tensors and approximating them using low-rank tensor network decompositions. This technique has shown promise as a model compression…

Machine Learning · Computer Science 2025-05-27 Safa Hamreras , Sukhbinder Singh , Román Orús