Related papers: Tensorizing Neural Networks

Tensorized Embedding Layers for Efficient Model Compression

The embedding layers transforming input words into real vectors are the key components of deep neural networks used in natural language processing. However, when the vocabulary is large, the corresponding weight matrices can be enormous,…

Computation and Language · Computer Science 2020-02-20 Oleksii Hrinchuk , Valentin Khrulkov , Leyla Mirvakhabova , Elena Orlova , Ivan Oseledets

Wide Compression: Tensor Ring Nets

Deep neural networks have demonstrated state-of-the-art performance in a variety of real-world applications. In order to obtain performance gains, these networks have grown larger and deeper, containing millions or even billions of…

Machine Learning · Computer Science 2018-02-27 Wenqi Wang , Yifan Sun , Brian Eriksson , Wenlin Wang , Vaneet Aggarwal

Tensorization is a powerful but underexplored tool for compression and interpretability of neural networks

Tensorizing a neural network involves reshaping some or all of its dense weight matrices into higher-order tensors and approximating them using low-rank tensor network decompositions. This technique has shown promise as a model compression…

Machine Learning · Computer Science 2025-05-27 Safa Hamreras , Sukhbinder Singh , Román Orús

Tensor-Train Recurrent Neural Networks for Video Classification

The Recurrent Neural Networks and their variants have shown promising performances in sequence modeling tasks such as Natural Language Processing. These models, however, turn out to be impractical and difficult to train when exposed to very…

Computer Vision and Pattern Recognition · Computer Science 2017-07-07 Yinchong Yang , Denis Krompass , Volker Tresp

A Fully Tensorized Recurrent Neural Network

Recurrent neural networks (RNNs) are powerful tools for sequential modeling, but typically require significant overparameterization and regularization to achieve optimal performance. This leads to difficulties in the deployment of large…

Machine Learning · Computer Science 2021-11-11 Charles C. Onu , Jacob E. Miller , Doina Precup

Nonlinear Tensor Ring Network

The state-of-the-art deep neural networks (DNNs) have been widely applied for various real-world applications, and achieved significant performance for cognitive problems. However, the increment of DNNs' width and depth in architecture…

Machine Learning · Computer Science 2021-11-15 Xiao Peng Li , Qi Liu , Hing Cheung So

Tensor train decompositions on recurrent networks

Recurrent neural networks (RNN) such as long-short-term memory (LSTM) networks are essential in a multitude of daily live tasks such as speech, language, video, and multimodal learning. The shift from cloud to edge computation intensifies…

Machine Learning · Computer Science 2020-06-11 Alejandro Murua , Ramchalam Ramakrishnan , Xinlin Li , Rui Heng Yang , Vahid Partovi Nia

Ultimate tensorization: compressing convolutional and FC layers alike

Convolutional neural networks excel in image recognition tasks, but this comes at the cost of high computational and memory complexity. To tackle this problem, [1] developed a tensor factorization framework to compress fully-connected…

Machine Learning · Computer Science 2016-11-11 Timur Garipov , Dmitry Podoprikhin , Alexander Novikov , Dmitry Vetrov

Convolutional Neural Network Compression Based on Low-Rank Decomposition

Deep neural networks typically impose significant computational loads and memory consumption. Moreover, the large parameters pose constraints on deploying the model on edge devices such as embedded systems. Tensor decomposition offers a…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Yaping He , Linhao Jiang , Di Wu

Tensor Networks for Latent Variable Analysis. Part I: Algorithms for Tensor Train Decomposition

Decompositions of tensors into factor matrices, which interact through a core tensor, have found numerous applications in signal processing and machine learning. A more general tensor model which represents data as an ordered network of…

Numerical Analysis · Computer Science 2016-09-30 Anh-Huy Phan , Andrzej Cichocki , Andre Uschmajew , Petr Tichavsky , George Luta , Danilo Mandic

"Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach

Modern deep neural networks (DNNs) are extremely powerful; however, this comes at the price of increased depth and having more parameters per layer, making their training and inference more computationally challenging. In an attempt to…

Machine Learning · Statistics 2024-03-04 Lingyu Gu , Yongqi Du , Yuan Zhang , Di Xie , Shiliang Pu , Robert C. Qiu , Zhenyu Liao

Tensor Decomposition for Model Reduction in Neural Networks: A Review

Modern neural networks have revolutionized the fields of computer vision (CV) and Natural Language Processing (NLP). They are widely used for solving complex CV tasks and NLP tasks such as image classification, image generation, and machine…

Machine Learning · Computer Science 2023-07-25 Xingyi Liu , Keshab K. Parhi

Block-term Tensor Neural Networks

Deep neural networks (DNNs) have achieved outstanding performance in a wide range of applications, e.g., image classification, natural language processing, etc. Despite the good performance, the huge number of parameters in DNNs brings…

Machine Learning · Computer Science 2020-12-21 Jinmian Ye , Guangxi Li , Di Chen , Haiqin Yang , Shandian Zhe , Zenglin Xu

Compression strategies and space-conscious representations for deep neural networks

Recent advances in deep learning have made available large, powerful convolutional neural networks (CNN) with state-of-the-art performance in several real-world applications. Unfortunately, these large-sized models have millions of…

Machine Learning · Computer Science 2020-07-17 Giosuè Cataldo Marinò , Gregorio Ghidoli , Marco Frasca , Dario Malchiodi

Tensor Networks for Big Data Analytics and Large-Scale Optimization Problems

In this paper we review basic and emerging models and associated algorithms for large-scale tensor networks, especially Tensor Train (TT) decompositions using novel mathematical and graphical representations. We discus the concept of…

Numerical Analysis · Computer Science 2014-08-25 Andrzej Cichocki

Compressing Deep Convolutional Networks using Vector Quantization

Deep convolutional neural networks (CNN) has become the most promising method for object recognition, repeatedly demonstrating record breaking results for image classification and object detection in recent years. However, a very deep CNN…

Computer Vision and Pattern Recognition · Computer Science 2014-12-22 Yunchao Gong , Liu Liu , Ming Yang , Lubomir Bourdev

Tensor Regression Networks with various Low-Rank Tensor Approximations

Tensor regression networks achieve high compression rate of neural networks while having slight impact on performances. They do so by imposing low tensor rank structure on the weight matrices of fully connected layers. In recent years,…

Machine Learning · Computer Science 2018-11-30 Xingwei Cao , Guillaume Rabusseau

Learning Robust and Lightweight Model through Separable Structured Transformations

With the proliferation of mobile devices and the Internet of Things, deep learning models are increasingly deployed on devices with limited computing resources and memory, and are exposed to the threat of adversarial noise. Learning deep…

Computer Vision and Pattern Recognition · Computer Science 2021-12-30 Xian Wei , Yanhui Huang , Yangyu Xu , Mingsong Chen , Hai Lan , Yuanxiang Li , Zhongfeng Wang , Xuan Tang

Variational Tensor Neural Networks for Deep Learning

Deep neural networks (NNs) encounter scalability limitations when confronted with a vast array of neurons, thereby constraining their achievable network depth. To address this challenge, we propose an integration of tensor networks (TN)…

Disordered Systems and Neural Networks · Physics 2024-08-20 Saeed S. Jahromi , Roman Orus

An Overview of Neural Network Compression

Overparameterized networks trained to convergence have shown impressive performance in domains such as computer vision and natural language processing. Pushing state of the art on salient tasks within these domains corresponds to these…

Machine Learning · Computer Science 2020-08-04 James O' Neill