English
Related papers

Related papers: Compressing Recurrent Neural Network with Tensor T…

200 papers

Recurrent neural networks (RNNs) are powerful tools for sequential modeling, but typically require significant overparameterization and regularization to achieve optimal performance. This leads to difficulties in the deployment of large…

Machine Learning · Computer Science 2021-11-11 Charles C. Onu , Jacob E. Miller , Doina Precup

In the machine learning fields, Recurrent Neural Network (RNN) has become a popular architecture for sequential data modeling. However, behind the impressive performance, RNNs require a large number of parameters for both training and…

Machine Learning · Computer Science 2018-05-09 Andros Tjandra , Sakriani Sakti , Satoshi Nakamura

Recurrent Neural Networks (RNNs) represent the de facto standard machine learning tool for sequence modelling, owing to their expressive power and memory. However, when dealing with large dimensional data, the corresponding exponential…

Machine Learning · Computer Science 2021-05-12 Yao Lei Xu , Giuseppe G. Calvi , Danilo P. Mandic

Recurrent Neural Network (RNN) and its variations such as Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU), have become standard building blocks for learning online data of sequential nature in many research areas, including…

Computation and Language · Computer Science 2020-05-12 Enmao Diao , Jie Ding , Vahid Tarokh

The Recurrent Neural Networks and their variants have shown promising performances in sequence modeling tasks such as Natural Language Processing. These models, however, turn out to be impractical and difficult to train when exposed to very…

Computer Vision and Pattern Recognition · Computer Science 2017-07-07 Yinchong Yang , Denis Krompass , Volker Tresp

Recurrent Neural Networks (RNNs) are among the most successful machine learning models for sequence modelling, but tend to suffer from an exponential increase in the number of parameters when dealing with large multidimensional data. To…

Machine Learning · Computer Science 2021-05-12 Yao Lei Xu , Danilo P. Mandic

We propose tensorial neural networks (TNNs), a generalization of existing neural networks by extending tensor operations on low order operands to those on high order ones. The problem of parameter learning is challenging, as it corresponds…

Machine Learning · Statistics 2018-12-11 Jiahao Su , Jingling Li , Bobby Bhattacharjee , Furong Huang

Recurrent Neural Networks (RNNs) and their variants, such as Long-Short Term Memory (LSTM) networks, and Gated Recurrent Unit (GRU) networks, have achieved promising performance in sequential data modeling. The hidden layers in RNNs can be…

Computer Vision and Pattern Recognition · Computer Science 2018-11-20 Yu Pan , Jing Xu , Maolin Wang , Jinmian Ye , Fei Wang , Kun Bai , Zenglin Xu

Recurrent Neural Networks (RNNs), which are a powerful scheme for modeling temporal and sequential data need to capture long-term dependencies on datasets and represent them in hidden layers with a powerful model to capture more information…

Machine Learning · Computer Science 2017-06-08 Andros Tjandra , Sakriani Sakti , Ruli Manurung , Mirna Adriani , Satoshi Nakamura

Recurrent Neural Networks (RNNs) have been widely used in sequence analysis and modeling. However, when processing high-dimensional data, RNNs typically require very large model sizes, thereby bringing a series of deployment challenges.…

Machine Learning · Computer Science 2020-05-12 Miao Yin , Siyu Liao , Xiao-Yang Liu , Xiaodong Wang , Bo Yuan

Recurrent neural networks (RNN) such as long-short-term memory (LSTM) networks are essential in a multitude of daily live tasks such as speech, language, video, and multimodal learning. The shift from cloud to edge computation intensifies…

Machine Learning · Computer Science 2020-06-11 Alejandro Murua , Ramchalam Ramakrishnan , Xinlin Li , Rui Heng Yang , Vahid Partovi Nia

In this work, we firstly apply the Train-Tensor (TT) networks to construct a compact representation of the classical Multilayer Perceptron, representing a reduction of up to 95% of the coefficients. A comparative analysis between tensor…

Machine Learning · Computer Science 2021-03-31 M. Nazareth da Costa , R. Attux , A. Cichocki , J. M. T. Romano

We study the problem of compressing recurrent neural networks (RNNs). In particular, we focus on the compression of RNN acoustic models, which are motivated by the goal of building compact and accurate speech recognition systems which can…

Computation and Language · Computer Science 2016-05-03 Rohit Prabhavalkar , Ouais Alsharif , Antoine Bruguier , Ian McGraw

The significant computational costs of deploying neural networks in large-scale or resource constrained environments, such as data centers and mobile devices, has spurred interest in model compression, which can achieve a reduction in both…

Computer Vision and Pattern Recognition · Computer Science 2017-02-27 Jonathan A. Cox

Three dimensional convolutional neural networks (3DCNNs) have been applied in many tasks, e.g., video and 3D point cloud recognition. However, due to the higher dimension of convolutional kernels, the space complexity of 3DCNNs is generally…

Computer Vision and Pattern Recognition · Computer Science 2020-08-12 Dingheng Wang , Guangshe Zhao , Guoqi Li , Lei Deng , Yang Wu

Advanced tensor decomposition, such as Tensor train (TT) and Tensor ring (TR), has been widely studied for deep neural network (DNN) model compression, especially for recurrent neural networks (RNNs). However, compressing convolutional…

Computer Vision and Pattern Recognition · Computer Science 2021-07-28 Miao Yin , Yang Sui , Siyu Liao , Bo Yuan

Recurrent Neural Networks (RNNs) are powerful sequence modeling tools. However, when dealing with high dimensional inputs, the training of RNNs becomes computational expensive due to the large number of model parameters. This hinders RNNs…

Machine Learning · Computer Science 2018-05-23 Jinmian Ye , Linnan Wang , Guangxi Li , Di Chen , Shandian Zhe , Xinqi Chu , Zenglin Xu

Modern sensing and metrology systems now stream terabytes of heterogeneous, high-dimensional (HD) data profiles, images, and dense point clouds, whose natural representation is multi-way tensors. Understanding such data requires regression…

Machine Learning · Computer Science 2025-10-08 Qian Wang , Mohammad N. Bisheh , Kamran Paynabar

Neural network (NN) designed for challenging machine learning tasks is in general a highly nonlinear mapping that contains massive variational parameters. High complexity of NN, if unbounded or unconstrained, might unpredictably cause…

Machine Learning · Computer Science 2025-05-23 Yong Qing , Ke Li , Peng-Fei Zhou , Shi-Ju Ran

Sentence compression is a Natural Language Processing (NLP) task aimed at shortening original sentences and preserving their key information. Its applications can benefit many fields e.g. one can build tools for language education. However,…

Computation and Language · Computer Science 2020-09-24 Weiwei Hou , Hanna Suominen , Piotr Koniusz , Sabrina Caldwell , Tom Gedeon
‹ Prev 1 2 3 10 Next ›