English
Related papers

Related papers: Efficient Beam Tree Recursion

200 papers

Binary Balanced Tree RvNNs (BBT-RvNNs) enforce sequence composition according to a preset balanced binary tree structure. Thus, their non-linear recursion depth is just $\log_2 n$ ($n$ being the sequence length). Such logarithmic scaling…

Machine Learning · Computer Science 2023-11-09 Jishnu Ray Chowdhury , Cornelia Caragea

We propose Beam Tree Recursive Cell (BT-Cell) - a backpropagation-friendly framework to extend Recursive Neural Networks (RvNNs) with beam search for latent structure induction. We further extend this framework by proposing a relaxation of…

Machine Learning · Computer Science 2023-06-21 Jishnu Ray Chowdhury , Cornelia Caragea

For years, recursive neural networks (RvNNs) have been shown to be suitable for representing text into fixed-length vectors and achieved good performance on several natural language processing tasks. However, the main drawback of RvNNs is…

Computation and Language · Computer Science 2017-11-22 Jihun Choi , Kang Min Yoo , Sang-goo Lee

We propose a novel approach to reduce memory consumption of the backpropagation through time (BPTT) algorithm when training recurrent neural networks (RNNs). Our approach uses dynamic programming to balance a trade-off between caching of…

Neural and Evolutionary Computing · Computer Science 2016-06-13 Audrūnas Gruslys , Remi Munos , Ivo Danihelka , Marc Lanctot , Alex Graves

Backpropagation through time (BPTT) is the standard algorithm for training recurrent neural networks (RNNs), which requires separate simulation phases for the forward and backward passes for inference and learning, respectively. Moreover,…

Machine Learning · Computer Science 2023-03-13 Anand Subramoney

We present Bifocal RNN-T, a new variant of the Recurrent Neural Network Transducer (RNN-T) architecture designed for improved inference time latency on speech recognition tasks. The architecture enables a dynamic pivot for its runtime…

Audio and Speech Processing · Electrical Eng. & Systems 2021-08-05 Jonathan Macoskey , Grant P. Strimel , Ariya Rastrow

Recurrent Neural Networks (RNNs) are a key technology for applications such as automatic speech recognition or machine translation. Unlike conventional feed-forward DNNs, RNNs remember past information to improve the accuracy of future…

Neural and Evolutionary Computing · Computer Science 2022-02-16 Franyell Silfa , Jose-Maria Arnau , Antonio González

Although deep learning has demonstrated remarkable capability in learning from unstructured data, modern tree-based ensemble models remain superior in extracting relevant information and learning from structured datasets. While several…

Machine Learning · Computer Science 2026-02-05 Yi-Chun Liao , Chieh-Lin Tsai , Yuan-Hao Chang , Camélia Slimani , Jalil Boukhobza , Tei-Wei Kuo

Recursive Neural Networks (RvNNs), which compose sequences according to their underlying hierarchical syntactic structure, have performed well in several natural language processing tasks compared to similar models without structural…

Computation and Language · Computer Science 2021-06-14 Jishnu Ray Chowdhury , Cornelia Caragea

Recursive neural networks (RNN) and their recently proposed extension recursive long short term memory networks (RLSTM) are models that compute representations for sentences, by recursively combining word embeddings according to an…

Artificial Intelligence · Computer Science 2016-03-02 Phong Le , Willem Zuidema

The concepts of unitary evolution matrices and associative memory have boosted the field of Recurrent Neural Networks (RNN) to state-of-the-art performance in a variety of sequential tasks. However, RNN still have a limited capacity to…

Machine Learning · Computer Science 2017-10-27 Rumen Dangovski , Li Jing , Marin Soljacic

Long short-term memory recurrent neural networks (LSTM-RNNs) are considered state-of-the art in many speech processing tasks. The recurrence in the network, in principle, allows any input to be remembered for an indefinite time, a feature…

Audio and Speech Processing · Electrical Eng. & Systems 2020-09-02 Jeroen Zegers , Hugo Van hamme

Recurrent neural networks (RNNs) have recently demonstrated strong performance and faster inference than Transformers at comparable parameter budgets. However, the recursive gradient computation with the backpropagation through time (or…

Machine Learning · Computer Science 2025-04-01 Paul Caillon , Erwan Fagnou , Alexandre Allauzen

In the domain of sequence modelling, Recurrent Neural Networks (RNN) have been capable of achieving impressive results in a variety of application areas including visual question answering, part-of-speech tagging and machine translation.…

Machine Learning · Computer Science 2018-05-22 Tharindu Fernando , Simon Denman , Aaron McFadyen , Sridha Sridharan , Clinton Fookes

Recurrent Neural Networks (RNNs) are widely used for sequential processing but face fundamental limitations with continual inference due to state saturation, requiring disruptive hidden state resets. However, reset-based methods impose…

Machine Learning · Computer Science 2024-12-23 Bojian Yin , Federico Corradi

Stack-augmented recurrent neural networks (RNNs) have been of interest to the deep learning community for some time. However, the difficulty of training memory models remains a problem obstructing the widespread use of such models. In this…

Machine Learning · Computer Science 2019-11-05 Yikang Shen , Shawn Tan , Arian Hosseini , Zhouhan Lin , Alessandro Sordoni , Aaron Courville

The RNN-Transducer (RNN-T) framework for speech recognition has been growing in popularity, particularly for deployed real-time ASR systems, because it combines high accuracy with naturally streaming recognition. One of the drawbacks of…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-28 Fangjun Kuang , Liyong Guo , Wei Kang , Long Lin , Mingshuang Luo , Zengwei Yao , Daniel Povey

Solving arithmetic word problems is a cornerstone task in assessing language understanding and reasoning capabilities in NLP systems. Recent works use automatic extraction and ranking of candidate solution equations providing the answer to…

Computation and Language · Computer Science 2021-03-10 Klim Zaporojets , Giannis Bekoulis , Johannes Deleu , Thomas Demeester , Chris Develder

Binary neural networks (BNNs) have been widely adopted to reduce the computational cost and memory storage on edge-computing devices by using one-bit representation for activations and weights. However, as neural networks become…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Quang Hieu Vo , Linh-Tam Tran , Sung-Ho Bae , Lok-Won Kim , Choong Seon Hong

Processing sequential data of variable length is a major challenge in a wide range of applications, such as speech recognition, language modeling, generative image modeling and machine translation. Here, we address this challenge by…

Neural and Evolutionary Computing · Computer Science 2017-06-13 Asier Mujika , Florian Meier , Angelika Steger
‹ Prev 1 2 3 10 Next ›