Related papers: Neural Attribute Machines for Program Generation

Inducing Regular Grammars Using Recurrent Neural Networks

Grammar induction is the task of learning a grammar from a set of examples. Recently, neural networks have been shown to be powerful learning machines that can identify patterns in streams of data. In this work we investigate their…

Computation and Language · Computer Science 2018-06-27 Mor Cohen , Avi Caciularu , Idan Rejwan , Jonathan Berant

Language Generation with Recurrent Generative Adversarial Networks without Pre-training

Generative Adversarial Networks (GANs) have shown great promise recently in image generation. Training GANs for language generation has proven to be more difficult, because of the non-differentiable nature of generating text with recurrent…

Computation and Language · Computer Science 2017-12-22 Ofir Press , Amir Bar , Ben Bogin , Jonathan Berant , Lior Wolf

Recurrent Aggregators in Neural Algorithmic Reasoning

Neural algorithmic reasoning (NAR) is an emerging field that seeks to design neural networks that mimic classical algorithmic computations. Today, graph neural networks (GNNs) are widely used in neural algorithmic reasoners due to their…

Machine Learning · Computer Science 2024-12-03 Kaijia Xu , Petar Veličković

Unsupervised Recurrent Neural Network Grammars

Recurrent neural network grammars (RNNG) are generative models of language which jointly model syntax and surface structure by incrementally generating a syntax tree and sentence in a top-down, left-to-right order. Supervised RNNGs achieve…

Computation and Language · Computer Science 2019-08-06 Yoon Kim , Alexander M. Rush , Lei Yu , Adhiguna Kuncoro , Chris Dyer , Gábor Melis

Recurrent Memory Networks for Language Modeling

Recurrent Neural Networks (RNN) have obtained excellent result in many natural language processing (NLP) tasks. However, understanding and interpreting the source of this success remains a challenge. In this paper, we propose Recurrent…

Computation and Language · Computer Science 2016-04-25 Ke Tran , Arianna Bisazza , Christof Monz

Recurrent Neural Network Grammars

We introduce recurrent neural network grammars, probabilistic models of sentences with explicit phrase structure. We explain efficient inference procedures that allow application to both parsing and language modeling. Experiments show that…

Computation and Language · Computer Science 2016-10-13 Chris Dyer , Adhiguna Kuncoro , Miguel Ballesteros , Noah A. Smith

Neural Programmer: Inducing Latent Programs with Gradient Descent

Deep neural networks have achieved impressive supervised classification performance in many tasks including image recognition, speech recognition, and sequence to sequence learning. However, this success has not been translated to…

Machine Learning · Computer Science 2016-08-05 Arvind Neelakantan , Quoc V. Le , Ilya Sutskever

Return of the RNN: Residual Recurrent Networks for Invertible Sentence Embeddings

This study presents a novel model for invertible sentence embeddings using a residual recurrent network trained on an unsupervised encoding task. Rather than the probabilistic outputs common to neural machine translation models, our…

Computation and Language · Computer Science 2023-04-07 Jeremy Wilkerson

What Do Recurrent Neural Network Grammars Learn About Syntax?

Recurrent neural network grammars (RNNG) are a recently proposed probabilistic generative modeling family for natural language. They show state-of-the-art language modeling and parsing performance. We investigate what information they…

Computation and Language · Computer Science 2017-01-12 Adhiguna Kuncoro , Miguel Ballesteros , Lingpeng Kong , Chris Dyer , Graham Neubig , Noah A. Smith

Which Neural Network Architecture matches Human Behavior in Artificial Grammar Learning?

In recent years artificial neural networks achieved performance close to or better than humans in several domains: tasks that were previously human prerogatives, such as language processing, have witnessed remarkable improvements in state…

Neurons and Cognition · Quantitative Biology 2019-02-14 Andrea Alamia , Victor Gauducheau , Dimitri Paisios , Rufin VanRullen

Learning Numeracy: Binary Arithmetic with Neural Turing Machines

One of the main problems encountered so far with recurrent neural networks is that they struggle to retain long-time information dependencies in their recurrent connections. Neural Turing Machines (NTMs) attempt to mitigate this issue by…

Neural and Evolutionary Computing · Computer Science 2024-12-20 Jacopo Castellini

Neural Text Generation: Past, Present and Beyond

This paper presents a systematic survey on recent development of neural text generation models. Specifically, we start from recurrent neural network language models with the traditional maximum likelihood estimation training scheme and…

Computation and Language · Computer Science 2018-03-21 Sidi Lu , Yaoming Zhu , Weinan Zhang , Jun Wang , Yong Yu

Learning to Execute

Recurrent Neural Networks (RNNs) with Long Short-Term Memory units (LSTM) are widely used because they are expressive and are easy to train. Our interest lies in empirically evaluating the expressiveness and the learnability of LSTMs in the…

Neural and Evolutionary Computing · Computer Science 2015-11-24 Wojciech Zaremba , Ilya Sutskever

Recurrent neural networks: vanishing and exploding gradients are not the end of the story

Recurrent neural networks (RNNs) notoriously struggle to learn long-term memories, primarily due to vanishing and exploding gradients. The recent success of state-space models (SSMs), a subclass of RNNs, to overcome such difficulties…

Machine Learning · Computer Science 2024-11-06 Nicolas Zucchet , Antonio Orvieto

Learning Operations on a Stack with Neural Turing Machines

Multiple extensions of Recurrent Neural Networks (RNNs) have been proposed recently to address the difficulty of storing information over long time periods. In this paper, we experiment with the capacity of Neural Turing Machines (NTMs) to…

Machine Learning · Computer Science 2016-12-05 Tristan Deleu , Joseph Dureau

Neural Additive Models: Interpretable Machine Learning with Neural Nets

Deep neural networks (DNNs) are powerful black-box predictors that have achieved impressive performance on a wide variety of tasks. However, their accuracy comes at the cost of intelligibility: it is usually unclear how they make their…

Machine Learning · Computer Science 2021-10-26 Rishabh Agarwal , Levi Melnick , Nicholas Frosst , Xuezhou Zhang , Ben Lengerich , Rich Caruana , Geoffrey Hinton

Recent Advances in Recurrent Neural Networks

Recurrent neural networks (RNNs) are capable of learning features and long term dependencies from sequential and time-series data. The RNNs have a stack of non-linear units where at least one connection between units forms a directed cycle.…

Neural and Evolutionary Computing · Computer Science 2018-02-26 Hojjat Salehinejad , Sharan Sankar , Joseph Barfett , Errol Colak , Shahrokh Valaee

A Critical Review of Recurrent Neural Networks for Sequence Learning

Countless learning tasks require dealing with sequential data. Image captioning, speech synthesis, and music generation all require that a model produce outputs that are sequences. In other domains, such as time series prediction, video…

Machine Learning · Computer Science 2015-10-20 Zachary C. Lipton , John Berkowitz , Charles Elkan

Recognizing Long Grammatical Sequences Using Recurrent Networks Augmented With An External Differentiable Stack

Recurrent neural networks (RNNs) are a widely used deep architecture for sequence modeling, generation, and prediction. Despite success in applications such as machine translation and voice recognition, these stateful models have several…

Computation and Language · Computer Science 2020-04-23 Ankur Mali , Alexander Ororbia , Daniel Kifer , Clyde Lee Giles

Advancing Regular Language Reasoning in Linear Recurrent Neural Networks

In recent studies, linear recurrent neural networks (LRNNs) have achieved Transformer-level performance in natural language and long-range modeling, while offering rapid parallel training and constant inference cost. With the resurgence of…

Computation and Language · Computer Science 2024-04-10 Ting-Han Fan , Ta-Chung Chi , Alexander I. Rudnicky