Related papers: RNNs Implicitly Implement Tensor Product Represent…

Question-Answering with Grammatically-Interpretable Representations

We introduce an architecture, the Tensor Product Recurrent Network (TPRN). In our application of TPRN, internal representations learned by end-to-end optimization in a deep neural network performing a textual question-answering (QA) task…

Computation and Language · Computer Science 2017-09-27 Hamid Palangi , Paul Smolensky , Xiaodong He , Li Deng

Tensor Product Generation Networks for Deep NLP Modeling

We present a new approach to the design of deep networks for natural language processing (NLP), based on the general technique of Tensor Product Representations (TPRs) for encoding and processing symbol structures in distributed neural…

Computer Vision and Pattern Recognition · Computer Science 2017-12-19 Qiuyuan Huang , Paul Smolensky , Xiaodong He , Li Deng , Dapeng Wu

Tensor-Train Recurrent Neural Networks for Interpretable Multi-Way Financial Forecasting

Recurrent Neural Networks (RNNs) represent the de facto standard machine learning tool for sequence modelling, owing to their expressive power and memory. However, when dealing with large dimensional data, the corresponding exponential…

Machine Learning · Computer Science 2021-05-12 Yao Lei Xu , Giuseppe G. Calvi , Danilo P. Mandic

Mapping Natural-language Problems to Formal-language Solutions Using Structured Neural Representations

Generating formal-language programs represented by relational tuples, such as Lisp programs or mathematical operations, to solve problems stated in natural language is a challenging task because it requires explicitly capturing discrete…

Computation and Language · Computer Science 2020-08-04 Kezhen Chen , Qiuyuan Huang , Hamid Palangi , Paul Smolensky , Kenneth D. Forbus , Jianfeng Gao

Recursive Neural Networks Can Learn Logical Semantics

Tree-structured recursive neural networks (TreeRNNs) for sentence meaning have been successful for many applications, but it remains an open question whether the fixed-length representations that they learn can support tasks as demanding as…

Computation and Language · Computer Science 2015-05-15 Samuel R. Bowman , Christopher Potts , Christopher D. Manning

A Neural-Symbolic Approach to Design of CAPTCHA

CAPTCHAs based on reading text are susceptible to machine-learning-based attacks due to recent significant advances in deep learning (DL). To address this, this paper promotes image/visual captioning based CAPTCHAs, which is robust against…

Computation and Language · Computer Science 2018-09-27 Qiuyuan Huang , Paul Smolensky , Xiaodong He , Li Deng , Dapeng Wu

Tensor-Train Recurrent Neural Networks for Video Classification

The Recurrent Neural Networks and their variants have shown promising performances in sequence modeling tasks such as Natural Language Processing. These models, however, turn out to be impractical and difficult to train when exposed to very…

Computer Vision and Pattern Recognition · Computer Science 2017-07-07 Yinchong Yang , Denis Krompass , Volker Tresp

A Simple Recurrent Unit with Reduced Tensor Product Representations

idely used recurrent units, including Long-short Term Memory (LSTM) and the Gated Recurrent Unit (GRU), perform well on natural language tasks, but their ability to learn structured representations is still questionable. Exploiting reduced…

Neural and Evolutionary Computing · Computer Science 2019-11-06 Shuai Tang , Paul Smolensky , Virginia R. de Sa

Predictive-State Decoders: Encoding the Future into Recurrent Networks

Recurrent neural networks (RNNs) are a vital modeling technique that rely on internal states learned indirectly by optimization of a supervised, unsupervised, or reinforcement training loss. RNNs are used to model dynamic processes that are…

Machine Learning · Statistics 2017-09-26 Arun Venkatraman , Nicholas Rhinehart , Wen Sun , Lerrel Pinto , Martial Hebert , Byron Boots , Kris M. Kitani , J. Andrew Bagnell

Learning Robust, Transferable Sentence Representations for Text Classification

Despite deep recurrent neural networks (RNNs) demonstrate strong performance in text classification, training RNN models are often expensive and requires an extensive collection of annotated data which may not be available. To overcome the…

Computation and Language · Computer Science 2018-10-02 Wasi Uddin Ahmad , Xueying Bai , Nanyun Peng , Kai-Wei Chang

Sequence Transduction with Recurrent Neural Networks

Many machine learning tasks can be expressed as the transformation---or \emph{transduction}---of input sequences into output sequences: speech recognition, machine translation, protein secondary structure prediction and text-to-speech to…

Neural and Evolutionary Computing · Computer Science 2012-11-16 Alex Graves

Compositional Structure Learning for Sequential Video Data

Conventional sequential learning methods such as Recurrent Neural Networks (RNNs) focus on interactions between consecutive inputs, i.e. first-order Markovian dependency. However, most of sequential data, as seen with videos, have complex…

Machine Learning · Computer Science 2019-07-04 Kyoung-Woon On , Eun-Sol Kim , Yu-Jung Heo , Byoung-Tak Zhang

Tensorial Recurrent Neural Networks for Longitudinal Data Analysis

Traditional Recurrent Neural Networks assume vectorized data as inputs. However many data from modern science and technology come in certain structures such as tensorial time series data. To apply the recurrent neural networks for this type…

Machine Learning · Computer Science 2017-08-02 Mingyuan Bai , Boyan Zhang , Junbin Gao

Modeling Hierarchical Structures with Continuous Recursive Neural Networks

Recursive Neural Networks (RvNNs), which compose sequences according to their underlying hierarchical syntactic structure, have performed well in several natural language processing tasks compared to similar models without structural…

Computation and Language · Computer Science 2021-06-14 Jishnu Ray Chowdhury , Cornelia Caragea

Learning Phrase Representations using RNN Encoder-Decoder for Statistical Machine Translation

In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes…

Computation and Language · Computer Science 2014-09-04 Kyunghyun Cho , Bart van Merrienboer , Caglar Gulcehre , Dzmitry Bahdanau , Fethi Bougares , Holger Schwenk , Yoshua Bengio

From Tensor Network Quantum States to Tensorial Recurrent Neural Networks

We show that any matrix product state (MPS) can be exactly represented by a recurrent neural network (RNN) with a linear memory update. We generalize this RNN architecture to 2D lattices using a multilinear memory update. It supports…

Quantum Physics · Physics 2023-10-02 Dian Wu , Riccardo Rossi , Filippo Vicentini , Giuseppe Carleo

Tensor Product Neural Networks for Functional ANOVA Model

Interpretability for machine learning models is becoming more and more important as machine learning models become more complex. The functional ANOVA model, which decomposes a high-dimensional function into a sum of lower dimensional…

Machine Learning · Statistics 2025-08-01 Seokhun Park , Insung Kong , Yongchan Choi , Chanmoo Park , Yongdai Kim

Toward Mention Detection Robustness with Recurrent Neural Networks

One of the key challenges in natural language processing (NLP) is to yield good performance across application domains and languages. In this work, we investigate the robustness of the mention detection systems, one of the fundamental tasks…

Computation and Language · Computer Science 2016-02-26 Thien Huu Nguyen , Avirup Sil , Georgiana Dinu , Radu Florian

Recurrent Neural Networks Learn to Store and Generate Sequences using Non-Linear Representations

The Linear Representation Hypothesis (LRH) states that neural networks learn to encode concepts as directions in activation space, and a strong version of the LRH states that models learn only such encodings. In this paper, we present a…

Machine Learning · Computer Science 2024-08-21 Róbert Csordás , Christopher Potts , Christopher D. Manning , Atticus Geiger

Multilingual, Multi-scale and Multi-layer Visualization of Intermediate Representations

The main alternatives nowadays to deal with sequences are Recurrent Neural Networks (RNN), Convolutional Neural Networks (CNN) architectures and the Transformer. In this context, RNN's, CNN's and Transformer have most commonly been used as…

Computation and Language · Computer Science 2019-07-02 Carlos Escolano , Marta R. Costa-jussà , Elora Lacroux , Pere-Pau Vázquez