English
Related papers

Related papers: RNNs Implicitly Implement Tensor Product Represent…

200 papers

We introduce an architecture, the Tensor Product Recurrent Network (TPRN). In our application of TPRN, internal representations learned by end-to-end optimization in a deep neural network performing a textual question-answering (QA) task…

Computation and Language · Computer Science 2017-09-27 Hamid Palangi , Paul Smolensky , Xiaodong He , Li Deng

We present a new approach to the design of deep networks for natural language processing (NLP), based on the general technique of Tensor Product Representations (TPRs) for encoding and processing symbol structures in distributed neural…

Computer Vision and Pattern Recognition · Computer Science 2017-12-19 Qiuyuan Huang , Paul Smolensky , Xiaodong He , Li Deng , Dapeng Wu

Recurrent Neural Networks (RNNs) represent the de facto standard machine learning tool for sequence modelling, owing to their expressive power and memory. However, when dealing with large dimensional data, the corresponding exponential…

Machine Learning · Computer Science 2021-05-12 Yao Lei Xu , Giuseppe G. Calvi , Danilo P. Mandic

Generating formal-language programs represented by relational tuples, such as Lisp programs or mathematical operations, to solve problems stated in natural language is a challenging task because it requires explicitly capturing discrete…

Computation and Language · Computer Science 2020-08-04 Kezhen Chen , Qiuyuan Huang , Hamid Palangi , Paul Smolensky , Kenneth D. Forbus , Jianfeng Gao

Tree-structured recursive neural networks (TreeRNNs) for sentence meaning have been successful for many applications, but it remains an open question whether the fixed-length representations that they learn can support tasks as demanding as…

Computation and Language · Computer Science 2015-05-15 Samuel R. Bowman , Christopher Potts , Christopher D. Manning

CAPTCHAs based on reading text are susceptible to machine-learning-based attacks due to recent significant advances in deep learning (DL). To address this, this paper promotes image/visual captioning based CAPTCHAs, which is robust against…

Computation and Language · Computer Science 2018-09-27 Qiuyuan Huang , Paul Smolensky , Xiaodong He , Li Deng , Dapeng Wu

The Recurrent Neural Networks and their variants have shown promising performances in sequence modeling tasks such as Natural Language Processing. These models, however, turn out to be impractical and difficult to train when exposed to very…

Computer Vision and Pattern Recognition · Computer Science 2017-07-07 Yinchong Yang , Denis Krompass , Volker Tresp

idely used recurrent units, including Long-short Term Memory (LSTM) and the Gated Recurrent Unit (GRU), perform well on natural language tasks, but their ability to learn structured representations is still questionable. Exploiting reduced…

Neural and Evolutionary Computing · Computer Science 2019-11-06 Shuai Tang , Paul Smolensky , Virginia R. de Sa

Recurrent neural networks (RNNs) are a vital modeling technique that rely on internal states learned indirectly by optimization of a supervised, unsupervised, or reinforcement training loss. RNNs are used to model dynamic processes that are…

Despite deep recurrent neural networks (RNNs) demonstrate strong performance in text classification, training RNN models are often expensive and requires an extensive collection of annotated data which may not be available. To overcome the…

Computation and Language · Computer Science 2018-10-02 Wasi Uddin Ahmad , Xueying Bai , Nanyun Peng , Kai-Wei Chang

Many machine learning tasks can be expressed as the transformation---or \emph{transduction}---of input sequences into output sequences: speech recognition, machine translation, protein secondary structure prediction and text-to-speech to…

Neural and Evolutionary Computing · Computer Science 2012-11-16 Alex Graves

Conventional sequential learning methods such as Recurrent Neural Networks (RNNs) focus on interactions between consecutive inputs, i.e. first-order Markovian dependency. However, most of sequential data, as seen with videos, have complex…

Machine Learning · Computer Science 2019-07-04 Kyoung-Woon On , Eun-Sol Kim , Yu-Jung Heo , Byoung-Tak Zhang

Traditional Recurrent Neural Networks assume vectorized data as inputs. However many data from modern science and technology come in certain structures such as tensorial time series data. To apply the recurrent neural networks for this type…

Machine Learning · Computer Science 2017-08-02 Mingyuan Bai , Boyan Zhang , Junbin Gao

Recursive Neural Networks (RvNNs), which compose sequences according to their underlying hierarchical syntactic structure, have performed well in several natural language processing tasks compared to similar models without structural…

Computation and Language · Computer Science 2021-06-14 Jishnu Ray Chowdhury , Cornelia Caragea

In this paper, we propose a novel neural network model called RNN Encoder-Decoder that consists of two recurrent neural networks (RNN). One RNN encodes a sequence of symbols into a fixed-length vector representation, and the other decodes…

Computation and Language · Computer Science 2014-09-04 Kyunghyun Cho , Bart van Merrienboer , Caglar Gulcehre , Dzmitry Bahdanau , Fethi Bougares , Holger Schwenk , Yoshua Bengio

We show that any matrix product state (MPS) can be exactly represented by a recurrent neural network (RNN) with a linear memory update. We generalize this RNN architecture to 2D lattices using a multilinear memory update. It supports…

Quantum Physics · Physics 2023-10-02 Dian Wu , Riccardo Rossi , Filippo Vicentini , Giuseppe Carleo

Interpretability for machine learning models is becoming more and more important as machine learning models become more complex. The functional ANOVA model, which decomposes a high-dimensional function into a sum of lower dimensional…

Machine Learning · Statistics 2025-08-01 Seokhun Park , Insung Kong , Yongchan Choi , Chanmoo Park , Yongdai Kim

One of the key challenges in natural language processing (NLP) is to yield good performance across application domains and languages. In this work, we investigate the robustness of the mention detection systems, one of the fundamental tasks…

Computation and Language · Computer Science 2016-02-26 Thien Huu Nguyen , Avirup Sil , Georgiana Dinu , Radu Florian

The Linear Representation Hypothesis (LRH) states that neural networks learn to encode concepts as directions in activation space, and a strong version of the LRH states that models learn only such encodings. In this paper, we present a…

Machine Learning · Computer Science 2024-08-21 Róbert Csordás , Christopher Potts , Christopher D. Manning , Atticus Geiger

The main alternatives nowadays to deal with sequences are Recurrent Neural Networks (RNN), Convolutional Neural Networks (CNN) architectures and the Transformer. In this context, RNN's, CNN's and Transformer have most commonly been used as…

Computation and Language · Computer Science 2019-07-02 Carlos Escolano , Marta R. Costa-jussà , Elora Lacroux , Pere-Pau Vázquez
‹ Prev 1 2 3 10 Next ›