Related papers: Attention-based Conditioning Methods for External …

ExGate: Externally Controlled Gating for Feature-based Attention in Artificial Neural Networks

Perceptual capabilities of artificial systems have come a long way since the advent of deep learning. These methods have proven to be effective, however they are not as efficient as their biological counterparts. Visual attention is a set…

Machine Learning · Computer Science 2018-11-09 Jarryd Son , Amit Mishra

Contextualized Knowledge-aware Attentive Neural Network: Enhancing Answer Selection with Knowledge

Answer selection, which is involved in many natural language processing applications such as dialog systems and question answering (QA), is an important yet challenging task in practice, since conventional methods typically suffer from the…

Computation and Language · Computer Science 2021-04-13 Yang Deng , Yuexiang Xie , Yaliang Li , Min Yang , Wai Lam , Ying Shen

Knowledge Graph Reasoning Based on Attention GCN

We propose a novel technique to enhance Knowledge Graph Reasoning by combining Graph Convolution Neural Network (GCN) with the Attention Mechanism. This approach utilizes the Attention Mechanism to examine the relationships between entities…

Information Retrieval · Computer Science 2025-03-24 Meera Gupta , Ravi Khanna , Divya Choudhary , Nandini Rao

Recursive Feature Generation for Knowledge-based Learning

When humans perform inductive learning, they often enhance the process with background knowledge. With the increasing availability of well-formed collaborative knowledge bases, the performance of learning algorithms could be significantly…

Artificial Intelligence · Computer Science 2018-02-02 Lior Friedman , Shaul Markovitch

Enhancing knowledge retention for continual learning with domain-specific adapters and features gating

Continual learning empowers models to learn from a continuous stream of data while preserving previously acquired knowledge, effectively addressing the challenge of catastrophic forgetting. In this study, we propose a new approach that…

Computer Vision and Pattern Recognition · Computer Science 2025-04-14 Mohamed Abbas Hedjazi , Oussama Hadjerci , Adel Hafiane

Gated Orthogonal Recurrent Units: On Learning to Forget

We present a novel recurrent neural network (RNN) based model that combines the remembering ability of unitary RNNs with the ability of gated RNNs to effectively forget redundant/irrelevant information in its memory. We achieve this by…

Machine Learning · Computer Science 2017-10-26 Li Jing , Caglar Gulcehre , John Peurifoy , Yichen Shen , Max Tegmark , Marin Soljačić , Yoshua Bengio

Focused Hierarchical RNNs for Conditional Sequence Processing

Recurrent Neural Networks (RNNs) with attention mechanisms have obtained state-of-the-art results for many sequence processing tasks. Most of these models use a simple form of encoder with attention that looks over the entire sequence and…

Machine Learning · Statistics 2018-06-13 Nan Rosemary Ke , Konrad Zolna , Alessandro Sordoni , Zhouhan Lin , Adam Trischler , Yoshua Bengio , Joelle Pineau , Laurent Charlin , Chris Pal

Refined Gate: A Simple and Effective Gating Mechanism for Recurrent Units

Recurrent neural network (RNN) has been widely studied in sequence learning tasks, while the mainstream models (e.g., LSTM and GRU) rely on the gating mechanism (in control of how information flows between hidden states). However, the…

Computer Vision and Pattern Recognition · Computer Science 2020-05-27 Zhanzhan Cheng , Yunlu Xu , Mingjian Cheng , Yu Qiao , Shiliang Pu , Yi Niu , Fei Wu

Recurrent Attention Unit

Recurrent Neural Network (RNN) has been successfully applied in many sequence learning problems. Such as handwriting recognition, image description, natural language processing and video motion analysis. After years of development,…

Machine Learning · Computer Science 2018-11-01 Guoqiang Zhong , Guohua Yue , Xiao Ling

Introducing Self-Attention to Target Attentive Graph Neural Networks

Session-based recommendation systems suggest relevant items to users by modeling user behavior and preferences using short-term anonymous sessions. Existing methods leverage Graph Neural Networks (GNNs) that propagate and aggregate…

Information Retrieval · Computer Science 2022-01-10 Sai Mitheran , Abhinav Java , Surya Kant Sahu , Arshad Shaikh

Occam's Gates

We present a complimentary objective for training recurrent neural networks (RNN) with gating units that helps with regularization and interpretability of the trained model. Attention-based RNN models have shown success in many difficult…

Machine Learning · Computer Science 2015-06-30 Jonathan Raiman , Szymon Sidor

Gated Recurrent Neural Tensor Network

Recurrent Neural Networks (RNNs), which are a powerful scheme for modeling temporal and sequential data need to capture long-term dependencies on datasets and represent them in hidden layers with a powerful model to capture more information…

Machine Learning · Computer Science 2017-06-08 Andros Tjandra , Sakriani Sakti , Ruli Manurung , Mirna Adriani , Satoshi Nakamura

Knowledge Enhanced Attention for Robust Natural Language Inference

Neural network models have been very successful at achieving high accuracy on natural language inference (NLI) tasks. However, as demonstrated in recent literature, when tested on some simple adversarial examples, most of the models suffer…

Computation and Language · Computer Science 2019-09-04 Alexander Hanbo Li , Abhinav Sethy

Convolutional Gated Recurrent Units for Medical Relation Classification

Convolutional neural network (CNN) and recurrent neural network (RNN) models have become the mainstream methods for relation classification. We propose a unified architecture, which exploits the advantages of CNN and RNN simultaneously, to…

Computation and Language · Computer Science 2018-07-31 Bin He , Yi Guan , Rui Dai

Learning Conjoint Attentions for Graph Neural Nets

In this paper, we present Conjoint Attentions (CAs), a class of novel learning-to-attend strategies for graph neural networks (GNNs). Besides considering the layer-wise node features propagated within the GNN, CAs can additionally…

Machine Learning · Computer Science 2021-12-14 Tiantian He , Yew-Soon Ong , Lu Bai

Deep Neural Network Embeddings with Gating Mechanisms for Text-Independent Speaker Verification

In this paper, gating mechanisms are applied in deep neural network (DNN) training for x-vector-based text-independent speaker verification. First, a gated convolution neural network (GCNN) is employed for modeling the frame-level embedding…

Audio and Speech Processing · Electrical Eng. & Systems 2019-04-05 Lanhua You , Wu Guo , Lirong Dai , Jun Du

Feature-based Graph Attention Networks Improve Online Continual Learning

Online continual learning for image classification is crucial for models to adapt to new data while retaining knowledge of previously learned tasks. This capability is essential to address real-world challenges involving dynamic…

Computer Vision and Pattern Recognition · Computer Science 2025-02-14 Adjovi Sim , Zhengkui Wang , Aik Beng Ng , Shalini De Mello , Simon See , Wonmin Byeon

Language Modeling with Gated Convolutional Networks

The pre-dominant approach to language modeling to date is based on recurrent neural networks. Their success on this task is often linked to their ability to capture unbounded context. In this paper we develop a finite context approach…

Computation and Language · Computer Science 2017-09-12 Yann N. Dauphin , Angela Fan , Michael Auli , David Grangier

Image Captioning and Visual Question Answering Based on Attributes and External Knowledge

Much recent progress in Vision-to-Language problems has been achieved through a combination of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs). This approach does not explicitly represent high-level semantic…

Computer Vision and Pattern Recognition · Computer Science 2016-12-19 Qi Wu , Chunhua Shen , Anton van den Hengel , Peng Wang , Anthony Dick

An Exploration of Conditioning Methods in Graph Neural Networks

The flexibility and effectiveness of message passing based graph neural networks (GNNs) induced considerable advances in deep learning on graph-structured data. In such approaches, GNNs recursively update node representations based on their…

Machine Learning · Computer Science 2023-05-04 Yeskendir Koishekenov , Erik J. Bekkers