English
Related papers

Related papers: Can Active Memory Replace Attention?

200 papers

The key to a Transformer model is the self-attention mechanism, which allows the model to analyze an entire sequence in a computationally efficient manner. Recent work has suggested the possibility that general attention mechanisms used by…

Machine Learning · Computer Science 2020-01-01 Thomas Dowdell , Hongyu Zhang

Intelligence necessitates memory. Without memory, humans fail to perform various nontrivial tasks such as reading novels, playing games or solving maths. As the ultimate goal of machine learning is to derive intelligent systems that learn…

Machine Learning · Computer Science 2021-07-06 Hung Le

Attention-based Neural Machine Translation (NMT) models suffer from attention deficiency issues as has been observed in recent research. We propose a novel mechanism to address some of these limitations and improve the NMT attention.…

Computation and Language · Computer Science 2016-08-10 Baskaran Sankaran , Haitao Mi , Yaser Al-Onaizan , Abe Ittycheriah

Attention mechanisms represent a fundamental paradigm shift in neural network architectures, enabling models to selectively focus on relevant portions of input sequences through learned weighting functions. This monograph provides a…

Machine Learning · Computer Science 2026-01-08 Hasi Hays

Attention mechanism has enhanced state-of-the-art Neural Machine Translation (NMT) by jointly learning to align and translate. It tends to ignore past alignment information, however, which often leads to over-translation and…

Computation and Language · Computer Science 2016-08-09 Zhaopeng Tu , Zhengdong Lu , Yang Liu , Xiaohua Liu , Hang Li

Attention in neural machine translation provides the possibility to encode relevant parts of the source sentence at each translation step. As a result, attention is considered to be an alignment model as well. However, there is no work that…

Computation and Language · Computer Science 2017-10-11 Hamidreza Ghader , Christof Monz

A long time ago in the machine learning literature, the idea of incorporating a mechanism inspired by the human visual system into neural networks was introduced. This idea is named the attention mechanism, and it has gone through a long…

Machine Learning · Computer Science 2022-08-10 Derya Soydaner

Human attention modelling has proven, in recent years, to be particularly useful not only for understanding the cognitive processes underlying visual exploration, but also for providing support to artificial intelligence models that aim to…

Computer Vision and Pattern Recognition · Computer Science 2024-04-23 Giuseppe Cartella , Marcella Cornia , Vittorio Cuculo , Alessandro D'Amelio , Dario Zanca , Giuseppe Boccignone , Rita Cucchiara

We introduced a {\it working memory} augmented adaptive controller in our recent work. The controller uses attention to read from and write to the working memory. Attention allows the controller to read specific information that is relevant…

Systems and Control · Electrical Eng. & Systems 2020-03-23 Deepan Muthirayan , Scott Nivison , Pramod P. Khargonekar

Transformers have revolutionized deep learning in numerous fields, including natural language processing, computer vision, and audio processing. Their strength lies in their attention mechanism, which allows for the discovering of complex…

Machine Learning · Computer Science 2024-04-02 Uladzislau Yorsh , Martin Holeňa , Ondřej Bojar , David Herel

Attention is a powerful and ubiquitous mechanism for allowing neural models to focus on particular salient pieces of information by taking their weighted average when making predictions. In particular, multi-headed attention is a driving…

Computation and Language · Computer Science 2019-11-05 Paul Michel , Omer Levy , Graham Neubig

In humans, Attention is a core property of all perceptual and cognitive operations. Given our limited ability to process competing sources, attention mechanisms select, modulate, and focus on the information most relevant to behavior. For…

Machine Learning · Computer Science 2021-04-01 Alana de Santana Correia , Esther Luna Colombini

Attention models have become a crucial component in neural machine translation (NMT). They are often implicitly or explicitly used to justify the model's decision in generating a specific token but it has not yet been rigorously established…

Computation and Language · Computer Science 2019-10-02 Pooya Moradi , Nishant Kambhatla , Anoop Sarkar

In this paper we explore deep learning models with memory component or attention mechanism for question answering task. We combine and compare three models, Neural Machine Translation, Neural Turing Machine, and Memory Networks for a…

Computation and Language · Computer Science 2015-11-23 Yang Yu , Wei Zhang , Chung-Wei Hang , Bing Xiang , Bowen Zhou

Attention mechanism has become the dominant module in natural language processing models. It is computationally intensive and depends on massive power-hungry multiplications. In this paper, we rethink variants of attention mechanism from…

Computation and Language · Computer Science 2022-10-20 Yu Wan , Baosong Yang , Dayiheng Liu , Rong Xiao , Derek F. Wong , Haibo Zhang , Boxing Chen , Lidia S. Chao

We study the application of active learning techniques to the translation of unbounded data streams via interactive neural machine translation. The main idea is to select, from an unbounded stream of source sentences, those worth to be…

Computation and Language · Computer Science 2018-10-26 Álvaro Peris , Francisco Casacuberta

Attention is an important mechanism that can be employed for a variety of deep learning models across many different domains and tasks. This survey provides an overview of the most important attention mechanisms proposed in the literature.…

Machine Learning · Computer Science 2022-03-29 Gianni Brauwers , Flavius Frasincar

Attention is an increasingly popular mechanism used in a wide range of neural architectures. The mechanism itself has been realized in a variety of formats. However, because of the fast-paced advances in this domain, a systematic overview…

Computation and Language · Computer Science 2021-10-12 Andrea Galassi , Marco Lippi , Paolo Torroni

In state-of-the-art Neural Machine Translation (NMT), an attention mechanism is used during decoding to enhance the translation. At every step, the decoder uses this mechanism to focus on different parts of the source sentence to gather the…

Computation and Language · Computer Science 2018-05-31 Jean-Benoit Delbrouck , Stéphane Dupont

Conventional attention-based Neural Machine Translation (NMT) conducts dynamic alignment in generating the target sentence. By repeatedly reading the representation of source sentence, which keeps fixed after generated by the encoder…

Computation and Language · Computer Science 2016-10-18 Fandong Meng , Zhengdong Lu , Hang Li , Qun Liu
‹ Prev 1 2 3 10 Next ›