English
Related papers

Related papers: Adaptive Memory Networks

200 papers

Most tasks in natural language processing can be cast into question answering (QA) problems over language input. We introduce the dynamic memory network (DMN), a neural network architecture which processes input sequences and questions,…

Computation and Language · Computer Science 2016-03-08 Ankit Kumar , Ozan Irsoy , Peter Ondruska , Mohit Iyyer , James Bradbury , Ishaan Gulrajani , Victor Zhong , Romain Paulus , Richard Socher

Recent advances in conversational systems have changed the search paradigm. Traditionally, a user poses a query to a search engine that returns an answer based on its index, possibly leveraging external knowledge bases and conditioning the…

Computation and Language · Computer Science 2017-12-21 Tom Kenter , Maarten de Rijke

This paper proposes the progressive attention memory network (PAMN) for movie story question answering (QA). Movie story QA is challenging compared to VQA in two aspects: (1) pinpointing the temporal parts relevant to answer the question is…

Computer Vision and Pattern Recognition · Computer Science 2019-04-19 Junyeong Kim , Minuk Ma , Kyungsu Kim , Sungjin Kim , Chang D. Yoo

Neural network architectures with memory and attention mechanisms exhibit certain reasoning capabilities required for question answering. One such architecture, the dynamic memory network (DMN), obtained high accuracy on a variety of…

Neural and Evolutionary Computing · Computer Science 2016-03-07 Caiming Xiong , Stephen Merity , Richard Socher

We examine Memory Networks for the task of question answering (QA), under common real world scenario where training examples are scarce and under weakly supervised scenario, that is only extrinsic labels are available for training. We…

Computation and Language · Computer Science 2017-03-14 Govardana Sachithanandam Ramachandran , Ajay Sohmshetty

Recurrent neural networks (RNNs) have achieved great success in language modeling. However, since the RNNs have fixed size of memory, their memory cannot store all the information about the words it have seen before in the sentence, and…

Computation and Language · Computer Science 2016-11-29 Da-Rong Liu , Shun-Po Chuang , Hung-yi Lee

The Artificial Neural Networks (ANNs) have been originally designed to function like a biological neural network, but does an ANN really work in the same way as a biological neural network? As we know, the human brain holds information in…

Neural and Evolutionary Computing · Computer Science 2019-01-08 Usman Ahmad , Hong Song , Awais Bilal , Shahid Mahmood , Asad Ullah , Uzair Saeed

Conventional model quantization methods use a fixed quantization scheme to different data samples, which ignores the inherent "recognition difficulty" differences between various samples. We propose to feed different data samples with…

Computer Vision and Pattern Recognition · Computer Science 2022-04-22 Chen Tang , Haoyu Zhai , Kai Ouyang , Zhi Wang , Yifei Zhu , Wenwu Zhu

Visual question answering by using information from multiple modalities has attracted more and more attention in recent years. However, it is a very challenging task, as the visual content and natural language have quite different…

Computer Vision and Pattern Recognition · Computer Science 2020-03-13 Zhaoquan Yuan , Siyuan Sun , Lixin Duan , Xiao Wu , Changsheng Xu

Question answering is an important and difficult task in the natural language processing domain, because many basic natural language processing tasks can be cast into a question answering task. Several deep neural network architectures have…

Computation and Language · Computer Science 2017-07-10 Fenglong Ma , Radha Chitta , Saurabh Kataria , Jing Zhou , Palghat Ramesh , Tong Sun , Jing Gao

Transformers achieve strong language modeling accuracy, yet their position-wise feed-forward networks (FFNs) are dense, globally shared, and typically updated end to end. These properties create two practical tensions. First, dense FFNs…

Machine Learning · Computer Science 2026-02-10 Shashank

Neural networks excel in detecting regular patterns but are less successful in representing and manipulating complex data structures, possibly due to the lack of an external memory. This has led to the recent development of a new line of…

Artificial Intelligence · Computer Science 2018-11-29 Trang Pham , Truyen Tran , Svetha Venkatesh

Semantic matching is of central significance to the answer selection task which aims to select correct answers for a given question from a candidate answer pool. A useful method is to employ neural networks with attention to generate…

Computation and Language · Computer Science 2021-05-10 Jie Huang

Despite recent progress in memory augmented neural network (MANN) research, associative memory networks with a single external memory still show limited performance on complex relational reasoning tasks. Especially the content-based…

Machine Learning · Computer Science 2021-08-30 Taewon Park , Inchul Choi , Minho Lee

During the last years, there has been a lot of interest in achieving some kind of complex reasoning using deep neural networks. To do that, models like Memory Networks (MemNNs) have combined external memory storages and attention…

Computation and Language · Computer Science 2018-05-25 Juan Pavez , Héctor Allende , Héctor Allende-Cid

While transformer models have been highly successful, they are computationally inefficient. We observe that for each layer, the full width of the layer may be needed only for a small subset of tokens inside a batch and that the "effective"…

Machine Learning · Computer Science 2024-12-19 Bartosz Wójcik , Alessio Devoto , Karol Pustelnik , Pasquale Minervini , Simone Scardapane

This paper explores Memory-Augmented Neural Networks (MANNs), delving into how they blend human-like memory processes into AI. It covers different memory types, like sensory, short-term, and long-term memory, linking psychological theories…

Artificial Intelligence · Computer Science 2023-12-14 Savya Khosla , Zhen Zhu , Yifei He

Attentive Neural Process (ANP) improves the fitting ability of Neural Process (NP) and improves its prediction accuracy, but the higher time complexity of the model imposes a limitation on the length of the input sequence. Inspired by…

Computer Vision and Pattern Recognition · Computer Science 2022-02-07 Xiaohan Yu , Shaochen Mao

Current generation of memory-augmented neural networks has limited scalability as they cannot efficiently process data that are too large to fit in the external memory storage. One example of this is lifelong learning scenario where the…

Machine Learning · Computer Science 2018-12-12 Hyunwoo Jung , Moonsu Han , Minki Kang , Sungju Hwang

Many important NLP problems can be posed as dual-sequence or sequence-to-sequence modeling tasks. Recent advances in building end-to-end neural architectures have been highly successful in solving such tasks. In this work we propose a new…

Neural and Evolutionary Computing · Computer Science 2016-06-15 Dirk Weissenborn
‹ Prev 1 2 3 10 Next ›