English
Related papers

Related papers: Meta Module Network for Compositional Visual Reaso…

200 papers

Neural Module Networks (NMN) are a compelling method for visual question answering, enabling the translation of a question into a program consisting of a series of reasoning sub-tasks that are sequentially executed on the image to produce…

Computation and Language · Computer Science 2023-10-25 Wafa Aissa , Marin Ferecatu , Michel Crucianu

Neural module networks (NMNs) are a popular approach for modeling compositionality: they achieve high accuracy when applied to problems in language and vision, while reflecting the compositional structure of the problem in the network…

Computation and Language · Computer Science 2020-09-09 Sanjay Subramanian , Ben Bogin , Nitish Gupta , Tomer Wolfson , Sameer Singh , Jonathan Berant , Matt Gardner

A key aspect of human intelligence is the ability to imagine -- composing learned concepts in novel ways -- to make sense of new scenarios. Such capacity is not yet attained for machine learning systems. In this work, in the context of…

Artificial Intelligence · Computer Science 2023-10-31 Rim Assouel , Pau Rodriguez , Perouz Taslakian , David Vazquez , Yoshua Bengio

Answering compositional questions that require multiple steps of reasoning against text is challenging, especially when they involve discrete, symbolic operations. Neural module networks (NMNs) learn to parse such questions as executable…

Computation and Language · Computer Science 2020-02-18 Nitish Gupta , Kevin Lin , Dan Roth , Sameer Singh , Matt Gardner

Neural Module Networks (NMNs) aim at Visual Question Answering (VQA) via composition of modules that tackle a sub-task. NMNs are a promising strategy to achieve systematic generalization, i.e., overcoming biasing factors in the training…

Machine Learning · Computer Science 2022-01-19 Vanessa D'Amario , Tomotake Sasaki , Xavier Boix

Transformers achieve great performance on Visual Question Answering (VQA). However, their systematic generalization capabilities, i.e., handling novel combinations of known concepts, is unclear. We reveal that Neural Module Networks (NMNs),…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Moyuru Yamada , Vanessa D'Amario , Kentaro Takemoto , Xavier Boix , Tomotake Sasaki

In complex inferential tasks like question answering, machine learning models must confront two challenges: the need to implement a compositional reasoning process, and, in many applications, the need for this reasoning process to be…

Computer Vision and Pattern Recognition · Computer Science 2019-03-08 Ronghang Hu , Jacob Andreas , Trevor Darrell , Kate Saenko

Natural language questions are inherently compositional, and many are most easily answered by reasoning about their decomposition into modular sub-problems. For example, to answer "is there an equal number of balls and boxes?" we can look…

Computer Vision and Pattern Recognition · Computer Science 2017-09-13 Ronghang Hu , Jacob Andreas , Marcus Rohrbach , Trevor Darrell , Kate Saenko

The collaborative reasoning for understanding each image-question pair is very critical but under-explored for an interpretable Visual Question Answering (VQA) system. Although very recent works also tried the explicit compositional…

Computer Vision and Pattern Recognition · Computer Science 2018-04-03 Qingxing Cao , Xiaodan Liang , Bailing Li , Guanbin Li , Liang Lin

Neural Memory Networks (NMNs) have received increased attention in recent years compared to deep architectures that use a constrained memory. Despite their new appeal, the success of NMNs hinges on the ability of the gradient-based…

Computer Vision and Pattern Recognition · Computer Science 2020-11-12 Tharindu Fernando , Simon Denman , Sridha Sridharan , Clinton Fookes

Neural networks (NNs) whose subnetworks implement reusable functions are expected to offer numerous advantages, including compositionality through efficient recombination of functional building blocks, interpretability, preventing…

Neural and Evolutionary Computing · Computer Science 2021-03-09 Róbert Csordás , Sjoerd van Steenkiste , Jürgen Schmidhuber

Predicting multiple real-world tasks in a single model often requires a particularly diverse feature space. Multimodal (MM) models aim to extract the synergistic predictive potential of multiple data types to create a shared feature space…

Machine Learning · Computer Science 2023-11-07 Vinitra Swamy , Malika Satayeva , Jibril Frej , Thierry Bossy , Thijs Vogels , Martin Jaggi , Tanja Käser , Mary-Anne Hartley

Neural Module Networks (NMNs) have been quite successful in incorporating explicit reasoning as learnable modules in various question answering tasks, including the most generic form of numerical reasoning over text in Machine Reading…

Computation and Language · Computer Science 2021-01-29 Amrita Saha , Shafiq Joty , Steven C. H. Hoi

Answering complex questions that require multi-step multi-type reasoning over raw text is challenging, especially when conducting numerical reasoning. Neural Module Networks(NMNs), follow the programmer-interpreter framework and design…

Computation and Language · Computer Science 2022-10-07 Jiayi Chen , Xiao-Yu Guo , Yuan-Fang Li , Gholamreza Haffari

In order to achieve a general visual question answering (VQA) system, it is essential to learn to answer deeper questions that require compositional reasoning on the image and external knowledge. Meanwhile, the reasoning process should be…

Computer Vision and Pattern Recognition · Computer Science 2022-06-28 Zihao Zhu

The layered structure of deep neural networks hinders the use of numerous analysis tools and thus the development of its interpretability. Inspired by the success of functional brain networks, we propose a novel framework for…

Machine Learning · Computer Science 2022-05-25 Ben Zhang , Zhetong Dong , Junsong Zhang , Hongwei Lin

During the last years, there has been a lot of interest in achieving some kind of complex reasoning using deep neural networks. To do that, models like Memory Networks (MemNNs) have combined external memory storages and attention…

Computation and Language · Computer Science 2018-05-25 Juan Pavez , Héctor Allende , Héctor Allende-Cid

\textit{Graph neural networks} (GNNs) are effective models for many dynamical systems consisting of entities and relations. Although most GNN applications assume a single type of entity and relation, many situations involve multiple types…

Machine Learning · Computer Science 2023-10-12 Ferran Alet , Erica Weng , Tomás Lozano Pérez , Leslie Pack Kaelbling

Answering questions that require reading texts in an image is challenging for current models. One key difficulty of this task is that rare, polysemous, and ambiguous words frequently appear in images, e.g., names of places, products, and…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Difei Gao , Ke Li , Ruiping Wang , Shiguang Shan , Xilin Chen

Neural module networks (NMN) have achieved success in image-grounded tasks such as Visual Question Answering (VQA) on synthetic images. However, very limited work on NMN has been studied in the video-grounded dialogue tasks. These tasks…

Computer Vision and Pattern Recognition · Computer Science 2022-06-14 Hung Le , Nancy F. Chen , Steven C. H. Hoi
‹ Prev 1 2 3 10 Next ›