Related papers: Meta Module Network for Compositional Visual Reaso…

Multimodal Representations for Teacher-Guided Compositional Visual Reasoning

Neural Module Networks (NMN) are a compelling method for visual question answering, enabling the translation of a question into a program consisting of a series of reasoning sub-tasks that are sequentially executed on the image to produce…

Computation and Language · Computer Science 2023-10-25 Wafa Aissa , Marin Ferecatu , Michel Crucianu

Obtaining Faithful Interpretations from Compositional Neural Networks

Neural module networks (NMNs) are a popular approach for modeling compositionality: they achieve high accuracy when applied to problems in language and vision, while reflecting the compositional structure of the problem in the network…

Computation and Language · Computer Science 2020-09-09 Sanjay Subramanian , Ben Bogin , Nitish Gupta , Tomer Wolfson , Sameer Singh , Jonathan Berant , Matt Gardner

OC-NMN: Object-centric Compositional Neural Module Network for Generative Visual Analogical Reasoning

A key aspect of human intelligence is the ability to imagine -- composing learned concepts in novel ways -- to make sense of new scenarios. Such capacity is not yet attained for machine learning systems. In this work, in the context of…

Artificial Intelligence · Computer Science 2023-10-31 Rim Assouel , Pau Rodriguez , Perouz Taslakian , David Vazquez , Yoshua Bengio

Neural Module Networks for Reasoning over Text

Answering compositional questions that require multiple steps of reasoning against text is challenging, especially when they involve discrete, symbolic operations. Neural module networks (NMNs) learn to parse such questions as executable…

Computation and Language · Computer Science 2020-02-18 Nitish Gupta , Kevin Lin , Dan Roth , Sameer Singh , Matt Gardner

How Modular Should Neural Module Networks Be for Systematic Generalization?

Neural Module Networks (NMNs) aim at Visual Question Answering (VQA) via composition of modules that tackle a sub-task. NMNs are a promising strategy to achieve systematic generalization, i.e., overcoming biasing factors in the training…

Machine Learning · Computer Science 2022-01-19 Vanessa D'Amario , Tomotake Sasaki , Xavier Boix

Transformer Module Networks for Systematic Generalization in Visual Question Answering

Transformers achieve great performance on Visual Question Answering (VQA). However, their systematic generalization capabilities, i.e., handling novel combinations of known concepts, is unclear. We reveal that Neural Module Networks (NMNs),…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Moyuru Yamada , Vanessa D'Amario , Kentaro Takemoto , Xavier Boix , Tomotake Sasaki

Explainable Neural Computation via Stack Neural Module Networks

In complex inferential tasks like question answering, machine learning models must confront two challenges: the need to implement a compositional reasoning process, and, in many applications, the need for this reasoning process to be…

Computer Vision and Pattern Recognition · Computer Science 2019-03-08 Ronghang Hu , Jacob Andreas , Trevor Darrell , Kate Saenko

Learning to Reason: End-to-End Module Networks for Visual Question Answering

Natural language questions are inherently compositional, and many are most easily answered by reasoning about their decomposition into modular sub-problems. For example, to answer "is there an equal number of balls and boxes?" we can look…

Computer Vision and Pattern Recognition · Computer Science 2017-09-13 Ronghang Hu , Jacob Andreas , Marcus Rohrbach , Trevor Darrell , Kate Saenko

Visual Question Reasoning on General Dependency Tree

The collaborative reasoning for understanding each image-question pair is very critical but under-explored for an interpretable Visual Question Answering (VQA) system. Although very recent works also tried the explicit compositional…

Computer Vision and Pattern Recognition · Computer Science 2018-04-03 Qingxing Cao , Xiaodan Liang , Bailing Li , Guanbin Li , Liang Lin

Fast & Slow Learning: Incorporating Synthetic Gradients in Neural Memory Controllers

Neural Memory Networks (NMNs) have received increased attention in recent years compared to deep architectures that use a constrained memory. Despite their new appeal, the success of NMNs hinges on the ability of the gradient-based…

Computer Vision and Pattern Recognition · Computer Science 2020-11-12 Tharindu Fernando , Simon Denman , Sridha Sridharan , Clinton Fookes

Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks

Neural networks (NNs) whose subnetworks implement reusable functions are expected to offer numerous advantages, including compositionality through efficient recombination of functional building blocks, interpretability, preventing…

Neural and Evolutionary Computing · Computer Science 2021-03-09 Róbert Csordás , Sjoerd van Steenkiste , Jürgen Schmidhuber

MultiModN- Multimodal, Multi-Task, Interpretable Modular Networks

Predicting multiple real-world tasks in a single model often requires a particularly diverse feature space. Multimodal (MM) models aim to extract the synergistic predictive potential of multiple data types to create a shared feature space…

Machine Learning · Computer Science 2023-11-07 Vinitra Swamy , Malika Satayeva , Jibril Frej , Thierry Bossy , Thijs Vogels , Martin Jaggi , Tanja Käser , Mary-Anne Hartley

Weakly Supervised Neuro-Symbolic Module Networks for Numerical Reasoning

Neural Module Networks (NMNs) have been quite successful in incorporating explicit reasoning as learnable modules in various question answering tasks, including the most generic form of numerical reasoning over text in Machine Reading…

Computation and Language · Computer Science 2021-01-29 Amrita Saha , Shafiq Joty , Steven C. H. Hoi

Teaching Neural Module Networks to Do Arithmetic

Answering complex questions that require multi-step multi-type reasoning over raw text is challenging, especially when conducting numerical reasoning. Neural Module Networks(NMNs), follow the programmer-interpreter framework and design…

Computation and Language · Computer Science 2022-10-07 Jiayi Chen , Xiao-Yu Guo , Yuan-Fang Li , Gholamreza Haffari

From Shallow to Deep: Compositional Reasoning over Graphs for Visual Question Answering

In order to achieve a general visual question answering (VQA) system, it is essential to learn to answer deeper questions that require compositional reasoning on the image and external knowledge. Meanwhile, the reasoning process should be…

Computer Vision and Pattern Recognition · Computer Science 2022-06-28 Zihao Zhu

Functional Network: A Novel Framework for Interpretability of Deep Neural Networks

The layered structure of deep neural networks hinders the use of numerous analysis tools and thus the development of its interpretability. Inspired by the success of functional brain networks, we propose a novel framework for…

Machine Learning · Computer Science 2022-05-25 Ben Zhang , Zhetong Dong , Junsong Zhang , Hongwei Lin

Working Memory Networks: Augmenting Memory Networks with a Relational Reasoning Module

During the last years, there has been a lot of interest in achieving some kind of complex reasoning using deep neural networks. To do that, models like Memory Networks (MemNNs) have combined external memory storages and attention…

Computation and Language · Computer Science 2018-05-25 Juan Pavez , Héctor Allende , Héctor Allende-Cid

Neural Relational Inference with Fast Modular Meta-learning

\textit{Graph neural networks} (GNNs) are effective models for many dynamical systems consisting of entities and relations. Although most GNN applications assume a single type of entity and relation, many situations involve multiple types…

Machine Learning · Computer Science 2023-10-12 Ferran Alet , Erica Weng , Tomás Lozano Pérez , Leslie Pack Kaelbling

Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text

Answering questions that require reading texts in an image is challenging for current models. One key difficulty of this task is that rare, polysemous, and ambiguous words frequently appear in images, e.g., names of places, products, and…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Difei Gao , Ke Li , Ruiping Wang , Shiguang Shan , Xilin Chen

VGNMN: Video-grounded Neural Module Network to Video-Grounded Language Tasks

Neural module networks (NMN) have achieved success in image-grounded tasks such as Visual Question Answering (VQA) on synthetic images. However, very limited work on NMN has been studied in the video-grounded dialogue tasks. These tasks…

Computer Vision and Pattern Recognition · Computer Science 2022-06-14 Hung Le , Nancy F. Chen , Steven C. H. Hoi