Related papers: Learning Differentiable Logic Programs for Abstrac…

Contextual RNN-GANs for Abstract Reasoning Diagram Generation

Understanding, predicting, and generating object motions and transformations is a core problem in artificial intelligence. Modeling sequences of evolving images may provide better representations and models of motion and may ultimately be…

Computer Vision and Pattern Recognition · Computer Science 2016-12-07 Arnab Ghosh , Viveka Kulharia , Amitabha Mukerjee , Vinay Namboodiri , Mohit Bansal

GAMR: A Guided Attention Model for (visual) Reasoning

Humans continue to outperform modern AI systems in their ability to flexibly parse and understand complex visual scenes. Here, we present a novel module for visual reasoning, the Guided Attention Model for (visual) Reasoning (GAMR), which…

Artificial Intelligence · Computer Science 2023-03-22 Mohit Vaishnav , Thomas Serre

RAVEN: A Dataset for Relational and Analogical Visual rEasoNing

Dramatic progress has been witnessed in basic vision tasks involving low-level perception, such as object recognition, detection, and tracking. Unfortunately, there is still an enormous performance gap between artificial vision systems and…

Computer Vision and Pattern Recognition · Computer Science 2019-03-08 Chi Zhang , Feng Gao , Baoxiong Jia , Yixin Zhu , Song-Chun Zhu

Neuro-Symbolic Forward Reasoning

Reasoning is an essential part of human intelligence and thus has been a long-standing goal in artificial intelligence research. With the recent success of deep learning, incorporating reasoning with deep learning systems, i.e.,…

Artificial Intelligence · Computer Science 2021-10-19 Hikaru Shindo , Devendra Singh Dhami , Kristian Kersting

A Neurosymbolic Agent System for Compositional Visual Reasoning

The advancement in large language models (LLMs) and large vision models has fueled the rapid progress in multi-modal vision-language reasoning capabilities. However, existing vision-language models (VLMs) remain challenged by compositional…

Computer Vision and Pattern Recognition · Computer Science 2025-10-07 Yichang Xu , Gaowen Liu , Ramana Rao Kompella , Sihao Hu , Fatih Ilhan , Selim Furkan Tekin , Zachary Yahn , Ling Liu

Vision-Language Navigation with Self-Supervised Auxiliary Reasoning Tasks

Vision-Language Navigation (VLN) is a task where agents learn to navigate following natural language instructions. The key to this task is to perceive both the visual scene and natural language sequentially. Conventional approaches exploit…

Computer Vision and Pattern Recognition · Computer Science 2020-04-02 Fengda Zhu , Yi Zhu , Xiaojun Chang , Xiaodan Liang

DAReN: A Collaborative Approach Towards Reasoning And Disentangling

Computational learning approaches to solving visual reasoning tests, such as Raven's Progressive Matrices (RPM), critically depend on the ability to identify the visual concepts used in the test (i.e., the representation) as well as the…

Machine Learning · Computer Science 2022-07-01 Pritish Sahu , Kalliopi Basioti , Vladimir Pavlovic

A Challenge to Build Neuro-Symbolic Video Agents

Modern video understanding systems excel at tasks such as scene classification, object detection, and short video retrieval. However, as video analysis becomes increasingly central to real-world applications, there is a growing need for…

Artificial Intelligence · Computer Science 2025-05-21 Sahil Shah , Harsh Goel , Sai Shankar Narasimhan , Minkyu Choi , S P Sharan , Oguzhan Akcin , Sandeep Chinchali

Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"

Visual reasoning tasks such as visual question answering (VQA) require an interplay of visual perception with reasoning about the question semantics grounded in perception. However, recent advances in this area are still primarily driven by…

Machine Learning · Computer Science 2020-08-27 Saeed Amizadeh , Hamid Palangi , Oleksandr Polozov , Yichen Huang , Kazuhito Koishida

Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding

We marry two powerful ideas: deep representation learning for visual recognition and language understanding, and symbolic program execution for reasoning. Our neural-symbolic visual question answering (NS-VQA) system first recovers a…

Artificial Intelligence · Computer Science 2019-01-16 Kexin Yi , Jiajun Wu , Chuang Gan , Antonio Torralba , Pushmeet Kohli , Joshua B. Tenenbaum

Neural Meta-Symbolic Reasoning and Learning

Deep neural learning uses an increasing amount of computation and data to solve very specific problems. By stark contrast, human minds solve a wide range of problems using a fixed amount of computation and limited experience. One ability…

Artificial Intelligence · Computer Science 2023-12-19 Zihan Ye , Hikaru Shindo , Devendra Singh Dhami , Kristian Kersting

Neural-Symbolic Computing: An Effective Methodology for Principled Integration of Machine Learning and Reasoning

Current advances in Artificial Intelligence and machine learning in general, and deep learning in particular have reached unprecedented impact not only across research communities, but also over popular media channels. However, concerns…

Artificial Intelligence · Computer Science 2019-05-16 Artur d'Avila Garcez , Marco Gori , Luis C. Lamb , Luciano Serafini , Michael Spranger , Son N. Tran

Explainable and Explicit Visual Reasoning over Scene Graphs

We aim to dismantle the prevalent black-box neural architectures used in complex visual reasoning tasks, into the proposed eXplainable and eXplicit Neural Modules (XNMs), which advance beyond existing neural module networks towards using…

Computer Vision and Pattern Recognition · Computer Science 2019-03-20 Jiaxin Shi , Hanwang Zhang , Juanzi Li

Differentiable Scene Graphs

Reasoning about complex visual scenes involves perception of entities and their relations. Scene graphs provide a natural representation for reasoning tasks, by assigning labels to both entities (nodes) and relations (edges). Unfortunately,…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Moshiko Raboh , Roei Herzig , Gal Chechik , Jonathan Berant , Amir Globerson

Deep Neural Networks for Visual Reasoning

Visual perception and language understanding are - fundamental components of human intelligence, enabling them to understand and reason about objects and their interactions. It is crucial for machines to have this capacity to reason using…

Computer Vision and Pattern Recognition · Computer Science 2022-09-27 Thao Minh Le

Differentiable Fuzzy Neural Networks for Recommender Systems

As recommender systems become increasingly complex, transparency is essential to increase user trust, accountability, and regulatory compliance. Neuro-symbolic approaches that integrate symbolic reasoning with sub-symbolic learning offer a…

Machine Learning · Computer Science 2025-05-12 Stephan Bartl , Kevin Innerebner , Elisabeth Lex

A Multi-Modal Neuro-Symbolic Approach for Spatial Reasoning-Based Visual Grounding in Robotics

Visual reasoning, particularly spatial reasoning, is a challenging cognitive task that requires understanding object relationships and their interactions within complex environments, especially in robotics domain. Existing vision_language…

Robotics · Computer Science 2025-11-03 Simindokht Jahangard , Mehrzad Mohammadi , Abhinav Dhall , Hamid Rezatofighi

Augmented Vision-Language Models: A Systematic Review

Recent advances in visual-language machine learning models have demonstrated exceptional ability to use natural language and understand visual scenes by training on large, unstructured datasets. However, this training paradigm cannot…

Computation and Language · Computer Science 2025-08-01 Anthony C Davis , Burhan Sadiq , Tianmin Shu , Chien-Ming Huang

Learning to reason over visual objects

A core component of human intelligence is the ability to identify abstract patterns inherent in complex, high-dimensional perceptual data, as exemplified by visual reasoning tasks such as Raven's Progressive Matrices (RPM). Motivated by the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-30 Shanka Subhra Mondal , Taylor Webb , Jonathan D. Cohen

NePTune: A Neuro-Pythonic Framework for Tunable Compositional Reasoning on Vision-Language

Modern Vision-Language Models (VLMs) have achieved impressive performance in various tasks, yet they often struggle with compositional reasoning, the ability to decompose and recombine concepts to solve novel problems. While neuro-symbolic…

Artificial Intelligence · Computer Science 2025-10-01 Danial Kamali , Parisa Kordjamshidi