Related papers: Visual Abductive Reasoning

AbductiveMLLM: Boosting Visual Abductive Reasoning Within MLLMs

Visual abductive reasoning (VAR) is a challenging task that requires AI systems to infer the most likely explanation for incomplete visual observations. While recent MLLMs develop strong general-purpose multimodal reasoning capabilities,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-07 Boyu Chang , Qi Wang , Xi Guo , Zhixiong Nan , Yazhou Yao , Tianfei Zhou

ACRE: Abstract Causal REasoning Beyond Covariation

Causal induction, i.e., identifying unobservable mechanisms that lead to the observable relations among variables, has played a pivotal role in modern scientific discovery, especially in scenarios with only sparse and limited data. Humans,…

Computer Vision and Pattern Recognition · Computer Science 2021-03-29 Chi Zhang , Baoxiong Jia , Mark Edmonds , Song-Chun Zhu , Yixin Zhu

The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning

Humans have remarkable capacity to reason abductively and hypothesize about what lies beyond the literal content of an image. By identifying concrete visual clues scattered throughout a scene, we almost can't help but draw probable…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Jack Hessel , Jena D. Hwang , Jae Sung Park , Rowan Zellers , Chandra Bhagavatula , Anna Rohrbach , Kate Saenko , Yejin Choi

Causal Reasoning Meets Visual Representation Learning: A Prospective Study

Visual representation learning is ubiquitous in various real-world applications, including visual comprehension, video understanding, multi-modal analysis, human-computer interaction, and urban computing. Due to the emergence of huge…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Yang Liu , Yushen Wei , Hong Yan , Guanbin Li , Liang Lin

Abductive Symbolic Solver on Abstraction and Reasoning Corpus

This paper addresses the challenge of enhancing artificial intelligence reasoning capabilities, focusing on logicality within the Abstraction and Reasoning Corpus (ARC). Humans solve such visual reasoning tasks based on their observations…

Artificial Intelligence · Computer Science 2024-11-28 Mintaek Lim , Seokki Lee , Liyew Woletemaryam Abitew , Sundong Kim

Reasoning is a Modality

The Abstraction and Reasoning Corpus (ARC) provides a compact laboratory for studying abstract reasoning, an ability central to human intelligence. Modern AI systems, including LLMs and ViTs, largely operate as sequence-of-behavior…

Artificial Intelligence · Computer Science 2026-01-21 Zhiguang Liu , Yi Shang

Reasoning in Computer Vision: Taxonomy, Models, Tasks, and Methodologies

Visual reasoning is critical for a wide range of computer vision tasks that go beyond surface-level object detection and classification. Despite notable advances in relational, symbolic, temporal, causal, and commonsense reasoning, existing…

Computer Vision and Pattern Recognition · Computer Science 2025-08-15 Ayushman Sarkar , Mohd Yamani Idna Idris , Zhenyu Yu

A Benchmark for Compositional Visual Reasoning

A fundamental component of human vision is our ability to parse complex visual scenes and judge the relations between their constituent objects. AI benchmarks for visual reasoning have driven rapid progress in recent years with…

Computer Vision and Pattern Recognition · Computer Science 2022-06-14 Aimen Zerroug , Mohit Vaishnav , Julien Colin , Sebastian Musslick , Thomas Serre

Abductive Computational Systems: Creative Abduction and Future Directions

Abductive reasoning, reasoning for inferring explanations for observations, is often mentioned in scientific, design-related and artistic contexts, but its understanding varies across these domains. This paper reviews how abductive…

Artificial Intelligence · Computer Science 2025-07-14 Abhinav Sood , Kazjon Grace , Stephen Wan , Cecile Paris

Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding

Visual arguments, often used in advertising or social causes, rely on images to persuade viewers to do or believe something. Understanding these arguments requires selective vision: only specific visual stimuli within an image are relevant…

Computation and Language · Computer Science 2024-10-24 Jiwan Chung , Sungjae Lee , Minseo Kim , Seungju Han , Ashkan Yousefpour , Jack Hessel , Youngjae Yu

Visual Attention Reasoning via Hierarchical Search and Self-Verification

Multimodal Large Language Models (MLLMs) frequently hallucinate due to their reliance on fragile, linear reasoning and weak visual grounding. We propose Visual Attention Reasoning (VAR), a reinforcement learning framework that reformulates…

Artificial Intelligence · Computer Science 2026-01-27 Wei Cai , Jian Zhao , Yuchen Yuan , Tianle Zhang , Ming Zhu , Haichuan Tang , Xuelong Li

Rational Inverse Reasoning

Humans can observe a single, imperfect demonstration and immediately generalize to very different problem settings. Robots, in contrast, often require hundreds of examples and still struggle to generalize beyond the training conditions. We…

Robotics · Computer Science 2025-08-13 Ben Zandonati , Tomás Lozano-Pérez , Leslie Pack Kaelbling

Complexity of Faceted Explanations in Propositional Abduction

Abductive reasoning is a popular non-monotonic paradigm that aims to explain observed symptoms and manifestations. It has many applications, such as diagnosis and planning in artificial intelligence and database updates. In propositional…

Artificial Intelligence · Computer Science 2026-01-14 Johannes Schmidt , Mohamed Maizia , Victor Lagerkvist , Johannes K. Fichte

GAMR: A Guided Attention Model for (visual) Reasoning

Humans continue to outperform modern AI systems in their ability to flexibly parse and understand complex visual scenes. Here, we present a novel module for visual reasoning, the Guided Attention Model for (visual) Reasoning (GAMR), which…

Artificial Intelligence · Computer Science 2023-03-22 Mohit Vaishnav , Thomas Serre

Abduction and Argumentation for Explainable Machine Learning: A Position Survey

This paper presents Abduction and Argumentation as two principled forms for reasoning, and fleshes out the fundamental role that they can play within Machine Learning. It reviews the state-of-the-art work over the past few decades on the…

Artificial Intelligence · Computer Science 2020-10-27 Antonis Kakas , Loizos Michael

Abductive Commonsense Reasoning

Abductive reasoning is inference to the most plausible explanation. For example, if Jenny finds her house in a mess when she returns from work, and remembers that she left a window open, she can hypothesize that a thief broke into her house…

Computation and Language · Computer Science 2020-02-17 Chandra Bhagavatula , Ronan Le Bras , Chaitanya Malaviya , Keisuke Sakaguchi , Ari Holtzman , Hannah Rashkin , Doug Downey , Scott Wen-tau Yih , Yejin Choi

Visual Explanation by High-Level Abduction: On Answer-Set Programming Driven Reasoning about Moving Objects

We propose a hybrid architecture for systematically computing robust visual explanation(s) encompassing hypothesis formation, belief revision, and default reasoning with video data. The architecture consists of two tightly integrated…

Artificial Intelligence · Computer Science 2017-12-05 Jakob Suchan , Mehul Bhatt , Przemysław Wałęga , Carl Schultz

Abstract Visual Reasoning Enabled by Language

While artificial intelligence (AI) models have achieved human or even superhuman performance in many well-defined applications, they still struggle to show signs of broad and flexible intelligence. The Abstraction and Reasoning Corpus…

Artificial Intelligence · Computer Science 2023-06-23 Giacomo Camposampiero , Loic Houmard , Benjamin Estermann , Joël Mathys , Roger Wattenhofer

RAVEN: A Dataset for Relational and Analogical Visual rEasoNing

Dramatic progress has been witnessed in basic vision tasks involving low-level perception, such as object recognition, detection, and tracking. Unfortunately, there is still an enormous performance gap between artificial vision systems and…

Computer Vision and Pattern Recognition · Computer Science 2019-03-08 Chi Zhang , Feng Gao , Baoxiong Jia , Yixin Zhu , Song-Chun Zhu

When Thinking Drifts: Evidential Grounding for Robust Video Reasoning

Video reasoning, the task of enabling machines to infer from dynamic visual content through multi-step logic, is crucial for advanced AI. While the Chain-of-Thought (CoT) mechanism has enhanced reasoning in text-based tasks, its application…

Computer Vision and Pattern Recognition · Computer Science 2025-10-08 Mi Luo , Zihui Xue , Alex Dimakis , Kristen Grauman