Related papers: Object-Centric Scene Representations using Active …

Disentangling What and Where for 3D Object-Centric Representations Through Active Inference

Although modern object detection and classification models achieve high accuracy, these are typically constrained in advance on a fixed train set and are therefore not flexible to deal with novel, unseen object categories. Moreover, these…

Artificial Intelligence · Computer Science 2021-08-27 Toon Van de Maele , Tim Verbelen , Ozan Catal , Bart Dhoedt

Improving Viewpoint-Independent Object-Centric Representations through Active Viewpoint Selection

Given the complexities inherent in visual scenes, such as object occlusion, a comprehensive understanding often requires observation from multiple viewpoints. Existing multi-viewpoint object-centric learning methods typically employ random…

Computer Vision and Pattern Recognition · Computer Science 2024-11-04 Yinxuan Huang , Chengmin Gao , Bin Li , Xiangyang Xue

Towards Embodied Scene Description

Embodiment is an important characteristic for all intelligent agents (creatures and robots), while existing scene description tasks mainly focus on analyzing images passively and the semantic understanding of the scenario is separated from…

Robotics · Computer Science 2020-05-08 Sinan Tan , Huaping Liu , Di Guo , Xinyu Zhang , Fuchun Sun

Active Perception and Representation for Robotic Manipulation

The vast majority of visual animals actively control their eyes, heads, and/or bodies to direct their gaze toward different parts of their environment. In contrast, recent applications of reinforcement learning in robotic manipulation…

Computer Vision and Pattern Recognition · Computer Science 2020-03-17 Youssef Zaky , Gaurav Paruthi , Bryan Tripp , James Bergstra

Object Finding in Cluttered Scenes Using Interactive Perception

Object finding in clutter is a skill that requires perception of the environment and in many cases physical interaction. In robotics, interactive perception defines a set of algorithms that leverage actions to improve the perception of the…

Robotics · Computer Science 2020-06-02 Tonci Novkovic , Remi Pautrat , Fadri Furrer , Michel Breyer , Roland Siegwart , Juan Nieto

A Dynamic Data Driven Approach for Explainable Scene Understanding

Scene-understanding is an important topic in the area of Computer Vision, and illustrates computational challenges with applications to a wide range of domains including remote sensing, surveillance, smart agriculture, robotics, autonomous…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Zachary A Daniels , Dimitris Metaxas

Self-supervised Visual Reinforcement Learning with Object-centric Representations

Autonomous agents need large repertoires of skills to act reasonably on new tasks that they have not seen before. However, acquiring these skills using only a stream of high-dimensional, unstructured, and unlabeled observations is a tricky…

Machine Learning · Computer Science 2021-02-09 Andrii Zadaianchuk , Maximilian Seitzer , Georg Martius

Learning Geometric Representations of Objects via Interaction

We address the problem of learning representations from observations of a scene involving an agent and an external object the agent interacts with. To this end, we propose a representation learning framework extracting the location in…

Machine Learning · Computer Science 2023-09-12 Alfredo Reichlin , Giovanni Luca Marchetti , Hang Yin , Anastasiia Varava , Danica Kragic

Recognizing Scenes from Novel Viewpoints

Humans can perceive scenes in 3D from a handful of 2D views. For AI agents, the ability to recognize a scene from any viewpoint given only a few images enables them to efficiently interact with the scene and its objects. In this work, we…

Computer Vision and Pattern Recognition · Computer Science 2021-12-03 Shengyi Qian , Alexander Kirillov , Nikhila Ravi , Devendra Singh Chaplot , Justin Johnson , David F. Fouhey , Georgia Gkioxari

Symmetry and Complexity in Object-Centric Deep Active Inference Models

Humans perceive and interact with hundreds of objects every day. In doing so, they need to employ mental models of these objects and often exploit symmetries in the object's shape and appearance in order to learn generalizable and…

Computer Vision and Pattern Recognition · Computer Science 2023-05-01 Stefano Ferraro , Toon Van de Maele , Tim Verbelen , Bart Dhoedt

Deep Active Inference for Autonomous Robot Navigation

Active inference is a theory that underpins the way biological agent's perceive and act in the real world. At its core, active inference is based on the principle that the brain is an approximate Bayesian inference engine, building an…

Artificial Intelligence · Computer Science 2020-03-09 Ozan Çatal , Samuel Wauthier , Tim Verbelen , Cedric De Boom , Bart Dhoedt

Learning An Active Inference Model of Driver Perception and Control: Application to Vehicle Car-Following

In this paper we introduce a general estimation methodology for learning a model of human perception and control in a sensorimotor control task based upon a finite set of demonstrations. The model's structure consists of i the agent's…

Machine Learning · Computer Science 2025-05-02 Ran Wei , Anthony D. McDonald , Alfredo Garcia , Gustav Markkula , Johan Engstrom , Matthew O'Kelly

Object-centric proto-symbolic behavioural reasoning from pixels

Autonomous intelligent agents must bridge computational challenges at disparate levels of abstraction, from the low-level spaces of sensory input and motor commands to the high-level domain of abstract reasoning and planning. A key question…

Artificial Intelligence · Computer Science 2025-12-12 Ruben van Bergen , Justus Hübotter , Alma Lago , Pablo Lanillos

Active Inference for Robotic Manipulation

Robotic manipulation stands as a largely unsolved problem despite significant advances in robotics and machine learning in the last decades. One of the central challenges of manipulation is partial observability, as the agent usually does…

Robotics · Computer Science 2022-06-22 Tim Schneider , Boris Belousov , Hany Abdulsamad , Jan Peters

Hierarchical Representations and Explicit Memory: Learning Effective Navigation Policies on 3D Scene Graphs using Graph Neural Networks

Representations are crucial for a robot to learn effective navigation policies. Recent work has shown that mid-level perceptual abstractions, such as depth estimates or 2D semantic segmentation, lead to more effective policies when provided…

Robotics · Computer Science 2022-05-09 Zachary Ravichandran , Lisa Peng , Nathan Hughes , J. Daniel Griffith , Luca Carlone

Disentangling Shape and Pose for Object-Centric Deep Active Inference Models

Active inference is a first principles approach for understanding the brain in particular, and sentient agents in general, with the single imperative of minimizing free energy. As such, it provides a computational account for modelling…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Stefano Ferraro , Toon Van de Maele , Pietro Mazzaglia , Tim Verbelen , Bart Dhoedt

Attend, Infer, Repeat: Fast Scene Understanding with Generative Models

We present a framework for efficient inference in structured image models that explicitly reason about objects. We achieve this by performing probabilistic inference using a recurrent neural network that attends to scene elements and…

Computer Vision and Pattern Recognition · Computer Science 2016-08-15 S. M. Ali Eslami , Nicolas Heess , Theophane Weber , Yuval Tassa , David Szepesvari , Koray Kavukcuoglu , Geoffrey E. Hinton

Towards Scene Understanding with Detailed 3D Object Representations

Current approaches to semantic image and scene understanding typically employ rather simple object representations such as 2D or 3D bounding boxes. While such coarse models are robust and allow for reliable object detection, they discard…

Computer Vision and Pattern Recognition · Computer Science 2014-11-24 M. Zeeshan Zia , Michael Stark , Konrad Schindler

Active Inference in Robotics and Artificial Agents: Survey and Challenges

Active inference is a mathematical framework which originated in computational neuroscience as a theory of how the brain implements action, perception and learning. Recently, it has been shown to be a promising approach to the problems of…

Robotics · Computer Science 2021-12-06 Pablo Lanillos , Cristian Meo , Corrado Pezzato , Ajith Anil Meera , Mohamed Baioumy , Wataru Ohata , Alexander Tschantz , Beren Millidge , Martijn Wisse , Christopher L. Buckley , Jun Tani

Bootstrapping Robotic Ecological Perception from a Limited Set of Hypotheses Through Interactive Perception

To solve its task, a robot needs to have the ability to interpret its perceptions. In vision, this interpretation is particularly difficult and relies on the understanding of the structure of the scene, at least to the extent of its task…

Robotics · Computer Science 2019-01-31 Léni K. Le Goff , Ghanim Mukhtar , Alexandre Coninx , Stéphane Doncieux