Related papers: Rotating Features for Object Discovery

Binding Dynamics in Rotating Features

In human cognition, the binding problem describes the open question of how the brain flexibly integrates diverse information into cohesive object representations. Analogously, in machine learning, there is a pursuit for models capable of…

Machine Learning · Computer Science 2024-02-09 Sindy Löwe , Francesco Locatello , Max Welling

Binding via Reconstruction Clustering

Disentangled distributed representations of data are desirable for machine learning, since they are more expressive and can generalize from fewer examples. However, for complex data, the distributed representations of multiple objects…

Machine Learning · Computer Science 2016-01-21 Klaus Greff , Rupesh Kumar Srivastava , Jürgen Schmidhuber

Discovering Object-Centric Generalized Value Functions From Pixels

Deep Reinforcement Learning has shown significant progress in extracting useful representations from high-dimensional inputs albeit using hand-crafted auxiliary tasks and pseudo rewards. Automatically learning such representations in an…

Machine Learning · Computer Science 2023-06-28 Somjit Nath , Gopeshh Raaj Subbaraj , Khimya Khetarpal , Samira Ebrahimi Kahou

Complex-Valued Autoencoders for Object Discovery

Object-centric representations form the basis of human perception, and enable us to reason about the world and to systematically generalize to new settings. Currently, most works on unsupervised object discovery focus on slot-based…

Machine Learning · Computer Science 2022-11-21 Sindy Löwe , Phillip Lippe , Maja Rudolph , Max Welling

Cycle Consistency Driven Object Discovery

Developing deep learning models that effectively learn object-centric representations, akin to human cognition, remains a challenging task. Existing approaches facilitate object discovery by representing objects as fixed-size vectors,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Aniket Didolkar , Anirudh Goyal , Yoshua Bengio

On the Binding Problem in Artificial Neural Networks

Contemporary neural networks still fall short of human-level generalization, which extends far beyond our direct experiences. In this paper, we argue that the underlying cause for this shortcoming is their inability to dynamically and…

Neural and Evolutionary Computing · Computer Science 2020-12-10 Klaus Greff , Sjoerd van Steenkiste , Jürgen Schmidhuber

Does Object Binding Naturally Emerge in Large Pretrained Vision Transformers?

Object binding, the brain's ability to bind the many features that collectively represent an object into a coherent whole, is central to human cognition. It groups low-level perceptual features into high-level object representations, stores…

Computer Vision and Pattern Recognition · Computer Science 2026-01-22 Yihao Li , Saeed Salehi , Lyle Ungar , Konrad P. Kording

Contrastive Training of Complex-Valued Autoencoders for Object Discovery

Current state-of-the-art object-centric models use slots and attention-based routing for binding. However, this class of models has several conceptual limitations: the number of slots is hardwired; all slots have equal capacity; training…

Machine Learning · Computer Science 2023-11-10 Aleksandar Stanić , Anand Gopalakrishnan , Kazuki Irie , Jürgen Schmidhuber

Human Centred Object Co-Segmentation

Co-segmentation is the automatic extraction of the common semantic regions given a set of images. Different from previous approaches mainly based on object visuals, in this paper, we propose a human centred object co-segmentation approach,…

Computer Vision and Pattern Recognition · Computer Science 2016-06-14 Chenxia Wu , Jiemi Zhang , Ashutosh Saxena , Silvio Savarese

Binding and Perspective Taking as Inference in a Generative Neural Network Model

The ability to flexibly bind features into coherent wholes from different perspectives is a hallmark of cognition and intelligence. Importantly, the binding problem is not only relevant for vision but also for general intelligence,…

Machine Learning · Computer Science 2021-10-19 Mahdi Sadeghi , Fabian Schrodt , Sebastian Otte , Martin V. Butz

CTRL-O: Language-Controllable Object-Centric Visual Representation Learning

Object-centric representation learning aims to decompose visual scenes into fixed-size vectors called "slots" or "object files", where each slot captures a distinct object. Current state-of-the-art object-centric models have shown…

Computer Vision and Pattern Recognition · Computer Science 2025-03-28 Aniket Didolkar , Andrii Zadaianchuk , Rabiul Awal , Maximilian Seitzer , Efstratios Gavves , Aishwarya Agrawal

Deep Object-Centric Representations for Generalizable Robot Learning

Robotic manipulation in complex open-world scenarios requires both reliable physical manipulation skills and effective and generalizable perception. In this paper, we propose a method where general purpose pretrained visual models serve as…

Robotics · Computer Science 2017-09-27 Coline Devin , Pieter Abbeel , Trevor Darrell , Sergey Levine

SlotLifter: Slot-guided Feature Lifting for Learning Object-centric Radiance Fields

The ability to distill object-centric abstractions from intricate visual scenes underpins human-level generalization. Despite the significant progress in object-centric learning methods, learning object-centric representations in the 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-08-14 Yu Liu , Baoxiong Jia , Yixin Chen , Siyuan Huang

Motion Representations for Articulated Animation

We propose novel motion representations for animating articulated objects consisting of distinct parts. In a completely unsupervised manner, our method identifies object parts, tracks them in a driving video, and infers their motions by…

Computer Vision and Pattern Recognition · Computer Science 2021-04-26 Aliaksandr Siarohin , Oliver J. Woodford , Jian Ren , Menglei Chai , Sergey Tulyakov

Neural Concept Binder

The challenge in object-based visual reasoning lies in generating concept representations that are both descriptive and distinct. Achieving this in an unsupervised manner requires human users to understand the model's learned concepts and,…

Artificial Intelligence · Computer Science 2024-10-25 Wolfgang Stammer , Antonia Wüst , David Steinmann , Kristian Kersting

Efficient Object-centric Representation Learning with Pre-trained Geometric Prior

This paper addresses key challenges in object-centric representation learning of video. While existing approaches struggle with complex scenes, we propose a novel weakly-supervised framework that emphasises geometric understanding and…

Computer Vision and Pattern Recognition · Computer Science 2024-12-18 Phúc H. Le Khac , Graham Healy , Alan F. Smeaton

Neural Systematic Binder

The key to high-level cognition is believed to be the ability to systematically manipulate and compose knowledge pieces. While token-like structured knowledge representations are naturally provided in text, it is elusive how to obtain them…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Gautam Singh , Yeongbin Kim , Sungjin Ahn

Object-Centric Action-Enhanced Representations for Robot Visuo-Motor Policy Learning

Learning visual representations from observing actions to benefit robot visuo-motor policy generation is a promising direction that closely resembles human cognitive function and perception. Motivated by this, and further inspired by…

Robotics · Computer Science 2025-05-28 Nikos Giannakakis , Argyris Manetas , Panagiotis P. Filntisis , Petros Maragos , George Retsinas

Simultaneous Multi-View Object Recognition and Grasping in Open-Ended Domains

To aid humans in everyday tasks, robots need to know which objects exist in the scene, where they are, and how to grasp and manipulate them in different situations. Therefore, object recognition and grasping are two key functionalities for…

Robotics · Computer Science 2022-12-07 Hamidreza Kasaei , Sha Luo , Remo Sasso , Mohammadreza Kasaei

Binding Dancers Into Attractors

To effectively perceive and process observations in our environment, feature binding and perspective taking are crucial cognitive abilities. Feature binding combines observed features into one entity, called a Gestalt. Perspective taking…

Neurons and Cognition · Quantitative Biology 2022-06-07 Franziska Kaltenberger , Sebastian Otte , Martin V. Butz