Related papers: Object-Centric Representation Learning with Genera…

Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views

Learning object-centric representations of multi-object scenes is a promising approach towards machine intelligence, facilitating high-level reasoning and control from visual sensory data. However, current approaches for unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2021-11-16 Li Nanbo , Cian Eastwood , Robert B. Fisher

Unsupervised Object-Centric Learning from Multiple Unspecified Viewpoints

Visual scenes are extremely diverse, not only because there are infinite possible combinations of objects and backgrounds but also because the observations of the same scene may vary greatly with the change of viewpoints. When observing a…

Computer Vision and Pattern Recognition · Computer Science 2024-01-05 Jinyang Yuan , Tonglin Chen , Zhimeng Shen , Bin Li , Xiangyang Xue

DynaVol: Unsupervised Learning for Dynamic Scenes through Object-Centric Voxelization

Unsupervised learning of object-centric representations in dynamic visual scenes is challenging. Unlike most previous approaches that learn to decompose 2D images, we present DynaVol, a 3D scene generative model that unifies geometric…

Computer Vision and Pattern Recognition · Computer Science 2024-01-29 Yanpeng Zhao , Siyu Gao , Yunbo Wang , Xiaokang Yang

Multi-Object Representation Learning with Iterative Variational Inference

Human perception is structured around objects which form the basis for our higher-level cognition and impressive systematic generalization abilities. Yet most work on representation learning focuses on feature learning without even…

Machine Learning · Computer Science 2020-07-29 Klaus Greff , Raphaël Lopez Kaufman , Rishabh Kabra , Nick Watters , Chris Burgess , Daniel Zoran , Loic Matthey , Matthew Botvinick , Alexander Lerchner

Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations

Perceptual understanding of the scene and the relationship between its different components is important for successful completion of robotic tasks. Representation learning has been shown to be a powerful technique for this, but most of the…

Robotics · Computer Science 2023-03-14 Negin Heravi , Ayzaan Wahid , Corey Lynch , Pete Florence , Travis Armstrong , Jonathan Tompson , Pierre Sermanet , Jeannette Bohg , Debidatta Dwibedi

Benchmarking Unsupervised Object Representations for Video Sequences

Perceiving the world in terms of objects and tracking them through time is a crucial prerequisite for reasoning and scene understanding. Recently, several methods have been proposed for unsupervised learning of object-centric…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Marissa A. Weis , Kashyap Chitta , Yash Sharma , Wieland Brendel , Matthias Bethge , Andreas Geiger , Alexander S. Ecker

Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints

Visual scenes are extremely rich in diversity, not only because there are infinite combinations of objects and background, but also because the observations of the same scene may vary greatly with the change of viewpoints. When observing a…

Computer Vision and Pattern Recognition · Computer Science 2021-12-14 Jinyang Yuan , Bin Li , Xiangyang Xue

Dynamic Scene Understanding through Object-Centric Voxelization and Neural Rendering

Learning object-centric representations from unsupervised videos is challenging. Unlike most previous approaches that focus on decomposing 2D images, we present a 3D generative model named DynaVol-S for dynamic scenes that enables…

Computer Vision and Pattern Recognition · Computer Science 2025-02-17 Yanpeng Zhao , Yiwei Hao , Siyu Gao , Yunbo Wang , Xiaokang Yang

Variational Inference for Scalable 3D Object-centric Learning

We tackle the task of scalable unsupervised object-centric representation learning on 3D scenes. Existing approaches to object-centric representation learning show limitations in generalizing to larger scenes as their learning processes…

Computer Vision and Pattern Recognition · Computer Science 2023-09-26 Tianyu Wang , Kee Siong Ng , Miaomiao Liu

Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization

Single-domain generalization aims to learn a model from single source domain data to achieve generalized performance on other unseen target domains. Existing works primarily focus on improving the generalization ability of static networks.…

Computer Vision and Pattern Recognition · Computer Science 2024-02-29 Deng Li , Aming Wu , Yaowei Wang , Yahong Han

Visuomotor Understanding for Representation Learning of Driving Scenes

Dashboard cameras capture a tremendous amount of driving scene video each day. These videos are purposefully coupled with vehicle sensing data, such as from the speedometer and inertial sensors, providing an additional sensing modality for…

Computer Vision and Pattern Recognition · Computer Science 2019-09-17 Seokju Lee , Junsik Kim , Tae-Hyun Oh , Yongseop Jeong , Donggeun Yoo , Stephen Lin , In So Kweon

Learning Scene-specific Object Detectors Based on a Generative-Discriminative Model with Minimal Supervision

One object class may show large variations due to diverse illuminations, backgrounds and camera viewpoints. Traditional object detection methods often perform worse under unconstrained video environments. To address this problem, many…

Computer Vision and Pattern Recognition · Computer Science 2018-03-14 Dapeng Luo , Zhipeng Zeng , Nong Sang , Xiang Wu , Longsheng Wei , Quanzheng Mou , Jun Cheng , Chen Luo

Improving Viewpoint-Independent Object-Centric Representations through Active Viewpoint Selection

Given the complexities inherent in visual scenes, such as object occlusion, a comprehensive understanding often requires observation from multiple viewpoints. Existing multi-viewpoint object-centric learning methods typically employ random…

Computer Vision and Pattern Recognition · Computer Science 2024-11-04 Yinxuan Huang , Chengmin Gao , Bin Li , Xiangyang Xue

Robust and Controllable Object-Centric Learning through Energy-based Models

Humans are remarkably good at understanding and reasoning about complex visual scenes. The capability to decompose low-level observations into discrete objects allows us to build a grounded abstract representation and identify the…

Machine Learning · Computer Science 2022-10-12 Ruixiang Zhang , Tong Che , Boris Ivanovic , Renhao Wang , Marco Pavone , Yoshua Bengio , Liam Paull

SIMONe: View-Invariant, Temporally-Abstracted Object Representations via Unsupervised Video Decomposition

To help agents reason about scenes in terms of their building blocks, we wish to extract the compositional structure of any given scene (in particular, the configuration and characteristics of objects comprising the scene). This problem is…

Computer Vision and Pattern Recognition · Computer Science 2021-12-07 Rishabh Kabra , Daniel Zoran , Goker Erdogan , Loic Matthey , Antonia Creswell , Matthew Botvinick , Alexander Lerchner , Christopher P. Burgess

SPACE: Unsupervised Object-Oriented Scene Representation via Spatial Attention and Decomposition

The ability to decompose complex multi-object scenes into meaningful abstractions like objects is fundamental to achieve higher-level cognition. Previous approaches for unsupervised object-oriented scene representation learning are either…

Machine Learning · Computer Science 2020-03-17 Zhixuan Lin , Yi-Fu Wu , Skand Vishwanath Peri , Weihao Sun , Gautam Singh , Fei Deng , Jindong Jiang , Sungjin Ahn

Grasp2Vec: Learning Object Representations from Self-Supervised Grasping

Well structured visual representations can make robot learning faster and can improve generalization. In this paper, we study how we can acquire effective object-centric representations for robotic manipulation tasks without human labeling…

Robotics · Computer Science 2018-11-20 Eric Jang , Coline Devin , Vincent Vanhoucke , Sergey Levine

Learning Physical Dynamics for Object-centric Visual Prediction

The ability to model the underlying dynamics of visual scenes and reason about the future is central to human intelligence. Many attempts have been made to empower intelligent systems with such physical understanding and prediction…

Computer Vision and Pattern Recognition · Computer Science 2024-03-18 Huilin Xu , Tao Chen , Feng Xu

Learning Global Object-Centric Representations via Disentangled Slot Attention

Humans can discern scene-independent features of objects across various environments, allowing them to swiftly identify objects amidst changing factors such as lighting, perspective, size, and position and imagine the complete images of the…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Tonglin Chen , Yinxuan Huang , Zhimeng Shen , Jinghao Huang , Bin Li , Xiangyang Xue

Learning 3D object-centric representation through prediction

As part of human core knowledge, the representation of objects is the building block of mental representation that supports high-level concepts and symbolic reasoning. While humans develop the ability of perceiving objects situated in 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-03-07 John Day , Tushar Arora , Jirui Liu , Li Erran Li , Ming Bo Cai