Related papers: Matching Multiple Perspectives for Efficient Repre…

On the robustness of self-supervised representations for multi-view object classification

It is known that representations from self-supervised pre-training can perform on par, and often better, on various downstream tasks than representations from fully-supervised pre-training. This has been shown in a host of settings such as…

Computer Vision and Pattern Recognition · Computer Science 2022-08-02 David Torpey , Richard Klein

A Computational Account Of Self-Supervised Visual Learning From Egocentric Object Play

Research in child development has shown that embodied experience handling physical objects contributes to many cognitive abilities, including visual learning. One characteristic of such experience is that the learner sees the same object…

Computer Vision and Pattern Recognition · Computer Science 2023-06-01 Deepayan Sanyal , Joel Michelson , Yuan Yang , James Ainooson , Maithilee Kunda

A Study on Self-Supervised Object Detection Pretraining

In this work, we study different approaches to self-supervised pretraining of object detection models. We first design a general framework to learn a spatially consistent dense representation from an image, by randomly sampling and…

Computer Vision and Pattern Recognition · Computer Science 2022-08-12 Trung Dang , Simon Kornblith , Huy Thong Nguyen , Peter Chin , Maryam Khademi

Self-supervised Learning from a Multi-view Perspective

As a subset of unsupervised representation learning, self-supervised representation learning adopts self-defined signals as supervision and uses the learned representation for downstream tasks, such as object detection and image captioning.…

Machine Learning · Computer Science 2021-03-24 Yao-Hung Hubert Tsai , Yue Wu , Ruslan Salakhutdinov , Louis-Philippe Morency

Learning Representations by Maximizing Mutual Information Across Views

We propose an approach to self-supervised representation learning based on maximizing mutual information between features extracted from multiple views of a shared context. For example, one could produce multiple views of a local…

Machine Learning · Computer Science 2019-07-09 Philip Bachman , R Devon Hjelm , William Buchwalter

Visuomotor Control in Multi-Object Scenes Using Object-Aware Representations

Perceptual understanding of the scene and the relationship between its different components is important for successful completion of robotic tasks. Representation learning has been shown to be a powerful technique for this, but most of the…

Robotics · Computer Science 2023-03-14 Negin Heravi , Ayzaan Wahid , Corey Lynch , Pete Florence , Travis Armstrong , Jonathan Tompson , Pierre Sermanet , Jeannette Bohg , Debidatta Dwibedi

Towards Self-Supervised Learning of Global and Object-Centric Representations

Self-supervision allows learning meaningful representations of natural images, which usually contain one central object. How well does it transfer to multi-entity scenes? We discuss key aspects of learning structured object-centric…

Computer Vision and Pattern Recognition · Computer Science 2022-04-15 Federico Baldassarre , Hossein Azizpour

Grasp2Vec: Learning Object Representations from Self-Supervised Grasping

Well structured visual representations can make robot learning faster and can improve generalization. In this paper, we study how we can acquire effective object-centric representations for robotic manipulation tasks without human labeling…

Robotics · Computer Science 2018-11-20 Eric Jang , Coline Devin , Vincent Vanhoucke , Sergey Levine

Self-Supervised Image Representation Learning: Transcending Masking with Paired Image Overlay

Self-supervised learning has become a popular approach in recent years for its ability to learn meaningful representations without the need for data annotation. This paper proposes a novel image augmentation technique, overlaying images,…

Computer Vision and Pattern Recognition · Computer Science 2023-01-25 Yinheng Li , Han Ding , Shaofei Wang

Towards Efficient and Effective Self-Supervised Learning of Visual Representations

Self-supervision has emerged as a propitious method for visual representation learning after the recent paradigm shift from handcrafted pretext tasks to instance-similarity based approaches. Most state-of-the-art methods enforce similarity…

Computer Vision and Pattern Recognition · Computer Science 2022-10-19 Sravanti Addepalli , Kaushal Bhogale , Priyam Dey , R. Venkatesh Babu

Self-supervisory Signals for Object Discovery and Detection

In robotic applications, we often face the challenge of discovering new objects while having very little or no labelled training data. In this paper we explore the use of self-supervision provided by a robot traversing an environment to…

Computer Vision and Pattern Recognition · Computer Science 2018-06-12 Etienne Pot , Alexander Toshev , Jana Kosecka

Self-Supervised Learning for Place Representation Generalization across Appearance Changes

Visual place recognition is a key to unlocking spatial navigation for animals, humans and robots. While state-of-the-art approaches are trained in a supervised manner and therefore hardly capture the information needed for generalizing to…

Computer Vision and Pattern Recognition · Computer Science 2023-12-25 Mohamed Adel Musallam , Vincent Gaudillière , Djamila Aouada

Multi-Object Representation Learning with Iterative Variational Inference

Human perception is structured around objects which form the basis for our higher-level cognition and impressive systematic generalization abilities. Yet most work on representation learning focuses on feature learning without even…

Machine Learning · Computer Science 2020-07-29 Klaus Greff , Raphaël Lopez Kaufman , Rishabh Kabra , Nick Watters , Chris Burgess , Daniel Zoran , Loic Matthey , Matthew Botvinick , Alexander Lerchner

Self-Supervised Learning of Object Parts for Semantic Segmentation

Progress in self-supervised learning has brought strong general image representation learning methods. Yet so far, it has mostly focused on image-level learning. In turn, tasks such as unsupervised image segmentation have not benefited from…

Computer Vision and Pattern Recognition · Computer Science 2022-06-22 Adrian Ziegler , Yuki M. Asano

Self-Supervision by Prediction for Object Discovery in Videos

Despite their irresistible success, deep learning algorithms still heavily rely on annotated data. On the other hand, unsupervised settings pose many challenges, especially about determining the right inductive bias in diverse scenarios.…

Computer Vision and Pattern Recognition · Computer Science 2021-03-11 Beril Besbinar , Pascal Frossard

A Survey on Self-Supervised Representation Learning

Learning meaningful representations is at the heart of many tasks in the field of modern machine learning. Recently, a lot of methods were introduced that allow learning of image representations without supervision. These representations…

Machine Learning · Computer Science 2023-08-23 Tobias Uelwer , Jan Robine , Stefan Sylvius Wagner , Marc Höftmann , Eric Upschulte , Sebastian Konietzny , Maike Behrendt , Stefan Harmeling

Unsupervised Learning of View-invariant Action Representations

The recent success in human action recognition with deep learning methods mostly adopt the supervised learning paradigm, which requires significant amount of manually labeled data to achieve good performance. However, label collection is an…

Computer Vision and Pattern Recognition · Computer Science 2018-09-07 Junnan Li , Yongkang Wong , Qi Zhao , Mohan S. Kankanhalli

Multi-Camera View Scaling for Data-Efficient Robot Imitation Learning

The generalization ability of imitation learning policies for robotic manipulation is fundamentally constrained by the diversity of expert demonstrations, while collecting demonstrations across varied environments is costly and difficult in…

Robotics · Computer Science 2026-04-02 Yichen Xie , Yixiao Wang , Shuqi Zhao , Cheng-En Wu , Masayoshi Tomizuka , Jianwen Xie , Hao-Shu Fang

Self-Supervised Representation Learning from Flow Equivariance

Self-supervised representation learning is able to learn semantically meaningful features; however, much of its recent success relies on multiple crops of an image with very few objects. Instead of learning view-invariant representation…

Computer Vision and Pattern Recognition · Computer Science 2021-10-13 Yuwen Xiong , Mengye Ren , Wenyuan Zeng , Raquel Urtasun

Self-Supervised Multi-View Learning via Auto-Encoding 3D Transformations

3D object representation learning is a fundamental challenge in computer vision to infer about the 3D world. Recent advances in deep learning have shown their efficiency in 3D object recognition, among which view-based methods have…

Computer Vision and Pattern Recognition · Computer Science 2021-03-02 Xiang Gao , Wei Hu , Guo-Jun Qi