Related papers: Learning Features by Watching Objects Move

Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion

Motion, measured via optical flow, provides a powerful cue to discover and learn objects in images and videos. However, compared to using appearance, it has some blind spots, such as the fact that objects become invisible if they do not…

Computer Vision and Pattern Recognition · Computer Science 2022-10-17 Subhabrata Choudhury , Laurynas Karazija , Iro Laina , Andrea Vedaldi , Christian Rupprecht

Unsupervised learning from video to detect foreground objects in single images

Unsupervised learning from visual data is one of the most difficult challenges in computer vision, being a fundamental task for understanding how visual recognition works. From a practical point of view, learning from unsupervised visual…

Computer Vision and Pattern Recognition · Computer Science 2017-04-03 Ioana Croitoru , Simion-Vlad Bogolin , Marius Leordeanu

Multi-Object Representation Learning with Iterative Variational Inference

Human perception is structured around objects which form the basis for our higher-level cognition and impressive systematic generalization abilities. Yet most work on representation learning focuses on feature learning without even…

Machine Learning · Computer Science 2020-07-29 Klaus Greff , Raphaël Lopez Kaufman , Rishabh Kabra , Nick Watters , Chris Burgess , Daniel Zoran , Loic Matthey , Matthew Botvinick , Alexander Lerchner

Self-supervised Video Object Segmentation by Motion Grouping

Animals have evolved highly functional visual systems to understand motion, assisting perception even under complex environments. In this paper, we work towards developing a computer vision system able to segment objects by exploiting…

Computer Vision and Pattern Recognition · Computer Science 2021-08-12 Charig Yang , Hala Lamdouar , Erika Lu , Andrew Zisserman , Weidi Xie

Learning To Segment Dominant Object Motion From Watching Videos

Existing deep learning based unsupervised video object segmentation methods still rely on ground-truth segmentation masks to train. Unsupervised in this context only means that no annotated frames are used during inference. As obtaining…

Computer Vision and Pattern Recognition · Computer Science 2021-11-30 Sahir Shrestha , Mohammad Ali Armin , Hongdong Li , Nick Barnes

Unsupervised Multi-object Segmentation by Predicting Probable Motion Patterns

We propose a new approach to learn to segment multiple image objects without manual supervision. The method can extract objects form still images, but uses videos for supervision. While prior works have considered motion for segmentation, a…

Computer Vision and Pattern Recognition · Computer Science 2022-10-24 Laurynas Karazija , Subhabrata Choudhury , Iro Laina , Christian Rupprecht , Andrea Vedaldi

A Self-supervised Learning System for Object Detection in Videos Using Random Walks on Graphs

This paper presents a new self-supervised system for learning to detect novel and previously unseen categories of objects in images. The proposed system receives as input several unlabeled videos of scenes containing various objects. The…

Computer Vision and Pattern Recognition · Computer Science 2021-08-25 Juntao Tan , Changkyu Song , Abdeslam Boularias

Disentangling Motion, Foreground and Background Features in Videos

This paper introduces an unsupervised framework to extract semantically rich features for video representation. Inspired by how the human visual system groups objects based on motion cues, we propose a deep convolutional neural network that…

Computer Vision and Pattern Recognition · Computer Science 2017-07-18 Xunyu Lin , Victor Campos , Xavier Giro-i-Nieto , Jordi Torres , Cristian Canton Ferrer

On guiding video object segmentation

This paper presents a novel approach for segmenting moving objects in unconstrained environments using guided convolutional neural networks. This guiding process relies on foreground masks from independent algorithms (i.e. state-of-the-art…

Computer Vision and Pattern Recognition · Computer Science 2019-04-26 Diego Ortego , Kevin McGuinness , Juan C. SanMiguel , Eric Arazo , José M. Martínez , Noel E. O'Connor

Unsupervised Video Representation Learning by Bidirectional Feature Prediction

This paper introduces a novel method for self-supervised video representation learning via feature prediction. In contrast to the previous methods that focus on future feature prediction, we argue that a supervisory signal arising from…

Computer Vision and Pattern Recognition · Computer Science 2020-11-13 Nadine Behrmann , Juergen Gall , Mehdi Noroozi

Unsupervised Learning of View-invariant Action Representations

The recent success in human action recognition with deep learning methods mostly adopt the supervised learning paradigm, which requires significant amount of manually labeled data to achieve good performance. However, label collection is an…

Computer Vision and Pattern Recognition · Computer Science 2018-09-07 Junnan Li , Yongkang Wong , Qi Zhao , Mohan S. Kankanhalli

Pose from Action: Unsupervised Learning of Pose Features based on Motion

Human actions are comprised of a sequence of poses. This makes videos of humans a rich and dense source of human poses. We propose an unsupervised method to learn pose features from videos that exploits a signal which is complementary to…

Computer Vision and Pattern Recognition · Computer Science 2016-09-20 Senthil Purushwalkam , Abhinav Gupta

Using Motion and Internal Supervision in Object Recognition

In this thesis we address two related aspects of visual object recognition: the use of motion information, and the use of internal supervision, to help unsupervised learning. These two aspects are inter-related in the current study, since…

Computer Vision and Pattern Recognition · Computer Science 2018-12-14 Daniel Harari

Unsupervised Learning of Object Structure and Dynamics from Videos

Extracting and predicting object structure and dynamics from videos without supervision is a major challenge in machine learning. To address this challenge, we adopt a keypoint-based image representation and learn a stochastic dynamics…

Computer Vision and Pattern Recognition · Computer Science 2020-03-03 Matthias Minderer , Chen Sun , Ruben Villegas , Forrester Cole , Kevin Murphy , Honglak Lee

Unsupervised Learning of Object Keypoints for Perception and Control

The study of object representations in computer vision has primarily focused on developing representations that are useful for image classification, object detection, or semantic segmentation as downstream tasks. In this work we aim to…

Computer Vision and Pattern Recognition · Computer Science 2019-11-21 Tejas Kulkarni , Ankush Gupta , Catalin Ionescu , Sebastian Borgeaud , Malcolm Reynolds , Andrew Zisserman , Volodymyr Mnih

Self-supervised Object-Centric Learning for Videos

Unsupervised multi-object segmentation has shown impressive results on images by utilizing powerful semantics learned from self-supervised pretraining. An additional modality such as depth or motion is often used to facilitate the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Görkay Aydemir , Weidi Xie , Fatma Güney

Object-Centric Learning for Real-World Videos by Predicting Temporal Feature Similarities

Unsupervised video-based object-centric learning is a promising avenue to learn structured representations from large, unlabeled video collections, but previous approaches have only managed to scale to real-world datasets in restricted…

Computer Vision and Pattern Recognition · Computer Science 2024-03-18 Andrii Zadaianchuk , Maximilian Seitzer , Georg Martius

Unseen Object Segmentation in Videos via Transferable Representations

In order to learn object segmentation models in videos, conventional methods require a large amount of pixel-wise ground truth annotations. However, collecting such supervised data is time-consuming and labor-intensive. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2019-01-09 Yi-Wen Chen , Yi-Hsuan Tsai , Chu-Ya Yang , Yen-Yu Lin , Ming-Hsuan Yang

Design Pseudo Ground Truth with Motion Cue for Unsupervised Video Object Segmentation

One major technique debt in video object segmentation is to label the object masks for training instances. As a result, we propose to prepare inexpensive, yet high quality pseudo ground truth corrected with motion cue for video object…

Computer Vision and Pattern Recognition · Computer Science 2018-12-14 Ye Wang , Jongmoo Choi , Yueru Chen , Qin Huang , Siyang Li , Ming-Sui Lee , C. -C. Jay Kuo

Video Representation Learning by Recognizing Temporal Transformations

We introduce a novel self-supervised learning approach to learn representations of videos that are responsive to changes in the motion dynamics. Our representations can be learned from data without human annotation and provide a substantial…

Computer Vision and Pattern Recognition · Computer Science 2020-07-22 Simon Jenni , Givi Meishvili , Paolo Favaro