English
Related papers

Related papers: Unsupervised Keypoint Learning for Guiding Class-C…

200 papers

In this paper, we consider the task of unsupervised object discovery in videos. Previous works have shown promising results via processing optical flows to segment objects. However, taking flow as input brings about two drawbacks. First,…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Shuangrui Ding , Weidi Xie , Yabo Chen , Rui Qian , Xiaopeng Zhang , Hongkai Xiong , Qi Tian

Video prediction aims to generate realistic future frames by learning dynamic visual patterns. One fundamental challenge is to deal with future uncertainty: How should a model behave when there are multiple correct, equally probable future?…

Computer Vision and Pattern Recognition · Computer Science 2018-07-10 Yunseok Jang , Gunhee Kim , Yale Song

The task of temporally detecting and segmenting actions in untrimmed videos has seen an increased attention recently. One problem in this context arises from the need to define and label action boundaries to create annotations for training…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Anna Kukleva , Hilde Kuehne , Fadime Sener , Juergen Gall

We propose a general purpose active learning algorithm for structured prediction, gathering labeled data for training a model that outputs a set of related labels for an image or video. Active learning starts with a limited initial training…

Computer Vision and Pattern Recognition · Computer Science 2017-06-16 Mehran Khodabandeh , Zhiwei Deng , Mostafa S. Ibrahim , Shinichi Satoh , Greg Mori

We propose a semantics-driven unsupervised learning approach for monocular depth and ego-motion estimation from videos in this paper. Recent unsupervised learning methods employ photometric errors between synthetic view and actual image as…

Computer Vision and Pattern Recognition · Computer Science 2020-06-09 Xiaobin Wei , Jianjiang Feng , Jie Zhou

Semi-supervised and unsupervised systems provide operators with invaluable support and can tremendously reduce the operators load. In the light of the necessity to process large volumes of video data and provide autonomous decisions, this…

Machine Learning · Statistics 2017-09-20 Olga Isupova , Danil Kuzin , Lyudmila Mihaylova

Understanding and predicting video content is essential for planning and reasoning in dynamic environments. Despite advancements, unsupervised learning of object representations and dynamics remains challenging. We present VideoPCDNet, an…

Computer Vision and Pattern Recognition · Computer Science 2025-06-25 Noel José Rodrigues Vicente , Enrique Lehner , Angel Villar-Corrales , Jan Nogga , Sven Behnke

This paper shows how an uncertainty-aware, deep neural network can be trained to detect, recognise and localise objects in 2D RGB images, in applications lacking annotated train-ng datasets. We propose a self-supervising teacher-student…

Computer Vision and Pattern Recognition · Computer Science 2024-11-06 Irum Mehboob , Li Sun , Alireza Astegarpanah , Rustam Stolkin

Observable motion in videos can give rise to the definition of objects moving with respect to the scene. The task of segmenting such moving objects is referred to as motion segmentation and is usually tackled either by aggregating motion…

Computer Vision and Pattern Recognition · Computer Science 2020-08-19 Amirhossein Kardoost , Kalun Ho , Peter Ochs , Margret Keuper

Extracting physical dynamical system parameters from recorded observations is key in natural science. Current methods for automatic parameter estimation from video train supervised deep networks on large datasets. Such datasets require…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Alejandro Castañeda Garcia , Jan van Gemert , Daan Brinks , Nergis Tömen

We propose a method to train deep networks to decompose videos into 3D geometry (camera and depth), moving objects, and their motions, with no supervision. We build on the idea of view synthesis, which uses classical camera geometry to…

Computer Vision and Pattern Recognition · Computer Science 2021-06-02 Dan Xu , Andrea Vedaldi , Joao F. Henriques

In this work, we address the challenging video scene parsing problem by developing effective representation learning methods given limited parsing annotations. In particular, we contribute two novel methods that constitute a unified parsing…

Computer Vision and Pattern Recognition · Computer Science 2016-12-14 Xiaojie Jin , Xin Li , Huaxin Xiao , Xiaohui Shen , Zhe Lin , Jimei Yang , Yunpeng Chen , Jian Dong , Luoqi Liu , Zequn Jie , Jiashi Feng , Shuicheng Yan

Deep neural networks can model images with rich latent representations, but they cannot naturally conceptualize structures of object categories in a human-perceptible way. This paper addresses the problem of learning object structures in an…

Computer Vision and Pattern Recognition · Computer Science 2018-04-13 Yuting Zhang , Yijie Guo , Yixin Jin , Yijun Luo , Zhiyuan He , Honglak Lee

Traffic videos inherently differ from generic videos in their stationary camera setup, thus providing a strong motion prior where objects often move in a specific direction over a short time interval. Existing works predominantly employ…

Computer Vision and Pattern Recognition · Computer Science 2023-11-17 Lihao Liu , Yanqi Cheng , Dongdong Chen , Jing He , Pietro Liò , Carola-Bibiane Schönlieb , Angelica I Aviles-Rivero

In recent years, dynamic vision sensors (DVS), also known as event-based cameras or neuromorphic sensors, have seen increased use due to various advantages over conventional frame-based cameras. Using principles inspired by the retina, its…

Computer Vision and Pattern Recognition · Computer Science 2018-03-15 Nicholas F. Y. Chen

Representation learning approaches typically rely on images of objects captured from a single perspective that are transformed using affine transformations. Additionally, self-supervised learning, a successful paradigm of representation…

Computer Vision and Pattern Recognition · Computer Science 2022-08-17 Omiros Pantazis , Mathew Salvaris

The success of deep learning in computer vision is rooted in the ability of deep networks to scale up model complexity as demanded by challenging visual tasks. As complexity is increased, so is the need for large amounts of labeled data to…

Computer Vision and Pattern Recognition · Computer Science 2017-08-22 Gustav Larsson

Embodied agents must detect and localize objects of interest, e.g. traffic participants for self-driving cars. Supervision in the form of bounding boxes for this task is extremely expensive. As such, prior work has looked at unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2024-08-01 Yihong Sun , Bharath Hariharan

We address the problem of inferring self-supervised dense semantic correspondences between objects in multi-object scenes. The method introduces learning of class-aware dense object descriptors by providing either unsupervised discrete…

Robotics · Computer Science 2021-10-06 Denis Hadjivelichkov , Dimitrios Kanoulas

How can unlabeled video augment visual learning? Existing methods perform "slow" feature analysis, encouraging the representations of temporally close frames to exhibit only small differences. While this standard approach captures the fact…

Computer Vision and Pattern Recognition · Computer Science 2016-04-15 Dinesh Jayaraman , Kristen Grauman