Related papers: Self-supervised Video Representation Learning Usin…

Self-Supervised Video Representation Learning by Video Incoherence Detection

This paper introduces a novel self-supervised method that leverages incoherence detection for video representation learning. It roots from the observation that visual systems of human beings can easily identify video incoherence based on…

Computer Vision and Pattern Recognition · Computer Science 2021-09-28 Haozhi Cao , Yuecong Xu , Jianfei Yang , Kezhi Mao , Lihua Xie , Jianxiong Yin , Simon See

Inter-intra Variant Dual Representations forSelf-supervised Video Recognition

Contrastive learning applied to self-supervised representation learning has seen a resurgence in deep models. In this paper, we find that existing contrastive learning based solutions for self-supervised video recognition focus on…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Lin Zhang , Qi She , Zhengyang Shen , Changhu Wang

Contrastive Learning of Image Representations with Cross-Video Cycle-Consistency

Recent works have advanced the performance of self-supervised representation learning by a large margin. The core among these methods is intra-image invariance learning. Two different transformations of one image instance are considered as…

Computer Vision and Pattern Recognition · Computer Science 2021-05-14 Haiping Wu , Xiaolong Wang

Nearest-Neighbor Inter-Intra Contrastive Learning from Unlabeled Videos

Contrastive learning has recently narrowed the gap between self-supervised and supervised methods in image and video domain. State-of-the-art video contrastive learning methods such as CVRL and $\rho$-MoCo spatiotemporally augment two clips…

Computer Vision and Pattern Recognition · Computer Science 2023-03-14 David Fan , Deyu Yang , Xinyu Li , Vimal Bhat , Rohith MV

Time-Equivariant Contrastive Video Representation Learning

We introduce a novel self-supervised contrastive learning method to learn representations from unlabelled videos. Existing approaches ignore the specifics of input distortions, e.g., by learning invariance to temporal transformations.…

Computer Vision and Pattern Recognition · Computer Science 2021-12-08 Simon Jenni , Hailin Jin

Self-supervised Co-training for Video Representation Learning

The objective of this paper is visual-only self-supervised video representation learning. We make the following contributions: (i) we investigate the benefit of adding semantic-class positives to instance-based Info Noise Contrastive…

Computer Vision and Pattern Recognition · Computer Science 2021-01-13 Tengda Han , Weidi Xie , Andrew Zisserman

Self-Supervised Video Representation Learning with Motion-Contrastive Perception

Visual-only self-supervised learning has achieved significant improvement in video representation learning. Existing related methods encourage models to learn video representations by utilizing contrastive learning or designing specific…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Jinyu Liu , Ying Cheng , Yuejie Zhang , Rui-Wei Zhao , Rui Feng

Contrastive Transformation for Self-supervised Correspondence Learning

In this paper, we focus on the self-supervised learning of visual correspondence using unlabeled videos in the wild. Our method simultaneously considers intra- and inter-video representation associations for reliable correspondence…

Computer Vision and Pattern Recognition · Computer Science 2020-12-10 Ning Wang , Wengang Zhou , Houqiang Li

Self-Supervised Contrastive Learning for Videos using Differentiable Local Alignment

Robust frame-wise embeddings are essential to perform video analysis and understanding tasks. We present a self-supervised method for representation learning based on aligning temporal video sequences. Our framework uses a transformer-based…

Computer Vision and Pattern Recognition · Computer Science 2025-03-04 Keyne Oei , Amr Gomaa , Anna Maria Feit , João Belo

Supervised Contrastive Frame Aggregation for Video Representation Learning

We propose a supervised contrastive learning framework for video representation learning that leverages temporally global context. We introduce a video to image aggregation strategy that spatially arranges multiple frames from each video…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Shaif Chowdhury , Mushfika Rahman , Greg Hamerly

Continual Contrastive Learning for Image Classification

Recently, self-supervised representation learning gives further development in multimedia technology. Most existing self-supervised learning methods are applicable to packaged data. However, when it comes to streamed data, they are…

Computer Vision and Pattern Recognition · Computer Science 2022-11-03 Zhiwei Lin , Yongtao Wang , Hongxiang Lin

Cycle-Contrast for Self-Supervised Video Representation Learning

We present Cycle-Contrastive Learning (CCL), a novel self-supervised method for learning video representation. Following a nature that there is a belong and inclusion relation of video and its frames, CCL is designed to find correspondences…

Computer Vision and Pattern Recognition · Computer Science 2020-10-29 Quan Kong , Wenpeng Wei , Ziwei Deng , Tomoaki Yoshinaga , Tomokazu Murakami

Audio-Visual Contrastive Learning with Temporal Self-Supervision

We propose a self-supervised learning approach for videos that learns representations of both the RGB frames and the accompanying audio without human supervision. In contrast to images that capture the static scene appearance, videos also…

Computer Vision and Pattern Recognition · Computer Science 2023-02-16 Simon Jenni , Alexander Black , John Collomosse

Unsupervised Video Representation Learning by Bidirectional Feature Prediction

This paper introduces a novel method for self-supervised video representation learning via feature prediction. In contrast to the previous methods that focus on future feature prediction, we argue that a supervisory signal arising from…

Computer Vision and Pattern Recognition · Computer Science 2020-11-13 Nadine Behrmann , Juergen Gall , Mehdi Noroozi

Self-Supervised Representation Learning for Visual Anomaly Detection

Self-supervised learning allows for better utilization of unlabelled data. The feature representation obtained by self-supervision can be used in downstream tasks such as classification, object detection, segmentation, and anomaly…

Computer Vision and Pattern Recognition · Computer Science 2020-06-18 Rabia Ali , Muhammad Umar Karim Khan , Chong Min Kyung

Time-Contrastive Networks: Self-Supervised Learning from Video

We propose a self-supervised approach for learning representations and robotic behaviors entirely from unlabeled videos recorded from multiple viewpoints, and study how this representation can be used in two robotic imitation settings:…

Computer Vision and Pattern Recognition · Computer Science 2018-03-21 Pierre Sermanet , Corey Lynch , Yevgen Chebotar , Jasmine Hsu , Eric Jang , Stefan Schaal , Sergey Levine

Contrastive representation learning has proven to be an effective self-supervised learning method for images and videos. Most successful approaches are based on Noise Contrastive Estimation (NCE) and use different views of an instance as…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Julien Denize , Jaonary Rabarisoa , Astrid Orcesi , Romain Hérault

Self-Supervised Visual Learning by Variable Playback Speeds Prediction of a Video

We propose a self-supervised visual learning method by predicting the variable playback speeds of a video. Without semantic labels, we learn the spatio-temporal visual representation of the video by leveraging the variations in the visual…

Computer Vision and Pattern Recognition · Computer Science 2021-06-02 Hyeon Cho , Taehoon Kim , Hyung Jin Chang , Wonjun Hwang

Revisiting Contrastive Methods for Unsupervised Learning of Visual Representations

Contrastive self-supervised learning has outperformed supervised pretraining on many downstream tasks like segmentation and object detection. However, current methods are still primarily applied to curated datasets like ImageNet. In this…

Computer Vision and Pattern Recognition · Computer Science 2021-12-15 Wouter Van Gansbeke , Simon Vandenhende , Stamatios Georgoulis , Luc Van Gool

Cross-View-Prediction: Exploring Contrastive Feature for Hyperspectral Image Classification

This paper presents a self-supervised feature learning method for hyperspectral image classification. Our method tries to construct two different views of the raw hyperspectral image through a cross-representation learning method. And then…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 Anyu Zhang , Haotian Wu , Zeyu Cao