English
Related papers

Related papers: Self-Supervised Learning for Semi-Supervised Tempo…

200 papers

Temporal action proposals are a common module in action detection pipelines today. Most current methods for training action proposal modules rely on fully supervised approaches that require large amounts of annotated temporal action…

Computer Vision and Pattern Recognition · Computer Science 2019-10-04 Jingwei Ji , Kaidi Cao , Juan Carlos Niebles

Temporal action segmentation is a task to classify each frame in the video with an action label. However, it is quite expensive to annotate every frame in a large corpus of videos to construct a comprehensive supervised training dataset.…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Zhe Wang , Hao Chen , Xinyu Li , Chunhui Liu , Yuanjun Xiong , Joseph Tighe , Charless Fowlkes

Unsupervised video representation learning has made remarkable achievements in recent years. However, most existing methods are designed and optimized for video classification. These pre-trained models can be sub-optimal for temporal…

Computer Vision and Pattern Recognition · Computer Science 2022-03-28 Can Zhang , Tianyu Yang , Junwu Weng , Meng Cao , Jue Wang , Yuexian Zou

Existing temporal action detection (TAD) methods rely on a large number of training data with segment-level annotations. Collecting and annotating such a training set is thus highly expensive and unscalable. Semi-supervised TAD (SS-TAD)…

Computer Vision and Pattern Recognition · Computer Science 2022-07-15 Sauradip Nag , Xiatian Zhu , Yi-Zhe Song , Tao Xiang

Self-supervised tasks have been utilized to build useful representations that can be used in downstream tasks when the annotation is unavailable. In this paper, we introduce a self-supervised video representation learning method based on…

Computer Vision and Pattern Recognition · Computer Science 2021-02-23 Duc Quang Vu , Ngan T. H. Le , Jia-Ching Wang

Semi-Supervised Learning can be more beneficial for the video domain compared to images because of its higher annotation cost and dimensionality. Besides, any video understanding task requires reasoning over both spatial and temporal…

Computer Vision and Pattern Recognition · Computer Science 2023-03-30 Ishan Rajendrakumar Dave , Mamshad Nayeem Rizve , Chen Chen , Mubarak Shah

Recently, temporal action localization (TAL) has garnered significant interest in information retrieval community. However, existing supervised/weakly supervised methods are heavily dependent on extensive labeled temporal boundaries and…

Computer Vision and Pattern Recognition · Computer Science 2025-04-01 Yupeng Hu , Han Jiang , Hao Liu , Kun Wang , Haoyu Tang , Liqiang Nie

The crux of semi-supervised temporal action localization (SS-TAL) lies in excavating valuable information from abundant unlabeled videos. However, current approaches predominantly focus on building models that are robust to the error-prone…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Kun Xia , Le Wang , Sanping Zhou , Gang Hua , Wei Tang

Temporal action segmentation in videos has drawn much attention recently. Timestamp supervision is a cost-effective way for this task. To obtain more information to optimize the model, the existing method generated pseudo frame-wise labels…

Computer Vision and Pattern Recognition · Computer Science 2022-12-14 Yang Zhao , Yan Song

In low-level video analyses, effective representations are important to derive the correspondences between video frames. These representations have been learned in a self-supervised fashion from unlabeled images or videos, using carefully…

Computer Vision and Pattern Recognition · Computer Science 2023-06-23 Rui Li , Dong Liu

Semi-supervised learning (SSL) is a popular setting aiming to effectively utilize unlabelled data to improve model performance in downstream natural language processing (NLP) tasks. Currently, there are two popular approaches to make use of…

Computation and Language · Computer Science 2023-05-23 Zhengxiang Shi , Francesco Tonolini , Nikolaos Aletras , Emine Yilmaz , Gabriella Kazai , Yunlong Jiao

Despite the recent progress of fully-supervised action segmentation techniques, the performance is still not fully satisfactory. One main challenge is the problem of spatiotemporal variations (e.g. different people may perform the same…

Computer Vision and Pattern Recognition · Computer Science 2020-03-20 Min-Hung Chen , Baopu Li , Yingze Bao , Ghassan AlRegib , Zsolt Kira

Self-supervised approaches for video have shown impressive results in video understanding tasks. However, unlike early works that leverage temporal self-supervision, current state-of-the-art methods primarily rely on tasks from the image…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Ishan Rajendrakumar Dave , Simon Jenni , Mubarak Shah

We address the problem of video representation learning without human-annotated labels. While previous efforts address the problem by designing novel self-supervised tasks using video data, the learned features are merely on a…

Computer Vision and Pattern Recognition · Computer Science 2019-04-09 Jiangliu Wang , Jianbo Jiao , Linchao Bao , Shengfeng He , Yunhui Liu , Wei Liu

Self-supervised learning is an effective way for label-free model pre-training, especially in the video domain where labeling is expensive. Existing self-supervised works in the video domain use varying experimental setups to demonstrate…

Computer Vision and Pattern Recognition · Computer Science 2023-11-22 Akash Kumar , Ashlesha Kumar , Vibhav Vineet , Yogesh Singh Rawat

Spatially dense self-supervised learning is a rapidly growing problem domain with promising applications for unsupervised segmentation and pretraining for dense downstream tasks. Despite the abundance of temporal data in the form of videos,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-24 Mohammadreza Salehi , Efstratios Gavves , Cees G. M. Snoek , Yuki M. Asano

Temporal Action Localization (TAL) aims to predict both action category and temporal boundary of action instances in untrimmed videos, i.e., start and end time. Fully-supervised solutions are usually adopted in most existing works, and…

Computer Vision and Pattern Recognition · Computer Science 2022-12-02 Ding Li , Xuebing Yang , Yongqiang Tang , Chenyang Zhang , Wensheng Zhang

We present a novel approach for unsupervised activity segmentation which uses video frame clustering as a pretext task and simultaneously performs representation learning and online clustering. This is in contrast with prior works where…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Sateesh Kumar , Sanjay Haresh , Awais Ahmed , Andrey Konin , M. Zeeshan Zia , Quoc-Huy Tran

Deep learning methods are successfully used in applications pertaining to ubiquitous computing, health, and well-being. Specifically, the area of human activity recognition (HAR) is primarily transformed by the convolutional and recurrent…

Machine Learning · Computer Science 2019-07-30 Aaqib Saeed , Tanir Ozcelebi , Johan Lukkien

Self-training has shown great potential in semi-supervised learning. Its core idea is to use the model learned on labeled data to generate pseudo-labels for unlabeled samples, and in turn teach itself. To obtain valid supervision, active…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Ye Du , Yujun Shen , Haochen Wang , Jingjing Fei , Wei Li , Liwei Wu , Rui Zhao , Zehua Fu , Qingjie Liu
‹ Prev 1 2 3 10 Next ›