Related papers: Bootstrapped Representation Learning for Skeleton-…

Bootstrap your own latent: A new approach to self-supervised Learning

We introduce Bootstrap Your Own Latent (BYOL), a new approach to self-supervised image representation learning. BYOL relies on two neural networks, referred to as online and target networks, that interact and learn from each other. From an…

Machine Learning · Computer Science 2020-09-11 Jean-Bastien Grill , Florian Strub , Florent Altché , Corentin Tallec , Pierre H. Richemond , Elena Buchatskaya , Carl Doersch , Bernardo Avila Pires , Zhaohan Daniel Guo , Mohammad Gheshlaghi Azar , Bilal Piot , Koray Kavukcuoglu , Rémi Munos , Michal Valko

ReL-SAR: Representation Learning for Skeleton Action Recognition with Convolutional Transformers and BYOL

To extract robust and generalizable skeleton action recognition features, large amounts of well-curated data are typically required, which is a challenging task hindered by annotation and computation costs. Therefore, unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2024-09-10 Safwen Naimi , Wassim Bouachir , Guillaume-Alexandre Bilodeau

Self-Supervised 3D Action Representation Learning with Skeleton Cloud Colorization

3D Skeleton-based human action recognition has attracted increasing attention in recent years. Most of the existing work focuses on supervised learning which requires a large number of labeled action sequences that are often expensive and…

Computer Vision and Pattern Recognition · Computer Science 2023-10-17 Siyuan Yang , Jun Liu , Shijian Lu , Er Meng Hwa , Yongjian Hu , Alex C. Kot

Skeleton-Contrastive 3D Action Representation Learning

This paper strives for self-supervised learning of a feature space suitable for skeleton-based action recognition. Our proposal is built upon learning invariances to input skeleton representations and various skeleton augmentations via a…

Computer Vision and Pattern Recognition · Computer Science 2021-08-10 Fida Mohammad Thoker , Hazel Doughty , Cees G. M. Snoek

Self-Supervised Skeleton-Based Action Representation Learning: A Benchmark and Beyond

Self-supervised learning (SSL), which aims to learn meaningful prior representations from unlabeled data, has been proven effective for skeleton-based action understanding. Different from the image domain, skeleton data possesses sparser…

Computer Vision and Pattern Recognition · Computer Science 2025-12-29 Jiahang Zhang , Lilang Lin , Shuai Yang , Jiaying Liu

Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences

Self-supervised learning has demonstrated remarkable capability in representation learning for skeleton-based action recognition. Existing methods mainly focus on applying global data augmentation to generate different views of the skeleton…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Yujie Zhou , Haodong Duan , Anyi Rao , Bing Su , Jiaqi Wang

MS$^2$L: Multi-Task Self-Supervised Learning for Skeleton Based Action Recognition

In this paper, we address self-supervised representation learning from human skeletons for action recognition. Previous methods, which usually learn feature presentations from a single reconstruction task, may come across the overfitting…

Computer Vision and Pattern Recognition · Computer Science 2020-10-15 Lilang Lin , Sijie Song , Wenhan Yan , Jiaying Liu

BYOLMed3D: Self-Supervised Representation Learning of Medical Videos using Gradient Accumulation Assisted 3D BYOL Framework

Applications on Medical Image Analysis suffer from acute shortage of large volume of data properly annotated by medical experts. Supervised Learning algorithms require a large volumes of balanced data to learn robust representations. Often…

Computer Vision and Pattern Recognition · Computer Science 2022-11-15 Siladittya Manna , Rakesh Dey , Souvik Chakraborty

HYperbolic Self-Paced Learning for Self-Supervised Skeleton-based Action Representations

Self-paced learning has been beneficial for tasks where some initial knowledge is available, such as weakly supervised learning and domain adaptation, to select and order the training sample sequence, from easy to complex. However its…

Computer Vision and Pattern Recognition · Computer Science 2023-03-14 Luca Franco , Paolo Mandica , Bharti Munjal , Fabio Galasso

Consensus Clustering With Unsupervised Representation Learning

Recent advances in deep clustering and unsupervised representation learning are based on the idea that different views of an input image (generated through data augmentation techniques) must either be closer in the representation space, or…

Computer Vision and Pattern Recognition · Computer Science 2021-07-09 Jayanth Reddy Regatti , Aniket Anand Deshmukh , Eren Manavoglu , Urun Dogan

BYOL works even without batch statistics

Bootstrap Your Own Latent (BYOL) is a self-supervised learning approach for image representation. From an augmented view of an image, BYOL trains an online network to predict a target network representation of a different augmented view of…

Machine Learning · Statistics 2020-10-21 Pierre H. Richemond , Jean-Bastien Grill , Florent Altché , Corentin Tallec , Florian Strub , Andrew Brock , Samuel Smith , Soham De , Razvan Pascanu , Bilal Piot , Michal Valko

Skeleton Cloud Colorization for Unsupervised 3D Action Representation Learning

Skeleton-based human action recognition has attracted increasing attention in recent years. However, most of the existing works focus on supervised learning which requiring a large number of annotated action sequences that are often…

Computer Vision and Pattern Recognition · Computer Science 2021-08-10 Siyuan Yang , Jun Liu , Shijian Lu , Meng Hwa Er , Alex C. Kot

Balanced Representation Learning for Long-tailed Skeleton-based Action Recognition

Skeleton-based action recognition has recently made significant progress. However, data imbalance is still a great challenge in real-world scenarios. The performance of current action recognition algorithms declines sharply when training…

Computer Vision and Pattern Recognition · Computer Science 2025-02-24 Hongda Liu , Yunlong Wang , Min Ren , Junxing Hu , Zhengquan Luo , Guangqi Hou , Zhenan Sun

Skeleton-Snippet Contrastive Learning with Multiscale Feature Fusion for Action Localization

The self-supervised pretraining paradigm has achieved great success in learning 3D action representations for skeleton-based action recognition using contrastive learning. However, learning effective representations for skeleton-based…

Computer Vision and Pattern Recognition · Computer Science 2026-05-06 Qiushuo Cheng , Jingjing Liu , Catherine Morgan , Alan Whone , Majid Mirmehdi

Variational Contrastive Learning for Skeleton-based Action Recognition

In recent years, self-supervised representation learning for skeleton-based action recognition has advanced with the development of contrastive learning methods. However, most of contrastive paradigms are inherently discriminative and often…

Computer Vision and Pattern Recognition · Computer Science 2026-01-13 Dang Dinh Nguyen , Decky Aspandi Latif , Titus Zaharia

View-Invariant Skeleton-based Action Recognition via Global-Local Contrastive Learning

Skeleton-based human action recognition has been drawing more interest recently due to its low sensitivity to appearance changes and the accessibility of more skeleton data. However, even the 3D skeletons captured in practice are still…

Computer Vision and Pattern Recognition · Computer Science 2022-09-26 Cunling Bian , Wei Feng , Fanbo Meng , Song Wang

Informative Sample Selection Model for Skeleton-based Action Recognition with Limited Training Samples

Skeleton-based human action recognition aims to classify human skeletal sequences, which are spatiotemporal representations of actions, into predefined categories. To reduce the reliance on costly annotations of skeletal sequences while…

Computer Vision and Pattern Recognition · Computer Science 2025-10-30 Zhigang Tu , Zhengbo Zhang , Jia Gong , Junsong Yuan , Bo Du

Self-supervised learning for robust voice cloning

Voice cloning is a difficult task which requires robust and informative features incorporated in a high quality TTS system in order to effectively copy an unseen speaker's voice. In our work, we utilize features learned in a self-supervised…

Sound · Computer Science 2022-11-04 Konstantinos Klapsas , Nikolaos Ellinas , Karolos Nikitaras , Georgios Vamvoukakis , Panos Kakoulidis , Konstantinos Markopoulos , Spyros Raptis , June Sig Sung , Gunu Jho , Aimilios Chalamandaris , Pirros Tsiakoulis

Self-Labeling Refinement for Robust Representation Learning with Bootstrap Your Own Latent

In this work, we have worked towards two major goals. Firstly, we have investigated the importance of Batch Normalisation (BN) layers in a non-contrastive representation learning framework called Bootstrap Your Own Latent (BYOL). We…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Siddhant Garg , Dhruval Jain

Contrastive Self-Supervised Learning for Skeleton Representations

Human skeleton point clouds are commonly used to automatically classify and predict the behaviour of others. In this paper, we use a contrastive self-supervised learning method, SimCLR, to learn representations that capture the semantics of…

Computer Vision and Pattern Recognition · Computer Science 2022-11-11 Nico Lingg , Miguel Sarabia , Luca Zappella , Barry-John Theobald