English

Self-supervised Representation Learning From Random Data Projectors

Machine Learning 2024-03-22 v2

Abstract

Self-supervised representation learning~(SSRL) has advanced considerably by exploiting the transformation invariance assumption under artificially designed data augmentations. While augmentation-based SSRL algorithms push the boundaries of performance in computer vision and natural language processing, they are often not directly applicable to other data modalities, and can conflict with application-specific data augmentation constraints. This paper presents an SSRL approach that can be applied to any data modality and network architecture because it does not rely on augmentations or masking. Specifically, we show that high-quality data representations can be learned by reconstructing random data projections. We evaluate the proposed approach on a wide range of representation learning tasks that span diverse modalities and real-world applications. We show that it outperforms multiple state-of-the-art SSRL baselines. Due to its wide applicability and strong empirical results, we argue that learning from randomness is a fruitful research direction worthy of attention and further study.

Keywords

Cite

@article{arxiv.2310.07756,
  title  = {Self-supervised Representation Learning From Random Data Projectors},
  author = {Yi Sui and Tongzi Wu and Jesse C. Cresswell and Ga Wu and George Stein and Xiao Shi Huang and Xiaochen Zhang and Maksims Volkovs},
  journal= {arXiv preprint arXiv:2310.07756},
  year   = {2024}
}

Comments

Published as a conference paper of ICLR 2024. https://openreview.net/pdf?id=EpYnZpDpsQ