Related papers: Self-supervised Pre-training with Hard Examples Im…

Beyond Random Augmentations: Pretraining with Hard Views

Self-Supervised Learning (SSL) methods typically rely on random image augmentations, or views, to make models invariant to different transformations. We hypothesize that the efficacy of pretraining pipelines based on conventional random…

Computer Vision and Pattern Recognition · Computer Science 2025-02-07 Fabio Ferreira , Ivo Rapant , Jörg K. H. Franke , Frank Hutter

Discovery of Visual Semantics by Unsupervised and Self-Supervised Representation Learning

The success of deep learning in computer vision is rooted in the ability of deep networks to scale up model complexity as demanded by challenging visual tasks. As complexity is increased, so is the need for large amounts of labeled data to…

Computer Vision and Pattern Recognition · Computer Science 2017-08-22 Gustav Larsson

Semi-Supervised Learning for hyperspectral images by non parametrically predicting view assignment

Hyperspectral image (HSI) classification is gaining a lot of momentum in present time because of high inherent spectral information within the images. However, these images suffer from the problem of curse of dimensionality and usually…

Computer Vision and Pattern Recognition · Computer Science 2023-06-21 Shivam Pande , Nassim Ait Ali Braham , Yi Wang , Conrad M Albrecht , Biplab Banerjee , Xiao Xiang Zhu

Self-Supervised Pretraining Improves Self-Supervised Pretraining

While self-supervised pretraining has proven beneficial for many computer vision tasks, it requires expensive and lengthy computation, large amounts of data, and is sensitive to data augmentation. Prior work demonstrates that models…

Computer Vision and Pattern Recognition · Computer Science 2021-03-26 Colorado J. Reed , Xiangyu Yue , Ani Nrusimha , Sayna Ebrahimi , Vivek Vijaykumar , Richard Mao , Bo Li , Shanghang Zhang , Devin Guillory , Sean Metzger , Kurt Keutzer , Trevor Darrell

Meta Co-Training: Two Views are Better than One

In many critical computer vision scenarios unlabeled data is plentiful, but labels are scarce and difficult to obtain. As a result, semi-supervised learning which leverages unlabeled data to boost the performance of supervised classifiers…

Computer Vision and Pattern Recognition · Computer Science 2025-05-29 Jay C. Rothenberger , Dimitrios I. Diochnos

Heterogeneous Self-Supervised Acoustic Pre-Training with Local Constraints

Self-supervised pre-training using unlabeled data is widely used in automatic speech recognition. In this paper, we propose a new self-supervised pre-training approach to dealing with heterogeneous data. Instead of mixing all the data and…

Machine Learning · Computer Science 2025-09-10 Xiaodong Cui , A F M Saif , Brian Kingsbury , Tianyi Chen

Self-supervised Heterogeneous Graph Pre-training Based on Structural Clustering

Recent self-supervised pre-training methods on Heterogeneous Information Networks (HINs) have shown promising competitiveness over traditional semi-supervised Heterogeneous Graph Neural Networks (HGNNs). Unfortunately, their performance…

Machine Learning · Computer Science 2023-04-13 Yaming Yang , Ziyu Guan , Zhe Wang , Wei Zhao , Cai Xu , Weigang Lu , Jianbin Huang

Self-Supervised Visual Representation Learning Using Lightweight Architectures

In self-supervised learning, a model is trained to solve a pretext task, using a data set whose annotations are created by a machine. The objective is to transfer the trained weights to perform a downstream task in the target domain. We…

Machine Learning · Computer Science 2021-10-22 Prathamesh Sonawane , Sparsh Drolia , Saqib Shamsi , Bhargav Jain

A Study on Self-Supervised Object Detection Pretraining

In this work, we study different approaches to self-supervised pretraining of object detection models. We first design a general framework to learn a spatially consistent dense representation from an image, by randomly sampling and…

Computer Vision and Pattern Recognition · Computer Science 2022-08-12 Trung Dang , Simon Kornblith , Huy Thong Nguyen , Peter Chin , Maryam Khademi

A Simple Data Mixing Prior for Improving Self-Supervised Learning

Data mixing (e.g., Mixup, Cutmix, ResizeMix) is an essential component for advancing recognition models. In this paper, we focus on studying its effectiveness in the self-supervised setting. By noticing the mixed images that share the same…

Computer Vision and Pattern Recognition · Computer Science 2022-06-16 Sucheng Ren , Huiyu Wang , Zhengqi Gao , Shengfeng He , Alan Yuille , Yuyin Zhou , Cihang Xie

SPDA: Superpixel-based Data Augmentation for Biomedical Image Segmentation

Supervised training a deep neural network aims to "teach" the network to mimic human visual perception that is represented by image-and-label pairs in the training data. Superpixelized (SP) images are visually perceivable to humans, but a…

Computer Vision and Pattern Recognition · Computer Science 2019-03-04 Yizhe Zhang , Lin Yang , Hao Zheng , Peixian Liang , Colleen Mangold , Raquel G. Loreto , David P. Hughes , Danny Z. Chen

Learning Deep Representations Using Convolutional Auto-encoders with Symmetric Skip Connections

Unsupervised pre-training was a critical technique for training deep neural networks years ago. With sufficient labeled data and modern training techniques, it is possible to train very deep neural networks from scratch in a purely…

Computer Vision and Pattern Recognition · Computer Science 2017-03-29 Jianfeng Dong , Xiao-Jiao Mao , Chunhua Shen , Yu-Bin Yang

USP: Unified Self-Supervised Pretraining for Image Generation and Understanding

Recent studies have highlighted the interplay between diffusion models and representation learning. Intermediate representations from diffusion models can be leveraged for downstream visual tasks, while self-supervised vision models can…

Computer Vision and Pattern Recognition · Computer Science 2025-07-23 Xiangxiang Chu , Renda Li , Yong Wang

Explorations in Self-Supervised Learning: Dataset Composition Testing for Object Classification

This paper investigates the impact of sampling and pretraining using datasets with different image characteristics on the performance of self-supervised learning (SSL) models for object classification. To do this, we sample two apartment…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Raynor Kirkson E. Chavez , Kyle Gabriel M. Reynoso

Location-Aware Self-Supervised Transformers for Semantic Segmentation

Pixel-level labels are particularly expensive to acquire. Hence, pretraining is a critical step to improve models on a task like semantic segmentation. However, prominent algorithms for pretraining neural networks use image-level…

Computer Vision and Pattern Recognition · Computer Science 2023-03-17 Mathilde Caron , Neil Houlsby , Cordelia Schmid

A Large-Scale Analysis on Self-Supervised Video Representation Learning

Self-supervised learning is an effective way for label-free model pre-training, especially in the video domain where labeling is expensive. Existing self-supervised works in the video domain use varying experimental setups to demonstrate…

Computer Vision and Pattern Recognition · Computer Science 2023-11-22 Akash Kumar , Ashlesha Kumar , Vibhav Vineet , Yogesh Singh Rawat

Heuristic Vision Pre-Training with Self-Supervised and Supervised Multi-Task Learning

To mimic human vision with the way of recognizing the diverse and open world, foundation vision models are much critical. While recent techniques of self-supervised learning show the promising potentiality of this mission, we argue that…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Zhiming Qian

Semantic Positive Pairs for Enhancing Visual Representation Learning of Instance Discrimination Methods

Self-supervised learning algorithms (SSL) based on instance discrimination have shown promising results, performing competitively or even outperforming supervised learning counterparts in some downstream tasks. Such approaches employ data…

Computer Vision and Pattern Recognition · Computer Science 2025-05-01 Mohammad Alkhalefi , Georgios Leontidis , Mingjun Zhong

Masked Self-Supervised Pre-Training for Text Recognition Transformers on Large-Scale Datasets

Self-supervised learning has emerged as a powerful approach for leveraging large-scale unlabeled data to improve model performance in various domains. In this paper, we explore masked self-supervised pre-training for text recognition…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Martin Kišš , Michal Hradiš

Scalable and Accurate Self-supervised Multimodal Representation Learning without Aligned Video and Text Data

Scaling up weakly-supervised datasets has shown to be highly effective in the image-text domain and has contributed to most of the recent state-of-the-art computer vision and multimodal neural networks. However, existing large-scale…

Computer Vision and Pattern Recognition · Computer Science 2023-04-06 Vladislav Lialin , Stephen Rawls , David Chan , Shalini Ghosh , Anna Rumshisky , Wael Hamza