Related papers: Task-Oriented Pre-Training for Drivable Area Detec…

Train No Evil: Selective Masking for Task-Guided Pre-Training

Recently, pre-trained language models mostly follow the pre-train-then-fine-tuning paradigm and have achieved great performance on various downstream tasks. However, since the pre-training stage is typically task-agnostic and the…

Computation and Language · Computer Science 2020-10-08 Yuxian Gu , Zhengyan Zhang , Xiaozhi Wang , Zhiyuan Liu , Maosong Sun

Improving Semantic Segmentation via Self-Training

Deep learning usually achieves the best results with complete supervision. In the case of semantic segmentation, this means that large amounts of pixelwise annotations are required to learn accurate models. In this paper, we show that we…

Computer Vision and Pattern Recognition · Computer Science 2020-05-07 Yi Zhu , Zhongyue Zhang , Chongruo Wu , Zhi Zhang , Tong He , Hang Zhang , R. Manmatha , Mu Li , Alexander Smola

Self-Supervised Pretraining for Aerial Road Extraction

Deep neural networks for aerial image segmentation require large amounts of labeled data, but high-quality aerial datasets with precise annotations are scarce and costly to produce. To address this limitation, we propose a self-supervised…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Rupert Polley , Sai Vignesh Abishek Deenadayalan , J. Marius Zöllner

From pre-training to downstream performance: Does domain-specific pre-training make sense?

Deep learning techniques have revolutionised medical imaging, improving diagnostic accuracy and enabling both more accurate and earlier disease detection. However, the relationship between pre-training strategies and downstream performance…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Felix Krones

Location-Aware Self-Supervised Transformers for Semantic Segmentation

Pixel-level labels are particularly expensive to acquire. Hence, pretraining is a critical step to improve models on a task like semantic segmentation. However, prominent algorithms for pretraining neural networks use image-level…

Computer Vision and Pattern Recognition · Computer Science 2023-03-17 Mathilde Caron , Neil Houlsby , Cordelia Schmid

Deep Instruction Tuning for Segment Anything Model

Recently, Segment Anything Model (SAM) has become a research hotspot in the fields of multimedia and computer vision, which exhibits powerful yet versatile capabilities on various (un) conditional image segmentation tasks. Although SAM can…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Xiaorui Huang , Gen Luo , Chaoyang Zhu , Bo Tong , Yiyi Zhou , Xiaoshuai Sun , Rongrong Ji

A Supervised Contrastive Learning Pretrain-Finetune Approach for Time Series

Foundation models have recently gained attention within the field of machine learning thanks to its efficiency in broad data processing. While researchers had attempted to extend this success to time series models, the main challenge is…

Machine Learning · Computer Science 2023-11-22 Trang H. Tran , Lam M. Nguyen , Kyongmin Yeo , Nam Nguyen , Roman Vaculin

Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving

Aiming towards a holistic understanding of multiple downstream tasks simultaneously, there is a need for extracting features with better transferability. Though many latest self-supervised pre-training methods have achieved impressive…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Xiwen Liang , Yangxin Wu , Jianhua Han , Hang Xu , Chunjing Xu , Xiaodan Liang

Weakly-guided Self-supervised Pretraining for Temporal Activity Detection

Temporal Activity Detection aims to predict activity classes per frame, in contrast to video-level predictions in Activity Classification (i.e., Activity Recognition). Due to the expensive frame-level annotations required for detection, the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Kumara Kahatapitiya , Zhou Ren , Haoxiang Li , Zhenyu Wu , Michael S. Ryoo , Gang Hua

Back-Translated Task Adaptive Pretraining: Improving Accuracy and Robustness on Text Classification

Language models (LMs) pretrained on a large text corpus and fine-tuned on a downstream text corpus and fine-tuned on a downstream task becomes a de facto training strategy for several natural language processing (NLP) tasks. Recently, an…

Computation and Language · Computer Science 2021-07-23 Junghoon Lee , Jounghee Kim , Pilsung Kang

Pre-training Text Representations as Meta Learning

Pre-training text representations has recently been shown to significantly improve the state-of-the-art in many natural language processing tasks. The central goal of pre-training is to learn text representations that are useful for…

Computation and Language · Computer Science 2020-04-14 Shangwen Lv , Yuechen Wang , Daya Guo , Duyu Tang , Nan Duan , Fuqing Zhu , Ming Gong , Linjun Shou , Ryan Ma , Daxin Jiang , Guihong Cao , Ming Zhou , Songlin Hu

Improving Genomic Models via Task-Specific Self-Pretraining

Pretraining DNA language models (DNALMs) on the full human genome is resource-intensive, yet often considered necessary for strong downstream performance. Inspired by recent findings in NLP and long-context modeling, we explore an…

Genomics · Quantitative Biology 2025-06-24 Sohan Mupparapu , Parameswari Krishnamurthy , Ratish Puduppully

Task Specific Pretraining with Noisy Labels for Remote Sensing Image Segmentation

Compared to supervised deep learning, self-supervision provides remote sensing a tool to reduce the amount of exact, human-crafted geospatial annotations. While image-level information for unsupervised pretraining efficiently works for…

Computer Vision and Pattern Recognition · Computer Science 2024-06-11 Chenying Liu , Conrad M Albrecht , Yi Wang , Xiao Xiang Zhu

FineText: Text Classification via Attention-based Language Model Fine-tuning

Training deep neural networks from scratch on natural language processing (NLP) tasks requires significant amount of manually labeled text corpus and substantial time to converge, which usually cannot be satisfied by the customers. In this…

Computation and Language · Computer Science 2019-10-29 Yunzhe Tao , Saurabh Gupta , Satyapriya Krishna , Xiong Zhou , Orchid Majumder , Vineet Khare

Aligned Unsupervised Pretraining of Object Detectors with Self-training

The unsupervised pretraining of object detectors has recently become a key component of object detector training, as it leads to improved performance and faster convergence during the supervised fine-tuning stage. Existing unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Ioannis Maniadis Metaxas , Adrian Bulat , Ioannis Patras , Brais Martinez , Georgios Tzimiropoulos

Rethinking Pre-training and Self-training

Pre-training is a dominant paradigm in computer vision. For example, supervised ImageNet pre-training is commonly used to initialize the backbones of object detection and segmentation models. He et al., however, show a surprising result…

Computer Vision and Pattern Recognition · Computer Science 2020-11-17 Barret Zoph , Golnaz Ghiasi , Tsung-Yi Lin , Yin Cui , Hanxiao Liu , Ekin D. Cubuk , Quoc V. Le

SMART: Self-supervised Multi-task pretrAining with contRol Transformers

Self-supervised pretraining has been extensively studied in language and vision domains, where a unified model can be easily adapted to various downstream tasks by pretraining representations without explicit labels. When it comes to…

Machine Learning · Computer Science 2023-01-25 Yanchao Sun , Shuang Ma , Ratnesh Madaan , Rogerio Bonatti , Furong Huang , Ashish Kapoor

Medical Semantic Segmentation with Diffusion Pretrain

Recent advances in deep learning have shown that learning robust feature representations is critical for the success of many computer vision tasks, including medical image segmentation. In particular, both transformer and…

Computer Vision and Pattern Recognition · Computer Science 2025-02-03 David Li , Anvar Kurmukov , Mikhail Goncharov , Roman Sokolov , Mikhail Belyaev

TSP: Temporally-Sensitive Pretraining of Video Encoders for Localization Tasks

Due to the large memory footprint of untrimmed videos, current state-of-the-art video localization methods operate atop precomputed video clip features. These features are extracted from video encoders typically trained for trimmed action…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Humam Alwassel , Silvio Giancola , Bernard Ghanem

A generalised pre-training strategy for deep learning networks in semantic segmentation of remotely sensed images

In the segmentation of remotely sensed images, deep learning models are typically pre-trained using large image databases like ImageNet before fine-tuned on domain-specific datasets. However, the performance of these fine-tuned models is…

Computer Vision and Pattern Recognition · Computer Science 2026-05-01 Yuan Fang , Yuanzhi Cai , Jagannath Aryal , Qinfeng Zhu , Hong Huang , Cheng Zhang , Lei Fan