English
Related papers

Related papers: Task-Oriented Pre-Training for Drivable Area Detec…

200 papers

Recently, pre-trained language models mostly follow the pre-train-then-fine-tuning paradigm and have achieved great performance on various downstream tasks. However, since the pre-training stage is typically task-agnostic and the…

Computation and Language · Computer Science 2020-10-08 Yuxian Gu , Zhengyan Zhang , Xiaozhi Wang , Zhiyuan Liu , Maosong Sun

Deep learning usually achieves the best results with complete supervision. In the case of semantic segmentation, this means that large amounts of pixelwise annotations are required to learn accurate models. In this paper, we show that we…

Computer Vision and Pattern Recognition · Computer Science 2020-05-07 Yi Zhu , Zhongyue Zhang , Chongruo Wu , Zhi Zhang , Tong He , Hang Zhang , R. Manmatha , Mu Li , Alexander Smola

Deep neural networks for aerial image segmentation require large amounts of labeled data, but high-quality aerial datasets with precise annotations are scarce and costly to produce. To address this limitation, we propose a self-supervised…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Rupert Polley , Sai Vignesh Abishek Deenadayalan , J. Marius Zöllner

Deep learning techniques have revolutionised medical imaging, improving diagnostic accuracy and enabling both more accurate and earlier disease detection. However, the relationship between pre-training strategies and downstream performance…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Felix Krones

Pixel-level labels are particularly expensive to acquire. Hence, pretraining is a critical step to improve models on a task like semantic segmentation. However, prominent algorithms for pretraining neural networks use image-level…

Computer Vision and Pattern Recognition · Computer Science 2023-03-17 Mathilde Caron , Neil Houlsby , Cordelia Schmid

Recently, Segment Anything Model (SAM) has become a research hotspot in the fields of multimedia and computer vision, which exhibits powerful yet versatile capabilities on various (un) conditional image segmentation tasks. Although SAM can…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Xiaorui Huang , Gen Luo , Chaoyang Zhu , Bo Tong , Yiyi Zhou , Xiaoshuai Sun , Rongrong Ji

Foundation models have recently gained attention within the field of machine learning thanks to its efficiency in broad data processing. While researchers had attempted to extend this success to time series models, the main challenge is…

Machine Learning · Computer Science 2023-11-22 Trang H. Tran , Lam M. Nguyen , Kyongmin Yeo , Nam Nguyen , Roman Vaculin

Aiming towards a holistic understanding of multiple downstream tasks simultaneously, there is a need for extracting features with better transferability. Though many latest self-supervised pre-training methods have achieved impressive…

Computer Vision and Pattern Recognition · Computer Science 2022-09-20 Xiwen Liang , Yangxin Wu , Jianhua Han , Hang Xu , Chunjing Xu , Xiaodan Liang

Temporal Activity Detection aims to predict activity classes per frame, in contrast to video-level predictions in Activity Classification (i.e., Activity Recognition). Due to the expensive frame-level annotations required for detection, the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-07 Kumara Kahatapitiya , Zhou Ren , Haoxiang Li , Zhenyu Wu , Michael S. Ryoo , Gang Hua

Language models (LMs) pretrained on a large text corpus and fine-tuned on a downstream text corpus and fine-tuned on a downstream task becomes a de facto training strategy for several natural language processing (NLP) tasks. Recently, an…

Computation and Language · Computer Science 2021-07-23 Junghoon Lee , Jounghee Kim , Pilsung Kang

Pre-training text representations has recently been shown to significantly improve the state-of-the-art in many natural language processing tasks. The central goal of pre-training is to learn text representations that are useful for…

Computation and Language · Computer Science 2020-04-14 Shangwen Lv , Yuechen Wang , Daya Guo , Duyu Tang , Nan Duan , Fuqing Zhu , Ming Gong , Linjun Shou , Ryan Ma , Daxin Jiang , Guihong Cao , Ming Zhou , Songlin Hu

Pretraining DNA language models (DNALMs) on the full human genome is resource-intensive, yet often considered necessary for strong downstream performance. Inspired by recent findings in NLP and long-context modeling, we explore an…

Genomics · Quantitative Biology 2025-06-24 Sohan Mupparapu , Parameswari Krishnamurthy , Ratish Puduppully

Compared to supervised deep learning, self-supervision provides remote sensing a tool to reduce the amount of exact, human-crafted geospatial annotations. While image-level information for unsupervised pretraining efficiently works for…

Computer Vision and Pattern Recognition · Computer Science 2024-06-11 Chenying Liu , Conrad M Albrecht , Yi Wang , Xiao Xiang Zhu

Training deep neural networks from scratch on natural language processing (NLP) tasks requires significant amount of manually labeled text corpus and substantial time to converge, which usually cannot be satisfied by the customers. In this…

Computation and Language · Computer Science 2019-10-29 Yunzhe Tao , Saurabh Gupta , Satyapriya Krishna , Xiong Zhou , Orchid Majumder , Vineet Khare

The unsupervised pretraining of object detectors has recently become a key component of object detector training, as it leads to improved performance and faster convergence during the supervised fine-tuning stage. Existing unsupervised…

Computer Vision and Pattern Recognition · Computer Science 2024-07-09 Ioannis Maniadis Metaxas , Adrian Bulat , Ioannis Patras , Brais Martinez , Georgios Tzimiropoulos

Pre-training is a dominant paradigm in computer vision. For example, supervised ImageNet pre-training is commonly used to initialize the backbones of object detection and segmentation models. He et al., however, show a surprising result…

Computer Vision and Pattern Recognition · Computer Science 2020-11-17 Barret Zoph , Golnaz Ghiasi , Tsung-Yi Lin , Yin Cui , Hanxiao Liu , Ekin D. Cubuk , Quoc V. Le

Self-supervised pretraining has been extensively studied in language and vision domains, where a unified model can be easily adapted to various downstream tasks by pretraining representations without explicit labels. When it comes to…

Machine Learning · Computer Science 2023-01-25 Yanchao Sun , Shuang Ma , Ratnesh Madaan , Rogerio Bonatti , Furong Huang , Ashish Kapoor

Recent advances in deep learning have shown that learning robust feature representations is critical for the success of many computer vision tasks, including medical image segmentation. In particular, both transformer and…

Computer Vision and Pattern Recognition · Computer Science 2025-02-03 David Li , Anvar Kurmukov , Mikhail Goncharov , Roman Sokolov , Mikhail Belyaev

Due to the large memory footprint of untrimmed videos, current state-of-the-art video localization methods operate atop precomputed video clip features. These features are extracted from video encoders typically trained for trimmed action…

Computer Vision and Pattern Recognition · Computer Science 2021-08-18 Humam Alwassel , Silvio Giancola , Bernard Ghanem

In the segmentation of remotely sensed images, deep learning models are typically pre-trained using large image databases like ImageNet before fine-tuned on domain-specific datasets. However, the performance of these fine-tuned models is…

Computer Vision and Pattern Recognition · Computer Science 2026-05-01 Yuan Fang , Yuanzhi Cai , Jagannath Aryal , Qinfeng Zhu , Hong Huang , Cheng Zhang , Lei Fan
‹ Prev 1 2 3 10 Next ›