Related papers: ETran: Energy-Based Transferability Estimation

An Information-Theoretic Approach to Transferability in Task Transfer Learning

Task transfer learning is a popular technique in image processing applications that uses pre-trained models to reduce the supervision cost of related tasks. An important question is to determine task transferability, i.e. given a common…

Machine Learning · Computer Science 2022-12-21 Yajie Bao , Yang Li , Shao-Lun Huang , Lin Zhang , Lizhong Zheng , Amir Zamir , Leonidas Guibas

Plausibility Verification For 3D Object Detectors Using Energy-Based Optimization

Environmental perception obtained via object detectors have no predictable safety layer encoded into their model schema, which creates the question of trustworthiness about the system's prediction. As can be seen from recent adversarial…

Computer Vision and Pattern Recognition · Computer Science 2022-11-11 Abhishek Vivekanandan , Niels Maier , J. Marius Zoellner

PAS: Estimating the target accuracy before domain adaptation

The goal of domain adaptation is to make predictions for unlabeled samples from a target domain with the help of labeled samples from a different but related source domain. The performance of domain adaptation methods is highly influenced…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Raphaella Diniz , Jackson de Faria , Martin Ester

Benchmarking Detection Transfer Learning with Vision Transformers

Object detection is a central downstream task used to test if pre-trained network parameters confer benefits, such as improved accuracy or training speed. The complexity of object detection methods can make this benchmarking non-trivial…

Computer Vision and Pattern Recognition · Computer Science 2021-11-23 Yanghao Li , Saining Xie , Xinlei Chen , Piotr Dollar , Kaiming He , Ross Girshick

ESOD:Edge-based Task Scheduling for Object Detection

Object Detection on the mobile system is a challenge in terms of everything. Nowadays, many object detection models have been designed, and most of them concentrate on precision. However, the computation burden of those models on mobile…

Computer Vision and Pattern Recognition · Computer Science 2021-10-25 Yihao Wang , Ling Gao , Jie Ren , Rui Cao , Hai Wang , Jie Zheng , Quanli Gao

Reinforcement Learning for Improving Object Detection

The performance of a trained object detection neural network depends a lot on the image quality. Generally, images are pre-processed before feeding them into the neural network and domain knowledge about the image dataset is used to choose…

Computer Vision and Pattern Recognition · Computer Science 2020-08-19 Siddharth Nayak , Balaraman Ravindran

What makes instance discrimination good for transfer learning?

Contrastive visual pretraining based on the instance discrimination pretext task has made significant progress. Notably, recent work on unsupervised pretraining has shown to surpass the supervised counterpart for finetuning downstream…

Computer Vision and Pattern Recognition · Computer Science 2021-01-20 Nanxuan Zhao , Zhirong Wu , Rynson W. H. Lau , Stephen Lin

Where is the Model Looking At?--Concentrate and Explain the Network Attention

Image classification models have achieved satisfactory performance on many datasets, sometimes even better than human. However, The model attention is unclear since the lack of interpretability. This paper investigates the fidelity and…

Computer Vision and Pattern Recognition · Computer Science 2020-09-30 Wenjia Xu , Jiuniu Wang , Yang Wang , Guangluan Xu , Wei Dai , Yirong Wu

Pre-Training Transformers as Energy-Based Cloze Models

We introduce Electric, an energy-based cloze model for representation learning over text. Like BERT, it is a conditional generative model of tokens given their contexts. However, Electric does not use masking or output a full distribution…

Computation and Language · Computer Science 2020-12-17 Kevin Clark , Minh-Thang Luong , Quoc V. Le , Christopher D. Manning

Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space

This paper addresses an important problem of ranking the pre-trained deep neural networks and screening the most transferable ones for downstream tasks. It is challenging because the ground-truth model ranking for each task can only be…

Machine Learning · Computer Science 2022-07-20 Wenqi Shao , Xun Zhao , Yixiao Ge , Zhaoyang Zhang , Lei Yang , Xiaogang Wang , Ying Shan , Ping Luo

Occam's model: Selecting simpler representations for better transferability estimation

Fine-tuning models that have been pre-trained on large datasets has become a cornerstone of modern machine learning workflows. With the widespread availability of online model repositories, such as Hugging Face, it is now easier than ever…

Machine Learning · Computer Science 2025-02-12 Prabhant Singh , Sibylle Hess , Joaquin Vanschoren

Online Anchor-based Training for Image Classification Tasks

In this paper, we aim to improve the performance of a deep learning model towards image classification tasks, proposing a novel anchor-based training methodology, named \textit{Online Anchor-based Training} (OAT). The OAT method, guided by…

Computer Vision and Pattern Recognition · Computer Science 2024-06-19 Maria Tzelepi , Vasileios Mezaris

Transfer of Pretrained Model Weights Substantially Improves Semi-Supervised Image Classification

Deep neural networks produce state-of-the-art results when trained on a large number of labeled examples but tend to overfit when small amounts of labeled examples are used for training. Creating a large number of labeled examples requires…

Computer Vision and Pattern Recognition · Computer Science 2021-09-13 Attaullah Sahito , Eibe Frank , Bernhard Pfahringer

Transferability Estimation using Bhattacharyya Class Separability

Transfer learning has become a popular method for leveraging pre-trained models in computer vision. However, without performing computationally expensive fine-tuning, it is difficult to quantify which pre-trained source models are suitable…

Computer Vision and Pattern Recognition · Computer Science 2022-04-12 Michal Pándy , Andrea Agostinelli , Jasper Uijlings , Vittorio Ferrari , Thomas Mensink

Understanding new tasks through the lens of training data via exponential tilting

Deploying machine learning models to new tasks is a major challenge despite the large size of the modern training datasets. However, it is conceivable that the training data can be reweighted to be more representative of the new (target)…

Machine Learning · Computer Science 2023-02-22 Subha Maity , Mikhail Yurochkin , Moulinath Banerjee , Yuekai Sun

Understanding the Transferability of Representations via Task-Relatedness

The growing popularity of transfer learning, due to the availability of models pre-trained on vast amounts of data, makes it imperative to understand when the knowledge of these pre-trained models can be transferred to obtain…

Machine Learning · Computer Science 2024-10-30 Akshay Mehra , Yunbei Zhang , Jihun Hamm

SPOT: Scalable 3D Pre-training via Occupancy Prediction for Learning Transferable 3D Representations

Annotating 3D LiDAR point clouds for perception tasks is fundamental for many applications e.g., autonomous driving, yet it still remains notoriously labor-intensive. Pretraining-finetuning approach can alleviate the labeling burden by…

Computer Vision and Pattern Recognition · Computer Science 2025-07-17 Xiangchao Yan , Runjian Chen , Bo Zhang , Hancheng Ye , Renqiu Xia , Jiakang Yuan , Hongbin Zhou , Xinyu Cai , Botian Shi , Wenqi Shao , Ping Luo , Yu Qiao , Tao Chen , Junchi Yan

Estimating and Evaluating Regression Predictive Uncertainty in Deep Object Detectors

Predictive uncertainty estimation is an essential next step for the reliable deployment of deep object detectors in safety-critical tasks. In this work, we focus on estimating predictive distributions for bounding box regression output with…

Computer Vision and Pattern Recognition · Computer Science 2021-03-15 Ali Harakeh , Steven L. Waslander

MTTrans: Cross-Domain Object Detection with Mean-Teacher Transformer

Recently, DEtection TRansformer (DETR), an end-to-end object detection pipeline, has achieved promising performance. However, it requires large-scale labeled data and suffers from domain shift, especially when no labeled data is available…

Computer Vision and Pattern Recognition · Computer Science 2022-08-17 Jinze Yu , Jiaming Liu , Xiaobao Wei , Haoyi Zhou , Yohei Nakata , Denis Gudovskiy , Tomoyuki Okuno , Jianxin Li , Kurt Keutzer , Shanghang Zhang

Shedding More Light on Robust Classifiers under the lens of Energy-based Models

By reinterpreting a robust discriminative classifier as Energy-based Model (EBM), we offer a new take on the dynamics of adversarial training (AT). Our analysis of the energy landscape during AT reveals that untargeted attacks generate…

Computer Vision and Pattern Recognition · Computer Science 2024-09-11 Mujtaba Hussain Mirza , Maria Rosaria Briglia , Senad Beadini , Iacopo Masi