Related papers: ETran: Energy-Based Transferability Estimation

How to Train Your Energy-Based Model for Regression

Energy-based models (EBMs) have become increasingly popular within computer vision in recent years. While they are commonly employed for generative image modeling, recent work has applied EBMs also for regression tasks, achieving…

Computer Vision and Pattern Recognition · Computer Science 2020-08-17 Fredrik K. Gustafsson , Martin Danelljan , Radu Timofte , Thomas B. Schön

How to Determine the Most Powerful Pre-trained Language Model without Brute Force Fine-tuning? An Empirical Survey

Transferability estimation has been attached to great attention in the computer vision fields. Researchers try to estimate with low computational cost the performance of a model when transferred from a source task to a given target task.…

Computation and Language · Computer Science 2023-12-11 Jun Bai , Xiaofeng Zhang , Chen Li , Hanhua Hong , Xi Xu , Chenghua Lin , Wenge Rong

Attributing Learned Concepts in Neural Networks to Training Data

By now there is substantial evidence that deep learning models learn certain human-interpretable features as part of their internal representations of data. As having the right (or wrong) concepts is critical to trustworthy machine learning…

Machine Learning · Computer Science 2023-12-29 Nicholas Konz , Charles Godfrey , Madelyn Shapiro , Jonathan Tu , Henry Kvinge , Davis Brown

RADAR: Relative Angular Divergence Across Representations

Machine learning methods rely on data. However, gathering suitable data can be challenging due to availability constraints, cost, or the need for domain expertise. Expanding datasets with additional sources is a common response to limited…

Machine Learning · Computer Science 2026-05-25 Xavier Cadet , Mateusz Nowak , Peter Chin

Object Detection with Transformers: A Review

The astounding performance of transformers in natural language processing (NLP) has motivated researchers to explore their applications in computer vision tasks. DEtection TRansformer (DETR) introduces transformers to object detection tasks…

Computer Vision and Pattern Recognition · Computer Science 2023-07-13 Tahira Shehzadi , Khurram Azeem Hashmi , Didier Stricker , Muhammad Zeshan Afzal

EvCenterNet: Uncertainty Estimation for Object Detection using Evidential Learning

Uncertainty estimation is crucial in safety-critical settings such as automated driving as it provides valuable information for several downstream tasks including high-level decision making and path planning. In this work, we propose…

Computer Vision and Pattern Recognition · Computer Science 2023-09-29 Monish R. Nallapareddy , Kshitij Sirohi , Paulo L. J. Drews-Jr , Wolfram Burgard , Chih-Hong Cheng , Abhinav Valada

Feature Intertwiner for Object Detection

A well-trained model should classify objects with a unanimous score for every category. This requires the high-level semantic features should be as much alike as possible among samples. To achive this, previous works focus on re-designing…

Computer Vision and Pattern Recognition · Computer Science 2019-03-29 Hongyang Li , Bo Dai , Shaoshuai Shi , Wanli Ouyang , Xiaogang Wang

Contextual object categorization with energy-based model

Object categorization is a hot issue of an image mining. Contextual information between objects is one of the important semantic knowledge of an image. However, the previous researches for an object categorization have not made full use of…

Computer Vision and Pattern Recognition · Computer Science 2016-04-26 Changyong Ri , Duho Pak , Cholryong Choe , Suhyang Kim , Yonghak Sin

Rethinking Transformer-based Set Prediction for Object Detection

DETR is a recently proposed Transformer-based method which views object detection as a set prediction problem and achieves state-of-the-art performance but demands extra-long training time to converge. In this paper, we investigate the…

Computer Vision and Pattern Recognition · Computer Science 2021-10-13 Zhiqing Sun , Shengcao Cao , Yiming Yang , Kris Kitani

Don't Wait, Just Weight: Improving Unsupervised Representations by Learning Goal-Driven Instance Weights

In the absence of large labelled datasets, self-supervised learning techniques can boost performance by learning useful representations from unlabelled data, which is often more readily available. However, there is often a domain shift…

Machine Learning · Computer Science 2020-06-23 Linus Ericsson , Henry Gouk , Timothy M. Hospedales

Resolving Class Imbalance in Object Detection with Weighted Cross Entropy Losses

Object detection is an important task in computer vision which serves a lot of real-world applications such as autonomous driving, surveillance and robotics. Along with the rapid thrive of large-scale data, numerous state-of-the-art…

Computer Vision and Pattern Recognition · Computer Science 2020-06-03 Trong Huy Phan , Kazuma Yamamoto

A Framework for Fast Scalable BNN Inference using Googlenet and Transfer Learning

Efficient and accurate object detection in video and image analysis is one of the major beneficiaries of the advancement in computer vision systems with the help of deep learning. With the aid of deep learning, more powerful tools evolved,…

Computer Vision and Pattern Recognition · Computer Science 2021-01-06 Karthik E

Adaptive Deep Learning for Entity Resolution by Risk Analysis

The state-of-the-art performance on entity resolution (ER) has been achieved by deep learning. However, deep models are usually trained on large quantities of accurately labeled training data, and can not be easily tuned towards a target…

Machine Learning · Computer Science 2022-04-12 Zhaoqiang Chen , Qun Chen , Youcef Nafa , Tianyi Duan , Wei Pan , Lijun Zhang , Zhanhuai Li

Exploring Thermal Images for Object Detection in Underexposure Regions for Autonomous Driving

Underexposure regions are vital to construct a complete perception of the surroundings for safe autonomous driving. The availability of thermal cameras has provided an essential alternate to explore regions where other optical sensors lack…

Computer Vision and Pattern Recognition · Computer Science 2021-05-04 Farzeen Munir , Shoaib Azam , Muhammd Aasim Rafique , Ahmad Muqeem Sheri , Moongu Jeon , Witold Pedrycz

ETTrack: Enhanced Temporal Motion Predictor for Multi-Object Tracking

Many Multi-Object Tracking (MOT) approaches exploit motion information to associate all the detected objects across frames. However, many methods that rely on filtering-based algorithms, such as the Kalman Filter, often work well in linear…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Xudong Han , Nobuyuki Oishi , Yueying Tian , Elif Ucurum , Rupert Young , Chris Chatwin , Philip Birch

Contrastive Out-of-Distribution Detection for Pretrained Transformers

Pretrained Transformers achieve remarkable performance when training and test data are from the same distribution. However, in real-world scenarios, the model often faces out-of-distribution (OOD) instances that can cause severe semantic…

Computation and Language · Computer Science 2022-01-24 Wenxuan Zhou , Fangyu Liu , Muhao Chen

Object Detection as Probabilistic Set Prediction

Accurate uncertainty estimates are essential for deploying deep object detectors in safety-critical systems. The development and evaluation of probabilistic object detectors have been hindered by shortcomings in existing performance…

Computer Vision and Pattern Recognition · Computer Science 2022-12-09 Georg Hess , Christoffer Petersson , Lennart Svensson

OutfitTransformer: Learning Outfit Representations for Fashion Recommendation

Learning an effective outfit-level representation is critical for predicting the compatibility of items in an outfit, and retrieving complementary items for a partial outfit. We present a framework, OutfitTransformer, that uses the proposed…

Computer Vision and Pattern Recognition · Computer Science 2022-04-19 Rohan Sarkar , Navaneeth Bodla , Mariya I. Vasileva , Yen-Liang Lin , Anurag Beniwal , Alan Lu , Gerard Medioni

Self-Supervised Pretraining of 3D Features on any Point-Cloud

Pretraining on large labeled datasets is a prerequisite to achieve good performance in many computer vision tasks like 2D object recognition, video classification etc. However, pretraining is not widely used for 3D recognition tasks where…

Computer Vision and Pattern Recognition · Computer Science 2021-01-08 Zaiwei Zhang , Rohit Girdhar , Armand Joulin , Ishan Misra

AttTrack: Online Deep Attention Transfer for Multi-object Tracking

Multi-object tracking (MOT) is a vital component of intelligent video analytics applications such as surveillance and autonomous driving. The time and storage complexity required to execute deep learning models for visual object tracking…

Computer Vision and Pattern Recognition · Computer Science 2022-10-28 Keivan Nalaie , Rong Zheng