Related papers: Going deeper with Image Transformers

Boosting vision transformers for image retrieval

Vision transformers have achieved remarkable progress in vision tasks such as image classification and detection. However, in instance-level image retrieval, transformers have not yet shown good performance compared to convolutional…

Computer Vision and Pattern Recognition · Computer Science 2022-10-24 Chull Hwan Song , Jooyoung Yoon , Shunghyun Choi , Yannis Avrithis

A Comprehensive Study of Vision Transformers in Image Classification Tasks

Image Classification is a fundamental task in the field of computer vision that frequently serves as a benchmark for gauging advancements in Computer Vision. Over the past few years, significant progress has been made in image…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Mahmoud Khalil , Ahmad Khalil , Alioune Ngom

Deep transfer learning for image classification: a survey

Deep neural networks such as convolutional neural networks (CNNs) and transformers have achieved many successes in image classification in recent years. It has been consistently demonstrated that best practice for image classification is…

Computer Vision and Pattern Recognition · Computer Science 2025-12-09 Jo Plested , Musa Phiri , Tom Gedeon

Towards Efficient and Data Agnostic Image Classification Training Pipeline for Embedded Systems

Nowadays deep learning-based methods have achieved a remarkable progress at the image classification task among a wide range of commonly used datasets (ImageNet, CIFAR, SVHN, Caltech 101, SUN397, etc.). SOTA performance on each of the…

Computer Vision and Pattern Recognition · Computer Science 2021-08-17 Kirill Prokofiev , Vladislav Sovrasov

A Transformer-in-Transformer Network Utilizing Knowledge Distillation for Image Recognition

This paper presents a novel knowledge distillation neural architecture leveraging efficient transformer networks for effective image classification. Natural images display intricate arrangements encompassing numerous extraneous elements.…

Computer Vision and Pattern Recognition · Computer Science 2025-02-25 Dewan Tauhid Rahman , Yeahia Sarker , Antar Mazumder , Md. Shamim Anower

Transfer Learning for Illustration Classification

The field of image classification has shown an outstanding success thanks to the development of deep learning techniques. Despite the great performance obtained, most of the work has focused on natural images ignoring other domains like…

Computer Vision and Pattern Recognition · Computer Science 2019-02-07 Manuel Lagunas , Elena Garces

Do Better ImageNet Models Transfer Better?

Transfer learning is a cornerstone of computer vision, yet little work has been done to evaluate the relationship between architecture and transfer. An implicit hypothesis in modern computer vision research is that models that perform…

Computer Vision and Pattern Recognition · Computer Science 2019-06-18 Simon Kornblith , Jonathon Shlens , Quoc V. Le

Some Improvements on Deep Convolutional Neural Network Based Image Classification

We investigate multiple techniques to improve upon the current state of the art deep convolutional neural network based image classification pipeline. The techiques include adding more image transformations to training data, adding more…

Computer Vision and Pattern Recognition · Computer Science 2013-12-20 Andrew G. Howard

Tensor-to-Image: Image-to-Image Translation with Vision Transformers

Transformers gain huge attention since they are first introduced and have a wide range of applications. Transformers start to take over all areas of deep learning and the Vision transformers paper also proved that they can be used for…

Computer Vision and Pattern Recognition · Computer Science 2021-10-18 Yiğit Gündüç

Transformers For Recognition In Overhead Imagery: A Reality Check

There is evidence that transformers offer state-of-the-art recognition performance on tasks involving overhead imagery (e.g., satellite imagery). However, it is difficult to make unbiased empirical comparisons between competing deep…

Computer Vision and Pattern Recognition · Computer Science 2022-11-02 Francesco Luzi , Aneesh Gupta , Leslie Collins , Kyle Bradbury , Jordan Malof

ImageNet-Hard: The Hardest Images Remaining from a Study of the Power of Zoom and Spatial Biases in Image Classification

Image classifiers are information-discarding machines, by design. Yet, how these models discard information remains mysterious. We hypothesize that one way for image classifiers to reach high accuracy is to first zoom to the most…

Computer Vision and Pattern Recognition · Computer Science 2023-10-10 Mohammad Reza Taesiri , Giang Nguyen , Sarra Habchi , Cor-Paul Bezemer , Anh Nguyen

Deeper, Sharper, Faster: Application of Efficient Transformer to Galaxy Image Restoration

The Transformer architecture has revolutionized the field of deep learning over the past several years in diverse areas, including natural language processing, code generation, image recognition, time series forecasting, etc. We propose to…

Instrumentation and Methods for Astrophysics · Physics 2024-05-30 Hyosun Park , Yongsik Jo , Seokun Kang , Taehwan Kim , M. James Jee

Semi-Supervised Vision Transformers

We study the training of Vision Transformers for semi-supervised image classification. Transformers have recently demonstrated impressive performance on a multitude of supervised learning tasks. Surprisingly, we show Vision Transformers…

Computer Vision and Pattern Recognition · Computer Science 2022-07-19 Zejia Weng , Xitong Yang , Ang Li , Zuxuan Wu , Yu-Gang Jiang

Exploring Vision Transformers for Fine-grained Classification

Existing computer vision research in categorization struggles with fine-grained attributes recognition due to the inherently high intra-class variances and low inter-class variances. SOTA methods tackle this challenge by locating the most…

Computer Vision and Pattern Recognition · Computer Science 2021-07-01 Marcos V. Conde , Kerem Turgutlu

Pre-Trained Image Processing Transformer

As the computing power of modern hardware is increasing strongly, pre-trained deep learning models (e.g., BERT, GPT-3) learned on large-scale datasets have shown their effectiveness over conventional methods. The big progress is mainly…

Computer Vision and Pattern Recognition · Computer Science 2021-11-09 Hanting Chen , Yunhe Wang , Tianyu Guo , Chang Xu , Yiping Deng , Zhenhua Liu , Siwei Ma , Chunjing Xu , Chao Xu , Wen Gao

Deep Image Composition Meets Image Forgery

Image forgery is a topic that has been studied for many years. Before the breakthrough of deep learning, forged images were detected using handcrafted features that did not require training. These traditional methods failed to perform…

Computer Vision and Pattern Recognition · Computer Science 2024-04-29 Eren Tahir , Mert Bal

Understanding Transformer-based Vision Models through Inversion

Understanding the mechanisms underlying deep neural networks remains a fundamental challenge in machine learning and computer vision. One promising, yet only preliminarily explored approach, is feature inversion, which attempts to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-15 Jan Rathjens , Shirin Reyhanian , David Kappel , Laurenz Wiskott

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

While the Transformer architecture has become the de-facto standard for natural language processing tasks, its applications to computer vision remain limited. In vision, attention is either applied in conjunction with convolutional…

Computer Vision and Pattern Recognition · Computer Science 2021-06-04 Alexey Dosovitskiy , Lucas Beyer , Alexander Kolesnikov , Dirk Weissenborn , Xiaohua Zhai , Thomas Unterthiner , Mostafa Dehghani , Matthias Minderer , Georg Heigold , Sylvain Gelly , Jakob Uszkoreit , Neil Houlsby

TransBoost: Improving the Best ImageNet Performance using Deep Transduction

This paper deals with deep transductive learning, and proposes TransBoost as a procedure for fine-tuning any deep neural model to improve its performance on any (unlabeled) test set provided at training time. TransBoost is inspired by a…

Computer Vision and Pattern Recognition · Computer Science 2023-01-18 Omer Belhasin , Guy Bar-Shalom , Ran El-Yaniv

Restormer: Efficient Transformer for High-Resolution Image Restoration

Since convolutional neural networks (CNNs) perform well at learning generalizable image priors from large-scale data, these models have been extensively applied to image restoration and related tasks. Recently, another class of neural…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Syed Waqas Zamir , Aditya Arora , Salman Khan , Munawar Hayat , Fahad Shahbaz Khan , Ming-Hsuan Yang