Related papers: Fixing the train-test resolution discrepancy: FixE…

Fixing the train-test resolution discrepancy

Data-augmentation is key to the training of neural networks for image classification. This paper first shows that existing augmentations induce a significant discrepancy between the typical size of the objects seen by the classifier at…

Computer Vision and Pattern Recognition · Computer Science 2022-01-21 Hugo Touvron , Andrea Vedaldi , Matthijs Douze , Hervé Jégou

Self-training with Noisy Student improves ImageNet classification

We present Noisy Student Training, a semi-supervised learning approach that works well even when labeled data is abundant. Noisy Student Training achieves 88.4% top-1 accuracy on ImageNet, which is 2.0% better than the state-of-the-art…

Machine Learning · Computer Science 2020-06-22 Qizhe Xie , Minh-Thang Luong , Eduard Hovy , Quoc V. Le

Comparative Performance of Finetuned ImageNet Pre-trained Models for Electronic Component Classification

Electronic component classification and detection are crucial in manufacturing industries, significantly reducing labor costs and promoting technological and industrial development. Pre-trained models, especially those trained on ImageNet,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-01 Yidi Shao , Longfei Zhou , Fangshuo Tang , Xinyi Shi , Dalang Chen , Shengtao Xia

EfficientNetV2: Smaller Models and Faster Training

This paper introduces EfficientNetV2, a new family of convolutional networks that have faster training speed and better parameter efficiency than previous models. To develop this family of models, we use a combination of training-aware…

Computer Vision and Pattern Recognition · Computer Science 2021-06-25 Mingxing Tan , Quoc V. Le

Training EfficientNets at Supercomputer Scale: 83% ImageNet Top-1 Accuracy in One Hour

EfficientNets are a family of state-of-the-art image classification models based on efficiently scaled convolutional neural networks. Currently, EfficientNets can take on the order of days to train; for example, training an EfficientNet-B0…

Machine Learning · Computer Science 2020-11-06 Arissa Wongpanich , Hieu Pham , James Demmel , Mingxing Tan , Quoc Le , Yang You , Sameer Kumar

High-Performance Large-Scale Image Recognition Without Normalization

Batch normalization is a key component of most image classification models, but it has many undesirable properties stemming from its dependence on the batch size and interactions between examples. Although recent work has succeeded in…

Computer Vision and Pattern Recognition · Computer Science 2021-02-12 Andrew Brock , Soham De , Samuel L. Smith , Karen Simonyan

Efficient ResNets: Residual Network Design

ResNets (or Residual Networks) are one of the most commonly used models for image classification tasks. In this project, we design and train a modified ResNet model for CIFAR-10 image classification. In particular, we aimed at maximizing…

Computer Vision and Pattern Recognition · Computer Science 2023-06-22 Aditya Thakur , Harish Chauhan , Nikunj Gupta

Do Better ImageNet Models Transfer Better?

Transfer learning is a cornerstone of computer vision, yet little work has been done to evaluate the relationship between architecture and transfer. An implicit hypothesis in modern computer vision research is that models that perform…

Computer Vision and Pattern Recognition · Computer Science 2019-06-18 Simon Kornblith , Jonathon Shlens , Quoc V. Le

Study of Residual Networks for Image Recognition

Deep neural networks demonstrate to have a high performance on image classification tasks while being more difficult to train. Due to the complexity and vanishing gradient problem, it normally takes a lot of time and more computational…

Computer Vision and Pattern Recognition · Computer Science 2018-05-02 Mohammad Sadegh Ebrahimi , Hossein Karkeh Abadi

Automated Cleanup of the ImageNet Dataset by Model Consensus, Explainability and Confident Learning

The convolutional neural networks (CNNs) trained on ILSVRC12 ImageNet were the backbone of various applications as a generic classifier, a feature extractor or a base model for transfer learning. This paper describes automated heuristics…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Csaba Kertész

Semi-Supervised Noisy Student Pre-training on EfficientNet Architectures for Plant Pathology Classification

In recent years, deep learning has vastly improved the identification and diagnosis of various diseases in plants. In this report, we investigate the problem of pathology classification using images of a single leaf. We explore the use of…

Computer Vision and Pattern Recognition · Computer Science 2020-12-02 Sedrick Scott Keh

Do better ImageNet classifiers assess perceptual similarity better?

Perceptual distances between images, as measured in the space of pre-trained deep features, have outperformed prior low-level, pixel-based metrics on assessing perceptual similarity. While the capabilities of older and less accurate models…

Computer Vision and Pattern Recognition · Computer Science 2022-11-01 Manoj Kumar , Neil Houlsby , Nal Kalchbrenner , Ekin D. Cubuk

Establishing a stronger baseline for lightweight contrastive models

Recent research has reported a performance degradation in self-supervised contrastive learning for specially designed efficient networks, such as MobileNet and EfficientNet. A common practice to address this problem is to introduce a…

Computer Vision and Pattern Recognition · Computer Science 2023-07-18 Wenye Lin , Yifeng Ding , Zhixiong Cao , Hai-tao Zheng

Efficient Conditional Pre-training for Transfer Learning

Almost all the state-of-the-art neural networks for computer vision tasks are trained by (1) pre-training on a large-scale dataset and (2) finetuning on the target dataset. This strategy helps reduce dependence on the target dataset and…

Computer Vision and Pattern Recognition · Computer Science 2021-11-22 Shuvam Chakraborty , Burak Uzkent , Kumar Ayush , Kumar Tanmay , Evan Sheehan , Stefano Ermon

Resolution Switchable Networks for Runtime Efficient Image Recognition

We propose a general method to train a single convolutional neural network which is capable of switching image resolutions at inference. Thus the running speed can be selected to meet various computational resource limits. Networks trained…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Yikai Wang , Fuchun Sun , Duo Li , Anbang Yao

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available. In this paper, we systematically study model scaling and identify that…

Machine Learning · Computer Science 2020-09-14 Mingxing Tan , Quoc V. Le

Revisiting ResNets: Improved Training and Scaling Strategies

Novel computer vision architectures monopolize the spotlight, but the impact of the model architecture is often conflated with simultaneous changes to training methodology and scaling strategies. Our work revisits the canonical ResNet (He…

Computer Vision and Pattern Recognition · Computer Science 2021-03-16 Irwan Bello , William Fedus , Xianzhi Du , Ekin D. Cubuk , Aravind Srinivas , Tsung-Yi Lin , Jonathon Shlens , Barret Zoph

Toward Errorless Training ImageNet-1k

In this paper, we describe a feedforward artificial neural network trained on the ImageNet 2012 contest dataset [7] with the new method of [5] to an accuracy rate of 98.3% with a 99.69 Top-1 rate, and an average of 285.9 labels that are…

Computer Vision and Pattern Recognition · Computer Science 2025-08-22 Bo Deng , Levi Heath

Compounding the Performance Improvements of Assembled Techniques in a Convolutional Neural Network

Recent studies in image classification have demonstrated a variety of techniques for improving the performance of Convolutional Neural Networks (CNNs). However, attempts to combine existing techniques to create a practical model are still…

Computer Vision and Pattern Recognition · Computer Science 2020-03-16 Jungkyu Lee , Taeryun Won , Tae Kwan Lee , Hyemin Lee , Geonmo Gu , Kiho Hong

Comparative Evaluation of Deep Learning Models for Fake Image Detection

The growing sophistication of GAN-based image manipulation presents significant challenges for digital forensics. This study compares the performance of four pretrained CNN architectures including VGG16, ResNet50, EfficientNetB0, and…

Computer Vision and Pattern Recognition · Computer Science 2026-05-21 Akhitha Pakala , Mohammed Mahir Rahman , Shahzad Memon , Tauseef Ahmed