Related papers: Better Aggregation in Test-Time Augmentation

Improved Text Classification via Test-Time Augmentation

Test-time augmentation -- the aggregation of predictions across transformed examples of test inputs -- is an established technique to improve the performance of image classification models. Importantly, TTA can be used to improve model…

Machine Learning · Computer Science 2022-06-29 Helen Lu , Divya Shanmugam , Harini Suresh , John Guttag

Learning Loss for Test-Time Augmentation

Data augmentation has been actively studied for robust neural networks. Most of the recent data augmentation methods focus on augmenting datasets during the training phase. At the testing phase, simple transformations are still widely used…

Computer Vision and Pattern Recognition · Computer Science 2020-10-23 Ildoo Kim , Younghoon Kim , Sungwoong Kim

Test-Time Augmentation Meets Variational Bayes

Data augmentation is known to contribute significantly to the robustness of machine learning models. In most instances, data augmentation is utilized during the training phase. Test-Time Augmentation (TTA) is a technique that instead…

Machine Learning · Statistics 2024-09-20 Masanari Kimura , Howard Bondell

I Can't Believe TTA Is Not Better: When Test-Time Augmentation Hurts Medical Image Classification

Test-time augmentation (TTA)--aggregating predictions over multiple augmented copies of a test input--is widely assumed to improve classification accuracy, particularly in medical imaging where it is routinely deployed in production systems…

Computer Vision and Pattern Recognition · Computer Science 2026-04-14 Daniel Nobrega Medeiros

Rotational augmentation techniques: a new perspective on ensemble learning for image classification

The popularity of data augmentation techniques in machine learning has increased in recent years, as they enable the creation of new samples from existing datasets. Rotational augmentation, in particular, has shown great promise by…

Computer Vision and Pattern Recognition · Computer Science 2023-06-13 Unai Muñoz-Aseguinolaza , Basilio Sierra , Naiara Aginako

Test-time augmentation improves efficiency in conformal prediction

A conformal classifier produces a set of predicted classes and provides a probabilistic guarantee that the set includes the true class. Unfortunately, it is often the case that conformal classifiers produce uninformatively large sets. In…

Machine Learning · Computer Science 2025-05-30 Divya Shanmugam , Helen Lu , Swami Sankaranarayanan , John Guttag

Understanding Test-Time Augmentation

Test-Time Augmentation (TTA) is a very powerful heuristic that takes advantage of data augmentation during testing to produce averaged output. Despite the experimental effectiveness of TTA, there is insufficient discussion of its…

Machine Learning · Computer Science 2024-02-13 Masanari Kimura

BayTTA: Uncertainty-aware medical image classification with optimized test-time augmentation using Bayesian model averaging

Test-time augmentation (TTA) is a well-known technique employed during the testing phase of computer vision tasks. It involves aggregating multiple augmented versions of input data. Combining predictions using a simple average formulation…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Zeinab Sherkatghanad , Moloud Abdar , Mohammadreza Bakhtyari , Pawel Plawiak , Vladimir Makarenkov

Intelligent Multi-View Test Time Augmentation

In this study, we introduce an intelligent Test Time Augmentation (TTA) algorithm designed to enhance the robustness and accuracy of image classification models against viewpoint variations. Unlike traditional TTA methods that…

Image and Video Processing · Electrical Eng. & Systems 2024-06-14 Efe Ozturk , Mohit Prabhushankar , Ghassan AlRegib

Test-Time Training with Self-Supervision for Generalization under Distribution Shifts

In this paper, we propose Test-Time Training, a general approach for improving the performance of predictive models when training and test data come from different distributions. We turn a single unlabeled test sample into a self-supervised…

Machine Learning · Computer Science 2020-07-03 Yu Sun , Xiaolong Wang , Zhuang Liu , John Miller , Alexei A. Efros , Moritz Hardt

Test-time Adaptation vs. Training-time Generalization: A Case Study in Human Instance Segmentation using Keypoints Estimation

We consider the problem of improving the human instance segmentation mask quality for a given test image using keypoints estimation. We compare two alternative approaches. The first approach is a test-time adaptation (TTA) method, where we…

Computer Vision and Pattern Recognition · Computer Science 2022-12-14 Kambiz Azarian , Debasmit Das , Hyojin Park , Fatih Porikli

Mean Embeddings with Test-Time Data Augmentation for Ensembling of Representations

Averaging predictions over a set of models -- an ensemble -- is widely used to improve predictive performance and uncertainty estimation of deep learning models. At the same time, many machine learning systems, such as search, matching, and…

Machine Learning · Computer Science 2021-07-15 Arsenii Ashukha , Andrei Atanov , Dmitry Vetrov

Test-time Augmentation for Factual Probing

Factual probing is a method that uses prompts to test if a language model "knows" certain world knowledge facts. A problem in factual probing is that small changes to the prompt can lead to large changes in model output. Previous work aimed…

Computation and Language · Computer Science 2023-10-27 Go Kamoda , Benjamin Heinzerling , Keisuke Sakaguchi , Kentaro Inui

Diverse Ensembles Improve Calibration

Modern deep neural networks can produce badly calibrated predictions, especially when train and test distributions are mismatched. Training an ensemble of models and averaging their predictions can help alleviate these issues. We propose a…

Machine Learning · Computer Science 2020-07-09 Asa Cooper Stickland , Iain Murray

Aleatoric uncertainty estimation with test-time augmentation for medical image segmentation with convolutional neural networks

Despite the state-of-the-art performance for medical image segmentation, deep convolutional neural networks (CNNs) have rarely provided uncertainty estimations regarding their segmentation outputs, e.g., model (epistemic) and image-based…

Computer Vision and Pattern Recognition · Computer Science 2019-02-12 Guotai Wang , Wenqi Li , Michael Aertsen , Jan Deprest , Sebastien Ourselin , Tom Vercauteren

Data Augmentation by Pairing Samples for Images Classification

Data augmentation is a widely used technique in many machine learning tasks, such as image classification, to virtually enlarge the training dataset size and avoid overfitting. Traditional data augmentation techniques for image…

Machine Learning · Computer Science 2018-04-12 Hiroshi Inoue

Adaptive Weighting Scheme for Automatic Time-Series Data Augmentation

Data augmentation methods have been shown to be a fundamental technique to improve generalization in tasks such as image, text and audio classification. Recently, automated augmentation methods have led to further improvements on image…

Machine Learning · Computer Science 2021-02-17 Elizabeth Fons , Paula Dawson , Xiao-jun Zeng , John Keane , Alexandros Iosifidis

Neural network ensembles: Evaluation of aggregation algorithms

Ensembles of artificial neural networks show improved generalization capabilities that outperform those of single networks. However, for aggregation to be effective, the individual networks must be as accurate and diverse as possible. An…

Artificial Intelligence · Computer Science 2007-05-23 P. M. Granitto , P. F. Verdes , H. A. Ceccatto

Combining Ensembles and Data Augmentation can Harm your Calibration

Ensemble methods which average over multiple neural network predictions are a simple approach to improve a model's calibration and robustness. Similarly, data augmentation techniques, which encode prior information in the form of invariant…

Machine Learning · Computer Science 2021-03-24 Yeming Wen , Ghassen Jerfel , Rafael Muller , Michael W. Dusenberry , Jasper Snoek , Balaji Lakshminarayanan , Dustin Tran

Aggregating multiple test results to improve medical decision-making

Gathering observational data for medical decision-making often involves uncertainties arising from both type I (false positive)and type II (false negative) errors. In this work, we develop a statistical model to study how medical…

Applications · Statistics 2025-10-21 Lucas Böttcher , Maria R. D'Orsogna , Tom Chou