Related papers: Learning Representational Invariances for Data-Eff…

Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition

We address the problem of data augmentation for video action recognition. Standard augmentation strategies in video are hand-designed and sample the space of possible augmented data points either at random, without knowing which augmented…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Shreyank N Gowda , Marcus Rohrbach , Frank Keller , Laura Sevilla-Lara

Extending Temporal Data Augmentation for Video Action Recognition

Pixel space augmentation has grown in popularity in many Deep Learning areas, due to its effectiveness, simplicity, and low computational cost. Data augmentation for videos, however, still remains an under-explored research topic, as most…

Computer Vision and Pattern Recognition · Computer Science 2022-11-10 Artjoms Gorpincenko , Michal Mackiewicz

Exploring Temporally Dynamic Data Augmentation for Video Recognition

Data augmentation has recently emerged as an essential component of modern training recipes for visual recognition tasks. However, data augmentation for video recognition has been rarely explored despite its effectiveness. Few existing…

Computer Vision and Pattern Recognition · Computer Science 2022-07-01 Taeoh Kim , Jinhyung Kim , Minho Shim , Sangdoo Yun , Myunggu Kang , Dongyoon Wee , Sangyoun Lee

Learning Temporally Invariant and Localizable Features via Data Augmentation for Video Recognition

Deep-Learning-based video recognition has shown promising improvements along with the development of large-scale datasets and spatiotemporal network architectures. In image recognition, learning spatially invariant features is a key factor…

Computer Vision and Pattern Recognition · Computer Science 2020-08-14 Taeoh Kim , Hyeongmin Lee , MyeongAh Cho , Ho Seong Lee , Dong Heon Cho , Sangyoun Lee

Exploring Data Augmentations on Self-/Semi-/Fully- Supervised Pre-trained Models

Data augmentation has become a standard component of vision pre-trained models to capture the invariance between augmented views. In practice, augmentation techniques that mask regions of a sample with zero/mean values or patches from other…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Shentong Mo , Zhun Sun , Chao Li

Visual Data Augmentation through Learning

The rapid progress in machine learning methods has been empowered by i) huge datasets that have been collected and annotated, ii) improved engineering (e.g. data pre-processing/normalization). The existing datasets typically include several…

Computer Vision and Pattern Recognition · Computer Science 2018-01-23 Grigorios G. Chrysos , Yannis Panagakis , Stefanos Zafeiriou

Augmentation Invariant Manifold Learning

Data augmentation is a widely used technique and an essential ingredient in the recent advance in self-supervised representation learning. By preserving the similarity between augmented data, the resulting data representation can improve…

Machine Learning · Statistics 2025-01-16 Shulei Wang

Data Augmentation for Manipulation

The success of deep learning depends heavily on the availability of large datasets, but in robotic manipulation there are many learning problems for which such datasets do not exist. Collecting these datasets is time-consuming and…

Robotics · Computer Science 2022-07-21 Peter Mitrano , Dmitry Berenson

Generative Hints

Data augmentation is widely used in vision to introduce variation and mitigate overfitting, by enabling models to learn invariant properties. However, augmentation only indirectly captures these properties and does not explicitly constrain…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Andy Dimnaku , Abdullah Yusuf Kavranoglu , Yaser Abu-Mostafa

Image Data Augmentation for Deep Learning: A Survey

Deep learning has achieved remarkable results in many computer vision tasks. Deep neural networks typically rely on large amounts of training data to avoid overfitting. However, labeled data for real-world applications may be limited. By…

Computer Vision and Pattern Recognition · Computer Science 2023-11-07 Suorong Yang , Weikang Xiao , Mengchen Zhang , Suhan Guo , Jian Zhao , Furao Shen

VideoMix: Rethinking Data Augmentation for Video Classification

State-of-the-art video action classifiers often suffer from overfitting. They tend to be biased towards specific objects and scene cues, rather than the foreground action content, leading to sub-optimal generalization performances. Recent…

Computer Vision and Pattern Recognition · Computer Science 2020-12-08 Sangdoo Yun , Seong Joon Oh , Byeongho Heo , Dongyoon Han , Jinhyung Kim

Augmenting Imitation Experience via Equivariant Representations

The robustness of visual navigation policies trained through imitation often hinges on the augmentation of the training image-action pairs. Traditionally, this has been done by collecting data from multiple cameras, by using standard data…

Computer Vision and Pattern Recognition · Computer Science 2021-10-18 Dhruv Sharma , Alihusein Kuwajerwala , Florian Shkurti

Steerable Equivariant Representation Learning

Pre-trained deep image representations are useful for post-training tasks such as classification through transfer learning, image retrieval, and object detection. Data augmentations are a crucial aspect of pre-training robust…

Computer Vision and Pattern Recognition · Computer Science 2023-02-23 Sangnie Bhardwaj , Willie McClinton , Tongzhou Wang , Guillaume Lajoie , Chen Sun , Phillip Isola , Dilip Krishnan

Data-Efficient Augmentation for Training Neural Networks

Data augmentation is essential to achieve state-of-the-art performance in many deep learning applications. However, the most effective augmentation techniques become computationally prohibitive for even medium-sized datasets. To address…

Machine Learning · Computer Science 2023-07-21 Tian Yu Liu , Baharan Mirzasoleiman

Watching the World Go By: Representation Learning from Unlabeled Videos

Recent single image unsupervised representation learning techniques show remarkable success on a variety of tasks. The basic principle in these works is instance discrimination: learning to differentiate between two augmented versions of…

Computer Vision and Pattern Recognition · Computer Science 2020-05-08 Daniel Gordon , Kiana Ehsani , Dieter Fox , Ali Farhadi

Albumentations: fast and flexible image augmentations

Data augmentation is a commonly used technique for increasing both the size and the diversity of labeled training sets by leveraging input transformations that preserve output labels. In computer vision domain, image augmentations have…

Computer Vision and Pattern Recognition · Computer Science 2020-02-27 Alexander Buslaev , Alex Parinov , Eugene Khvedchenya , Vladimir I. Iglovikov , Alexandr A. Kalinin

Efficient Augmentation via Data Subsampling

Data augmentation is commonly used to encode invariances in learning methods. However, this process is often performed in an inefficient manner, as artificial examples are created by applying a number of transformations to all points in the…

Machine Learning · Computer Science 2019-03-04 Michael Kuchnik , Virginia Smith

Variable-Speed Teaching-Playback as Real-World Data Augmentation for Imitation Learning

Because imitation learning relies on human demonstrations in hard-to-simulate settings, the inclusion of force control in this method has resulted in a shortage of training data, even with a simple change in speed. Although the field of…

Robotics · Computer Science 2025-05-07 Nozomu Masuya , Hiroshi Sato , Koki Yamane , Takuya Kusume , Sho Sakaino , Toshiaki Tsuji

Effective Data Augmentation With Diffusion Models

Data augmentation is one of the most prevalent tools in deep learning, underpinning many recent advances, including those from classification, generative models, and representation learning. The standard approach to data augmentation…

Computer Vision and Pattern Recognition · Computer Science 2025-06-12 Brandon Trabucco , Kyle Doherty , Max Gurinas , Ruslan Salakhutdinov

Learning robust visual representations using data augmentation invariance

Deep convolutional neural networks trained for image object categorization have shown remarkable similarities with representations found across the primate ventral visual stream. Yet, artificial and biological networks still exhibit…

Computer Vision and Pattern Recognition · Computer Science 2019-06-12 Alex Hernández-García , Peter König , Tim C. Kietzmann