Related papers: SapAugment: Learning A Sample Adaptive Policy for …

Sample adaptive data augmentation with progressive scheduling

Data augmentation is a widely adopted technique utilized to improve the robustness of automatic speech recognition (ASR). Employing a fixed data augmentation strategy for all training data is a common practice. However, it is important to…

Sound · Computer Science 2024-12-03 Hongxuan Lu , Biao Li

MetaAugment: Sample-Aware Data Augmentation Policy Learning

Automated data augmentation has shown superior performance in image recognition. Existing works search for dataset-level augmentation policies without considering individual sample variations, which are likely to be sub-optimal. On the…

Machine Learning · Computer Science 2020-12-23 Fengwei Zhou , Jiawei Li , Chuanlong Xie , Fei Chen , Lanqing Hong , Rui Sun , Zhenguo Li

Efficient Augmentation via Data Subsampling

Data augmentation is commonly used to encode invariances in learning methods. However, this process is often performed in an inefficient manner, as artificial examples are created by applying a number of transformations to all points in the…

Machine Learning · Computer Science 2019-03-04 Michael Kuchnik , Virginia Smith

A Study On Data Augmentation In Voice Anti-Spoofing

In this paper, we perform an in-depth study of how data augmentation techniques improve synthetic or spoofed audio detection. Specifically, we propose methods to deal with channel variability, different audio compressions, different…

Sound · Computer Science 2021-10-22 Ariel Cohen , Inbal Rimon , Eran Aflalo , Haim Permuter

A Policy-based Approach to the SpecAugment Method for Low Resource E2E ASR

SpecAugment is a very effective data augmentation method for both HMM and E2E-based automatic speech recognition (ASR) systems. Especially, it also works in low-resource scenarios. However, SpecAugment masks the spectrum of time or the…

Sound · Computer Science 2022-10-18 Rui Li , Guodong Ma , Dexin Zhao , Ranran Zeng , Xiaoyu Li , Hao Huang

Automatic Data Augmentation Selection and Parametrization in Contrastive Self-Supervised Speech Representation Learning

Contrastive learning enables learning useful audio and speech representations without ground-truth labels by maximizing the similarity between latent representations of similar signal segments. In this framework various data augmentation…

Audio and Speech Processing · Electrical Eng. & Systems 2022-04-11 Salah Zaiem , Titouan Parcollet , Slim Essid

Population Based Training for Data Augmentation and Regularization in Speech Recognition

Varying data augmentation policies and regularization over the course of optimization has led to performance improvements over using fixed values. We show that population based training is a useful tool to continuously search those…

Computation and Language · Computer Science 2020-10-09 Daniel Haziza , Jérémy Rapin , Gabriel Synnaeve

Smart(Sampling)Augment: Optimal and Efficient Data Augmentation for Semantic Segmentation

Data augmentation methods enrich datasets with augmented data to improve the performance of neural networks. Recently, automated data augmentation methods have emerged, which automatically design augmentation strategies. Existing work…

Computer Vision and Pattern Recognition · Computer Science 2021-11-02 Misgana Negassi , Diane Wagner , Alexander Reiterer

Data Augmentation via Structured Adversarial Perturbations

Data augmentation is a major component of many machine learning methods with state-of-the-art performance. Common augmentation strategies work by drawing random samples from a space of transformations. Unfortunately, such sampling…

Machine Learning · Computer Science 2020-11-06 Calvin Luo , Hossein Mobahi , Samy Bengio

Automatically Learning Data Augmentation Policies for Dialogue Tasks

Automatic data augmentation (AutoAugment) (Cubuk et al., 2019) searches for optimal perturbation policies via a controller trained using performance rewards of a sampled policy on the target task, hence reducing data-level model bias. While…

Computation and Language · Computer Science 2019-10-01 Tong Niu , Mohit Bansal

Self-paced Data Augmentation for Training Neural Networks

Data augmentation is widely used for machine learning; however, an effective method to apply data augmentation has not been established even though it includes several factors that should be tuned carefully. One such factor is sample…

Machine Learning · Computer Science 2020-10-30 Tomoumi Takase , Ryo Karakida , Hideki Asoh

AutoDO: Robust AutoAugment for Biased Data with Label Noise via Scalable Probabilistic Implicit Differentiation

AutoAugment has sparked an interest in automated augmentation methods for deep learning models. These methods estimate image transformation policies for train data that improve generalization to test data. While recent papers evolved in the…

Computer Vision and Pattern Recognition · Computer Science 2021-03-15 Denis Gudovskiy , Luca Rigazio , Shun Ishizaka , Kazuki Kozuka , Sotaro Tsukizawa

Auditory-Based Data Augmentation for End-to-End Automatic Speech Recognition

End-to-end models have achieved significant improvement on automatic speech recognition. One common method to improve performance of these models is expanding the data-space through data augmentation. Meanwhile, human auditory inspired…

Audio and Speech Processing · Electrical Eng. & Systems 2022-04-12 Zehai Tu , Jack Deadman , Ning Ma , Jon Barker

When to Learn What: Model-Adaptive Data Augmentation Curriculum

Data augmentation (DA) is widely used to improve the generalization of neural networks by enforcing the invariances and symmetries to pre-defined transformations applied to input data. However, a fixed augmentation policy may have different…

Computer Vision and Pattern Recognition · Computer Science 2023-10-04 Chengkai Hou , Jieyu Zhang , Tianyi Zhou

Unsupervised Data Augmentation for Consistency Training

Semi-supervised learning lately has shown much promise in improving deep learning models when labeled data is scarce. Common among recent approaches is the use of consistency training on a large amount of unlabeled data to constrain model…

Machine Learning · Computer Science 2020-11-06 Qizhe Xie , Zihang Dai , Eduard Hovy , Minh-Thang Luong , Quoc V. Le

A Bayesian Data Augmentation Approach for Learning Deep Models

Data augmentation is an essential part of the training process applied to deep learning models. The motivation is that a robust training process for deep learning models depends on large annotated datasets, which are expensive to be…

Computer Vision and Pattern Recognition · Computer Science 2017-10-31 Toan Tran , Trung Pham , Gustavo Carneiro , Lyle Palmer , Ian Reid

Boosting Model Resilience via Implicit Adversarial Data Augmentation

Data augmentation plays a pivotal role in enhancing and diversifying training data. Nonetheless, consistently improving model performance in varied learning scenarios, especially those with inherent data biases, remains challenging. To…

Machine Learning · Computer Science 2024-06-04 Xiaoling Zhou , Wei Ye , Zhemg Lee , Rui Xie , Shikun Zhang

On-the-Fly Data Augmentation via Gradient-Guided and Sample-Aware Influence Estimation

Data augmentation has been widely employed to improve the generalization of deep neural networks. Most existing methods apply fixed or random transformations. However, we find that sample difficulty evolves along with the model's…

Machine Learning · Computer Science 2025-10-02 Suorong Yang , Jie Zong , Lihang Wang , Ziheng Qin , Hai Gan , Pengfei Zhou , Kai Wang , Yang You , Furao Shen

Enhancing Sample Utilization through Sample Adaptive Augmentation in Semi-Supervised Learning

In semi-supervised learning, unlabeled samples can be utilized through augmentation and consistency regularization. However, we observed certain samples, even undergoing strong augmentation, are still correctly classified with high…

Computer Vision and Pattern Recognition · Computer Science 2023-09-08 Guan Gui , Zhen Zhao , Lei Qi , Luping Zhou , Lei Wang , Yinghuan Shi

Make More of Your Data: Minimal Effort Data Augmentation for Automatic Speech Recognition and Translation

Data augmentation is a technique to generate new training data based on existing data. We evaluate the simple and cost-effective method of concatenating the original data examples to build new training instances. Continued training with…

Computation and Language · Computer Science 2023-06-12 Tsz Kin Lam , Shigehiko Schamoni , Stefan Riezler