Related papers: Robustness to Programmable String Transformations …

Semantic Robustness of Models of Source Code

Deep neural networks are vulnerable to adversarial examples - small input perturbations that result in incorrect predictions. We study this problem for models of source code, where we want the network to be robust to source-code…

Machine Learning · Computer Science 2022-08-23 Goutham Ramakrishnan , Jordan Henkel , Zi Wang , Aws Albarghouthi , Somesh Jha , Thomas Reps

Certified Robustness to Programmable Transformations in LSTMs

Deep neural networks for natural language processing are fragile in the face of adversarial examples -- small input perturbations, like synonym substitution or word duplication, which cause a neural network to change its prediction. We…

Machine Learning · Computer Science 2021-09-08 Yuhao Zhang , Aws Albarghouthi , Loris D'Antoni

Fast Training of Deep Neural Networks Robust to Adversarial Perturbations

Deep neural networks are capable of training fast and generalizing well within many domains. Despite their promising performance, deep networks have shown sensitivities to perturbations of their inputs (e.g., adversarial examples) and their…

Machine Learning · Computer Science 2020-07-09 Justin Goodwin , Olivia Brown , Victoria Helus

Semantics-Preserving Adversarial Training

Adversarial training is a defense technique that improves adversarial robustness of a deep neural network (DNN) by including adversarial examples in the training data. In this paper, we identify an overlooked problem of adversarial training…

Machine Learning · Computer Science 2020-09-24 Wonseok Lee , Hanbit Lee , Sang-goo Lee

Soft Adversarial Training Can Retain Natural Accuracy

Adversarial training for neural networks has been in the limelight in recent years. The advancement in neural network architectures over the last decade has led to significant improvement in their performance. It sparked an interest in…

Machine Learning · Computer Science 2022-06-07 Abhijith Sharma , Apurva Narayan

A3T: Adversarially Augmented Adversarial Training

Recent research showed that deep neural networks are highly sensitive to so-called adversarial perturbations, which are tiny perturbations of the input data purposely designed to fool a machine learning classifier. Most classification…

Machine Learning · Computer Science 2018-01-15 Akram Erraqabi , Aristide Baratin , Yoshua Bengio , Simon Lacoste-Julien

Adversarial Training: embedding adversarial perturbations into the parameter space of a neural network to build a robust system

Adversarial training, in which a network is trained on both adversarial and clean examples, is one of the most trusted defense methods against adversarial attacks. However, there are three major practical difficulties in implementing and…

Machine Learning · Computer Science 2019-10-11 Shixian Wen , Laurent Itti

Adversarial Training Methods for Semi-Supervised Text Classification

Adversarial training provides a means of regularizing supervised learning algorithms while virtual adversarial training is able to extend supervised learning algorithms to the semi-supervised setting. However, both methods require making…

Machine Learning · Statistics 2021-11-17 Takeru Miyato , Andrew M. Dai , Ian Goodfellow

Adversarial and Random Transformations for Robust Domain Adaptation and Generalization

Data augmentation has been widely used to improve generalization in training deep neural networks. Recent works show that using worst-case transformations or adversarial augmentation strategies can significantly improve the accuracy and…

Computer Vision and Pattern Recognition · Computer Science 2022-11-15 Liang Xiao , Jiaolong Xu , Dawei Zhao , Erke Shang , Qi Zhu , Bin Dai

Adversarial Training Can Provably Improve Robustness: Theoretical Analysis of Feature Learning Process Under Structured Data

Adversarial training is a widely-applied approach to training deep neural networks to be robust against adversarial perturbation. However, although adversarial training has achieved empirical success in practice, it still remains unclear…

Machine Learning · Computer Science 2025-02-10 Binghui Li , Yuanzhi Li

On the Sensitivity of Adversarial Robustness to Input Data Distributions

Neural networks are vulnerable to small adversarial perturbations. Existing literature largely focused on understanding and mitigating the vulnerability of learned models. In this paper, we demonstrate an intriguing phenomenon about the…

Machine Learning · Computer Science 2019-02-25 Gavin Weiguang Ding , Kry Yik Chau Lui , Xiaomeng Jin , Luyu Wang , Ruitong Huang

Impact of Adversarial Training on Robustness and Generalizability of Language Models

Adversarial training is widely acknowledged as the most effective defense against adversarial attacks. However, it is also well established that achieving both robustness and generalization in adversarially trained models involves a…

Computation and Language · Computer Science 2023-12-12 Enes Altinisik , Hassan Sajjad , Husrev Taha Sencar , Safa Messaoud , Sanjay Chawla

Adversarial Robustness for Code

Machine learning and deep learning in particular has been recently used to successfully address many tasks in the domain of code such as finding and fixing bugs, code completion, decompilation, type inference and many others. However, the…

Machine Learning · Computer Science 2020-08-18 Pavol Bielik , Martin Vechev

Interpretable Adversarial Perturbation in Input Embedding Space for Text

Following great success in the image processing field, the idea of adversarial training has been applied to tasks in the natural language processing (NLP) field. One promising approach directly applies adversarial training developed in the…

Machine Learning · Computer Science 2018-05-09 Motoki Sato , Jun Suzuki , Hiroyuki Shindo , Yuji Matsumoto

A Curious Case of Searching for the Correlation between Training Data and Adversarial Robustness of Transformer Textual Models

Existing works have shown that fine-tuned textual transformer models achieve state-of-the-art prediction performances but are also vulnerable to adversarial text perturbations. Traditional adversarial evaluation is often done \textit{only…

Machine Learning · Computer Science 2024-07-03 Cuong Dang , Dung D. Le , Thai Le

Adversarial Training: A Survey

Adversarial training (AT) refers to integrating adversarial examples -- inputs altered with imperceptible perturbations that can significantly impact model predictions -- into the training process. Recent studies have demonstrated the…

Machine Learning · Computer Science 2024-10-22 Mengnan Zhao , Lihe Zhang , Jingwen Ye , Huchuan Lu , Baocai Yin , Xinchao Wang

Improving Neural Language Modeling via Adversarial Training

Recently, substantial progress has been made in language modeling by using deep neural networks. However, in practice, large scale neural language models have been shown to be prone to overfitting. In this paper, we present a simple yet…

Machine Learning · Computer Science 2019-09-10 Dilin Wang , Chengyue Gong , Qiang Liu

Certified Adversarial Robustness with Additive Noise

The existence of adversarial data examples has drawn significant attention in the deep-learning community; such data are seemingly minimally perturbed relative to the original data, but lead to very different outputs from a deep-learning…

Machine Learning · Computer Science 2019-11-12 Bai Li , Changyou Chen , Wenlin Wang , Lawrence Carin

Differentially Private Adversarial Robustness Through Randomized Perturbations

Deep Neural Networks, despite their great success in diverse domains, are provably sensitive to small perturbations on correctly classified examples and lead to erroneous predictions. Recently, it was proposed that this behavior can be…

Machine Learning · Computer Science 2020-09-29 Nan Xu , Oluwaseyi Feyisetan , Abhinav Aggarwal , Zekun Xu , Nathanael Teissier

Improving adversarial robustness of deep neural networks by using semantic information

The vulnerability of deep neural networks (DNNs) to adversarial attack, which is an attack that can mislead state-of-the-art classifiers into making an incorrect classification with high confidence by deliberately perturbing the original…

Machine Learning · Computer Science 2021-06-18 Lina Wang , Rui Tang , Yawei Yue , Xingshu Chen , Wei Wang , Yi Zhu , Xuemei Zeng