Related papers: Adversarial Training for Large Neural Language Mod…

Learning Globally Optimized Language Structure via Adversarial Training

Recent work has explored integrating autoregressive language models with energy-based models (EBMs) to enhance text generation capabilities. However, learning effective EBMs for text is challenged by the discrete nature of language. This…

Computation and Language · Computer Science 2023-11-14 Xuwang Yin

CAT:Collaborative Adversarial Training

Adversarial training can improve the robustness of neural networks. Previous methods focus on a single adversarial training strategy and do not consider the model property trained by different strategies. By revisiting the previous methods,…

Computer Vision and Pattern Recognition · Computer Science 2023-03-28 Xingbin Liu , Huafeng Kuang , Xianming Lin , Yongjian Wu , Rongrong Ji

Improving the robustness and accuracy of biomedical language models through adversarial training

Deep transformer neural network models have improved the predictive accuracy of intelligent text processing systems in the biomedical domain. They have obtained state-of-the-art performance scores on a wide variety of biomedical and…

Computation and Language · Computer Science 2021-11-17 Milad Moradi , Matthias Samwald

Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning

Adversarial training (i.e., training on adversarially perturbed input data) is a well-studied method for making neural networks robust to potential adversarial attacks during inference. However, the improved robustness does not come for…

Robotics · Computer Science 2023-01-27 Mathias Lechner , Alexander Amini , Daniela Rus , Thomas A. Henzinger

Amata: An Annealing Mechanism for Adversarial Training Acceleration

Despite the empirical success in various domains, it has been revealed that deep neural networks are vulnerable to maliciously perturbed input data that much degrade their performance. This is known as adversarial attacks. To counter…

Machine Learning · Computer Science 2021-08-17 Nanyang Ye , Qianxiao Li , Xiao-Yun Zhou , Zhanxing Zhu

TAVAT: Token-Aware Virtual Adversarial Training for Language Understanding

Gradient-based adversarial training is widely used in improving the robustness of neural networks, while it cannot be easily adapted to natural language processing tasks since the embedding space is discrete. In natural language processing…

Computation and Language · Computer Science 2020-12-07 Linyang Li , Xipeng Qiu

Generating Valid and Natural Adversarial Examples with Large Language Models

Deep learning-based natural language processing (NLP) models, particularly pre-trained language models (PLMs), have been revealed to be vulnerable to adversarial attacks. However, the adversarial examples generated by many mainstream…

Computation and Language · Computer Science 2023-11-21 Zimu Wang , Wei Wang , Qi Chen , Qiufeng Wang , Anh Nguyen

ATOM: Asynchronous Training of Massive Models for Deep Learning in a Decentralized Environment

The advent of the Transformer architecture has propelled the growth of natural language processing (NLP) models, leading to remarkable achievements in numerous NLP tasks. Yet, the absence of specialized hardware like expansive GPU memory…

Distributed, Parallel, and Cluster Computing · Computer Science 2024-03-18 Xiaofeng Wu , Jia Rao , Wei Chen

Adversarial Tuning: Defending Against Jailbreak Attacks for LLMs

Although safely enhanced Large Language Models (LLMs) have achieved remarkable success in tackling various complex tasks in a zero-shot manner, they remain susceptible to jailbreak attacks, particularly the unknown jailbreak attack. To…

Computation and Language · Computer Science 2024-06-12 Fan Liu , Zhao Xu , Hao Liu

Rethinking Textual Adversarial Defense for Pre-trained Language Models

Although pre-trained language models (PrLMs) have achieved significant success, recent studies demonstrate that PrLMs are vulnerable to adversarial attacks. By generating adversarial examples with slight perturbations on different levels…

Computation and Language · Computer Science 2022-08-23 Jiayi Wang , Rongzhou Bao , Zhuosheng Zhang , Hai Zhao

Adversarial Training of End-to-end Speech Recognition Using a Criticizing Language Model

In this paper we proposed a novel Adversarial Training (AT) approach for end-to-end speech recognition using a Criticizing Language Model (CLM). In this way the CLM and the automatic speech recognition (ASR) model can challenge and learn…

Computation and Language · Computer Science 2018-11-05 Alexander H. Liu , Hung-yi Lee , Lin-shan Lee

Stability and Generalization in Free Adversarial Training

While adversarial training methods have significantly improved the robustness of deep neural networks against norm-bounded adversarial perturbations, the generalization gap between their performance on training and test data is considerably…

Machine Learning · Computer Science 2025-01-08 Xiwei Cheng , Kexin Fu , Farzan Farnia

Fast Adversarial Training against Textual Adversarial Attacks

Many adversarial defense methods have been proposed to enhance the adversarial robustness of natural language processing models. However, most of them introduce additional pre-set linguistic knowledge and assume that the synonym candidates…

Computation and Language · Computer Science 2024-02-28 Yichen Yang , Xin Liu , Kun He

Semantics-Preserving Adversarial Training

Adversarial training is a defense technique that improves adversarial robustness of a deep neural network (DNN) by including adversarial examples in the training data. In this paper, we identify an overlooked problem of adversarial training…

Machine Learning · Computer Science 2020-09-24 Wonseok Lee , Hanbit Lee , Sang-goo Lee

Advancing NLP Security by Leveraging LLMs as Adversarial Engines

This position paper proposes a novel approach to advancing NLP security by leveraging Large Language Models (LLMs) as engines for generating diverse adversarial attacks. Building upon recent work demonstrating LLMs' effectiveness in…

Artificial Intelligence · Computer Science 2024-10-25 Sudarshan Srinivasan , Maria Mahbub , Amir Sadovnik

Distributed Adversarial Training to Robustify Deep Neural Networks at Scale

Current deep neural networks (DNNs) are vulnerable to adversarial attacks, where adversarial perturbations to the inputs can change or manipulate classification. To defend against such attacks, an effective and popular approach, known as…

Machine Learning · Computer Science 2022-09-08 Gaoyuan Zhang , Songtao Lu , Yihua Zhang , Xiangyi Chen , Pin-Yu Chen , Quanfu Fan , Lee Martie , Lior Horesh , Mingyi Hong , Sijia Liu

Simple Contrastive Representation Adversarial Learning for NLP Tasks

Self-supervised learning approach like contrastive learning is attached great attention in natural language processing. It uses pairs of training data augmentations to build a classification task for an encoder with well representation…

Computation and Language · Computer Science 2021-12-03 Deshui Miao , Jiaqi Zhang , Wenbo Xie , Jian Song , Xin Li , Lijuan Jia , Ning Guo

Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations

Model robustness against adversarial examples of single perturbation type such as the $\ell_{p}$-norm has been widely studied, yet its generalization to more realistic scenarios involving multiple semantic perturbations and their…

Computer Vision and Pattern Recognition · Computer Science 2023-03-23 Lei Hsiung , Yun-Yun Tsai , Pin-Yu Chen , Tsung-Yi Ho

ROOM: Adversarial Machine Learning Attacks Under Real-Time Constraints

Advances in deep learning have enabled a wide range of promising applications. However, these systems are vulnerable to Adversarial Machine Learning (AML) attacks; adversarially crafted perturbations to their inputs could cause them to…

Cryptography and Security · Computer Science 2022-01-06 Amira Guesmi , Khaled N. Khasawneh , Nael Abu-Ghazaleh , Ihsen Alouani

WAT: Improve the Worst-class Robustness in Adversarial Training

Deep Neural Networks (DNN) have been shown to be vulnerable to adversarial examples. Adversarial training (AT) is a popular and effective strategy to defend against adversarial attacks. Recent works (Benz et al., 2020; Xu et al., 2021; Tian…

Machine Learning · Computer Science 2023-02-09 Boqi Li , Weiwei Liu