Related papers: A Visual Analytics Framework for Adversarial Text …

Generating Natural Language Adversarial Examples on a Large Scale with Generative Models

Today text classification models have been widely used. However, these classifiers are found to be easily fooled by adversarial examples. Fortunately, standard attacking methods generate adversarial texts in a pair-wise way, that is, an…

Computation and Language · Computer Science 2020-03-24 Yankun Ren , Jianbin Lin , Siliang Tang , Jun Zhou , Shuang Yang , Yuan Qi , Xiang Ren

A Generative Adversarial Attack for Multilingual Text Classifiers

Current adversarial attack algorithms, where an adversary changes a text to fool a victim model, have been repeatedly shown to be effective against text classifiers. These attacks, however, generally assume that the victim model is…

Computation and Language · Computer Science 2024-01-17 Tom Roth , Inigo Jauregi Unanue , Alsharif Abuadbba , Massimo Piccardi

Towards Crafting Text Adversarial Samples

Adversarial samples are strategically modified samples, which are crafted with the purpose of fooling a classifier at hand. An attacker introduces specially crafted adversarial samples to a deployed classifier, which are being…

Machine Learning · Computer Science 2017-07-11 Suranjana Samanta , Sameep Mehta

Visual Attack and Defense on Text

Modifying characters of a piece of text to their visual similar ones often ap-pear in spam in order to fool inspection systems and other conditions, which we regard as a kind of adversarial attack to neural models. We pro-pose a way of…

Computer Vision and Pattern Recognition · Computer Science 2020-08-25 Shengjun Liu , Ningkang Jiang , Yuanbin Wu

Identifying Adversarial Attacks on Text Classifiers

The landscape of adversarial attacks against text classifiers continues to grow, with new attacks developed every year and many of them available in standard toolkits, such as TextAttack and OpenAttack. In response, there is a growing body…

Computation and Language · Computer Science 2022-01-24 Zhouhang Xie , Jonathan Brophy , Adam Noack , Wencong You , Kalyani Asthana , Carter Perkins , Sabrina Reis , Sameer Singh , Daniel Lowd

Explaining Vulnerabilities to Adversarial Machine Learning through Visual Analytics

Machine learning models are currently being deployed in a variety of real-world applications where model predictions are used to make decisions about healthcare, bank loans, and numerous other critical tasks. As the deployment of artificial…

Human-Computer Interaction · Computer Science 2019-10-07 Yuxin Ma , Tiankai Xie , Jundong Li , Ross Maciejewski

Adversarial Texts with Gradient Methods

Adversarial samples for images have been extensively studied in the literature. Among many of the attacking methods, gradient-based methods are both effective and easy to compute. In this work, we propose a framework to adapt the gradient…

Computation and Language · Computer Science 2018-01-26 Zhitao Gong , Wenlu Wang , Bo Li , Dawn Song , Wei-Shinn Ku

Adversarial Decoding: Generating Readable Documents for Adversarial Objectives

We design, implement, and evaluate adversarial decoding, a new, generic text generation technique that produces readable documents for different adversarial objectives. Prior methods either produce easily detectable gibberish, or cannot…

Computation and Language · Computer Science 2025-03-07 Collin Zhang , Tingwei Zhang , Vitaly Shmatikov

A survey on text generation using generative adversarial networks

This work presents a thorough review concerning recent studies and text generation advancements using Generative Adversarial Networks. The usage of adversarial learning for text generation is promising as it provides alternatives to…

Computation and Language · Computer Science 2022-12-22 Gustavo Henrique de Rosa , João Paulo Papa

Identifying Adversarial Sentences by Analyzing Text Complexity

Attackers create adversarial text to deceive both human perception and the current AI systems to perform malicious purposes such as spam product reviews and fake political posts. We investigate the difference between the adversarial and the…

Computation and Language · Computer Science 2019-12-20 Hoang-Quoc Nguyen-Son , Tran Phuong Thao , Seira Hidano , Shinsaku Kiyomoto

Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack

With the development of large language models (LLMs), detecting whether text is generated by a machine becomes increasingly challenging in the face of malicious use cases like the spread of false information, protection of intellectual…

Computation and Language · Computer Science 2024-04-03 Ying Zhou , Ben He , Le Sun

Adversarial Feature Matching for Text Generation

The Generative Adversarial Network (GAN) has achieved great success in generating realistic (real-valued) synthetic data. However, convergence issues and difficulties dealing with discrete data hinder the applicability of GAN to text. We…

Machine Learning · Statistics 2017-11-21 Yizhe Zhang , Zhe Gan , Kai Fan , Zhi Chen , Ricardo Henao , Dinghan Shen , Lawrence Carin

The Attack Generator: A Systematic Approach Towards Constructing Adversarial Attacks

Most state-of-the-art machine learning (ML) classification systems are vulnerable to adversarial perturbations. As a consequence, adversarial robustness poses a significant challenge for the deployment of ML-based systems in safety- and…

Machine Learning · Computer Science 2019-06-18 Felix Assion , Peter Schlicht , Florens Greßner , Wiebke Günther , Fabian Hüger , Nico Schmidt , Umair Rasheed

Grey-box Adversarial Attack And Defence For Sentiment Classification

We introduce a grey-box adversarial attack and defence framework for sentiment classification. We address the issues of differentiability, label preservation and input reconstruction for adversarial attack and defence in one unified…

Machine Learning · Computer Science 2021-03-23 Ying Xu , Xu Zhong , Antonio Jimeno Yepes , Jey Han Lau

HQA-VLAttack: Towards High Quality Adversarial Attack on Vision-Language Pre-Trained Models

Black-box adversarial attack on vision-language pre-trained models is a practical and challenging task, as text and image perturbations need to be considered simultaneously, and only the predicted results are accessible. Research on this…

Computer Vision and Pattern Recognition · Computer Science 2026-04-21 Han Liu , Jiaqi Li , Zhi Xu , Xiaotong Zhang , Xiaoming Xu , Fenglong Ma , Yuanman Li , Hong Yu

Online Alternate Generator against Adversarial Attacks

The field of computer vision has witnessed phenomenal progress in recent years partially due to the development of deep convolutional neural networks. However, deep learning models are notoriously sensitive to adversarial examples which are…

Computer Vision and Pattern Recognition · Computer Science 2020-10-28 Haofeng Li , Yirui Zeng , Guanbin Li , Liang Lin , Yizhou Yu

Generating Textual Adversaries with Minimal Perturbation

Many word-level adversarial attack approaches for textual data have been proposed in recent studies. However, due to the massive search space consisting of combinations of candidate words, the existing approaches face the problem of…

Computation and Language · Computer Science 2022-11-15 Xingyi Zhao , Lu Zhang , Depeng Xu , Shuhan Yuan

Bridge the Gap Between CV and NLP! A Gradient-based Textual Adversarial Attack Framework

Despite recent success on various tasks, deep learning techniques still perform poorly on adversarial examples with small perturbations. While optimization-based methods for adversarial attacks are well-explored in the field of computer…

Computation and Language · Computer Science 2023-06-09 Lifan Yuan , Yichi Zhang , Yangyi Chen , Wei Wei

On Generation of Adversarial Examples using Convex Programming

It has been observed that deep learning architectures tend to make erroneous decisions with high reliability for particularly designed adversarial instances. In this work, we show that the perturbation analysis of these architectures…

Machine Learning · Computer Science 2018-12-05 Emilio Rafael Balda , Arash Behboodi , Rudolf Mathar

Contrasting Human- and Machine-Generated Word-Level Adversarial Examples for Text Classification

Research shows that natural language processing models are generally considered to be vulnerable to adversarial attacks; but recent work has drawn attention to the issue of validating these adversarial inputs against certain criteria (e.g.,…

Computation and Language · Computer Science 2021-09-10 Maximilian Mozes , Max Bartolo , Pontus Stenetorp , Bennett Kleinberg , Lewis D. Griffin