Related papers: Adversarial Decoding: Generating Readable Document…

A survey on text generation using generative adversarial networks

This work presents a thorough review concerning recent studies and text generation advancements using Generative Adversarial Networks. The usage of adversarial learning for text generation is promising as it provides alternatives to…

Computation and Language · Computer Science 2022-12-22 Gustavo Henrique de Rosa , João Paulo Papa

Generating Natural Language Adversarial Examples on a Large Scale with Generative Models

Today text classification models have been widely used. However, these classifiers are found to be easily fooled by adversarial examples. Fortunately, standard attacking methods generate adversarial texts in a pair-wise way, that is, an…

Computation and Language · Computer Science 2020-03-24 Yankun Ren , Jianbin Lin , Siliang Tang , Jun Zhou , Shuang Yang , Yuan Qi , Xiang Ren

A Visual Analytics Framework for Adversarial Text Generation

This paper presents a framework which enables a user to more easily make corrections to adversarial texts. While attack algorithms have been demonstrated to automatically build adversaries, changes made by the algorithms can often have poor…

Human-Computer Interaction · Computer Science 2020-12-21 Brandon Laughlin , Christopher Collins , Karthik Sankaranarayanan , Khalil El-Khatib

Adversarial Decomposition of Text Representation

In this paper, we present a method for adversarial decomposition of text representation. This method can be used to decompose a representation of an input sentence into several independent vectors, each of them responsible for a specific…

Computation and Language · Computer Science 2019-04-11 Alexey Romanov , Anna Rumshisky , Anna Rogers , David Donahue

Towards Crafting Text Adversarial Samples

Adversarial samples are strategically modified samples, which are crafted with the purpose of fooling a classifier at hand. An attacker introduces specially crafted adversarial samples to a deployed classifier, which are being…

Machine Learning · Computer Science 2017-07-11 Suranjana Samanta , Sameep Mehta

Towards More Robust Retrieval-Augmented Generation: Evaluating RAG Under Adversarial Poisoning Attacks

Retrieval-Augmented Generation (RAG) systems have emerged as a promising solution to mitigate LLM hallucinations and enhance their performance in knowledge-intensive domains. However, these systems are vulnerable to adversarial poisoning…

Information Retrieval · Computer Science 2025-07-29 Jinyan Su , Jin Peng Zhou , Zhengxin Zhang , Preslav Nakov , Claire Cardie

Adversarial Feature Matching for Text Generation

The Generative Adversarial Network (GAN) has achieved great success in generating realistic (real-valued) synthetic data. However, convergence issues and difficulties dealing with discrete data hinder the applicability of GAN to text. We…

Machine Learning · Statistics 2017-11-21 Yizhe Zhang , Zhe Gan , Kai Fan , Zhi Chen , Ricardo Henao , Dinghan Shen , Lawrence Carin

Generative Adversarial Networks for text using word2vec intermediaries

Generative adversarial networks (GANs) have shown considerable success, especially in the realistic generation of images. In this work, we apply similar techniques for the generation of text. We propose a novel approach to handle the…

Computation and Language · Computer Science 2019-04-05 Akshay Budhkar , Krishnapriya Vishnubhotla , Safwan Hossain , Frank Rudzicz

Online Alternate Generator against Adversarial Attacks

The field of computer vision has witnessed phenomenal progress in recent years partially due to the development of deep convolutional neural networks. However, deep learning models are notoriously sensitive to adversarial examples which are…

Computer Vision and Pattern Recognition · Computer Science 2020-10-28 Haofeng Li , Yirui Zeng , Guanbin Li , Liang Lin , Yizhou Yu

Towards Unsupervised Adversarial Document Detection in Retrieval Augmented Generation Systems

Retrieval augmented generation systems have become an integral part of everyday life. Whether in internet search engines, email systems, or service chatbots, these systems are based on context retrieval and answer generation with large…

Cryptography and Security · Computer Science 2026-03-19 Patrick Levi

Identifying Adversarial Sentences by Analyzing Text Complexity

Attackers create adversarial text to deceive both human perception and the current AI systems to perform malicious purposes such as spam product reviews and fake political posts. We investigate the difference between the adversarial and the…

Computation and Language · Computer Science 2019-12-20 Hoang-Quoc Nguyen-Son , Tran Phuong Thao , Seira Hidano , Shinsaku Kiyomoto

Text Adversarial Purification as Defense against Adversarial Attacks

Adversarial purification is a successful defense mechanism against adversarial attacks without requiring knowledge of the form of the incoming attack. Generally, adversarial purification aims to remove the adversarial perturbations…

Computation and Language · Computer Science 2023-05-04 Linyang Li , Demin Song , Xipeng Qiu

DANCin SEQ2SEQ: Fooling Text Classifiers with Adversarial Text Example Generation

Machine learning models are powerful but fallible. Generating adversarial examples - inputs deliberately crafted to cause model misclassification or other errors - can yield important insight into model assumptions and vulnerabilities.…

Machine Learning · Computer Science 2017-12-18 Catherine Wong

A Constraint-Enforcing Reward for Adversarial Attacks on Text Classifiers

Text classifiers are vulnerable to adversarial examples -- correctly-classified examples that are deliberately transformed to be misclassified while satisfying acceptability constraints. The conventional approach to finding adversarial…

Computation and Language · Computer Science 2024-05-21 Tom Roth , Inigo Jauregi Unanue , Alsharif Abuadbba , Massimo Piccardi

Adversarial Paraphrasing: A Universal Attack for Humanizing AI-Generated Text

The increasing capabilities of Large Language Models (LLMs) have raised concerns about their misuse in AI-generated plagiarism and social engineering. While various AI-generated text detectors have been proposed to mitigate these risks,…

Computation and Language · Computer Science 2025-10-31 Yize Cheng , Vinu Sankar Sadasivan , Mehrdad Saberi , Shoumik Saha , Soheil Feizi

Adversarial Attacks on AI-Generated Text Detection Models: A Token Probability-Based Approach Using Embeddings

In recent years, text generation tools utilizing Artificial Intelligence (AI) have occasionally been misused across various domains, such as generating student reports or creative writings. This issue prompts plagiarism detection services…

Computation and Language · Computer Science 2025-04-14 Ahmed K. Kadhim , Lei Jiao , Rishad Shafik , Ole-Christoffer Granmo

Learning Globally Optimized Language Structure via Adversarial Training

Recent work has explored integrating autoregressive language models with energy-based models (EBMs) to enhance text generation capabilities. However, learning effective EBMs for text is challenged by the discrete nature of language. This…

Computation and Language · Computer Science 2023-11-14 Xuwang Yin

A Generative Adversarial Attack for Multilingual Text Classifiers

Current adversarial attack algorithms, where an adversary changes a text to fool a victim model, have been repeatedly shown to be effective against text classifiers. These attacks, however, generally assume that the victim model is…

Computation and Language · Computer Science 2024-01-17 Tom Roth , Inigo Jauregi Unanue , Alsharif Abuadbba , Massimo Piccardi

Humanizing Machine-Generated Content: Evading AI-Text Detection through Adversarial Attack

With the development of large language models (LLMs), detecting whether text is generated by a machine becomes increasingly challenging in the face of malicious use cases like the spread of false information, protection of intellectual…

Computation and Language · Computer Science 2024-04-03 Ying Zhou , Ben He , Le Sun

Graded Suspiciousness of Adversarial Texts to Human

Adversarial examples pose a significant challenge to deep neural networks (DNNs) across both image and text domains, with the intent to degrade model performance through meticulously altered inputs. Adversarial texts, however, are distinct…

Machine Learning · Computer Science 2025-01-24 Shakila Mahjabin Tonni , Pedro Faustini , Mark Dras