Related papers: MathAttack: Attacking Large Language Models Toward…

ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving

While Large Language Models (LLMs) have shown impressive capabilities in math problem-solving tasks, their robustness to noisy inputs is not well-studied. We propose ArithmAttack to examine how robust the LLMs are when they encounter noisy…

Computation and Language · Computer Science 2026-03-17 Zain Ul Abedin , Shahzeb Qamar , Lucie Flek , Akbar Karimi

Adversarial Attacks and Defenses in Large Language Models: Old and New Threats

Over the past decade, there has been extensive research aimed at enhancing the robustness of neural networks, yet this problem remains vastly unsolved. Here, one major impediment has been the overestimation of the robustness of new defense…

Artificial Intelligence · Computer Science 2023-10-31 Leo Schwinn , David Dobre , Stephan Günnemann , Gauthier Gidel

Robustness of Large Language Models Against Adversarial Attacks

The increasing deployment of Large Language Models (LLMs) in various applications necessitates a rigorous evaluation of their robustness against adversarial attacks. In this paper, we present a comprehensive study on the robustness of GPT…

Computation and Language · Computer Science 2024-12-24 Yiyi Tao , Yixian Shen , Hang Zhang , Yanxin Shen , Lun Wang , Chuanqi Shi , Shaoshuai Du

Assessing Adversarial Robustness of Large Language Models: An Empirical Study

Large Language Models (LLMs) have revolutionized natural language processing, but their robustness against adversarial attacks remains a critical concern. We presents a novel white-box style attack approach that exposes vulnerabilities in…

Computation and Language · Computer Science 2024-09-16 Zeyu Yang , Zhao Meng , Xiaochen Zheng , Roger Wattenhofer

Adversarial Examples for Evaluating Math Word Problem Solvers

Standard accuracy metrics have shown that Math Word Problem (MWP) solvers have achieved high performance on benchmark datasets. However, the extent to which existing MWP solvers truly understand language and its relation with numbers is…

Computation and Language · Computer Science 2021-09-14 Vivek Kumar , Rishabh Maheshwary , Vikram Pudi

RAFT: Realistic Attacks to Fool Text Detectors

Large language models (LLMs) have exhibited remarkable fluency across various tasks. However, their unethical applications, such as disseminating disinformation, have become a growing concern. Although recent works have proposed a number of…

Computation and Language · Computer Science 2024-10-07 James Wang , Ran Li , Junfeng Yang , Chengzhi Mao

Defending Large Language Models Against Attacks With Residual Stream Activation Analysis

The widespread adoption of Large Language Models (LLMs), exemplified by OpenAI's ChatGPT, brings to the forefront the imperative to defend against adversarial threats on these models. These attacks, which manipulate an LLM's output by…

Cryptography and Security · Computer Science 2025-04-04 Amelia Kawasaki , Andrew Davis , Houssam Abbas

PromptRobust: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts

The increasing reliance on Large Language Models (LLMs) across academia and industry necessitates a comprehensive understanding of their robustness to prompts. In response to this vital need, we introduce PromptRobust, a robustness…

Computation and Language · Computer Science 2024-07-17 Kaijie Zhu , Jindong Wang , Jiaheng Zhou , Zichen Wang , Hao Chen , Yidong Wang , Linyi Yang , Wei Ye , Yue Zhang , Neil Zhenqiang Gong , Xing Xie

An LLM can Fool Itself: A Prompt-Based Adversarial Attack

The wide-ranging applications of large language models (LLMs), especially in safety-critical domains, necessitate the proper evaluation of the LLM's adversarial robustness. This paper proposes an efficient tool to audit the LLM's…

Cryptography and Security · Computer Science 2023-10-23 Xilie Xu , Keyi Kong , Ning Liu , Lizhen Cui , Di Wang , Jingfeng Zhang , Mohan Kankanhalli

Red Teaming Language Model Detectors with Language Models

The prevalence and strong capability of large language models (LLMs) present significant safety and ethical risks if exploited by malicious users. To prevent the potentially deceptive usage of LLMs, recent works have proposed algorithms to…

Computation and Language · Computer Science 2023-10-20 Zhouxing Shi , Yihan Wang , Fan Yin , Xiangning Chen , Kai-Wei Chang , Cho-Jui Hsieh

Exposing the Achilles' Heel: Evaluating LLMs Ability to Handle Mistakes in Mathematical Reasoning

Large Language Models (LLMs) have been applied to Math Word Problems (MWPs) with transformative impacts, revolutionizing how these complex problems are approached and solved in various domains including educational settings. However, the…

Computation and Language · Computer Science 2024-06-18 Joykirat Singh , Akshay Nambi , Vibhav Vineet

Fine-tuned Large Language Models (LLMs): Improved Prompt Injection Attacks Detection

Large language models (LLMs) are becoming a popular tool as they have significantly advanced in their capability to tackle a wide range of language-based tasks. However, LLMs applications are highly vulnerable to prompt injection attacks,…

Computation and Language · Computer Science 2024-11-11 Md Abdur Rahman , Fan Wu , Alfredo Cuzzocrea , Sheikh Iqbal Ahamed

GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers

Large language models (LLMs) have achieved impressive performance across various mathematical reasoning benchmarks. However, there are increasing debates regarding whether these models truly understand and apply mathematical knowledge or…

Computation and Language · Computer Science 2024-07-03 Qintong Li , Leyang Cui , Xueliang Zhao , Lingpeng Kong , Wei Bi

Reasoning Robustness of LLMs to Adversarial Typographical Errors

Large Language Models (LLMs) have demonstrated impressive capabilities in reasoning using Chain-of-Thought (CoT) prompting. However, CoT can be biased by users' instruction. In this work, we study the reasoning robustness of LLMs to…

Computation and Language · Computer Science 2024-11-11 Esther Gan , Yiran Zhao , Liying Cheng , Yancan Mao , Anirudh Goyal , Kenji Kawaguchi , Min-Yen Kan , Michael Shieh

Solving Math Word Problems Using Estimation Verification and Equation Generation

Large Language Models (LLMs) excel at various tasks, including problem-solving and question-answering. However, LLMs often find Math Word Problems (MWPs) challenging because solving them requires a range of reasoning and mathematical…

Artificial Intelligence · Computer Science 2025-09-24 Mitchell Piehl , Dillon Wilson , Ananya Kalita , Jugal Kalita

Adversarial Math Word Problem Generation

Large language models (LLMs) have significantly transformed the educational landscape. As current plagiarism detection tools struggle to keep pace with LLMs' rapid advancements, the educational community faces the challenge of assessing…

Computation and Language · Computer Science 2024-06-18 Roy Xie , Chengxuan Huang , Junlin Wang , Bhuwan Dhingra

Cutting Through the Noise: Boosting LLM Performance on Math Word Problems

Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates…

Computation and Language · Computer Science 2025-09-17 Ujjwala Anantheswaran , Himanshu Gupta , Kevin Scaria , Shreyas Verma , Chitta Baral , Swaroop Mishra

One Model Transfer to All: On Robust Jailbreak Prompts Generation against LLMs

Safety alignment in large language models (LLMs) is increasingly compromised by jailbreak attacks, which can manipulate these models to generate harmful or unintended content. Investigating these attacks is crucial for uncovering model…

Cryptography and Security · Computer Science 2025-05-26 Linbao Li , Yannan Liu , Daojing He , Yu Li

Transfer Attacks and Defenses for Large Language Models on Coding Tasks

Modern large language models (LLMs), such as ChatGPT, have demonstrated impressive capabilities for coding tasks including writing and reasoning about code. They improve upon previous neural network models of code, such as code2seq or…

Machine Learning · Computer Science 2023-11-23 Chi Zhang , Zifan Wang , Ravi Mangal , Matt Fredrikson , Limin Jia , Corina Pasareanu

Breaking Down the Defenses: A Comparative Survey of Attacks on Large Language Models

Large Language Models (LLMs) have become a cornerstone in the field of Natural Language Processing (NLP), offering transformative capabilities in understanding and generating human-like text. However, with their rising prominence, the…

Cryptography and Security · Computer Science 2024-03-26 Arijit Ghosh Chowdhury , Md Mofijul Islam , Vaibhav Kumar , Faysal Hossain Shezan , Vaibhav Kumar , Vinija Jain , Aman Chadha