English
Related papers

Related papers: MathAttack: Attacking Large Language Models Toward…

200 papers

While Large Language Models (LLMs) have shown impressive capabilities in math problem-solving tasks, their robustness to noisy inputs is not well-studied. We propose ArithmAttack to examine how robust the LLMs are when they encounter noisy…

Computation and Language · Computer Science 2026-03-17 Zain Ul Abedin , Shahzeb Qamar , Lucie Flek , Akbar Karimi

Over the past decade, there has been extensive research aimed at enhancing the robustness of neural networks, yet this problem remains vastly unsolved. Here, one major impediment has been the overestimation of the robustness of new defense…

Artificial Intelligence · Computer Science 2023-10-31 Leo Schwinn , David Dobre , Stephan Günnemann , Gauthier Gidel

The increasing deployment of Large Language Models (LLMs) in various applications necessitates a rigorous evaluation of their robustness against adversarial attacks. In this paper, we present a comprehensive study on the robustness of GPT…

Computation and Language · Computer Science 2024-12-24 Yiyi Tao , Yixian Shen , Hang Zhang , Yanxin Shen , Lun Wang , Chuanqi Shi , Shaoshuai Du

Large Language Models (LLMs) have revolutionized natural language processing, but their robustness against adversarial attacks remains a critical concern. We presents a novel white-box style attack approach that exposes vulnerabilities in…

Computation and Language · Computer Science 2024-09-16 Zeyu Yang , Zhao Meng , Xiaochen Zheng , Roger Wattenhofer

Standard accuracy metrics have shown that Math Word Problem (MWP) solvers have achieved high performance on benchmark datasets. However, the extent to which existing MWP solvers truly understand language and its relation with numbers is…

Computation and Language · Computer Science 2021-09-14 Vivek Kumar , Rishabh Maheshwary , Vikram Pudi

Large language models (LLMs) have exhibited remarkable fluency across various tasks. However, their unethical applications, such as disseminating disinformation, have become a growing concern. Although recent works have proposed a number of…

Computation and Language · Computer Science 2024-10-07 James Wang , Ran Li , Junfeng Yang , Chengzhi Mao

The widespread adoption of Large Language Models (LLMs), exemplified by OpenAI's ChatGPT, brings to the forefront the imperative to defend against adversarial threats on these models. These attacks, which manipulate an LLM's output by…

Cryptography and Security · Computer Science 2025-04-04 Amelia Kawasaki , Andrew Davis , Houssam Abbas

The increasing reliance on Large Language Models (LLMs) across academia and industry necessitates a comprehensive understanding of their robustness to prompts. In response to this vital need, we introduce PromptRobust, a robustness…

Computation and Language · Computer Science 2024-07-17 Kaijie Zhu , Jindong Wang , Jiaheng Zhou , Zichen Wang , Hao Chen , Yidong Wang , Linyi Yang , Wei Ye , Yue Zhang , Neil Zhenqiang Gong , Xing Xie

The wide-ranging applications of large language models (LLMs), especially in safety-critical domains, necessitate the proper evaluation of the LLM's adversarial robustness. This paper proposes an efficient tool to audit the LLM's…

Cryptography and Security · Computer Science 2023-10-23 Xilie Xu , Keyi Kong , Ning Liu , Lizhen Cui , Di Wang , Jingfeng Zhang , Mohan Kankanhalli

The prevalence and strong capability of large language models (LLMs) present significant safety and ethical risks if exploited by malicious users. To prevent the potentially deceptive usage of LLMs, recent works have proposed algorithms to…

Computation and Language · Computer Science 2023-10-20 Zhouxing Shi , Yihan Wang , Fan Yin , Xiangning Chen , Kai-Wei Chang , Cho-Jui Hsieh

Large Language Models (LLMs) have been applied to Math Word Problems (MWPs) with transformative impacts, revolutionizing how these complex problems are approached and solved in various domains including educational settings. However, the…

Computation and Language · Computer Science 2024-06-18 Joykirat Singh , Akshay Nambi , Vibhav Vineet

Large language models (LLMs) are becoming a popular tool as they have significantly advanced in their capability to tackle a wide range of language-based tasks. However, LLMs applications are highly vulnerable to prompt injection attacks,…

Computation and Language · Computer Science 2024-11-11 Md Abdur Rahman , Fan Wu , Alfredo Cuzzocrea , Sheikh Iqbal Ahamed

Large language models (LLMs) have achieved impressive performance across various mathematical reasoning benchmarks. However, there are increasing debates regarding whether these models truly understand and apply mathematical knowledge or…

Computation and Language · Computer Science 2024-07-03 Qintong Li , Leyang Cui , Xueliang Zhao , Lingpeng Kong , Wei Bi

Large Language Models (LLMs) have demonstrated impressive capabilities in reasoning using Chain-of-Thought (CoT) prompting. However, CoT can be biased by users' instruction. In this work, we study the reasoning robustness of LLMs to…

Computation and Language · Computer Science 2024-11-11 Esther Gan , Yiran Zhao , Liying Cheng , Yancan Mao , Anirudh Goyal , Kenji Kawaguchi , Min-Yen Kan , Michael Shieh

Large Language Models (LLMs) excel at various tasks, including problem-solving and question-answering. However, LLMs often find Math Word Problems (MWPs) challenging because solving them requires a range of reasoning and mathematical…

Artificial Intelligence · Computer Science 2025-09-24 Mitchell Piehl , Dillon Wilson , Ananya Kalita , Jugal Kalita

Large language models (LLMs) have significantly transformed the educational landscape. As current plagiarism detection tools struggle to keep pace with LLMs' rapid advancements, the educational community faces the challenge of assessing…

Computation and Language · Computer Science 2024-06-18 Roy Xie , Chengxuan Huang , Junlin Wang , Bhuwan Dhingra

Large Language Models (LLMs) excel at various tasks, including solving math word problems (MWPs), but struggle with real-world problems containing irrelevant information. To address this, we propose a prompting framework that generates…

Computation and Language · Computer Science 2025-09-17 Ujjwala Anantheswaran , Himanshu Gupta , Kevin Scaria , Shreyas Verma , Chitta Baral , Swaroop Mishra

Safety alignment in large language models (LLMs) is increasingly compromised by jailbreak attacks, which can manipulate these models to generate harmful or unintended content. Investigating these attacks is crucial for uncovering model…

Cryptography and Security · Computer Science 2025-05-26 Linbao Li , Yannan Liu , Daojing He , Yu Li

Modern large language models (LLMs), such as ChatGPT, have demonstrated impressive capabilities for coding tasks including writing and reasoning about code. They improve upon previous neural network models of code, such as code2seq or…

Machine Learning · Computer Science 2023-11-23 Chi Zhang , Zifan Wang , Ravi Mangal , Matt Fredrikson , Limin Jia , Corina Pasareanu

Large Language Models (LLMs) have become a cornerstone in the field of Natural Language Processing (NLP), offering transformative capabilities in understanding and generating human-like text. However, with their rising prominence, the…

Cryptography and Security · Computer Science 2024-03-26 Arijit Ghosh Chowdhury , Md Mofijul Islam , Vaibhav Kumar , Faysal Hossain Shezan , Vaibhav Kumar , Vinija Jain , Aman Chadha
‹ Prev 1 2 3 10 Next ›