Related papers: Black-Box Adversarial Attacks on LLM-Based Code Co…

An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection

Large Language Models (LLMs) have transformed code completion tasks, providing context-based suggestions to boost developer productivity in software engineering. As users often fine-tune these models for specific applications, poisoning and…

Cryptography and Security · Computer Science 2024-06-12 Shenao Yan , Shen Wang , Yue Duan , Hanbin Hong , Kiho Lee , Doowon Kim , Yuan Hong

CodeLMSec Benchmark: Systematically Evaluating and Finding Security Vulnerabilities in Black-Box Code Language Models

Large language models (LLMs) for automatic code generation have achieved breakthroughs in several programming tasks. Their advances in competition-level programming problems have made them an essential pillar of AI-assisted pair…

Cryptography and Security · Computer Science 2023-10-24 Hossein Hajipour , Keno Hassler , Thorsten Holz , Lea Schönherr , Mario Fritz

Exploring Jailbreak Attacks on LLMs through Intent Concealment and Diversion

Although large language models (LLMs) have achieved remarkable advancements, their security remains a pressing concern. One major threat is jailbreak attacks, where adversarial prompts bypass model safeguards to generate harmful or…

Cryptography and Security · Computer Science 2025-05-21 Tiehan Cui , Yanxu Mao , Peipei Liu , Congying Liu , Datao You

CCTEST: Testing and Repairing Code Completion Systems

Code completion, a highly valuable topic in the software development domain, has been increasingly promoted for use by recent advances in large language models (LLMs). To date, visible LLM-based code completion frameworks such as GitHub…

Software Engineering · Computer Science 2023-05-09 Zongjie Li , Chaozheng Wang , Zhibo Liu , Haoxuan Wang , Dong Chen , Shuai Wang , Cuiyun Gao

RealSec-bench: A Benchmark for Evaluating Secure Code Generation in Real-World Repositories

Large Language Models (LLMs) have demonstrated remarkable capabilities in code generation, but their proficiency in producing secure code remains a critical, under-explored area. Existing benchmarks often fall short by relying on synthetic…

Cryptography and Security · Computer Science 2026-02-02 Yanlin Wang , Ziyao Zhang , Chong Wang , Xinyi Xu , Mingwei Liu , Yong Wang , Jiachi Chen , Zibin Zheng

Demonstration Attack against In-Context Learning for Code Intelligence

Recent advancements in large language models (LLMs) have revolutionized code intelligence by improving programming productivity and alleviating challenges faced by software developers. To further improve the performance of LLMs on specific…

Cryptography and Security · Computer Science 2024-10-07 Yifei Ge , Weisong Sun , Yihang Lou , Chunrong Fang , Yiran Zhang , Yiming Li , Xiaofang Zhang , Yang Liu , Zhihong Zhao , Zhenyu Chen

Security Attacks on LLM-based Code Completion Tools

The rapid development of large language models (LLMs) has significantly advanced code completion capabilities, giving rise to a new generation of LLM-based Code Completion Tools (LCCTs). Unlike general-purpose LLMs, these tools possess…

Computation and Language · Computer Science 2025-01-03 Wen Cheng , Ke Sun , Xinyu Zhang , Wei Wang

ShadowCode: Towards (Automatic) External Prompt Injection Attack against Code LLMs

Recent advancements have led to the widespread adoption of code-oriented large language models (Code LLMs) for programming tasks. Despite their success in deployment, their security research is left far behind. This paper introduces a new…

Cryptography and Security · Computer Science 2025-07-23 Yuchen Yang , Yiming Li , Hongwei Yao , Bingrun Yang , Yiling He , Tianwei Zhang , Dacheng Tao , Zhan Qin

Target-driven Attack for Large Language Models

Current large language models (LLM) provide a strong foundation for large-scale user-oriented natural language tasks. Many users can easily inject adversarial text or instructions through the user interface, thus causing LLM model security…

Computation and Language · Computer Science 2024-11-14 Chong Zhang , Mingyu Jin , Dong Shu , Taowen Wang , Dongfang Liu , Xiaobo Jin

ACE: A Security Architecture for LLM-Integrated App Systems

LLM-integrated app systems extend the utility of Large Language Models (LLMs) with third-party apps that are invoked by a system LLM using interleaved planning and execution phases to answer user queries. These systems introduce new attack…

Cryptography and Security · Computer Science 2026-05-15 Evan Li , Tushin Mallick , Evan Rose , William Robertson , Alina Oprea , Cristina Nita-Rotaru

Inducing Vulnerable Code Generation in LLM Coding Assistants

Due to insufficient domain knowledge, LLM coding assistants often reference related solutions from the Internet to address programming problems. However, incorporating external information into LLMs' code generation process introduces new…

Software Engineering · Computer Science 2025-04-23 Binqi Zeng , Quan Zhang , Chijin Zhou , Gwihwan Go , Yu Jiang , Heyuan Shi

LLM-CSEC: Empirical Evaluation of Security in C/C++ Code Generated by Large Language Models

The security of code generated by large language models (LLMs) is a significant concern, as studies indicate that such code often contains vulnerabilities and lacks essential defensive programming constructs. This work focuses on examining…

Artificial Intelligence · Computer Science 2025-11-25 Muhammad Usman Shahid , Chuadhry Mujeeb Ahmed , Rajiv Ranjan

DCE-LLM: Dead Code Elimination with Large Language Models

Dead code introduces several challenges in software development, such as increased binary size and maintenance difficulties. It can also obscure logical errors and be exploited for obfuscation in malware. For LLM-based code-related tasks,…

Software Engineering · Computer Science 2025-06-16 Minyu Chen , Guoqiang Li , Ling-I Wu , Ruibang Liu

"Someone Hid It": Query-Agnostic Black-Box Attacks on LLM-Based Retrieval

Large language models (LLMs) have been serving as effective backbones for retrieval systems, including Retrieval-Augmentation-Generation (RAG), Dense Information Retriever (IR), and Agent Memory Retrieval. Recent studies have demonstrated…

Cryptography and Security · Computer Science 2026-05-18 Jiate Li , Defu Cao , Li Li , Wei Yang , Yuehan Qin , Chenxiao Yu , Tiannuo Yang , Ryan A. Rossi , Yan Liu , Xiyang Hu , Yue Zhao

Black-box Optimization of LLM Outputs by Asking for Directions

We present a novel approach for attacking black-box large language models (LLMs) by exploiting their ability to express confidence in natural language. Existing black-box attacks require either access to continuous model outputs like logits…

Cryptography and Security · Computer Science 2025-10-21 Jie Zhang , Meng Ding , Yang Liu , Jue Hong , Florian Tramèr

HardSecBench: Benchmarking the Security Awareness of LLMs for Hardware Code Generation

Large language models (LLMs) are being increasingly integrated into practical hardware and firmware development pipelines for code generation. Existing studies have primarily focused on evaluating the functional correctness of LLM-generated…

Cryptography and Security · Computer Science 2026-01-21 Qirui Chen , Jingxian Shuai , Shuangwu Chen , Shenghao Ye , Zijian Wen , Xufei Su , Jie Jin , Jiangming Li , Jun Chen , Xiaobin Tan , Jian Yang

Certifying LLM Safety against Adversarial Prompting

Large language models (LLMs) are vulnerable to adversarial attacks that add malicious tokens to an input prompt to bypass the safety guardrails of an LLM and cause it to produce harmful content. In this work, we introduce erase-and-check,…

Computation and Language · Computer Science 2025-02-06 Aounon Kumar , Chirag Agarwal , Suraj Srinivas , Aaron Jiaxun Li , Soheil Feizi , Himabindu Lakkaraju

Construction and Evaluation of LLM-based agents for Semi-Autonomous penetration testing

With the emergence of high-performance large language models (LLMs) such as GPT, Claude, and Gemini, the autonomous and semi-autonomous execution of tasks has significantly advanced across various domains. However, in highly specialized…

Cryptography and Security · Computer Science 2025-02-24 Masaya Kobayashi , Masane Fuchi , Amar Zanashir , Tomonori Yoneda , Tomohiro Takagi

How Secure is Secure Code Generation? Adversarial Prompts Put LLM Defenses to the Test

Recent secure code generation methods, using vulnerability-aware fine-tuning, prefix-tuning, and prompt optimization, claim to prevent LLMs from producing insecure code. However, their robustness under adversarial conditions remains…

Cryptography and Security · Computer Science 2026-01-13 Melissa Tessa , Iyiola E. Olatunji , Aicha War , Jacques Klein , Tegawendé F. Bissyandé

Benchmarking and Defending Against Indirect Prompt Injection Attacks on Large Language Models

The integration of large language models with external content has enabled applications such as Microsoft Copilot but also introduced vulnerabilities to indirect prompt injection attacks. In these attacks, malicious instructions embedded…

Computation and Language · Computer Science 2025-01-28 Jingwei Yi , Yueqi Xie , Bin Zhu , Emre Kiciman , Guangzhong Sun , Xing Xie , Fangzhao Wu