Related papers: CoTDeceptor:Adversarial Code Obfuscation Against C…

A Systematic Study of Code Obfuscation Against LLM-based Vulnerability Detection

As large language models (LLMs) are increasingly adopted for code vulnerability detection, their reliability and robustness across diverse vulnerability types have become a pressing concern. In traditional adversarial settings, code…

Cryptography and Security · Computer Science 2025-12-19 Xiao Li , Yue Li , Hao Wu , Yue Zhang , Yechao Zhang , Fengyuan Xu , Sheng Zhong

The Code Barrier: What LLMs Actually Understand?

Understanding code represents a core ability needed for automating software development tasks. While foundation models like LLMs show impressive results across many software engineering challenges, the extent of their true semantic…

Software Engineering · Computer Science 2025-04-16 Serge Lionel Nikiema , Jordan Samhi , Abdoul Kader Kaboré , Jacques Klein , Tegawendé F. Bissyandé

Assessing LLMs in Malicious Code Deobfuscation of Real-world Malware Campaigns

The integration of large language models (LLMs) into various pipelines is increasingly widespread, effectively automating many manual tasks and often surpassing human capabilities. Cybersecurity researchers and practitioners have recognised…

Cryptography and Security · Computer Science 2024-05-01 Constantinos Patsakis , Fran Casino , Nikolaos Lykousas

"Digital Camouflage": The LLVM Challenge in LLM-Based Malware Detection

Large Language Models (LLMs) have emerged as promising tools for malware detection by analyzing code semantics, identifying vulnerabilities, and adapting to evolving threats. However, their reliability under adversarial compiler-level…

Cryptography and Security · Computer Science 2025-09-23 Ekin Böke , Simon Torka

Benchmarking Large Language Models for IoC Recovery under Adversarial Code Obfuscation and Encryption

Software obfuscation and encryption present persistent challenges for program comprehension and security analysis, particularly when adversaries conceal Indicators of Compromise (IoCs) such as IP addresses within source code. While Large…

Cryptography and Security · Computer Science 2026-05-11 Jaime Morales , Sergio Pastrana , Juan Tapiador

SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding

As large language models (LLMs) become increasingly integrated into real-world applications such as code generation and chatbot assistance, extensive efforts have been made to align LLM behavior with human values, including safety.…

Cryptography and Security · Computer Science 2024-07-29 Zhangchen Xu , Fengqing Jiang , Luyao Niu , Jinyuan Jia , Bill Yuchen Lin , Radha Poovendran

DeceptPrompt: Exploiting LLM-driven Code Generation via Adversarial Natural Language Instructions

With the advancement of Large Language Models (LLMs), significant progress has been made in code generation, enabling LLMs to transform natural language into programming code. These Code LLMs have been widely accepted by massive users and…

Cryptography and Security · Computer Science 2023-12-14 Fangzhou Wu , Xiaogeng Liu , Chaowei Xiao

An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities against Strong Detection

Large Language Models (LLMs) have transformed code completion tasks, providing context-based suggestions to boost developer productivity in software engineering. As users often fine-tune these models for specific applications, poisoning and…

Cryptography and Security · Computer Science 2024-06-12 Shenao Yan , Shen Wang , Yue Duan , Hanbin Hong , Kiho Lee , Doowon Kim , Yuan Hong

ConceptCoder: Improve Code Reasoning via Concept Learning

Large language models (LLMs) have shown promising results for software engineering applications, but still struggle with code reasoning tasks such as vulnerability detection (VD). We introduce ConceptCoder, a fine-tuning method that…

Software Engineering · Computer Science 2026-03-25 Md Mahbubur Rahman , Hengbo Tong , Wei Le

What You See Is Not Always What You Get: Evaluating GPT's Comprehension of Source Code

Recent studies have demonstrated outstanding capabilities of large language models (LLMs) in software engineering tasks, including code generation and comprehension. While LLMs have shown significant potential in assisting with coding, LLMs…

Software Engineering · Computer Science 2025-11-18 Jiawen Wen , Bangshuo Zhu , Huaming Chen

Detecting LLM-generated Code with Subtle Modification by Adversarial Training

With the rapid development of Large Language Models (LLMs), their powerful code-generation capabilities have been widely applied in tasks like code completion and automated development, demonstrating the value of improving coding…

Software Engineering · Computer Science 2025-07-18 Xin Yin , Xinrui Li , Chao Ni , Xiaodan Xu , Xiaohu Yang

An Insight into Security Code Review with LLMs: Capabilities, Obstacles, and Influential Factors

Security code review is a time-consuming and labor-intensive process typically requiring integration with automated security defect detection tools. However, existing security analysis tools struggle with poor generalization, high false…

Software Engineering · Computer Science 2026-05-12 Jiaxin Yu , Peng Liang , Yujia Fu , Amjed Tahir , Mojtaba Shahin , Chong Wang , Yangxiao Cai

CodeCipher: Learning to Obfuscate Source Code Against LLMs

While large code language models have made significant strides in AI-assisted coding tasks, there are growing concerns about privacy challenges. The user code is transparent to the cloud LLM service provider, inducing risks of unauthorized…

Computation and Language · Computer Science 2024-10-10 Yalan Lin , Chengcheng Wan , Yixiong Fang , Xiaodong Gu

Revealing Adversarial Smart Contracts through Semantic Interpretation and Uncertainty Estimation

Adversarial smart contracts, mostly on EVM-compatible chains like Ethereum and BSC, are deployed as EVM bytecode to exploit vulnerable smart contracts for financial gain. Detecting such malicious contracts at the time of deployment is an…

Cryptography and Security · Computer Science 2025-11-17 Yating Liu , Xing Su , Hao Wu , Sijin Li , Yuxi Cheng , Fengyuan Xu , Sheng Zhong

Detecting Adversarial Fine-tuning with Auditing Agents

Large Language Model (LLM) providers expose fine-tuning APIs that let end users fine-tune their frontier LLMs. Unfortunately, it has been shown that an adversary with fine-tuning access to an LLM can bypass safeguards. Particularly…

Cryptography and Security · Computer Science 2025-10-21 Sarah Egler , John Schulman , Nicholas Carlini

A Preliminary Study of Large Language Models for Multilingual Vulnerability Detection

Deep learning-based approaches, particularly those leveraging pre-trained language models (PLMs), have shown promise in automated software vulnerability detection. However, existing methods are predominantly limited to specific programming…

Software Engineering · Computer Science 2025-05-13 Junji Yu , Honglin Shu , Michael Fu , Dong Wang , Chakkrit Tantithamthavorn , Yasutaka Kamei , Junjie Chen

ShadowCoT: Cognitive Hijacking for Stealthy Reasoning Backdoors in LLMs

Chain-of-Thought (CoT) enhances an LLM's ability to perform complex reasoning tasks, but it also introduces new security issues. In this work, we present ShadowCoT, a novel backdoor attack framework that targets the internal reasoning…

Cryptography and Security · Computer Science 2026-04-23 Gejian Zhao , Hanzhou Wu , Xinpeng Zhang , Athanasios V. Vasilakos

ExplainableGuard: Interpretable Adversarial Defense for Large Language Models Using Chain-of-Thought Reasoning

Large Language Models (LLMs) are increasingly vulnerable to adversarial attacks that can subtly manipulate their outputs. While various defense mechanisms have been proposed, many operate as black boxes, lacking transparency in their…

Cryptography and Security · Computer Science 2025-11-19 Shaowei Guan , Yu Zhai , Zhengyu Zhang , Yanze Wang , Hin Chi Kwok

DCE-LLM: Dead Code Elimination with Large Language Models

Dead code introduces several challenges in software development, such as increased binary size and maintenance difficulties. It can also obscure logical errors and be exploited for obfuscation in malware. For LLM-based code-related tasks,…

Software Engineering · Computer Science 2025-06-16 Minyu Chen , Guoqiang Li , Ling-I Wu , Ruibang Liu

A Survey on Adversarial Machine Learning for Code Data: Realistic Threats, Countermeasures, and Interpretations

Code Language Models (CLMs) have achieved tremendous progress in source code understanding and generation, leading to a significant increase in research interests focused on applying CLMs to real-world software engineering tasks in recent…

Cryptography and Security · Computer Science 2024-11-13 Yulong Yang , Haoran Fan , Chenhao Lin , Qian Li , Zhengyu Zhao , Chao Shen , Xiaohong Guan