Related papers: MALSIGHT: Exploring Malicious Source Code and Beni…

A Vulnerability Code Intent Summary Dataset

In the era of Large Language Models (LLMs), the code summarization technique boosts a lot, along with the emergence of many new significant works. However, the potential of code summarization in the Computer Security Area still remains…

Software Engineering · Computer Science 2025-04-14 Yifan Huang , Weisong Sun , Yubin Qu

Binary Diff Summarization using Large Language Models

Security of software supply chains is necessary to ensure that software updates do not contain maliciously injected code or introduce vulnerabilities that may compromise the integrity of critical infrastructure. Verifying the integrity of…

Cryptography and Security · Computer Science 2025-09-30 Meet Udeshi , Venkata Sai Charan Putrevu , Prashanth Krishnamurthy , Prashant Anantharaman , Sean Carrick , Ramesh Karri , Farshad Khorrami

Exploring Large Language Models for Semantic Analysis and Categorization of Android Malware

Malware analysis is a complex process of examining and evaluating malicious software's functionality, origin, and potential impact. This arduous process typically involves dissecting the software to understand its components, infection…

Cryptography and Security · Computer Science 2025-01-10 Brandon J Walton , Mst Eshita Khatun , James M Ghawaly , Aisha Ali-Gombe

MalCVE: Malware Detection and CVE Association Using Large Language Models

Malicious software attacks are having an increasingly significant economic impact. Commercial malware detection software can be costly, and tools that attribute malware to the specific software vulnerabilities it exploits are largely…

Cryptography and Security · Computer Science 2026-02-03 Eduard Andrei Cristea , Petter Molnes , Jingyue Li

Exploring LLMs for Malware Detection: Review, Framework Design, and Countermeasure Approaches

The rising use of Large Language Models (LLMs) to create and disseminate malware poses a significant cybersecurity challenge due to their ability to generate and distribute attacks with ease. A single prompt can initiate a wide array of…

Cryptography and Security · Computer Science 2024-09-13 Jamal Al-Karaki , Muhammad Al-Zafar Khan , Marwan Omar

MaLAware: Automating the Comprehension of Malicious Software Behaviours using Large Language Models (LLMs)

Current malware (malicious software) analysis tools focus on detection and family classification but fail to provide clear and actionable narrative insights into the malignant activity of the malware. Therefore, there is a need for a tool…

Cryptography and Security · Computer Science 2025-04-03 Bikash Saha , Nanda Rani , Sandeep Kumar Shukla

Summary-Mediated Repair: Can LLMs use code summarisation as a tool for program repair?

Large Language Models (LLMs) often produce code with subtle implementation-level bugs despite strong benchmark performance. These errors are hard for LLMs to spot and can have large behavioural effects; yet when asked to summarise code,…

Software Engineering · Computer Science 2025-11-25 Lukas Twist

MALITE: Lightweight Malware Detection and Classification for Constrained Devices

Today, malware is one of the primary cyberthreats to organizations. Malware has pervaded almost every type of computing device including the ones having limited memory, battery and computation power such as mobile phones, tablets and…

Cryptography and Security · Computer Science 2023-09-08 Sidharth Anand , Barsha Mitra , Soumyadeep Dey , Abhinav Rao , Rupsa Dhar , Jaideep Vaidya

An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding

Binary code analysis plays a pivotal role in the field of software security and is widely used in tasks such as software maintenance, malware detection, software vulnerability discovery, patch analysis, etc. However, unlike source code,…

Software Engineering · Computer Science 2025-05-01 Xiuwei Shang , Zhenkan Fu , Shaoyin Cheng , Guoqiang Chen , Gangyang Li , Li Hu , Weiming Zhang , Nenghai Yu

Empirical Analysis and Detection of Hallucinations in LLM-Generated Bug Report Summaries

Large Language Models (LLMs) are increasingly used to generate summaries of software bug reports, including sections such as Steps-to-Reproduce (S2R), Actual Behavior (AB), and Expected Behavior (EB). However, these models frequently…

Software Engineering · Computer Science 2026-05-26 Hinduja Nirujan , Shreyas Patil , Abdallah Ayoub , Ahmad Abdel Latif , Gouri Ginde

Exploiting Method Names to Improve Code Summarization: A Deliberation Multi-Task Learning Approach

Code summaries are brief natural language descriptions of source code pieces. The main purpose of code summarization is to assist developers in understanding code and to reduce documentation workload. In this paper, we design a novel…

Computation and Language · Computer Science 2021-03-31 Rui Xie , Wei Ye , Jinan Sun , Shikun Zhang

Calibration of Large Language Models on Code Summarization

A brief, fluent, and relevant summary can be helpful during program comprehension; however, such a summary does require significant human effort to produce. Often, good summaries are unavailable in software projects, which makes maintenance…

Software Engineering · Computer Science 2025-06-03 Yuvraj Virk , Premkumar Devanbu , Toufique Ahmed

Beyond Classification: Evaluating LLMs for Fine-Grained Automatic Malware Behavior Auditing

Automated malware classification has achieved strong detection performance. Yet, malware behavior auditing seeks causal and verifiable explanations of malicious activities -- essential not only to reveal what malware does but also to…

Cryptography and Security · Computer Science 2025-09-19 Xinran Zheng , Xingzhi Qian , Yiling He , Shuo Yang , Lorenzo Cavallaro

Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language Models

Binary code summarization, while invaluable for understanding code semantics, is challenging due to its labor-intensive nature. This study delves into the potential of large language models (LLMs) for binary code comprehension. To this end,…

Cryptography and Security · Computer Science 2023-12-18 Xin Jin , Jonathan Larson , Weiwei Yang , Zhiqiang Lin

Source Code Summarization in the Era of Large Language Models

To support software developers in understanding and maintaining programs, various automatic (source) code summarization techniques have been proposed to generate a concise natural language summary (i.e., comment) for a given code snippet.…

Software Engineering · Computer Science 2025-08-26 Weisong Sun , Yun Miao , Yuekang Li , Hongyu Zhang , Chunrong Fang , Yi Liu , Gelei Deng , Yang Liu , Zhenyu Chen

Malware Detection Using Frequency Domain-Based Image Visualization and Deep Learning

We propose a novel method to detect and visualize malware through image classification. The executable binaries are represented as grayscale images obtained from the count of N-grams (N=2) of bytes in the Discrete Cosine Transform (DCT)…

Cryptography and Security · Computer Science 2021-01-27 Tajuddin Manhar Mohammed , Lakshmanan Nataraj , Satish Chikkagoudar , Shivkumar Chandrasekaran , B. S. Manjunath

Code Summarization Beyond Function Level

Code summarization is a critical task in natural language processing and software engineering, which aims to generate concise descriptions of source code. Recent advancements have improved the quality of these summaries, enhancing code…

Computation and Language · Computer Science 2025-02-25 Vladimir Makharev , Vladimir Ivanov

Can Large Language Models Serve as Evaluators for Code Summarization?

Code summarization facilitates program comprehension and software maintenance by converting code snippets into natural-language descriptions. Over the years, numerous methods have been developed for this task, but a key challenge remains:…

Software Engineering · Computer Science 2024-12-03 Yang Wu , Yao Wan , Zhaoyang Chu , Wenting Zhao , Ye Liu , Hongyu Zhang , Xuanhua Shi , Philip S. Yu

Assessing LLMs in Malicious Code Deobfuscation of Real-world Malware Campaigns

The integration of large language models (LLMs) into various pipelines is increasingly widespread, effectively automating many manual tasks and often surpassing human capabilities. Cybersecurity researchers and practitioners have recognised…

Cryptography and Security · Computer Science 2024-05-01 Constantinos Patsakis , Fran Casino , Nikolaos Lykousas

On Benchmarking Code LLMs for Android Malware Analysis

Large Language Models (LLMs) have demonstrated strong capabilities in various code intelligence tasks. However, their effectiveness for Android malware analysis remains underexplored. Decompiled Android malware code presents unique…

Cryptography and Security · Computer Science 2025-04-24 Yiling He , Hongyu She , Xingzhi Qian , Xinran Zheng , Zhuo Chen , Zhan Qin , Lorenzo Cavallaro