English
Related papers

Related papers: kTrans: Knowledge-Aware Transformer for Binary Cod…

200 papers

Binary code similarity detection (BCSD) has important applications in various fields such as vulnerability detection, software component analysis, and reverse engineering. Recent studies have shown that deep neural networks (DNNs) can…

Cryptography and Security · Computer Science 2022-05-26 Hao Wang , Wenjie Qu , Gilad Katz , Wenyu Zhu , Zeyu Gao , Han Qiu , Jianwei Zhuge , Chao Zhang

Binary code similarity detection is a core task in reverse engineering. It supports malware analysis and vulnerability discovery by identifying semantically similar code in different contexts. Modern methods have progressed from manually…

Artificial Intelligence · Computer Science 2025-09-30 Charles E. Gagnon , Steven H. H. Ding , Philippe Charland , Benjamin C. M. Fung

Many recent works have demonstrated the benefits of knowledge graph embeddings in completing monolingual knowledge graphs. Inasmuch as related knowledge bases are built in several different languages, achieving cross-lingual knowledge…

Artificial Intelligence · Computer Science 2017-05-19 Muhao Chen , Yingtao Tian , Mohan Yang , Carlo Zaniolo

A recent trend in binary code analysis promotes the use of neural solutions based on instruction embedding models. An instruction embedding model is a neural network that transforms sequences of assembly instructions into embedding vectors.…

Cryptography and Security · Computer Science 2022-08-16 Fiorella Artuso , Marco Mormando , Giuseppe A. Di Luna , Leonardo Querzoni

Function association is a useful process for binary reverse engineers. Search tools exist to perform association at scale, but they do not utilize the full range of capabilities that AI-enabled search provides. Prior work has explored the…

Cryptography and Security · Computer Science 2026-05-08 Eric Wolos , Michael Doyle

Knowledge bases of real-world facts about entities and their relationships are useful resources for a variety of natural language processing tasks. However, because knowledge bases are typically incomplete, it is useful to be able to…

Computation and Language · Computer Science 2017-03-09 Dat Quoc Nguyen , Kairit Sirts , Lizhen Qu , Mark Johnson

Currently, a growing number of mature natural language processing applications make people's life more convenient. Such applications are built by source code - the language in software engineering. However, the applications for…

Software Engineering · Computer Science 2021-05-13 Ahmed Elnaggar , Wei Ding , Llion Jones , Tom Gibbs , Tamas Feher , Christoph Angerer , Silvia Severini , Florian Matthes , Burkhard Rost

Binary code analysis has immense importance in the research domain of software security. Today, software is very often compiled for various Instruction Set Architectures (ISAs). As a result, cross-architecture binary code analysis has…

Software Engineering · Computer Science 2024-05-01 Iftakhar Ahmad , Lannan Luo

Encrypted traffic classification (TC) methods must adapt to new protocols and extensions as well as to advancements in other machine learning fields. In this paper, we adopt a transfer learning setup best known from computer vision. We…

Machine Learning · Computer Science 2026-01-21 Jan Luxemburk , Karel Hynek , Richard Plný , Tomáš Čejka

Binary embeddings provide efficient and powerful ways to perform operations on large scale data. However binary embedding typically requires long codes in order to preserve the discriminative power of the input space. Thus binary coding…

Data Structures and Algorithms · Computer Science 2015-12-08 Felix X. Yu , Aditya Bhaskara , Sanjiv Kumar , Yunchao Gong , Shih-Fu Chang

The task of multimodal referring expression comprehension (REC), aiming at localizing an image region described by a natural language expression, has recently received increasing attention within the research comminity. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2024-12-02 Zhi Zhang , Helen Yannakoudakis , Xiantong Zhen , Ekaterina Shutova

Though adversarial erasing has prevailed in weakly supervised semantic segmentation to help activate integral object regions, existing approaches still suffer from the dilemma of under-activation and over-expansion due to the difficulty in…

Computer Vision and Pattern Recognition · Computer Science 2024-07-04 Tao Chen , XiRuo Jiang , Gensheng Pei , Zeren Sun , Yucheng Wang , Yazhou Yao

In this work, we reimagine classical probing to evaluate knowledge transfer from simple source to more complex target tasks. Instead of probing frozen representations from a complex source task on diverse simple target probing tasks (as…

When applying the Transformer architecture to source code, designing a good self-attention mechanism is critical as it affects how node relationship is extracted from the Abstract Syntax Trees (ASTs) of the source code. We present Code…

Software Engineering · Computer Science 2024-04-10 Saeyoon Oh , Shin Yoo

Code search is a widely used technique by developers during software development. It provides semantically similar implementations from a large code corpus to developers based on their queries. Existing techniques leverage deep learning…

Software Engineering · Computer Science 2022-02-17 Weisong Sun , Chunrong Fang , Yuchen Chen , Guanhong Tao , Tingxu Han , Quanjun Zhang

Decompilation is foundational to binary analysis, yet conventional tools prioritize human readability over strict recompilability and verifiable runtime correctness. While recent LLM-based approaches attempt to refine decompiled pseudocode,…

Software Engineering · Computer Science 2026-03-17 Yuxin Cui , Zeyu Gao , Shuxian He , Siliang Qin , Chao Zhang

Currently, video transmission serves not only the Human Visual System (HVS) for viewing but also machine perception for analysis. However, existing codecs are primarily optimized for pixel-domain and HVS-perception metrics rather than the…

Image and Video Processing · Electrical Eng. & Systems 2025-03-28 Yuxiao Sun , Yao Zhao , Meiqin Liu , Chao Yao , Weisi Lin

In this work, we begin to investigate the possibility of training a deep neural network on the task of binary code understanding. Specifically, the network would take, as input, features derived directly from binaries and output English…

Machine Learning · Computer Science 2024-05-01 Alexander Interrante-Grant , Andy Davis , Heather Preslier , Tim Leek

Tackling binary program analysis problems has traditionally implied manually defining rules and heuristics, a tedious and time-consuming task for human analysts. In order to improve automation and scalability, we propose an alternative…

Cryptography and Security · Computer Science 2021-05-25 Shushan Arakelyan , Sima Arasteh , Christophe Hauser , Erik Kline , Aram Galstyan

Binary code similarity detection (BCSD) has various applications, including but not limited to vulnerability detection, plagiarism detection, and malware detection. Previous research efforts mainly focus on transforming binary code to…

Cryptography and Security · Computer Science 2023-06-27 Chensen Huang , Guibo Zhu , Guojing Ge , Taihao Li , Jinqiao Wang
‹ Prev 1 2 3 10 Next ›