Related papers: Semantic Data Representation for Explainable Windo…

Knowledge-Based Dataset for Training PE Malware Detection Models

Ontologies are a standard for semantic schemata in many knowledge-intensive domains of human interest. They are now becoming increasingly important also in areas until very recently dominated by subsymbolic representations and…

Cryptography and Security · Computer Science 2023-01-03 Peter Švec , Štefan Balogh , Martin Homola , Ján Kľuka

Towards an Automated Pipeline for Detecting and Classifying Malware through Machine Learning

The constant growth in the number of malware - software or code fragment potentially harmful for computers and information networks - and the use of sophisticated evasion and obfuscation techniques have seriously hindered classic…

Cryptography and Security · Computer Science 2021-06-11 Nicola Loi , Claudio Borile , Daniele Ucci

Explainable Malware Detection with Tailored Logic Explained Networks

Malware detection is a constant challenge in cybersecurity due to the rapid development of new attack techniques. Traditional signature-based approaches struggle to keep pace with the sheer volume of malware samples. Machine learning offers…

Cryptography and Security · Computer Science 2024-05-07 Peter Anthony , Francesco Giannini , Michelangelo Diligenti , Martin Homola , Marco Gori , Stefan Balogh , Jan Mojzis

Multimodal Techniques for Malware Classification

The threat of malware is a serious concern for computer networks and systems, highlighting the need for accurate classification techniques. In this research, we experiment with multimodal machine learning approaches for malware…

Cryptography and Security · Computer Science 2025-01-22 Jonathan Jiang , Mark Stamp

EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models

This paper describes EMBER: a labeled benchmark dataset for training machine learning models to statically detect malicious Windows portable executable files. The dataset includes features extracted from 1.1M binary files: 900K training…

Cryptography and Security · Computer Science 2018-04-18 Hyrum S. Anderson , Phil Roth

Machine Learning for Detecting Malware in PE Files

The increasing number of sophisticated malware poses a major cybersecurity threat. Portable executable (PE) files are a common vector for such malware. In this work we review and evaluate machine learning-based PE malware detection…

Cryptography and Security · Computer Science 2022-12-29 Collin Connors , Dilip Sarkar

Efficient Malware Analysis Using Metric Embeddings

In this paper, we explore the use of metric learning to embed Windows PE files in a low-dimensional vector space for downstream use in a variety of applications, including malware detection, family classification, and malware attribute…

Machine Learning · Computer Science 2022-12-07 Ethan M. Rudd , David Krisiloff , Scott Coull , Daniel Olszewski , Edward Raff , James Holt

Learning the PE Header, Malware Detection with Minimal Domain Knowledge

Many efforts have been made to use various forms of domain knowledge in malware detection. Currently there exist two common approaches to malware detection without domain knowledge, namely byte n-grams and strings. In this work we explore…

Machine Learning · Statistics 2017-11-15 Edward Raff , Jared Sylvester , Charles Nicholas

Machine Learning Transferability for Malware Detection

Malware continues to be a predominant operational risk for organizations, especially when obfuscation techniques are used to evade detection. Despite the ongoing efforts in the development of Machine Learning (ML) detection approaches,…

Cryptography and Security · Computer Science 2026-03-30 César Vieira , João Vitorino , Eva Maia , Isabel Praça

Learning from Context: Exploiting and Interpreting File Path Information for Better Malware Detection

Machine learning (ML) used for static portable executable (PE) malware detection typically employs per-file numerical feature vector representations as input with one or more target labels during training. However, there is much orthogonal…

Cryptography and Security · Computer Science 2019-05-20 Adarsh Kyadige , Ethan M. Rudd , Konstantin Berlin

Multi-feature Dataset for Windows PE Malware Classification

This paper describes a multi-feature dataset for training machine learning classifiers for detecting malicious Windows Portable Executable (PE) files. The dataset includes four feature sets from 18,551 binary samples belonging to five…

Cryptography and Security · Computer Science 2022-10-31 Muhammad Irfan Yousuf , Izza Anwer , Tanzeela Shakir , Minahil Siddiqui , Maysoon Shahid

Explainability Guided Adversarial Evasion Attacks on Malware Detectors

As the focus on security of Artificial Intelligence (AI) is becoming paramount, research on crafting and inserting optimal adversarial perturbations has become increasingly critical. In the malware domain, this adversarial sample generation…

Cryptography and Security · Computer Science 2024-05-06 Kshitiz Aryal , Maanak Gupta , Mahmoud Abdelsalam , Moustafa Saleh

MalwarePT: A Binary-Level Foundation Model for Malware Analysis

Automated malware analysis increasingly relies on machine learning, yet most existing methods remain task-specific and depend on handcrafted features or narrowly scoped models. Recent developments in binary-level foundation models suggest a…

Cryptography and Security · Computer Science 2026-05-19 Saastha Vasan , Yuzhou Nie , Kaie Chen , Yigitcan Kaya , Hojjat Aghakhani , Roman Vasilenko , Wenbo Guo , Christopher Kruegel , Giovanni Vigna

Ontology Embedding: A Survey of Methods, Applications and Resources

Ontologies are widely used for representing domain knowledge and meta data, playing an increasingly important role in Information Systems, the Semantic Web, Bioinformatics and many other domains. However, logical reasoning that ontologies…

Artificial Intelligence · Computer Science 2025-04-08 Jiaoyan Chen , Olga Mashkova , Fernando Zhapa-Camacho , Robert Hoehndorf , Yuan He , Ian Horrocks

Instance Attack:An Explanation-based Vulnerability Analysis Framework Against DNNs for Malware Detection

Deep neural networks (DNNs) are increasingly being applied in malware detection and their robustness has been widely debated. Traditionally an adversarial example generation scheme relies on either detailed model information (gradient-based…

Cryptography and Security · Computer Science 2022-09-07 Sun RuiJin , Guo ShiZe , Guo JinHong , Xing ChangYou , Yang LuMing , Guo Xi , Pan ZhiSong

A Comprehensive Study on Learning-Based PE Malware Family Classification Methods

Driven by the high profit, Portable Executable (PE) malware has been consistently evolving in terms of both volume and sophistication. PE malware family classification has gained great attention and a large number of approaches have been…

Cryptography and Security · Computer Science 2021-11-01 Yixuan Ma , Shuang Liu , Jiajun Jiang , Guanhong Chen , Keqiu Li

Automatic Malware Description via Attribute Tagging and Similarity Embedding

With the rapid proliferation and increased sophistication of malicious software (malware), detection methods no longer rely only on manually generated signatures but have also incorporated more general approaches like machine learning…

Machine Learning · Computer Science 2020-01-24 Felipe N. Ducau , Ethan M. Rudd , Tad M. Heppner , Alex Long , Konstantin Berlin

EMBERSim: A Large-Scale Databank for Boosting Similarity Search in Malware Analysis

In recent years there has been a shift from heuristics-based malware detection towards machine learning, which proves to be more robust in the current heavily adversarial threat landscape. While we acknowledge machine learning to be better…

Machine Learning · Computer Science 2023-10-04 Dragos Georgian Corlatescu , Alexandru Dinu , Mihaela Gaman , Paul Sumedrea

A Static Malware Detection System Using Data Mining Methods

A serious threat today is malicious executables. It is designed to damage computer system and some of them spread over network without the knowledge of the owner using the system. Two approaches have been derived for it i.e. Signature Based…

Cryptography and Security · Computer Science 2013-08-14 Usukhbayar Baldangombo , Nyamjav Jambaljav , Shi-Jinn Horng

Malware Classification Using Long Short-Term Memory Models

Signature and anomaly based techniques are the quintessential approaches to malware detection. However, these techniques have become increasingly ineffective as malware has become more sophisticated and complex. Researchers have therefore…

Cryptography and Security · Computer Science 2021-03-05 Dennis Dang , Fabio Di Troia , Mark Stamp