English

DeepObfusCode: Source Code Obfuscation Through Sequence-to-Sequence Networks

Cryptography and Security 2021-02-26 v3 Machine Learning Software Engineering

Abstract

The paper explores a novel methodology in source code obfuscation through the application of text-based recurrent neural network (RNN) encoder-decoder models in ciphertext generation and key generation. Sequence-to-sequence models are incorporated into the model architecture to generate obfuscated code, generate the deobfuscation key, and live execution. Quantitative benchmark comparison to existing obfuscation methods indicate significant improvement in stealth and execution cost for the proposed solution, and experiments regarding the model's properties yield positive results regarding its character variation, dissimilarity to the original codebase, and consistent length of obfuscated code.

Keywords

Cite

@article{arxiv.1909.01837,
  title  = {DeepObfusCode: Source Code Obfuscation Through Sequence-to-Sequence Networks},
  author = {Siddhartha Datta},
  journal= {arXiv preprint arXiv:1909.01837},
  year   = {2021}
}

Comments

Accepted in Advances in Intelligent Systems and Computing 2021 & Computing Conference 2021