English

Predicting Process Name from Network Data

Cryptography and Security 2021-09-09 v1 Artificial Intelligence Machine Learning Networking and Internet Architecture

Abstract

The ability to identify applications based on the network data they generate could be a valuable tool for cyber defense. We report on a machine learning technique capable of using netflow-like features to predict the application that generated the traffic. In our experiments, we used ground-truth labels obtained from host-based sensors deployed in a large enterprise environment; we applied random forests and multilayer perceptrons to the tasks of browser vs. non-browser identification, browser fingerprinting, and process name prediction. For each of these tasks, we demonstrate how machine learning models can achieve high classification accuracy using only netflow-like features as the basis for classification.

Keywords

Cite

@article{arxiv.2109.03328,
  title  = {Predicting Process Name from Network Data},
  author = {Justin Allen and David Knapp and Kristine Monteith},
  journal= {arXiv preprint arXiv:2109.03328},
  year   = {2021}
}

Comments

Presented at 1st International Workshop on Adaptive Cyber Defense, 2021 (arXiv:2108.08476)