English

BotGraph: Web Bot Detection Based on Sitemap

Cryptography and Security 2019-03-27 v2

Abstract

The web bots have been blamed for consuming large amount of Internet traffic and undermining the interest of the scraped sites for years. Traditional bot detection studies focus mainly on signature-based solution, but advanced bots usually forge their identities to bypass such detection. With increasing cloud migration, cloud providers provide new opportunities for an effective bot detection based on big data to solve this issue. In this paper, we present a behavior-based bot detection scheme called BotGraph that combines sitemap and convolutional neural network (CNN) to detect inner behavior of bots. Experimental results show that BotGraph achieves ~95% recall and precision on 35-day production data traces from different customers including the Bing search engine and several sites.

Keywords

Cite

@article{arxiv.1903.08074,
  title  = {BotGraph: Web Bot Detection Based on Sitemap},
  author = {Yang Luo and Guozhen She and Peng Cheng and Yongqiang Xiong},
  journal= {arXiv preprint arXiv:1903.08074},
  year   = {2019}
}

Comments

7 pages, 3 figures

R2 v1 2026-06-23T08:12:58.798Z