English

Evolutionary Grammar-Based Fuzzing

Software Engineering 2020-08-05 v1

Abstract

A fuzzer provides randomly generated inputs to a targeted software to expose erroneous behavior. To efficiently detect defects, generated inputs should conform to the structure of the input format and thus, grammars can be used to generate syntactically correct inputs. In this context, fuzzing can be guided by probabilities attached to competing rules in the grammar, leading to the idea of probabilistic grammar-based fuzzing. However, the optimal assignment of probabilities to individual grammar rules to effectively expose erroneous behavior for individual systems under test is an open research question. In this paper, we present EvoGFuzz, an evolutionary grammar-based fuzzing approach to optimize the probabilities to generate test inputs that may be more likely to trigger exceptional behavior. The evaluation shows the effectiveness of EvoGFuzz in detecting defects compared to probabilistic grammar-based fuzzing (baseline). Applied to ten real-world applications with common input formats (JSON, JavaScript, or CSS3), the evaluation shows that EvoGFuzz achieved a significantly larger median line coverage for all subjects by up to 48% compared to the baseline. Moreover, EvoGFuzz managed to expose 11 unique defects, from which five have not been detected by the baseline.

Keywords

Cite

@article{arxiv.2008.01150,
  title  = {Evolutionary Grammar-Based Fuzzing},
  author = {Martin Eberlein and Yannic Noller and Thomas Vogel and Lars Grunske},
  journal= {arXiv preprint arXiv:2008.01150},
  year   = {2020}
}

Comments

Accepted at the 12th Symposium on Search-Based Software Engineering (SSBSE 2020)