English

Measuring source code conciseness across programming languages using compression

Programming Languages 2021-11-19 v1 Software Engineering

Abstract

It is well-known, and often a topic of heated debates, that programs in some programming languages are more concise than in others. This is a relevant factor when comparing or aggregating volume-impacted metrics on source code written in a combination of programming languages. In this paper, we present a model for measuring the conciseness of programming languages in a consistent, objective and evidence-based way. We present the approach, explain how it is founded on information theoretical principles, present detailed analysis steps and show the quantitative results of applying this model to a large benchmark of diverse commercial software applications. We demonstrate that our metric for language conciseness is strongly correlated with both an alternative analytical approach, and with a large scale developer survey, and show how its results can be applied to improve software metrics for multi-language applications.

Keywords

Cite

@article{arxiv.2111.09728,
  title  = {Measuring source code conciseness across programming languages using compression},
  author = {Lodewijk Bergmans and Xander Schrijen and Edwin Ouwehand and Magiel Bruntink},
  journal= {arXiv preprint arXiv:2111.09728},
  year   = {2021}
}