English

DPS: Design Pattern Summarisation Using Code Features

Software Engineering 2025-12-08 v3

Abstract

Automatic summarisation has been used efficiently in recent years to condense texts, conversations, audio, code, and various other artefacts. A range of methods, from simple template-based summaries to complex machine learning techniques -- and more recently, large language models -- have been employed to generate these summaries. Summarising software design patterns is important because it helps developers quickly understand and reuse complex design concepts, thereby improving software maintainability and development efficiency. However, the generation of summaries for software design patterns has not yet been explored. Our approach utilises code features and JavaParser to parse the code and create a JSON representation. Using an NLG library on this JSON representation, we convert it into natural language text that acts as a summary of the code, capturing the contextual information of the design pattern. Our empirical results indicate that the summaries generated by our approach capture the context in which patterns are applied in the codebase. Statistical evaluations demonstrate that our summaries closely align with human-written summaries, as evident from high values in the ROUGE-L, BLEU-4, NIST, and FrugalScore metrics. A follow-up survey further shows that DPS summaries were rated as capturing context better than human-generated summaries. Additionally, a time based task activity shows that summaries increase the time of understanding of design pattern for developer better than when the summaries are not present.

Keywords

Cite

@article{arxiv.2504.11081,
  title  = {DPS: Design Pattern Summarisation Using Code Features},
  author = {Najam Nazar and Sameer Sikka and Christoph Treude},
  journal= {arXiv preprint arXiv:2504.11081},
  year   = {2025}
}

Comments

32 pages, 2 figures, 8 tables