English

Predicting Issue Types with seBERT

Software Engineering 2022-05-04 v1 Computation and Language Machine Learning

Abstract

Pre-trained transformer models are the current state-of-the-art for natural language models processing. seBERT is such a model, that was developed based on the BERT architecture, but trained from scratch with software engineering data. We fine-tuned this model for the NLBSE challenge for the task of issue type prediction. Our model dominates the baseline fastText for all three issue types in both recall and precisio} to achieve an overall F1-score of 85.7%, which is an increase of 4.1% over the baseline.

Keywords

Cite

@article{arxiv.2205.01335,
  title  = {Predicting Issue Types with seBERT},
  author = {Alexander Trautsch and Steffen Herbold},
  journal= {arXiv preprint arXiv:2205.01335},
  year   = {2022}
}

Comments

Accepted for Publication at the NLBSE'22 Tool Competition

R2 v1 2026-06-24T11:05:35.386Z