AixBench: A Code Generation Benchmark Dataset

Yiyang Hao; Ge Li; Yongqiang Liu; Xiaowei Miao; He Zong; Siyuan Jiang; Yang Liu; He Wei

AixBench: A Code Generation Benchmark Dataset

Software Engineering 2022-07-22 v2

Authors: Yiyang Hao , Ge Li , Yongqiang Liu , Xiaowei Miao , He Zong , Siyuan Jiang , Yang Liu , He Wei

Abstract

We present a benchmark dataset for evaluating method-level code generation task. The benchmark contains a dataset of 175 samples for automated evaluation and a dataset of 161 samples for manual evaluation. We also present a new metric for automatically evaluating the correctness of the generated code, and a set of criteria to manually evaluating the overall quality of the generated code.

AixBench: A Code Generation Benchmark Dataset

Abstract

Keywords

Cite

Related papers