AST-Transformer: Encoding Abstract Syntax Trees Efficiently for Code Summarization
Computation and Language
2021-12-03 v1 Software Engineering
Abstract
Code summarization aims to generate brief natural language descriptions for source code. As source code is highly structured and follows strict programming language grammars, its Abstract Syntax Tree (AST) is often leveraged to inform the encoder about the structural information. However, ASTs are usually much longer than the source code. Current approaches ignore the size limit and simply feed the whole linearized AST into the encoder. To address this problem, we propose AST-Transformer to efficiently encode tree-structured ASTs. Experiments show that AST-Transformer outperforms the state-of-arts by a substantial margin while being able to reduce of the computational complexity in the encoding process.
Keywords
Cite
@article{arxiv.2112.01184,
title = {AST-Transformer: Encoding Abstract Syntax Trees Efficiently for Code Summarization},
author = {Ze Tang and Chuanyi Li and Jidong Ge and Xiaoyu Shen and Zheling Zhu and Bin Luo},
journal= {arXiv preprint arXiv:2112.01184},
year = {2021}
}