Utilizing Bidirectional Encoder Representations from Transformers for Answer Selection

Md Tahmid Rahman Laskar; Enamul Hoque; Jimmy Xiangji Huang

Utilizing Bidirectional Encoder Representations from Transformers for Answer Selection

Computation and Language 2020-11-17 v1 Information Retrieval

Authors: Md Tahmid Rahman Laskar , Enamul Hoque , Jimmy Xiangji Huang

Abstract

Pre-training a transformer-based model for the language modeling task in a large dataset and then fine-tuning it for downstream tasks has been found very useful in recent years. One major advantage of such pre-trained language models is that they can effectively absorb the context of each word in a sentence. However, for tasks such as the answer selection task, the pre-trained language models have not been extensively used yet. To investigate their effectiveness in such tasks, in this paper, we adopt the pre-trained Bidirectional Encoder Representations from Transformer (BERT) language model and fine-tune it on two Question Answering (QA) datasets and three Community Question Answering (CQA) datasets for the answer selection task. We find that fine-tuning the BERT model for the answer selection task is very effective and observe a maximum improvement of 13.1% in the QA datasets and 18.7% in the CQA datasets compared to the previous state-of-the-art.

Keywords

bert model transformation pre-trained language model

Cite

@article{arxiv.2011.07208,
  title  = {Utilizing Bidirectional Encoder Representations from Transformers for Answer Selection},
  author = {Md Tahmid Rahman Laskar and Enamul Hoque and Jimmy Xiangji Huang},
  journal= {arXiv preprint arXiv:2011.07208},
  year   = {2020}
}

Comments

Accepted to the AMMCS 2019 Proceedings

Utilizing Bidirectional Encoder Representations from Transformers for Answer Selection

Abstract

Keywords

Cite

Comments

Related papers