Table Search Using a Deep Contextualized Language Model
Abstract
Pretrained contextualized language models such as BERT have achieved impressive results on various natural language processing benchmarks. Benefiting from multiple pretraining tasks and large scale training corpora, pretrained models can capture complex syntactic word relations. In this paper, we use the deep contextualized language model BERT for the task of ad hoc table retrieval. We investigate how to encode table content considering the table structure and input length limit of BERT. We also propose an approach that incorporates features from prior literature on table retrieval and jointly trains them with BERT. In experiments on public datasets, we show that our best approach can outperform the previous state-of-the-art method and BERT baselines with a large margin under different evaluation metrics.
Cite
@article{arxiv.2005.09207,
title = {Table Search Using a Deep Contextualized Language Model},
author = {Zhiyu Chen and Mohamed Trabelsi and Jeff Heflin and Yinan Xu and Brian D. Davison},
journal= {arXiv preprint arXiv:2005.09207},
year = {2020}
}
Comments
Accepted at SIGIR 2020 (Long)