English

Table Search Using a Deep Contextualized Language Model

Information Retrieval 2020-05-28 v2 Computation and Language Machine Learning

Abstract

Pretrained contextualized language models such as BERT have achieved impressive results on various natural language processing benchmarks. Benefiting from multiple pretraining tasks and large scale training corpora, pretrained models can capture complex syntactic word relations. In this paper, we use the deep contextualized language model BERT for the task of ad hoc table retrieval. We investigate how to encode table content considering the table structure and input length limit of BERT. We also propose an approach that incorporates features from prior literature on table retrieval and jointly trains them with BERT. In experiments on public datasets, we show that our best approach can outperform the previous state-of-the-art method and BERT baselines with a large margin under different evaluation metrics.

Keywords

Cite

@article{arxiv.2005.09207,
  title  = {Table Search Using a Deep Contextualized Language Model},
  author = {Zhiyu Chen and Mohamed Trabelsi and Jeff Heflin and Yinan Xu and Brian D. Davison},
  journal= {arXiv preprint arXiv:2005.09207},
  year   = {2020}
}

Comments

Accepted at SIGIR 2020 (Long)