Related papers: DC-BERT: Decoupling Question and Document for Effi…

Co-BERT: A Context-Aware BERT Retrieval Model Incorporating Local and Query-specific Context

BERT-based text ranking models have dramatically advanced the state-of-the-art in ad-hoc retrieval, wherein most models tend to consider individual query-document pairs independently. In the mean time, the importance and usefulness to…

Information Retrieval · Computer Science 2021-04-20 Xiaoyang Chen , Kai Hui , Ben He , Xianpei Han , Le Sun , Zheng Ye

Unsupervised Domain Adaptation of Contextual Embeddings for Low-Resource Duplicate Question Detection

Answering questions is a primary goal of many conversational systems or search products. While most current systems have focused on answering questions against structured databases or curated knowledge graphs, on-line community forums or…

Computation and Language · Computer Science 2019-11-11 Alexandre Rochette , Yadollah Yaghoobzadeh , Timothy J. Hazen

Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and Challenges

Recent years have witnessed a substantial increase in the use of deep learning to solve various natural language processing (NLP) problems. Early deep learning models were constrained by their sequential or unidirectional nature, such that…

Information Retrieval · Computer Science 2024-03-05 Jiajia Wang , Jimmy X. Huang , Xinhui Tu , Junmei Wang , Angela J. Huang , Md Tahmid Rahman Laskar , Amran Bhuiyan

Table Search Using a Deep Contextualized Language Model

Pretrained contextualized language models such as BERT have achieved impressive results on various natural language processing benchmarks. Benefiting from multiple pretraining tasks and large scale training corpora, pretrained models can…

Information Retrieval · Computer Science 2020-05-28 Zhiyu Chen , Mohamed Trabelsi , Jeff Heflin , Yinan Xu , Brian D. Davison

TwinBERT: Distilling Knowledge to Twin-Structured BERT Models for Efficient Retrieval

Pre-trained language models like BERT have achieved great success in a wide variety of NLP tasks, while the superior performance comes with high demand in computational resources, which hinders the application in low-latency IR systems. We…

Information Retrieval · Computer Science 2020-02-18 Wenhao Lu , Jian Jiao , Ruofei Zhang

A Simple but Effective Method to Incorporate Multi-turn Context with BERT for Conversational Machine Comprehension

Conversational machine comprehension (CMC) requires understanding the context of multi-turn dialogue. Using BERT, a pre-training language model, has been successful for single-turn machine comprehension, while modeling multiple turns of…

Computation and Language · Computer Science 2019-05-31 Yasuhito Ohsugi , Itsumi Saito , Kyosuke Nishida , Hisako Asano , Junji Tomita

Composite Re-Ranking for Efficient Document Search with BERT

Although considerable efforts have been devoted to transformer-based ranking models for document search, the relevance-efficiency tradeoff remains a critical problem for ad-hoc ranking. To overcome this challenge, this paper presents BECR…

Information Retrieval · Computer Science 2022-01-07 Yingrui Yang , Yifan Qiao , Jinjin Shao , Mayuresh Anand , Xifeng Yan , Tao Yang

DocBERT: BERT for Document Classification

We present, to our knowledge, the first application of BERT to document classification. A few characteristics of the task might lead one to think that BERT is not the most appropriate model: syntactic structures matter less for content…

Computation and Language · Computer Science 2019-08-23 Ashutosh Adhikari , Achyudh Ram , Raphael Tang , Jimmy Lin

Automated classification for open-ended questions with BERT

Manual coding of text data from open-ended questions into different categories is time consuming and expensive. Automated coding uses statistical/machine learning to train on a small subset of manually coded text answers. Recently,…

Applications · Statistics 2023-10-25 Hyukjun Gweon , Matthias Schonlau

BERT-CoQAC: BERT-based Conversational Question Answering in Context

As one promising way to inquire about any particular information through a dialog with the bot, question answering dialog systems have gained increasing research interests recently. Designing interactive QA systems has always been a…

Computation and Language · Computer Science 2021-04-26 Munazza Zaib , Dai Hoang Tran , Subhash Sagar , Adnan Mahmood , Wei E. Zhang , Quan Z. Sheng

Decoupled Transformer for Scalable Inference in Open-domain Question Answering

Large transformer models, such as BERT, achieve state-of-the-art results in machine reading comprehension (MRC) for open-domain question answering (QA). However, transformers have a high computational cost for inference which makes them…

Computation and Language · Computer Science 2021-08-06 Haytham ElFadeel , Stan Peshterliev

Context-Aware Classification of Legal Document Pages

For many business applications that require the processing, indexing, and retrieval of professional documents such as legal briefs (in PDF format etc.), it is often essential to classify the pages of any given document into their…

Computation and Language · Computer Science 2023-04-26 Pavlos Fragkogiannis , Martina Forster , Grace E. Lee , Dell Zhang

Pre-trained contextualized embedding models such as BERT are a standard building block in many natural language processing systems. We demonstrate that the sentence-level representations produced by some off-the-shelf contextualized…

Computation and Language · Computer Science 2022-06-06 Xiliang Zhu , David Rossouw , Shayna Gardiner , Simon Corston-Oliver

ED2LM: Encoder-Decoder to Language Model for Faster Document Re-ranking Inference

State-of-the-art neural models typically encode document-query pairs using cross-attention for re-ranking. To this end, models generally utilize an encoder-only (like BERT) paradigm or an encoder-decoder (like T5) approach. These paradigms,…

Computation and Language · Computer Science 2022-04-26 Kai Hui , Honglei Zhuang , Tao Chen , Zhen Qin , Jing Lu , Dara Bahri , Ji Ma , Jai Prakash Gupta , Cicero Nogueira dos Santos , Yi Tay , Don Metzler

Utilizing Bidirectional Encoder Representations from Transformers for Answer Selection

Pre-training a transformer-based model for the language modeling task in a large dataset and then fine-tuning it for downstream tasks has been found very useful in recent years. One major advantage of such pre-trained language models is…

Computation and Language · Computer Science 2020-11-17 Md Tahmid Rahman Laskar , Enamul Hoque , Jimmy Xiangji Huang

Simple Applications of BERT for Ad Hoc Document Retrieval

Following recent successes in applying BERT to question answering, we explore simple applications to ad hoc document retrieval. This required confronting the challenge posed by documents that are typically longer than the length of input…

Information Retrieval · Computer Science 2019-03-27 Wei Yang , Haotian Zhang , Jimmy Lin

BERT-QE: Contextualized Query Expansion for Document Re-ranking

Query expansion aims to mitigate the mismatch between the language used in a query and in a document. However, query expansion methods can suffer from introducing non-relevant information when expanding the query. To bridge this gap,…

Information Retrieval · Computer Science 2020-11-04 Zhi Zheng , Kai Hui , Ben He , Xianpei Han , Le Sun , Andrew Yates

Multi-passage BERT: A Globally Normalized BERT Model for Open-domain Question Answering

BERT model has been successfully applied to open-domain QA tasks. However, previous work trains BERT by viewing passages corresponding to the same question as independent training instances, which may cause incomparable scores for answers…

Computation and Language · Computer Science 2019-10-03 Zhiguo Wang , Patrick Ng , Xiaofei Ma , Ramesh Nallapati , Bing Xiang

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

We introduce a new language representation model called BERT, which stands for Bidirectional Encoder Representations from Transformers. Unlike recent language representation models, BERT is designed to pre-train deep bidirectional…

Computation and Language · Computer Science 2019-05-28 Jacob Devlin , Ming-Wei Chang , Kenton Lee , Kristina Toutanova

Incorporating BERT into Parallel Sequence Decoding with Adapters

While large scale pre-trained language models such as BERT have achieved great success on various natural language understanding tasks, how to efficiently and effectively incorporate them into sequence-to-sequence models and the…

Computation and Language · Computer Science 2020-10-14 Junliang Guo , Zhirui Zhang , Linli Xu , Hao-Ran Wei , Boxing Chen , Enhong Chen