Related papers: DocBERT: BERT for Document Classification

Simple Applications of BERT for Ad Hoc Document Retrieval

Following recent successes in applying BERT to question answering, we explore simple applications to ad hoc document retrieval. This required confronting the challenge posed by documents that are typically longer than the length of input…

Information Retrieval · Computer Science 2019-03-27 Wei Yang , Haotian Zhang , Jimmy Lin

Simplified TinyBERT: Knowledge Distillation for Document Retrieval

Despite the effectiveness of utilizing the BERT model for document ranking, the high computational cost of such approaches limits their uses. To this end, this paper first empirically investigates the effectiveness of two knowledge…

Information Retrieval · Computer Science 2023-05-05 Xuanang Chen , Ben He , Kai Hui , Le Sun , Yingfei Sun

Comparative Study of Long Document Classification

The amount of information stored in the form of documents on the internet has been increasing rapidly. Thus it has become a necessity to organize and maintain these documents in an optimum manner. Text classification algorithms study the…

Computation and Language · Computer Science 2022-02-22 Vedangi Wagh , Snehal Khandve , Isha Joshi , Apurva Wani , Geetanjali Kale , Raviraj Joshi

A Sentence-level Hierarchical BERT Model for Document Classification with Limited Labelled Data

Training deep learning models with limited labelled data is an attractive scenario for many NLP tasks, including document classification. While with the recent emergence of BERT, deep learning language models can achieve reasonably good…

Computation and Language · Computer Science 2021-06-15 Jinghui Lu , Maeve Henchion , Ivan Bacher , Brian Mac Namee

Context-Aware Classification of Legal Document Pages

For many business applications that require the processing, indexing, and retrieval of professional documents such as legal briefs (in PDF format etc.), it is often essential to classify the pages of any given document into their…

Computation and Language · Computer Science 2023-04-26 Pavlos Fragkogiannis , Martina Forster , Grace E. Lee , Dell Zhang

BERT: A Review of Applications in Natural Language Processing and Understanding

In this review, we describe the application of one of the most popular deep learning-based language models - BERT. The paper describes the mechanism of operation of this model, the main areas of its application to the tasks of text…

Computation and Language · Computer Science 2021-03-23 M. V. Koroteev

Multi-Stage Document Ranking with BERT

The advent of deep neural networks pre-trained via language modeling tasks has spurred a number of successful applications in natural language processing. This work explores one such popular model, BERT, in the context of document ranking.…

Information Retrieval · Computer Science 2019-11-01 Rodrigo Nogueira , Wei Yang , Kyunghyun Cho , Jimmy Lin

Hierarchical Transformers for Long Document Classification

BERT, which stands for Bidirectional Encoder Representations from Transformers, is a recently introduced language representation model based upon the transfer learning paradigm. We extend its fine-tuning procedure to address one of its…

Computation and Language · Computer Science 2019-10-25 Raghavendra Pappagari , Piotr Żelasko , Jesús Villalba , Yishay Carmiel , Najim Dehak

PatentBERT: Patent Classification with Fine-Tuning a pre-trained BERT Model

In this work we focus on fine-tuning a pre-trained BERT model and applying it to patent classification. When applied to large datasets of over two millions patents, our approach outperforms the state of the art by an approach using CNN with…

Computation and Language · Computer Science 2019-07-02 Jieh-Sheng Lee , Jieh Hsiang

Layer-wise Guided Training for BERT: Learning Incrementally Refined Document Representations

Although BERT is widely used by the NLP community, little is known about its inner workings. Several attempts have been made to shed light on certain aspects of BERT, often with contradicting conclusions. A much raised concern focuses on…

Computation and Language · Computer Science 2020-10-13 Nikolaos Manginas , Ilias Chalkidis , Prodromos Malakasiotis

Co-BERT: A Context-Aware BERT Retrieval Model Incorporating Local and Query-specific Context

BERT-based text ranking models have dramatically advanced the state-of-the-art in ad-hoc retrieval, wherein most models tend to consider individual query-document pairs independently. In the mean time, the importance and usefulness to…

Information Retrieval · Computer Science 2021-04-20 Xiaoyang Chen , Kai Hui , Ben He , Xianpei Han , Le Sun , Zheng Ye

Rapid Adaptation of BERT for Information Extraction on Domain-Specific Business Documents

Techniques for automatically extracting important content elements from business documents such as contracts, statements, and filings have the potential to make business operations more efficient. This problem can be formulated as a…

Computation and Language · Computer Science 2020-02-06 Ruixue Zhang , Wei Yang , Luyun Lin , Zhengkai Tu , Yuqing Xie , Zihang Fu , Yuhao Xie , Luchen Tan , Kun Xiong , Jimmy Lin

TopicBERT for Energy Efficient Document Classification

Prior research notes that BERT's computational cost grows quadratically with sequence length thus leading to longer training times, higher GPU memory constraints and carbon emissions. While recent work seeks to address these scalability…

Computation and Language · Computer Science 2020-11-02 Yatin Chaudhary , Pankaj Gupta , Khushbu Saxena , Vivek Kulkarni , Thomas Runkler , Hinrich Schütze

Simple BERT Models for Relation Extraction and Semantic Role Labeling

We present simple BERT-based models for relation extraction and semantic role labeling. In recent years, state-of-the-art performance has been achieved using neural models by incorporating lexical and syntactic features such as…

Computation and Language · Computer Science 2019-04-11 Peng Shi , Jimmy Lin

Selecting Between BERT and GPT for Text Classification in Political Science Research

Political scientists often grapple with data scarcity in text classification. Recently, fine-tuned BERT models and their variants have gained traction as effective solutions to address this issue. In this study, we investigate the potential…

Computation and Language · Computer Science 2024-11-11 Yu Wang , Wen Qu , Xin Ye

Comparing BERT against traditional machine learning text classification

The BERT model has arisen as a popular state-of-the-art machine learning model in the recent years that is able to cope with multiple NLP tasks such as supervised text classification without human supervision. Its flexibility to cope with…

Computation and Language · Computer Science 2023-04-26 Santiago González-Carvajal , Eduardo C. Garrido-Merchán

Linear Classifier: An Often-Forgotten Baseline for Text Classification

Large-scale pre-trained language models such as BERT are popular solutions for text classification. Due to the superior performance of these advanced methods, nowadays, people often directly train them for a few epochs and deploy the…

Computation and Language · Computer Science 2023-06-13 Yu-Chen Lin , Si-An Chen , Jie-Jyun Liu , Chih-Jen Lin

Utilizing BERT for Information Retrieval: Survey, Applications, Resources, and Challenges

Recent years have witnessed a substantial increase in the use of deep learning to solve various natural language processing (NLP) problems. Early deep learning models were constrained by their sequential or unidirectional nature, such that…

Information Retrieval · Computer Science 2024-03-05 Jiajia Wang , Jimmy X. Huang , Xinhui Tu , Junmei Wang , Angela J. Huang , Md Tahmid Rahman Laskar , Amran Bhuiyan

DC-BERT: Decoupling Question and Document for Efficient Contextual Encoding

Recent studies on open-domain question answering have achieved prominent performance improvement using pre-trained language models such as BERT. State-of-the-art approaches typically follow the "retrieve and read" pipeline and employ…

Computation and Language · Computer Science 2020-03-02 Yuyu Zhang , Ping Nie , Xiubo Geng , Arun Ramamurthy , Le Song , Daxin Jiang

StruBERT: Structure-aware BERT for Table Search and Matching

A large amount of information is stored in data tables. Users can search for data tables using a keyword-based query. A table is composed primarily of data values that are organized in rows and columns providing implicit structural…

Information Retrieval · Computer Science 2022-03-29 Mohamed Trabelsi , Zhiyu Chen , Shuo Zhang , Brian D. Davison , Jeff Heflin