Related papers: BagBERT: BERT-based bagging-stacking for multi-top…

Multi-label topic classification for COVID-19 literature with Bioformer

We describe Bioformer team's participation in the multi-label topic classification task for COVID-19 literature (track 5 of BioCreative VII). Topic classification is performed using different BERT models (BioBERT, PubMedBERT, and…

Computation and Language · Computer Science 2022-04-15 Li Fang , Kai Wang

Domain specific BERT representation for Named Entity Recognition of lab protocol

Supervised models trained to predict properties from representations have been achieving high accuracy on a variety of tasks. For instance, the BERT family seems to work exceptionally well on the downstream task from NER tagging to the…

Computation and Language · Computer Science 2020-12-22 Tejas Vaidhya , Ayush Kaushal

Enriching BERT with Knowledge Graph Embeddings for Document Classification

In this paper, we focus on the classification of books using short descriptive texts (cover blurbs) and additional metadata. Building upon BERT, a deep neural language model, we demonstrate how to combine text representations with metadata…

Computation and Language · Computer Science 2019-09-19 Malte Ostendorff , Peter Bourgonje , Maria Berger , Julian Moreno-Schneider , Georg Rehm , Bela Gipp

BERT-Based Multi-Head Selection for Joint Entity-Relation Extraction

In this paper, we report our method for the Information Extraction task in 2019 Language and Intelligence Challenge. We incorporate BERT into the multi-head selection framework for joint entity-relation extraction. This model extends…

Computation and Language · Computer Science 2019-09-27 Weipeng Huang , Xingyi Cheng , Taifeng Wang , Wei Chu

Multi-level biomedical NER through multi-granularity embeddings and enhanced labeling

Biomedical Named Entity Recognition (NER) is a fundamental task of Biomedical Natural Language Processing for extracting relevant information from biomedical texts, such as clinical records, scientific publications, and electronic health…

Computation and Language · Computer Science 2023-12-27 Fahime Shahrokh , Nasser Ghadiri , Rasoul Samani , Milad Moradi

DocBERT: BERT for Document Classification

We present, to our knowledge, the first application of BERT to document classification. A few characteristics of the task might lead one to think that BERT is not the most appropriate model: syntactic structures matter less for content…

Computation and Language · Computer Science 2019-08-23 Ashutosh Adhikari , Achyudh Ram , Raphael Tang , Jimmy Lin

Multi-BERT for Embeddings for Recommendation System

In this paper, we propose a novel approach for generating document embeddings using a combination of Sentence-BERT (SBERT) and RoBERTa, two state-of-the-art natural language processing models. Our approach treats sentences as tokens and…

Information Retrieval · Computer Science 2023-08-28 Shashidhar Reddy Javaji , Krutika Sarode

Students Need More Attention: BERT-based AttentionModel for Small Data with Application to AutomaticPatient Message Triage

Small and imbalanced datasets commonly seen in healthcare represent a challenge when training classifiers based on deep learning models. So motivated, we propose a novel framework based on BioBERT (Bidirectional Encoder Representations from…

Computation and Language · Computer Science 2020-06-23 Shijing Si , Rui Wang , Jedrek Wosik , Hao Zhang , David Dov , Guoyin Wang , Ricardo Henao , Lawrence Carin

Universal Text Representation from BERT: An Empirical Study

We present a systematic investigation of layer-wise BERT activations for general-purpose text representations to understand what linguistic information they capture and how transferable they are across different tasks. Sentence-level…

Computation and Language · Computer Science 2019-10-25 Xiaofei Ma , Zhiguo Wang , Patrick Ng , Ramesh Nallapati , Bing Xiang

Word Sense Induction with Knowledge Distillation from BERT

Pre-trained contextual language models are ubiquitously employed for language understanding tasks, but are unsuitable for resource-constrained systems. Noncontextual word embeddings are an efficient alternative in these settings. Such…

Computation and Language · Computer Science 2023-04-24 Anik Saha , Alex Gittens , Bulent Yener

Enriched BERT Embeddings for Scholarly Publication Classification

With the rapid expansion of academic literature and the proliferation of preprints, researchers face growing challenges in manually organizing and labeling large volumes of articles. The NSLP 2024 FoRC Shared Task I addresses this challenge…

Artificial Intelligence · Computer Science 2024-08-16 Benjamin Wolff , Eva Seidlmayer , Konrad U. Förstner

Layer-wise Guided Training for BERT: Learning Incrementally Refined Document Representations

Although BERT is widely used by the NLP community, little is known about its inner workings. Several attempts have been made to shed light on certain aspects of BERT, often with contradicting conclusions. A much raised concern focuses on…

Computation and Language · Computer Science 2020-10-13 Nikolaos Manginas , Ilias Chalkidis , Prodromos Malakasiotis

Establishing Strong Baselines for the New Decade: Sequence Tagging, Syntactic and Semantic Parsing with BERT

This paper presents new state-of-the-art models for three tasks, part-of-speech tagging, syntactic parsing, and semantic parsing, using the cutting-edge contextualized embedding framework known as BERT. For each task, we first replicate and…

Computation and Language · Computer Science 2020-05-26 Han He , Jinho D. Choi

Automated Text Mining of Experimental Methodologies from Biomedical Literature

Biomedical literature is a rapidly expanding field of science and technology. Classification of biomedical texts is an essential part of biomedicine research, especially in the field of biology. This work proposes the fine-tuned DistilBERT,…

Computation and Language · Computer Science 2024-04-23 Ziqing Guo

Language Representation Models for Fine-Grained Sentiment Classification

Sentiment classification is a quickly advancing field of study with applications in almost any field. While various models and datasets have shown high accuracy inthe task of binary classification, the task of fine-grained sentiment…

Computation and Language · Computer Science 2020-05-29 Brian Cheang , Bailey Wei , David Kogan , Howey Qiu , Masud Ahmed

Multi-Stage Document Ranking with BERT

The advent of deep neural networks pre-trained via language modeling tasks has spurred a number of successful applications in natural language processing. This work explores one such popular model, BERT, in the context of document ranking.…

Information Retrieval · Computer Science 2019-11-01 Rodrigo Nogueira , Wei Yang , Kyunghyun Cho , Jimmy Lin

Stacked DeBERT: All Attention in Incomplete Data for Text Classification

In this paper, we propose Stacked DeBERT, short for Stacked Denoising Bidirectional Encoder Representations from Transformers. This novel model improves robustness in incomplete data, when compared to existing systems, by designing a novel…

Computation and Language · Computer Science 2021-01-15 Gwenaelle Cunha Sergio , Minho Lee

Hierarchical Transformers for Long Document Classification

BERT, which stands for Bidirectional Encoder Representations from Transformers, is a recently introduced language representation model based upon the transfer learning paradigm. We extend its fine-tuning procedure to address one of its…

Computation and Language · Computer Science 2019-10-25 Raghavendra Pappagari , Piotr Żelasko , Jesús Villalba , Yishay Carmiel , Najim Dehak

BERT for Joint Intent Classification and Slot Filling

Intent classification and slot filling are two essential tasks for natural language understanding. They often suffer from small-scale human-labeled training data, resulting in poor generalization capability, especially for rare words.…

Computation and Language · Computer Science 2019-03-01 Qian Chen , Zhu Zhuo , Wen Wang

FAT ALBERT: Finding Answers in Large Texts using Semantic Similarity Attention Layer based on BERT

Machine based text comprehension has always been a significant research field in natural language processing. Once a full understanding of the text context and semantics is achieved, a deep learning model can be trained to solve a large…

Computation and Language · Computer Science 2020-09-03 Omar Mossad , Amgad Ahmed , Anandharaju Raju , Hari Karthikeyan , Zayed Ahmed