Related papers: RobBERT: a Dutch RoBERTa-based Language Model

BERTje: A Dutch BERT Model

The transformer-based pre-trained language model BERT has helped to improve state-of-the-art performance on many natural language processing (NLP) tasks. Using the same architecture and parameters, we developed and evaluated a monolingual…

Computation and Language · Computer Science 2019-12-23 Wietse de Vries , Andreas van Cranenburgh , Arianna Bisazza , Tommaso Caselli , Gertjan van Noord , Malvina Nissim

RobBERT-2022: Updating a Dutch Language Model to Account for Evolving Language Use

Large transformer-based language models, e.g. BERT and GPT-3, outperform previous architectures on most natural language processing tasks. Such language models are first pre-trained on gigantic corpora of text and later used as base-model…

Computation and Language · Computer Science 2022-11-16 Pieter Delobelle , Thomas Winters , Bettina Berendt

RobBERTje: a Distilled Dutch BERT Model

Pre-trained large-scale language models such as BERT have gained a lot of attention thanks to their outstanding performance on a wide range of natural language tasks. However, due to their large number of parameters, they are…

Computation and Language · Computer Science 2022-04-29 Pieter Delobelle , Thomas Winters , Bettina Berendt

Multilingual is not enough: BERT for Finnish

Deep learning-based language models pretrained on large unannotated text corpora have been demonstrated to allow efficient transfer learning for natural language processing, with recent approaches such as the transformer-based BERT model…

Computation and Language · Computer Science 2019-12-17 Antti Virtanen , Jenna Kanerva , Rami Ilo , Jouni Luoma , Juhani Luotolahti , Tapio Salakoski , Filip Ginter , Sampo Pyysalo

GottBERT: a pure German Language Model

Lately, pre-trained language models advanced the field of natural language processing (NLP). The introduction of Bidirectional Encoders for Transformers (BERT) and its optimized version RoBERTa have had significant impact and increased the…

Computation and Language · Computer Science 2025-06-13 Raphael Scheible , Fabian Thomczyk , Patric Tippmann , Victor Jaravine , Martin Boeker

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Language model pretraining has led to significant performance gains but careful comparison between different approaches is challenging. Training is computationally expensive, often done on private datasets of different sizes, and, as we…

Computation and Language · Computer Science 2019-07-29 Yinhan Liu , Myle Ott , Naman Goyal , Jingfei Du , Mandar Joshi , Danqi Chen , Omer Levy , Mike Lewis , Luke Zettlemoyer , Veselin Stoyanov

HerBERT: Efficiently Pretrained Transformer-based Language Model for Polish

BERT-based models are currently used for solving nearly all Natural Language Processing (NLP) tasks and most often achieve state-of-the-art results. Therefore, the NLP community conducts extensive research on understanding these models, but…

Computation and Language · Computer Science 2021-05-06 Robert Mroczkowski , Piotr Rybak , Alina Wróblewska , Ireneusz Gawlik

belabBERT: a Dutch RoBERTa-based language model applied to psychiatric classification

Natural language processing (NLP) is becoming an important means for automatic recognition of human traits and states, such as intoxication, presence of psychiatric disorders, presence of airway disorders and states of stress. Such…

Computation and Language · Computer Science 2021-06-03 Joppe Wouts , Janna de Boer , Alban Voppel , Sanne Brederoo , Sander van Splunter , Iris Sommer

Comprehensive Study on German Language Models for Clinical and Biomedical Text Understanding

Recent advances in natural language processing (NLP) can be largely attributed to the advent of pre-trained language models such as BERT and RoBERTa. While these models demonstrate remarkable performance on general datasets, they can…

Computation and Language · Computer Science 2024-05-09 Ahmad Idrissi-Yaghir , Amin Dada , Henning Schäfer , Kamyar Arzideh , Giulia Baldini , Jan Trienes , Max Hasin , Jeanette Bewersdorff , Cynthia S. Schmidt , Marie Bauer , Kaleb E. Smith , Jiang Bian , Yonghui Wu , Jörg Schlötterer , Torsten Zesch , Peter A. Horn , Christin Seifert , Felix Nensa , Jens Kleesiek , Christoph M. Friedrich

Towards Fully Bilingual Deep Language Modeling

Language models based on deep neural networks have facilitated great advances in natural language processing and understanding tasks in recent years. While models covering a large number of languages have been introduced, their…

Computation and Language · Computer Science 2020-10-23 Li-Hsin Chang , Sampo Pyysalo , Jenna Kanerva , Filip Ginter

MedicalBERT: enhancing biomedical natural language processing using pretrained BERT-based model

Recent advances in natural language processing (NLP) have been driven bypretrained language models like BERT, RoBERTa, T5, and GPT. Thesemodels excel at understanding complex texts, but biomedical literature, withits domain-specific…

Computation and Language · Computer Science 2025-07-28 K. Sahit Reddy , N. Ragavenderan , Vasanth K. , Ganesh N. Naik , Vishalakshi Prabhu , Nagaraja G. S

Enhancing Multilingual Language Models for Code-Switched Input Data

Code-switching, or alternating between languages within a single conversation, presents challenges for multilingual language models on NLP tasks. This research investigates if pre-training Multilingual BERT (mBERT) on code-switched datasets…

Computation and Language · Computer Science 2025-03-12 Katherine Xie , Nitya Babbar , Vicky Chen , Yoanna Turura

Recent Advances in Natural Language Processing via Large Pre-Trained Language Models: A Survey

Large, pre-trained transformer-based language models such as BERT have drastically changed the Natural Language Processing (NLP) field. We present a survey of recent work that uses these large language models to solve NLP tasks via…

Computation and Language · Computer Science 2021-11-03 Bonan Min , Hayley Ross , Elior Sulem , Amir Pouran Ben Veyseh , Thien Huu Nguyen , Oscar Sainz , Eneko Agirre , Ilana Heinz , Dan Roth

EstBERT: A Pretrained Language-Specific BERT for Estonian

This paper presents EstBERT, a large pretrained transformer-based language-specific BERT model for Estonian. Recent work has evaluated multilingual BERT models on Estonian tasks and found them to outperform the baselines. Still, based on…

Computation and Language · Computer Science 2021-04-29 Hasan Tanvir , Claudia Kittask , Sandra Eiche , Kairit Sirts

AraBERT: Transformer-based Model for Arabic Language Understanding

The Arabic language is a morphologically rich language with relatively few resources and a less explored syntax compared to English. Given these limitations, Arabic Natural Language Processing (NLP) tasks like Sentiment Analysis (SA), Named…

Computation and Language · Computer Science 2021-03-09 Wissam Antoun , Fady Baly , Hazem Hajj

MathBERT: A Pre-trained Language Model for General NLP Tasks in Mathematics Education

Since the introduction of the original BERT (i.e., BASE BERT), researchers have developed various customized BERT models with improved performance for specific domains and tasks by exploiting the benefits of transfer learning. Due to the…

Computation and Language · Computer Science 2023-08-15 Jia Tracy Shen , Michiharu Yamashita , Ethan Prihar , Neil Heffernan , Xintao Wu , Ben Graff , Dongwon Lee

Are All Languages Created Equal in Multilingual BERT?

Multilingual BERT (mBERT) trained on 104 languages has shown surprisingly good cross-lingual performance on several NLP tasks, even without explicit cross-lingual signals. However, these evaluations have focused on cross-lingual transfer…

Computation and Language · Computer Science 2020-10-02 Shijie Wu , Mark Dredze

Multilingual BERT language model for medical tasks: Evaluation on domain-specific adaptation and cross-linguality

In multilingual healthcare applications, the availability of domain-specific natural language processing(NLP) tools is limited, especially for low-resource languages. Although multilingual bidirectional encoder representations from…

Computation and Language · Computer Science 2025-11-03 Yinghao Luo , Lang Zhou , Amrish Jhingoer , Klaske Vliegenthart Jongbloed , Carlijn Jordans , Ben Werkhoven , Tom Seinen , Erik van Mulligen , Casper Rokx , Yunlei Li

Revisiting Pre-Trained Models for Chinese Natural Language Processing

Bidirectional Encoder Representations from Transformers (BERT) has shown marvelous improvements across various NLP tasks, and consecutive variants have been proposed to further improve the performance of the pre-trained language models. In…

Computation and Language · Computer Science 2020-12-14 Yiming Cui , Wanxiang Che , Ting Liu , Bing Qin , Shijin Wang , Guoping Hu

TiBERT: Tibetan Pre-trained Language Model

The pre-trained language model is trained on large-scale unlabeled text and can achieve state-of-the-art results in many different downstream tasks. However, the current pre-trained language model is mainly concentrated in the Chinese and…

Computation and Language · Computer Science 2022-05-17 Yuan Sun , Sisi Liu , Junjie Deng , Xiaobing Zhao