Related papers: BERT Cannot Align Characters

Is Multilingual BERT Fluent in Language Generation?

The multilingual BERT model is trained on 104 languages and meant to serve as a universal language model and tool for encoding sentences. We explore how well the model performs on several languages across several tasks: a diagnostic…

Computation and Language · Computer Science 2019-10-10 Samuel Rönnqvist , Jenna Kanerva , Tapio Salakoski , Filip Ginter

GREEK-BERT: The Greeks visiting Sesame Street

Transformer-based language models, such as BERT and its variants, have achieved state-of-the-art performance in several downstream natural language processing (NLP) tasks on generic benchmark datasets (e.g., GLUE, SQUAD, RACE). However,…

Computation and Language · Computer Science 2020-09-04 John Koutsikakis , Ilias Chalkidis , Prodromos Malakasiotis , Ion Androutsopoulos

Cross-lingual Alignment Methods for Multilingual BERT: A Comparative Study

Multilingual BERT (mBERT) has shown reasonable capability for zero-shot cross-lingual transfer when fine-tuned on downstream tasks. Since mBERT is not pre-trained with explicit cross-lingual supervision, transfer performance can further be…

Computation and Language · Computer Science 2020-10-01 Saurabh Kulshreshtha , José Luis Redondo-García , Ching-Yun Chang

It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT

Recent works have demonstrated that multilingual BERT (mBERT) learns rich cross-lingual representations, that allow for transfer across languages. We study the word-level translation information embedded in mBERT and present two simple…

Computation and Language · Computer Science 2020-10-19 Hila Gonen , Shauli Ravfogel , Yanai Elazar , Yoav Goldberg

Multilingual Alignment of Contextual Word Representations

We propose procedures for evaluating and strengthening contextual embedding alignment and show that they are useful in analyzing and improving multilingual BERT. In particular, after our proposed alignment procedure, BERT exhibits…

Computation and Language · Computer Science 2020-02-14 Steven Cao , Nikita Kitaev , Dan Klein

How Language-Neutral is Multilingual BERT?

Multilingual BERT (mBERT) provides sentence representations for 104 languages, which are useful for many multi-lingual tasks. Previous work probed the cross-linguality of mBERT using zero-shot transfer learning on morphological and…

Computation and Language · Computer Science 2019-11-11 Jindřich Libovický , Rudolf Rosa , Alexander Fraser

Mono vs Multilingual Transformer-based Models: a Comparison across Several Language Tasks

BERT (Bidirectional Encoder Representations from Transformers) and ALBERT (A Lite BERT) are methods for pre-training language models which can later be fine-tuned for a variety of Natural Language Understanding tasks. These methods have…

Computation and Language · Computer Science 2020-07-21 Diego de Vargas Feijo , Viviane Pereira Moreira

AMBERT: A Pre-trained Language Model with Multi-Grained Tokenization

Pre-trained language models such as BERT have exhibited remarkable performances in many tasks in natural language understanding (NLU). The tokens in the models are usually fine-grained in the sense that for languages like English they are…

Computation and Language · Computer Science 2021-05-28 Xinsong Zhang , Pengshuai Li , Hang Li

Are All Languages Created Equal in Multilingual BERT?

Multilingual BERT (mBERT) trained on 104 languages has shown surprisingly good cross-lingual performance on several NLP tasks, even without explicit cross-lingual signals. However, these evaluations have focused on cross-lingual transfer…

Computation and Language · Computer Science 2020-10-02 Shijie Wu , Mark Dredze

Phrase-BERT: Improved Phrase Embeddings from BERT with an Application to Corpus Exploration

Phrase representations derived from BERT often do not exhibit complex phrasal compositionality, as the model relies instead on lexical similarity to determine semantic relatedness. In this paper, we propose a contrastive fine-tuning…

Computation and Language · Computer Science 2021-10-15 Shufan Wang , Laure Thompson , Mohit Iyyer

Multilingual is not enough: BERT for Finnish

Deep learning-based language models pretrained on large unannotated text corpora have been demonstrated to allow efficient transfer learning for natural language processing, with recent approaches such as the transformer-based BERT model…

Computation and Language · Computer Science 2019-12-17 Antti Virtanen , Jenna Kanerva , Rami Ilo , Jouni Luoma , Juhani Luotolahti , Tapio Salakoski , Filip Ginter , Sampo Pyysalo

On the Language-specificity of Multilingual BERT and the Impact of Fine-tuning

Recent work has shown evidence that the knowledge acquired by multilingual BERT (mBERT) has two components: a language-specific and a language-neutral one. This paper analyses the relationship between them, in the context of fine-tuning on…

Computation and Language · Computer Science 2021-12-28 Marc Tanti , Lonneke van der Plas , Claudia Borg , Albert Gatt

Mitigating Language-Dependent Ethnic Bias in BERT

BERT and other large-scale language models (LMs) contain gender and racial bias. They also exhibit other dimensions of social bias, most of which have not been studied in depth, and some of which vary depending on the language. In this…

Computation and Language · Computer Science 2021-09-15 Jaimeen Ahn , Alice Oh

Cross-Linguistic Syntactic Difference in Multilingual BERT: How Good is It and How Does It Affect Transfer?

Multilingual BERT (mBERT) has demonstrated considerable cross-lingual syntactic ability, whereby it enables effective zero-shot cross-lingual transfer of syntactic knowledge. The transfer is more successful between some languages, but it is…

Computation and Language · Computer Science 2022-12-22 Ningyu Xu , Tao Gui , Ruotian Ma , Qi Zhang , Jingting Ye , Menghan Zhang , Xuanjing Huang

An Analysis of Social Biases Present in BERT Variants Across Multiple Languages

Although large pre-trained language models have achieved great success in many NLP tasks, it has been shown that they reflect human biases from their pre-training corpora. This bias may lead to undesirable outcomes when these models are…

Computation and Language · Computer Science 2022-11-29 Aristides Milios , Parishad BehnamGhader

A Study of Cross-Lingual Ability and Language-specific Information in Multilingual BERT

Recently, multilingual BERT works remarkably well on cross-lingual transfer tasks, superior to static non-contextualized word embeddings. In this work, we provide an in-depth experimental study to supplement the existing literature of…

Computation and Language · Computer Science 2020-04-21 Chi-Liang Liu , Tsung-Yuan Hsu , Yung-Sung Chuang , Hung-Yi Lee

Are Neural Language Models Good Plagiarists? A Benchmark for Neural Paraphrase Detection

The rise of language models such as BERT allows for high-quality text paraphrasing. This is a problem to academic integrity, as it is difficult to differentiate between original and machine-generated content. We propose a benchmark…

Computation and Language · Computer Science 2023-10-24 Jan Philip Wahle , Terry Ruas , Norman Meuschke , Bela Gipp

Cross-Lingual Relevance Transfer for Document Retrieval

Recent work has shown the surprising ability of multi-lingual BERT to serve as a zero-shot cross-lingual transfer model for a number of language processing tasks. We combine this finding with a similarly-recently proposal on sentence-level…

Information Retrieval · Computer Science 2019-11-11 Peng Shi , Jimmy Lin

German BERT Model for Legal Named Entity Recognition

The use of BERT, one of the most popular language models, has led to improvements in many Natural Language Processing (NLP) tasks. One such task is Named Entity Recognition (NER) i.e. automatic identification of named entities such as…

Computation and Language · Computer Science 2023-03-10 Harshil Darji , Jelena Mitrović , Michael Granitzer

Does BERT agree? Evaluating knowledge of structure dependence through agreement relations

Learning representations that accurately model semantics is an important goal of natural language processing research. Many semantic phenomena depend on syntactic structure. Recent work examines the extent to which state-of-the-art models…

Computation and Language · Computer Science 2019-08-28 Geoff Bacon , Terry Regier