English
Related papers

Related papers: Identifying Necessary Elements for BERT's Multilin…

200 papers

Multilingual BERT (mBERT) provides sentence representations for 104 languages, which are useful for many multi-lingual tasks. Previous work probed the cross-linguality of mBERT using zero-shot transfer learning on morphological and…

Computation and Language · Computer Science 2019-11-11 Jindřich Libovický , Rudolf Rosa , Alexander Fraser

While recent work on multilingual language models has demonstrated their capacity for cross-lingual zero-shot transfer on downstream tasks, there is a lack of consensus in the community as to what shared properties between languages enable…

Computation and Language · Computer Science 2022-05-05 Ameet Deshpande , Partha Talukdar , Karthik Narasimhan

Multilingual BERT (mBERT) has demonstrated considerable cross-lingual syntactic ability, whereby it enables effective zero-shot cross-lingual transfer of syntactic knowledge. The transfer is more successful between some languages, but it is…

Computation and Language · Computer Science 2022-12-22 Ningyu Xu , Tao Gui , Ruotian Ma , Qi Zhang , Jingting Ye , Menghan Zhang , Xuanjing Huang

Multilingual BERT (mBERT) trained on 104 languages has shown surprisingly good cross-lingual performance on several NLP tasks, even without explicit cross-lingual signals. However, these evaluations have focused on cross-lingual transfer…

Computation and Language · Computer Science 2020-10-02 Shijie Wu , Mark Dredze

Recent work has exhibited the surprising cross-lingual abilities of multilingual BERT (M-BERT) -- surprising since it is trained without any cross-lingual objective and with no aligned data. In this work, we provide a comprehensive study of…

Computation and Language · Computer Science 2020-02-18 Karthikeyan K , Zihan Wang , Stephen Mayhew , Dan Roth

In this paper, we show that Multilingual BERT (M-BERT), released by Devlin et al. (2018) as a single language model pre-trained from monolingual corpora in 104 languages, is surprisingly good at zero-shot cross-lingual model transfer, in…

Computation and Language · Computer Science 2019-06-05 Telmo Pires , Eva Schlinger , Dan Garrette

Multilingual BERT (mBERT), a language model pre-trained on large multilingual corpora, has impressive zero-shot cross-lingual transfer capabilities and performs surprisingly well on zero-shot POS tagging and Named Entity Recognition (NER),…

Computation and Language · Computer Science 2022-05-18 Beiduo Chen , Wu Guo , Quan Liu , Kun Tao

Recently, multilingual BERT works remarkably well on cross-lingual transfer tasks, superior to static non-contextualized word embeddings. In this work, we provide an in-depth experimental study to supplement the existing literature of…

Computation and Language · Computer Science 2020-04-21 Chi-Liang Liu , Tsung-Yuan Hsu , Yung-Sung Chuang , Hung-Yi Lee

Recently, multilingual BERT works remarkably well on cross-lingual transfer tasks, superior to static non-contextualized word embeddings. In this work, we provide an in-depth experimental study to supplement the existing literature of…

Computation and Language · Computer Science 2020-10-22 Chi-Liang Liu , Tsung-Yuan Hsu , Yung-Sung Chuang , Hung-yi Lee

Multilingual pretrained language models have demonstrated remarkable zero-shot cross-lingual transfer capabilities. Such transfer emerges by fine-tuning on a task of interest in one language and evaluating on a distinct language, not seen…

Computation and Language · Computer Science 2021-01-28 Benjamin Muller , Yanai Elazar , Benoît Sagot , Djamé Seddah

Cross-language entity linking grounds mentions in multiple languages to a single-language knowledge base. We propose a neural ranking architecture for this task that uses multilingual BERT representations of the mention and the context in a…

Computation and Language · Computer Science 2021-07-09 Elliot Schumacher , James Mayfield , Mark Dredze

Pretrained contextual representation models (Peters et al., 2018; Devlin et al., 2018) have pushed forward the state-of-the-art on many NLP tasks. A new release of BERT (Devlin, 2018) includes a model simultaneously pretrained on 104…

Computation and Language · Computer Science 2019-10-04 Shijie Wu , Mark Dredze

Recent work has found evidence that Multilingual BERT (mBERT), a transformer-based multilingual masked language model, is capable of zero-shot cross-lingual transfer, suggesting that some aspects of its representations are shared…

Computation and Language · Computer Science 2020-05-21 Ethan A. Chi , John Hewitt , Christopher D. Manning

The multilingual BERT model is trained on 104 languages and meant to serve as a universal language model and tool for encoding sentences. We explore how well the model performs on several languages across several tasks: a diagnostic…

Computation and Language · Computer Science 2019-10-10 Samuel Rönnqvist , Jenna Kanerva , Tapio Salakoski , Filip Ginter

NLP systems typically require support for more than one language. As different languages have different amounts of supervision, cross-lingual transfer benefits languages with little to no training data by transferring from other languages.…

Computation and Language · Computer Science 2022-07-13 Shijie Wu

Recent work has shown the surprising ability of multi-lingual BERT to serve as a zero-shot cross-lingual transfer model for a number of language processing tasks. We combine this finding with a similarly-recently proposal on sentence-level…

Information Retrieval · Computer Science 2019-11-11 Peng Shi , Jimmy Lin

For many (minority) languages, the resources needed to train large models are not available. We investigate the performance of zero-shot transfer learning with as little data as possible, and the influence of language similarity in this…

Computation and Language · Computer Science 2021-08-03 Wietse de Vries , Martijn Bartelds , Malvina Nissim , Martijn Wieling

Multi-lingual contextualized embeddings, such as multilingual-BERT (mBERT), have shown success in a variety of zero-shot cross-lingual tasks. However, these models are limited by having inconsistent contextualized representations of…

Computation and Language · Computer Science 2020-07-14 Libo Qin , Minheng Ni , Yue Zhang , Wanxiang Che

Recent works have demonstrated that multilingual BERT (mBERT) learns rich cross-lingual representations, that allow for transfer across languages. We study the word-level translation information embedded in mBERT and present two simple…

Computation and Language · Computer Science 2020-10-19 Hila Gonen , Shauli Ravfogel , Yanai Elazar , Yoav Goldberg

Recent work has shown evidence that the knowledge acquired by multilingual BERT (mBERT) has two components: a language-specific and a language-neutral one. This paper analyses the relationship between them, in the context of fine-tuning on…

Computation and Language · Computer Science 2021-12-28 Marc Tanti , Lonneke van der Plas , Claudia Borg , Albert Gatt
‹ Prev 1 2 3 10 Next ›