Related papers: Are Multilingual Models Effective in Code-Switchin…

Emerging Cross-lingual Structure in Pretrained Language Models

We study the problem of multilingual masked language modeling, i.e. the training of a single model on concatenated text from multiple languages, and present a detailed study of several factors that influence why these models are so…

Computation and Language · Computer Science 2020-05-11 Shijie Wu , Alexis Conneau , Haoran Li , Luke Zettlemoyer , Veselin Stoyanov

Dual Language Models for Code Switched Speech Recognition

In this work, we present a simple and elegant approach to language modeling for bilingual code-switched text. Since code-switching is a blend of two or more different languages, a standard bilingual language model can be improved upon by…

Computation and Language · Computer Science 2018-08-06 Saurabh Garg , Tanmay Parekh , Preethi Jyothi

Code-Mixed Probes Show How Pre-Trained Models Generalise On Code-Switched Text

Code-switching is a prevalent linguistic phenomenon in which multilingual individuals seamlessly alternate between languages. Despite its widespread use online and recent research trends in this area, research in code-switching presents…

Computation and Language · Computer Science 2024-05-08 Frances A. Laureano De Leon , Harish Tayyar Madabushi , Mark Lee

Multilingual Large Language Models Are Not (Yet) Code-Switchers

Multilingual Large Language Models (LLMs) have recently shown great capabilities in a wide range of tasks, exhibiting state-of-the-art performance through zero-shot or few-shot prompting methods. While there have been extensive studies on…

Computation and Language · Computer Science 2023-10-24 Ruochen Zhang , Samuel Cahyawijaya , Jan Christian Blaise Cruz , Genta Indra Winata , Alham Fikri Aji

Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training

Large language models (LLMs) exhibit remarkable multilingual capabilities despite the extreme language imbalance in the pre-training data. In this paper, we closely examine the reasons behind this phenomenon, focusing on the pre-training…

Computation and Language · Computer Science 2025-04-23 Zhijun Wang , Jiahuan Li , Hao Zhou , Rongxiang Weng , Jingang Wang , Xin Huang , Xue Han , Junlan Feng , Chao Deng , Shujian Huang

Checks and Strategies for Enabling Code-Switched Machine Translation

Code-switching is a common phenomenon among multilingual speakers, where alternation between two or more languages occurs within the context of a single conversation. While multilingual humans can seamlessly switch back and forth between…

Computation and Language · Computer Science 2022-10-12 Thamme Gowda , Mozhdeh Gheini , Jonathan May

Enhancing Multilingual Language Models for Code-Switched Input Data

Code-switching, or alternating between languages within a single conversation, presents challenges for multilingual language models on NLP tasks. This research investigates if pre-training Multilingual BERT (mBERT) on code-switched datasets…

Computation and Language · Computer Science 2025-03-12 Katherine Xie , Nitya Babbar , Vicky Chen , Yoanna Turura

Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer

We introduce and demonstrate how to effectively train multilingual machine translation models with pixel representations. We experiment with two different data settings with a variety of language and script coverage, demonstrating improved…

Computation and Language · Computer Science 2023-10-25 Elizabeth Salesky , Neha Verma , Philipp Koehn , Matt Post

Are pre-trained text representations useful for multilingual and multi-dimensional language proficiency modeling?

Development of language proficiency models for non-native learners has been an active area of interest in NLP research for the past few years. Although language proficiency is multidimensional in nature, existing research typically…

Computation and Language · Computer Science 2021-02-26 Taraka Rama , Sowmya Vajjala

Cross-lingual Models of Word Embeddings: An Empirical Comparison

Despite interest in using cross-lingual knowledge to learn word embeddings for various tasks, a systematic comparison of the possible approaches is lacking in the literature. We perform an extensive evaluation of four popular approaches of…

Computation and Language · Computer Science 2016-06-09 Shyam Upadhyay , Manaal Faruqui , Chris Dyer , Dan Roth

Hierarchical Meta-Embeddings for Code-Switching Named Entity Recognition

In countries that speak multiple main languages, mixing up different languages within a conversation is commonly called code-switching. Previous works addressing this challenge mainly focused on word-level aspects such as word embeddings.…

Computation and Language · Computer Science 2019-09-19 Genta Indra Winata , Zhaojiang Lin , Jamin Shin , Zihan Liu , Pascale Fung

Are Knowledge and Reference in Multilingual Language Models Cross-Lingually Consistent?

Cross-lingual consistency should be considered to assess cross-lingual transferability, maintain the factuality of the model knowledge across languages, and preserve the parity of language model performance. We are thus interested in…

Computation and Language · Computer Science 2025-10-02 Xi Ai , Mahardika Krisna Ihsani , Min-Yen Kan

A Survey Of Cross-lingual Word Embedding Models

Cross-lingual representations of words enable us to reason about word meaning in multilingual contexts and are a key facilitator of cross-lingual transfer when developing natural language processing models for low-resource languages. In…

Computation and Language · Computer Science 2019-10-08 Sebastian Ruder , Ivan Vulić , Anders Søgaard

Do Multi-Sense Embeddings Improve Natural Language Understanding?

Learning a distinct representation for each sense of an ambiguous word could lead to more powerful and fine-grained models of vector-space representations. Yet while `multi-sense' methods have been proposed and tested on artificial…

Computation and Language · Computer Science 2015-11-25 Jiwei Li , Dan Jurafsky

Contextual Code Switching for Machine Translation using Language Models

Large language models (LLMs) have exerted a considerable impact on diverse language-related tasks in recent years. Their demonstrated state-of-the-art performance is achieved through methodologies such as zero-shot or few-shot prompting.…

Computation and Language · Computer Science 2023-12-21 Arshad Kaji , Manan Shah

Lessons learned in multilingual grounded language learning

Recent work has shown how to learn better visual-semantic embeddings by leveraging image descriptions in more than one language. Here, we investigate in detail which conditions affect the performance of this type of grounded language…

Computation and Language · Computer Science 2018-09-21 Ákos Kádár , Desmond Elliott , Marc-Alexandre Côté , Grzegorz Chrupała , Afra Alishahi

Analyzing the Evaluation of Cross-Lingual Knowledge Transfer in Multilingual Language Models

Recent advances in training multilingual language models on large datasets seem to have shown promising results in knowledge transfer across languages and achieve high performance on downstream tasks. However, we question to what extent the…

Computation and Language · Computer Science 2024-02-06 Sara Rajaee , Christof Monz

Meta-Transfer Learning for Code-Switched Speech Recognition

An increasing number of people in the world today speak a mixed-language as a result of being multilingual. However, building a speech recognition system for code-switching remains difficult due to the availability of limited resources and…

Computation and Language · Computer Science 2020-04-30 Genta Indra Winata , Samuel Cahyawijaya , Zhaojiang Lin , Zihan Liu , Peng Xu , Pascale Fung

On the Limits of Model Merging for Multilinguality in Pre-Training

Endowing models with consistent multilingual performance can be achieved by mixing pre-training data, or post-training approaches such as language-specific model merging. In this work, we test whether merging can be applied to monolingually…

Computation and Language · Computer Science 2026-05-26 Seth Aycock , Fedor Vitiugin , Aleksandr Umnov , Christof Monz , Khalil Sima'an

Adapting Language Balance in Code-Switching Speech

Despite achieving impressive results on standard benchmarks, large foundational models still struggle against code-switching test cases. When data scarcity cannot be used as the usual justification for poor performance, the reason may lie…

Computation and Language · Computer Science 2025-10-22 Enes Yavuz Ugan , Ngoc-Quan Pham , Alexander Waibel