Related papers: Benchmarking Evaluation Metrics for Code-Switching…

Assessing Evaluation Metrics for Speech-to-Speech Translation

Speech-to-speech translation combines machine translation with speech synthesis, introducing evaluation challenges not present in either task alone. How to automatically evaluate speech-to-speech translation is an open question which has…

Computation and Language · Computer Science 2021-10-27 Elizabeth Salesky , Julian Mäder , Severin Klinger

Code-Switching in End-to-End Automatic Speech Recognition: A Systematic Literature Review

Motivated by a growing research interest into automatic speech recognition (ASR), and the growing body of work for languages in which code-switching (CS) often occurs, we present a systematic literature review of code-switching in…

Computation and Language · Computer Science 2025-07-11 Maha Tufail Agro , Atharva Kulkarni , Karima Kadaoui , Zeerak Talat , Hanan Aldarmaki

Dual Language Models for Code Switched Speech Recognition

In this work, we present a simple and elegant approach to language modeling for bilingual code-switched text. Since code-switching is a blend of two or more different languages, a standard bilingual language model can be improved upon by…

Computation and Language · Computer Science 2018-08-06 Saurabh Garg , Tanmay Parekh , Preethi Jyothi

Meta-Transfer Learning for Code-Switched Speech Recognition

An increasing number of people in the world today speak a mixed-language as a result of being multilingual. However, building a speech recognition system for code-switching remains difficult due to the availability of limited resources and…

Computation and Language · Computer Science 2020-04-30 Genta Indra Winata , Samuel Cahyawijaya , Zhaojiang Lin , Zihan Liu , Peng Xu , Pascale Fung

Gujarati-English Code-Switching Speech Recognition using ensemble prediction of spoken language

An important and difficult task in code-switched speech recognition is to recognize the language, as lots of words in two languages can sound similar, especially in some accents. We focus on improving performance of end-to-end Automatic…

Computation and Language · Computer Science 2024-03-14 Yash Sharma , Basil Abraham , Preethi Jyothi

Adapting Language Balance in Code-Switching Speech

Despite achieving impressive results on standard benchmarks, large foundational models still struggle against code-switching test cases. When data scarcity cannot be used as the usual justification for poor performance, the reason may lie…

Computation and Language · Computer Science 2025-10-22 Enes Yavuz Ugan , Ngoc-Quan Pham , Alexander Waibel

Sentiment Classification of Code-Switched Text using Pre-trained Multilingual Embeddings and Segmentation

With increasing globalization and immigration, various studies have estimated that about half of the world population is bilingual. Consequently, individuals concurrently use two or more languages or dialects in casual conversational…

Computation and Language · Computer Science 2022-11-01 Saurav K. Aryal , Howard Prioleau , Gloria Washington

A Survey of Code-switched Speech and Language Processing

Code-switching, the alternation of languages within a conversation or utterance, is a common communicative phenomenon that occurs in multilingual communities across the world. This survey reviews computational approaches for code-switched…

Computation and Language · Computer Science 2020-07-24 Sunayana Sitaram , Khyathi Raghavi Chandu , Sai Krishna Rallabandi , Alan W Black

Arabic Code-Switching Speech Recognition using Monolingual Data

Code-switching in automatic speech recognition (ASR) is an important challenge due to globalization. Recent research in multilingual ASR shows potential improvement over monolingual systems. We study key issues related to multilingual…

Computation and Language · Computer Science 2021-07-06 Ahmed Ali , Shammur Chowdhury , Amir Hussein , Yasser Hifny

Checks and Strategies for Enabling Code-Switched Machine Translation

Code-switching is a common phenomenon among multilingual speakers, where alternation between two or more languages occurs within the context of a single conversation. While multilingual humans can seamlessly switch back and forth between…

Computation and Language · Computer Science 2022-10-12 Thamme Gowda , Mozhdeh Gheini , Jonathan May

Integrating Knowledge in End-to-End Automatic Speech Recognition for Mandarin-English Code-Switching

Code-Switching (CS) is a common linguistic phenomenon in multilingual communities that consists of switching between languages while speaking. This paper presents our investigations on end-to-end speech recognition for Mandarin-English CS…

Computation and Language · Computer Science 2021-12-21 Chia-Yu Li , Ngoc Thang Vu

Exploring the Correlation between Human and Machine Evaluation of Simultaneous Speech Translation

Assessing the performance of interpreting services is a complex task, given the nuanced nature of spoken language translation, the strategies that interpreters apply, and the diverse expectations of users. The complexity of this task become…

Computation and Language · Computer Science 2024-06-17 Xiaoman Wang , Claudio Fantinuoli

Adapting the adapters for code-switching in multilingual ASR

Recently, large pre-trained multilingual speech models have shown potential in scaling Automatic Speech Recognition (ASR) to many low-resource languages. Some of these models employ language adapters in their formulation, which helps to…

Computation and Language · Computer Science 2023-10-12 Atharva Kulkarni , Ajinkya Kulkarni , Miguel Couceiro , Hanan Aldarmaki

Attention-Guided Adaptation for Code-Switching Speech Recognition

The prevalence of the powerful multilingual models, such as Whisper, has significantly advanced the researches on speech recognition. However, these models often struggle with handling the code-switching setting, which is essential in…

Audio and Speech Processing · Electrical Eng. & Systems 2024-01-15 Bobbi Aditya , Mahdin Rohmatillah , Liang-Hsuan Tai , Jen-Tzung Chien

Testing Correctness, Fairness, and Robustness of Speech Emotion Recognition Models

Machine learning models for speech emotion recognition (SER) can be trained for different tasks and are usually evaluated based on a few available datasets per task. Tasks could include arousal, valence, dominance, emotional categories, or…

Audio and Speech Processing · Electrical Eng. & Systems 2025-02-13 Anna Derington , Hagen Wierstorf , Ali Özkil , Florian Eyben , Felix Burkhardt , Björn W. Schuller

Using heterogeneity in semi-supervised transcription hypotheses to improve code-switched speech recognition

Modeling code-switched speech is an important problem in automatic speech recognition (ASR). Labeled code-switched data are rare, so monolingual data are often used to model code-switched speech. These monolingual data may be more closely…

Computation and Language · Computer Science 2021-06-16 Andrew Slottje , Shannon Wotherspoon , William Hartmann , Matthew Snover , Owen Kimball

A New Benchmark for Evaluating Automatic Speech Recognition in the Arabic Call Domain

This work is an attempt to introduce a comprehensive benchmark for Arabic speech recognition, specifically tailored to address the challenges of telephone conversations in Arabic language. Arabic, characterized by its rich dialectal…

Artificial Intelligence · Computer Science 2024-05-31 Qusai Abo Obaidah , Muhy Eddin Za'ter , Adnan Jaljuli , Ali Mahboub , Asma Hakouz , Bashar Al-Rfooh , Yazan Estaitia

End-to-End Code Switching Language Models for Automatic Speech Recognition

In this paper, we particularly work on the code-switched text, one of the most common occurrences in the bilingual communities across the world. Due to the discrepancies in the extraction of code-switched text from an Automated Speech…

Computation and Language · Computer Science 2020-06-17 Ahan M. R. , Shreyas Sunil Kulkarni

ChiEngMixBench: Evaluating Large Language Models on Spontaneous and Natural Chinese-English Code-Mixed Generation

Code-mixing is increasingly prevalent in interactions between humans and large language models, yet existing work often reduces it to a translation or convertibility problem, making it difficult to assess whether a model's switching…

Computation and Language · Computer Science 2026-01-26 Qingyan Yang , Tongxi Wang , Yunsheng Luo

Zero Resource Code-switched Speech Benchmark Using Speech Utterance Pairs For Multiple Spoken Languages

We introduce a new zero resource code-switched speech benchmark designed to directly assess the code-switching capabilities of self-supervised speech encoders. We showcase a baseline system of language modeling on discrete units to…

Audio and Speech Processing · Electrical Eng. & Systems 2024-03-19 Kuan-Po Huang , Chih-Kai Yang , Yu-Kuan Fu , Ewan Dunbar , Hung-yi Lee