Related papers: Arabic Code-Switching Speech Recognition using Mon…

Towards One Model to Rule All: Multilingual Strategy for Dialectal Code-Switching Arabic ASR

With the advent of globalization, there is an increasing demand for multilingual automatic speech recognition (ASR), handling language and dialectal variation of spoken content. Recent studies show its efficacy over monolingual systems. In…

Computation and Language · Computer Science 2021-07-06 Shammur Absar Chowdhury , Amir Hussein , Ahmed Abdelali , Ahmed Ali

Learning to Recognize Code-switched Speech Without Forgetting Monolingual Speech Recognition

Recently, there has been significant progress made in Automatic Speech Recognition (ASR) of code-switched speech, leading to gains in accuracy on code-switched datasets in many language pairs. Code-switched speech co-occurs with monolingual…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-02 Sanket Shah , Basil Abraham , Gurunath Reddy M , Sunayana Sitaram , Vikas Joshi

Dialectal Coverage And Generalization in Arabic Speech Recognition

Developing robust automatic speech recognition (ASR) systems for Arabic requires effective strategies to manage its diversity. Existing ASR systems mainly cover the modern standard Arabic (MSA) variety and few high-resource dialects, but…

Computation and Language · Computer Science 2025-06-02 Amirbek Djanibekov , Hawau Olamide Toyin , Raghad Alshalan , Abdullah Alitr , Hanan Aldarmaki

Investigations on Speech Recognition Systems for Low-Resource Dialectal Arabic-English Code-Switching Speech

Code-switching (CS), defined as the mixing of languages in conversations, has become a worldwide phenomenon. The prevalence of CS has been recently met with a growing demand and interest to build CS ASR systems. In this paper, we present…

Computation and Language · Computer Science 2021-08-31 Injy Hamed , Pavel Denisov , Chia-Yu Li , Mohamed Elmahdy , Slim Abdennadher , Ngoc Thang Vu

End-to-End Code Switching Language Models for Automatic Speech Recognition

In this paper, we particularly work on the code-switched text, one of the most common occurrences in the bilingual communities across the world. Due to the discrepancies in the extraction of code-switched text from an Automated Speech…

Computation and Language · Computer Science 2020-06-17 Ahan M. R. , Shreyas Sunil Kulkarni

Code Switched and Code Mixed Speech Recognition for Indic languages

Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and lexical information is typically language specific. Training multilingual system for Indic languages is even more tougher due to lack of…

Computation and Language · Computer Science 2022-06-14 Harveen Singh Chadha , Priyanshi Shah , Ankur Dhuriya , Neeraj Chhimwal , Anirudh Gupta , Vivek Raghavan

Leveraging Data Collection and Unsupervised Learning for Code-switched Tunisian Arabic Automatic Speech Recognition

Crafting an effective Automatic Speech Recognition (ASR) solution for dialects demands innovative approaches that not only address the data scarcity issue but also navigate the intricacies of linguistic diversity. In this paper, we address…

Audio and Speech Processing · Electrical Eng. & Systems 2023-09-26 Ahmed Amine Ben Abdallah , Ata Kabboudi , Amir Kanoun , Salah Zaiem

Adapting the adapters for code-switching in multilingual ASR

Recently, large pre-trained multilingual speech models have shown potential in scaling Automatic Speech Recognition (ASR) to many low-resource languages. Some of these models employ language adapters in their formulation, which helps to…

Computation and Language · Computer Science 2023-10-12 Atharva Kulkarni , Ajinkya Kulkarni , Miguel Couceiro , Hanan Aldarmaki

ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs

Motivated by the widespread increase in the phenomenon of code-switching between Egyptian Arabic and English in recent times, this paper explores the intricacies of machine translation (MT) and automatic speech recognition (ASR) systems,…

Computation and Language · Computer Science 2024-07-16 Ahmed Heakl , Youssef Zaghloul , Mennatullah Ali , Rania Hossam , Walid Gomaa

Transformer-Transducers for Code-Switched Speech Recognition

We live in a world where 60% of the population can speak two or more languages fluently. Members of these communities constantly switch between languages when having a conversation. As automatic speech recognition (ASR) systems are being…

Computation and Language · Computer Science 2021-02-16 Siddharth Dalmia , Yuzong Liu , Srikanth Ronanki , Katrin Kirchhoff

Language Modeling for Code-Switching: Evaluation, Integration of Monolingual Data, and Discriminative Training

We focus on the problem of language modeling for code-switched language, in the context of automatic speech recognition (ASR). Language modeling for code-switched language is challenging for (at least) three reasons: (1) lack of available…

Computation and Language · Computer Science 2019-11-12 Hila Gonen , Yoav Goldberg

Learning not to Discriminate: Task Agnostic Learning for Improving Monolingual and Code-switched Speech Recognition

Recognizing code-switched speech is challenging for Automatic Speech Recognition (ASR) for a variety of reasons, including the lack of code-switched training data. Recently, we showed that monolingual ASR systems fine-tuned on code-switched…

Audio and Speech Processing · Electrical Eng. & Systems 2020-06-11 Gurunath Reddy Madhumani , Sanket Shah , Basil Abraham , Vikas Joshi , Sunayana Sitaram

Textual Data Augmentation for Arabic-English Code-Switching Speech Recognition

The pervasiveness of intra-utterance code-switching (CS) in spoken content requires that speech recognition (ASR) systems handle mixed language. Designing a CS-ASR system has many challenges, mainly due to data scarcity, grammatical…

Computation and Language · Computer Science 2023-01-12 Amir Hussein , Shammur Absar Chowdhury , Ahmed Abdelali , Najim Dehak , Ahmed Ali , Sanjeev Khudanpur

Optimizing Bilingual Neural Transducer with Synthetic Code-switching Text Generation

Code-switching describes the practice of using more than one language in the same sentence. In this study, we investigate how to optimize a neural transducer based bilingual automatic speech recognition (ASR) model for code-switching…

Sound · Computer Science 2022-10-25 Thien Nguyen , Nathalie Tran , Liuhui Deng , Thiago Fraga da Silva , Matthew Radzihovsky , Roger Hsiao , Henry Mason , Stefan Braun , Erik McDermott , Dogan Can , Pawel Swietojanski , Lyan Verwimp , Sibel Oyman , Tresi Arvizo , Honza Silovsky , Arnab Ghoshal , Mathieu Martel , Bharat Ram Ambati , Mohamed Ali

Multi-Graph Decoding for Code-Switching ASR

In the FAME! Project, a code-switching (CS) automatic speech recognition (ASR) system for Frisian-Dutch speech is developed that can accurately transcribe the local broadcaster's bilingual archives with CS speech. This archive contains…

Computation and Language · Computer Science 2019-07-01 Emre Yılmaz , Samuel Cohen , Xianghu Yue , David van Leeuwen , Haizhou Li

Semi-supervised Learning for Code-Switching ASR with Large Language Model Filter

Code-switching (CS) phenomenon occurs when words or phrases from different languages are alternated in a single sentence. Due to data scarcity, building an effective CS Automatic Speech Recognition (ASR) system remains challenging. In this…

Audio and Speech Processing · Electrical Eng. & Systems 2024-09-23 Yu Xi , Wen Ding , Kai Yu , Junjie Lai

ArTST: Arabic Text and Speech Transformer

We present ArTST, a pre-trained Arabic text and speech transformer for supporting open-source speech technologies for the Arabic language. The model architecture follows the unified-modal framework, SpeechT5, that was recently released for…

Computation and Language · Computer Science 2023-10-26 Hawau Olamide Toyin , Amirbek Djanibekov , Ajinkya Kulkarni , Hanan Aldarmaki

Reducing language context confusion for end-to-end code-switching automatic speech recognition

Code-switching deals with alternative languages in communication process. Training end-to-end (E2E) automatic speech recognition (ASR) systems for code-switching is especially challenging as code-switching training data are always…

Computation and Language · Computer Science 2022-06-30 Shuai Zhang , Jiangyan Yi , Zhengkun Tian , Jianhua Tao , Yu Ting Yeung , Liqun Deng

Using heterogeneity in semi-supervised transcription hypotheses to improve code-switched speech recognition

Modeling code-switched speech is an important problem in automatic speech recognition (ASR). Labeled code-switched data are rare, so monolingual data are often used to model code-switched speech. These monolingual data may be more closely…

Computation and Language · Computer Science 2021-06-16 Andrew Slottje , Shannon Wotherspoon , William Hartmann , Matthew Snover , Owen Kimball

Benchmarking Commercial ASR Systems on Code-Switching Speech: Arabic, Persian, and German

Code-switching -- the natural alternation between two languages within a single utterance -- remains one of the most challenging and under-studied conditions for automatic speech recognition (ASR). We present a benchmark evaluating five…

Computation and Language · Computer Science 2026-05-25 Sajjad Abdoli , Ghassan Al-Sumaidaee , Clayton W. Taylor , Ahmad ElShiekh , Ahmed Rashad