Related papers: Dual Script E2E framework for Multilingual and Cod…

Balanced End-to-End Monolingual pre-training for Low-Resourced Indic Languages Code-Switching Speech Recognition

The success in designing Code-Switching (CS) ASR often depends on the availability of the transcribed CS resources. Such dependency harms the development of ASR in low-resourced languages such as Bengali and Hindi. In this paper, we exploit…

Computation and Language · Computer Science 2022-02-16 Amir Hussein , Shammur Chowdhury , Najim Dehak , Ahmed Ali

Code Switched and Code Mixed Speech Recognition for Indic languages

Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and lexical information is typically language specific. Training multilingual system for Indic languages is even more tougher due to lack of…

Computation and Language · Computer Science 2022-06-14 Harveen Singh Chadha , Priyanshi Shah , Ankur Dhuriya , Neeraj Chhimwal , Anirudh Gupta , Vivek Raghavan

Multilingual and code-switching ASR challenges for low resource Indian languages

Recently, there is increasing interest in multilingual automatic speech recognition (ASR) where a speech recognition system caters to multiple low resource languages by taking advantage of low amounts of labeled corpora in multiple…

Computation and Language · Computer Science 2021-09-21 Anuj Diwan , Rakesh Vaideeswaran , Sanket Shah , Ankita Singh , Srinivasa Raghavan , Shreya Khare , Vinit Unni , Saurabh Vyas , Akash Rajpuria , Chiranjeevi Yarra , Ashish Mittal , Prasanta Kumar Ghosh , Preethi Jyothi , Kalika Bali , Vivek Seshadri , Sunayana Sitaram , Samarth Bharadwaj , Jai Nanavati , Raoul Nanavati , Karthik Sankaranarayanan , Tejaswi Seeram , Basil Abraham

Reducing language context confusion for end-to-end code-switching automatic speech recognition

Code-switching deals with alternative languages in communication process. Training end-to-end (E2E) automatic speech recognition (ASR) systems for code-switching is especially challenging as code-switching training data are always…

Computation and Language · Computer Science 2022-06-30 Shuai Zhang , Jiangyan Yi , Zhengkun Tian , Jianhua Tao , Yu Ting Yeung , Liqun Deng

Large-Scale Multilingual Speech Recognition with a Streaming End-to-End Model

Multilingual end-to-end (E2E) models have shown great promise in expansion of automatic speech recognition (ASR) coverage of the world's languages. They have shown improvement over monolingual systems, and have simplified training and…

Audio and Speech Processing · Electrical Eng. & Systems 2019-09-13 Anjuli Kannan , Arindrima Datta , Tara N. Sainath , Eugene Weinstein , Bhuvana Ramabhadran , Yonghui Wu , Ankur Bapna , Zhifeng Chen , Seungji Lee

Language-agnostic Code-Switching in Sequence-To-Sequence Speech Recognition

Code-Switching (CS) is referred to the phenomenon of alternately using words and phrases from different languages. While today's neural end-to-end (E2E) models deliver state-of-the-art performances on the task of automatic speech…

Computation and Language · Computer Science 2023-07-04 Enes Yavuz Ugan , Christian Huber , Juan Hussain , Alexander Waibel

Multilingual Speech Recognition With A Single End-To-End Model

Training a conventional automatic speech recognition (ASR) system to support multiple languages is challenging because the sub-word unit, lexicon and word inventories are typically language specific. In contrast, sequence-to-sequence models…

Audio and Speech Processing · Electrical Eng. & Systems 2018-02-16 Shubham Toshniwal , Tara N. Sainath , Ron J. Weiss , Bo Li , Pedro Moreno , Eugene Weinstein , Kanishka Rao

DuDe: Dual-Decoder Multilingual ASR for Indian Languages using Common Label Set

In a multilingual country like India, multilingual Automatic Speech Recognition (ASR) systems have much scope. Multilingual ASR systems exhibit many advantages like scalability, maintainability, and improved performance over the monolingual…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-01 Arunkumar A , Mudit Batra , Umesh S

Building Robust and Scalable Multilingual ASR for Indian Languages

This paper describes the systems developed by SPRING Lab, Indian Institute of Technology Madras, for the ASRU MADASR 2.0 challenge. The systems developed focuses on adapting ASR systems to improve in predicting the language and dialect of…

Computation and Language · Computer Science 2025-11-20 Arjun Gangwar , Kaousheik Jayakumar , S. Umesh

End-to-end contextual speech recognition using class language models and a token passing decoder

End-to-end modeling (E2E) of automatic speech recognition (ASR) blends all the components of a traditional speech recognition system into a unified model. Although it simplifies training and decoding pipelines, the unified model is hard to…

Computation and Language · Computer Science 2018-12-06 Zhehuai Chen , Mahaveer Jain , Yongqiang Wang , Michael L. Seltzer , Christian Fuegen

The Tag-Team Approach: Leveraging CLS and Language Tagging for Enhancing Multilingual ASR

Building a multilingual Automated Speech Recognition (ASR) system in a linguistically diverse country like India can be a challenging task due to the differences in scripts and the limited availability of speech data. This problem can be…

Computation and Language · Computer Science 2023-06-01 Kaousheik Jayakumar , Vrunda N. Sukhadia , A Arunkumar , S. Umesh

End-to-End Code-Switching ASR for Low-Resourced Language Pairs

Despite the significant progress in end-to-end (E2E) automatic speech recognition (ASR), E2E ASR for low resourced code-switching (CS) speech has not been well studied. In this work, we describe an E2E ASR pipeline for the recognition of CS…

Computation and Language · Computer Science 2019-10-01 Xianghu Yue , Grandee Lee , Emre Yılmaz , Fang Deng , Haizhou Li

A two-stage transliteration approach to improve performance of a multilingual ASR

End-to-end Automatic Speech Recognition (ASR) systems are rapidly claiming to become state-of-art over other modeling methods. Several techniques have been introduced to improve their ability to handle multiple languages. However, due to…

Computation and Language · Computer Science 2024-10-22 Rohit Kumar

End-to-End ASR for Code-switched Hindi-English Speech

End-to-end (E2E) models have been explored for large speech corpora and have been found to match or outperform traditional pipeline-based systems in some languages. However, most prior work on end-to-end models use speech corpora exceeding…

Audio and Speech Processing · Electrical Eng. & Systems 2019-06-25 Brij Mohan Lal Srivastava , Basil Abraham , Sunayana Sitaram , Rupesh Mehta , Preethi Jyothi

Multilingual Speech Recognition using Knowledge Transfer across Learning Processes

Multilingual end-to-end(E2E) models have shown a great potential in the expansion of the language coverage in the realm of automatic speech recognition(ASR). In this paper, we aim to enhance the multilingual ASR performance in two ways,…

Computation and Language · Computer Science 2021-10-18 Rimita Lahiri , Kenichi Kumatani , Eric Sun , Yao Qian

Streaming Bilingual End-to-End ASR model using Attention over Multiple Softmax

Even with several advancements in multilingual modeling, it is challenging to recognize multiple languages using a single neural model, without knowing the input language and most multilingual models assume the availability of the input…

Audio and Speech Processing · Electrical Eng. & Systems 2024-01-23 Aditya Patil , Vikas Joshi , Purvi Agrawal , Rupesh Mehta

Hindi-English Code-Switching Speech Corpus

Code-switching refers to the usage of two languages within a sentence or discourse. It is a global phenomenon among multilingual communities and has emerged as an independent area of research. With the increasing demand for the…

Computation and Language · Computer Science 2018-10-02 Ganji Sreeram , Kunal Dhawan , Rohit Sinha

Enhancing Code-Switching ASR Leveraging Non-Peaky CTC Loss and Deep Language Posterior Injection

Code-switching-where multilingual speakers alternately switch between languages during conversations-still poses significant challenges to end-to-end (E2E) automatic speech recognition (ASR) systems due to phenomena of both acoustic and…

Audio and Speech Processing · Electrical Eng. & Systems 2024-12-13 Tzu-Ting Yang , Hsin-Wei Wang , Yi-Cheng Wang , Berlin Chen

A Study of Multilingual End-to-End Speech Recognition for Kazakh, Russian, and English

We study training a single end-to-end (E2E) automatic speech recognition (ASR) model for three languages used in Kazakhstan: Kazakh, Russian, and English. We first describe the development of multilingual E2E ASR based on Transformer…

Audio and Speech Processing · Electrical Eng. & Systems 2021-08-04 Saida Mussakhojayeva , Yerbolat Khassanov , Huseyin Atakan Varol

Investigating Target Set Reduction for End-to-End Speech Recognition of Hindi-English Code-Switching Data

End-to-end (E2E) systems are fast replacing the conventional systems in the domain of automatic speech recognition. As the target labels are learned directly from speech data, the E2E systems need a bigger corpus for effective training. In…

Audio and Speech Processing · Electrical Eng. & Systems 2019-07-22 Kunal Dhawan , Ganji Sreeram , Kumar Priyadarshi , Rohit Sinha