English
Related papers

Related papers: Dual Script E2E framework for Multilingual and Cod…

200 papers

The success in designing Code-Switching (CS) ASR often depends on the availability of the transcribed CS resources. Such dependency harms the development of ASR in low-resourced languages such as Bengali and Hindi. In this paper, we exploit…

Computation and Language · Computer Science 2022-02-16 Amir Hussein , Shammur Chowdhury , Najim Dehak , Ahmed Ali

Training multilingual automatic speech recognition (ASR) systems is challenging because acoustic and lexical information is typically language specific. Training multilingual system for Indic languages is even more tougher due to lack of…

Computation and Language · Computer Science 2022-06-14 Harveen Singh Chadha , Priyanshi Shah , Ankur Dhuriya , Neeraj Chhimwal , Anirudh Gupta , Vivek Raghavan

Recently, there is increasing interest in multilingual automatic speech recognition (ASR) where a speech recognition system caters to multiple low resource languages by taking advantage of low amounts of labeled corpora in multiple…

Code-switching deals with alternative languages in communication process. Training end-to-end (E2E) automatic speech recognition (ASR) systems for code-switching is especially challenging as code-switching training data are always…

Computation and Language · Computer Science 2022-06-30 Shuai Zhang , Jiangyan Yi , Zhengkun Tian , Jianhua Tao , Yu Ting Yeung , Liqun Deng

Multilingual end-to-end (E2E) models have shown great promise in expansion of automatic speech recognition (ASR) coverage of the world's languages. They have shown improvement over monolingual systems, and have simplified training and…

Audio and Speech Processing · Electrical Eng. & Systems 2019-09-13 Anjuli Kannan , Arindrima Datta , Tara N. Sainath , Eugene Weinstein , Bhuvana Ramabhadran , Yonghui Wu , Ankur Bapna , Zhifeng Chen , Seungji Lee

Code-Switching (CS) is referred to the phenomenon of alternately using words and phrases from different languages. While today's neural end-to-end (E2E) models deliver state-of-the-art performances on the task of automatic speech…

Computation and Language · Computer Science 2023-07-04 Enes Yavuz Ugan , Christian Huber , Juan Hussain , Alexander Waibel

Training a conventional automatic speech recognition (ASR) system to support multiple languages is challenging because the sub-word unit, lexicon and word inventories are typically language specific. In contrast, sequence-to-sequence models…

Audio and Speech Processing · Electrical Eng. & Systems 2018-02-16 Shubham Toshniwal , Tara N. Sainath , Ron J. Weiss , Bo Li , Pedro Moreno , Eugene Weinstein , Kanishka Rao

In a multilingual country like India, multilingual Automatic Speech Recognition (ASR) systems have much scope. Multilingual ASR systems exhibit many advantages like scalability, maintainability, and improved performance over the monolingual…

Audio and Speech Processing · Electrical Eng. & Systems 2022-11-01 Arunkumar A , Mudit Batra , Umesh S

This paper describes the systems developed by SPRING Lab, Indian Institute of Technology Madras, for the ASRU MADASR 2.0 challenge. The systems developed focuses on adapting ASR systems to improve in predicting the language and dialect of…

Computation and Language · Computer Science 2025-11-20 Arjun Gangwar , Kaousheik Jayakumar , S. Umesh

End-to-end modeling (E2E) of automatic speech recognition (ASR) blends all the components of a traditional speech recognition system into a unified model. Although it simplifies training and decoding pipelines, the unified model is hard to…

Computation and Language · Computer Science 2018-12-06 Zhehuai Chen , Mahaveer Jain , Yongqiang Wang , Michael L. Seltzer , Christian Fuegen

Building a multilingual Automated Speech Recognition (ASR) system in a linguistically diverse country like India can be a challenging task due to the differences in scripts and the limited availability of speech data. This problem can be…

Computation and Language · Computer Science 2023-06-01 Kaousheik Jayakumar , Vrunda N. Sukhadia , A Arunkumar , S. Umesh

Despite the significant progress in end-to-end (E2E) automatic speech recognition (ASR), E2E ASR for low resourced code-switching (CS) speech has not been well studied. In this work, we describe an E2E ASR pipeline for the recognition of CS…

Computation and Language · Computer Science 2019-10-01 Xianghu Yue , Grandee Lee , Emre Yılmaz , Fang Deng , Haizhou Li

End-to-end Automatic Speech Recognition (ASR) systems are rapidly claiming to become state-of-art over other modeling methods. Several techniques have been introduced to improve their ability to handle multiple languages. However, due to…

Computation and Language · Computer Science 2024-10-22 Rohit Kumar

End-to-end (E2E) models have been explored for large speech corpora and have been found to match or outperform traditional pipeline-based systems in some languages. However, most prior work on end-to-end models use speech corpora exceeding…

Audio and Speech Processing · Electrical Eng. & Systems 2019-06-25 Brij Mohan Lal Srivastava , Basil Abraham , Sunayana Sitaram , Rupesh Mehta , Preethi Jyothi

Multilingual end-to-end(E2E) models have shown a great potential in the expansion of the language coverage in the realm of automatic speech recognition(ASR). In this paper, we aim to enhance the multilingual ASR performance in two ways,…

Computation and Language · Computer Science 2021-10-18 Rimita Lahiri , Kenichi Kumatani , Eric Sun , Yao Qian

Even with several advancements in multilingual modeling, it is challenging to recognize multiple languages using a single neural model, without knowing the input language and most multilingual models assume the availability of the input…

Audio and Speech Processing · Electrical Eng. & Systems 2024-01-23 Aditya Patil , Vikas Joshi , Purvi Agrawal , Rupesh Mehta

Code-switching refers to the usage of two languages within a sentence or discourse. It is a global phenomenon among multilingual communities and has emerged as an independent area of research. With the increasing demand for the…

Computation and Language · Computer Science 2018-10-02 Ganji Sreeram , Kunal Dhawan , Rohit Sinha

Code-switching-where multilingual speakers alternately switch between languages during conversations-still poses significant challenges to end-to-end (E2E) automatic speech recognition (ASR) systems due to phenomena of both acoustic and…

Audio and Speech Processing · Electrical Eng. & Systems 2024-12-13 Tzu-Ting Yang , Hsin-Wei Wang , Yi-Cheng Wang , Berlin Chen

We study training a single end-to-end (E2E) automatic speech recognition (ASR) model for three languages used in Kazakhstan: Kazakh, Russian, and English. We first describe the development of multilingual E2E ASR based on Transformer…

Audio and Speech Processing · Electrical Eng. & Systems 2021-08-04 Saida Mussakhojayeva , Yerbolat Khassanov , Huseyin Atakan Varol

End-to-end (E2E) systems are fast replacing the conventional systems in the domain of automatic speech recognition. As the target labels are learned directly from speech data, the E2E systems need a bigger corpus for effective training. In…

Audio and Speech Processing · Electrical Eng. & Systems 2019-07-22 Kunal Dhawan , Ganji Sreeram , Kumar Priyadarshi , Rohit Sinha
‹ Prev 1 2 3 10 Next ›