English
Related papers

Related papers: Meta-Transfer Learning for Code-Switched Speech Re…

200 papers

In this thesis, we address the data scarcity and limitations of linguistic theory by proposing language-agnostic multi-task training methods. First, we introduce a meta-learning-based approach, meta-transfer learning, in which information…

Computation and Language · Computer Science 2021-04-14 Genta Indra Winata

Transfer learning is a vital technique that generalizes models trained for one setting or task to other settings or tasks. For example in speech recognition, an acoustic model trained for one language can be used to recognize speech in…

Computation and Language · Computer Science 2015-11-20 Dong Wang , Thomas Fang Zheng

In this work, we present a simple and elegant approach to language modeling for bilingual code-switched text. Since code-switching is a blend of two or more different languages, a standard bilingual language model can be improved upon by…

Computation and Language · Computer Science 2018-08-06 Saurabh Garg , Tanmay Parekh , Preethi Jyothi

Best-performing speech models are trained on large amounts of data in the language they are meant to work for. However, most languages have sparse data, making training models challenging. This shortage of data is even more prevalent in…

Computation and Language · Computer Science 2024-10-08 David-Gabriel Ion , Răzvan-Alexandru Smădu , Dumitru-Clementin Cercel , Florin Pop , Mihaela-Claudia Cercel

The prevalence of the powerful multilingual models, such as Whisper, has significantly advanced the researches on speech recognition. However, these models often struggle with handling the code-switching setting, which is essential in…

Audio and Speech Processing · Electrical Eng. & Systems 2024-01-15 Bobbi Aditya , Mahdin Rohmatillah , Liang-Hsuan Tai , Jen-Tzung Chien

Although Automatic Speech Recognition (ASR) systems have achieved human-like performance for a few languages, the majority of the world's languages do not have usable systems due to the lack of large speech datasets to train these models.…

Computation and Language · Computer Science 2022-02-28 Hemant Yadav , Sunayana Sitaram

We live in a world where 60% of the population can speak two or more languages fluently. Members of these communities constantly switch between languages when having a conversation. As automatic speech recognition (ASR) systems are being…

Computation and Language · Computer Science 2021-02-16 Siddharth Dalmia , Yuzong Liu , Srikanth Ronanki , Katrin Kirchhoff

Neural machine translation is known to require large numbers of parallel training sentences, which generally prevent it from excelling on low-resource language pairs. This thesis explores the use of cross-lingual transfer learning on neural…

Computation and Language · Computer Science 2020-01-07 Tom Kocmi

Speech separation aims to separate multiple speech sources from a speech mixture. Although speech separation is well-solved on some existing English speech separation benchmarks, it is worthy of more investigation on the generalizability of…

Sound · Computer Science 2022-03-14 Kuan-Po Huang , Yuan-Kuei Wu , Hung-yi Lee

Learning what to share between tasks has been a topic of great importance recently, as strategic sharing of knowledge has been shown to improve downstream task performance. This is particularly important for multilingual applications, as…

Computation and Language · Computer Science 2020-10-06 Farhad Nooralahzadeh , Giannis Bekoulis , Johannes Bjerva , Isabelle Augenstein

For languages with no annotated resources, transferring knowledge from rich-resource languages is an effective solution for named entity recognition (NER). While all existing methods directly transfer from source-learned model to a target…

Computation and Language · Computer Science 2020-07-16 Qianhui Wu , Zijia Lin , Guoxin Wang , Hui Chen , Börje F. Karlsson , Biqing Huang , Chin-Yew Lin

Transfer learning has been proven as an effective technique for neural machine translation under low-resource conditions. Existing methods require a common target language, language relatedness, or specific training tricks and regimes. We…

Computation and Language · Computer Science 2020-07-09 Tom Kocmi , Ondřej Bojar

Meta-Learning is a subarea of Machine Learning that aims to take advantage of prior knowledge to learn faster and with fewer data [1]. There are different scenarios where meta-learning can be applied, and one of the most common is algorithm…

Machine Learning · Computer Science 2019-10-17 Gean Trindade Pereira , Moisés dos Santos , Edesio Alcobaça , Rafael Mantovani , André Carvalho

Speech processing systems currently do not support the vast majority of languages, in part due to the lack of data in low-resource languages. Cross-lingual transfer offers a compelling way to help bridge this digital divide by incorporating…

Audio and Speech Processing · Electrical Eng. & Systems 2021-11-03 Peter Wu , Jiatong Shi , Yifan Zhong , Shinji Watanabe , Alan W Black

The combination of multilingual pre-trained representations and cross-lingual transfer learning is one of the most effective methods for building functional NLP systems for low-resource languages. However, for extremely low-resource…

Computation and Language · Computer Science 2021-04-19 Mengzhou Xia , Guoqing Zheng , Subhabrata Mukherjee , Milad Shokouhi , Graham Neubig , Ahmed Hassan Awadallah

The success of pretrained cross-lingual language models relies on two essential abilities, i.e., generalization ability for learning downstream tasks in a source language, and cross-lingual transferability for transferring the task…

Computation and Language · Computer Science 2021-09-24 Zewen Chi , Heyan Huang , Luyang Liu , Yu Bai , Xian-Ling Mao

Code-switching, also called code-mixing, is the linguistics phenomenon where in casual settings, multilingual speakers mix words from different languages in one utterance. Due to its spontaneous nature, code-switching is extremely…

Computation and Language · Computer Science 2023-06-01 Shuyue Stella Li , Cihan Xiao , Tianjian Li , Bismarck Odoom

Large multilingual language models typically share their parameters across all languages, which enables cross-lingual task transfer, but learning can also be hindered when training updates from different languages are in conflict. In this…

Computation and Language · Computer Science 2022-11-02 Rochelle Choenni , Dan Garrette , Ekaterina Shutova

Representation learning of source code is essential for applying machine learning to software engineering tasks. Learning code representation from a multilingual source code dataset has been shown to be more effective than learning from…

Software Engineering · Computer Science 2022-12-07 Weiguo Pian , Hanyu Peng , Xunzhu Tang , Tiezhu Sun , Haoye Tian , Andrew Habib , Jacques Klein , Tegawendé F. Bissyandé

Most Transformer language models are primarily pretrained on English text, limiting their use for other languages. As the model sizes grow, the performance gap between English and other languages with fewer compute and data resources…

Computation and Language · Computer Science 2023-01-24 Malte Ostendorff , Georg Rehm
‹ Prev 1 2 3 10 Next ›