English
Related papers

Related papers: Multi-task Learning for Low-resource Second Langua…

200 papers

Spoken language understanding (SLU) treats automatic speech recognition (ASR) and natural language understanding (NLU) as a unified task and usually suffers from data scarcity. We exploit an ASR and NLU joint training method based on meta…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-28 Yingying Gao , Junlan Feng , Chao Deng , Shilei Zhang

Second language acquisition (SLA) is a complex and dynamic process. Many SLA studies that have attempted to record and analyze this process have typically focused on a single modality (e.g., textual output of learners), covered only a short…

Computation and Language · Computer Science 2024-03-27 Masato Hagiwara , Joshua Tanner

Spoken Language Assessment (SLA) estimates a learner's oral proficiency from spontaneous speech. The growing population of L2 English speakers has intensified the demand for reliable SLA, a critical component of Computer Assisted Language…

Computation and Language · Computer Science 2025-09-22 Hong-Yun Lin , Jhen-Ke Lin , Chung-Chun Wang , Hao-Chien Lu , Berlin Chen

We explore the benefits that multitask learning offer to speech processing as we train models on dual objectives with automatic speech recognition and intent classification or sentiment classification. Our models, although being of modest…

Computation and Language · Computer Science 2022-11-28 Quentin Meeus , Marie-Francine Moens , Hugo Van hamme

We present a generative model for multitask conditional language generation. Our guiding hypothesis is that a shared set of latent skills underlies many disparate language generation tasks, and that explicitly modelling these skills in a…

Computation and Language · Computer Science 2020-02-25 Kris Cao , Dani Yogatama

Training Large Language Models (LLMs) from scratch requires immense computational resources, making it prohibitively expensive. Model scaling-up offers a promising solution by leveraging the parameters of smaller models to create larger…

Machine Learning · Computer Science 2025-02-20 Yifei Yang , Zouying Cao , Xinbei Ma , Yao Yao , Libo Qin , Zhi Chen , Hai Zhao

The utilization of speech Self-Supervised Learning (SSL) models achieves impressive performance on Automatic Speech Recognition (ASR). However, in low-resource language ASR, they encounter the domain mismatch problem between pre-trained and…

Learning general representations of text is a fundamental problem for many natural language understanding (NLU) tasks. Previously, researchers have proposed to use language model pre-training and multi-task learning to learn robust…

Computation and Language · Computer Science 2019-08-29 Zi-Yi Dou , Keyi Yu , Antonios Anastasopoulos

Multi-task learning (MTL) aims to improve generalization performance by learning multiple related tasks simultaneously. While sometimes the underlying task relationship structure is known, often the structure needs to be estimated from data…

In this paper, we proposed to apply meta learning approach for low-resource automatic speech recognition (ASR). We formulated ASR for different languages as different tasks, and meta-learned the initialization parameters from many…

Sound · Computer Science 2019-10-29 Jui-Yang Hsu , Yuan-Jui Chen , Hung-yi Lee

Self-supervised learning (SSL), as a newly emerging unsupervised representation learning paradigm, generally follows a two-stage learning pipeline: 1) learning invariant and discriminative representations with auto-annotation pretext(s),…

Machine Learning · Computer Science 2022-08-23 Jiayu Yao , Qingyuan Wu , Quan Feng , Songcan Chen

Lack of text data has been the major issue on code-switching language modeling. In this paper, we introduce multi-task learning based language model which shares syntax representation of languages to leverage linguistic information and…

Computation and Language · Computer Science 2018-10-05 Genta Indra Winata , Andrea Madotto , Chien-Sheng Wu , Pascale Fung

Deep learning based discriminative methods, being the state-of-the-art machine learning techniques, are ill-suited for learning from lower amounts of data. In this paper, we propose a novel framework, called simultaneous two sample learning…

Computation and Language · Computer Science 2017-12-18 Sri Harsha Dumpala , Rupayan Chakraborty , Sunil Kumar Kopparapu

Language models (LMs) are pre-trained on raw text datasets to generate text sequences token-by-token. While this approach facilitates the learning of world knowledge and reasoning, it does not explicitly optimize for linguistic competence.…

Computation and Language · Computer Science 2026-04-17 Atsuki Yamaguchi , Maggie Mi , Nikolaos Aletras

Transformer based architectures have shown notable results on many down streaming tasks including question answering. The availability of data, on the other hand, impedes obtaining legitimate performance for low-resource languages. In this…

Computation and Language · Computer Science 2024-09-04 Hariom A. Pandya , Bhavik Ardeshna , Brijesh S. Bhatt

Multi-agent systems with smaller language models (SLMs) present a viable alternative to single agent systems powered by large language models (LLMs) for addressing complex problems. In this work, we study how these alternatives compare in…

Although large language models (LLMs) have achieved significant success in natural language processing, they still struggle with long-context comprehension. Traditional approaches to mitigating this issue typically rely on fine-tuning or…

Computation and Language · Computer Science 2025-02-25 Yifei Gao , Shaohong Chen , Lei Wang , Ruiting Dai , Ziyun Zhang , Kerui Ren , Jiaji Wu , Jun Cheng

Self-supervised learning (SSL) is able to build latent representations that generalize well to unseen data. However, only a few SSL techniques exist for the online CL setting, where data arrives in small minibatches, the model must comply…

Machine Learning · Computer Science 2025-07-16 Giacomo Cignoni , Andrea Cossu , Alexandra Gomez-Villa , Joost van de Weijer , Antonio Carta

Active learning (AL) is a training paradigm for selecting unlabeled samples for annotation to improve model performance on a test set, which is useful when only a limited number of samples can be annotated. These algorithms often work by…

Computation and Language · Computer Science 2026-04-13 Lorenzo Jaime Yu Flores , Cesare Spinoso di-Piano , Ori Ernst , David Ifeoluwa Adelani , Jackie Chi Kit Cheung

Multi-task learning (MTL) has recently contributed to learning better representations in service of various NLP tasks. MTL aims at improving the performance of a primary task, by jointly training on a secondary task. This paper introduces…

Machine Learning · Computer Science 2017-09-21 Davis Liang , Yan Shu
‹ Prev 1 2 3 10 Next ›