Related papers: Multi-task Learning for Low-resource Second Langua…

Meta Auxiliary Learning for Low-resource Spoken Language Understanding

Spoken language understanding (SLU) treats automatic speech recognition (ASR) and natural language understanding (NLU) as a unified task and usually suffers from data scarcity. We exploit an ASR and NLU joint training method based on meta…

Audio and Speech Processing · Electrical Eng. & Systems 2022-06-28 Yingying Gao , Junlan Feng , Chao Deng , Shilei Zhang

Project MOSLA: Recording Every Moment of Second Language Acquisition

Second language acquisition (SLA) is a complex and dynamic process. Many SLA studies that have attempted to record and analyze this process have typically focused on a single modality (e.g., textual output of learners), covered only a short…

Computation and Language · Computer Science 2024-03-27 Masato Hagiwara , Joshua Tanner

Session-Level Spoken Language Assessment with a Multimodal Foundation Model via Multi-Target Learning

Spoken Language Assessment (SLA) estimates a learner's oral proficiency from spontaneous speech. The growing population of L2 English speakers has intensified the demand for reliable SLA, a critical component of Computer Assisted Language…

Computation and Language · Computer Science 2025-09-22 Hong-Yun Lin , Jhen-Ke Lin , Chung-Chun Wang , Hao-Chien Lu , Berlin Chen

Multitask Learning for Low Resource Spoken Language Understanding

We explore the benefits that multitask learning offer to speech processing as we train models on dual objectives with automatic speech recognition and intent classification or sentiment classification. Our models, although being of modest…

Computation and Language · Computer Science 2022-11-28 Quentin Meeus , Marie-Francine Moens , Hugo Van hamme

Modelling Latent Skills for Multitask Language Generation

We present a generative model for multitask conditional language generation. Our guiding hypothesis is that a shared set of latent skills underlies many disparate language generation tasks, and that explicitly modelling these skills in a…

Computation and Language · Computer Science 2020-02-25 Kris Cao , Dani Yogatama

LESA: Learnable LLM Layer Scaling-Up

Training Large Language Models (LLMs) from scratch requires immense computational resources, making it prohibitively expensive. Model scaling-up offers a promising solution by leveraging the parameters of smaller models to create larger…

Machine Learning · Computer Science 2025-02-20 Yifei Yang , Zouying Cao , Xinbei Ma , Yao Yao , Libo Qin , Zhi Chen , Hai Zhao

How to Learn a New Language? An Efficient Solution for Self-Supervised Learning Models Unseen Languages Adaption in Low-Resource Scenario

The utilization of speech Self-Supervised Learning (SSL) models achieves impressive performance on Automatic Speech Recognition (ASR). However, in low-resource language ASR, they encounter the domain mismatch problem between pre-trained and…

Sound · Computer Science 2025-01-07 Shih-Heng Wang , Zih-Ching Chen , Jiatong Shi , Ming-To Chuang , Guan-Ting Lin , Kuan-Po Huang , David Harwath , Shang-Wen Li , Hung-yi Lee

Investigating Meta-Learning Algorithms for Low-Resource Natural Language Understanding Tasks

Learning general representations of text is a fundamental problem for many natural language understanding (NLU) tasks. Previously, researchers have proposed to use language model pre-training and multi-task learning to learn robust…

Computation and Language · Computer Science 2019-08-29 Zi-Yi Dou , Keyi Yu , Antonios Anastasopoulos

Multi-task Sparse Structure Learning

Multi-task learning (MTL) aims to improve generalization performance by learning multiple related tasks simultaneously. While sometimes the underlying task relationship structure is known, often the structure needs to be estimated from data…

Machine Learning · Computer Science 2017-01-31 Andre R. Goncalves , Puja Das , Soumyadeep Chatterjee , Vidyashankar Sivakumar , Fernando J. Von Zuben , Arindam Banerjee

Meta Learning for End-to-End Low-Resource Speech Recognition

In this paper, we proposed to apply meta learning approach for low-resource automatic speech recognition (ASR). We formulated ASR for different languages as different tasks, and meta-learned the initialization parameters from many…

Sound · Computer Science 2019-10-29 Jui-Yang Hsu , Yuan-Jui Chen , Hung-yi Lee

Learning Downstream Task by Selectively Capturing Complementary Knowledge from Multiple Self-supervisedly Learning Pretexts

Self-supervised learning (SSL), as a newly emerging unsupervised representation learning paradigm, generally follows a two-stage learning pipeline: 1) learning invariant and discriminative representations with auto-annotation pretext(s),…

Machine Learning · Computer Science 2022-08-23 Jiayu Yao , Qingyuan Wu , Quan Feng , Songcan Chen

Code-Switching Language Modeling using Syntax-Aware Multi-Task Learning

Lack of text data has been the major issue on code-switching language modeling. In this paper, we introduce multi-task learning based language model which shares syntax representation of languages to leverage linguistic information and…

Computation and Language · Computer Science 2018-10-05 Genta Indra Winata , Andrea Madotto , Chien-Sheng Wu , Pascale Fung

A Novel Approach for Effective Learning in Low Resourced Scenarios

Deep learning based discriminative methods, being the state-of-the-art machine learning techniques, are ill-suited for learning from lower amounts of data. In this paper, we propose a novel framework, called simultaneous two sample learning…

Computation and Language · Computer Science 2017-12-18 Sri Harsha Dumpala , Rupayan Chakraborty , Sunil Kumar Kopparapu

Enhancing Linguistic Competence of Language Models through Pre-training with Language Learning Tasks

Language models (LMs) are pre-trained on raw text datasets to generate text sequences token-by-token. While this approach facilitates the learning of world knowledge and reasoning, it does not explicitly optimize for linguistic competence.…

Computation and Language · Computer Science 2026-04-17 Atsuki Yamaguchi , Maggie Mi , Nikolaos Aletras

Cascading Adaptors to Leverage English Data to Improve Performance of Question Answering for Low-Resource Languages

Transformer based architectures have shown notable results on many down streaming tasks including question answering. The availability of data, on the other hand, impedes obtaining legitimate performance for low-resource languages. In this…

Computation and Language · Computer Science 2024-09-04 Hariom A. Pandya , Bhavik Ardeshna , Brijesh S. Bhatt

ProST: Progressive Sub-task Training for Pareto-Optimal Multi-agent Systems Using Small Language Models

Multi-agent systems with smaller language models (SLMs) present a viable alternative to single agent systems powered by large language models (LLMs) for addressing complex problems. In this work, we study how these alternatives compare in…

Computation and Language · Computer Science 2025-11-12 Biddut Sarker Bijoy , Mohammad Saqib Hasan , Pegah Alipoormolabashi , Avirup Sil , Aruna Balasubramanian , Niranjan Balasubramanian

Pay Attention to What You Need

Although large language models (LLMs) have achieved significant success in natural language processing, they still struggle with long-context comprehension. Traditional approaches to mitigating this issue typically rely on fine-tuning or…

Computation and Language · Computer Science 2025-02-25 Yifei Gao , Shaohong Chen , Lei Wang , Ruiting Dai , Ziyun Zhang , Kerui Ren , Jiaji Wu , Jun Cheng

CLA: Latent Alignment for Online Continual Self-Supervised Learning

Self-supervised learning (SSL) is able to build latent representations that generalize well to unseen data. However, only a few SSL techniques exist for the online CL setting, where data arrives in small minibatches, the model must comply…

Machine Learning · Computer Science 2025-07-16 Giacomo Cignoni , Andrea Cossu , Alexandra Gomez-Villa , Joost van de Weijer , Antonio Carta

Testing the Assumptions of Active Learning for Translation Tasks with Few Samples

Active learning (AL) is a training paradigm for selecting unlabeled samples for annotation to improve model performance on a test set, which is useful when only a limited number of samples can be annotated. These algorithms often work by…

Computation and Language · Computer Science 2026-04-13 Lorenzo Jaime Yu Flores , Cesare Spinoso di-Piano , Ori Ernst , David Ifeoluwa Adelani , Jackie Chi Kit Cheung

Deep Automated Multi-task Learning

Multi-task learning (MTL) has recently contributed to learning better representations in service of various NLP tasks. MTL aims at improving the performance of a primary task, by jointly training on a secondary task. This paper introduces…

Machine Learning · Computer Science 2017-09-21 Davis Liang , Yan Shu