Related papers: Language Modelling as a Multi-Task Problem

Multitask and Multilingual Modelling for Lexical Analysis

In Natural Language Processing (NLP), one traditionally considers a single task (e.g. part-of-speech tagging) for a single language (e.g. English) at a time. However, recent work has shown that it can be beneficial to take advantage of…

Computation and Language · Computer Science 2018-09-10 Johannes Bjerva

Interpretability of Language Models via Task Spaces

The usual way to interpret language models (LMs) is to test their performance on different benchmarks and subsequently infer their internal processes. In this paper, we present an alternative approach, concentrating on the quality of LM…

Computation and Language · Computer Science 2024-06-11 Lucas Weber , Jaap Jumelet , Elia Bruni , Dieuwke Hupkes

Establishing Trustworthiness: Rethinking Tasks and Model Evaluation

Language understanding is a multi-faceted cognitive capability, which the Natural Language Processing (NLP) community has striven to model computationally for decades. Traditionally, facets of linguistic intelligence have been…

Computation and Language · Computer Science 2023-10-24 Robert Litschko , Max Müller-Eberstein , Rob van der Goot , Leon Weber , Barbara Plank

One Model to Rule them all: Multitask and Multilingual Modelling for Lexical Analysis

When learning a new skill, you take advantage of your preexisting skills and knowledge. For instance, if you are a skilled violinist, you will likely have an easier time learning to play cello. Similarly, when learning a new language you…

Computation and Language · Computer Science 2017-11-06 Johannes Bjerva

Meta-Learning for Effective Multi-task and Multilingual Modelling

Natural language processing (NLP) tasks (e.g. question-answering in English) benefit from knowledge of other tasks (e.g. named entity recognition in English) and knowledge of other languages (e.g. question-answering in Spanish). Such shared…

Computation and Language · Computer Science 2021-03-23 Ishan Tarunesh , Sushil Khyalia , Vishwajeet Kumar , Ganesh Ramakrishnan , Preethi Jyothi

Do Language Models Understand Anything? On the Ability of LSTMs to Understand Negative Polarity Items

In this paper, we attempt to link the inner workings of a neural language model to linguistic theory, focusing on a complex phenomenon well discussed in formal linguis- tics: (negative) polarity items. We briefly discuss the leading…

Computation and Language · Computer Science 2018-09-03 Jaap Jumelet , Dieuwke Hupkes

A Survey of Multi-task Learning in Natural Language Processing: Regarding Task Relatedness and Training Methods

Multi-task learning (MTL) has become increasingly popular in natural language processing (NLP) because it improves the performance of related tasks by exploiting their commonalities and differences. Nevertheless, it is still not understood…

Computation and Language · Computer Science 2023-02-16 Zhihan Zhang , Wenhao Yu , Mengxia Yu , Zhichun Guo , Meng Jiang

Multi-Task Learning in Natural Language Processing: An Overview

Deep learning approaches have achieved great success in the field of Natural Language Processing (NLP). However, directly training deep neural models often suffer from overfitting and data scarcity problems that are pervasive in NLP tasks.…

Artificial Intelligence · Computer Science 2024-04-30 Shijie Chen , Yu Zhang , Qiang Yang

A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives

Pretrained language models (PLMs) display impressive performances and have captured the attention of the NLP community. Establishing best practices in pretraining has, therefore, become a major focus of NLP research, especially since…

Computation and Language · Computer Science 2024-10-08 Zihao Li , Shaoxiong Ji , Timothee Mickus , Vincent Segonne , Jörg Tiedemann

Multi-task Learning of Negation and Speculation for Targeted Sentiment Classification

The majority of work in targeted sentiment analysis has concentrated on finding better methods to improve the overall results. Within this paper we show that these models are not robust to linguistic phenomena, specifically negation and…

Computation and Language · Computer Science 2021-04-01 Andrew Moore , Jeremy Barnes

Multi-Task Learning for Speaker-Role Adaptation in Neural Conversation Models

Building a persona-based conversation agent is challenging owing to the lack of large amounts of speaker-specific conversation data for model training. This paper addresses the problem by proposing a multi-task learning approach to training…

Computation and Language · Computer Science 2017-10-23 Yi Luan , Chris Brockett , Bill Dolan , Jianfeng Gao , Michel Galley

Is Translation All You Need? A Study on Solving Multilingual Tasks with Large Language Models

Large language models (LLMs) have demonstrated multilingual capabilities, yet they are mostly English-centric due to the imbalanced training corpora. While prior works have leveraged this bias to enhance multilingual performance through…

Computation and Language · Computer Science 2025-04-22 Chaoqun Liu , Wenxuan Zhang , Yiran Zhao , Anh Tuan Luu , Lidong Bing

Unifying Molecular and Textual Representations via Multi-task Language Modelling

The recent advances in neural language models have also been successfully applied to the field of chemistry, offering generative solutions for classical problems in molecular design and synthesis planning. These new methods have the…

Machine Learning · Computer Science 2023-05-19 Dimitrios Christofidellis , Giorgio Giannone , Jannis Born , Ole Winther , Teodoro Laino , Matteo Manica

An Empirical Study on Crosslingual Transfer in Probabilistic Topic Models

Probabilistic topic modeling is a popular choice as the first step of crosslingual tasks to enable knowledge transfer and extract multilingual features. While many multilingual topic models have been developed, their assumptions on the…

Computation and Language · Computer Science 2019-06-11 Shudong Hao , Michael J. Paul

Language Model Behavior: A Comprehensive Survey

Transformer language models have received widespread public attention, yet their generated text is often surprising even to NLP researchers. In this survey, we discuss over 250 recent studies of English language model behavior before…

Computation and Language · Computer Science 2023-08-29 Tyler A. Chang , Benjamin K. Bergen

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a…

Computation and Language · Computer Science 2017-07-25 Kazuma Hashimoto , Caiming Xiong , Yoshimasa Tsuruoka , Richard Socher

12-in-1: Multi-Task Vision and Language Representation Learning

Much of vision-and-language research focuses on a small but diverse set of independent tasks and supporting datasets often studied in isolation; however, the visually-grounded language understanding skills required for success at these…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Jiasen Lu , Vedanuj Goswami , Marcus Rohrbach , Devi Parikh , Stefan Lee

Disentangling Dialect from Social Bias via Multitask Learning to Improve Fairness

Dialects introduce syntactic and lexical variations in language that occur in regional or social groups. Most NLP methods are not sensitive to such variations. This may lead to unfair behavior of the methods, conveying negative bias towards…

Computation and Language · Computer Science 2024-06-17 Maximilian Spliethöver , Sai Nikhil Menon , Henning Wachsmuth

Social Bias in Multilingual Language Models: A Survey

Pretrained multilingual models exhibit the same social bias as models processing English texts. This systematic review analyzes emerging research that extends bias evaluation and mitigation approaches into multilingual and non-English…

Computation and Language · Computer Science 2025-09-08 Lance Calvin Lim Gamboa , Yue Feng , Mark Lee

Training Models on Dialects of Translationese Shows How Lexical Diversity and Source-Target Syntactic Similarity Shape Learning

Machine-translated data is widely used in multilingual NLP, particularly when native text is scarce. However, translated text differs systematically from native text. This phenomenon is known as translationese, and it reflects both traces…

Computation and Language · Computer Science 2026-02-19 Jenny Kunz