Related papers: Multilingual Text Representation

A Comprehensive Survey on Word Representation Models: From Classical to State-Of-The-Art Word Representation Language Models

Word representation has always been an important research area in the history of natural language processing (NLP). Understanding such complex text data is imperative, given that it is rich in information and can be used widely across…

Computation and Language · Computer Science 2020-11-10 Usman Naseem , Imran Razzak , Shah Khalid Khan , Mukesh Prasad

Semantic Representations of Word Senses and Concepts

Representing the semantics of linguistic items in a machine-interpretable form has been a major goal of Natural Language Processing since its earliest days. Among the range of different linguistic items, words have attracted the most…

Computation and Language · Computer Science 2016-08-04 José Camacho-Collados , Ignacio Iacobacci , Roberto Navigli , Mohammad Taher Pilehvar

Multilingual Large Language Model: A Survey of Resources, Taxonomy and Frontiers

Multilingual Large Language Models are capable of using powerful Large Language Models to handle and respond to queries in multiple languages, which achieves remarkable success in multilingual natural language processing tasks. Despite…

Computation and Language · Computer Science 2024-04-09 Libo Qin , Qiguang Chen , Yuhang Zhou , Zhi Chen , Yinghui Li , Lizi Liao , Min Li , Wanxiang Che , Philip S. Yu

LLM for Everyone: Representing the Underrepresented in Large Language Models

Natural language processing (NLP) has witnessed a profound impact of large language models (LLMs) that excel in a multitude of tasks. However, the limitation of LLMs in multilingual settings, particularly in underrepresented languages,…

Computation and Language · Computer Science 2024-09-24 Samuel Cahyawijaya

Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation

The performance of multilingual pretrained models is highly dependent on the availability of monolingual or parallel text present in a target language. Thus, the majority of the world's languages cannot benefit from recent progress in NLP…

Computation and Language · Computer Science 2022-04-07 Xinyi Wang , Sebastian Ruder , Graham Neubig

Massively Multilingual Neural Machine Translation

Multilingual neural machine translation (NMT) enables training a single model that supports translation from multiple source languages into multiple target languages. In this paper, we push the limits of multilingual NMT in terms of number…

Computation and Language · Computer Science 2019-07-03 Roee Aharoni , Melvin Johnson , Orhan Firat

A Comprehensive Overview of Large Language Models

Large Language Models (LLMs) have recently demonstrated remarkable capabilities in natural language processing tasks and beyond. This success of LLMs has led to a large influx of research contributions in this direction. These works…

Computation and Language · Computer Science 2024-10-18 Humza Naveed , Asad Ullah Khan , Shi Qiu , Muhammad Saqib , Saeed Anwar , Muhammad Usman , Naveed Akhtar , Nick Barnes , Ajmal Mian

A Survey on Large Language Models with Multilingualism: Recent Advances and New Frontiers

The rapid development of Large Language Models (LLMs) demonstrates remarkable multilingual capabilities in natural language processing, attracting global attention in both academia and industry. To mitigate potential discrimination and…

Computation and Language · Computer Science 2025-01-08 Kaiyu Huang , Fengran Mo , Xinyu Zhang , Hongliang Li , You Li , Yuanchi Zhang , Weijian Yi , Yulong Mao , Jinchen Liu , Yuzhuang Xu , Jinan Xu , Jian-Yun Nie , Yang Liu

Survey of different Large Language Model Architectures: Trends, Benchmarks, and Challenges

Large Language Models (LLMs) represent a class of deep learning models adept at understanding natural language and generating coherent responses to various prompts or queries. These models far exceed the complexity of conventional neural…

Machine Learning · Computer Science 2024-12-05 Minghao Shao , Abdul Basit , Ramesh Karri , Muhammad Shafique

Evaluating Transformer-Based Multilingual Text Classification

As NLP tools become ubiquitous in today's technological landscape, they are increasingly applied to languages with a variety of typological structures. However, NLP research does not focus primarily on typological differences in its…

Computation and Language · Computer Science 2020-05-04 Sophie Groenwold , Samhita Honnavalli , Lily Ou , Aesha Parekh , Sharon Levy , Diba Mirza , William Yang Wang

How does a Multilingual LM Handle Multiple Languages?

Multilingual language models have significantly advanced due to rapid progress in natural language processing. Models like BLOOM 1.7B, trained on diverse multilingual datasets, aim to bridge linguistic gaps. However, their effectiveness in…

Computation and Language · Computer Science 2026-02-03 Santhosh Kakarla , Gautama Shastry Bulusu Venkata , Aishwarya Gaddam , Maheedhar Sai Omtri Mohan

Language Model Behavior: A Comprehensive Survey

Transformer language models have received widespread public attention, yet their generated text is often surprising even to NLP researchers. In this survey, we discuss over 250 recent studies of English language model behavior before…

Computation and Language · Computer Science 2023-08-29 Tyler A. Chang , Benjamin K. Bergen

The Less the Merrier? Investigating Language Representation in Multilingual Models

Multilingual Language Models offer a way to incorporate multiple languages in one model and utilize cross-language transfer learning to improve performance for different Natural Language Processing (NLP) tasks. Despite progress in…

Computation and Language · Computer Science 2023-10-23 Hellina Hailu Nigatu , Atnafu Lambebo Tonja , Jugal Kalita

The Call for Socially Aware Language Technologies

Language technologies have made enormous progress, especially with the introduction of large language models (LLMs). On traditional tasks such as machine translation and sentiment analysis, these models perform at near-human level. These…

Computation and Language · Computer Science 2025-02-24 Diyi Yang , Dirk Hovy , David Jurgens , Barbara Plank

What are the Goals of Distributional Semantics?

Distributional semantic models have become a mainstay in NLP, providing useful features for downstream tasks. However, assessing long-term progress requires explicit long-term goals. In this paper, I take a broad linguistic perspective,…

Computation and Language · Computer Science 2020-05-07 Guy Emerson

Multi-Step Reasoning with Large Language Models, a Survey

Large language models (LLMs) with billions of parameters exhibit in-context learning abilities, enabling few-shot learning on tasks that the model was not specifically trained for. Traditional models achieve breakthrough performance on…

Artificial Intelligence · Computer Science 2025-11-04 Aske Plaat , Annie Wong , Suzan Verberne , Joost Broekens , Niki van Stein , Thomas Back

Since the Scientific Literature Is Multilingual, Our Models Should Be Too

English has long been assumed the $\textit{lingua franca}$ of scientific research, and this notion is reflected in the natural language processing (NLP) research involving scientific document representation. In this position piece, we…

Computation and Language · Computer Science 2024-03-28 Abteen Ebrahimi , Kenneth Church

Probing LLMs for Multilingual Discourse Generalization Through a Unified Label Set

Discourse understanding is essential for many NLP tasks, yet most existing work remains constrained by framework-dependent discourse representations. This work investigates whether large language models (LLMs) capture discourse knowledge…

Computation and Language · Computer Science 2025-06-05 Florian Eichin , Yang Janet Liu , Barbara Plank , Michael A. Hedderich

Interactively Providing Explanations for Transformer Language Models

Transformer language models are state of the art in a multitude of NLP tasks. Despite these successes, their opaqueness remains problematic. Recent methods aiming to provide interpretability and explainability to black-box models primarily…

Computation and Language · Computer Science 2022-03-14 Felix Friedrich , Patrick Schramowski , Christopher Tauchmann , Kristian Kersting

History, Development, and Principles of Large Language Models-An Introductory Survey

Language models serve as a cornerstone in natural language processing (NLP), utilizing mathematical methods to generalize language laws and knowledge for prediction and generation. Over extensive research spanning decades, language modeling…

Computation and Language · Computer Science 2024-09-24 Zichong Wang , Zhibo Chu , Thang Viet Doan , Shiwen Ni , Min Yang , Wenbin Zhang