Related papers: Contextual Code Switching for Machine Translation …

Multilingual Large Language Models Are Not (Yet) Code-Switchers

Multilingual Large Language Models (LLMs) have recently shown great capabilities in a wide range of tasks, exhibiting state-of-the-art performance through zero-shot or few-shot prompting methods. While there have been extensive studies on…

Computation and Language · Computer Science 2023-10-24 Ruochen Zhang , Samuel Cahyawijaya , Jan Christian Blaise Cruz , Genta Indra Winata , Alham Fikri Aji

Adapting Large Language Models for Document-Level Machine Translation

Large language models (LLMs) have significantly advanced various natural language processing (NLP) tasks. Recent research indicates that moderately-sized LLMs often outperform larger ones after task-specific fine-tuning. This study focuses…

Computation and Language · Computer Science 2024-10-14 Minghao Wu , Thuy-Trang Vu , Lizhen Qu , George Foster , Gholamreza Haffari

Exploring the Capabilities of LLMs for Code Change Related Tasks

Developers deal with code-change-related tasks daily, e.g., reviewing code. Pre-trained code and code-change-oriented models have been adapted to help developers with such tasks. Recently, large language models (LLMs) have shown their…

Software Engineering · Computer Science 2024-07-04 Lishui Fan , Jiakun Liu , Zhongxin Liu , David Lo , Xin Xia , Shanping Li

LLMs Are Zero-Shot Context-Aware Simultaneous Translators

The advent of transformers has fueled progress in machine translation. More recently large language models (LLMs) have come to the spotlight thanks to their generality and strong performance in a wide range of language tasks, including…

Computation and Language · Computer Science 2024-06-26 Roman Koshkin , Katsuhito Sudoh , Satoshi Nakamura

Exploring Large Language Models for Code Explanation

Automating code documentation through explanatory text can prove highly beneficial in code understanding. Large Language Models (LLMs) have made remarkable strides in Natural Language Processing, especially within software engineering tasks…

Software Engineering · Computer Science 2023-10-26 Paheli Bhattacharya , Manojit Chakraborty , Kartheek N S N Palepu , Vikas Pandey , Ishan Dindorkar , Rakesh Rajpurohit , Rishabh Gupta

Multilingual Machine Translation with Large Language Models: Empirical Results and Analysis

Large language models (LLMs) have demonstrated remarkable potential in handling multilingual machine translation (MMT). In this paper, we systematically investigate the advantages and challenges of LLMs for MMT by answering two questions:…

Computation and Language · Computer Science 2024-06-17 Wenhao Zhu , Hongyi Liu , Qingxiu Dong , Jingjing Xu , Shujian Huang , Lingpeng Kong , Jiajun Chen , Lei Li

The Impact of Model Scaling on Seen and Unseen Language Performance

The rapid advancement of Large Language Models (LLMs), particularly those trained on multilingual corpora, has intensified the need for a deeper understanding of their performance across a diverse range of languages and model sizes. Our…

Computation and Language · Computer Science 2025-01-13 Rhitabrat Pokharel , Sina Bagheri Nezhad , Ameeta Agrawal , Suresh Singh

Adaptive Machine Translation with Large Language Models

Consistency is a key requirement of high-quality translation. It is especially important to adhere to pre-approved terminology and adapt to corrected translations in domain-specific projects. Machine translation (MT) has achieved…

Computation and Language · Computer Science 2023-05-10 Yasmin Moslem , Rejwanul Haque , John D. Kelleher , Andy Way

What do Large Language Models Need for Machine Translation Evaluation?

Leveraging large language models (LLMs) for various natural language processing tasks has led to superlative claims about their performance. For the evaluation of machine translation (MT), existing research shows that LLMs are able to…

Computation and Language · Computer Science 2024-10-10 Shenbin Qian , Archchana Sindhujan , Minnie Kabra , Diptesh Kanojia , Constantin Orăsan , Tharindu Ranasinghe , Frédéric Blain

Language Models can Exploit Cross-Task In-context Learning for Data-Scarce Novel Tasks

Large Language Models (LLMs) have transformed NLP with their remarkable In-context Learning (ICL) capabilities. Automated assistants based on LLMs are gaining popularity; however, adapting them to novel tasks is still challenging. While…

Computation and Language · Computer Science 2024-06-13 Anwoy Chatterjee , Eshaan Tanwar , Subhabrata Dutta , Tanmoy Chakraborty

How Multilingual Are Large Language Models Fine-Tuned for Translation?

A new paradigm for machine translation has recently emerged: fine-tuning large language models (LLM) on parallel text has been shown to outperform dedicated translation systems trained in a supervised fashion on much larger amounts of…

Computation and Language · Computer Science 2024-06-03 Aquia Richburg , Marine Carpuat

Investigating and Scaling up Code-Switching for Multilingual Language Model Pre-Training

Large language models (LLMs) exhibit remarkable multilingual capabilities despite the extreme language imbalance in the pre-training data. In this paper, we closely examine the reasons behind this phenomenon, focusing on the pre-training…

Computation and Language · Computer Science 2025-04-23 Zhijun Wang , Jiahuan Li , Hao Zhou , Rongxiang Weng , Jingang Wang , Xin Huang , Xue Han , Junlan Feng , Chao Deng , Shujian Huang

Large Language Models for Mental Health: A Multilingual Evaluation

Large Language Models (LLMs) have remarkable capabilities across NLP tasks. However, their performance in multilingual contexts, especially within the mental health domain, has not been thoroughly explored. In this paper, we evaluate…

Computation and Language · Computer Science 2026-02-03 Nishat Raihan , Sadiya Sayara Chowdhury Puspo , Ana-Maria Bucur , Stevie Chancellor , Marcos Zampieri

Context-Aware or Context-Insensitive? Assessing LLMs' Performance in Document-Level Translation

Large language models (LLMs) are increasingly strong contenders in machine translation. In this work, we focus on document-level translation, where some words cannot be translated without context from outside the sentence. Specifically, we…

Computation and Language · Computer Science 2025-02-17 Wafaa Mohammed , Vlad Niculae

When Many-Shot Prompting Fails: An Empirical Study of LLM Code Translation

Large Language Models (LLMs) with vast context windows offer new avenues for in-context learning (ICL), where providing many examples ("many-shot" prompting) is often assumed to enhance performance. We investigate this assumption for the…

Software Engineering · Computer Science 2025-12-10 Amirkia Rafiei Oskooei , Kaan Baturalp Cosdan , Husamettin Isiktas , Mehmet S. Aktas

Enhancing Code Generation for Low-Resource Languages: No Silver Bullet

The advent of Large Language Models (LLMs) has significantly advanced the field of automated code generation. LLMs rely on large and diverse datasets to learn syntax, semantics, and usage patterns of programming languages. For low-resource…

Software Engineering · Computer Science 2025-02-03 Alessandro Giagnorio , Alberto Martin-Lopez , Gabriele Bavota

On the Calibration of Massively Multilingual Language Models

Massively Multilingual Language Models (MMLMs) have recently gained popularity due to their surprising effectiveness in cross-lingual transfer. While there has been much work in evaluating these models for their performance on a variety of…

Computation and Language · Computer Science 2022-10-25 Kabir Ahuja , Sunayana Sitaram , Sandipan Dandapat , Monojit Choudhury

Beyond the Sentence: A Survey on Context-Aware Machine Translation with Large Language Models

Despite the popularity of the large language models (LLMs), their application to machine translation is relatively underexplored, especially in context-aware settings. This work presents a literature review of context-aware translation with…

Computation and Language · Computer Science 2025-06-10 Ramakrishna Appicharla , Baban Gain , Santanu Pal , Asif Ekbal

Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models

The training paradigm for machine translation has gradually shifted, from learning neural machine translation (NMT) models with extensive parallel corpora to instruction finetuning on multilingual large language models (LLMs) with…

Computation and Language · Computer Science 2024-02-08 Pengzhi Gao , Zhongjun He , Hua Wu , Haifeng Wang

Can Large Language Models Transform Computational Social Science?

Large Language Models (LLMs) are capable of successfully performing many language processing tasks zero-shot (without training data). If zero-shot LLMs can also reliably classify and explain social phenomena like persuasiveness and…

Computation and Language · Computer Science 2024-02-27 Caleb Ziems , William Held , Omar Shaikh , Jiaao Chen , Zhehao Zhang , Diyi Yang