Related papers: TranslateGemma Technical Report

A Paradigm Shift in Machine Translation: Boosting Translation Performance of Large Language Models

Generative Large Language Models (LLMs) have achieved remarkable advancements in various NLP tasks. However, these advances have not been reflected in the translation task, especially those with moderate model sizes (i.e., 7B or 13B…

Computation and Language · Computer Science 2024-02-07 Haoran Xu , Young Jin Kim , Amr Sharaf , Hany Hassan Awadalla

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

Large language models (LLMs) have shown continuously improving multilingual capabilities, and even small-scale open-source models have demonstrated rapid performance enhancement. In this paper, we systematically explore the abilities of…

Computation and Language · Computer Science 2025-02-25 Menglong Cui , Pengzhi Gao , Wei Liu , Jian Luan , Bin Wang

EmbeddingGemma: Powerful and Lightweight Text Representations

We introduce EmbeddingGemma, a new lightweight, open text embedding model based on the Gemma 3 language model family. Our innovative training recipe strategically captures knowledge from larger models via encoder-decoder initialization and…

Computation and Language · Computer Science 2025-11-04 Henrique Schechter Vera , Sahil Dua , Biao Zhang , Daniel Salz , Ryan Mullins , Sindhu Raghuram Panyam , Sara Smoot , Iftekhar Naim , Joe Zou , Feiyang Chen , Daniel Cer , Alice Lisak , Min Choi , Lucas Gonzalez , Omar Sanseviero , Glenn Cameron , Ian Ballantyne , Kat Black , Kaifeng Chen , Weiyi Wang , Zhe Li , Gus Martins , Jinhyuk Lee , Mark Sherwood , Juyeong Ji , Renjie Wu , Jingxiao Zheng , Jyotinder Singh , Abheesht Sharma , Divyashree Sreepathihalli , Aashi Jain , Adham Elarabawy , AJ Co , Andreas Doumanoglou , Babak Samari , Ben Hora , Brian Potetz , Dahun Kim , Enrique Alfonseca , Fedor Moiseev , Feng Han , Frank Palma Gomez , Gustavo Hernández Ábrego , Hesen Zhang , Hui Hui , Jay Han , Karan Gill , Ke Chen , Koert Chen , Madhuri Shanbhogue , Michael Boratko , Paul Suganthan , Sai Meher Karthik Duddu , Sandeep Mariserla , Setareh Ariafar , Shanfeng Zhang , Shijie Zhang , Simon Baumgartner , Sonam Goenka , Steve Qiu , Tanmaya Dabral , Trevor Walker , Vikram Rao , Waleed Khawaja , Wenlei Zhou , Xiaoqi Ren , Ye Xia , Yichang Chen , Yi-Ting Chen , Zhe Dong , Zhongli Ding , Francesco Visin , Gaël Liu , Jiageng Zhang , Kathleen Kenealy , Michelle Casbon , Ravin Kumar , Thomas Mesnard , Zach Gleicher , Cormac Brick , Olivier Lacombe , Adam Roberts , Qin Yin , Yunhsuan Sung , Raphael Hoffmann , Tris Warkentin , Armand Joulin , Tom Duerig , Mojtaba Seyedhosseini

Gemma 3 Technical Report

We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer…

Computation and Language · Computer Science 2025-03-26 Gemma Team , Aishwarya Kamath , Johan Ferret , Shreya Pathak , Nino Vieillard , Ramona Merhej , Sarah Perrin , Tatiana Matejovicova , Alexandre Ramé , Morgane Rivière , Louis Rouillard , Thomas Mesnard , Geoffrey Cideron , Jean-bastien Grill , Sabela Ramos , Edouard Yvinec , Michelle Casbon , Etienne Pot , Ivo Penchev , Gaël Liu , Francesco Visin , Kathleen Kenealy , Lucas Beyer , Xiaohai Zhai , Anton Tsitsulin , Robert Busa-Fekete , Alex Feng , Noveen Sachdeva , Benjamin Coleman , Yi Gao , Basil Mustafa , Iain Barr , Emilio Parisotto , David Tian , Matan Eyal , Colin Cherry , Jan-Thorsten Peter , Danila Sinopalnikov , Surya Bhupatiraju , Rishabh Agarwal , Mehran Kazemi , Dan Malkin , Ravin Kumar , David Vilar , Idan Brusilovsky , Jiaming Luo , Andreas Steiner , Abe Friesen , Abhanshu Sharma , Abheesht Sharma , Adi Mayrav Gilady , Adrian Goedeckemeyer , Alaa Saade , Alex Feng , Alexander Kolesnikov , Alexei Bendebury , Alvin Abdagic , Amit Vadi , András György , André Susano Pinto , Anil Das , Ankur Bapna , Antoine Miech , Antoine Yang , Antonia Paterson , Ashish Shenoy , Ayan Chakrabarti , Bilal Piot , Bo Wu , Bobak Shahriari , Bryce Petrini , Charlie Chen , Charline Le Lan , Christopher A. Choquette-Choo , CJ Carey , Cormac Brick , Daniel Deutsch , Danielle Eisenbud , Dee Cattle , Derek Cheng , Dimitris Paparas , Divyashree Shivakumar Sreepathihalli , Doug Reid , Dustin Tran , Dustin Zelle , Eric Noland , Erwin Huizenga , Eugene Kharitonov , Frederick Liu , Gagik Amirkhanyan , Glenn Cameron , Hadi Hashemi , Hanna Klimczak-Plucińska , Harman Singh , Harsh Mehta , Harshal Tushar Lehri , Hussein Hazimeh , Ian Ballantyne , Idan Szpektor , Ivan Nardini , Jean Pouget-Abadie , Jetha Chan , Joe Stanton , John Wieting , Jonathan Lai , Jordi Orbay , Joseph Fernandez , Josh Newlan , Ju-yeong Ji , Jyotinder Singh , Kat Black , Kathy Yu , Kevin Hui , Kiran Vodrahalli , Klaus Greff , Linhai Qiu , Marcella Valentine , Marina Coelho , Marvin Ritter , Matt Hoffman , Matthew Watson , Mayank Chaturvedi , Michael Moynihan , Min Ma , Nabila Babar , Natasha Noy , Nathan Byrd , Nick Roy , Nikola Momchev , Nilay Chauhan , Noveen Sachdeva , Oskar Bunyan , Pankil Botarda , Paul Caron , Paul Kishan Rubenstein , Phil Culliton , Philipp Schmid , Pier Giuseppe Sessa , Pingmei Xu , Piotr Stanczyk , Pouya Tafti , Rakesh Shivanna , Renjie Wu , Renke Pan , Reza Rokni , Rob Willoughby , Rohith Vallu , Ryan Mullins , Sammy Jerome , Sara Smoot , Sertan Girgin , Shariq Iqbal , Shashir Reddy , Shruti Sheth , Siim Põder , Sijal Bhatnagar , Sindhu Raghuram Panyam , Sivan Eiger , Susan Zhang , Tianqi Liu , Trevor Yacovone , Tyler Liechty , Uday Kalra , Utku Evci , Vedant Misra , Vincent Roseberry , Vlad Feinberg , Vlad Kolesnikov , Woohyun Han , Woosuk Kwon , Xi Chen , Yinlam Chow , Yuvein Zhu , Zichuan Wei , Zoltan Egyed , Victor Cotruta , Minh Giang , Phoebe Kirk , Anand Rao , Kat Black , Nabila Babar , Jessica Lo , Erica Moreira , Luiz Gustavo Martins , Omar Sanseviero , Lucas Gonzalez , Zach Gleicher , Tris Warkentin , Vahab Mirrokni , Evan Senter , Eli Collins , Joelle Barral , Zoubin Ghahramani , Raia Hadsell , Yossi Matias , D. Sculley , Slav Petrov , Noah Fiedel , Noam Shazeer , Oriol Vinyals , Jeff Dean , Demis Hassabis , Koray Kavukcuoglu , Clement Farabet , Elena Buchatskaya , Jean-Baptiste Alayrac , Rohan Anil , Dmitry , Lepikhin , Sebastian Borgeaud , Olivier Bachem , Armand Joulin , Alek Andreev , Cassidy Hardin , Robert Dadashi , Léonard Hussenot

Large Language Models Are State-of-the-Art Evaluators of Translation Quality

We describe GEMBA, a GPT-based metric for assessment of translation quality, which works both with a reference translation and without. In our evaluation, we focus on zero-shot prompting, comparing four prompt variants in two modes, based…

Computation and Language · Computer Science 2023-06-02 Tom Kocmi , Christian Federmann

Scaling Model and Data for Multilingual Machine Translation with Open Large Language Models

Open large language models (LLMs) have demonstrated improving multilingual capabilities in recent years. In this paper, we present a study of open LLMs for multilingual machine translation (MT) across a range of languages, and investigate…

Computation and Language · Computer Science 2026-02-26 Yuzhe Shang , Pengzhi Gao , Wei Liu , Jian Luan , Jinsong Su

MetricX-25 and GemSpanEval: Google Translate Submissions to the WMT25 Evaluation Shared Task

In this paper, we present our submissions to the unified WMT25 Translation Evaluation Shared Task. For the Quality Score Prediction subtask, we create a new generation of MetricX with improvements in the input format and the training…

Computation and Language · Computer Science 2025-10-29 Juraj Juraska , Tobias Domhan , Mara Finkelstein , Tetsuji Nakagawa , Geza Kovacs , Daniel Deutsch , Pidong Wang , Markus Freitag

How Multilingual Are Large Language Models Fine-Tuned for Translation?

A new paradigm for machine translation has recently emerged: fine-tuning large language models (LLM) on parallel text has been shown to outperform dedicated translation systems trained in a supervised fashion on much larger amounts of…

Computation and Language · Computer Science 2024-06-03 Aquia Richburg , Marine Carpuat

A Novel Paradigm Boosting Translation Capabilities of Large Language Models

This paper presents a study on strategies to enhance the translation capabilities of large language models (LLMs) in the context of machine translation (MT) tasks. The paper proposes a novel paradigm consisting of three stages: Secondary…

Computation and Language · Computer Science 2024-04-16 Jiaxin Guo , Hao Yang , Zongyao Li , Daimeng Wei , Hengchao Shang , Xiaoyu Chen

LLaMAX2: Your Translation-Enhanced Model also Performs Well in Reasoning

General Large Language Models (LLMs) excel in reasoning, but those enhanced for translation struggle with reasoning tasks. To address this, we propose a novel translationenhanced recipe that begins with instruct models and applies…

Computation and Language · Computer Science 2025-10-13 Changjiang Gao , Zixian Huang , Jingyang Gong , Shujian Huang , Lei Li , Fei Yuan

LuxMT Technical Report

We introduce LuxMT, a machine translation system based on Gemma 3 27B and fine-tuned for translation from Luxembourgish (LB) into French (FR) and English (EN). To assess translation performance, we construct a novel benchmark covering…

Computation and Language · Computer Science 2026-02-18 Nils Rehlinger

Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level

General-purpose Large Language Models (LLMs) like GPT-4 have achieved remarkable advancements in machine translation (MT) by leveraging extensive web content. On the other hand, translation-specific LLMs are built by pre-training on…

Computation and Language · Computer Science 2024-10-30 Zhaopeng Feng , Ruizhe Chen , Yan Zhang , Zijie Meng , Zuozhu Liu

Translating Step-by-Step: Decomposing the Translation Process for Improved Translation Quality of Long-Form Texts

In this paper we present a step-by-step approach to long-form text translation, drawing on established processes in translation studies. Instead of viewing machine translation as a single, monolithic task, we propose a framework that…

Computation and Language · Computer Science 2024-09-12 Eleftheria Briakou , Jiaming Luo , Colin Cherry , Markus Freitag

CodeGemma: Open Code Models Based on Gemma

This paper introduces CodeGemma, a collection of specialized open code models built on top of Gemma, capable of a variety of code and natural language generation tasks. We release three model variants. CodeGemma 7B pretrained (PT) and…

Computation and Language · Computer Science 2024-06-21 CodeGemma Team , Heri Zhao , Jeffrey Hui , Joshua Howland , Nam Nguyen , Siqi Zuo , Andrea Hu , Christopher A. Choquette-Choo , Jingyue Shen , Joe Kelley , Kshitij Bansal , Luke Vilnis , Mateo Wirth , Paul Michel , Peter Choy , Pratik Joshi , Ravin Kumar , Sarmad Hashmi , Shubham Agrawal , Zhitao Gong , Jane Fine , Tris Warkentin , Ale Jakse Hartman , Bin Ni , Kathy Korevec , Kelly Schaefer , Scott Huffman

Domain Terminology Integration into Machine Translation: Leveraging Large Language Models

This paper discusses the methods that we used for our submissions to the WMT 2023 Terminology Shared Task for German-to-English (DE-EN), English-to-Czech (EN-CS), and Chinese-to-English (ZH-EN) language pairs. The task aims to advance…

Computation and Language · Computer Science 2025-03-04 Yasmin Moslem , Gianfranco Romani , Mahdi Molaei , Rejwanul Haque , John D. Kelleher , Andy Way

The Fine-Tuning Paradox: Boosting Translation Quality Without Sacrificing LLM Abilities

Fine-tuning large language models (LLMs) for machine translation has shown improvements in overall translation quality. However, it is unclear what is the impact of fine-tuning on desirable LLM behaviors that are not present in neural…

Computation and Language · Computer Science 2024-08-07 David Stap , Eva Hasler , Bill Byrne , Christof Monz , Ke Tran

Beyond Triplet: Leveraging the Most Data for Multimodal Machine Translation

Multimodal machine translation (MMT) aims to improve translation quality by incorporating information from other modalities, such as vision. Previous MMT systems mainly focus on better access and use of visual information and tend to…

Computation and Language · Computer Science 2023-09-06 Yaoming Zhu , Zewei Sun , Shanbo Cheng , Luyang Huang , Liwei Wu , Mingxuan Wang

GenTranslate: Large Language Models are Generative Multilingual Speech and Machine Translators

Recent advances in large language models (LLMs) have stepped forward the development of multilingual speech and machine translation by its reduced representation errors and incorporated external knowledge. However, both translation tasks…

Computation and Language · Computer Science 2024-05-17 Yuchen Hu , Chen Chen , Chao-Han Huck Yang , Ruizhe Li , Dong Zhang , Zhehuai Chen , Eng Siong Chng

Gemma 2: Improving Open Language Models at a Practical Size

In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical…

Computation and Language · Computer Science 2024-10-03 Gemma Team , Morgane Riviere , Shreya Pathak , Pier Giuseppe Sessa , Cassidy Hardin , Surya Bhupatiraju , Léonard Hussenot , Thomas Mesnard , Bobak Shahriari , Alexandre Ramé , Johan Ferret , Peter Liu , Pouya Tafti , Abe Friesen , Michelle Casbon , Sabela Ramos , Ravin Kumar , Charline Le Lan , Sammy Jerome , Anton Tsitsulin , Nino Vieillard , Piotr Stanczyk , Sertan Girgin , Nikola Momchev , Matt Hoffman , Shantanu Thakoor , Jean-Bastien Grill , Behnam Neyshabur , Olivier Bachem , Alanna Walton , Aliaksei Severyn , Alicia Parrish , Aliya Ahmad , Allen Hutchison , Alvin Abdagic , Amanda Carl , Amy Shen , Andy Brock , Andy Coenen , Anthony Laforge , Antonia Paterson , Ben Bastian , Bilal Piot , Bo Wu , Brandon Royal , Charlie Chen , Chintu Kumar , Chris Perry , Chris Welty , Christopher A. Choquette-Choo , Danila Sinopalnikov , David Weinberger , Dimple Vijaykumar , Dominika Rogozińska , Dustin Herbison , Elisa Bandy , Emma Wang , Eric Noland , Erica Moreira , Evan Senter , Evgenii Eltyshev , Francesco Visin , Gabriel Rasskin , Gary Wei , Glenn Cameron , Gus Martins , Hadi Hashemi , Hanna Klimczak-Plucińska , Harleen Batra , Harsh Dhand , Ivan Nardini , Jacinda Mein , Jack Zhou , James Svensson , Jeff Stanway , Jetha Chan , Jin Peng Zhou , Joana Carrasqueira , Joana Iljazi , Jocelyn Becker , Joe Fernandez , Joost van Amersfoort , Josh Gordon , Josh Lipschultz , Josh Newlan , Ju-yeong Ji , Kareem Mohamed , Kartikeya Badola , Kat Black , Katie Millican , Keelin McDonell , Kelvin Nguyen , Kiranbir Sodhia , Kish Greene , Lars Lowe Sjoesund , Lauren Usui , Laurent Sifre , Lena Heuermann , Leticia Lago , Lilly McNealus , Livio Baldini Soares , Logan Kilpatrick , Lucas Dixon , Luciano Martins , Machel Reid , Manvinder Singh , Mark Iverson , Martin Görner , Mat Velloso , Mateo Wirth , Matt Davidow , Matt Miller , Matthew Rahtz , Matthew Watson , Meg Risdal , Mehran Kazemi , Michael Moynihan , Ming Zhang , Minsuk Kahng , Minwoo Park , Mofi Rahman , Mohit Khatwani , Natalie Dao , Nenshad Bardoliwalla , Nesh Devanathan , Neta Dumai , Nilay Chauhan , Oscar Wahltinez , Pankil Botarda , Parker Barnes , Paul Barham , Paul Michel , Pengchong Jin , Petko Georgiev , Phil Culliton , Pradeep Kuppala , Ramona Comanescu , Ramona Merhej , Reena Jana , Reza Ardeshir Rokni , Rishabh Agarwal , Ryan Mullins , Samaneh Saadat , Sara Mc Carthy , Sarah Cogan , Sarah Perrin , Sébastien M. R. Arnold , Sebastian Krause , Shengyang Dai , Shruti Garg , Shruti Sheth , Sue Ronstrom , Susan Chan , Timothy Jordan , Ting Yu , Tom Eccles , Tom Hennigan , Tomas Kocisky , Tulsee Doshi , Vihan Jain , Vikas Yadav , Vilobh Meshram , Vishal Dharmadhikari , Warren Barkley , Wei Wei , Wenming Ye , Woohyun Han , Woosuk Kwon , Xiang Xu , Zhe Shen , Zhitao Gong , Zichuan Wei , Victor Cotruta , Phoebe Kirk , Anand Rao , Minh Giang , Ludovic Peran , Tris Warkentin , Eli Collins , Joelle Barral , Zoubin Ghahramani , Raia Hadsell , D. Sculley , Jeanine Banks , Anca Dragan , Slav Petrov , Oriol Vinyals , Jeff Dean , Demis Hassabis , Koray Kavukcuoglu , Clement Farabet , Elena Buchatskaya , Sebastian Borgeaud , Noah Fiedel , Armand Joulin , Kathleen Kenealy , Robert Dadashi , Alek Andreev

LexMatcher: Dictionary-centric Data Collection for LLM-based Machine Translation

The fine-tuning of open-source large language models (LLMs) for machine translation has recently received considerable attention, marking a shift towards data-centric research from traditional neural machine translation. However, the area…

Computation and Language · Computer Science 2024-10-28 Yongjing Yin , Jiali Zeng , Yafu Li , Fandong Meng , Yue Zhang