Related papers: CodeGemma: Open Code Models Based on Gemma

Gemma: Open Models Based on Gemini Research and Technology

This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language…

Computation and Language · Computer Science 2024-04-17 Gemma Team , Thomas Mesnard , Cassidy Hardin , Robert Dadashi , Surya Bhupatiraju , Shreya Pathak , Laurent Sifre , Morgane Rivière , Mihir Sanjay Kale , Juliette Love , Pouya Tafti , Léonard Hussenot , Pier Giuseppe Sessa , Aakanksha Chowdhery , Adam Roberts , Aditya Barua , Alex Botev , Alex Castro-Ros , Ambrose Slone , Amélie Héliou , Andrea Tacchetti , Anna Bulanova , Antonia Paterson , Beth Tsai , Bobak Shahriari , Charline Le Lan , Christopher A. Choquette-Choo , Clément Crepy , Daniel Cer , Daphne Ippolito , David Reid , Elena Buchatskaya , Eric Ni , Eric Noland , Geng Yan , George Tucker , George-Christian Muraru , Grigory Rozhdestvenskiy , Henryk Michalewski , Ian Tenney , Ivan Grishchenko , Jacob Austin , James Keeling , Jane Labanowski , Jean-Baptiste Lespiau , Jeff Stanway , Jenny Brennan , Jeremy Chen , Johan Ferret , Justin Chiu , Justin Mao-Jones , Katherine Lee , Kathy Yu , Katie Millican , Lars Lowe Sjoesund , Lisa Lee , Lucas Dixon , Machel Reid , Maciej Mikuła , Mateo Wirth , Michael Sharman , Nikolai Chinaev , Nithum Thain , Olivier Bachem , Oscar Chang , Oscar Wahltinez , Paige Bailey , Paul Michel , Petko Yotov , Rahma Chaabouni , Ramona Comanescu , Reena Jana , Rohan Anil , Ross McIlroy , Ruibo Liu , Ryan Mullins , Samuel L Smith , Sebastian Borgeaud , Sertan Girgin , Sholto Douglas , Shree Pandya , Siamak Shakeri , Soham De , Ted Klimenko , Tom Hennigan , Vlad Feinberg , Wojciech Stokowiec , Yu-hui Chen , Zafarali Ahmed , Zhitao Gong , Tris Warkentin , Ludovic Peran , Minh Giang , Clément Farabet , Oriol Vinyals , Jeff Dean , Koray Kavukcuoglu , Demis Hassabis , Zoubin Ghahramani , Douglas Eck , Joelle Barral , Fernando Pereira , Eli Collins , Armand Joulin , Noah Fiedel , Evan Senter , Alek Andreev , Kathleen Kenealy

Code Llama: Open Foundation Models for Code

We release Code Llama, a family of large language models for code based on Llama 2 providing state-of-the-art performance among open models, infilling capabilities, support for large input contexts, and zero-shot instruction following…

Computation and Language · Computer Science 2024-02-02 Baptiste Rozière , Jonas Gehring , Fabian Gloeckle , Sten Sootla , Itai Gat , Xiaoqing Ellen Tan , Yossi Adi , Jingyu Liu , Romain Sauvestre , Tal Remez , Jérémy Rapin , Artyom Kozhevnikov , Ivan Evtimov , Joanna Bitton , Manish Bhatt , Cristian Canton Ferrer , Aaron Grattafiori , Wenhan Xiong , Alexandre Défossez , Jade Copet , Faisal Azhar , Hugo Touvron , Louis Martin , Nicolas Usunier , Thomas Scialom , Gabriel Synnaeve

Gemma 2: Improving Open Language Models at a Practical Size

In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical…

Computation and Language · Computer Science 2024-10-03 Gemma Team , Morgane Riviere , Shreya Pathak , Pier Giuseppe Sessa , Cassidy Hardin , Surya Bhupatiraju , Léonard Hussenot , Thomas Mesnard , Bobak Shahriari , Alexandre Ramé , Johan Ferret , Peter Liu , Pouya Tafti , Abe Friesen , Michelle Casbon , Sabela Ramos , Ravin Kumar , Charline Le Lan , Sammy Jerome , Anton Tsitsulin , Nino Vieillard , Piotr Stanczyk , Sertan Girgin , Nikola Momchev , Matt Hoffman , Shantanu Thakoor , Jean-Bastien Grill , Behnam Neyshabur , Olivier Bachem , Alanna Walton , Aliaksei Severyn , Alicia Parrish , Aliya Ahmad , Allen Hutchison , Alvin Abdagic , Amanda Carl , Amy Shen , Andy Brock , Andy Coenen , Anthony Laforge , Antonia Paterson , Ben Bastian , Bilal Piot , Bo Wu , Brandon Royal , Charlie Chen , Chintu Kumar , Chris Perry , Chris Welty , Christopher A. Choquette-Choo , Danila Sinopalnikov , David Weinberger , Dimple Vijaykumar , Dominika Rogozińska , Dustin Herbison , Elisa Bandy , Emma Wang , Eric Noland , Erica Moreira , Evan Senter , Evgenii Eltyshev , Francesco Visin , Gabriel Rasskin , Gary Wei , Glenn Cameron , Gus Martins , Hadi Hashemi , Hanna Klimczak-Plucińska , Harleen Batra , Harsh Dhand , Ivan Nardini , Jacinda Mein , Jack Zhou , James Svensson , Jeff Stanway , Jetha Chan , Jin Peng Zhou , Joana Carrasqueira , Joana Iljazi , Jocelyn Becker , Joe Fernandez , Joost van Amersfoort , Josh Gordon , Josh Lipschultz , Josh Newlan , Ju-yeong Ji , Kareem Mohamed , Kartikeya Badola , Kat Black , Katie Millican , Keelin McDonell , Kelvin Nguyen , Kiranbir Sodhia , Kish Greene , Lars Lowe Sjoesund , Lauren Usui , Laurent Sifre , Lena Heuermann , Leticia Lago , Lilly McNealus , Livio Baldini Soares , Logan Kilpatrick , Lucas Dixon , Luciano Martins , Machel Reid , Manvinder Singh , Mark Iverson , Martin Görner , Mat Velloso , Mateo Wirth , Matt Davidow , Matt Miller , Matthew Rahtz , Matthew Watson , Meg Risdal , Mehran Kazemi , Michael Moynihan , Ming Zhang , Minsuk Kahng , Minwoo Park , Mofi Rahman , Mohit Khatwani , Natalie Dao , Nenshad Bardoliwalla , Nesh Devanathan , Neta Dumai , Nilay Chauhan , Oscar Wahltinez , Pankil Botarda , Parker Barnes , Paul Barham , Paul Michel , Pengchong Jin , Petko Georgiev , Phil Culliton , Pradeep Kuppala , Ramona Comanescu , Ramona Merhej , Reena Jana , Reza Ardeshir Rokni , Rishabh Agarwal , Ryan Mullins , Samaneh Saadat , Sara Mc Carthy , Sarah Cogan , Sarah Perrin , Sébastien M. R. Arnold , Sebastian Krause , Shengyang Dai , Shruti Garg , Shruti Sheth , Sue Ronstrom , Susan Chan , Timothy Jordan , Ting Yu , Tom Eccles , Tom Hennigan , Tomas Kocisky , Tulsee Doshi , Vihan Jain , Vikas Yadav , Vilobh Meshram , Vishal Dharmadhikari , Warren Barkley , Wei Wei , Wenming Ye , Woohyun Han , Woosuk Kwon , Xiang Xu , Zhe Shen , Zhitao Gong , Zichuan Wei , Victor Cotruta , Phoebe Kirk , Anand Rao , Minh Giang , Ludovic Peran , Tris Warkentin , Eli Collins , Joelle Barral , Zoubin Ghahramani , Raia Hadsell , D. Sculley , Jeanine Banks , Anca Dragan , Slav Petrov , Oriol Vinyals , Jeff Dean , Demis Hassabis , Koray Kavukcuoglu , Clement Farabet , Elena Buchatskaya , Sebastian Borgeaud , Noah Fiedel , Armand Joulin , Kathleen Kenealy , Robert Dadashi , Alek Andreev

EmbeddingGemma: Powerful and Lightweight Text Representations

We introduce EmbeddingGemma, a new lightweight, open text embedding model based on the Gemma 3 language model family. Our innovative training recipe strategically captures knowledge from larger models via encoder-decoder initialization and…

Computation and Language · Computer Science 2025-11-04 Henrique Schechter Vera , Sahil Dua , Biao Zhang , Daniel Salz , Ryan Mullins , Sindhu Raghuram Panyam , Sara Smoot , Iftekhar Naim , Joe Zou , Feiyang Chen , Daniel Cer , Alice Lisak , Min Choi , Lucas Gonzalez , Omar Sanseviero , Glenn Cameron , Ian Ballantyne , Kat Black , Kaifeng Chen , Weiyi Wang , Zhe Li , Gus Martins , Jinhyuk Lee , Mark Sherwood , Juyeong Ji , Renjie Wu , Jingxiao Zheng , Jyotinder Singh , Abheesht Sharma , Divyashree Sreepathihalli , Aashi Jain , Adham Elarabawy , AJ Co , Andreas Doumanoglou , Babak Samari , Ben Hora , Brian Potetz , Dahun Kim , Enrique Alfonseca , Fedor Moiseev , Feng Han , Frank Palma Gomez , Gustavo Hernández Ábrego , Hesen Zhang , Hui Hui , Jay Han , Karan Gill , Ke Chen , Koert Chen , Madhuri Shanbhogue , Michael Boratko , Paul Suganthan , Sai Meher Karthik Duddu , Sandeep Mariserla , Setareh Ariafar , Shanfeng Zhang , Shijie Zhang , Simon Baumgartner , Sonam Goenka , Steve Qiu , Tanmaya Dabral , Trevor Walker , Vikram Rao , Waleed Khawaja , Wenlei Zhou , Xiaoqi Ren , Ye Xia , Yichang Chen , Yi-Ting Chen , Zhe Dong , Zhongli Ding , Francesco Visin , Gaël Liu , Jiageng Zhang , Kathleen Kenealy , Michelle Casbon , Ravin Kumar , Thomas Mesnard , Zach Gleicher , Cormac Brick , Olivier Lacombe , Adam Roberts , Qin Yin , Yunhsuan Sung , Raphael Hoffmann , Tris Warkentin , Armand Joulin , Tom Duerig , Mojtaba Seyedhosseini

TranslateGemma Technical Report

We present TranslateGemma, a suite of open machine translation models based on the Gemma 3 foundation models. To enhance the inherent multilingual capabilities of Gemma 3 for the translation task, we employ a two-stage fine-tuning process.…

Computation and Language · Computer Science 2026-01-21 Mara Finkelstein , Isaac Caswell , Tobias Domhan , Jan-Thorsten Peter , Juraj Juraska , Parker Riley , Daniel Deutsch , Geza Kovacs , Cole Dilanni , Colin Cherry , Eleftheria Briakou , Elizabeth Nielsen , Jiaming Luo , Kat Black , Ryan Mullins , Sweta Agrawal , Wenda Xu , Erin Kats , Stephane Jaskiewicz , Markus Freitag , David Vilar

Gemma 3 Technical Report

We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer…

Computation and Language · Computer Science 2025-03-26 Gemma Team , Aishwarya Kamath , Johan Ferret , Shreya Pathak , Nino Vieillard , Ramona Merhej , Sarah Perrin , Tatiana Matejovicova , Alexandre Ramé , Morgane Rivière , Louis Rouillard , Thomas Mesnard , Geoffrey Cideron , Jean-bastien Grill , Sabela Ramos , Edouard Yvinec , Michelle Casbon , Etienne Pot , Ivo Penchev , Gaël Liu , Francesco Visin , Kathleen Kenealy , Lucas Beyer , Xiaohai Zhai , Anton Tsitsulin , Robert Busa-Fekete , Alex Feng , Noveen Sachdeva , Benjamin Coleman , Yi Gao , Basil Mustafa , Iain Barr , Emilio Parisotto , David Tian , Matan Eyal , Colin Cherry , Jan-Thorsten Peter , Danila Sinopalnikov , Surya Bhupatiraju , Rishabh Agarwal , Mehran Kazemi , Dan Malkin , Ravin Kumar , David Vilar , Idan Brusilovsky , Jiaming Luo , Andreas Steiner , Abe Friesen , Abhanshu Sharma , Abheesht Sharma , Adi Mayrav Gilady , Adrian Goedeckemeyer , Alaa Saade , Alex Feng , Alexander Kolesnikov , Alexei Bendebury , Alvin Abdagic , Amit Vadi , András György , André Susano Pinto , Anil Das , Ankur Bapna , Antoine Miech , Antoine Yang , Antonia Paterson , Ashish Shenoy , Ayan Chakrabarti , Bilal Piot , Bo Wu , Bobak Shahriari , Bryce Petrini , Charlie Chen , Charline Le Lan , Christopher A. Choquette-Choo , CJ Carey , Cormac Brick , Daniel Deutsch , Danielle Eisenbud , Dee Cattle , Derek Cheng , Dimitris Paparas , Divyashree Shivakumar Sreepathihalli , Doug Reid , Dustin Tran , Dustin Zelle , Eric Noland , Erwin Huizenga , Eugene Kharitonov , Frederick Liu , Gagik Amirkhanyan , Glenn Cameron , Hadi Hashemi , Hanna Klimczak-Plucińska , Harman Singh , Harsh Mehta , Harshal Tushar Lehri , Hussein Hazimeh , Ian Ballantyne , Idan Szpektor , Ivan Nardini , Jean Pouget-Abadie , Jetha Chan , Joe Stanton , John Wieting , Jonathan Lai , Jordi Orbay , Joseph Fernandez , Josh Newlan , Ju-yeong Ji , Jyotinder Singh , Kat Black , Kathy Yu , Kevin Hui , Kiran Vodrahalli , Klaus Greff , Linhai Qiu , Marcella Valentine , Marina Coelho , Marvin Ritter , Matt Hoffman , Matthew Watson , Mayank Chaturvedi , Michael Moynihan , Min Ma , Nabila Babar , Natasha Noy , Nathan Byrd , Nick Roy , Nikola Momchev , Nilay Chauhan , Noveen Sachdeva , Oskar Bunyan , Pankil Botarda , Paul Caron , Paul Kishan Rubenstein , Phil Culliton , Philipp Schmid , Pier Giuseppe Sessa , Pingmei Xu , Piotr Stanczyk , Pouya Tafti , Rakesh Shivanna , Renjie Wu , Renke Pan , Reza Rokni , Rob Willoughby , Rohith Vallu , Ryan Mullins , Sammy Jerome , Sara Smoot , Sertan Girgin , Shariq Iqbal , Shashir Reddy , Shruti Sheth , Siim Põder , Sijal Bhatnagar , Sindhu Raghuram Panyam , Sivan Eiger , Susan Zhang , Tianqi Liu , Trevor Yacovone , Tyler Liechty , Uday Kalra , Utku Evci , Vedant Misra , Vincent Roseberry , Vlad Feinberg , Vlad Kolesnikov , Woohyun Han , Woosuk Kwon , Xi Chen , Yinlam Chow , Yuvein Zhu , Zichuan Wei , Zoltan Egyed , Victor Cotruta , Minh Giang , Phoebe Kirk , Anand Rao , Kat Black , Nabila Babar , Jessica Lo , Erica Moreira , Luiz Gustavo Martins , Omar Sanseviero , Lucas Gonzalez , Zach Gleicher , Tris Warkentin , Vahab Mirrokni , Evan Senter , Eli Collins , Joelle Barral , Zoubin Ghahramani , Raia Hadsell , Yossi Matias , D. Sculley , Slav Petrov , Noah Fiedel , Noam Shazeer , Oriol Vinyals , Jeff Dean , Demis Hassabis , Koray Kavukcuoglu , Clement Farabet , Elena Buchatskaya , Jean-Baptiste Alayrac , Rohan Anil , Dmitry , Lepikhin , Sebastian Borgeaud , Olivier Bachem , Armand Joulin , Alek Andreev , Cassidy Hardin , Robert Dadashi , Léonard Hussenot

Code Vulnerability Detection: A Comparative Analysis of Emerging Large Language Models

The growing trend of vulnerability issues in software development as a result of a large dependence on open-source projects has received considerable attention recently. This paper investigates the effectiveness of Large Language Models…

Software Engineering · Computer Science 2024-09-17 Shaznin Sultana , Sadia Afreen , Nasir U. Eisty

LLaMA: Open and Efficient Foundation Language Models

We introduce LLaMA, a collection of foundation language models ranging from 7B to 65B parameters. We train our models on trillions of tokens, and show that it is possible to train state-of-the-art models using publicly available datasets…

Computation and Language · Computer Science 2023-02-28 Hugo Touvron , Thibaut Lavril , Gautier Izacard , Xavier Martinet , Marie-Anne Lachaux , Timothée Lacroix , Baptiste Rozière , Naman Goyal , Eric Hambro , Faisal Azhar , Aurelien Rodriguez , Armand Joulin , Edouard Grave , Guillaume Lample

PaliGemma: A versatile 3B VLM for transfer

PaliGemma is an open Vision-Language Model (VLM) that is based on the SigLIP-So400m vision encoder and the Gemma-2B language model. It is trained to be a versatile and broadly knowledgeable base model that is effective to transfer. It…

Computer Vision and Pattern Recognition · Computer Science 2024-10-11 Lucas Beyer , Andreas Steiner , André Susano Pinto , Alexander Kolesnikov , Xiao Wang , Daniel Salz , Maxim Neumann , Ibrahim Alabdulmohsin , Michael Tschannen , Emanuele Bugliarello , Thomas Unterthiner , Daniel Keysers , Skanda Koppula , Fangyu Liu , Adam Grycner , Alexey Gritsenko , Neil Houlsby , Manoj Kumar , Keran Rong , Julian Eisenschlos , Rishabh Kabra , Matthias Bauer , Matko Bošnjak , Xi Chen , Matthias Minderer , Paul Voigtlaender , Ioana Bica , Ivana Balazevic , Joan Puigcerver , Pinelopi Papalampidi , Olivier Henaff , Xi Xiong , Radu Soricut , Jeremiah Harmsen , Xiaohua Zhai

Code Generation and Algorithmic Problem Solving Using Llama 3.1 405B

Code generation by Llama 3.1 models, such as Meta's Llama 3.1 405B, represents a significant advancement in the field of artificial intelligence, particularly in natural language processing and programming automation. This paper explores…

Computation and Language · Computer Science 2025-04-03 Aniket Deroy , Subhankar Maity

Llemma: An Open Language Model For Mathematics

We present Llemma, a large language model for mathematics. We continue pretraining Code Llama on the Proof-Pile-2, a mixture of scientific papers, web data containing mathematics, and mathematical code, yielding Llemma. On the MATH…

Computation and Language · Computer Science 2024-03-19 Zhangir Azerbayev , Hailey Schoelkopf , Keiran Paster , Marco Dos Santos , Stephen McAleer , Albert Q. Jiang , Jia Deng , Stella Biderman , Sean Welleck

Granite Code Models: A Family of Open Foundation Models for Code Intelligence

Large Language Models (LLMs) trained on code are revolutionizing the software development process. Increasingly, code LLMs are being integrated into software development environments to improve the productivity of human programmers, and…

Artificial Intelligence · Computer Science 2024-05-08 Mayank Mishra , Matt Stallone , Gaoyuan Zhang , Yikang Shen , Aditya Prasad , Adriana Meza Soria , Michele Merler , Parameswaran Selvam , Saptha Surendran , Shivdeep Singh , Manish Sethi , Xuan-Hong Dang , Pengyuan Li , Kun-Lung Wu , Syed Zawad , Andrew Coleman , Matthew White , Mark Lewis , Raju Pavuluri , Yan Koyfman , Boris Lublinsky , Maximilien de Bayser , Ibrahim Abdelaziz , Kinjal Basu , Mayank Agarwal , Yi Zhou , Chris Johnson , Aanchal Goyal , Hima Patel , Yousaf Shah , Petros Zerfos , Heiko Ludwig , Asim Munawar , Maxwell Crouse , Pavan Kapanipathi , Shweta Salaria , Bob Calio , Sophia Wen , Seetharami Seelam , Brian Belgodere , Carlos Fonseca , Amith Singhee , Nirmit Desai , David D. Cox , Ruchir Puri , Rameswar Panda

T5Gemma 2: Seeing, Reading, and Understanding Longer

We introduce T5Gemma 2, the next generation of the T5Gemma family of lightweight open encoder-decoder models, featuring strong multilingual, multimodal and long-context capabilities. T5Gemma 2 follows the adaptation recipe (via UL2) in…

Computation and Language · Computer Science 2025-12-25 Biao Zhang , Paul Suganthan , Gaël Liu , Ilya Philippov , Sahil Dua , Ben Hora , Kat Black , Gus Martins , Omar Sanseviero , Shreya Pathak , Cassidy Hardin , Francesco Visin , Jiageng Zhang , Kathleen Kenealy , Qin Yin , Xiaodan Song , Olivier Lacombe , Armand Joulin , Tris Warkentin , Adam Roberts

Code Generation with Small Language Models: A Codeforces-Based Study

Large Language Models (LLMs) demonstrate capabilities in code generation, potentially boosting developer productivity. However, their adoption remains limited by high computational costs, among other factors. Small Language Models (SLMs)…

Software Engineering · Computer Science 2025-09-23 Débora Souza , Rohit Gheyi , Lucas Albuquerque , Gustavo Soares , Márcio Ribeiro

LLM Benchmarking with LLaMA2: Evaluating Code Development Performance Across Multiple Programming Languages

The rapid evolution of large language models (LLMs) has opened new possibilities for automating various tasks in software development. This paper evaluates the capabilities of the Llama 2-70B model in automating these tasks for scientific…

Software Engineering · Computer Science 2025-07-09 Patrick Diehl , Nojoud Nader , Maxim Moraru , Steven R. Brandt

GEMMA-SQL: A Novel Text-to-SQL Model Based on Large Language Models

Text-to-SQL systems enable users to interact with structured databases using natural language, eliminating the need for specialized programming knowledge. In this work, we introduce GEMMA-SQL, a lightweight and efficient text-to-SQL model…

Computation and Language · Computer Science 2025-11-10 Hari Mohan Pandey , Anshul Gupta , Subham Sarkar , Minakshi Tomer , Schneider Johannes , Yan Gong

Large Language Models for Code Generation: A Comprehensive Survey of Challenges, Techniques, Evaluation, and Applications

Large Language Models (LLMs) have demonstrated their remarkable capabilities in numerous fields. This survey focuses on how LLMs empower users, regardless of their technical background, to use human languages to automatically generate…

Software Engineering · Computer Science 2025-04-03 Nam Huynh , Beiyu Lin

Code Vulnerability Detection Across Different Programming Languages with AI Models

Security vulnerabilities present in a code that has been written in diverse programming languages are among the most critical yet complicated aspects of source code to detect. Static analysis tools based on rule-based patterns usually do…

Cryptography and Security · Computer Science 2025-08-19 Hael Abdulhakim Ali Humran , Ferdi Sonmez

LLM-Assisted Code Cleaning For Training Accurate Code Generators

Natural language to code generation is an important application area of LLMs and has received wide attention from the community. The majority of relevant studies have exclusively concentrated on increasing the quantity and functional…

Machine Learning · Computer Science 2023-11-28 Naman Jain , Tianjun Zhang , Wei-Lin Chiang , Joseph E. Gonzalez , Koushik Sen , Ion Stoica

CodeJudge: Evaluating Code Generation with Large Language Models

Large Language Models (LLMs) have shown promising performance in code generation. However, how to reliably evaluate code generated by LLMs remains an unresolved problem. This paper presents CodeJudge, a code evaluation framework that…

Machine Learning · Computer Science 2024-10-04 Weixi Tong , Tianyi Zhang