English
Related papers

Related papers: CodeGemma: Open Code Models Based on Gemma

200 papers

This work introduces Gemma, a family of lightweight, state-of-the art open models built from the research and technology used to create Gemini models. Gemma models demonstrate strong performance across academic benchmarks for language…

Computation and Language · Computer Science 2024-04-17 Gemma Team , Thomas Mesnard , Cassidy Hardin , Robert Dadashi , Surya Bhupatiraju , Shreya Pathak , Laurent Sifre , Morgane Rivière , Mihir Sanjay Kale , Juliette Love , Pouya Tafti , Léonard Hussenot , Pier Giuseppe Sessa , Aakanksha Chowdhery , Adam Roberts , Aditya Barua , Alex Botev , Alex Castro-Ros , Ambrose Slone , Amélie Héliou , Andrea Tacchetti , Anna Bulanova , Antonia Paterson , Beth Tsai , Bobak Shahriari , Charline Le Lan , Christopher A. Choquette-Choo , Clément Crepy , Daniel Cer , Daphne Ippolito , David Reid , Elena Buchatskaya , Eric Ni , Eric Noland , Geng Yan , George Tucker , George-Christian Muraru , Grigory Rozhdestvenskiy , Henryk Michalewski , Ian Tenney , Ivan Grishchenko , Jacob Austin , James Keeling , Jane Labanowski , Jean-Baptiste Lespiau , Jeff Stanway , Jenny Brennan , Jeremy Chen , Johan Ferret , Justin Chiu , Justin Mao-Jones , Katherine Lee , Kathy Yu , Katie Millican , Lars Lowe Sjoesund , Lisa Lee , Lucas Dixon , Machel Reid , Maciej Mikuła , Mateo Wirth , Michael Sharman , Nikolai Chinaev , Nithum Thain , Olivier Bachem , Oscar Chang , Oscar Wahltinez , Paige Bailey , Paul Michel , Petko Yotov , Rahma Chaabouni , Ramona Comanescu , Reena Jana , Rohan Anil , Ross McIlroy , Ruibo Liu , Ryan Mullins , Samuel L Smith , Sebastian Borgeaud , Sertan Girgin , Sholto Douglas , Shree Pandya , Siamak Shakeri , Soham De , Ted Klimenko , Tom Hennigan , Vlad Feinberg , Wojciech Stokowiec , Yu-hui Chen , Zafarali Ahmed , Zhitao Gong , Tris Warkentin , Ludovic Peran , Minh Giang , Clément Farabet , Oriol Vinyals , Jeff Dean , Koray Kavukcuoglu , Demis Hassabis , Zoubin Ghahramani , Douglas Eck , Joelle Barral , Fernando Pereira , Eli Collins , Armand Joulin , Noah Fiedel , Evan Senter , Alek Andreev , Kathleen Kenealy

We release Code Llama, a family of large language models for code based on Llama 2 providing state-of-the-art performance among open models, infilling capabilities, support for large input contexts, and zero-shot instruction following…

In this work, we introduce Gemma 2, a new addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters. In this new version, we apply several known technical…

Computation and Language · Computer Science 2024-10-03 Gemma Team , Morgane Riviere , Shreya Pathak , Pier Giuseppe Sessa , Cassidy Hardin , Surya Bhupatiraju , Léonard Hussenot , Thomas Mesnard , Bobak Shahriari , Alexandre Ramé , Johan Ferret , Peter Liu , Pouya Tafti , Abe Friesen , Michelle Casbon , Sabela Ramos , Ravin Kumar , Charline Le Lan , Sammy Jerome , Anton Tsitsulin , Nino Vieillard , Piotr Stanczyk , Sertan Girgin , Nikola Momchev , Matt Hoffman , Shantanu Thakoor , Jean-Bastien Grill , Behnam Neyshabur , Olivier Bachem , Alanna Walton , Aliaksei Severyn , Alicia Parrish , Aliya Ahmad , Allen Hutchison , Alvin Abdagic , Amanda Carl , Amy Shen , Andy Brock , Andy Coenen , Anthony Laforge , Antonia Paterson , Ben Bastian , Bilal Piot , Bo Wu , Brandon Royal , Charlie Chen , Chintu Kumar , Chris Perry , Chris Welty , Christopher A. Choquette-Choo , Danila Sinopalnikov , David Weinberger , Dimple Vijaykumar , Dominika Rogozińska , Dustin Herbison , Elisa Bandy , Emma Wang , Eric Noland , Erica Moreira , Evan Senter , Evgenii Eltyshev , Francesco Visin , Gabriel Rasskin , Gary Wei , Glenn Cameron , Gus Martins , Hadi Hashemi , Hanna Klimczak-Plucińska , Harleen Batra , Harsh Dhand , Ivan Nardini , Jacinda Mein , Jack Zhou , James Svensson , Jeff Stanway , Jetha Chan , Jin Peng Zhou , Joana Carrasqueira , Joana Iljazi , Jocelyn Becker , Joe Fernandez , Joost van Amersfoort , Josh Gordon , Josh Lipschultz , Josh Newlan , Ju-yeong Ji , Kareem Mohamed , Kartikeya Badola , Kat Black , Katie Millican , Keelin McDonell , Kelvin Nguyen , Kiranbir Sodhia , Kish Greene , Lars Lowe Sjoesund , Lauren Usui , Laurent Sifre , Lena Heuermann , Leticia Lago , Lilly McNealus , Livio Baldini Soares , Logan Kilpatrick , Lucas Dixon , Luciano Martins , Machel Reid , Manvinder Singh , Mark Iverson , Martin Görner , Mat Velloso , Mateo Wirth , Matt Davidow , Matt Miller , Matthew Rahtz , Matthew Watson , Meg Risdal , Mehran Kazemi , Michael Moynihan , Ming Zhang , Minsuk Kahng , Minwoo Park , Mofi Rahman , Mohit Khatwani , Natalie Dao , Nenshad Bardoliwalla , Nesh Devanathan , Neta Dumai , Nilay Chauhan , Oscar Wahltinez , Pankil Botarda , Parker Barnes , Paul Barham , Paul Michel , Pengchong Jin , Petko Georgiev , Phil Culliton , Pradeep Kuppala , Ramona Comanescu , Ramona Merhej , Reena Jana , Reza Ardeshir Rokni , Rishabh Agarwal , Ryan Mullins , Samaneh Saadat , Sara Mc Carthy , Sarah Cogan , Sarah Perrin , Sébastien M. R. Arnold , Sebastian Krause , Shengyang Dai , Shruti Garg , Shruti Sheth , Sue Ronstrom , Susan Chan , Timothy Jordan , Ting Yu , Tom Eccles , Tom Hennigan , Tomas Kocisky , Tulsee Doshi , Vihan Jain , Vikas Yadav , Vilobh Meshram , Vishal Dharmadhikari , Warren Barkley , Wei Wei , Wenming Ye , Woohyun Han , Woosuk Kwon , Xiang Xu , Zhe Shen , Zhitao Gong , Zichuan Wei , Victor Cotruta , Phoebe Kirk , Anand Rao , Minh Giang , Ludovic Peran , Tris Warkentin , Eli Collins , Joelle Barral , Zoubin Ghahramani , Raia Hadsell , D. Sculley , Jeanine Banks , Anca Dragan , Slav Petrov , Oriol Vinyals , Jeff Dean , Demis Hassabis , Koray Kavukcuoglu , Clement Farabet , Elena Buchatskaya , Sebastian Borgeaud , Noah Fiedel , Armand Joulin , Kathleen Kenealy , Robert Dadashi , Alek Andreev

We introduce EmbeddingGemma, a new lightweight, open text embedding model based on the Gemma 3 language model family. Our innovative training recipe strategically captures knowledge from larger models via encoder-decoder initialization and…

We present TranslateGemma, a suite of open machine translation models based on the Gemma 3 foundation models. To enhance the inherent multilingual capabilities of Gemma 3 for the translation task, we employ a two-stage fine-tuning process.…

We introduce Gemma 3, a multimodal addition to the Gemma family of lightweight open models, ranging in scale from 1 to 27 billion parameters. This version introduces vision understanding abilities, a wider coverage of languages and longer…

Computation and Language · Computer Science 2025-03-26 Gemma Team , Aishwarya Kamath , Johan Ferret , Shreya Pathak , Nino Vieillard , Ramona Merhej , Sarah Perrin , Tatiana Matejovicova , Alexandre Ramé , Morgane Rivière , Louis Rouillard , Thomas Mesnard , Geoffrey Cideron , Jean-bastien Grill , Sabela Ramos , Edouard Yvinec , Michelle Casbon , Etienne Pot , Ivo Penchev , Gaël Liu , Francesco Visin , Kathleen Kenealy , Lucas Beyer , Xiaohai Zhai , Anton Tsitsulin , Robert Busa-Fekete , Alex Feng , Noveen Sachdeva , Benjamin Coleman , Yi Gao , Basil Mustafa , Iain Barr , Emilio Parisotto , David Tian , Matan Eyal , Colin Cherry , Jan-Thorsten Peter , Danila Sinopalnikov , Surya Bhupatiraju , Rishabh Agarwal , Mehran Kazemi , Dan Malkin , Ravin Kumar , David Vilar , Idan Brusilovsky , Jiaming Luo , Andreas Steiner , Abe Friesen , Abhanshu Sharma , Abheesht Sharma , Adi Mayrav Gilady , Adrian Goedeckemeyer , Alaa Saade , Alex Feng , Alexander Kolesnikov , Alexei Bendebury , Alvin Abdagic , Amit Vadi , András György , André Susano Pinto , Anil Das , Ankur Bapna , Antoine Miech , Antoine Yang , Antonia Paterson , Ashish Shenoy , Ayan Chakrabarti , Bilal Piot , Bo Wu , Bobak Shahriari , Bryce Petrini , Charlie Chen , Charline Le Lan , Christopher A. Choquette-Choo , CJ Carey , Cormac Brick , Daniel Deutsch , Danielle Eisenbud , Dee Cattle , Derek Cheng , Dimitris Paparas , Divyashree Shivakumar Sreepathihalli , Doug Reid , Dustin Tran , Dustin Zelle , Eric Noland , Erwin Huizenga , Eugene Kharitonov , Frederick Liu , Gagik Amirkhanyan , Glenn Cameron , Hadi Hashemi , Hanna Klimczak-Plucińska , Harman Singh , Harsh Mehta , Harshal Tushar Lehri , Hussein Hazimeh , Ian Ballantyne , Idan Szpektor , Ivan Nardini , Jean Pouget-Abadie , Jetha Chan , Joe Stanton , John Wieting , Jonathan Lai , Jordi Orbay , Joseph Fernandez , Josh Newlan , Ju-yeong Ji , Jyotinder Singh , Kat Black , Kathy Yu , Kevin Hui , Kiran Vodrahalli , Klaus Greff , Linhai Qiu , Marcella Valentine , Marina Coelho , Marvin Ritter , Matt Hoffman , Matthew Watson , Mayank Chaturvedi , Michael Moynihan , Min Ma , Nabila Babar , Natasha Noy , Nathan Byrd , Nick Roy , Nikola Momchev , Nilay Chauhan , Noveen Sachdeva , Oskar Bunyan , Pankil Botarda , Paul Caron , Paul Kishan Rubenstein , Phil Culliton , Philipp Schmid , Pier Giuseppe Sessa , Pingmei Xu , Piotr Stanczyk , Pouya Tafti , Rakesh Shivanna , Renjie Wu , Renke Pan , Reza Rokni , Rob Willoughby , Rohith Vallu , Ryan Mullins , Sammy Jerome , Sara Smoot , Sertan Girgin , Shariq Iqbal , Shashir Reddy , Shruti Sheth , Siim Põder , Sijal Bhatnagar , Sindhu Raghuram Panyam , Sivan Eiger , Susan Zhang , Tianqi Liu , Trevor Yacovone , Tyler Liechty , Uday Kalra , Utku Evci , Vedant Misra , Vincent Roseberry , Vlad Feinberg , Vlad Kolesnikov , Woohyun Han , Woosuk Kwon , Xi Chen , Yinlam Chow , Yuvein Zhu , Zichuan Wei , Zoltan Egyed , Victor Cotruta , Minh Giang , Phoebe Kirk , Anand Rao , Kat Black , Nabila Babar , Jessica Lo , Erica Moreira , Luiz Gustavo Martins , Omar Sanseviero , Lucas Gonzalez , Zach Gleicher , Tris Warkentin , Vahab Mirrokni , Evan Senter , Eli Collins , Joelle Barral , Zoubin Ghahramani , Raia Hadsell , Yossi Matias , D. Sculley , Slav Petrov , Noah Fiedel , Noam Shazeer , Oriol Vinyals , Jeff Dean , Demis Hassabis , Koray Kavukcuoglu , Clement Farabet , Elena Buchatskaya , Jean-Baptiste Alayrac , Rohan Anil , Dmitry , Lepikhin , Sebastian Borgeaud , Olivier Bachem , Armand Joulin , Alek Andreev , Cassidy Hardin , Robert Dadashi , Léonard Hussenot

The growing trend of vulnerability issues in software development as a result of a large dependence on open-source projects has received considerable attention recently. This paper investigates the effectiveness of Large Language Models…

Software Engineering · Computer Science 2024-09-17 Shaznin Sultana , Sadia Afreen , Nasir U. Eisty

We introduce LLaMA, a collection of foundation language models ranging from 7B to 65B parameters. We train our models on trillions of tokens, and show that it is possible to train state-of-the-art models using publicly available datasets…

PaliGemma is an open Vision-Language Model (VLM) that is based on the SigLIP-So400m vision encoder and the Gemma-2B language model. It is trained to be a versatile and broadly knowledgeable base model that is effective to transfer. It…

Code generation by Llama 3.1 models, such as Meta's Llama 3.1 405B, represents a significant advancement in the field of artificial intelligence, particularly in natural language processing and programming automation. This paper explores…

Computation and Language · Computer Science 2025-04-03 Aniket Deroy , Subhankar Maity

We present Llemma, a large language model for mathematics. We continue pretraining Code Llama on the Proof-Pile-2, a mixture of scientific papers, web data containing mathematics, and mathematical code, yielding Llemma. On the MATH…

We introduce T5Gemma 2, the next generation of the T5Gemma family of lightweight open encoder-decoder models, featuring strong multilingual, multimodal and long-context capabilities. T5Gemma 2 follows the adaptation recipe (via UL2) in…

Large Language Models (LLMs) demonstrate capabilities in code generation, potentially boosting developer productivity. However, their adoption remains limited by high computational costs, among other factors. Small Language Models (SLMs)…

Software Engineering · Computer Science 2025-09-23 Débora Souza , Rohit Gheyi , Lucas Albuquerque , Gustavo Soares , Márcio Ribeiro

The rapid evolution of large language models (LLMs) has opened new possibilities for automating various tasks in software development. This paper evaluates the capabilities of the Llama 2-70B model in automating these tasks for scientific…

Software Engineering · Computer Science 2025-07-09 Patrick Diehl , Nojoud Nader , Maxim Moraru , Steven R. Brandt

Text-to-SQL systems enable users to interact with structured databases using natural language, eliminating the need for specialized programming knowledge. In this work, we introduce GEMMA-SQL, a lightweight and efficient text-to-SQL model…

Computation and Language · Computer Science 2025-11-10 Hari Mohan Pandey , Anshul Gupta , Subham Sarkar , Minakshi Tomer , Schneider Johannes , Yan Gong

Large Language Models (LLMs) have demonstrated their remarkable capabilities in numerous fields. This survey focuses on how LLMs empower users, regardless of their technical background, to use human languages to automatically generate…

Software Engineering · Computer Science 2025-04-03 Nam Huynh , Beiyu Lin

Security vulnerabilities present in a code that has been written in diverse programming languages are among the most critical yet complicated aspects of source code to detect. Static analysis tools based on rule-based patterns usually do…

Cryptography and Security · Computer Science 2025-08-19 Hael Abdulhakim Ali Humran , Ferdi Sonmez

Natural language to code generation is an important application area of LLMs and has received wide attention from the community. The majority of relevant studies have exclusively concentrated on increasing the quantity and functional…

Machine Learning · Computer Science 2023-11-28 Naman Jain , Tianjun Zhang , Wei-Lin Chiang , Joseph E. Gonzalez , Koushik Sen , Ion Stoica

Large Language Models (LLMs) have shown promising performance in code generation. However, how to reliably evaluate code generated by LLMs remains an unresolved problem. This paper presents CodeJudge, a code evaluation framework that…

Machine Learning · Computer Science 2024-10-04 Weixi Tong , Tianyi Zhang
‹ Prev 1 2 3 10 Next ›