Related papers: Narrow Transformer: StarCoder-Based Java-LM For De…

A Comprehensive Review of State-of-The-Art Methods for Java Code Generation from Natural Language Text

Java Code Generation consists in generating automatically Java code from a Natural Language Text. This NLP task helps in increasing programmers' productivity by providing them with immediate solutions to the simplest and most repetitive…

Computation and Language · Computer Science 2023-06-13 Jessica López Espejel , Mahaman Sanoussi Yahaya Alassan , El Mehdi Chouham , Walid Dahhane , El Hassane Ettifouri

REMODEL-LLM: Transforming C code to Java using LLMs

The automated translation of C code to Java code is a notoriously difficult task, fraught with challenges stemming from fundamental paradigm shifts (procedural vs. Object Oriented), memory models (manual pointers vs. Garbage Collection),…

Software Engineering · Computer Science 2025-12-15 Aryan Gupta , Y. Raghu Reddy

An Empirical Study on the Code Refactoring Capability of Large Language Models

Large Language Models (LLMs) have shown potential to enhance software development through automated code generation and refactoring, reducing development time and improving code quality. This study empirically evaluates StarCoder2, an LLM…

Software Engineering · Computer Science 2024-11-05 Jonathan Cordeiro , Shayan Noei , Ying Zou

StarCoder 2 and The Stack v2: The Next Generation

The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of…

Software Engineering · Computer Science 2024-03-01 Anton Lozhkov , Raymond Li , Loubna Ben Allal , Federico Cassano , Joel Lamy-Poirier , Nouamane Tazi , Ao Tang , Dmytro Pykhtar , Jiawei Liu , Yuxiang Wei , Tianyang Liu , Max Tian , Denis Kocetkov , Arthur Zucker , Younes Belkada , Zijian Wang , Qian Liu , Dmitry Abulkhanov , Indraneil Paul , Zhuang Li , Wen-Ding Li , Megan Risdal , Jia Li , Jian Zhu , Terry Yue Zhuo , Evgenii Zheltonozhskii , Nii Osae Osae Dade , Wenhao Yu , Lucas Krauß , Naman Jain , Yixuan Su , Xuanli He , Manan Dey , Edoardo Abati , Yekun Chai , Niklas Muennighoff , Xiangru Tang , Muhtasham Oblokulov , Christopher Akiki , Marc Marone , Chenghao Mou , Mayank Mishra , Alex Gu , Binyuan Hui , Tri Dao , Armel Zebaze , Olivier Dehaene , Nicolas Patry , Canwen Xu , Julian McAuley , Han Hu , Torsten Scholak , Sebastien Paquet , Jennifer Robinson , Carolyn Jane Anderson , Nicolas Chapados , Mostofa Patwary , Nima Tajbakhsh , Yacine Jernite , Carlos Muñoz Ferrandis , Lingming Zhang , Sean Hughes , Thomas Wolf , Arjun Guha , Leandro von Werra , Harm de Vries

A Language Model of Java Methods with Train/Test Deduplication

This tool demonstration presents a research toolkit for a language model of Java source code. The target audience includes researchers studying problems at the granularity level of subroutines, statements, or variables in Java. In contrast…

Software Engineering · Computer Science 2023-05-16 Chia-Yi Su , Aakash Bansal , Vijayanta Jain , Sepideh Ghanavati , Collin McMillan

CoderUJB: An Executable and Unified Java Benchmark for Practical Programming Scenarios

In the evolving landscape of large language models (LLMs) tailored for software engineering, the need for benchmarks that accurately reflect real-world development scenarios is paramount. Current benchmarks are either too simplistic or fail…

Software Engineering · Computer Science 2024-03-29 Zhengran Zeng , Yidong Wang , Rui Xie , Wei Ye , Shikun Zhang

Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs

Over the past few years, Large Language Models of Code (Code LLMs) have started to have a significant impact on programming practice. Code LLMs are also emerging as building blocks for research in programming languages and software…

Programming Languages · Computer Science 2024-09-24 Federico Cassano , John Gouwar , Francesca Lucchetti , Claire Schlesinger , Anders Freeman , Carolyn Jane Anderson , Molly Q Feldman , Michael Greenberg , Abhinav Jangda , Arjun Guha

COBOL-Coder: Domain-Adapted Large Language Models for COBOL Code Generation and Translation

COBOL remains a critical language for mainframe systems, yet existing large language models (LLMs) struggle to generate and translate COBOL code correctly. This paper reports our experience in developing and evaluating domain-adapted LLMs…

Software Engineering · Computer Science 2026-04-07 Anh T. V. Dau , Shin Hwei Tan , Jinqiu Yang , Nghi D. Q. Bui , Anh Tuan Nguyen

TigerCoder: A Novel Suite of LLMs for Code Generation in Bangla

Despite being the 5th most spoken language, Bangla remains underrepresented in Large Language Models (LLMs), particularly for code generation. This primarily stems from the scarcity of high-quality data to pre-train and/or finetune such…

Computation and Language · Computer Science 2025-09-12 Nishat Raihan , Antonios Anastasopoulos , Marcos Zampieri

StarCoder: may the source be with you!

The BigCode community, an open-scientific collaboration working on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder and StarCoderBase: 15.5B parameter models with 8K context length, infilling…

Computation and Language · Computer Science 2023-12-14 Raymond Li , Loubna Ben Allal , Yangtian Zi , Niklas Muennighoff , Denis Kocetkov , Chenghao Mou , Marc Marone , Christopher Akiki , Jia Li , Jenny Chim , Qian Liu , Evgenii Zheltonozhskii , Terry Yue Zhuo , Thomas Wang , Olivier Dehaene , Mishig Davaadorj , Joel Lamy-Poirier , João Monteiro , Oleh Shliazhko , Nicolas Gontier , Nicholas Meade , Armel Zebaze , Ming-Ho Yee , Logesh Kumar Umapathi , Jian Zhu , Benjamin Lipkin , Muhtasham Oblokulov , Zhiruo Wang , Rudra Murthy , Jason Stillerman , Siva Sankalp Patel , Dmitry Abulkhanov , Marco Zocca , Manan Dey , Zhihan Zhang , Nour Fahmy , Urvashi Bhattacharyya , Wenhao Yu , Swayam Singh , Sasha Luccioni , Paulo Villegas , Maxim Kunakov , Fedor Zhdanov , Manuel Romero , Tony Lee , Nadav Timor , Jennifer Ding , Claire Schlesinger , Hailey Schoelkopf , Jan Ebert , Tri Dao , Mayank Mishra , Alex Gu , Jennifer Robinson , Carolyn Jane Anderson , Brendan Dolan-Gavitt , Danish Contractor , Siva Reddy , Daniel Fried , Dzmitry Bahdanau , Yacine Jernite , Carlos Muñoz Ferrandis , Sean Hughes , Thomas Wolf , Arjun Guha , Leandro von Werra , Harm de Vries

Analysis of MiniJava Programs via Translation to ML

MiniJava is a subset of the object-oriented programming language Java. Standard ML is the canonical representative of the ML family of functional programming languages, which includes F# and OCaml. Different program analysis and…

Programming Languages · Computer Science 2021-01-01 Martin Mariusz Lester

Syntax Is Not Enough: An Empirical Study of Small Transformer Models for Neural Code Repair

Automated program repair using neural models has shown promising results on benchmark datasets, yet practical deployment remains limited. In this study, we examine whether a small transformer model can meaningfully repair real-world Java…

Software Engineering · Computer Science 2025-12-30 Shaunak Samant

JaCoText: A Pretrained Model for Java Code-Text Generation

Pretrained transformer-based models have shown high performance in natural language generation task. However, a new wave of interest has surged: automatic programming language generation. This task consists of translating natural language…

Computation and Language · Computer Science 2023-03-24 Jessica López Espejel , Mahaman Sanoussi Yahaya Alassan , Walid Dahhane , El Hassane Ettifouri

SteloCoder: a Decoder-Only LLM for Multi-Language to Python Code Translation

With the recent focus on Large Language Models (LLMs), both StarCoder (Li et al., 2023) and Code Llama (Rozi\`ere et al., 2023) have demonstrated remarkable performance in code generation. However, there is still a need for improvement in…

Computation and Language · Computer Science 2023-12-18 Jialing Pan , Adrien Sadé , Jin Kim , Eric Soriano , Guillem Sole , Sylvain Flamant

Exploring and Evaluating Personalized Models for Code Generation

Large Transformer models achieved the state-of-the-art status for Natural Language Understanding tasks and are increasingly becoming the baseline model architecture for modeling source code. Transformers are usually pre-trained on large…

Software Engineering · Computer Science 2022-09-21 Andrei Zlotchevski , Dawn Drain , Alexey Svyatkovskiy , Colin Clement , Neel Sundaresan , Michele Tufano

GEB-1.3B: Open Lightweight Large Language Model

Recently developed large language models (LLMs) such as ChatGPT, Claude, and Llama have demonstrated impressive abilities, and even surpass human-level performance in several tasks. Despite their success, the resource-intensive demands of…

Computation and Language · Computer Science 2024-06-17 Jie Wu , Yufeng Zhu , Lei Shen , Xuqing Lu

ComplexCodeEval: A Benchmark for Evaluating Large Code Models on More Complex Code

In recent years, the application of large language models (LLMs) to code-related tasks has gained significant attention. However, existing evaluation benchmarks often focus on limited scenarios, such as code generation or completion, which…

Software Engineering · Computer Science 2024-09-17 Jia Feng , Jiachen Liu , Cuiyun Gao , Chun Yong Chong , Chaozheng Wang , Shan Gao , Xin Xia

From Code to Play: Benchmarking Program Search for Games Using Large Language Models

Large language models (LLMs) have shown impressive capabilities in generating program code, opening exciting opportunities for applying program synthesis to games. In this work, we explore the potential of LLMs to directly synthesize usable…

Artificial Intelligence · Computer Science 2025-07-16 Manuel Eberhardinger , James Goodman , Alexander Dockhorn , Diego Perez-Liebana , Raluca D. Gaina , Duygu Çakmak , Setareh Maghsudi , Simon Lucas

SantaCoder: don't reach for the stars!

The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes the progress of the collaboration until December 2022, outlining the current state…

Software Engineering · Computer Science 2023-02-27 Loubna Ben Allal , Raymond Li , Denis Kocetkov , Chenghao Mou , Christopher Akiki , Carlos Munoz Ferrandis , Niklas Muennighoff , Mayank Mishra , Alex Gu , Manan Dey , Logesh Kumar Umapathi , Carolyn Jane Anderson , Yangtian Zi , Joel Lamy Poirier , Hailey Schoelkopf , Sergey Troshin , Dmitry Abulkhanov , Manuel Romero , Michael Lappert , Francesco De Toni , Bernardo García del Río , Qian Liu , Shamik Bose , Urvashi Bhattacharyya , Terry Yue Zhuo , Ian Yu , Paulo Villegas , Marco Zocca , Sourab Mangrulkar , David Lansky , Huu Nguyen , Danish Contractor , Luis Villa , Jia Li , Dzmitry Bahdanau , Yacine Jernite , Sean Hughes , Daniel Fried , Arjun Guha , Harm de Vries , Leandro von Werra

TinyLLaVA: A Framework of Small-scale Large Multimodal Models

We present the TinyLLaVA framework that provides a unified perspective in designing and analyzing the small-scale Large Multimodal Models (LMMs). We empirically study the effects of different vision encoders, connection modules, language…

Machine Learning · Computer Science 2024-02-23 Baichuan Zhou , Ying Hu , Xi Weng , Junlong Jia , Jie Luo , Xien Liu , Ji Wu , Lei Huang