Related papers: Stable Code Technical Report

Stable LM 2 1.6B Technical Report

We introduce StableLM 2 1.6B, the first in a new generation of our language model series. In this technical report, we present in detail the data and training procedure leading to the base and instruction-tuned versions of StableLM 2 1.6B.…

Computation and Language · Computer Science 2024-02-29 Marco Bellagente , Jonathan Tow , Dakota Mahan , Duy Phung , Maksym Zhuravinskyi , Reshinth Adithyan , James Baicoianu , Ben Brooks , Nathan Cooper , Ashish Datta , Meng Lee , Emad Mostaque , Michael Pieler , Nikhil Pinnaparju , Paulo Rocha , Harry Saini , Hannah Teufel , Niccolo Zanichelli , Carlos Riquelme

CodeShell Technical Report

Code large language models mark a pivotal breakthrough in artificial intelligence. They are specifically crafted to understand and generate programming languages, significantly boosting the efficiency of coding development workflows. In…

Software Engineering · Computer Science 2024-03-26 Rui Xie , Zhengran Zeng , Zhuohao Yu , Chang Gao , Shikun Zhang , Wei Ye

Code Llama: Open Foundation Models for Code

We release Code Llama, a family of large language models for code based on Llama 2 providing state-of-the-art performance among open models, infilling capabilities, support for large input contexts, and zero-shot instruction following…

Computation and Language · Computer Science 2024-02-02 Baptiste Rozière , Jonas Gehring , Fabian Gloeckle , Sten Sootla , Itai Gat , Xiaoqing Ellen Tan , Yossi Adi , Jingyu Liu , Romain Sauvestre , Tal Remez , Jérémy Rapin , Artyom Kozhevnikov , Ivan Evtimov , Joanna Bitton , Manish Bhatt , Cristian Canton Ferrer , Aaron Grattafiori , Wenhan Xiong , Alexandre Défossez , Jade Copet , Faisal Azhar , Hugo Touvron , Louis Martin , Nicolas Usunier , Thomas Scialom , Gabriel Synnaeve

OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs

Large Language Models (LLMs) have transformed software development by enabling code generation, automated debugging, and complex reasoning. However, their continued advancement is constrained by the scarcity of high-quality, publicly…

Software Engineering · Computer Science 2025-08-11 Wasi Uddin Ahmad , Aleksander Ficek , Mehrzad Samadi , Jocelyn Huang , Vahid Noroozi , Somshubra Majumdar , Boris Ginsburg

Multilingual E5 Text Embeddings: A Technical Report

This technical report presents the training methodology and evaluation results of the open-source multilingual E5 text embedding models, released in mid-2023. Three embedding models of different sizes (small / base / large) are provided,…

Computation and Language · Computer Science 2024-02-09 Liang Wang , Nan Yang , Xiaolong Huang , Linjun Yang , Rangan Majumder , Furu Wei

StarCoder 2 and The Stack v2: The Next Generation

The BigCode project, an open-scientific collaboration focused on the responsible development of Large Language Models for Code (Code LLMs), introduces StarCoder2. In partnership with Software Heritage (SWH), we build The Stack v2 on top of…

Software Engineering · Computer Science 2024-03-01 Anton Lozhkov , Raymond Li , Loubna Ben Allal , Federico Cassano , Joel Lamy-Poirier , Nouamane Tazi , Ao Tang , Dmytro Pykhtar , Jiawei Liu , Yuxiang Wei , Tianyang Liu , Max Tian , Denis Kocetkov , Arthur Zucker , Younes Belkada , Zijian Wang , Qian Liu , Dmitry Abulkhanov , Indraneil Paul , Zhuang Li , Wen-Ding Li , Megan Risdal , Jia Li , Jian Zhu , Terry Yue Zhuo , Evgenii Zheltonozhskii , Nii Osae Osae Dade , Wenhao Yu , Lucas Krauß , Naman Jain , Yixuan Su , Xuanli He , Manan Dey , Edoardo Abati , Yekun Chai , Niklas Muennighoff , Xiangru Tang , Muhtasham Oblokulov , Christopher Akiki , Marc Marone , Chenghao Mou , Mayank Mishra , Alex Gu , Binyuan Hui , Tri Dao , Armel Zebaze , Olivier Dehaene , Nicolas Patry , Canwen Xu , Julian McAuley , Han Hu , Torsten Scholak , Sebastien Paquet , Jennifer Robinson , Carolyn Jane Anderson , Nicolas Chapados , Mostofa Patwary , Nima Tajbakhsh , Yacine Jernite , Carlos Muñoz Ferrandis , Lingming Zhang , Sean Hughes , Thomas Wolf , Arjun Guha , Leandro von Werra , Harm de Vries

Large Language Models Versus Static Code Analysis Tools: A Systematic Benchmark for Vulnerability Detection

Modern software relies on a multitude of automated testing and quality assurance tools to prevent errors, bugs and potential vulnerabilities. This study sets out to provide a head-to-head, quantitative and qualitative evaluation of six…

Software Engineering · Computer Science 2025-08-07 Damian Gnieciak , Tomasz Szandala

StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models

Large Language Models (LLMs) have witnessed remarkable advancements in recent years, prompting the exploration of tool learning, which integrates LLMs with external tools to address diverse real-world challenges. Assessing the capability of…

Computation and Language · Computer Science 2025-03-06 Zhicheng Guo , Sijie Cheng , Hao Wang , Shihao Liang , Yujia Qin , Peng Li , Zhiyuan Liu , Maosong Sun , Yang Liu

A Static Evaluation of Code Completion by Large Language Models

Large language models trained on code have shown great potential to increase productivity of software developers. Several execution-based benchmarks have been proposed to evaluate functional correctness of model-generated code on simple…

Computation and Language · Computer Science 2023-06-07 Hantian Ding , Varun Kumar , Yuchen Tian , Zijian Wang , Rob Kwiatkowski , Xiaopeng Li , Murali Krishna Ramanathan , Baishakhi Ray , Parminder Bhatia , Sudipta Sengupta , Dan Roth , Bing Xiang

Nanbeige4-3B Technical Report: Exploring the Frontier of Small Language Models

We present Nanbeige4-3B, a family of small-scale but high-performing language models. Pretrained on 23T high-quality tokens and finetuned on over 30 million diverse instructions, we extend the boundary of the scaling law for small language…

Computation and Language · Computer Science 2025-12-09 Chen Yang , Guangyue Peng , Jiaying Zhu , Ran Le , Ruixiang Feng , Tao Zhang , Wei Ruan , Xiaoqi Liu , Xiaoxue Cheng , Xiyun Xu , Yang Song , Yanzipeng Gao , Yiming Jia , Yun Xing , Yuntao Wen , Zekai Wang , Zhenwei An , Zhicong Sun , Zongchao Chen

DynaCode: A Dynamic Complexity-Aware Code Benchmark for Evaluating Large Language Models in Code Generation

The rapid advancement of large language models (LLMs) has significantly improved their performance in code generation tasks. However, existing code benchmarks remain static, consisting of fixed datasets with predefined problems. This makes…

Computation and Language · Computer Science 2025-05-30 Wenhao Hu , Jinhao Duan , Chunchen Wei , Li Zhang , Yue Zhang , Kaidi Xu

Infinity Instruct: Scaling Instruction Selection and Synthesis to Enhance Language Models

Large Language Models (LLMs) demonstrate strong performance in real-world applications, yet existing open-source instruction datasets often concentrate on narrow domains, such as mathematics or coding, limiting generalization and widening…

Computation and Language · Computer Science 2025-06-16 Jijie Li , Li Du , Hanyu Zhao , Bo-wen Zhang , Liangdong Wang , Boyan Gao , Guang Liu , Yonghua Lin

Instella: Fully Open Language Models with Stellar Performance

Large language models (LLMs) have demonstrated remarkable performance across a wide range of tasks, yet the majority of high-performing models remain closed-source or partially open, limiting transparency and reproducibility. In this work,…

Computation and Language · Computer Science 2025-11-17 Jiang Liu , Jialian Wu , Xiaodong Yu , Yusheng Su , Prakamya Mishra , Gowtham Ramesh , Sudhanshu Ranjan , Chaitanya Manem , Ximeng Sun , Ze Wang , Pratik Prabhanjan Brahma , Zicheng Liu , Emad Barsoum

CodeAssistBench (CAB): Dataset & Benchmarking for Multi-turn Chat-Based Code Assistance

Programming assistants powered by large language models have improved dramatically, yet existing benchmarks still evaluate them in narrow code-generation settings. Recent efforts such as InfiBench and StackEval rely on Stack Overflow…

Software Engineering · Computer Science 2026-01-16 Myeongsoo Kim , Shweta Garg , Baishakhi Ray , Varun Kumar , Anoop Deoras

Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model

Diffusion-based language models (DLLMs) offer non-sequential, block-wise generation and richer data reuse compared to autoregressive (AR) models, but existing code DLLMs still lag behind strong AR baselines under comparable budgets. We…

Computation and Language · Computer Science 2026-01-26 Chenghao Fan , Wen Heng , Bo Li , Sichen Liu , Yuxuan Song , Jing Su , Xiaoye Qu , Kai Shen , Wei Wei

ML2B: Multi-Lingual ML Benchmark For AutoML

Large language models (LLMs) have recently demonstrated strong capabilities in generating machine learning (ML) code, enabling end-to-end pipeline construction from natural language instructions. However, existing benchmarks for ML code…

Computation and Language · Computer Science 2025-10-07 Ekaterina Trofimova , Zosia Shamina , Maria Selifanova , Artem Zaitsev , Remi Savchuk , Maxim Minets , Daria Ozerova , Emil Sataev , Denis Zuenko , Andrey E. Ustyuzhanin

TeleChat Technical Report

In this technical report, we present TeleChat, a collection of large language models (LLMs) with parameters of 3 billion, 7 billion and 12 billion. It includes pretrained language models as well as fine-tuned chat models that is aligned…

Computation and Language · Computer Science 2024-04-03 Zhongjiang He , Zihan Wang , Xinzhang Liu , Shixuan Liu , Yitong Yao , Yuyao Huang , Xuelong Li , Yongxiang Li , Zhonghao Che , Zhaoxi Zhang , Yan Wang , Xin Wang , Luwen Pu , Huinan Xu , Ruiyu Fang , Yu Zhao , Jie Zhang , Xiaomeng Huang , Zhilong Lu , Jiaxin Peng , Wenjun Zheng , Shiquan Wang , Bingkai Yang , Xuewei he , Zhuoru Jiang , Qiyi Xie , Yanhan Zhang , Zhongqiu Li , Lingling Shi , Weiwei Fu , Yin Zhang , Zilu Huang , Sishi Xiong , Yuxiang Zhang , Chao Wang , Shuangyong Song

Large Language Models Meet NL2Code: A Survey

The task of generating code from a natural language description, or NL2Code, is considered a pressing and significant challenge in code intelligence. Thanks to the rapid development of pre-training techniques, surging large language models…

Software Engineering · Computer Science 2023-05-09 Daoguang Zan , Bei Chen , Fengji Zhang , Dianjie Lu , Bingchao Wu , Bei Guan , Yongji Wang , Jian-Guang Lou

A Language Model of Java Methods with Train/Test Deduplication

This tool demonstration presents a research toolkit for a language model of Java source code. The target audience includes researchers studying problems at the granularity level of subroutines, statements, or variables in Java. In contrast…

Software Engineering · Computer Science 2023-05-16 Chia-Yi Su , Aakash Bansal , Vijayanta Jain , Sepideh Ghanavati , Collin McMillan

SecureCode: A Production-Grade Multi-Turn Dataset for Training Security-Aware Code Generation Models

AI coding assistants produce vulnerable code in 45\% of security-relevant scenarios~\cite{veracode2025}, yet no public training dataset teaches both traditional web security and AI/ML-specific defenses in a format suitable for instruction…

Cryptography and Security · Computer Science 2026-02-12 Scott Thornton