Related papers: SynConfRoute: Syntax-Aware Routing for Efficient C…

aiXcoder-7B: A Lightweight and Effective Large Language Model for Code Processing

Large Language Models (LLMs) have been widely used in code completion, and researchers are focusing on scaling up LLMs to improve their accuracy. However, larger LLMs have lower inference efficiency, affecting developers' experience and…

Computation and Language · Computer Science 2025-01-17 Siyuan Jiang , Jia Li , He Zong , Huanyu Liu , Hao Zhu , Shukai Hu , Erlu Li , Jiazheng Ding , Yu Han , Wei Ning , Gen Wang , Yihong Dong , Kechi Zhang , Ge Li

SynthCoder: A Synthetical Strategy to Tune LLMs for Code Completion

Code completion is a prominent application of Large Language Models (LLMs) in software engineering. Due to the near real-time response requirements of this task, base models with small to medium-sized parameters are typically employed,…

Software Engineering · Computer Science 2025-09-18 Dongjun Yu , Xiao Yan , Zhenrui Li , Jipeng Xiao , Haochuan He , Yongda Yu , Hao Zhang , Guoping Rong , Xiaobo Huang

CodeFuse-13B: A Pretrained Multi-lingual Code Large Language Model

Code Large Language Models (Code LLMs) have gained significant attention in the industry due to their wide applications in the full lifecycle of software engineering. However, the effectiveness of existing models in understanding…

Software Engineering · Computer Science 2024-01-18 Peng Di , Jianguo Li , Hang Yu , Wei Jiang , Wenting Cai , Yang Cao , Chaoyu Chen , Dajun Chen , Hongwei Chen , Liang Chen , Gang Fan , Jie Gong , Zi Gong , Wen Hu , Tingting Guo , Zhichao Lei , Ting Li , Zheng Li , Ming Liang , Cong Liao , Bingchang Liu , Jiachen Liu , Zhiwei Liu , Shaojun Lu , Min Shen , Guangpei Wang , Huan Wang , Zhi Wang , Zhaogui Xu , Jiawei Yang , Qing Ye , Gehao Zhang , Yu Zhang , Zelin Zhao , Xunjin Zheng , Hailian Zhou , Lifu Zhu , Xianying Zhu

MiniCPM-V 4.5: Cooking Efficient MLLMs via Architecture, Data, and Training Recipe

Multimodal Large Language Models (MLLMs) are undergoing rapid progress and represent the frontier of AI development. However, their training and inference efficiency have emerged as a core bottleneck in making MLLMs more accessible and…

Machine Learning · Computer Science 2025-09-24 Tianyu Yu , Zefan Wang , Chongyi Wang , Fuwei Huang , Wenshuo Ma , Zhihui He , Tianchi Cai , Weize Chen , Yuxiang Huang , Yuanqian Zhao , Bokai Xu , Junbo Cui , Yingjing Xu , Liqing Ruan , Luoyuan Zhang , Hanyu Liu , Jingkun Tang , Hongyuan Liu , Qining Guo , Wenhao Hu , Bingxiang He , Jie Zhou , Jie Cai , Ji Qi , Zonghao Guo , Chi Chen , Guoyang Zeng , Yuxuan Li , Ganqu Cui , Ning Ding , Xu Han , Yuan Yao , Zhiyuan Liu , Maosong Sun

Evaluating Small Language Models for Front-Door Routing: A Harmonized Benchmark and Synthetic-Traffic Experiment

Selecting the appropriate model at inference time -- the routing problem -- requires jointly optimizing output quality, cost, latency, and governance constraints. Existing approaches delegate this decision to LLM-based classifiers or…

Networking and Internet Architecture · Computer Science 2026-04-06 Warren Johnson , Charles Lee

SynapseRoute: An Auto-Route Switching Framework on Dual-State Large Language Model

With the widespread adoption of large language models (LLMs) in practical applications, selecting an appropriate model requires balancing not only performance but also operational cost. The emergence of reasoning-capable models has further…

Computation and Language · Computer Science 2025-07-04 Wencheng Zhang , Shiqin Qiao , Lingjie Luo , Yinfeng Li , Chuanyang Zheng , Qian Xu , Meng Li , Yong Gui , Yijun He , Jianing Qiu , Jindong Hong , Jiankai Sun

A Hierarchical and Evolvable Benchmark for Fine-Grained Code Instruction Following with Multi-Turn Feedback

Large language models (LLMs) have advanced significantly in code generation, yet their ability to follow complex programming instructions with layered and diverse constraints remains underexplored. Existing benchmarks often prioritize…

Software Engineering · Computer Science 2025-07-02 Guoliang Duan , Mingwei Liu , Yanlin Wang , Chong Wang , Xin Peng , Zibin Zheng

ConsRoute:Consistency-Aware Adaptive Query Routing for Cloud-Edge-Device Large Language Models

Large language models (LLMs) deliver impressive capabilities but incur substantial inference latency and cost, which hinders their deployment in latency-sensitive and resource-constrained scenarios. Cloud-edge-device collaborative inference…

Artificial Intelligence · Computer Science 2026-03-24 Haoyu Qiao , Hao Zhang , Shanwen Mao , Siyao Cheng , Jie Liu

SelfCodeAlign: Self-Alignment for Code Generation

Instruction tuning is a supervised fine-tuning approach that significantly improves the ability of large language models (LLMs) to follow human instructions. We propose SelfCodeAlign, the first fully transparent and permissive pipeline for…

Computation and Language · Computer Science 2024-11-04 Yuxiang Wei , Federico Cassano , Jiawei Liu , Yifeng Ding , Naman Jain , Zachary Mueller , Harm de Vries , Leandro von Werra , Arjun Guha , Lingming Zhang

Evaluating and Achieving Controllable Code Completion in Code LLM

Code completion has become a central task, gaining significant attention with the rise of large language model (LLM)-based tools in software engineering. Although recent advances have greatly improved LLMs' code completion abilities,…

Software Engineering · Computer Science 2026-01-23 Jiajun Zhang , Zeyu Cui , Lei Zhang , Jian Yang , Jiaxi Yang , Qiang Liu , Zilei Wang , Binyuan Hui , Liang Wang , Junyang Lin

CodeLutra: Boosting LLM Code Generation via Preference-Guided Refinement

Large Language Models (LLMs) have revolutionized code generation but require significant resources and often over-generalize, limiting their task-specific efficiency. Fine-tuning smaller, open-source LLMs provides a cost-effective…

Computation and Language · Computer Science 2025-06-27 Leitian Tao , Xiang Chen , Tong Yu , Tung Mai , Ryan Rossi , Yixuan Li , Saayan Mitra

Code Generation with Small Language Models: A Codeforces-Based Study

Large Language Models (LLMs) demonstrate capabilities in code generation, potentially boosting developer productivity. However, their adoption remains limited by high computational costs, among other factors. Small Language Models (SLMs)…

Software Engineering · Computer Science 2025-09-23 Débora Souza , Rohit Gheyi , Lucas Albuquerque , Gustavo Soares , Márcio Ribeiro

BigCodeBench: Benchmarking Code Generation with Diverse Function Calls and Complex Instructions

Task automation has been greatly empowered by the recent advances in Large Language Models (LLMs) via Python code, where the tasks ranging from software engineering development to general-purpose reasoning. While current benchmarks have…

Software Engineering · Computer Science 2025-04-02 Terry Yue Zhuo , Minh Chien Vu , Jenny Chim , Han Hu , Wenhao Yu , Ratnadira Widyasari , Imam Nur Bani Yusuf , Haolan Zhan , Junda He , Indraneil Paul , Simon Brunner , Chen Gong , Thong Hoang , Armel Randy Zebaze , Xiaoheng Hong , Wen-Ding Li , Jean Kaddour , Ming Xu , Zhihan Zhang , Prateek Yadav , Naman Jain , Alex Gu , Zhoujun Cheng , Jiawei Liu , Qian Liu , Zijian Wang , Binyuan Hui , Niklas Muennighoff , David Lo , Daniel Fried , Xiaoning Du , Harm de Vries , Leandro Von Werra

Optimizing Large Language Models for OpenAPI Code Completion

Recent advancements in Large Language Models (LLMs) and their utilization in code generation tasks have significantly reshaped the field of software development. Despite the remarkable efficacy of code completion solutions in mainstream…

Software Engineering · Computer Science 2024-06-12 Bohdan Petryshyn , Mantas Lukoševičius

PerfRL: A Small Language Model Framework for Efficient Code Optimization

Code optimization is a challenging task requiring a substantial level of expertise from developers. Nonetheless, this level of human capacity is not sufficient considering the rapid evolution of new hardware architectures and software…

Machine Learning · Computer Science 2025-03-11 Shukai Duan , Nikos Kanakaris , Xiongye Xiao , Heng Ping , Chenyu Zhou , Nesreen K. Ahmed , Guixiang Ma , Mihai Capota , Theodore L. Willke , Shahin Nazarian , Paul Bogdan

Language Models for Code Completion: A Practical Evaluation

Transformer-based language models for automatic code completion have shown great promise so far, yet the evaluation of these models rarely uses real data. This study provides both quantitative and qualitative assessments of three public…

Software Engineering · Computer Science 2024-02-27 Maliheh Izadi , Jonathan Katzy , Tim van Dam , Marc Otten , Razvan Mihai Popescu , Arie van Deursen

Syntax Without Semantics: Teaching Large Language Models to Code in an Unseen Language

Large language models (LLMs) achieve high pass rates on code generation benchmarks, yet whether they can transfer this ability to languages absent from pretraining remains poorly understood. We introduce PyLang, a minimal imperative…

Computation and Language · Computer Science 2026-05-18 Vinayshekhar Bannihatti Kumar , Disha Makhija , Manoj Ghuhan Arivazhagan , Rashmi Gangadharaiah

PerfCodeGen: Improving Performance of LLM Generated Code with Execution Feedback

Large Language Models (LLMs) are widely adopted for assisting in software development tasks, yet their performance evaluations have narrowly focused on the functional correctness of generated code. Human programmers, however, require…

Software Engineering · Computer Science 2024-12-06 Yun Peng , Akhilesh Deepak Gotmare , Michael Lyu , Caiming Xiong , Silvio Savarese , Doyen Sahoo

From Context to Intent: Reasoning-Guided Function-Level Code Completion

The growing capabilities of Large Language Models (LLMs) have led to their widespread adoption for function completion within code repositories. Recent studies on such tasks show promising results when explicit instructions, often in the…

Software Engineering · Computer Science 2026-03-25 Yanzhou Li , Tianlin Li , Yiran Zhang , Shangqing Liu , Aishan Liu , Xianglong Liu , Yang Liu

TRACE: Evaluating Execution Efficiency of LLM-Based Code Translation

While Large Language Models (LLMs) have substantially improved the functional correctness of code translation, the critical dimension of \textit{execution efficiency} remains overlooked. We present \textbf{\textsc{trace}}, the first…

Software Engineering · Computer Science 2026-04-15 Zhihao Gong , Zeyu Sun , Dong Huang , Qingyuan Liang , Jie M. Zhang , Dan Hao