Related papers: CodeACT: Code Adaptive Compute-efficient Tuning Fr…

Data-efficient LLM Fine-tuning for Code Generation

Large language models (LLMs) have demonstrated significant potential in code generation tasks. However, there remains a performance gap between open-source and closed-source models. To address this gap, existing approaches typically…

Computation and Language · Computer Science 2025-04-18 Weijie Lv , Xuan Xia , Sheng-Jun Huang

AdaptiveLLM: A Framework for Selecting Optimal Cost-Efficient LLM for Code-Generation Based on CoT Length

While Large Language Models (LLMs) have significantly advanced code generation efficiency, they face inherent challenges in balancing performance and inference costs across diverse programming tasks. Dynamically selecting the optimal LLM…

Software Engineering · Computer Science 2025-06-13 Junhang Cheng , Fang Liu , Chengru Wu , Li Zhang

ACT: Bridging the Gap in Code Translation through Synthetic Data Generation & Adaptive Training

Code translation is a crucial process in software development and migration projects, enabling interoperability between different programming languages and enhancing software adaptability and thus longevity. Traditional automated…

Artificial Intelligence · Computer Science 2025-07-23 Shreya Saxena , Siva Prasad , Zishan Ahmad , Vishal Vaddina

Code Comparison Tuning for Code Large Language Models

We present Code Comparison Tuning (CCT), a simple and effective tuning method for code large language models (Code LLMs) to better handle subtle code errors. Specifically, we integrate the concept of comparison into instruction tuning, both…

Computation and Language · Computer Science 2024-06-06 Yufan Jiang , Qiaozhi He , Xiaomin Zhuang , Zhihua Wu

Efficient Code LLM Training via Distribution-Consistent and Diversity-Aware Data Selection

Recent advancements in large language models (LLMs) have significantly improved code generation and program comprehension, accelerating the evolution of software engineering. Current methods primarily enhance model performance by leveraging…

Computation and Language · Computer Science 2025-07-04 Weijie Lyu , Sheng-Jun Huang , Xuan Xia

Revisiting Test-Time Scaling: A Survey and a Diversity-Aware Method for Efficient Reasoning

Test-Time Scaling (TTS) improves the reasoning performance of Large Language Models (LLMs) by allocating additional compute during inference. We conduct a structured survey of TTS methods and categorize them into sampling-based,…

Computation and Language · Computer Science 2025-06-06 Ho-Lam Chung , Teng-Yun Hsiao , Hsiao-Ying Huang , Chunerh Cho , Jian-Ren Lin , Zhang Ziwei , Yun-Nung Chen

CodeEvolve: LLM-Driven Evolutionary Optimization with Runtime-Enriched Target Selection for Multi-Language Code Enhancement

We present CodeEvolve, an evolutionary framework for improving program performance and code quality with Large Language Models (LLMs). CodeEvolve extends OpenEvolve with runtime-guided target selection, Monte Carlo Tree Search (MCTS),…

Software Engineering · Computer Science 2026-05-07 Ajay Krishna Borra , Wenzhuo Yang , Samarth Arora , Akhilesh Deepak Gotmare , Gokulakrishnan Gopalakrishnan , Tharun Gali , Madhav Rathi , Doyen Sahoo , Manpreet Singh , Mayuresh Verma , Laksh Venka , Shuchita Singh

OpenCodeInstruct: A Large-scale Instruction Tuning Dataset for Code LLMs

Large Language Models (LLMs) have transformed software development by enabling code generation, automated debugging, and complex reasoning. However, their continued advancement is constrained by the scarcity of high-quality, publicly…

Software Engineering · Computer Science 2025-08-11 Wasi Uddin Ahmad , Aleksander Ficek , Mehrzad Samadi , Jocelyn Huang , Vahid Noroozi , Somshubra Majumdar , Boris Ginsburg

ADEPT: Continual Pretraining via Adaptive Expansion and Dynamic Decoupled Tuning

Conventional continual pretraining (CPT) for large language model (LLM) domain adaptation often suffers from catastrophic forgetting and limited domain capacity. Existing strategies adopt layer expansion, introducing additional trainable…

Machine Learning · Computer Science 2025-10-14 Jinyang Zhang , Yue Fang , Hongxin Ding , Weibin Liao , Muyang Ye , Xu Chu , Junfeng Zhao , Yasha Wang

ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning

Prompt Tuning has been a popular Parameter-Efficient Fine-Tuning method attributed to its remarkable performance with few updated parameters on various large-scale pretrained Language Models (PLMs). Traditionally, each prompt has been…

Computation and Language · Computer Science 2024-10-21 Yu-Chen Lin , Wei-Hua Li , Jun-Cheng Chen , Chu-Song Chen

COMPASS: COntinual Multilingual PEFT with Adaptive Semantic Sampling

Large language models (LLMs) often exhibit performance disparities across languages, with naive multilingual fine-tuning frequently degrading performance due to negative cross-lingual interference. To address this, we introduce COMPASS…

Machine Learning · Computer Science 2026-04-23 Noah Flynn

Beyond Templates: Dynamic Adaptation of Reasoning Demonstrations via Feasibility-Aware Exploration

Large language models (LLMs) have shown remarkable reasoning capabilities, yet aligning such abilities to small language models (SLMs) remains a challenge due to distributional mismatches and limited model capacity. Existing reasoning…

Computation and Language · Computer Science 2025-05-28 Yong Wu , Weihang Pan , Ke Li , Chen Binhui , Ping Li , Binbin Lin

Contextual Attention Modulation: Towards Efficient Multi-Task Adaptation in Large Language Models

Large Language Models (LLMs) possess remarkable generalization capabilities but struggle with multi-task adaptation, particularly in balancing knowledge retention with task-specific specialization. Conventional fine-tuning methods suffer…

Artificial Intelligence · Computer Science 2025-10-21 Dayan Pan , Zhaoyang Fu , Jingyuan Wang , Xiao Han , Yue Zhu , Xiangyu Zhao

Enhancing Code Generation Performance of Smaller Models by Distilling the Reasoning Ability of LLMs

Large Language Models (LLMs) have recently made significant advances in code generation through the 'Chain-of-Thought' prompting technique. This technique empowers the model to autonomously devise "solution plans" to tackle intricate…

Software Engineering · Computer Science 2024-03-21 Zhihong Sun , Chen Lyu , Bolun Li , Yao Wan , Hongyu Zhang , Ge Li , Zhi Jin

TAROT: Test-driven and Capability-adaptive Curriculum Reinforcement Fine-tuning for Code Generation with Large Language Models

Large Language Models (LLMs) are changing the coding paradigm, known as vibe coding, yet synthesizing algorithmically sophisticated and robust code still remains a critical challenge. Incentivizing the deep reasoning capabilities of LLMs is…

Computation and Language · Computer Science 2026-02-18 Chansung Park , Juyong Jiang , Fan Wang , Sayak Paul , Jiasi Shen , Jing Tang , Jianguo Li

CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models

Open large language models (LLMs) have significantly advanced the field of natural language processing, showcasing impressive performance across various tasks.Despite the significant advancements in LLMs, their effective operation still…

Computation and Language · Computer Science 2025-04-16 Xuechen Liang , Yangfan He , Meiling Tao , Yinghui Xia , Jianhui Wang , Tianyu Shi , Jun Wang , JingSong Yang

SLA-Awareness for AI-assisted coding

The integration of AI-assisted coding tools within development environments drastically reduces development time, and allows developers to focus more on creative and critical aspects of software engineering through the use of Code Large…

Software Engineering · Computer Science 2025-03-26 Kishanthan Thangarajah , Arthur Leung , Boyuan Chen , Ahmed E. Hassan

ACECode: A Reinforcement Learning Framework for Aligning Code Efficiency and Correctness in Code Language Models

CodeLLMs have demonstrated remarkable advancements in software engineering tasks. However, while these models can generate functionally correct code, they often produce code that is inefficient in terms of runtime. This inefficiency is…

Software Engineering · Computer Science 2024-12-24 Chengran Yang , Hong Jin Kang , Jieke Shi , David Lo

Rethinking the Sampling Criteria in Reinforcement Learning for LLM Reasoning: A Competence-Difficulty Alignment Perspective

Reinforcement learning exhibits potential in enhancing the reasoning abilities of large language models, yet it is hard to scale for the low sample efficiency during the rollout phase. Existing methods attempt to improve efficiency by…

Machine Learning · Computer Science 2026-02-02 Deyang Kong , Qi Guo , Xiangyu Xi , Wei Wang , Jingang Wang , Xunliang Cai , Shikun Zhang , Wei Ye

Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning

Recent advancements in Large Language Models (LLMs) have emphasized the critical role of fine-tuning (FT) techniques in adapting LLMs to specific tasks, especially when retraining from scratch is computationally infeasible. Fine-tuning…

Artificial Intelligence · Computer Science 2025-10-23 Xiao Han , Zimo Zhao , Wanyu Wang , Maolin Wang , Zitao Liu , Yi Chang , Xiangyu Zhao