English
Related papers

Related papers: FastCode: Fast and Cost-Efficient Code Understandi…

200 papers

Code generation is a latency-sensitive task that demands high timeliness. However, with the growing interest and inherent difficulty in repository-level code generation, most existing code generation studies focus on improving the…

Artificial Intelligence · Computer Science 2025-10-01 Qianhui Zhao , Li Zhang , Fang Liu , Xiaoli Lian , Qiaoyuanhe Meng , Ziqian Jiao , Zetong Zhou , Jia Li , Lin Shi

Recent advancements in code-fluent Large Language Models (LLMs) enabled the research on repository-level code editing. In such tasks, the model navigates and modifies the entire codebase of a project according to request. Hence, such tasks…

Software Engineering · Computer Science 2024-06-10 Alexander Kovrigin , Aleksandra Eliseeva , Yaroslav Zharov , Timofey Bryksin

Large Language Models (LLMs) demonstrate strong capabilities in general coding tasks but encounter two key challenges when optimizing code: (i) the complexity of writing optimized code (such as performant CUDA kernels and competition-level…

Machine Learning · Computer Science 2026-01-12 Jiefu Ou , Sapana Chaudhary , Kaj Bostrom , Nathaniel Weir , Shuai Zhang , Huzefa Rangwala , George Karypis

In this work, we propose FastCoT, a model-agnostic framework based on parallel decoding without any further training of an auxiliary model or modification to the LLM itself. FastCoT uses a size-varying context window whose size changes with…

Computation and Language · Computer Science 2024-06-05 Hongxuan Zhang , Zhining Liu , Yao Zhao , Jiaqi Zheng , Chenyi Zhuang , Jinjie Gu , Guihai Chen

Despite the remarkable success of large language models (LLMs) on traditional natural language processing tasks, their planning ability remains a critical bottleneck in tackling complex multi-step reasoning tasks. Existing approaches mainly…

Computation and Language · Computer Science 2024-10-07 Jiaxin Wen , Jian Guan , Hongning Wang , Wei Wu , Minlie Huang

Current search techniques are limited to standard RAG query-document applications. In this paper, we propose a novel technique to expand the code and index for predicting the required APIs, directly enabling high-quality, end-to-end code…

Software Engineering · Computer Science 2025-10-01 Esakkivel Esakkiraja , Denis Akhiyarov , Aditya Shanmugham , Chitra Ganapathy

Understanding the purpose of source code is a critical task in software maintenance, onboarding, and modernization. While large language models (LLMs) have shown promise in generating code explanations, they often lack grounding in the…

Software Engineering · Computer Science 2025-11-06 Ziv Nevo , Orna Raz , Karen Yorav

Retrieval-Augmented Generation (RAG) frameworks aim to enhance Code Language Models (CLMs) by including another module for retrieving relevant context to construct the input prompt. However, these retrieval modules commonly use semantic…

Software Engineering · Computer Science 2025-10-16 Minh Nguyen

Reasoning-oriented Large Language Models (LLMs) often rely on generating explicit tokens step by step, and their effectiveness typically hinges on large-scale supervised fine-tuning or reinforcement learning. While Chain-of-Thought (CoT)…

Computation and Language · Computer Science 2025-09-30 Haoyu Zheng , Zhuonan Wang , Yuqian Yuan , Tianwei Lin , Wenqiao Zhang , Zheqi Lv , Juncheng Li , Siliang Tang , Yueting Zhuang , Hongyang He

While large language models (LLMs) excel at handling long-context sequences, they require substantial prefill computation and key-value (KV) cache, which can heavily burden computational efficiency and memory usage in both prefill and…

Machine Learning · Computer Science 2026-04-21 Dongwon Jo , Jiwon Song , Yulhwa Kim , Jae-Joon Kim

Quantitative research increasingly relies on unstructured financial content such as filings, earnings calls, and research notes, yet existing LLM and RAG pipelines struggle with point-in-time correctness, evidence attribution, and…

Computational Engineering, Finance, and Science · Computer Science 2025-09-29 Haoxue Wang , Keli Wen , Yuante Li , Qiancheng Qu , Xiangxu Mu , Xinjie Shen , Jiaqi Gao , Chenyang Chang , Chuhan Xie , San Yu Cheung , Zhuoyuan Hu , Xinyu Wang , Sirui Bi , Bi'an Du

Recent advances in large language models (LLMs) have demonstrated impressive capabilities in code-related tasks, such as code generation and automated program repair. Despite their promising performance, most existing approaches for code…

Software Engineering · Computer Science 2025-09-03 Yicong Zhao , Shisong Chen , Jiacheng Zhang , Zhixu Li

Code often suffers from performance bugs. These bugs necessitate the research and practice of code optimization. Traditional rule-based methods rely on manually designing and maintaining rules for specific performance bugs (e.g., redundant…

Software Engineering · Computer Science 2025-12-30 Yue Wu , Minghao Han , Ruiyin Li , Peng Liang , Amjed Tahir , Zengyang Li , Qiong Feng , Mojtaba Shahin

Recently deep learning based Natural Language Processing (NLP) models have shown great potential in the modeling of source code. However, a major limitation of these approaches is that they take source code as simple tokens of text and…

Neural and Evolutionary Computing · Computer Science 2020-07-15 Yasir Hussain , Zhiqiu Huang , Yu Zhou , Senzhang Wang

The goal of natural language semantic code search is to retrieve a semantically relevant code snippet from a fixed set of candidates using a natural language query. Existing approaches are neither effective nor efficient enough towards a…

Computation and Language · Computer Science 2021-10-18 Akhilesh Deepak Gotmare , Junnan Li , Shafiq Joty , Steven C. H. Hoi

Code intelligence is an emerging domain in software engineering, aiming to improve the effectiveness and efficiency of various code-related tasks. Recent research suggests that incorporating contextual information beyond the basic original…

Software Engineering · Computer Science 2026-02-10 Yanlin Wang , Kefeng Duan , Dewu Zheng , Ensheng Shi , Fengji Zhang , Yanli Wang , Jiachi Chen , Xilin Liu , Yuchi Ma , Hongyu Zhang , Qianxiang Wang , Zibin Zheng

The performance of Large Language Models (LLMs) is fundamentally determined by the contextual information provided during inference. This survey introduces Context Engineering, a formal discipline that transcends simple prompt design to…

Accurate requirement-to-code traceability is crucial for software maintenance. However, existing IR- and embedding-based methods are heavily dependent on lexical similarity, often yielding incomplete or inconsistent links across projects…

Software Engineering · Computer Science 2026-04-27 Yifei Wang , Jacky Keung , Xiaoxue Ma , Zhenyu Mao , Kehui Chen , Yishu Li

Context window efficiency is a practical constraint in large language model (LLM)-based developer tools. Paulsen [12] shows that all tested models degrade in accuracy well before their advertised context limits the Maximum Effective Context…

Software Engineering · Computer Science 2026-05-15 Shweta Mishra

Code completion can help developers improve efficiency and ease the development lifecycle. Although code completion is available in modern integrated development environments (IDEs), research lacks in determining what makes a good context…

Software Engineering · Computer Science 2025-10-13 Imranur Rahman , Md Rayhanur Rahman
‹ Prev 1 2 3 10 Next ›