English
Related papers

Related papers: SWE-Pruner: Self-Adaptive Context Pruning for Codi…

200 papers

Small language models (SLMs) have attracted considerable attention from both academia and industry due to their broad range of applications in edge devices. To obtain SLMs with strong performance, conventional approaches either pre-train…

Machine Learning · Computer Science 2025-11-17 Rui Pan , Shivanshu Shekhar , Boyao Wang , Shizhe Diao , Jipeng Zhang , Xingyuan Pan , Renjie Pi , Tong Zhang

Large language models (LLMs) exhibit strong performance on self-contained programming tasks. However, they still struggle with repository-level software engineering (SWE), which demands (1) deep codebase navigation with effective context…

Software Engineering · Computer Science 2026-05-27 Kang He , Kaushik Roy

Large language models (LLMs) have shown remarkable capabilities in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges in both the…

Computation and Language · Computer Science 2023-09-29 Xinyin Ma , Gongfan Fang , Xinchao Wang

Agents based on large language models have recently shown strong potential on real-world software engineering (SWE) tasks that require long-horizon interaction with repository-scale codebases. However, most existing agents rely on…

Computation and Language · Computer Science 2025-12-29 Shukai Liu , Jian Yang , Bo Jiang , Yizhi Li , Jinyang Guo , Xianglong Liu , Bryan Dai

Large language model agents have made strong progress on software engineering, yet current systems suffer from a context coupling problem: the standard code editing interface conflates code inspection, modification planning, and edit…

Software Engineering · Computer Science 2026-05-27 Yikai Zhang , Jiaxin Pei , Kenan Li , Qirui Jin , Maoquan Wang , Jin Pan , Yu Kang , Shengyu Fu , Elsie Nallipogu , Junjie Hu , Yufan Huang , Zijian Jin

LLM-powered coding agents spend the majority of their token budget reading repository files, yet much of the retrieved code is irrelevant to the task at hand. Existing learned pruners compress this context with a single-objective sequence…

Artificial Intelligence · Computer Science 2026-05-18 Jingjing Wang , Xiwen Chen , Wenhui Zhu , Huayu Li , Zhengxiao He , Feiyang Cai , Ana S. Carreon-Rascon , Xuanzhao Dong , Feng Luo

Autoregressive Transformers adopted in Large Language Models (LLMs) are hard to scale to long sequences. Despite several works trying to reduce their computational cost, most of LLMs still adopt attention layers between all pairs of tokens…

Computation and Language · Computer Science 2024-06-03 Sotiris Anagnostidis , Dario Pavllo , Luca Biggio , Lorenzo Noci , Aurelien Lucchi , Thomas Hofmann

Large language models are increasingly used as coding agents for software engineering tasks. Current benchmarks mainly evaluate whether the agent can correctly solve the request or fix the bugs. They largely treat tasks as independent and…

Software Engineering · Computer Science 2026-05-07 Jiayuan Zhu , Junde Wu , Minhao Hu , Shengda Zhu , Jiazhen Pan , Weixiang Shen , Yijun Yang , Fenglin Liu , Jianye Hao , Yueming Jin , Qirong Ho , Min Xu

Resolving real-world software engineering (SWE) issues with autonomous agents requires complex, long-horizon reasoning. Current pipelines are bottlenecked by unoptimized demonstration data, sparse execution rewards, and computationally…

Software Engineering · Computer Science 2026-04-17 Hao Han , Jin Xie , Xuehao Ma , Weiquan Zhu , Ziyao Zhang , ZhiLiang Long , Hongkai Chen , Qingwen Ye

Rapidly increasing context lengths have led to the assumption that large language models (LLMs) can directly reason over entire codebases. Concurrently, recent advances in LLMs have enabled strong performance on software engineering…

Software Engineering · Computer Science 2026-03-09 Ravi Raju , Mengmeng Ji , Shubhangi Upasani , Bo Li , Urmish Thakker

Neural network pruning remains essential for deploying deep learning models on resource-constrained devices, yet existing approaches primarily target parameter reduction without directly controlling computational cost. This yields…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Shahrzad Esmat , Mahdi Banisharif , Ali Jannesari

As Large Language Models (LLMs) continue to scale, post-training pruning has emerged as a promising approach to reduce computational costs while preserving performance. Existing methods such as SparseGPT and Wanda achieve high sparsity…

Computation and Language · Computer Science 2026-01-15 Sai Varun Kodathala , Rakesh Vunnam

LLM-based agents have shown promising capabilities in a growing range of software engineering (SWE) tasks. However, advancing this field faces two critical challenges. First, high-quality training data is scarce, especially data that…

Automating real-world software engineering tasks remains challenging for large language model (LLM)-based agents due to the need for long-horizon reasoning over large, evolving codebases and making consistent decisions across interdependent…

Software Engineering · Computer Science 2026-04-14 Mahir Labib Dihan , Md Ashrafur Rahman Khan

As Large Language Models (LLMs) are increasingly deployed for narrow tasks in resource-constrained settings, a central question arises: how much of an LLM is truly necessary for a given task? We present LLM-Sieve, a framework that prunes…

Machine Learning · Computer Science 2025-10-07 Waleed Reda , Abhinav Jangda , Krishna Chintalapudi

The rapid advancement of Large Language Models (LLMs) has inaugurated a transformative epoch in natural language processing, fostering unprecedented proficiency in text generation, comprehension, and contextual scrutiny. Nevertheless,…

Machine Learning · Computer Science 2024-04-22 Cangqing Wang , Yutian Yang , Ruisi Li , Dan Sun , Ruicong Cai , Yuzhu Zhang , Chengqian Fu , Lillian Floyd

Large language models (LLMs) have transformed the software engineering landscape. Recently, numerous LLM-based agents have been developed to address real-world software issue fixing tasks. Despite their state-of-the-art performance, Despite…

Software Engineering · Computer Science 2026-03-10 Xin-Cheng Wen , Binbin Chen , Haoxuan Lan , Hang Yu , Peng Di , Cuiyun Gao

Small language models (SLMs) offer compelling advantages in cost, latency, and adaptability, but have so far lagged behind larger models on long-horizon software engineering tasks such as SWE-bench, where they suffer from pervasive action…

Software Engineering · Computer Science 2026-02-26 Patrick Tser Jern Kon , Archana Pradeep , Ang Chen , Alexander P. Ellis , Warren Hunt , Zijian Wang , John Yang , Samuel Thompson

Soft context compression reduces the computational workload of processing long contexts in LLMs by encoding long context into a smaller number of latent tokens. However, existing frameworks apply uniform compression ratios, failing to…

Computation and Language · Computer Science 2026-03-30 Yijiong Yu , Shuai Yuan , Jie Zheng , Huazheng Wang , Ji Pei

Evaluating large language models (LLMs) for software engineering has been limited by narrow task coverage, language bias, and insufficient alignment with real-world developer workflows. Existing benchmarks often focus on algorithmic…

‹ Prev 1 2 3 10 Next ›