Related papers: SWE-Pruner: Self-Adaptive Context Pruning for Codi…

Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training

Small language models (SLMs) have attracted considerable attention from both academia and industry due to their broad range of applications in edge devices. To obtain SLMs with strong performance, conventional approaches either pre-train…

Machine Learning · Computer Science 2025-11-17 Rui Pan , Shivanshu Shekhar , Boyao Wang , Shizhe Diao , Jipeng Zhang , Xingyuan Pan , Renjie Pi , Tong Zhang

SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution

Large language models (LLMs) exhibit strong performance on self-contained programming tasks. However, they still struggle with repository-level software engineering (SWE), which demands (1) deep codebase navigation with effective context…

Software Engineering · Computer Science 2026-05-27 Kang He , Kaushik Roy

LLM-Pruner: On the Structural Pruning of Large Language Models

Large language models (LLMs) have shown remarkable capabilities in language understanding and generation. However, such impressive capability typically comes with a substantial model size, which presents significant challenges in both the…

Computation and Language · Computer Science 2023-09-29 Xinyin Ma , Gongfan Fang , Xinchao Wang

Context as a Tool: Context Management for Long-Horizon SWE-Agents

Agents based on large language models have recently shown strong potential on real-world software engineering (SWE) tasks that require long-horizon interaction with repository-scale codebases. However, most existing agents rely on…

Computation and Language · Computer Science 2025-12-29 Shukai Liu , Jian Yang , Bo Jiang , Yizhi Li , Jinyang Guo , Xianglong Liu , Bryan Dai

SWE-Edit: Rethinking Code Editing for Efficient SWE-Agent

Large language model agents have made strong progress on software engineering, yet current systems suffer from a context coupling problem: the standard code editing interface conflates code inspection, modification planning, and edit…

Software Engineering · Computer Science 2026-05-27 Yikai Zhang , Jiaxin Pei , Kenan Li , Qirui Jin , Maoquan Wang , Jin Pan , Yu Kang , Shengyu Fu , Elsie Nallipogu , Junjie Hu , Yufan Huang , Zijian Jin

Context Pruning for Coding Agents via Multi-Rubric Latent Reasoning

LLM-powered coding agents spend the majority of their token budget reading repository files, yet much of the retrieved code is irrelevant to the task at hand. Existing learned pruners compress this context with a single-objective sequence…

Artificial Intelligence · Computer Science 2026-05-18 Jingjing Wang , Xiwen Chen , Wenhui Zhu , Huayu Li , Zhengxiao He , Feiyang Cai , Ana S. Carreon-Rascon , Xuanzhao Dong , Feng Luo

Dynamic Context Pruning for Efficient and Interpretable Autoregressive Transformers

Autoregressive Transformers adopted in Large Language Models (LLMs) are hard to scale to long sequences. Despite several works trying to reduce their computational cost, most of LLMs still adopt attention layers between all pairs of tokens…

Computation and Language · Computer Science 2024-06-03 Sotiris Anagnostidis , Dario Pavllo , Luca Biggio , Lorenzo Noci , Aurelien Lucchi , Thomas Hofmann

SWE Context Bench: A Benchmark for Context Learning in Coding

Large language models are increasingly used as coding agents for software engineering tasks. Current benchmarks mainly evaluate whether the agent can correctly solve the request or fix the bugs. They largely treat tasks as independent and…

Software Engineering · Computer Science 2026-05-07 Jiayuan Zhu , Junde Wu , Minhao Hu , Shengda Zhu , Jiazhen Pan , Weixiang Shen , Yijun Yang , Fenglin Liu , Jianye Hao , Yueming Jin , Qirong Ho , Min Xu

SWE-TRACE: Optimizing Long-Horizon SWE Agents Through Rubric Process Reward Models and Heuristic Test-Time Scaling

Resolving real-world software engineering (SWE) issues with autonomous agents requires complex, long-horizon reasoning. Current pipelines are bottlenecked by unoptimized demonstration data, sparse execution rewards, and computationally…

Software Engineering · Computer Science 2026-04-17 Hao Han , Jin Xie , Xuehao Ma , Weiquan Zhu , Ziyao Zhang , ZhiLiang Long , Hongkai Chen , Qingwen Ye

The Limits of Long-Context Reasoning in Automated Bug Fixing

Rapidly increasing context lengths have led to the assumption that large language models (LLMs) can directly reason over entire codebases. Concurrently, recent advances in LLMs have enabled strong performance on software engineering…

Software Engineering · Computer Science 2026-03-09 Ravi Raju , Mengmeng Ji , Shubhangi Upasani , Bo Li , Urmish Thakker

AgenticPruner: MAC-Constrained Neural Network Compression via LLM-Driven Strategy Search

Neural network pruning remains essential for deploying deep learning models on resource-constrained devices, yet existing approaches primarily target parameter reduction without directly controlling computational cost. This yields…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Shahrzad Esmat , Mahdi Banisharif , Ali Jannesari

LLMs can Compress LLMs: Adaptive Pruning by Agents

As Large Language Models (LLMs) continue to scale, post-training pruning has emerged as a promising approach to reduce computational costs while preserving performance. Existing methods such as SparseGPT and Wanda achieve high sparsity…

Computation and Language · Computer Science 2026-01-15 Sai Varun Kodathala , Rakesh Vunnam

SWE-rebench: An Automated Pipeline for Task Collection and Decontaminated Evaluation of Software Engineering Agents

LLM-based agents have shown promising capabilities in a growing range of software engineering (SWE) tasks. However, advancing this field faces two critical challenges. First, high-quality training data is scarce, especially data that…

Software Engineering · Computer Science 2025-11-05 Ibragim Badertdinov , Alexander Golubev , Maksim Nekrashevich , Anton Shevtsov , Simon Karasik , Andrei Andriushchenko , Maria Trofimova , Daria Litvintseva , Boris Yangel

SWE-Shepherd: Advancing PRMs for Reinforcing Code Agents

Automating real-world software engineering tasks remains challenging for large language model (LLM)-based agents due to the need for long-horizon reasoning over large, evolving codebases and making consistent decisions across interdependent…

Software Engineering · Computer Science 2026-04-14 Mahir Labib Dihan , Md Ashrafur Rahman Khan

How Many Parameters Does Your Task Really Need? Task Specific Pruning with LLM-Sieve

As Large Language Models (LLMs) are increasingly deployed for narrow tasks in resource-constrained settings, a central question arises: how much of an LLM is truly necessary for a given task? We present LLM-Sieve, a framework that prunes…

Machine Learning · Computer Science 2025-10-07 Waleed Reda , Abhinav Jangda , Krishna Chintalapudi

Adapting LLMs for Efficient Context Processing through Soft Prompt Compression

The rapid advancement of Large Language Models (LLMs) has inaugurated a transformative epoch in natural language processing, fostering unprecedented proficiency in text generation, comprehension, and contextual scrutiny. Nevertheless,…

Machine Learning · Computer Science 2024-04-22 Cangqing Wang , Yutian Yang , Ruisi Li , Dan Sun , Ruicong Cai , Yuzhu Zhang , Chengqian Fu , Lillian Floyd

SWE-Fuse: Empowering Software Agents via Issue-free Trajectory Learning and Entropy-aware RLVR Training

Large language models (LLMs) have transformed the software engineering landscape. Recently, numerous LLM-based agents have been developed to address real-world software issue fixing tasks. Despite their state-of-the-art performance, Despite…

Software Engineering · Computer Science 2026-03-10 Xin-Cheng Wen , Binbin Chen , Haoxuan Lan , Hang Yu , Peng Di , Cuiyun Gao

SWE-Prot\'eg\'e: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents

Small language models (SLMs) offer compelling advantages in cost, latency, and adaptability, but have so far lagged behind larger models on long-horizon software engineering tasks such as SWE-bench, where they suffer from pervasive action…

Software Engineering · Computer Science 2026-02-26 Patrick Tser Jern Kon , Archana Pradeep , Ang Chen , Alexander P. Ellis , Warren Hunt , Zijian Wang , John Yang , Samuel Thompson

Density-aware Soft Context Compression with Semi-Dynamic Compression Ratio

Soft context compression reduces the computational workload of processing long contexts in LLMs by encoding long context into a smaller number of latent tokens. However, existing frameworks apply uniform compression ratios, failing to…

Computation and Language · Computer Science 2026-03-30 Yijiong Yu , Shuai Yuan , Jie Zheng , Huazheng Wang , Ji Pei

SWE-Compass: Towards Unified Evaluation of Agentic Coding Abilities for Large Language Models

Evaluating large language models (LLMs) for software engineering has been limited by narrow task coverage, language bias, and insufficient alignment with real-world developer workflows. Existing benchmarks often focus on algorithmic…

Software Engineering · Computer Science 2025-11-12 Jingxuan Xu , Ken Deng , Weihao Li , Songwei Yu , Huaixi Tang , Haoyang Huang , Zhiyi Lai , Zizheng Zhan , Yanan Wu , Chenchen Zhang , Kepeng Lei , Yifan Yao , Xinping Lei , Wenqiang Zhu , Zongxian Feng , Han Li , Junqi Xiong , Dailin Li , Zuchen Gao , Kun Wu , Wen Xiang , Ziqi Zhan , Yuanxing Zhang , Wuxuan Gong , Ziyuan Gao , Guanxiang Wang , Yirong Xue , Mengtong Li , Mengfei Xie , Xiaojiang Zhang , Jinghui Wang , Wenhao Zhuang , Zheng Lin , Huiming Wang , Zhaoxiang Zhang , Yuqun Zhang , Haotian Zhang , Bin Chen , Jiaheng Liu