Related papers: CodeScout: Contextual Problem Statement Enhancemen…

CodeScout: An Effective Recipe for Reinforcement Learning of Code Search Agents

A prerequisite for coding agents to perform tasks on large repositories is code localization - the identification of relevant files, classes, and functions to work on. While repository-level code localization has been performed using…

Software Engineering · Computer Science 2026-03-19 Lintang Sutawika , Aditya Bharat Soni , Bharath Sriraam R R , Apurva Gandhi , Taha Yassine , Sanidhya Vijayvargiya , Yuchen Li , Xuhui Zhou , Yilin Zhang , Leander Melroy Maben , Graham Neubig

ContextBench: A Benchmark for Context Retrieval in Coding Agents

LLM-based coding agents have shown strong performance on automated issue resolution benchmarks, yet existing evaluations largely focus on final task success, providing limited insight into how agents retrieve and use code context during…

Machine Learning · Computer Science 2026-02-12 Han Li , Letian Zhu , Bohan Zhang , Rili Feng , Jiaming Wang , Yue Pan , Earl T. Barr , Federica Sarro , Zhaoyang Chu , He Ye

Rethinking Dataset Discovery with DataScout

Dataset Search -- the process of finding appropriate datasets for a given task -- remains a critical yet under-explored challenge in data science workflows. Assessing dataset suitability for a task (e.g., training a classification model) is…

Human-Computer Interaction · Computer Science 2025-07-28 Rachel Lin , Bhavya Chopra , Wenjing Lin , Shreya Shankar , Madelon Hulsebos , Aditya G. Parameswaran

ContextBuddy: AI-Enhanced Contextual Insights for Security Alert Investigation (Applied to Intrusion Detection)

Modern Security Operations Centres (SOCs) integrate diverse tools, such as SIEM, IDS, and XDR systems, offering rich contextual data, including alert enrichments, flow features, and similar case histories. Yet, analysts must still manually…

Cryptography and Security · Computer Science 2025-06-12 Ronal Singh , Mohan Baruwal Chhetri , Surya Nepal , Cecile Paris

SWE Context Bench: A Benchmark for Context Learning in Coding

Large language models are increasingly used as coding agents for software engineering tasks. Current benchmarks mainly evaluate whether the agent can correctly solve the request or fix the bugs. They largely treat tasks as independent and…

Software Engineering · Computer Science 2026-05-07 Jiayuan Zhu , Junde Wu , Minhao Hu , Shengda Zhu , Jiazhen Pan , Weixiang Shen , Yijun Yang , Fenglin Liu , Jianye Hao , Yueming Jin , Qirong Ho , Min Xu

PaperScout: An Autonomous Agent for Academic Paper Search with Process-Aware Sequence-Level Policy Optimization

Academic paper search is a fundamental task in scientific research, yet most existing approaches rely on rigid, predefined workflows that struggle with complex, conditional queries. To address this limitation, we propose PaperScout, an…

Artificial Intelligence · Computer Science 2026-03-19 Tingyue Pan , Jie Ouyang , Mingyue Cheng , Qingchuan Li , Zirui Liu , Daoyu Wang , Mingfan Pan , Shuo Yu , Qi Liu

DeepCodeSeek: Real-Time API Retrieval for Context-Aware Code Generation

Current search techniques are limited to standard RAG query-document applications. In this paper, we propose a novel technique to expand the code and index for predicting the required APIs, directly enabling high-quality, end-to-end code…

Software Engineering · Computer Science 2025-10-01 Esakkivel Esakkiraja , Denis Akhiyarov , Aditya Shanmugham , Chitra Ganapathy

Towards an Understanding of Context Utilization in Code Intelligence

Code intelligence is an emerging domain in software engineering, aiming to improve the effectiveness and efficiency of various code-related tasks. Recent research suggests that incorporating contextual information beyond the basic original…

Software Engineering · Computer Science 2026-02-10 Yanlin Wang , Kefeng Duan , Dewu Zheng , Ensheng Shi , Fengji Zhang , Yanli Wang , Jiachi Chen , Xilin Liu , Yuchi Ma , Hongyu Zhang , Qianxiang Wang , Zibin Zheng

Spec Kit Agents: Context-Grounded Agentic Workflows

Spec-driven development (SDD) with AI coding agents provides a structured workflow, but agents often remain "context blind" in large, evolving repositories, leading to hallucinated APIs and architectural violations. We present Spec Kit…

Software Engineering · Computer Science 2026-04-08 Pardis Taghavi , Santosh Bhavani

Context-Augmented Code Generation: How Product Context Improves AI Coding Agent Decision Compliance by 49%

AI coding agents powered by large language models can read codebases and produce functional code, but they routinely violate team-specific product decisions that are invisible in the source code alone. We introduce a controlled benchmark…

Software Engineering · Computer Science 2026-05-12 Drew Dillon , Kasyap Varanasi

CoSQA+: Pioneering the Multi-Choice Code Search Benchmark with Test-Driven Agents

Semantic code search, retrieving code that matches a given natural language query, is an important task to improve productivity in software engineering. Existing code search datasets face limitations: they rely on human annotators who…

Software Engineering · Computer Science 2026-02-05 Jing Gong , Yanghui Wu , Linxi Liang , Yanlin Wang , Jiachi Chen , Mingwei Liu , Zibin Zheng

Context Engineering for Multi-Agent LLM Code Assistants Using Elicit, NotebookLM, ChatGPT, and Claude Code

Large Language Models (LLMs) have shown promise in automating code generation and software engineering tasks, yet they often struggle with complex, multi-file projects due to context limitations and knowledge gaps. We propose a novel…

Software Engineering · Computer Science 2025-08-13 Muhammad Haseeb

Developer Experiences with a Contextualized AI Coding Assistant: Usability, Expectations, and Outcomes

In the rapidly advancing field of artificial intelligence, software development has emerged as a key area of innovation. Despite the plethora of general-purpose AI assistants available, their effectiveness diminishes in complex,…

Software Engineering · Computer Science 2023-12-01 Gustavo Pinto , Cleidson de Souza , Thayssa Rocha , Igor Steinmacher , Alberto de Souza , Edward Monteiro

Enriching Source Code with Contextual Data for Code Completion Models: An Empirical Study

Transformer-based pre-trained models have recently achieved great results in solving many software engineering tasks including automatic code completion which is a staple in a developer's toolkit. While many have striven to improve the…

Computation and Language · Computer Science 2023-04-25 Tim van Dam , Maliheh Izadi , Arie van Deursen

From Abstract to Contextual: What LLMs Still Cannot Do in Mathematics

Large language models now solve many benchmark math problems at near-expert levels, yet this progress has not fully translated into reliable performance in real-world applications. We study this gap through contextual mathematical…

Artificial Intelligence · Computer Science 2026-04-06 Bowen Cao , Dongdong Zhang , Yixia Li , Junpeng Liu , Shijue Huang , Chufan Shi , Hongyuan Lu , Yaokang Wu , Guanhua Chen , Wai Lam , Furu Wei

Contextual Semantic Parsing using Crowdsourced Spatial Descriptions

We describe a contextual parser for the Robot Commands Treebank, a new crowdsourced resource. In contrast to previous semantic parsers that select the most-probable parse, we consider the different problem of parsing using additional…

Computation and Language · Computer Science 2014-05-02 Kais Dukes

A Framework To Improve User Story Sets Through Collaboration

Agile methodologies have become increasingly popular in recent years. Due to its inherent nature, agile methodologies involve stakeholders with a wide range of expertise and require interaction between them, relying on collaboration and…

Software Engineering · Computer Science 2023-01-25 Salih Göktuğ Köse , Fatma Başak Aydemir

Safe Exploration for Optimizing Contextual Bandits

Contextual bandit problems are a natural fit for many information retrieval tasks, such as learning to rank, text classification, recommendation, etc. However, existing learning methods for contextual bandit problems have one of two…

Information Retrieval · Computer Science 2020-02-06 Rolf Jagerman , Ilya Markov , Maarten de Rijke

LLM Agents Improve Semantic Code Search

Code Search is a key task that many programmers often have to perform while developing solutions to problems. Current methodologies suffer from an inability to perform accurately on prompts that contain some ambiguity or ones that require…

Software Engineering · Computer Science 2024-08-22 Sarthak Jain , Aditya Dora , Ka Seng Sam , Prabhat Singh

SWE-Adept: An LLM-Based Agentic Framework for Deep Codebase Analysis and Structured Issue Resolution

Large language models (LLMs) exhibit strong performance on self-contained programming tasks. However, they still struggle with repository-level software engineering (SWE), which demands (1) deep codebase navigation with effective context…

Software Engineering · Computer Science 2026-05-27 Kang He , Kaushik Roy