English
Related papers

Related papers: CodeScout: Contextual Problem Statement Enhancemen…

200 papers

A prerequisite for coding agents to perform tasks on large repositories is code localization - the identification of relevant files, classes, and functions to work on. While repository-level code localization has been performed using…

LLM-based coding agents have shown strong performance on automated issue resolution benchmarks, yet existing evaluations largely focus on final task success, providing limited insight into how agents retrieve and use code context during…

Machine Learning · Computer Science 2026-02-12 Han Li , Letian Zhu , Bohan Zhang , Rili Feng , Jiaming Wang , Yue Pan , Earl T. Barr , Federica Sarro , Zhaoyang Chu , He Ye

Dataset Search -- the process of finding appropriate datasets for a given task -- remains a critical yet under-explored challenge in data science workflows. Assessing dataset suitability for a task (e.g., training a classification model) is…

Human-Computer Interaction · Computer Science 2025-07-28 Rachel Lin , Bhavya Chopra , Wenjing Lin , Shreya Shankar , Madelon Hulsebos , Aditya G. Parameswaran

Modern Security Operations Centres (SOCs) integrate diverse tools, such as SIEM, IDS, and XDR systems, offering rich contextual data, including alert enrichments, flow features, and similar case histories. Yet, analysts must still manually…

Cryptography and Security · Computer Science 2025-06-12 Ronal Singh , Mohan Baruwal Chhetri , Surya Nepal , Cecile Paris

Large language models are increasingly used as coding agents for software engineering tasks. Current benchmarks mainly evaluate whether the agent can correctly solve the request or fix the bugs. They largely treat tasks as independent and…

Software Engineering · Computer Science 2026-05-07 Jiayuan Zhu , Junde Wu , Minhao Hu , Shengda Zhu , Jiazhen Pan , Weixiang Shen , Yijun Yang , Fenglin Liu , Jianye Hao , Yueming Jin , Qirong Ho , Min Xu

Academic paper search is a fundamental task in scientific research, yet most existing approaches rely on rigid, predefined workflows that struggle with complex, conditional queries. To address this limitation, we propose PaperScout, an…

Artificial Intelligence · Computer Science 2026-03-19 Tingyue Pan , Jie Ouyang , Mingyue Cheng , Qingchuan Li , Zirui Liu , Daoyu Wang , Mingfan Pan , Shuo Yu , Qi Liu

Current search techniques are limited to standard RAG query-document applications. In this paper, we propose a novel technique to expand the code and index for predicting the required APIs, directly enabling high-quality, end-to-end code…

Software Engineering · Computer Science 2025-10-01 Esakkivel Esakkiraja , Denis Akhiyarov , Aditya Shanmugham , Chitra Ganapathy

Code intelligence is an emerging domain in software engineering, aiming to improve the effectiveness and efficiency of various code-related tasks. Recent research suggests that incorporating contextual information beyond the basic original…

Software Engineering · Computer Science 2026-02-10 Yanlin Wang , Kefeng Duan , Dewu Zheng , Ensheng Shi , Fengji Zhang , Yanli Wang , Jiachi Chen , Xilin Liu , Yuchi Ma , Hongyu Zhang , Qianxiang Wang , Zibin Zheng

Spec-driven development (SDD) with AI coding agents provides a structured workflow, but agents often remain "context blind" in large, evolving repositories, leading to hallucinated APIs and architectural violations. We present Spec Kit…

Software Engineering · Computer Science 2026-04-08 Pardis Taghavi , Santosh Bhavani

AI coding agents powered by large language models can read codebases and produce functional code, but they routinely violate team-specific product decisions that are invisible in the source code alone. We introduce a controlled benchmark…

Software Engineering · Computer Science 2026-05-12 Drew Dillon , Kasyap Varanasi

Semantic code search, retrieving code that matches a given natural language query, is an important task to improve productivity in software engineering. Existing code search datasets face limitations: they rely on human annotators who…

Software Engineering · Computer Science 2026-02-05 Jing Gong , Yanghui Wu , Linxi Liang , Yanlin Wang , Jiachi Chen , Mingwei Liu , Zibin Zheng

Large Language Models (LLMs) have shown promise in automating code generation and software engineering tasks, yet they often struggle with complex, multi-file projects due to context limitations and knowledge gaps. We propose a novel…

Software Engineering · Computer Science 2025-08-13 Muhammad Haseeb

In the rapidly advancing field of artificial intelligence, software development has emerged as a key area of innovation. Despite the plethora of general-purpose AI assistants available, their effectiveness diminishes in complex,…

Software Engineering · Computer Science 2023-12-01 Gustavo Pinto , Cleidson de Souza , Thayssa Rocha , Igor Steinmacher , Alberto de Souza , Edward Monteiro

Transformer-based pre-trained models have recently achieved great results in solving many software engineering tasks including automatic code completion which is a staple in a developer's toolkit. While many have striven to improve the…

Computation and Language · Computer Science 2023-04-25 Tim van Dam , Maliheh Izadi , Arie van Deursen

Large language models now solve many benchmark math problems at near-expert levels, yet this progress has not fully translated into reliable performance in real-world applications. We study this gap through contextual mathematical…

Artificial Intelligence · Computer Science 2026-04-06 Bowen Cao , Dongdong Zhang , Yixia Li , Junpeng Liu , Shijue Huang , Chufan Shi , Hongyuan Lu , Yaokang Wu , Guanhua Chen , Wai Lam , Furu Wei

We describe a contextual parser for the Robot Commands Treebank, a new crowdsourced resource. In contrast to previous semantic parsers that select the most-probable parse, we consider the different problem of parsing using additional…

Computation and Language · Computer Science 2014-05-02 Kais Dukes

Agile methodologies have become increasingly popular in recent years. Due to its inherent nature, agile methodologies involve stakeholders with a wide range of expertise and require interaction between them, relying on collaboration and…

Software Engineering · Computer Science 2023-01-25 Salih Göktuğ Köse , Fatma Başak Aydemir

Contextual bandit problems are a natural fit for many information retrieval tasks, such as learning to rank, text classification, recommendation, etc. However, existing learning methods for contextual bandit problems have one of two…

Information Retrieval · Computer Science 2020-02-06 Rolf Jagerman , Ilya Markov , Maarten de Rijke

Code Search is a key task that many programmers often have to perform while developing solutions to problems. Current methodologies suffer from an inability to perform accurately on prompts that contain some ambiguity or ones that require…

Software Engineering · Computer Science 2024-08-22 Sarthak Jain , Aditya Dora , Ka Seng Sam , Prabhat Singh

Large language models (LLMs) exhibit strong performance on self-contained programming tasks. However, they still struggle with repository-level software engineering (SWE), which demands (1) deep codebase navigation with effective context…

Software Engineering · Computer Science 2026-05-27 Kang He , Kaushik Roy
‹ Prev 1 2 3 10 Next ›