English
Related papers

Related papers: CodeMatcher: Searching Code Based on Sequential Se…

200 papers

Developers often search and reuse existing code snippets in the process of software development. Code search aims to retrieve relevant code snippets from a codebase according to natural language queries entered by the developer. Up to now,…

Software Engineering · Computer Science 2022-04-28 Yi Cheng , Li Kuang

Semantic code search is the task of retrieving relevant code snippet given a natural language query. Different from typical information retrieval tasks, code search requires to bridge the semantic gap between the programming language and…

Computation and Language · Computer Science 2022-01-28 Chen Wu , Ming Yan

Code search is to search reusable code snippets from source code corpus based on natural languages queries. Deep learning-based methods of code search have shown promising results. However, previous methods focus on retrieval accuracy but…

Software Engineering · Computer Science 2022-04-01 Wenchao Gu , Yanlin Wang , Lun Du , Hongyu Zhang , Shi Han , Dongmei Zhang , Michael R. Lyu

Neural networks with deep architectures have demonstrated significant performance improvements in computer vision, speech recognition, and natural language processing. The challenges in information retrieval (IR), however, are different…

Information Retrieval · Computer Science 2021-03-23 Bhaskar Mitra

Information retrieval (IR) plays a crucial role in locating relevant resources from vast amounts of data, and its applications have evolved from traditional knowledge bases to modern retrieval models (RMs). The emergence of large language…

Computation and Language · Computer Science 2023-12-13 Jiazhan Feng , Chongyang Tao , Xiubo Geng , Tao Shen , Can Xu , Guodong Long , Dongyan Zhao , Daxin Jiang

To obtain code snippets for reuse, programmers prefer to search for related documents, e.g., blogs or Q&A, instead of code itself. The major reason is due to the semantic diversity and mismatch between queries and code snippets. Deep…

Software Engineering · Computer Science 2020-08-18 Zhensu Sun , Yan Liu , Chen Yang , Yu Qian

Pre-trained code models have emerged as the state-of-the-art paradigm for code search tasks. The paradigm involves pre-training the model on search-irrelevant tasks such as masked language modeling, followed by the fine-tuning stage, which…

Software Engineering · Computer Science 2024-11-25 Hande Dong , Jiayi Lin , Yanlin Wang , Yichong Leng , Jiawei Chen , Yutao Xie

The performance of neural code search is significantly influenced by the quality of the training data from which the neural models are derived. A large corpus of high-quality query and code pairs is demanded to establish a precise mapping…

Software Engineering · Computer Science 2022-02-15 Zhensu Sun , Li Li , Yan Liu , Xiaoning Du , Li Li

The vocabulary gap is a core challenge in information retrieval (IR). In e-commerce applications like product search, the vocabulary gap is reported to be a bigger challenge than in more traditional application areas in IR, such as news…

Information Retrieval · Computer Science 2020-07-21 Fatemeh Sarvi , Nikos Voskarides , Lois Mooiman , Sebastian Schelter , Maarten de Rijke

Code retrieval is to find the code snippet from a large corpus of source code repositories that highly matches the query of natural language description. Recent work mainly uses natural language processing techniques to process both query…

Artificial Intelligence · Computer Science 2021-06-23 Xiang Ling , Lingfei Wu , Saizhuo Wang , Gaoning Pan , Tengfei Ma , Fangli Xu , Alex X. Liu , Chunming Wu , Shouling Ji

Code writing is repetitive and predictable, inspiring us to develop various code intelligence techniques. This survey focuses on code search, that is, to retrieve code that matches a given query by effectively capturing the semantic…

Software Engineering · Computer Science 2023-12-14 Yutao Xie , Jiayi Lin , Hande Dong , Lei Zhang , Zhonghai Wu

Automated detection of semantically equivalent questions in longitudinal social science surveys is crucial for long-term studies informing empirical research in the social, economic, and health sciences. Retrieving equivalent questions…

Computation and Language · Computer Science 2025-07-08 Wing Yan Li , Zeqiang Wang , Jon Johnson , Suparna De

Deep research has emerged as an important task that aims to address hard queries through extensive open-web exploration. To tackle it, most prior work equips large language model (LLM)-based agents with opaque web search APIs, enabling…

Information Retrieval · Computer Science 2026-02-26 Chuan Meng , Litu Ou , Sean MacAvaney , Jeff Dalton

Despite the continuous efforts in improving both the effectiveness and efficiency of code search, two issues remained unsolved. First, programming languages have inherent strong structural linkages, and feature mining of code as text form…

Software Engineering · Computer Science 2022-08-09 Yi Hu , Bo Cai , Yaoxiang Yu

Making the content generated by Large Language Model (LLM), accurate, credible and traceable is crucial, especially in complex knowledge-intensive tasks that require multi-step reasoning and each step needs knowledge to solve.…

Computation and Language · Computer Science 2024-02-27 Shicheng Xu , Liang Pang , Huawei Shen , Xueqi Cheng , Tat-Seng Chua

Despite the substantial success of Information Retrieval (IR) in various NLP tasks, most IR systems predominantly handle queries and corpora in natural language, neglecting the domain of code retrieval. Code retrieval is critically…

Information Retrieval · Computer Science 2025-06-09 Xiangyang Li , Kuicai Dong , Yi Quan Lee , Wei Xia , Hao Zhang , Xinyi Dai , Yasheng Wang , Ruiming Tang

Code retrieval, which retrieves code snippets based on users' natural language descriptions, is widely used by developers and plays a pivotal role in real-world software development. The advent of deep learning has shifted the retrieval…

Software Engineering · Computer Science 2024-12-17 Wenchao Gu , Ensheng Shi , Yanlin Wang , Lun Du , Shi Han , Hongyu Zhang , Dongmei Zhang , Michael R. Lyu

As a primary means of information acquisition, information retrieval (IR) systems, such as search engines, have integrated themselves into our daily lives. These systems also serve as components of dialogue, question-answering, and…

Computation and Language · Computer Science 2025-09-18 Yutao Zhu , Huaying Yuan , Shuting Wang , Jiongnan Liu , Wenhan Liu , Chenlong Deng , Haonan Chen , Zheng Liu , Zhicheng Dou , Ji-Rong Wen

Information retrieval (IR) systems play a critical role in navigating information overload across various applications. Existing IR benchmarks primarily focus on simple queries that are semantically analogous to single- and multi-hop…

Information Retrieval · Computer Science 2025-11-25 Ganlin Xu , Zhitao Yin , Linghao Zhang , Jiaqing Liang , Weijia Lu , Xiaodong Zhang , Zhifei Yang , Sihang Jiang , Deqing Yang

Information Retrieval (IR) models need to deal with two difficult issues, vocabulary mismatch and term dependencies. Vocabulary mismatch corresponds to the difficulty of retrieving relevant documents that do not contain exact query terms…

Information Retrieval · Computer Science 2015-10-07 Benjamin Piwowarski , Sylvain Lamprier , Nicolas Despres
‹ Prev 1 2 3 10 Next ›