Related papers: Copilot Evaluation Harness: Evaluating LLM-Guided …

SIMCOPILOT: Evaluating Large Language Models for Copilot-Style Code Generation

We introduce SIMCOPILOT, a benchmark that simulates the role of large language models (LLMs) as interactive, "copilot"-style coding assistants. Targeting both completion (finishing incomplete methods or code blocks) and infill tasks…

Machine Learning · Computer Science 2025-05-29 Mingchao Jiang , Abhinav Jain , Sophia Zorek , Chris Jermaine

Optimizing Large Language Models for OpenAPI Code Completion

Recent advancements in Large Language Models (LLMs) and their utilization in code generation tasks have significantly reshaped the field of software development. Despite the remarkable efficacy of code completion solutions in mainstream…

Software Engineering · Computer Science 2024-06-12 Bohdan Petryshyn , Mantas Lukoševičius

A Survey on Evaluating Large Language Models in Code Generation Tasks

This paper provides a comprehensive review of the current methods and metrics used to evaluate the performance of Large Language Models (LLMs) in code generation tasks. With the rapid growth in demand for automated software development,…

Software Engineering · Computer Science 2025-03-05 Liguo Chen , Qi Guo , Hongrui Jia , Zhengran Zeng , Xin Wang , Yijiang Xu , Jian Wu , Yidong Wang , Qing Gao , Jindong Wang , Wei Ye , Shikun Zhang

LLM4DS: Evaluating Large Language Models for Data Science Code Generation

The adoption of Large Language Models (LLMs) for code generation in data science offers substantial potential for enhancing tasks such as data manipulation, statistical analysis, and visualization. However, the effectiveness of these models…

Software Engineering · Computer Science 2024-11-20 Nathalia Nascimento , Everton Guimaraes , Sai Sanjna Chintakunta , Santhosh Anitha Boominathan

Large Language Models for Code Generation: A Comprehensive Survey of Challenges, Techniques, Evaluation, and Applications

Large Language Models (LLMs) have demonstrated their remarkable capabilities in numerous fields. This survey focuses on how LLMs empower users, regardless of their technical background, to use human languages to automatically generate…

Software Engineering · Computer Science 2025-04-03 Nam Huynh , Beiyu Lin

Tool-Augmented LLMs as a Universal Interface for IDEs

Modern-day Integrated Development Environments (IDEs) have come a long way from the early text editing utilities to the complex programs encompassing thousands of functions to help developers. However, with the increasing number of…

Software Engineering · Computer Science 2024-02-20 Yaroslav Zharov , Yury Khudyakov , Evgeniia Fedotova , Evgeny Grigorenko , Egor Bogomolov

Examination of Code generated by Large Language Models

Large language models (LLMs), such as ChatGPT and Copilot, are transforming software development by automating code generation and, arguably, enable rapid prototyping, support education, and boost productivity. Therefore, correctness and…

Software Engineering · Computer Science 2024-08-30 Robin Beer , Alexander Feix , Tim Guttzeit , Tamara Muras , Vincent Müller , Maurice Rauscher , Florian Schäffler , Welf Löwe

Review of Tools for Zero-Code LLM Based Application Development

Large Language Models (LLMs) are transforming software creation by enabling zero code development platforms. Our survey reviews recent platforms that let users build applications without writing code, by leveraging LLMs as the brains of the…

Software Engineering · Computer Science 2025-10-23 Priyaranjan Pattnayak , Hussain Bohra

A Study on Developer Behaviors for Validating and Repairing LLM-Generated Code Using Eye Tracking and IDE Actions

The increasing use of large language model (LLM)-powered code generation tools, such as GitHub Copilot, is transforming software engineering practices. This paper investigates how developers validate and repair code generated by Copilot and…

Software Engineering · Computer Science 2024-05-28 Ningzhi Tang , Meng Chen , Zheng Ning , Aakash Bansal , Yu Huang , Collin McMillan , Toby Jia-Jun Li

Using an LLM to Help With Code Understanding

Understanding code is challenging, especially when working in new and complex development environments. Code comments and documentation can help, but are typically scarce or hard to navigate. Large language models (LLMs) are revolutionizing…

Software Engineering · Computer Science 2024-01-18 Daye Nam , Andrew Macvean , Vincent Hellendoorn , Bogdan Vasilescu , Brad Myers

Toward PDDL Planning Copilot

Large Language Models (LLMs) are increasingly being used as autonomous agents capable of performing complicated tasks. However, they lack the ability to perform reliable long-horizon planning on their own. This paper bridges this gap by…

Artificial Intelligence · Computer Science 2025-09-17 Yarin Benyamin , Argaman Mordoch , Shahaf S. Shperberg , Roni Stern

Large Language Models for Code Generation: The Practitioners Perspective

Large Language Models (LLMs) have emerged as coding assistants, capable of generating source code from natural language prompts. With the increasing adoption of LLMs in software development, academic research and industry based projects are…

Software Engineering · Computer Science 2025-01-29 Zeeshan Rasheed , Muhammad Waseem , Kai Kristian Kemell , Aakash Ahmad , Malik Abdul Sami , Jussi Rasku , Kari Systä , Pekka Abrahamsson

New Solutions on LLM Acceleration, Optimization, and Application

Large Language Models (LLMs) have become extremely potent instruments with exceptional capacities for comprehending and producing human-like text in a wide range of applications. However, the increasing size and complexity of LLMs present…

Machine Learning · Computer Science 2024-06-18 Yingbing Huang , Lily Jiaxin Wan , Hanchen Ye , Manvi Jha , Jinghua Wang , Yuhong Li , Xiaofan Zhang , Deming Chen

PromptPilot: Improving Human-AI Collaboration Through LLM-Enhanced Prompt Engineering

Effective prompt engineering is critical to realizing the promised productivity gains of large language models (LLMs) in knowledge-intensive tasks. Yet, many users struggle to craft prompts that yield high-quality outputs, limiting the…

Human-Computer Interaction · Computer Science 2025-10-02 Niklas Gutheil , Valentin Mayer , Leopold Müller , Jörg Rommelt , Niklas Kühl

Do Large Language Models Understand Performance Optimization?

Large Language Models (LLMs) have emerged as powerful tools for software development tasks such as code completion, translation, and optimization. However, their ability to generate efficient and correct code, particularly in complex…

Distributed, Parallel, and Cluster Computing · Computer Science 2025-03-19 Bowen Cui , Tejas Ramesh , Oscar Hernandez , Keren Zhou

Exploring and Characterizing Large Language Models For Embedded System Development and Debugging

Large language models (LLMs) have shown remarkable abilities to generate code, however their ability to develop software for embedded systems, which requires cross-domain knowledge of hardware and software has not been studied. In this…

Software Engineering · Computer Science 2023-11-23 Zachary Englhardt , Richard Li , Dilini Nissanka , Zhihan Zhang , Girish Narayanswamy , Joseph Breda , Xin Liu , Shwetak Patel , Vikram Iyer

Scratch Copilot Evaluation: Assessing AI-Assisted Creative Coding for Families

How can AI enhance creative coding experiences for families? This study explores the potential of large language models (LLMs) in helping families with creative coding using Scratch. Based on our previous user study involving a prototype AI…

Human-Computer Interaction · Computer Science 2023-05-18 Stefania Druga , Nancy Otero

A Survey: Collaborative Hardware and Software Design in the Era of Large Language Models

The rapid development of large language models (LLMs) has significantly transformed the field of artificial intelligence, demonstrating remarkable capabilities in natural language processing and moving towards multi-modal functionality.…

Hardware Architecture · Computer Science 2024-10-11 Cong Guo , Feng Cheng , Zhixu Du , James Kiessling , Jonathan Ku , Shiyu Li , Ziru Li , Mingyuan Ma , Tergel Molom-Ochir , Benjamin Morris , Haoxuan Shan , Jingwei Sun , Yitu Wang , Chiyue Wei , Xueying Wu , Yuhao Wu , Hao Frank Yang , Jingyang Zhang , Junyao Zhang , Qilin Zheng , Guanglei Zhou , Hai , Li , Yiran Chen

AI-Based Measurement of Innovation: Mapping Expert Insight into Large Language Model Applications

Measuring innovation often relies on context-specific proxies and on expert evaluation. Hence, empirical innovation research is often limited to settings where such data is available. We investigate how large language models (LLMs) can be…

Computation and Language · Computer Science 2025-08-05 Robin Nowak , Patrick Figge , Carolin Haeussler

Agentic Auto-Scheduling: An Experimental Study of LLM-Guided Loop Optimization

Automatic code optimization remains a difficult challenge, particularly for complex loop nests on modern hardware. This paper investigates a novel approach to code optimization where Large Language Models (LLMs) guide the process through a…

Programming Languages · Computer Science 2025-12-30 Massinissa Merouani , Islem Kara Bernou , Riyadh Baghdadi