Software Engineering · Computer Science
DebugBench: Evaluating Debugging Capability of Large Language Models
Runchu Tian, Yining Ye, Yujia Qin, Xin Cong +7
2024-06-07
Software Engineering · Computer Science
ChatDBG: Augmenting Debugging with Large Language Models
Kyla H. Levin, Nicolas van Kempen, Emery D. Berger, Stephen N. Freund
2025-06-23
Programming Languages · Computer Science
Benchmarking Large Language Models for Automated Verilog RTL Code Generation
Shailja Thakur, Baleegh Ahmad, Zhenxing Fan, Hammond Pearce +4
2022-12-22
Software Engineering · Computer Science
An Empirical Study on the Capability of LLMs in Decomposing Bug Reports
Zhiyuan Chen, Vanessa Nava-Camal, Ahmad Suleiman, Yiming Tang +2
2025-04-30
Computation and Language · Computer Science
MdEval: Massively Multilingual Code Debugging
Shukai Liu, Linzheng Chai, Jian Yang, Jiajun Shi +14
2025-02-25
Human-Computer Interaction · Computer Science
SPROUT: an Interactive Authoring Tool for Generating Programming Tutorials with the Visualization of Large Language Models
Yihan Liu, Zhen Wen, Luoxuan Weng, Ollie Woodman +2
2024-10-29
Computation and Language · Computer Science
SimulBench: Evaluating Language Models with Creative Simulation Tasks
Qi Jia, Xiang Yue, Tianyu Zheng, Jie Huang +1
2024-09-13
Machine Learning · Computer Science
DevBench: A Realistic, Developer-Informed Benchmark for Code Generation Models
Adarsh Kumarappan, Pareesa Ameneh Golnari, Wen Wen, Xiaoyu Liu +4
2026-05-19
Software Engineering · Computer Science
ToolScan: A Benchmark for Characterizing Errors in Tool-Use LLMs
Shirley Kokane, Ming Zhu, Tulika Awalgaonkar, Jianguo Zhang +14
2025-06-27
Software Engineering · Computer Science
In-IDE Toolkit for Developers of AI-Based Features
Yaroslav Sokolov, Yury Khudyakov, Lenar Sharipov, Andrei Gasparian +2
2026-05-15
Software Engineering · Computer Science
Chain of Targeted Verification Questions to Improve the Reliability of Code Generated by LLMs
Sylvain Kouemo Ngassom, Arghavan Moradi Dakhel, Florian Tambon, Foutse Khomh
2024-05-24
Machine Learning · Computer Science
Towards a Neural Debugger for Python
Maximilian Beck, Jonas Gehring, Jannik Kossen, Gabriel Synnaeve
2026-03-11
Software Engineering · Computer Science
Codellm-Devkit: A Framework for Contextualizing Code LLMs with Program Analysis Insights
Rahul Krishna, Rangeet Pan, Raju Pavuluri, Srikanth Tamilselvam +2
2024-10-18
Computation and Language · Computer Science
VeriLLMed: Interactive Visual Debugging of Medical Large Language Models with Knowledge Graphs
Yurui Xiang, Xingyi Mao, Rui Sheng, Zixin Chen +6
2026-04-28
Subcellular Processes · Quantitative Biology
libRoadRunner: A High Performance SBML Simulation and Analysis Library
Endre T. Somogyi, Jean-Marie Bouteiller, James A. Glazier, Matthias König +3
2015-03-04
Human-Computer Interaction · Computer Science
ViseGPT: Towards Better Alignment of LLM-generated Data Wrangling Scripts and User Prompts
Jiajun Zhu, Xinyu Cheng, Zhongsu Luo, Yunfan Zhou +3
2025-08-05
Software Engineering · Computer Science
ScriptSmith: A Unified LLM Framework for Enhancing IT Operations via Automated Bash Script Generation, Assessment, and Refinement
Oishik Chatterjee, Pooja Aggarwal, Suranjana Samanta, Ting Dai +7
2024-09-27
Software Engineering · Computer Science
LLPut: Investigating Large Language Models for Bug Report-Based Input Generation
Alif Al Hasan, Subarna Saha, Mia Mohammad Imran, Tarannum Shaila Zaman
2025-12-16