Related papers: Explaining Competitive-Level Programming Solutions…

Competition-Level Problems are Effective LLM Evaluators

Large language models (LLMs) have demonstrated impressive reasoning capabilities, yet there is ongoing debate about these abilities and the potential data contamination problem recently. This paper aims to evaluate the reasoning capacities…

Computation and Language · Computer Science 2024-06-05 Yiming Huang , Zhenghao Lin , Xiao Liu , Yeyun Gong , Shuai Lu , Fangyu Lei , Yaobo Liang , Yelong Shen , Chen Lin , Nan Duan , Weizhu Chen

Can LLM-Generated Textual Explanations Enhance Model Classification Performance? An Empirical Study

In the rapidly evolving field of Explainable Natural Language Processing (NLP), textual explanations, i.e., human-like rationales, are pivotal for explaining model predictions and enriching datasets with interpretable labels. Traditional…

Computation and Language · Computer Science 2025-11-12 Mahdi Dhaini , Juraj Vladika , Ege Erdogan , Zineb Attaoui , Gjergji Kasneci

Leveraging Large Language Models to Generate Answer Set Programs

Large language models (LLMs), such as GPT-3 and GPT-4, have demonstrated exceptional performance in various natural language processing tasks and have shown the ability to solve certain reasoning problems. However, their reasoning…

Artificial Intelligence · Computer Science 2023-07-18 Adam Ishay , Zhun Yang , Joohyung Lee

MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning

The recently released GPT-4 Code Interpreter has demonstrated remarkable proficiency in solving challenging math problems, primarily attributed to its ability to seamlessly reason with natural language, generate code, execute code, and…

Computation and Language · Computer Science 2023-10-06 Ke Wang , Houxing Ren , Aojun Zhou , Zimu Lu , Sichun Luo , Weikang Shi , Renrui Zhang , Linqi Song , Mingjie Zhan , Hongsheng Li

AutoCode: LLMs as Problem Setters for Competitive Programming

Writing competitive programming problems is exacting. Authors must: set constraints, input distributions, and edge cases that rule out shortcuts; target specific algorithms (e.g., max-flow, dynamic programming, data structures); and…

Software Engineering · Computer Science 2025-10-16 Shang Zhou , Zihan Zheng , Kaiyuan Liu , Zeyu Shen , Zerui Cheng , Zexing Chen , Hansen He , Jianzhu Yao , Huanzhi Mao , Qiuyang Mang , Tianfu Fu , Beichen Li , Dongruixuan Li , Wenhao Chai , Zhuang Liu , Aleksandra Korolova , Peter Henderson , Natasha Jaques , Pramod Viswanath , Saining Xie , Jingbo Shang

Towards Human-Level Text Coding with LLMs: The Case of Fatherhood Roles in Public Policy Documents

Recent advances in large language models (LLMs) like GPT-3.5 and GPT-4 promise automation with better results and less programming, opening up new opportunities for text analysis in political science. In this study, we evaluate LLMs on…

Computation and Language · Computer Science 2024-08-29 Lorenzo Lupo , Oscar Magnusson , Dirk Hovy , Elin Naurin , Lena Wängnerud

LLM-ProS: Analyzing Large Language Models' Performance in Competitive Problem Solving

The rapid advancement of large language models has opened new avenues for automating complex problem-solving tasks such as algorithmic coding and competitive programming. This paper introduces a novel evaluation technique, LLM-ProS, to…

Computation and Language · Computer Science 2026-03-03 Md Sifat Hossain , Anika Tabassum , Md. Fahim Arefin , Tarannum Shaila Zaman

Can LLMs Augment Low-Resource Reading Comprehension Datasets? Opportunities and Challenges

Large Language Models (LLMs) have demonstrated impressive zero shot performance on a wide range of NLP tasks, demonstrating the ability to reason and apply commonsense. A relevant application is to use them for creating high quality…

Computation and Language · Computer Science 2024-07-11 Vinay Samuel , Houda Aynaou , Arijit Ghosh Chowdhury , Karthik Venkat Ramanan , Aman Chadha

Evaluating LLMs' Mathematical and Coding Competency through Ontology-guided Interventions

Recent advancements in Large Language Models (LLMs) have showcased striking results on existing logical reasoning benchmarks, with some models even surpassing human performance. However, the true depth of their competencies and robustness…

Computation and Language · Computer Science 2024-11-05 Pengfei Hong , Navonil Majumder , Deepanway Ghosal , Somak Aditya , Rada Mihalcea , Soujanya Poria

Grounding Data Science Code Generation with Input-Output Specifications

Large language models (LLMs) have recently demonstrated a remarkable ability to generate code from natural language (NL) prompts. However, in the real world, NL is often too ambiguous to capture the true intent behind programming problems,…

Machine Learning · Computer Science 2024-03-18 Yeming Wen , Pengcheng Yin , Kensen Shi , Henryk Michalewski , Swarat Chaudhuri , Alex Polozov

Solving General Natural-Language-Description Optimization Problems with Large Language Models

Optimization problems seek to find the best solution to an objective under a set of constraints, and have been widely investigated in real-world applications. Modeling and solving optimization problems in a specific domain typically require…

Optimization and Control · Mathematics 2024-07-12 Jihai Zhang , Wei Wang , Siyan Guo , Li Wang , Fangquan Lin , Cheng Yang , Wotao Yin

Explaining Code Examples in Introductory Programming Courses: LLM vs Humans

Worked examples, which present an explained code for solving typical programming problems are among the most popular types of learning content in programming classes. Most approaches and tools for presenting these examples to students are…

Computers and Society · Computer Science 2024-03-13 Arun-Balajiee Lekshmi-Narayanan , Priti Oli , Jeevan Chapagain , Mohammad Hassany , Rabin Banjade , Peter Brusilovsky , Vasile Rus

Natural Language Outlines for Code: Literate Programming in the LLM Era

We propose using natural language outlines as a novel modality and interaction surface for providing AI assistance to developers throughout the software development process. An NL outline for a code function comprises multiple statements…

Software Engineering · Computer Science 2025-04-21 Kensen Shi , Deniz Altınbüken , Saswat Anand , Mihai Christodorescu , Katja Grünwedel , Alexa Koenings , Sai Naidu , Anurag Pathak , Marc Rasi , Fredde Ribeiro , Brandon Ruffin , Siddhant Sanyam , Maxim Tabachnyk , Sara Toth , Roy Tu , Tobias Welp , Pengcheng Yin , Manzil Zaheer , Satish Chandra , Charles Sutton

Modeling Legal Reasoning: LM Annotation at the Edge of Human Agreement

Generative language models (LMs) are increasingly used for document class-prediction tasks and promise enormous improvements in cost and efficiency. Existing research often examines simple classification tasks, but the capability of LMs to…

Computation and Language · Computer Science 2023-10-31 Rosamond Thalken , Edward H. Stiglitz , David Mimno , Matthew Wilkens

CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities

Recent large language models (LLMs) have shown indications of mathematical reasoning ability on challenging competition-level problems, especially with self-generated verbalizations of intermediate reasoning steps (i.e., chain-of-thought…

Computation and Language · Computer Science 2024-06-11 Yujun Mao , Yoon Kim , Yilun Zhou

Automatically Generating CS Learning Materials with Large Language Models

Recent breakthroughs in Large Language Models (LLMs), such as GPT-3 and Codex, now enable software developers to generate code based on a natural language prompt. Within computer science education, researchers are exploring the potential…

Computers and Society · Computer Science 2022-12-13 Stephen MacNeil , Andrew Tran , Juho Leinonen , Paul Denny , Joanne Kim , Arto Hellas , Seth Bernstein , Sami Sarsa

Performance Review on LLM for solving leetcode problems

This paper presents a comprehensive performance evaluation of Large Language Models (LLMs) in solving programming challenges from Leetcode, a widely used platform for algorithm practice and technical interviews. We began by crawling the…

Software Engineering · Computer Science 2025-03-04 Lun Wang , Chuanqi Shi , Shaoshui Du , Yiyi Tao , Yixian Shen , Hang Zheng , Yanxin Shen , Xinyu Qiu

Benchmarking GPT-4 on Algorithmic Problems: A Systematic Evaluation of Prompting Strategies

Large Language Models (LLMs) have revolutionized the field of Natural Language Processing thanks to their ability to reuse knowledge acquired on massive text corpora on a wide variety of downstream tasks, with minimal (if any) tuning steps.…

Computation and Language · Computer Science 2024-07-12 Flavio Petruzzellis , Alberto Testolin , Alessandro Sperduti

Distilling Algorithmic Reasoning from LLMs via Explaining Solution Programs

Distilling explicit chain-of-thought reasoning paths has emerged as an effective method for improving the reasoning abilities of large language models (LLMs) across various tasks. However, when tackling complex tasks that pose significant…

Computation and Language · Computer Science 2024-04-15 Jierui Li , Raymond Mooney

Can LLMs Master Math? Investigating Large Language Models on Math Stack Exchange

Large Language Models (LLMs) have demonstrated exceptional capabilities in various natural language tasks, often achieving performances that surpass those of humans. Despite these advancements, the domain of mathematics presents a…

Computation and Language · Computer Science 2024-04-02 Ankit Satpute , Noah Giessing , Andre Greiner-Petter , Moritz Schubotz , Olaf Teschke , Akiko Aizawa , Bela Gipp