Related papers: CodeSCM: Causal Analysis for Multi-Modal Code Gene…

Benchmarking and Explaining Large Language Model-based Code Generation: A Causality-Centric Approach

While code generation has been widely used in various software development scenarios, the quality of the generated code is not guaranteed. This has been a particular concern in the era of large language models (LLMs)- based code generation,…

Software Engineering · Computer Science 2023-10-11 Zhenlan Ji , Pingchuan Ma , Zongjie Li , Shuai Wang

Language Models as Causal Effect Generators

In this work, we present sequence-driven structural causal models (SD-SCMs), a framework for specifying causal models with user-defined structure and language-model-defined mechanisms. We characterize how an SD-SCM enables sampling from…

Computation and Language · Computer Science 2025-09-24 Lucius E. J. Bynum , Kyunghyun Cho

The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code

Causal reasoning, the ability to identify cause-and-effect relationship, is crucial in human thinking. Although large language models (LLMs) succeed in many NLP tasks, it is still challenging for them to conduct complex causal reasoning…

Computation and Language · Computer Science 2023-05-31 Xiao Liu , Da Yin , Chen Zhang , Yansong Feng , Dongyan Zhao

Exploration of Masked and Causal Language Modelling for Text Generation

Large Language Models (LLMs) have revolutionised the field of Natural Language Processing (NLP) and have achieved state-of-the-art performance in practically every task in this field. However, the prevalent approach used in text generation,…

Computation and Language · Computer Science 2024-08-12 Nicolo Micheletti , Samuel Belkadi , Lifeng Han , Goran Nenadic

Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models

While Multi-modal Language Models (MLMs) demonstrate impressive multimodal ability, they still struggle on providing factual and precise responses for tasks like visual question answering (VQA). In this paper, we address this challenge from…

Artificial Intelligence · Computer Science 2023-12-13 Shitian Zhao , Zhuowan Li , Yadong Lu , Alan Yuille , Yan Wang

Causal methods for LLM development and evaluation

Large language model (LLM) development is currently driven by large-scale empirical iteration over data mixtures, reward models, routing strategies, and evaluation pipelines. Here, we argue that many central questions in LLM development and…

Machine Learning · Computer Science 2026-05-26 Dennis Frauen , Marie Brockschmidt , Konstantin Hess , Haorui Ma , Yuchen Ma , Abdurahman Maarouf , Maresa Schröder , Jonas Schweisthal , Yuxin Wang , Athiya Deviyani , Sonali Parbhoo , Rahul G. Krishnan , Stefan Feuerriegel

The Behavior of Large Language Models When Prompted to Generate Code Explanations

This paper systematically investigates the generation of code explanations by Large Language Models (LLMs) for code examples commonly encountered in introductory programming courses. Our findings reveal significant variations in the nature…

Software Engineering · Computer Science 2023-11-13 Priti Oli , Rabin Banjade , Jeevan Chapagain , Vasile Rus

Exploring Multi-Lingual Bias of Large Code Models in Code Generation

Code generation aims to synthesize code and fulfill functional requirements based on natural language (NL) specifications, which can greatly improve development efficiency. In the era of large language models (LLMs), large code models…

Software Engineering · Computer Science 2024-05-01 Chaozheng Wang , Zongjie Li , Cuiyun Gao , Wenxuan Wang , Ting Peng , Hailiang Huang , Yuetang Deng , Shuai Wang , Michael R. Lyu

An Empirical Study on the Effects of System Prompts in Instruction-Tuned Models for Code Generation

Instruction-tuned Language Models (ILMs) have become essential components of modern AI systems, demonstrating exceptional versatility across natural language and reasoning tasks. Among their most impactful applications is code generation,…

Software Engineering · Computer Science 2026-02-18 Zaiyu Cheng , Antonio Mastropaolo

Latent Causal Probing: A Formal Perspective on Probing with Causal Models of Data

As language models (LMs) deliver increasing performance on a range of NLP tasks, probing classifiers have become an indispensable technique in the effort to better understand their inner workings. A typical setup involves (1) defining an…

Computation and Language · Computer Science 2024-08-01 Charles Jin , Martin Rinard

Learning Latent Structural Causal Models

Causal learning has long concerned itself with the accurate recovery of underlying causal mechanisms. Such causal modelling enables better explanations of out-of-distribution data. Prior works on causal learning assume that the high-level…

Machine Learning · Computer Science 2022-10-26 Jithendaraa Subramanian , Yashas Annadani , Ivaxi Sheth , Nan Rosemary Ke , Tristan Deleu , Stefan Bauer , Derek Nowrouzezahrai , Samira Ebrahimi Kahou

Causal Discovery in Linear Structural Causal Models with Deterministic Relations

Linear structural causal models (SCMs) -- in which each observed variable is generated by a subset of the other observed variables as well as a subset of the exogenous sources -- are pervasive in causal inference and casual discovery.…

Machine Learning · Computer Science 2022-11-09 Yuqin Yang , Mohamed Nafea , AmirEmad Ghassami , Negar Kiyavash

Understanding Defects in Generated Codes by Language Models

This study investigates the reliability of code generation by Large Language Models (LLMs), focusing on identifying and analyzing defects in the generated code. Despite the advanced capabilities of LLMs in automating code generation,…

Software Engineering · Computer Science 2024-08-27 Ali Mohammadi Esfahani , Nafiseh Kahani , Samuel A. Ajila

Quantifying the Impact of Structured Output Format on Large Language Models through Causal Inference

Structured output from large language models (LLMs) has enhanced efficiency in processing generated information and is increasingly adopted in industrial applications. Prior studies have investigated the impact of structured output on LLMs'…

Computation and Language · Computer Science 2025-12-22 Han Yuan , Yue Zhao , Li Zhang , Wuqiong Luo , Zheng Ma

Rethinking Software Empirical Studies with Structural Causal Models

Causal Inference offers a fundamental approach for advancing empirical software engineering (ESE) beyond traditional statistical association, enabling researchers to rigorously identify and quantify causal relationships in software…

Software Engineering · Computer Science 2026-05-28 Daniel Rodriguez-Cardenas , Aya Garryyeva , David Nader Palacio , Antonio Mastropaolo , Denys Poshyvanyk

Benchmarking and Revisiting Code Generation Assessment: A Mutation-Based Approach

Code Large Language Models (CLLMs) have exhibited outstanding performance in program synthesis, attracting the focus of the research community. The evaluation of CLLM's program synthesis capability has generally relied on manually curated…

Software Engineering · Computer Science 2025-05-13 Longtian Wang , Tianlin Li , Xiaofei Xie , Yuhan Zhi , Jian Wang , Chao Shen

Automatically Generating CS Learning Materials with Large Language Models

Recent breakthroughs in Large Language Models (LLMs), such as GPT-3 and Codex, now enable software developers to generate code based on a natural language prompt. Within computer science education, researchers are exploring the potential…

Computers and Society · Computer Science 2022-12-13 Stephen MacNeil , Andrew Tran , Juho Leinonen , Paul Denny , Joanne Kim , Arto Hellas , Seth Bernstein , Sami Sarsa

Inducing Causal Structure for Abstractive Text Summarization

The mainstream of data-driven abstractive summarization models tends to explore the correlations rather than the causal relationships. Among such correlations, there can be spurious ones which suffer from the language prior learned from the…

Computation and Language · Computer Science 2023-08-25 Lu Chen , Ruqing Zhang , Wei Huang , Wei Chen , Jiafeng Guo , Xueqi Cheng

Causal Machine Learning: A Survey and Open Problems

Causal Machine Learning (CausalML) is an umbrella term for machine learning methods that formalize the data-generation process as a structural causal model (SCM). This perspective enables us to reason about the effects of changes to this…

Machine Learning · Computer Science 2026-05-28 Jean Kaddour , Aengus Lynch , Qi Liu , Matt J. Kusner , Ricardo Silva

The Readability Spectrum: Patterns, Issues, and Prompt Effects in LLM-Generated Code

As Large Language Models (LLMs) are transforming software development, the functional quality of generated code has become a central focus, leaving readability, one of critical non-functional attributes, understudied. Given that…

Software Engineering · Computer Science 2026-05-14 Hengzhi Ye , Fengyuan Ran , Weiwei Xu , Minghui Zhou