English
Related papers

Related papers: CodeSCM: Causal Analysis for Multi-Modal Code Gene…

200 papers

While code generation has been widely used in various software development scenarios, the quality of the generated code is not guaranteed. This has been a particular concern in the era of large language models (LLMs)- based code generation,…

Software Engineering · Computer Science 2023-10-11 Zhenlan Ji , Pingchuan Ma , Zongjie Li , Shuai Wang

In this work, we present sequence-driven structural causal models (SD-SCMs), a framework for specifying causal models with user-defined structure and language-model-defined mechanisms. We characterize how an SD-SCM enables sampling from…

Computation and Language · Computer Science 2025-09-24 Lucius E. J. Bynum , Kyunghyun Cho

Causal reasoning, the ability to identify cause-and-effect relationship, is crucial in human thinking. Although large language models (LLMs) succeed in many NLP tasks, it is still challenging for them to conduct complex causal reasoning…

Computation and Language · Computer Science 2023-05-31 Xiao Liu , Da Yin , Chen Zhang , Yansong Feng , Dongyan Zhao

Large Language Models (LLMs) have revolutionised the field of Natural Language Processing (NLP) and have achieved state-of-the-art performance in practically every task in this field. However, the prevalent approach used in text generation,…

Computation and Language · Computer Science 2024-08-12 Nicolo Micheletti , Samuel Belkadi , Lifeng Han , Goran Nenadic

While Multi-modal Language Models (MLMs) demonstrate impressive multimodal ability, they still struggle on providing factual and precise responses for tasks like visual question answering (VQA). In this paper, we address this challenge from…

Artificial Intelligence · Computer Science 2023-12-13 Shitian Zhao , Zhuowan Li , Yadong Lu , Alan Yuille , Yan Wang

Large language model (LLM) development is currently driven by large-scale empirical iteration over data mixtures, reward models, routing strategies, and evaluation pipelines. Here, we argue that many central questions in LLM development and…

This paper systematically investigates the generation of code explanations by Large Language Models (LLMs) for code examples commonly encountered in introductory programming courses. Our findings reveal significant variations in the nature…

Software Engineering · Computer Science 2023-11-13 Priti Oli , Rabin Banjade , Jeevan Chapagain , Vasile Rus

Code generation aims to synthesize code and fulfill functional requirements based on natural language (NL) specifications, which can greatly improve development efficiency. In the era of large language models (LLMs), large code models…

Software Engineering · Computer Science 2024-05-01 Chaozheng Wang , Zongjie Li , Cuiyun Gao , Wenxuan Wang , Ting Peng , Hailiang Huang , Yuetang Deng , Shuai Wang , Michael R. Lyu

Instruction-tuned Language Models (ILMs) have become essential components of modern AI systems, demonstrating exceptional versatility across natural language and reasoning tasks. Among their most impactful applications is code generation,…

Software Engineering · Computer Science 2026-02-18 Zaiyu Cheng , Antonio Mastropaolo

As language models (LMs) deliver increasing performance on a range of NLP tasks, probing classifiers have become an indispensable technique in the effort to better understand their inner workings. A typical setup involves (1) defining an…

Computation and Language · Computer Science 2024-08-01 Charles Jin , Martin Rinard

Causal learning has long concerned itself with the accurate recovery of underlying causal mechanisms. Such causal modelling enables better explanations of out-of-distribution data. Prior works on causal learning assume that the high-level…

Linear structural causal models (SCMs) -- in which each observed variable is generated by a subset of the other observed variables as well as a subset of the exogenous sources -- are pervasive in causal inference and casual discovery.…

Machine Learning · Computer Science 2022-11-09 Yuqin Yang , Mohamed Nafea , AmirEmad Ghassami , Negar Kiyavash

This study investigates the reliability of code generation by Large Language Models (LLMs), focusing on identifying and analyzing defects in the generated code. Despite the advanced capabilities of LLMs in automating code generation,…

Software Engineering · Computer Science 2024-08-27 Ali Mohammadi Esfahani , Nafiseh Kahani , Samuel A. Ajila

Structured output from large language models (LLMs) has enhanced efficiency in processing generated information and is increasingly adopted in industrial applications. Prior studies have investigated the impact of structured output on LLMs'…

Computation and Language · Computer Science 2025-12-22 Han Yuan , Yue Zhao , Li Zhang , Wuqiong Luo , Zheng Ma

Causal Inference offers a fundamental approach for advancing empirical software engineering (ESE) beyond traditional statistical association, enabling researchers to rigorously identify and quantify causal relationships in software…

Software Engineering · Computer Science 2026-05-28 Daniel Rodriguez-Cardenas , Aya Garryyeva , David Nader Palacio , Antonio Mastropaolo , Denys Poshyvanyk

Code Large Language Models (CLLMs) have exhibited outstanding performance in program synthesis, attracting the focus of the research community. The evaluation of CLLM's program synthesis capability has generally relied on manually curated…

Software Engineering · Computer Science 2025-05-13 Longtian Wang , Tianlin Li , Xiaofei Xie , Yuhan Zhi , Jian Wang , Chao Shen

Recent breakthroughs in Large Language Models (LLMs), such as GPT-3 and Codex, now enable software developers to generate code based on a natural language prompt. Within computer science education, researchers are exploring the potential…

Computers and Society · Computer Science 2022-12-13 Stephen MacNeil , Andrew Tran , Juho Leinonen , Paul Denny , Joanne Kim , Arto Hellas , Seth Bernstein , Sami Sarsa

The mainstream of data-driven abstractive summarization models tends to explore the correlations rather than the causal relationships. Among such correlations, there can be spurious ones which suffer from the language prior learned from the…

Computation and Language · Computer Science 2023-08-25 Lu Chen , Ruqing Zhang , Wei Huang , Wei Chen , Jiafeng Guo , Xueqi Cheng

Causal Machine Learning (CausalML) is an umbrella term for machine learning methods that formalize the data-generation process as a structural causal model (SCM). This perspective enables us to reason about the effects of changes to this…

Machine Learning · Computer Science 2026-05-28 Jean Kaddour , Aengus Lynch , Qi Liu , Matt J. Kusner , Ricardo Silva

As Large Language Models (LLMs) are transforming software development, the functional quality of generated code has become a central focus, leaving readability, one of critical non-functional attributes, understudied. Given that…

Software Engineering · Computer Science 2026-05-14 Hengzhi Ye , Fengyuan Ran , Weiwei Xu , Minghui Zhou
‹ Prev 1 2 3 10 Next ›