English
Related papers

Related papers: Causality for Large Language Models

200 papers

The causal capabilities of large language models (LLMs) are a matter of significant debate, with critical implications for the use of LLMs in societally impactful domains such as medicine, science, law, and policy. We conduct a "behavorial"…

Artificial Intelligence · Computer Science 2024-08-21 Emre Kıcıman , Robert Ness , Amit Sharma , Chenhao Tan

Causal inference has shown potential in enhancing the predictive accuracy, fairness, robustness, and explainability of Natural Language Processing (NLP) models by capturing causal relationships among variables. The emergence of generative…

Computation and Language · Computer Science 2025-03-24 Xiaoyu Liu , Paiheng Xu , Junda Wu , Jiaxin Yuan , Yifan Yang , Yuhang Zhou , Fuxiao Liu , Tianrui Guan , Haoliang Wang , Tong Yu , Julian McAuley , Wei Ai , Furong Huang

Causal inference is one of the hallmarks of human intelligence. While the field of CausalNLP has attracted much interest in the recent years, existing causal inference datasets in NLP primarily rely on discovering causality from empirical…

Computation and Language · Computer Science 2024-04-18 Zhijing Jin , Jiarui Liu , Zhiheng Lyu , Spencer Poff , Mrinmaya Sachan , Rada Mihalcea , Mona Diab , Bernhard Schölkopf

Causal inference has been a pivotal challenge across diverse domains such as medicine and economics, demanding a complicated integration of human knowledge, mathematical reasoning, and data mining capabilities. Recent advancements in…

Computation and Language · Computer Science 2025-02-11 Jing Ma

Large language models (LLMs) have shown various ability on natural language processing, including problems about causality. It is not intuitive for LLMs to command causality, since pretrained models usually work on statistical associations,…

Computation and Language · Computer Science 2024-08-27 Chenyang Zhang , Haibo Tong , Bin Zhang , Dongyu Zhang

Large language models (LLMs) have recently shown remarkable performance in language tasks and beyond. However, due to their limited inherent causal reasoning ability, LLMs still face challenges in handling tasks that require robust causal…

Computation and Language · Computer Science 2025-03-13 Xin Li , Zhuo Cai , Shoujin Wang , Kun Yu , Fang Chen

Recovering the structure of causal graphical models from observational data is an essential yet challenging task for causal discovery in scientific scenarios. Domain-specific causal discovery usually relies on expert validation or prior…

Artificial Intelligence · Computer Science 2025-08-27 Taiyu Ban , Lyuzhou Chen , Derui Lyu , Xiangyu Wang , Qinrui Zhu , Qiang Tu , Huanhuan Chen

This study investigates the efficacy of Large Language Models (LLMs) in causal discovery. Using newly available open-source LLMs, OLMo and BLOOM, which provide access to their pre-training corpora, we investigate how LLMs address causal…

Computation and Language · Computer Science 2025-10-13 Tao Feng , Lizhen Qu , Niket Tandon , Zhuang Li , Xiaoxi Kang , Gholamreza Haffari

Scaling laws have allowed Pre-trained Language Models (PLMs) into the field of causal reasoning. Causal reasoning of PLM relies solely on text-based descriptions, in contrast to causal discovery which aims to determine the causal…

Causal reasoning is a cornerstone of human intelligence and a critical capability for artificial systems aiming to achieve advanced understanding and decision-making. This thesis delves into various dimensions of causal reasoning and…

Computation and Language · Computer Science 2025-04-22 Zhijing Jin

Some argue scale is all what is needed to achieve AI, covering even causal models. We make it clear that large language models (LLMs) cannot be causal and give reason onto why sometimes we might feel otherwise. To this end, we define and…

Artificial Intelligence · Computer Science 2023-08-28 Matej Zečević , Moritz Willig , Devendra Singh Dhami , Kristian Kersting

Large Language Models (LLMs) have demonstrated remarkable capabilities in various reasoning and generation tasks. However, their proficiency in complex causal reasoning, discovery, and estimation remains an area of active development, often…

Artificial Intelligence · Computer Science 2025-09-03 Adib Bazgir , Amir Habibdoust , Yuwen Zhang , Xing Song

Large language model (LLM) development is currently driven by large-scale empirical iteration over data mixtures, reward models, routing strategies, and evaluation pipelines. Here, we argue that many central questions in LLM development and…

Causal learning is the cognitive process of developing the capability of making causal inferences based on available information, often guided by normative principles. This process is prone to errors and biases, such as the illusion of…

Large language models (LLMs) are increasingly used in domains where causal reasoning matters, yet it remains unclear whether their judgments reflect normative causal computation, human-like shortcuts, or brittle pattern matching. We…

Artificial Intelligence · Computer Science 2026-03-16 Hanna M. Dettki , Charley M. Wu , Bob Rehder

We assess the ability of large language models (LLMs) to answer causal questions by analyzing their strengths and weaknesses against three types of causal question. We believe that current LLMs can answer causal questions with existing…

Large Language Models (LLMs) have recently shown great promise in planning and reasoning applications. These tasks demand robust systems, which arguably require a causal understanding of the environment. While LLMs can acquire and reflect…

Artificial Intelligence · Computer Science 2024-10-29 John Gkountouras , Matthias Lindemann , Phillip Lippe , Efstratios Gavves , Ivan Titov

The ability to understand causality significantly impacts the competence of large language models (LLMs) in output explanation and counterfactual reasoning, as causality reveals the underlying data distribution. However, the lack of a…

Machine Learning · Computer Science 2024-09-30 Yu Zhou , Xingyu Wu , Beicheng Huang , Jibin Wu , Liang Feng , Kay Chen Tan

The ability to robustly identify causal relationships is essential for autonomous decision-making and adaptation to novel scenarios. However, accurately inferring causal structure requires integrating both world knowledge and abstract…

Machine Learning · Computer Science 2025-06-17 Khurram Yamin , Shantanu Gupta , Gaurav R. Ghosal , Zachary C. Lipton , Bryan Wilder

While code generation has been widely used in various software development scenarios, the quality of the generated code is not guaranteed. This has been a particular concern in the era of large language models (LLMs)- based code generation,…

Software Engineering · Computer Science 2023-10-11 Zhenlan Ji , Pingchuan Ma , Zongjie Li , Shuai Wang
‹ Prev 1 2 3 10 Next ›