Related papers: Language-Based Causal Representation Learning

Learning Local Causal World Models with State Space Models and Attention

World modelling, i.e. building a representation of the rules that govern the world so as to predict its evolution, is an essential ability for any agent interacting with the physical world. Despite their impressive performance, many…

Machine Learning · Computer Science 2025-05-06 Francesco Petri , Luigi Asprino , Aldo Gangemi

Causal Structure Learning

Graphical models can represent a multivariate distribution in a convenient and accessible form as a graph. Causal models can be viewed as a special class of graphical models that not only represent the distribution of the observed system…

Methodology · Statistics 2017-06-29 Christina Heinze-Deml , Marloes H. Maathuis , Nicolai Meinshausen

Causal Masking on Spatial Data: An Information-Theoretic Case for Learning Spatial Datasets with Unimodal Language Models

Language models are traditionally designed around causal masking. In domains with spatial or relational structure, causal masking is often viewed as inappropriate, and sequential linearizations are instead used. Yet the question of whether…

Artificial Intelligence · Computer Science 2025-11-03 Jared Junkin , Samuel Nathanson

Causal Representation Learning from Multiple Distributions: A General Setting

In many problems, the measured variables (e.g., image pixels) are just mathematical functions of the latent causal variables (e.g., the underlying concepts or objects). For the purpose of making predictions in changing environments or…

Machine Learning · Computer Science 2024-08-13 Kun Zhang , Shaoan Xie , Ignavier Ng , Yujia Zheng

Language Agents Meet Causality -- Bridging LLMs and Causal World Models

Large Language Models (LLMs) have recently shown great promise in planning and reasoning applications. These tasks demand robust systems, which arguably require a causal understanding of the environment. While LLMs can acquire and reflect…

Artificial Intelligence · Computer Science 2024-10-29 John Gkountouras , Matthias Lindemann , Phillip Lippe , Efstratios Gavves , Ivan Titov

Causal Representation Learning from General Environments under Nonparametric Mixing

Causal representation learning aims to recover the latent causal variables and their causal relations, typically represented by directed acyclic graphs (DAGs), from low-level observations such as image pixels. A prevailing line of research…

Machine Learning · Computer Science 2026-04-28 Ignavier Ng , Shaoan Xie , Xinshuai Dong , Peter Spirtes , Kun Zhang

Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and Acting

Large language models (LLMs) have shown great potential in decision-making due to the vast amount of knowledge stored within the models. However, these pre-trained models are prone to lack reasoning abilities and are difficult to adapt to…

Machine Learning · Computer Science 2025-06-02 Wei Chen , Jiahao Zhang , Haipeng Zhu , Boyan Xu , Zhifeng Hao , Keli Zhang , Junjian Ye , Ruichu Cai

Causal Structure Learning: a Bayesian approach based on random graphs

A Random Graph is a random object which take its values in the space of graphs. We take advantage of the expressibility of graphs in order to model the uncertainty about the existence of causal relationships within a given set of variables.…

Artificial Intelligence · Computer Science 2026-04-30 Mauricio Gonzalez-Soto , Ivan R. Feliciano-Avelino , L. Enrique Sucar , Hugo J. Escalante Balderas

Systematic Evaluation of Causal Discovery in Visual Model Based Reinforcement Learning

Inducing causal relationships from observations is a classic problem in machine learning. Most work in causality starts from the premise that the causal variables themselves are observed. However, for AI agents such as robots trying to make…

Machine Learning · Statistics 2021-07-05 Nan Rosemary Ke , Aniket Didolkar , Sarthak Mittal , Anirudh Goyal , Guillaume Lajoie , Stefan Bauer , Danilo Rezende , Yoshua Bengio , Michael Mozer , Christopher Pal

Learning Latent Structural Causal Models

Causal learning has long concerned itself with the accurate recovery of underlying causal mechanisms. Such causal modelling enables better explanations of out-of-distribution data. Prior works on causal learning assume that the high-level…

Machine Learning · Computer Science 2022-10-26 Jithendaraa Subramanian , Yashas Annadani , Ivaxi Sheth , Nan Rosemary Ke , Tristan Deleu , Stefan Bauer , Derek Nowrouzezahrai , Samira Ebrahimi Kahou

Interpretable Imitation Learning with Dynamic Causal Relations

Imitation learning, which learns agent policy by mimicking expert demonstration, has shown promising results in many applications such as medical treatment regimes and self-driving vehicles. However, it remains a difficult task to interpret…

Machine Learning · Computer Science 2024-01-31 Tianxiang Zhao , Wenchao Yu , Suhang Wang , Lu Wang , Xiang Zhang , Yuncong Chen , Yanchi Liu , Wei Cheng , Haifeng Chen

Discovering and Reasoning of Causality in the Hidden World with Large Language Models

Revealing hidden causal variables alongside the underlying causal mechanisms is essential to the development of science. Despite the progress in the past decades, existing practice in causal discovery (CD) heavily relies on high-quality…

Machine Learning · Computer Science 2025-10-14 Chenxi Liu , Yongqiang Chen , Tongliang Liu , Mingming Gong , James Cheng , Bo Han , Kun Zhang

Structure Mapping for Transferability of Causal Models

Human beings learn causal models and constantly use them to transfer knowledge between similar environments. We use this intuition to design a transfer-learning framework using object-oriented representations to learn the causal…

Machine Learning · Computer Science 2020-07-21 Purva Pruthi , Javier González , Xiaoyu Lu , Madalina Fiterau

Speech World Model: Causal State-Action Planning with Explicit Reasoning for Speech

Current speech-language models (SLMs) typically use a cascade of speech encoder and large language model, treating speech understanding as a single black box. They analyze the content of speech well but reason weakly about other aspects,…

Audio and Speech Processing · Electrical Eng. & Systems 2025-12-08 Xuanru Zhou , Jiachen Lian , Henry Hong , Xinyi Yang , Gopala Anumanchipalli

Disentangled Representations for Causal Cognition

Complex adaptive agents consistently achieve their goals by solving problems that seem to require an understanding of causal information, information pertaining to the causal relationships that exist among elements of combined…

Artificial Intelligence · Computer Science 2024-07-02 Filippo Torresan , Manuel Baltieri

Large Language Models for Constrained-Based Causal Discovery

Causality is essential for understanding complex systems, such as the economy, the brain, and the climate. Constructing causal graphs often relies on either data-driven or expert-driven approaches, both fraught with challenges. The former…

Artificial Intelligence · Computer Science 2024-06-12 Kai-Hendrik Cohrs , Gherardo Varando , Emiliano Diaz , Vasileios Sitokonstantinou , Gustau Camps-Valls

CP-logic: A Language of Causal Probabilistic Events and Its Relation to Logic Programming

This papers develops a logical language for representing probabilistic causal laws. Our interest in such a language is twofold. First, it can be motivated as a fundamental study of the representation of causal knowledge. Causality has an…

Artificial Intelligence · Computer Science 2009-04-13 Joost Vennekens , Marc Denecker , Maurice Bruynooghe

Beyond identifiability: Learning causal representations with few environments and finite samples

We provide explicit, finite-sample guarantees for learning causal representations from data with a sublinear number of environments. Causal representation learning seeks to provide a rigourous foundation for the general representation…

Machine Learning · Statistics 2026-03-30 Inbeom Lee , Tongtong Jin , Bryon Aragam

Can Large Language Models Learn Independent Causal Mechanisms?

Despite impressive performance on language modelling and complex reasoning tasks, Large Language Models (LLMs) fall short on the same tasks in uncommon settings or with distribution shifts, exhibiting a lack of generalisation ability. By…

Computation and Language · Computer Science 2024-09-11 Gaël Gendron , Bao Trung Nguyen , Alex Yuxuan Peng , Michael Witbrock , Gillian Dobbie

Beyond Structural Causal Models: Causal Constraints Models

Structural Causal Models (SCMs) provide a popular causal modeling framework. In this work, we show that SCMs are not flexible enough to give a complete causal representation of dynamical systems at equilibrium. Instead, we propose a…

Artificial Intelligence · Computer Science 2019-08-07 Tineke Blom , Stephan Bongers , Joris M. Mooij