Related papers: Abstracting Causal Models

Approximate Causal Abstraction

Scientific models describe natural phenomena at different levels of abstraction. Abstract descriptions can provide the basis for interventions on the system and explanation of observed phenomena at a level of granularity that is coarser…

Artificial Intelligence · Computer Science 2019-07-02 Sander Beckers , Frederick Eberhardt , Joseph Y. Halpern

Causal Abstraction with Soft Interventions

Causal abstraction provides a theory describing how several causal models can represent the same system at different levels of detail. Existing theoretical proposals limit the analysis of abstract models to "hard" interventions fixing…

Artificial Intelligence · Computer Science 2022-11-23 Riccardo Massidda , Atticus Geiger , Thomas Icard , Davide Bacciu

Causal Abstraction Inference under Lossy Representations

The study of causal abstractions bridges two integral components of human intelligence: the ability to determine cause and effect, and the ability to interpret complex patterns into abstract concepts. Formally, causal abstraction frameworks…

Machine Learning · Computer Science 2025-09-29 Kevin Xia , Elias Bareinboim

Causal and Compositional Abstraction

Abstracting from a low level to a more explanatory high level of description, and ideally while preserving causal structure, is fundamental to scientific practice, to causal inference problems, and to robust, efficient and interpretable AI.…

Logic in Computer Science · Computer Science 2026-02-19 Robin Lorenz , Sean Tull

Causal Abstraction in Model Interpretability: A Compact Survey

The pursuit of interpretable artificial intelligence has led to significant advancements in the development of methods that aim to explain the decision-making processes of complex models, such as deep learning systems. Among these methods,…

Machine Learning · Computer Science 2024-10-29 Yihao Zhang

Compositional Abstraction Error and a Category of Causal Models

Interventional causal models describe several joint distributions over some variables used to describe a system, one for each intervention setting. They provide a formal recipe for how to move between the different joint distributions and…

Machine Learning · Statistics 2021-08-06 Eigil F. Rischel , Sebastian Weichwald

Abstracting Probabilistic Models: A Logical Perspective

Abstraction is a powerful idea widely used in science, to model, reason and explain the behavior of systems in a more tractable search space, by omitting irrelevant details. While notions of abstraction have matured for deterministic…

Artificial Intelligence · Computer Science 2020-01-14 Vaishak Belle

Causal Abstractions, Categorically Unified

We present a categorical framework for relating causal models that represent the same system at different levels of abstraction. We define a causal abstraction as natural transformations between appropriate Markov functors, which concisely…

Machine Learning · Statistics 2025-10-07 Markus Englberger , Devendra Singh Dhami

Combining Causal Models for More Accurate Abstractions of Neural Networks

Mechanistic interpretability aims to reverse engineer neural networks by uncovering which high-level algorithms they implement. Causal abstraction provides a precise notion of when a network implements an algorithm, i.e., a causal model of…

Machine Learning · Computer Science 2025-03-17 Theodora-Mara Pîslar , Sara Magliacane , Atticus Geiger

Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability

Causal abstraction provides a theoretical foundation for mechanistic interpretability, the field concerned with providing intelligible algorithms that are faithful simplifications of the known, but opaque low-level details of black box AI…

Artificial Intelligence · Computer Science 2025-05-12 Atticus Geiger , Duligur Ibeling , Amir Zur , Maheep Chaudhary , Sonakshi Chauhan , Jing Huang , Aryaman Arora , Zhengxuan Wu , Noah Goodman , Christopher Potts , Thomas Icard

The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?

The concept of causal abstraction got recently popularised to demystify the opaque decision-making processes of machine learning models; in short, a neural network can be abstracted as a higher-level algorithm if there exists a function…

Machine Learning · Computer Science 2025-11-13 Denis Sutter , Julian Minder , Thomas Hofmann , Tiago Pimentel

Distributionally Robust Causal Abstractions

Causal Abstraction (CA) theory provides a principled framework for relating causal models that describe the same system at different levels of granularity while ensuring interventional consistency between them. Recent methods for learning…

Machine Learning · Computer Science 2026-01-23 Yorgos Felekis , Theodoros Damoulas , Paris Giampouras

How Causal Abstraction Underpins Computational Explanation

Explanations of cognitive behavior often appeal to computations over representations. What does it take for a system to implement a given computation over suitable representational vehicles within that system? We argue that the language of…

Machine Learning · Computer Science 2025-08-18 Atticus Geiger , Jacqueline Harding , Thomas Icard

Jointly Learning Consistent Causal Abstractions Over Multiple Interventional Distributions

An abstraction can be used to relate two structural causal models representing the same system at different levels of resolution. Learning abstractions which guarantee consistency with respect to interventional distributions would allow one…

Artificial Intelligence · Computer Science 2023-05-09 Fabio Massimo Zennaro , Máté Drávucz , Geanina Apachitei , W. Dhammika Widanage , Theodoros Damoulas

Efficient Discovery of Approximate Causal Abstractions via Neural Mechanism Sparsification

Neural networks are hypothesized to implement interpretable causal mechanisms, yet verifying this requires finding a causal abstraction -- a simpler, high-level Structural Causal Model (SCM) faithful to the network under interventions.…

Machine Learning · Computer Science 2026-03-02 Amir Asiaee

Neural Causal Abstractions

The abilities of humans to understand the world in terms of cause and effect relationships, as well as to compress information into abstract concepts, are two hallmark features of human intelligence. These two topics have been studied in…

Machine Learning · Computer Science 2024-02-26 Kevin Xia , Elias Bareinboim

Towards Computing an Optimal Abstraction for Structural Causal Models

Working with causal models at different levels of abstraction is an important feature of science. Existing work has already considered the problem of expressing formally the relation of abstraction between causal models. In this paper, we…

Artificial Intelligence · Computer Science 2022-08-02 Fabio Massimo Zennaro , Paolo Turrini , Theodoros Damoulas

Learning Causal Abstractions of Linear Structural Causal Models

The need for modelling causal knowledge at different levels of granularity arises in several settings. Causal Abstraction provides a framework for formalizing this problem by relating two Structural Causal Models at different levels of…

Machine Learning · Computer Science 2024-06-04 Riccardo Massidda , Sara Magliacane , Davide Bacciu

Abstraction between Structural Causal Models: A Review of Definitions and Properties

Structural causal models (SCMs) are a widespread formalism to deal with causal systems. A recent direction of research has considered the problem of relating formally SCMs at different levels of abstraction, by defining maps between SCMs…

Artificial Intelligence · Computer Science 2022-07-19 Fabio Massimo Zennaro

Multi-Level Causal Embeddings

Abstractions of causal models allow for the coarsening of models such that relations of cause and effect are preserved. Whereas abstractions focus on the relation between two models, in this paper we study a framework for causal embeddings…

Artificial Intelligence · Computer Science 2026-03-02 Willem Schooltink , Fabio Massimo Zennaro