Related papers: Causal and Compositional Abstraction

Causal Abstraction Inference under Lossy Representations

The study of causal abstractions bridges two integral components of human intelligence: the ability to determine cause and effect, and the ability to interpret complex patterns into abstract concepts. Formally, causal abstraction frameworks…

Machine Learning · Computer Science 2025-09-29 Kevin Xia , Elias Bareinboim

Causal Abstractions, Categorically Unified

We present a categorical framework for relating causal models that represent the same system at different levels of abstraction. We define a causal abstraction as natural transformations between appropriate Markov functors, which concisely…

Machine Learning · Statistics 2025-10-07 Markus Englberger , Devendra Singh Dhami

Compositional Abstraction Error and a Category of Causal Models

Interventional causal models describe several joint distributions over some variables used to describe a system, one for each intervention setting. They provide a formal recipe for how to move between the different joint distributions and…

Machine Learning · Statistics 2021-08-06 Eigil F. Rischel , Sebastian Weichwald

Causal Abstraction: A Theoretical Foundation for Mechanistic Interpretability

Causal abstraction provides a theoretical foundation for mechanistic interpretability, the field concerned with providing intelligible algorithms that are faithful simplifications of the known, but opaque low-level details of black box AI…

Artificial Intelligence · Computer Science 2025-05-12 Atticus Geiger , Duligur Ibeling , Amir Zur , Maheep Chaudhary , Sonakshi Chauhan , Jing Huang , Aryaman Arora , Zhengxuan Wu , Noah Goodman , Christopher Potts , Thomas Icard

Causal Abstraction with Soft Interventions

Causal abstraction provides a theory describing how several causal models can represent the same system at different levels of detail. Existing theoretical proposals limit the analysis of abstract models to "hard" interventions fixing…

Artificial Intelligence · Computer Science 2022-11-23 Riccardo Massidda , Atticus Geiger , Thomas Icard , Davide Bacciu

Learning Causal Abstractions of Linear Structural Causal Models

The need for modelling causal knowledge at different levels of granularity arises in several settings. Causal Abstraction provides a framework for formalizing this problem by relating two Structural Causal Models at different levels of…

Machine Learning · Computer Science 2024-06-04 Riccardo Massidda , Sara Magliacane , Davide Bacciu

Causal Abstraction in Model Interpretability: A Compact Survey

The pursuit of interpretable artificial intelligence has led to significant advancements in the development of methods that aim to explain the decision-making processes of complex models, such as deep learning systems. Among these methods,…

Machine Learning · Computer Science 2024-10-29 Yihao Zhang

How Causal Abstraction Underpins Computational Explanation

Explanations of cognitive behavior often appeal to computations over representations. What does it take for a system to implement a given computation over suitable representational vehicles within that system? We argue that the language of…

Machine Learning · Computer Science 2025-08-18 Atticus Geiger , Jacqueline Harding , Thomas Icard

Jointly Learning Consistent Causal Abstractions Over Multiple Interventional Distributions

An abstraction can be used to relate two structural causal models representing the same system at different levels of resolution. Learning abstractions which guarantee consistency with respect to interventional distributions would allow one…

Artificial Intelligence · Computer Science 2023-05-09 Fabio Massimo Zennaro , Máté Drávucz , Geanina Apachitei , W. Dhammika Widanage , Theodoros Damoulas

Neural Causal Abstractions

The abilities of humans to understand the world in terms of cause and effect relationships, as well as to compress information into abstract concepts, are two hallmark features of human intelligence. These two topics have been studied in…

Machine Learning · Computer Science 2024-02-26 Kevin Xia , Elias Bareinboim

Abstracting Causal Models

We consider a sequence of successively more restrictive definitions of abstraction for causal models, starting with a notion introduced by Rubenstein et al. (2017) called exact transformation that applies to probabilistic causal models,…

Artificial Intelligence · Computer Science 2019-07-11 Sander Beckers , Joseph Y. Halpern

Quantifying Consistency and Information Loss for Causal Abstraction Learning

Structural causal models provide a formalism to express causal relations between variables of interest. Models and variables can represent a system at different levels of abstraction, whereby relations may be coarsened and refined according…

Artificial Intelligence · Computer Science 2023-05-09 Fabio Massimo Zennaro , Paolo Turrini , Theodoros Damoulas

Causal Abstractions of Neural Networks

Structural analysis methods (e.g., probing and feature attribution) are increasingly important tools for neural network analysis. We propose a new structural analysis method grounded in a formal theory of causal abstraction that provides…

Artificial Intelligence · Computer Science 2021-10-28 Atticus Geiger , Hanson Lu , Thomas Icard , Christopher Potts

Aligning Graphical and Functional Causal Abstractions

Causal abstractions allow us to relate causal models on different levels of granularity. To ensure that the models agree on cause and effect, frameworks for causal abstractions define notions of consistency. Two distinct methods for causal…

Artificial Intelligence · Computer Science 2025-03-17 Willem Schooltink , Fabio Massimo Zennaro

Finding Alignments Between Interpretable Causal Variables and Distributed Neural Representations

Causal abstraction is a promising theoretical framework for explainable artificial intelligence that defines when an interpretable high-level causal model is a faithful simplification of a low-level deep learning system. However, existing…

Artificial Intelligence · Computer Science 2024-02-23 Atticus Geiger , Zhengxuan Wu , Christopher Potts , Thomas Icard , Noah D. Goodman

Abstraction between Structural Causal Models: A Review of Definitions and Properties

Structural causal models (SCMs) are a widespread formalism to deal with causal systems. A recent direction of research has considered the problem of relating formally SCMs at different levels of abstraction, by defining maps between SCMs…

Artificial Intelligence · Computer Science 2022-07-19 Fabio Massimo Zennaro

Continual Causal Abstractions

This short paper discusses continually updated causal abstractions as a potential direction of future research. The key idea is to revise the existing level of causal abstraction to a different level of detail that is both consistent with…

Artificial Intelligence · Computer Science 2023-01-10 Matej Zečević , Moritz Willig , Jonas Seng , Florian Peter Busch

Combining Causal Models for More Accurate Abstractions of Neural Networks

Mechanistic interpretability aims to reverse engineer neural networks by uncovering which high-level algorithms they implement. Causal abstraction provides a precise notion of when a network implements an algorithm, i.e., a causal model of…

Machine Learning · Computer Science 2025-03-17 Theodora-Mara Pîslar , Sara Magliacane , Atticus Geiger

Approximate Causal Abstraction

Scientific models describe natural phenomena at different levels of abstraction. Abstract descriptions can provide the basis for interventions on the system and explanation of observed phenomena at a level of granularity that is coarser…

Artificial Intelligence · Computer Science 2019-07-02 Sander Beckers , Frederick Eberhardt , Joseph Y. Halpern

The Non-Linear Representation Dilemma: Is Causal Abstraction Enough for Mechanistic Interpretability?

The concept of causal abstraction got recently popularised to demystify the opaque decision-making processes of machine learning models; in short, a neural network can be abstracted as a higher-level algorithm if there exists a function…

Machine Learning · Computer Science 2025-11-13 Denis Sutter , Julian Minder , Thomas Hofmann , Tiago Pimentel