Related papers: Identifying Representations for Intervention Extra…

Learning Representations that Support Extrapolation

Extrapolation -- the ability to make inferences that go beyond the scope of one's experiences -- is a hallmark of human intelligence. By contrast, the generalization exhibited by contemporary neural network algorithms is largely limited to…

Computer Vision and Pattern Recognition · Computer Science 2023-09-08 Taylor W. Webb , Zachary Dulberg , Steven M. Frankland , Alexander A. Petrov , Randall C. O'Reilly , Jonathan D. Cohen

Learning Robust Intervention Representations with Delta Embeddings

Causal representation learning has attracted significant research interest during the past few years, as a means for improving model generalization and robustness. Causal representations of interventional image pairs (also called…

Computer Vision and Pattern Recognition · Computer Science 2026-03-03 Panagiotis Alimisis , Christos Diou

Identifying Linearly-Mixed Causal Representations from Multi-Node Interventions

The task of inferring high-level causal variables from low-level observations, commonly referred to as causal representation learning, is fundamentally underconstrained. As such, recent works to address this problem focus on various…

Machine Learning · Statistics 2024-03-26 Simon Bing , Urmi Ninad , Jonas Wahl , Jakob Runge

On Linear Identifiability of Learned Representations

Identifiability is a desirable property of a statistical model: it implies that the true model parameters may be estimated to any desired precision, given sufficient computational resources and data. We study identifiability in the context…

Machine Learning · Statistics 2020-07-09 Geoffrey Roeder , Luke Metz , Diederik P. Kingma

Leveraging Task Structures for Improved Identifiability in Neural Network Representations

This work extends the theory of identifiability in supervised learning by considering the consequences of having access to a distribution of tasks. In such cases, we show that linear identifiability is achievable in the general multi-task…

Machine Learning · Statistics 2024-08-26 Wenlin Chen , Julien Horwood , Juyeon Heo , José Miguel Hernández-Lobato

The Extrapolation Power of Implicit Models

In this paper, we investigate the extrapolation capabilities of implicit deep learning models in handling unobserved data, where traditional deep neural networks may falter. Implicit models, distinguished by their adaptability in layer…

Machine Learning · Computer Science 2024-07-22 Juliette Decugis , Alicia Y. Tsai , Max Emerling , Ashwin Ganesh , Laurent El Ghaoui

Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models

A retrieval model should not only interpolate the training data but also extrapolate well to the queries that are different from the training data. While neural retrieval models have demonstrated impressive performance on ad-hoc search…

Information Retrieval · Computer Science 2022-08-05 Jingtao Zhan , Xiaohui Xie , Jiaxin Mao , Yiqun Liu , Jiafeng Guo , Min Zhang , Shaoping Ma

Beyond identifiability: Learning causal representations with few environments and finite samples

We provide explicit, finite-sample guarantees for learning causal representations from data with a sublinear number of environments. Causal representation learning seeks to provide a rigourous foundation for the general representation…

Machine Learning · Statistics 2026-03-30 Inbeom Lee , Tongtong Jin , Bryon Aragam

Properties from Mechanisms: An Equivariance Perspective on Identifiable Representation Learning

A key goal of unsupervised representation learning is "inverting" a data generating process to recover its latent properties. Existing work that provably achieves this goal relies on strong assumptions on relationships between the latent…

Machine Learning · Computer Science 2021-11-01 Kartik Ahuja , Jason Hartford , Yoshua Bengio

Towards Understanding Extrapolation: a Causal Lens

Canonical work handling distribution shifts typically necessitates an entire target distribution that lands inside the training distribution. However, practical scenarios often involve only a handful of target samples, potentially lying…

Machine Learning · Computer Science 2025-01-17 Lingjing Kong , Guangyi Chen , Petar Stojanov , Haoxuan Li , Eric P. Xing , Kun Zhang

A theory of independent mechanisms for extrapolation in generative models

Generative models can be trained to emulate complex empirical data, but are they useful to make predictions in the context of previously unobserved environments? An intuitive idea to promote such extrapolation capabilities is to have the…

Machine Learning · Computer Science 2022-01-03 Michel Besserve , Rémy Sun , Dominik Janzing , Bernhard Schölkopf

On Evolution-Based Models for Experimentation Under Interference

Causal effect estimation in networked systems is central to data-driven decision making. In such settings, interventions on one unit can spill over to others, and in complex physical or social systems, the interaction pathways driving these…

Machine Learning · Statistics 2025-11-27 Sadegh Shirani , Mohsen Bayati

Extrapolation Frameworks in Cognitive Psychology Suitable for Study of Image Classification Models

We study the functional task of deep learning image classification models and show that image classification requires extrapolation capabilities. This suggests that new theories have to be developed for the understanding of deep learning as…

Machine Learning · Computer Science 2021-12-08 Roozbeh Yousefzadeh , Jessica A. Mollick

Non-linear Interventions on Large Language Models

Intervention is one of the most representative and widely used methods for understanding the internal representations of large language models (LLMs). However, existing intervention methods are confined to linear interventions grounded in…

Computation and Language · Computer Science 2026-05-15 Sangwoo Kim

Nonparametric Identifiability of Causal Representations from Unknown Interventions

We study causal representation learning, the task of inferring latent causal variables and their causal relations from high-dimensional mixtures of the variables. Prior work relies on weak supervision, in the form of counterfactual pre- and…

Machine Learning · Statistics 2023-10-31 Julius von Kügelgen , Michel Besserve , Liang Wendong , Luigi Gresele , Armin Kekić , Elias Bareinboim , David M. Blei , Bernhard Schölkopf

Score-based Causal Representation Learning: Linear and General Transformations

This paper addresses intervention-based causal representation learning (CRL) under a general nonparametric latent causal model and an unknown transformation that maps the latent variables to the observed variables. Linear and general…

Machine Learning · Computer Science 2025-07-22 Burak Varıcı , Emre Acartürk , Karthikeyan Shanmugam , Abhishek Kumar , Ali Tajer

Reliable extrapolation of deep neural operators informed by physics or sparse observations

Deep neural operators can learn nonlinear mappings between infinite-dimensional function spaces via deep neural networks. As promising surrogate solvers of partial differential equations (PDEs) for real-time prediction, deep neural…

Machine Learning · Computer Science 2023-05-17 Min Zhu , Handi Zhang , Anran Jiao , George Em Karniadakis , Lu Lu

General Identifiability and Achievability for Causal Representation Learning

This paper focuses on causal representation learning (CRL) under a general nonparametric latent causal model and a general transformation model that maps the latent data to the observational data. It establishes identifiability and…

Machine Learning · Computer Science 2024-02-15 Burak Varıcı , Emre Acartürk , Karthikeyan Shanmugam , Ali Tajer

Task-Induced Representation Learning

In this work, we evaluate the effectiveness of representation learning approaches for decision making in visually complex environments. Representation learning is essential for effective reinforcement learning (RL) from high-dimensional…

Machine Learning · Computer Science 2022-04-26 Jun Yamada , Karl Pertsch , Anisha Gunjal , Joseph J. Lim

Identifiable Latent Polynomial Causal Models Through the Lens of Change

Causal representation learning aims to unveil latent high-level causal representations from observed low-level data. One of its primary tasks is to provide reliable assurance of identifying these latent causal models, known as…

Machine Learning · Computer Science 2024-12-02 Yuhang Liu , Zhen Zhang , Dong Gong , Mingming Gong , Biwei Huang , Anton van den Hengel , Kun Zhang , Javen Qinfeng Shi