Related papers: Optimal Transport for Structure Learning Under Mis…

Missing Data Imputation using Optimal Transport

Missing data is a crucial issue when applying machine learning algorithms to real-world datasets. Starting from the simple assumption that two batches extracted randomly from the same dataset should share the same distribution, we leverage…

Machine Learning · Statistics 2020-07-02 Boris Muzellec , Julie Josse , Claire Boyer , Marco Cuturi

Optimal transport and Wasserstein distances for causal models

In this paper, we introduce a variant of optimal transport adapted to the causal structure given by an underlying directed graph $G$. Different graph structures lead to different specifications of the optimal transport problem. For…

Statistics Theory · Mathematics 2024-07-08 Patrick Cheridito , Stephan Eckstein

Learning Causal Graphs via Monotone Triangular Transport Maps

We study the problem of causal structure learning from data using optimal transport (OT). Specifically, we first provide a constraint-based method which builds upon lower-triangular monotone parametric transport maps to design conditional…

Methodology · Statistics 2023-05-30 Sina Akbari , Luca Ganassali , Negar Kiyavash

Interpretable, multi-dimensional Evaluation Framework for Causal Discovery from observational i.i.d. Data

Nonlinear causal discovery from observational data imposes strict identifiability assumptions on the formulation of structural equations utilized in the data generating process. The evaluation of structure learning methods under assumption…

Machine Learning · Statistics 2024-12-17 Georg Velev , Stefan Lessmann

Optimal Transport with Heterogeneously Missing Data

We consider the problem of solving the optimal transport problem between two empirical distributions with missing values. Our main assumption is that the data is missing completely at random (MCAR), but we allow for heterogeneous…

Machine Learning · Statistics 2025-05-26 Linus Bleistein , Aurélien Bellet , Julie Josse

Optimal Experiment Design for Causal Discovery from Fixed Number of Experiments

We study the problem of causal structure learning over a set of random variables when the experimenter is allowed to perform at most $M$ experiments in a non-adaptive manner. We consider the optimal learning strategy in terms of minimizing…

Machine Learning · Computer Science 2017-03-01 AmirEmad Ghassami , Saber Salehkaleybar , Negar Kiyavash

Robust Causal Discovery under Imperfect Structural Constraints

Robust causal discovery from observational data under imperfect prior knowledge remains a significant and largely unresolved challenge. Existing methods typically presuppose perfect priors or can only handle specific, pre-identified error…

Machine Learning · Computer Science 2025-11-11 Zidong Wang , Xi Lin , Chuchao He , Xiaoguang Gao

A primer on optimal transport for causal inference with observational data

The theory of optimal transportation has developed into a powerful and elegant framework for comparing probability distributions, with wide-ranging applications in all areas of science. The fundamental idea of analyzing probabilities by…

Methodology · Statistics 2025-03-14 Florian F Gunsilius

Causal Discovery with Heterogeneous Observational Data

We consider the problem of causal discovery (structure learning) from heterogeneous observational data. Most existing methods assume a homogeneous sampling scheme, which leads to misleading conclusions when violated in many applications. To…

Methodology · Statistics 2022-02-01 Fangting Zhou , Kejun He , Yang Ni

Identification of Causal Structure in the Presence of Missing Data with Additive Noise Model

Missing data are an unavoidable complication frequently encountered in many causal discovery tasks. When a missing process depends on the missing values themselves (known as self-masking missingness), the recovery of the joint distribution…

Machine Learning · Computer Science 2023-12-20 Jie Qiao , Zhengming Chen , Jianhua Yu , Ruichu Cai , Zhifeng Hao

Designing Ambiguity Sets for Distributionally Robust Optimization Using Structural Causal Optimal Transport

Distributionally robust optimization tackles out-of-sample issues like overfitting and distribution shifts by adopting an adversarial approach over a range of possible data distributions, known as the ambiguity set. To balance conservatism…

Machine Learning · Computer Science 2025-10-02 Ahmad-Reza Ehyaei , Golnoosh Farnadi , Samira Samadi

Causal learning with sufficient statistics: an information bottleneck approach

The inference of causal relationships using observational data from partially observed multivariate systems with hidden variables is a fundamental question in many scientific domains. Methods extracting causal information from conditional…

Machine Learning · Statistics 2020-10-13 Daniel Chicharro , Michel Besserve , Stefano Panzeri

On the Role of Entropy-based Loss for Learning Causal Structures with Continuous Optimization

Causal discovery from observational data is an important but challenging task in many scientific fields. Recently, a method with non-combinatorial directed acyclic constraint, called NOTEARS, formulates the causal structure learning problem…

Machine Learning · Computer Science 2023-10-31 Weilin Chen , Jie Qiao , Ruichu Cai , Zhifeng Hao

Distribution Shift in Missing Data Imputation: A Risk-Based Perspective and Importance-Weighted Correction under MAR

Missing data imputation, where a model is trained on observed data to estimate unobserved values, is a fundamental problem in machine learning. In this paper, we rigorously formulate imputation model learning as a mean-squared error risk…

Machine Learning · Statistics 2026-05-14 Luke Shannon , Song Liu , Katarzyna Reluga

Reliable Causal Discovery with Improved Exact Search and Weaker Assumptions

Many of the causal discovery methods rely on the faithfulness assumption to guarantee asymptotic correctness. However, the assumption can be approximately violated in many ways, leading to sub-optimal solutions. Although there is a line of…

Machine Learning · Computer Science 2022-01-19 Ignavier Ng , Yujia Zheng , Jiji Zhang , Kun Zhang

Optimal Transportation by Orthogonal Coupling Dynamics

Many numerical and learning algorithms rely on the solution of the Monge-Kantorovich problem and Wasserstein distances, which provide appropriate distributional metrics. While the natural approach is to treat the problem as an…

Optimization and Control · Mathematics 2025-12-11 Mohsen Sadr , Peyman Mohajerin Esfahani , Hossein Gorji

Learning Ultrametric Trees for Optimal Transport Regression

Optimal transport provides a metric which quantifies the dissimilarity between probability measures. For measures supported in discrete metric spaces, finding the optimal transport distance has cubic time complexity in the size of the…

Machine Learning · Computer Science 2024-01-30 Samantha Chen , Puoya Tabaghi , Yusu Wang

Co-clustering through Optimal Transport

In this paper, we present a novel method for co-clustering, an unsupervised learning approach that aims at discovering homogeneous groups of data instances and features by grouping them simultaneously. The proposed method uses the entropy…

Machine Learning · Statistics 2017-05-22 Charlotte Laclau , Ievgen Redko , Basarab Matei , Younès Bennani , Vincent Brault

A contribution to Optimal Transport on incomparable spaces

Optimal Transport is a theory that allows to define geometrical notions of distance between probability distributions and to find correspondences, relationships, between sets of points. Many machine learning applications are derived from…

Machine Learning · Statistics 2020-11-10 Titouan Vayer

Representation Learning via Adversarially-Contrastive Optimal Transport

In this paper, we study the problem of learning compact (low-dimensional) representations for sequential data that captures its implicit spatio-temporal cues. To maximize extraction of such informative cues from the data, we set the problem…

Machine Learning · Computer Science 2020-07-14 Anoop Cherian , Shuchin Aeron