Related papers: A Unifying Causal Framework for Analyzing Dataset …

Fairness Violations and Mitigation under Covariate Shift

We study the problem of learning fair prediction models for unseen test sets distributed differently from the train set. Stability against changes in data distribution is an important mandate for responsible deployment of models. The domain…

Machine Learning · Computer Science 2021-01-26 Harvineet Singh , Rina Singh , Vishwali Mhasawade , Rumi Chunara

Which Invariance Should We Transfer? A Causal Minimax Learning Approach

A major barrier to deploying current machine learning models lies in their non-reliability to dataset shifts. To resolve this problem, most existing studies attempted to transfer stable information to unseen environments. Particularly,…

Machine Learning · Statistics 2023-05-31 Mingzhou Liu , Xiangyu Zheng , Xinwei Sun , Fang Fang , Yizhou Wang

Evaluating Robustness and Uncertainty of Graph Models Under Structural Distributional Shifts

In reliable decision-making systems based on machine learning, models have to be robust to distributional shifts or provide the uncertainty of their predictions. In node-level problems of graph learning, distributional shifts can be…

Machine Learning · Computer Science 2023-11-02 Gleb Bazhenov , Denis Kuznedelev , Andrey Malinin , Artem Babenko , Liudmila Prokhorenkova

Evaluating Model Robustness and Stability to Dataset Shift

As the use of machine learning in high impact domains becomes widespread, the importance of evaluating safety has increased. An important aspect of this is evaluating how robust a model is to changes in setting or population, which…

Machine Learning · Computer Science 2021-03-16 Adarsh Subbaswamy , Roy Adams , Suchi Saria

"What is Different Between These Datasets?" A Framework for Explaining Data Distribution Shifts

The performance of machine learning models relies heavily on the quality of input data, yet real-world applications often face significant data-related challenges. A common issue arises when curating training data or deploying models: two…

Machine Learning · Computer Science 2025-09-24 Varun Babbar , Zhicheng Guo , Cynthia Rudin

Explaining and Adapting Graph Conditional Shift

Graph Neural Networks (GNNs) have shown remarkable performance on graph-structured data. However, recent empirical studies suggest that GNNs are very susceptible to distribution shift. There is still significant ambiguity about why…

Machine Learning · Computer Science 2023-06-07 Qi Zhu , Yizhu Jiao , Natalia Ponomareva , Jiawei Han , Bryan Perozzi

Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks

There has been significant research done on developing methods for improving robustness to distributional shift and uncertainty estimation. In contrast, only limited work has examined developing standard datasets and benchmarks for…

Machine Learning · Computer Science 2022-02-14 Andrey Malinin , Neil Band , Ganshin , Alexander , German Chesnokov , Yarin Gal , Mark J. F. Gales , Alexey Noskov , Andrey Ploskonosov , Liudmila Prokhorenkova , Ivan Provilkov , Vatsal Raina , Vyas Raina , Roginskiy , Denis , Mariya Shmatova , Panos Tigas , Boris Yangel

Stable Prediction on Graphs with Agnostic Distribution Shift

Graph is a flexible and effective tool to represent complex structures in practice and graph neural networks (GNNs) have been shown to be effective on various graph tasks with randomly separated training and testing data. In real…

Machine Learning · Computer Science 2021-10-11 Shengyu Zhang , Kun Kuang , Jiezhong Qiu , Jin Yu , Zhou Zhao , Hongxia Yang , Zhongfei Zhang , Fei Wu

Preventing Failures Due to Dataset Shift: Learning Predictive Models That Transport

Classical supervised learning produces unreliable models when training and target distributions differ, with most existing solutions requiring samples from the target domain. We propose a proactive approach which learns a relationship in…

Machine Learning · Statistics 2019-03-01 Adarsh Subbaswamy , Peter Schulam , Suchi Saria

Counterfactual Normalization: Proactively Addressing Dataset Shift and Improving Reliability Using Causal Mechanisms

Predictive models can fail to generalize from training to deployment environments because of dataset shift, posing a threat to model reliability and the safety of downstream decisions made in practice. Instead of using samples from the…

Machine Learning · Statistics 2018-08-10 Adarsh Subbaswamy , Suchi Saria

A Meta Learning Approach to Discerning Causal Graph Structure

We explore the usage of meta-learning to derive the causal direction between variables by optimizing over a measure of distribution simplicity. We incorporate a stochastic graph representation which includes latent variables and allows for…

Machine Learning · Computer Science 2021-06-11 Justin Wong , Dominik Damjakob

Score-based Conditional Out-of-Distribution Augmentation for Graph Covariate Shift

Distribution shifts between training and testing datasets significantly impair the model performance on graph learning. A commonly-taken causal view in graph invariant learning suggests that stable predictive features of graphs are causally…

Machine Learning · Computer Science 2025-12-10 Bohan Wang , Yurui Chang , Wei Jin , Lu Lin

Minimax Optimal Estimation of Stability Under Distribution Shift

The performance of decision policies and prediction models often deteriorates when applied to environments different from the ones seen during training. To ensure reliable operation, we analyze the stability of a system under distribution…

Machine Learning · Statistics 2026-02-13 Hongseok Namkoong , Yuanzhe Ma , Peter W. Glynn

Covariate Shift in High-Dimensional Random Feature Regression

A significant obstacle in the development of robust machine learning models is covariate shift, a form of distribution shift that occurs when the input distributions of the training and test sets differ while the conditional label…

Machine Learning · Statistics 2021-11-17 Nilesh Tripuraneni , Ben Adlam , Jeffrey Pennington

Evaluating Robustness to Dataset Shift via Parametric Robustness Sets

We give a method for proactively identifying small, plausible shifts in distribution which lead to large differences in model performance. These shifts are defined via parametric changes in the causal mechanisms of observed variables, where…

Machine Learning · Computer Science 2023-01-18 Nikolaj Thams , Michael Oberst , David Sontag

Distributionally Robust Graph Learning from Smooth Signals under Moment Uncertainty

We consider the problem of learning a graph from a finite set of noisy graph signal observations, the goal of which is to find a smooth representation of the graph signal. Such a problem is motivated by the desire to infer relational…

Machine Learning · Computer Science 2023-02-08 Xiaolu Wang , Yuen-Man Pun , Anthony Man-Cho So

Mind the Graph When Balancing Data for Fairness or Robustness

Failures of fairness or robustness in machine learning predictive settings can be due to undesired dependencies between covariates, outcomes and auxiliary factors of variation. A common strategy to mitigate these failures is data balancing,…

Machine Learning · Computer Science 2025-11-10 Jessica Schrouff , Alexis Bellot , Amal Rannen-Triki , Alan Malek , Isabela Albuquerque , Arthur Gretton , Alexander D'Amour , Silvia Chiappa

Context-Specific Causal Graph Discovery with Unobserved Contexts: Non-Stationarity, Regimes and Spatio-Temporal Patterns

Real-world problems, for example in climate applications, often require causal reasoning on spatially gridded time series data or data with comparable structure. While the underlying system is often believed to behave similarly at different…

Machine Learning · Computer Science 2026-02-16 Martin Rabel , Jakob Runge

Robust Predictive Modeling Under Unseen Data Distribution Shifts: A Methodological Commentary

Most research designing novel predictive models, or employing existing ones, assumes that training and testing data are independent and identically distributed. In practice, the data encountered at serving time often deviate from the…

Machine Learning · Computer Science 2026-03-30 Hanyu Duan , Yi Yang , Ahmed Abbasi , Kar Yan Tam

Dr. FERMI: A Stochastic Distributionally Robust Fair Empirical Risk Minimization Framework

While training fair machine learning models has been studied extensively in recent years, most developed methods rely on the assumption that the training and test data have similar distributions. In the presence of distribution shifts, fair…

Machine Learning · Computer Science 2023-09-22 Sina Baharlouei , Meisam Razaviyayn