English
Related papers

Related papers: A Unifying Causal Framework for Analyzing Dataset …

200 papers

We study the problem of learning fair prediction models for unseen test sets distributed differently from the train set. Stability against changes in data distribution is an important mandate for responsible deployment of models. The domain…

Machine Learning · Computer Science 2021-01-26 Harvineet Singh , Rina Singh , Vishwali Mhasawade , Rumi Chunara

A major barrier to deploying current machine learning models lies in their non-reliability to dataset shifts. To resolve this problem, most existing studies attempted to transfer stable information to unseen environments. Particularly,…

Machine Learning · Statistics 2023-05-31 Mingzhou Liu , Xiangyu Zheng , Xinwei Sun , Fang Fang , Yizhou Wang

In reliable decision-making systems based on machine learning, models have to be robust to distributional shifts or provide the uncertainty of their predictions. In node-level problems of graph learning, distributional shifts can be…

Machine Learning · Computer Science 2023-11-02 Gleb Bazhenov , Denis Kuznedelev , Andrey Malinin , Artem Babenko , Liudmila Prokhorenkova

As the use of machine learning in high impact domains becomes widespread, the importance of evaluating safety has increased. An important aspect of this is evaluating how robust a model is to changes in setting or population, which…

Machine Learning · Computer Science 2021-03-16 Adarsh Subbaswamy , Roy Adams , Suchi Saria

The performance of machine learning models relies heavily on the quality of input data, yet real-world applications often face significant data-related challenges. A common issue arises when curating training data or deploying models: two…

Machine Learning · Computer Science 2025-09-24 Varun Babbar , Zhicheng Guo , Cynthia Rudin

Graph Neural Networks (GNNs) have shown remarkable performance on graph-structured data. However, recent empirical studies suggest that GNNs are very susceptible to distribution shift. There is still significant ambiguity about why…

Machine Learning · Computer Science 2023-06-07 Qi Zhu , Yizhu Jiao , Natalia Ponomareva , Jiawei Han , Bryan Perozzi

There has been significant research done on developing methods for improving robustness to distributional shift and uncertainty estimation. In contrast, only limited work has examined developing standard datasets and benchmarks for…

Graph is a flexible and effective tool to represent complex structures in practice and graph neural networks (GNNs) have been shown to be effective on various graph tasks with randomly separated training and testing data. In real…

Machine Learning · Computer Science 2021-10-11 Shengyu Zhang , Kun Kuang , Jiezhong Qiu , Jin Yu , Zhou Zhao , Hongxia Yang , Zhongfei Zhang , Fei Wu

Classical supervised learning produces unreliable models when training and target distributions differ, with most existing solutions requiring samples from the target domain. We propose a proactive approach which learns a relationship in…

Machine Learning · Statistics 2019-03-01 Adarsh Subbaswamy , Peter Schulam , Suchi Saria

Predictive models can fail to generalize from training to deployment environments because of dataset shift, posing a threat to model reliability and the safety of downstream decisions made in practice. Instead of using samples from the…

Machine Learning · Statistics 2018-08-10 Adarsh Subbaswamy , Suchi Saria

We explore the usage of meta-learning to derive the causal direction between variables by optimizing over a measure of distribution simplicity. We incorporate a stochastic graph representation which includes latent variables and allows for…

Machine Learning · Computer Science 2021-06-11 Justin Wong , Dominik Damjakob

Distribution shifts between training and testing datasets significantly impair the model performance on graph learning. A commonly-taken causal view in graph invariant learning suggests that stable predictive features of graphs are causally…

Machine Learning · Computer Science 2025-12-10 Bohan Wang , Yurui Chang , Wei Jin , Lu Lin

The performance of decision policies and prediction models often deteriorates when applied to environments different from the ones seen during training. To ensure reliable operation, we analyze the stability of a system under distribution…

Machine Learning · Statistics 2026-02-13 Hongseok Namkoong , Yuanzhe Ma , Peter W. Glynn

A significant obstacle in the development of robust machine learning models is covariate shift, a form of distribution shift that occurs when the input distributions of the training and test sets differ while the conditional label…

Machine Learning · Statistics 2021-11-17 Nilesh Tripuraneni , Ben Adlam , Jeffrey Pennington

We give a method for proactively identifying small, plausible shifts in distribution which lead to large differences in model performance. These shifts are defined via parametric changes in the causal mechanisms of observed variables, where…

Machine Learning · Computer Science 2023-01-18 Nikolaj Thams , Michael Oberst , David Sontag

We consider the problem of learning a graph from a finite set of noisy graph signal observations, the goal of which is to find a smooth representation of the graph signal. Such a problem is motivated by the desire to infer relational…

Machine Learning · Computer Science 2023-02-08 Xiaolu Wang , Yuen-Man Pun , Anthony Man-Cho So

Failures of fairness or robustness in machine learning predictive settings can be due to undesired dependencies between covariates, outcomes and auxiliary factors of variation. A common strategy to mitigate these failures is data balancing,…

Real-world problems, for example in climate applications, often require causal reasoning on spatially gridded time series data or data with comparable structure. While the underlying system is often believed to behave similarly at different…

Machine Learning · Computer Science 2026-02-16 Martin Rabel , Jakob Runge

Most research designing novel predictive models, or employing existing ones, assumes that training and testing data are independent and identically distributed. In practice, the data encountered at serving time often deviate from the…

Machine Learning · Computer Science 2026-03-30 Hanyu Duan , Yi Yang , Ahmed Abbasi , Kar Yan Tam

While training fair machine learning models has been studied extensively in recent years, most developed methods rely on the assumption that the training and test data have similar distributions. In the presence of distribution shifts, fair…

Machine Learning · Computer Science 2023-09-22 Sina Baharlouei , Meisam Razaviyayn
‹ Prev 1 2 3 10 Next ›