English

Mining Invariance from Nonlinear Multi-Environment Data: Binary Classification

Methodology 2024-07-08 v2 Machine Learning

Abstract

Making predictions in an unseen environment given data from multiple training environments is a challenging task. We approach this problem from an invariance perspective, focusing on binary classification to shed light on general nonlinear data generation mechanisms. We identify a unique form of invariance that exists solely in a binary setting that allows us to train models invariant over environments. We provide sufficient conditions for such invariance and show it is robust even when environmental conditions vary greatly. Our formulation admits a causal interpretation, allowing us to compare it with various frameworks. Finally, we propose a heuristic prediction method and conduct experiments using real and synthetic datasets.

Keywords

Cite

@article{arxiv.2404.15245,
  title  = {Mining Invariance from Nonlinear Multi-Environment Data: Binary Classification},
  author = {Austin Goddard and Kang Du and Yu Xiang},
  journal= {arXiv preprint arXiv:2404.15245},
  year   = {2024}
}

Comments

Accepted to the 2024 International Symposium on Information Theory (ISIT)