Provable Compositional Generalization for Object-Centric Learning

Thaddäus Wiedemer; Jack Brady; Alexander Panfilov; Attila Juhos; Matthias Bethge; Wieland Brendel

Provable Compositional Generalization for Object-Centric Learning

Machine Learning 2024-11-13 v2

Authors: Thaddäus Wiedemer , Jack Brady , Alexander Panfilov , Attila Juhos , Matthias Bethge , Wieland Brendel

Abstract

Learning representations that generalize to novel compositions of known concepts is crucial for bridging the gap between human and machine perception. One prominent effort is learning object-centric representations, which are widely conjectured to enable compositional generalization. Yet, it remains unclear when this conjecture will be true, as a principled theoretical or empirical understanding of compositional generalization is lacking. In this work, we investigate when compositional generalization is guaranteed for object-centric representations through the lens of identifiability theory. We show that autoencoders that satisfy structural assumptions on the decoder and enforce encoder-decoder consistency will learn object-centric representations that provably generalize compositionally. We validate our theoretical result and highlight the practical relevance of our assumptions through experiments on synthetic image data.

Keywords

representation learning generalization in machine learning image representation learning

Cite

@article{arxiv.2310.05327,
  title  = {Provable Compositional Generalization for Object-Centric Learning},
  author = {Thaddäus Wiedemer and Jack Brady and Alexander Panfilov and Attila Juhos and Matthias Bethge and Wieland Brendel},
  journal= {arXiv preprint arXiv:2310.05327},
  year   = {2024}
}

Comments

Oral at ICLR 2024. The first four authors contributed equally

Provable Compositional Generalization for Object-Centric Learning

Abstract

Keywords

Cite

Comments

Related papers