English

Learning Primitive Relations for Compositional Zero-Shot Learning

Computer Vision and Pattern Recognition 2025-01-27 v1 Artificial Intelligence

Abstract

Compositional Zero-Shot Learning (CZSL) aims to identify unseen state-object compositions by leveraging knowledge learned from seen compositions. Existing approaches often independently predict states and objects, overlooking their relationships. In this paper, we propose a novel framework, learning primitive relations (LPR), designed to probabilistically capture the relationships between states and objects. By employing the cross-attention mechanism, LPR considers the dependencies between states and objects, enabling the model to infer the likelihood of unseen compositions. Experimental results demonstrate that LPR outperforms state-of-the-art methods on all three CZSL benchmark datasets in both closed-world and open-world settings. Through qualitative analysis, we show that LPR leverages state-object relationships for unseen composition prediction.

Keywords

Cite

@article{arxiv.2501.14308,
  title  = {Learning Primitive Relations for Compositional Zero-Shot Learning},
  author = {Insu Lee and Jiseob Kim and Kyuhong Shim and Byonghyo Shim},
  journal= {arXiv preprint arXiv:2501.14308},
  year   = {2025}
}

Comments

Accepted to ICASSP 2025

R2 v1 2026-06-28T21:15:52.471Z