English

Learning Clustering-based Prototypes for Compositional Zero-shot Learning

Computer Vision and Pattern Recognition 2025-02-25 v2

Abstract

Learning primitive (i.e., attribute and object) concepts from seen compositions is the primary challenge of Compositional Zero-Shot Learning (CZSL). Existing CZSL solutions typically rely on oversimplified data assumptions, e.g., modeling each primitive with a single centroid primitive representation, ignoring the natural diversities of the attribute (resp. object) when coupled with different objects (resp. attribute). In this work, we develop ClusPro, a robust clustering-based prototype mining framework for CZSL that defines the conceptual boundaries of primitives through a set of diversified prototypes. Specifically, ClusPro conducts within-primitive clustering on the embedding space for automatically discovering and dynamically updating prototypes. These representative prototypes are subsequently used to repaint a well-structured and independent primitive embedding space, ensuring intra-primitive separation and inter-primitive decorrelation through prototype-based contrastive learning and decorrelation learning. Moreover, ClusPro efficiently performs prototype clustering in a non-parametric fashion without the introduction of additional learnable parameters or computational budget during testing. Experiments on three benchmarks demonstrate ClusPro outperforms various top-leading CZSL solutions under both closed-world and open-world settings.

Keywords

Cite

@article{arxiv.2502.06501,
  title  = {Learning Clustering-based Prototypes for Compositional Zero-shot Learning},
  author = {Hongyu Qu and Jianan Wei and Xiangbo Shu and Wenguan Wang},
  journal= {arXiv preprint arXiv:2502.06501},
  year   = {2025}
}

Comments

Accepted to ICLR 2025; Project page: https://github.com/quhongyu/ClusPro

R2 v1 2026-06-28T21:38:38.248Z