English

Compressing Sentence Representation with maximum Coding Rate Reduction

Computation and Language 2023-04-26 v1 Machine Learning

Abstract

In most natural language inference problems, sentence representation is needed for semantic retrieval tasks. In recent years, pre-trained large language models have been quite effective for computing such representations. These models produce high-dimensional sentence embeddings. An evident performance gap between large and small models exists in practice. Hence, due to space and time hardware limitations, there is a need to attain comparable results when using the smaller model, which is usually a distilled version of the large language model. In this paper, we assess the model distillation of the sentence representation model Sentence-BERT by augmenting the pre-trained distilled model with a projection layer additionally learned on the Maximum Coding Rate Reduction (MCR2)objective, a novel approach developed for general-purpose manifold clustering. We demonstrate that the new language model with reduced complexity and sentence embedding size can achieve comparable results on semantic retrieval benchmarks.

Keywords

Cite

@article{arxiv.2304.12674,
  title  = {Compressing Sentence Representation with maximum Coding Rate Reduction},
  author = {Domagoj Ševerdija and Tomislav Prusina and Antonio Jovanović and Luka Borozan and Jurica Maltar and Domagoj Matijević},
  journal= {arXiv preprint arXiv:2304.12674},
  year   = {2023}
}

Comments

14 pages, 3 figures, accepted on ICT and Electronics Convention (MIPRO), Croatia

R2 v1 2026-06-28T10:16:55.315Z