English

MOROCCO: Model Resource Comparison Framework

Computation and Language 2021-04-30 v1

Abstract

The new generation of pre-trained NLP models push the SOTA to the new limits, but at the cost of computational resources, to the point that their use in real production environments is often prohibitively expensive. We tackle this problem by evaluating not only the standard quality metrics on downstream tasks but also the memory footprint and inference time. We present MOROCCO, a framework to compare language models compatible with \texttt{jiant} environment which supports over 50 NLU tasks, including SuperGLUE benchmark and multiple probing suites. We demonstrate its applicability for two GLUE-like suites in different languages.

Cite

@article{arxiv.2104.14314,
  title  = {MOROCCO: Model Resource Comparison Framework},
  author = {Valentin Malykh and Alexander Kukushkin and Ekaterina Artemova and Vladislav Mikhailov and Maria Tikhonova and Tatiana Shavrina},
  journal= {arXiv preprint arXiv:2104.14314},
  year   = {2021}
}
R2 v1 2026-06-24T01:37:54.043Z