Learnware: Small Models Do Big

Zhi-Hua Zhou; Zhi-Hao Tan

doi:10.1007/s11432-023-3823-6

Learnware: Small Models Do Big

Machine Learning 2023-10-31 v3 Artificial Intelligence

Authors: Zhi-Hua Zhou , Zhi-Hao Tan

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

There are complaints about current machine learning techniques such as the requirement of a huge amount of training data and proficient training skills, the difficulty of continual learning, the risk of catastrophic forgetting, the leaking of data privacy/proprietary, etc. Most research efforts have been focusing on one of those concerned issues separately, paying less attention to the fact that most issues are entangled in practice. The prevailing big model paradigm, which has achieved impressive results in natural language processing and computer vision applications, has not yet addressed those issues, whereas becoming a serious source of carbon emissions. This article offers an overview of the learnware paradigm, which attempts to enable users not need to build machine learning models from scratch, with the hope of reusing small models to do things even beyond their original purposes, where the key ingredient is the specification which enables a trained model to be adequately identified to reuse according to the requirement of future users who know nothing about the model in advance.

Keywords

machine learning machine learning theory unified modeling language

Cite

@article{arxiv.2210.03647,
  title  = {Learnware: Small Models Do Big},
  author = {Zhi-Hua Zhou and Zhi-Hao Tan},
  journal= {arXiv preprint arXiv:2210.03647},
  year   = {2023}
}

Comments

Cite as: Zhi-Hua Zhou, Zhi-Hao Tan. Learnware: Small models do big. Science China Information Sciences, 2024, 67(1): 112102.01-112102.12

Learnware: Small Models Do Big

Abstract

Keywords

Cite

Comments

Related papers