English

Cliff-Learning

Machine Learning 2023-06-08 v2 Artificial Intelligence Machine Learning

Abstract

We study the data-scaling of transfer learning from foundation models in the low-downstream-data regime. We observe an intriguing phenomenon which we call cliff-learning. Cliff-learning refers to regions of data-scaling laws where performance improves at a faster than power law rate (i.e. regions of concavity on a log-log scaling plot). We conduct an in-depth investigation of foundation-model cliff-learning and study toy models of the phenomenon. We observe that the degree of cliff-learning reflects the degree of compatibility between the priors of a learning algorithm and the task being learned.

Keywords

Cite

@article{arxiv.2302.07348,
  title  = {Cliff-Learning},
  author = {Tony T. Wang and Igor Zablotchi and Nir Shavit and Jonathan S. Rosenfeld},
  journal= {arXiv preprint arXiv:2302.07348},
  year   = {2023}
}

Comments

16 pages; v2 updates: improved layout, added acknowledgements

R2 v1 2026-06-28T08:40:16.621Z