Continual Learning in Linear Classification on Separable Data
Machine Learning
2023-06-07 v1 Numerical Analysis
Numerical Analysis
Abstract
We analyze continual learning on a sequence of separable linear classification tasks with binary labels. We show theoretically that learning with weak regularization reduces to solving a sequential max-margin problem, corresponding to a special case of the Projection Onto Convex Sets (POCS) framework. We then develop upper bounds on the forgetting and other quantities of interest under various settings with recurring tasks, including cyclic and random orderings of tasks. We discuss several practical implications to popular training practices like regularization scheduling and weighting. We point out several theoretical differences between our continual classification setting and a recently studied continual regression setting.
Cite
@article{arxiv.2306.03534,
title = {Continual Learning in Linear Classification on Separable Data},
author = {Itay Evron and Edward Moroshko and Gon Buzaglo and Maroun Khriesh and Badea Marjieh and Nathan Srebro and Daniel Soudry},
journal= {arXiv preprint arXiv:2306.03534},
year = {2023}
}