Sequential Function-Space Variational Inference via Gaussian Mixture Approximation
Abstract
Continual learning in neural networks aims to learn new tasks without forgetting old tasks. Sequential function-space variational inference (SFSVI) uses a Gaussian variational distribution to approximate the distribution of the outputs of the neural network corresponding to a finite number of selected inducing points. Since the posterior distribution of a neural network is multi-modal, a Gaussian distribution could only match one mode of the posterior distribution, and a Gaussian mixture distribution could be used to better approximate the posterior distribution. We propose an SFSVI method based on a Gaussian mixture variational distribution. We also compare different types of variational inference methods with a fixed pre-trained feature extractor (where continual learning is performed on the final layer) and without a fixed pre-trained feature extractor (where continual learning is performed on all layers). We find that in terms of final average accuracy, likelihood-focused Gaussian mixture SFSVI outperforms other sequential variational inference methods, especially in the latter case.
Cite
@article{arxiv.2503.07114,
title = {Sequential Function-Space Variational Inference via Gaussian Mixture Approximation},
author = {Menghao Waiyan William Zhu and Pengcheng Hao and Ercan Engin Kuruoğlu},
journal= {arXiv preprint arXiv:2503.07114},
year = {2025}
}