Conjugate-Gradient-like Based Adaptive Moment Estimation Optimization Algorithm for Deep Learning

Jiawu Tian; Liwei Xu; Xiaowei Zhang; Yongqi Li

Conjugate-Gradient-like Based Adaptive Moment Estimation Optimization Algorithm for Deep Learning

Machine Learning 2025-01-09 v4 Artificial Intelligence Computer Vision and Pattern Recognition Optimization and Control

Authors: Jiawu Tian , Liwei Xu , Xiaowei Zhang , Yongqi Li

Abstract

Training deep neural networks is a challenging task. In order to speed up training and enhance the performance of deep neural networks, we rectify the vanilla conjugate gradient as conjugate-gradient-like and incorporate it into the generic Adam, and thus propose a new optimization algorithm named CG-like-Adam for deep learning. Specifically, both the first-order and the second-order moment estimation of generic Adam are replaced by the conjugate-gradient-like. Convergence analysis handles the cases where the exponential moving average coefficient of the first-order moment estimation is constant and the first-order moment estimation is unbiased. Numerical experiments show the superiority of the proposed algorithm based on the CIFAR10/100 dataset.

Keywords

optimization stochastic gradient descent neural network training

Cite

@article{arxiv.2404.01714,
  title  = {Conjugate-Gradient-like Based Adaptive Moment Estimation Optimization Algorithm for Deep Learning},
  author = {Jiawu Tian and Liwei Xu and Xiaowei Zhang and Yongqi Li},
  journal= {arXiv preprint arXiv:2404.01714},
  year   = {2025}
}

Comments

32 pages, 13 figures

Conjugate-Gradient-like Based Adaptive Moment Estimation Optimization Algorithm for Deep Learning

Abstract

Keywords

Cite

Comments

Related papers