English

Pseudorehearsal in value function approximation

Artificial Intelligence 2017-03-22 v1

Abstract

Catastrophic forgetting is of special importance in reinforcement learning, as the data distribution is generally non-stationary over time. We study and compare several pseudorehearsal approaches for Q-learning with function approximation in a pole balancing task. We have found that pseudorehearsal seems to assist learning even in such very simple problems, given proper initialization of the rehearsal parameters.

Keywords

Cite

@article{arxiv.1703.07075,
  title  = {Pseudorehearsal in value function approximation},
  author = {Vladimir Marochko and Leonard Johard and Manuel Mazzara},
  journal= {arXiv preprint arXiv:1703.07075},
  year   = {2017}
}