Pseudorehearsal in value function approximation

Vladimir Marochko; Leonard Johard; Manuel Mazzara

Pseudorehearsal in value function approximation

Artificial Intelligence 2017-03-22 v1

Authors: Vladimir Marochko , Leonard Johard , Manuel Mazzara

Abstract

Catastrophic forgetting is of special importance in reinforcement learning, as the data distribution is generally non-stationary over time. We study and compare several pseudorehearsal approaches for Q-learning with function approximation in a pole balancing task. We have found that pseudorehearsal seems to assist learning even in such very simple problems, given proper initialization of the rehearsal parameters.

Keywords

machine learning theory continual learning reinforcement learning

Cite

@article{arxiv.1703.07075,
  title  = {Pseudorehearsal in value function approximation},
  author = {Vladimir Marochko and Leonard Johard and Manuel Mazzara},
  journal= {arXiv preprint arXiv:1703.07075},
  year   = {2017}
}

Related papers

View all related →

Artificial Intelligence · Computer Science

Pseudorehearsal in actor-critic agents

Marochko Vladimir, Leonard Johard, Manuel Mazzara

2017-04-18

Artificial Intelligence · Computer Science

Pseudorehearsal in actor-critic agents with neural network function approximation

Vladimir Marochko, Leonard Johard, Manuel Mazzara, Luca Longo