English

Time manipulation technique for speeding up reinforcement learning in simulations

Artificial Intelligence 2009-03-31 v1 Machine Learning Robotics

Abstract

A technique for speeding up reinforcement learning algorithms by using time manipulation is proposed. It is applicable to failure-avoidance control problems running in a computer simulation. Turning the time of the simulation backwards on failure events is shown to speed up the learning by 260% and improve the state space exploration by 12% on the cart-pole balancing task, compared to the conventional Q-learning and Actor-Critic algorithms.

Keywords

Cite

@article{arxiv.0903.4930,
  title  = {Time manipulation technique for speeding up reinforcement learning in simulations},
  author = {Petar Kormushev and Kohei Nomoto and Fangyan Dong and Kaoru Hirota},
  journal= {arXiv preprint arXiv:0903.4930},
  year   = {2009}
}

Comments

12 pages

R2 v1 2026-06-21T12:45:32.546Z