Time manipulation technique for speeding up reinforcement learning in simulations

Petar Kormushev; Kohei Nomoto; Fangyan Dong; Kaoru Hirota

Time manipulation technique for speeding up reinforcement learning in simulations

Artificial Intelligence 2009-03-31 v1 Machine Learning Robotics

Authors: Petar Kormushev , Kohei Nomoto , Fangyan Dong , Kaoru Hirota

Abstract

A technique for speeding up reinforcement learning algorithms by using time manipulation is proposed. It is applicable to failure-avoidance control problems running in a computer simulation. Turning the time of the simulation backwards on failure events is shown to speed up the learning by 260% and improve the state space exploration by 12% on the cart-pole balancing task, compared to the conventional Q-learning and Actor-Critic algorithms.

Time manipulation technique for speeding up reinforcement learning in simulations

Abstract

Keywords

Cite

Comments

Related papers