Playing Atari with Deep Reinforcement Learning

Volodymyr Mnih; Koray Kavukcuoglu; David Silver; Alex Graves; Ioannis Antonoglou; Daan Wierstra; Martin Riedmiller

Playing Atari with Deep Reinforcement Learning

Machine Learning 2013-12-20 v1

Authors: Volodymyr Mnih , Koray Kavukcuoglu , David Silver , Alex Graves , Ioannis Antonoglou , Daan Wierstra , Martin Riedmiller

View on arXiv ↗ PDF ↗

Abstract

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no adjustment of the architecture or learning algorithm. We find that it outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

Keywords

reinforcement learning deep learning game

Cite

@article{arxiv.1312.5602,
  title  = {Playing Atari with Deep Reinforcement Learning},
  author = {Volodymyr Mnih and Koray Kavukcuoglu and David Silver and Alex Graves and Ioannis Antonoglou and Daan Wierstra and Martin Riedmiller},
  journal= {arXiv preprint arXiv:1312.5602},
  year   = {2013}
}

Comments

NIPS Deep Learning Workshop 2013

Playing Atari with Deep Reinforcement Learning

Abstract

Keywords

Cite

Comments

Related papers