Generative Adversarial Imitation Learning

Jonathan Ho; Stefano Ermon

Generative Adversarial Imitation Learning

Machine Learning 2016-06-14 v1 Artificial Intelligence

Authors: Jonathan Ho , Stefano Ermon

Abstract

Consider learning a policy from example expert behavior, without interaction with the expert or access to reinforcement signal. One approach is to recover the expert's cost function with inverse reinforcement learning, then extract a policy from that cost function with reinforcement learning. This approach is indirect and can be slow. We propose a new general framework for directly extracting a policy from data, as if it were obtained by reinforcement learning following inverse reinforcement learning. We show that a certain instantiation of our framework draws an analogy between imitation learning and generative adversarial networks, from which we derive a model-free imitation learning algorithm that obtains significant performance gains over existing model-free methods in imitating complex behaviors in large, high-dimensional environments.

Keywords

reinforcement learning machine learning theory imitation learning

Cite

@article{arxiv.1606.03476,
  title  = {Generative Adversarial Imitation Learning},
  author = {Jonathan Ho and Stefano Ermon},
  journal= {arXiv preprint arXiv:1606.03476},
  year   = {2016}
}

Generative Adversarial Imitation Learning

Abstract

Keywords

Cite

Related papers