Multi-Task Policy Search

Marc Peter Deisenroth; Peter Englert; Jan Peters; Dieter Fox

Multi-Task Policy Search

Machine Learning 2014-02-13 v2 Artificial Intelligence Machine Learning Robotics

Authors: Marc Peter Deisenroth , Peter Englert , Jan Peters , Dieter Fox

Abstract

Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for continuous task variations, requiring more principled approaches to share and transfer knowledge among similar tasks. We present a novel approach for learning a nonlinear feedback policy that generalizes across multiple tasks. The key idea is to define a parametrized policy as a function of both the state and the task, which allows learning a single policy that generalizes across multiple known and unknown tasks. Applications of our novel approach to reinforcement and imitation learning in real-robot experiments are shown.

Keywords

multi-task learning online learning decision making under uncertainty

Cite

@article{arxiv.1307.0813,
  title  = {Multi-Task Policy Search},
  author = {Marc Peter Deisenroth and Peter Englert and Jan Peters and Dieter Fox},
  journal= {arXiv preprint arXiv:1307.0813},
  year   = {2014}
}

Comments

8 pages, double column. IEEE International Conference on Robotics and Automation, 2014

Multi-Task Policy Search

Abstract

Keywords

Cite

Comments

Related papers