We study offline multitask representation learning in reinforcement learning (RL), where a learner is provided with an offline dataset from different tasks that share a common representation and is asked to learn the shared representation. We theoretically investigate offline multitask low-rank RL, and propose a new algorithm called MORL for offline multitask representation learning. Furthermore, we examine downstream RL in reward-free, offline and online scenarios, where a new task is introduced to the agent that shares the same representation as the upstream offline tasks. Our theoretical results demonstrate the benefits of using the learned representation from the upstream offline task instead of directly learning the representation of the low-rank model.
@article{arxiv.2403.11574,
title = {Offline Multitask Representation Learning for Reinforcement Learning},
author = {Haque Ishfaq and Thanh Nguyen-Tang and Songtao Feng and Raman Arora and Mengdi Wang and Ming Yin and Doina Precup},
journal= {arXiv preprint arXiv:2403.11574},
year = {2024}
}
Comments
Accepted to 38th Conference on Neural Information Processing Systems (NeurIPS 2024)