Supervised Reward Inference

Will Schwarzer; Jordan Schneider; Philip S. Thomas; Scott Niekum

Supervised Reward Inference

Machine Learning 2025-02-26 v1

Authors: Will Schwarzer , Jordan Schneider , Philip S. Thomas , Scott Niekum

Abstract

Existing approaches to reward inference from behavior typically assume that humans provide demonstrations according to specific models of behavior. However, humans often indicate their goals through a wide range of behaviors, from actions that are suboptimal due to poor planning or execution to behaviors which are intended to communicate goals rather than achieve them. We propose that supervised learning offers a unified framework to infer reward functions from any class of behavior, and show that such an approach is asymptotically Bayes-optimal under mild assumptions. Experiments on simulated robotic manipulation tasks show that our method can efficiently infer rewards from a wide variety of arbitrarily suboptimal demonstrations.

Keywords

machine learning theory imitation learning reinforcement learning from human feedback

Cite

@article{arxiv.2502.18447,
  title  = {Supervised Reward Inference},
  author = {Will Schwarzer and Jordan Schneider and Philip S. Thomas and Scott Niekum},
  journal= {arXiv preprint arXiv:2502.18447},
  year   = {2025}
}

Comments

16 pages, 4 figures

Supervised Reward Inference

Abstract

Keywords

Cite

Comments

Related papers