Investigating Compounding Prediction Errors in Learned Dynamics Models

Nathan Lambert; Kristofer Pister; Roberto Calandra

Investigating Compounding Prediction Errors in Learned Dynamics Models

Machine Learning 2022-03-21 v1

Authors: Nathan Lambert , Kristofer Pister , Roberto Calandra

Abstract

Accurately predicting the consequences of agents' actions is a key prerequisite for planning in robotic control. Model-based reinforcement learning (MBRL) is one paradigm which relies on the iterative learning and prediction of state-action transitions to solve a task. Deep MBRL has become a popular candidate, using a neural network to learn a dynamics model that predicts with each pass from high-dimensional states to actions. These "one-step" predictions are known to become inaccurate over longer horizons of composed prediction - called the compounding error problem. Given the prevalence of the compounding error problem in MBRL and related fields of data-driven control, we set out to understand the properties of and conditions causing these long-horizon errors. In this paper, we explore the effects of subcomponents of a control problem on long term prediction error: including choosing a system, collecting data, and training a model. These detailed quantitative studies on simulated and real-world data show that the underlying dynamics of a system are the strongest factor determining the shape and magnitude of prediction error. Given a clearer understanding of compounding prediction error, researchers can implement new types of models beyond "one-step" that are more useful for control.

Keywords

multi-view learning machine learning reinforcement learning

Cite

@article{arxiv.2203.09637,
  title  = {Investigating Compounding Prediction Errors in Learned Dynamics Models},
  author = {Nathan Lambert and Kristofer Pister and Roberto Calandra},
  journal= {arXiv preprint arXiv:2203.09637},
  year   = {2022}
}

Comments

25 pages, 19 figures

Investigating Compounding Prediction Errors in Learned Dynamics Models

Abstract

Keywords

Cite

Comments

Related papers