A Concentration Bound for LSPE($\lambda$)

Siddharth Chandak; Vivek S. Borkar; Harsh Dolhare

A Concentration Bound for LSPE($\lambda$)

Machine Learning 2022-12-01 v5 Systems and Control Systems and Control

Authors: Siddharth Chandak , Vivek S. Borkar , Harsh Dolhare

Abstract

The popular LSPE( $\lambda$ ) algorithm for policy evaluation is revisited to derive a concentration bound that gives high probability performance guarantees from some time on.

A Concentration Bound for LSPE($\lambda$)

Abstract

Cite

Comments

Related papers