Reinforcement Learning for the Unit Commitment Problem

Gal Dalal; Shie Mannor

doi:10.1109/PTC.2015.7232646

Reinforcement Learning for the Unit Commitment Problem

Artificial Intelligence 2016-11-17 v1

Authors: Gal Dalal , Shie Mannor

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

In this work we solve the day-ahead unit commitment (UC) problem, by formulating it as a Markov decision process (MDP) and finding a low-cost policy for generation scheduling. We present two reinforcement learning algorithms, and devise a third one. We compare our results to previous work that uses simulated annealing (SA), and show a 27% improvement in operation costs, with running time of 2.5 minutes (compared to 2.5 hours of existing state-of-the-art).

Reinforcement Learning for the Unit Commitment Problem

Abstract

Keywords

Cite

Comments

Related papers