English

Simulating multi-exit evacuation using deep reinforcement learning

Machine Learning 2020-07-14 v1 Machine Learning

Abstract

Conventional simulations on multi-exit indoor evacuation focus primarily on how to determine a reasonable exit based on numerous factors in a changing environment. Results commonly include some congested and other under-utilized exits, especially with massive pedestrians. We propose a multi-exit evacuation simulation based on Deep Reinforcement Learning (DRL), referred to as the MultiExit-DRL, which involves in a Deep Neural Network (DNN) framework to facilitate state-to-action mapping. The DNN framework applies Rainbow Deep Q-Network (DQN), a DRL algorithm that integrates several advanced DQN methods, to improve data utilization and algorithm stability, and further divides the action space into eight isometric directions for possible pedestrian choices. We compare MultiExit-DRL with two conventional multi-exit evacuation simulation models in three separate scenarios: 1) varying pedestrian distribution ratios, 2) varying exit width ratios, and 3) varying open schedules for an exit. The results show that MultiExit-DRL presents great learning efficiency while reducing the total number of evacuation frames in all designed experiments. In addition, the integration of DRL allows pedestrians to explore other potential exits and helps determine optimal directions, leading to the high efficiency of exit utilization.

Keywords

Cite

@article{arxiv.2007.05783,
  title  = {Simulating multi-exit evacuation using deep reinforcement learning},
  author = {Dong Xu and Xiao Huang and Joseph Mango and Xiang Li and Zhenlong Li},
  journal= {arXiv preprint arXiv:2007.05783},
  year   = {2020}
}

Comments

25 pages, 5 figures, submitted to Transactions in GIS

R2 v1 2026-06-23T17:02:35.406Z