Graphics Processing Unit, or GPUs, have been successfully adopted both for graphic computation in 3D applications, and for general purpose application (GP-GPUs), thank to their tremendous performance-per-watt. Recently, there is a big interest in adopting them also within automotive and avionic industrial settings, imposing for the first time real-time constraints on the design of such devices. Unfortunately, it is extremely hard to extract timing guarantees from modern GPU designs, and current approaches rely on a model where the GPU is treated as a unique monolithic execution device. Unlike state-of-the-art of research, we try to open the box of modern GPU architectures, providing a clean way to exploit intra-GPU predictable execution.
@article{arxiv.2310.01212,
title = {Enabling predictable parallelism in single-GPU systems with persistent CUDA threads},
author = {Paolo Burgio},
journal= {arXiv preprint arXiv:2310.01212},
year = {2023}
}
Comments
3 pages, 2 figures, accepted at the 27th Euromicro Conference on Real-Time Systems as Work-in-Progress