Generative Adversarial Networks (GAN) are cutting-edge algorithms for generating new data samples based on the learned data distribution. However, its performance comes at a significant cost in terms of computation and memory requirements. In this paper, we proposed an HW/SW co-design approach for training quantized deconvolution GAN (QDCGAN) implemented on FPGA using a scalable streaming dataflow architecture capable of achieving higher throughput versus resource utilization trade-off. The developed accelerator is based on an efficient deconvolution engine that offers high parallelism with respect to scaling factors for GAN-based edge computing. Furthermore, various precisions, datasets, and network scalability were analyzed for low-power inference on resource-constrained platforms. Lastly, an end-to-end open-source framework is provided for training, implementation, state-space exploration, and scaling the inference using Vivado high-level synthesis for Xilinx SoC-FPGAs, and a comparison testbed with Jetson Nano.
@article{arxiv.2201.06878,
title = {Hardware-Efficient Deconvolution-Based GAN for Edge Computing},
author = {Azzam Alhussain and Mingjie Lin},
journal= {arXiv preprint arXiv:2201.06878},
year = {2022}
}
Comments
To be presented in the 56th Annual Conference on Information Sciences and System (CISS), 9-11 March 2022, and published in the IEEE Xplore Digital library