English

Blasting through lattice calculations using CUDA

High Energy Physics - Lattice 2009-01-22 v1

Abstract

Modern graphics hardware is designed for highly parallel numerical tasks and provides significant cost and performance benefits. Graphics hardware vendors are now making available development tools to support general purpose high performance computing. Nvidia's CUDA platform, in particular, offers direct access to graphics hardware through a programming language similar to C. Using the CUDA platform we have implemented a Wilson-Dirac operator which runs at an effective 68 Gflops on the Tesla C870. The recently released GeForce GTX 280 runs this same code at 92 Gflops, and we expect further improvement pending code optimization.

Keywords

Cite

@article{arxiv.0810.5365,
  title  = {Blasting through lattice calculations using CUDA},
  author = {Kipton Barros and Ronald Babich and Richard Brower and Michael A. Clark and Claudio Rebbi},
  journal= {arXiv preprint arXiv:0810.5365},
  year   = {2009}
}

Comments

7 pages, 3 figures, presented at the XXVI International Symposium on Lattice Field Theory (Lattice 2008), Williamsburg, Virginia, July 14-19, 2008

R2 v1 2026-06-21T11:36:21.810Z