English

PBBFMM3D: a parallel black-box algorithm for kernel matrix-vector multiplication

Mathematical Software 2021-04-30 v3

Abstract

Kernel matrix-vector product is ubiquitous in many science and engineering applications. However, a naive method requires O(N2)O(N^2) operations, which becomes prohibitive for large-scale problems. We introduce a parallel method that provably requires O(N)O(N) operations to reduce the computation cost. The distinct feature of our method is that it requires only the ability to evaluate the kernel function, offering a black-box interface to users. Our parallel approach targets multi-core shared-memory machines and is implemented using OpenMP. Numerical results demonstrate up to 19×19\times speedup on 32 cores. We also present a real-world application in geostatistics, where our parallel method was used to deliver fast principle component analysis of covariance matrices.

Keywords

Cite

@article{arxiv.1903.02153,
  title  = {PBBFMM3D: a parallel black-box algorithm for kernel matrix-vector multiplication},
  author = {Ruoxi Wang and Chao Chen and Jonghyun Lee and Eric Darve},
  journal= {arXiv preprint arXiv:1903.02153},
  year   = {2021}
}