AccFFT: A library for distributed-memory FFT on CPU and GPU architectures

A Gholami, J Hill, D Malhotra, G Biros - arXiv preprint arXiv:1506.07933, 2015 - arxiv.org
We present a new library for parallel distributed Fast Fourier Transforms (FFT). The
importance of FFT in science and engineering and the advances in high performance
computing necessitate further improvements. AccFFT extends existing FFT libraries for
CUDA-enabled Graphics Processing Units (GPUs) to distributed memory clusters. We use
overlapping communication method to reduce the overhead of PCIe transfers from/to GPU.
We present numerical results on the Maverick platform at the Texas Advanced Computing …

[CITATION][C] Accfft: a library for distributed-memory FFT on CPU and GPU architectures. CoRR abs/1506.07933 (2015)

A Gholami, J Hill, D Malhotra, G Biros - arXiv preprint arXiv:1506.07933, 2015
Showing the best results for this search. See all results