|
This article is cited in 1 scientific paper (total in 1 paper)
Hardware, software and distributed supercomputer systems
Effective computation of two-dimensional FFT on a homogeneous or heterogeneous cluster
D. Yu. Knyazkov Ishlinsky Institute for Problems in Mechanics of the Russian Academy of Sciences
Abstract:
The paper considers performing two-dimensional FFT on a supercomputer. It investigates a dependance of FFT computation time from a matrix size for MVS-100K, MVS-10P and HybriLIT supercomputers. A method of CPU-GPU load balance for a heterogeneous cluster is proposed. For a TESLA K40 card it is shown, that two-dimensional FFT computation time is almost equal to data transferring time. The computation itself is 48 times faster when using GPU comparing to two-processors node. (In Russian).
Key words and phrases:
HPC, supercomputer computations, fast Fourier transform, FFT, GPU computations.
Citation:
D. Yu. Knyazkov, “Effective computation of two-dimensional FFT on a homogeneous or heterogeneous cluster”, Program Systems: Theory and Applications, 8:1 (2017), 47–62
Linking options:
https://www.mathnet.ru/eng/ps248 https://www.mathnet.ru/eng/ps/v8/i1/p47
|
Statistics & downloads: |
Abstract page: | 155 | Full-text PDF : | 44 | References: | 22 |
|