A. V. Zakirov, V. D. Levchenko, A. Yu. Perepelkina, Yasunari Zempo, “High performance FDTD code implementation for GPGPU supercomputers”, Keldysh Institute preprints, 2016, 044, 22 pp.

Preprints of the Keldysh Institute of Applied Mathematics

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Keldysh Institute preprints:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Preprints of the Keldysh Institute of Applied Mathematics, 2016, 044, 22 pp.
DOI: https://doi.org/10.20948/prepr-2016-44-e (Mi ipmp2120)

This article is cited in 5 scientific papers (total in 5 papers)

High performance FDTD code implementation for GPGPU supercomputers

A. V. Zakirov, V. D. Levchenko, A. Yu. Perepelkina, Yasunari Zempo

Full-text PDF (771 kB) Citations (5)

References:

PDF

HTML

DOI: https://doi.org/10.20948/prepr-2016-44-e

Abstract: An implementation of FDTD (Finite Difference Time Domain) method for solution of optical and other electrodynamic problems of high computational cost is described. The implementation is based on LRnLA (Locally Recursive non-Locally Asynchronous) algorithm DiamondTorre, which is developed specifically for GPGPU (General Purpose Graphical Processing Unit) hardware. The specifics of the DiamondTorre algorithms for staggered grid (Yee cell) and many-GPU devices are shown. The algorithm is implemented in software for real physics calculation with the use of CUDA, OpenMP, MPI technologies. The software performance limits are estimated through algorithms parameters and computer model of TSUBAME2.5. The real performance is tested on one GPU device, as well as on many-GPU cluster with strong and weak scaling tests. The performance of up to $0.65\cdot10^{12}$ cell updates per second for 3D domain with $0.3\cdot10^{12}$ Yee cells total is achieved.

Funding agency	Grant number
Russian Foundation for Basic Research	14-01-31483_мол_а
The work is supported by Hosei International Fellowship grant, RFBR grant no. 14-01-31483.

Document Type: Preprint

UDC: 519.688

Language: English

Citation: A. V. Zakirov, V. D. Levchenko, A. Yu. Perepelkina, Yasunari Zempo, “High performance FDTD code implementation for GPGPU supercomputers”, Keldysh Institute preprints, 2016, 044, 22 pp.

Citation in format AMSBIB

\Bibitem{ZakLevPer16}

\by A.~V.~Zakirov, V.~D.~Levchenko, A.~Yu.~Perepelkina, Yasunari~Zempo

\paper High performance FDTD code implementation for GPGPU supercomputers

\jour Keldysh Institute preprints

\yr 2016

\papernumber 044

\totalpages 22

\mathnet{http://mi.mathnet.ru/ipmp2120}

\crossref{https://doi.org/10.20948/prepr-2016-44-e}

Linking options:

https://www.mathnet.ru/eng/ipmp2120

https://www.mathnet.ru/eng/ipmp/y2016/p44

This publication is cited in the following 5 articles:

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Препринты Института прикладной математики им. М. В. Келдыша РАН

Statistics & downloads:
Abstract page:	294
Full-text PDF :	252
References:	27

Что такое QR-код?

Registration to the website

Logotypes