V. A. Platonov, A. V. Monakov, “Overlapping communications and computations in GPU-based iterative linear solvers”, Proceedings of ISP RAS, 28:1 (2016), 81

Loading [MathJax]/jax/output/SVG/config.js

Proceedings of the Institute for System Programming of the RAS

RUS ENG

JOURNALS PEOPLE ORGANISATIONS CONFERENCES SEMINARS VIDEO LIBRARY PACKAGE AMSBIB

JavaScript is disabled in your browser. Please switch it on to enable full functionality of the website

	General information
	Latest issue
	Archive

	Search papers
	Search references

	RSS
	Latest issue
	Current issues
	Archive issues
	What is RSS

Proceedings of ISP RAS:
Year:
Volume:
Issue:
Page:
	Find

Personal entry:
Login:
Password:
	Save password
	Enter
	Forgotten password?
	Register

Proceedings of the Institute for System Programming of the RAS, 2016, Volume 28, Issue 1, Pages 81–92
DOI: https://doi.org/10.15514/ISPRAS-2016-28(1)-5 (Mi tisp5)

Overlapping communications and computations in GPU-based iterative linear solvers

V. A. Platonov, A. V. Monakov

Institute for System Programming of the Russian Academy of Sciences RAS, 25 Alexander Solzhenitsyn Str., Moscow, 109004, Russian Federation

Full-text PDF (253 kB)

References:

PDF

HTML

DOI: https://doi.org/10.15514/ISPRAS-2016-28(1)-5

Abstract: Krylov subspace methods such as Conjugate Gradient and Biconjugate Gradient Stabilized methods are well known approaches for solving symmetric and asymmetric systems of linear algebraic equations, such as systems usually arising from partial differential equations in computational mathematics problems, like Navier-Stokes equations in fluid dynamics. With increasing sizes of meshes and numbers of computational nodes, network communications time may become an issue: stalls during global reductions have increasing duration, preventing useful computations. This happens because, in original formulations of methods, computing a dot product requires a global reduce operation, and its value is required on the next step, so each process has to stop until all others reach this point, like in a barrier synchronization. We research alternative formulations of conjugate gradient methods (Preconditioned Conjugate Gradient and BiCGStab) for GPU-based iterative linear system solvers. They allow to have an overlap of parallel computations and communications, at the cost of increased amount of computations and memory requirements. We describe an implementation of our approach for GPU-accelerated hybrid systems in OpenFOAM, an open source framework for computational fluid dynamics. Asynchronous collective communications from MPI-3 parallel programming API are used to avoid full barrier synchronization and reduce latency. Experimental results on 2 and 4 million cases from standard OpenFOAM problems are presented.

Keywords: OpenFOAM, GPU, MPI, conjugate gradient method, bicgstab method, AINV preconditioning.

Funding agency	Grant number
Russian Foundation for Basic Research	13-07-12102
The paper is supported by RFBR grant 13-07-12102

Bibliographic databases:

Document Type: Article

Language: Russian

Citation: V. A. Platonov, A. V. Monakov, “Overlapping communications and computations in GPU-based iterative linear solvers”, Proceedings of ISP RAS, 28:1 (2016), 81–92

Citation in format AMSBIB

\Bibitem{PlaMon16}

\by V.~A.~Platonov, A.~V.~Monakov

\paper Overlapping communications and computations in GPU-based iterative linear solvers

\jour Proceedings of ISP RAS

\yr 2016

\vol 28

\issue 1

\pages 81--92

\mathnet{http://mi.mathnet.ru/tisp5}

\crossref{https://doi.org/10.15514/ISPRAS-2016-28(1)-5}

\elib{https://elibrary.ru/item.asp?id=26166310}

Linking options:

https://www.mathnet.ru/eng/tisp5

https://www.mathnet.ru/eng/tisp/v28/i1/p81

Citing articles in Google Scholar: Russian citations, English citations
Related articles in Google Scholar: Russian articles, English articles

Proceedings of the Institute for System Programming of the RAS

Statistics & downloads:
Abstract page:	196
Full-text PDF :	68
References:	37

Registration to the website

Logotypes