|
Upravlenie Bol'shimi Sistemami, 2017, Issue 70, Pages 58–86
(Mi ubs935)
|
|
|
|
Network-based models in Control
On topological fault-tolerance of scalable computing systems
V. A. Melent'ev Rzhanov Institute of Semiconductor Physics Siberian Branch of RAS, Novosibirsk
Abstract:
Problems of the analysis of topological fault tolerance of the scalable computing system and ensuring its sustainability to fault of the given multiplicity are considered. The measure of topological fault tolerance is offered, which connects the computing system topology with its potential parallelism for the given fault multiplicity. The relationship between the functions of topological scalability and topological fault tolerance is defined. The dependence of the minimum of a topological fault tolerance by the girth of the system graph is shown. Model of parallel computings, and functions of the topological fault tolerance and scalability are adapted to the existence of unique nodes in information topology of the solved task. A method for configuring fault-tolerant subsystems for a deficient topological fault tolerance of a computing system is proposed, while providing the preassigned fault multiplicity for the solved task is achieved by duplicating subsystems which are configured for less, than the preassigned, fault multiplicity.
Keywords:
scalable computing systems, their topological fault-tolerance.
Received: September 20, 2016 Published: November 30, 2017
Citation:
V. A. Melent'ev, “On topological fault-tolerance of scalable computing systems”, UBS, 70 (2017), 58–86
Linking options:
https://www.mathnet.ru/eng/ubs935 https://www.mathnet.ru/eng/ubs/v70/p58
|
|