Baidu
map

Scalable, fault-tolerant job step management for high-performance systems

Solt, D; Hursey, J; Lauria, A; Guo, D; Guo, X

Solt, D (corresponding author), IBM Corp, Dallas, TX 75019 USA.

IBM JOURNAL OF RESEARCH AND DEVELOPMENT, 2020; 64 (3-4):

Abstract

Scientific applications on the CORAL systems demanded a fault-tolerant, scalable job launch infrastructure for complex workflows with multiple job ste......

Full Text Link


Baidu
map
Baidu
map
Baidu
map