中国综合性科技类核心期刊(北大核心)

中国科学引文数据库来源期刊(CSCD)

美国《化学文摘》(CA)收录

美国《数学评论》(MR)收录

俄罗斯《文摘杂志》收录

Message Board

Respected readers, authors and reviewers, you can add comments to this page on any questions about the contribution, review, editing and publication of this journal. We will give you an answer as soon as possible. Thank you for your support!

Name
E-mail
Phone
Title
Content
Verification Code
Issue 5
Nov.  2014
Turn off MathJax
Article Contents
ZHANG Xin-Zhou, ZHOU Min-Qi. Fault tolerance recovery techniques in large distributed parallel computing system[J]. Journal of East China Normal University (Natural Sciences), 2014, (5): 207-215. doi: 10.3969/j.issn.1000-5641.2014.05.018
Citation: ZHANG Xin-Zhou, ZHOU Min-Qi. Fault tolerance recovery techniques in large distributed parallel computing system[J]. Journal of East China Normal University (Natural Sciences), 2014, (5): 207-215. doi: 10.3969/j.issn.1000-5641.2014.05.018

Fault tolerance recovery techniques in large distributed parallel computing system

doi: 10.3969/j.issn.1000-5641.2014.05.018
  • Publish Date: 2014-09-25
  • Supercomputing systems today often come in the form of large numbers of commodity systems linked together into a computing cluster. These systems, like any distributed system, can have large numbers of independent hardware components cooperating or collaborating on a computation. Unfortunately,any of this vast number of components can fail at any time, resulting in potentially erroneous output. In order to improve the robustness of supercomputing applications in the presence of failures,many techniques have been developed to provide resilience to these kinds of system faults. This survey provides an overview of these various fault tolerance techniques.
  • loading
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索
    Article views (1069) PDF downloads(2062) Cited by()
    Proportional views

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return