TY - JOUR
T1 - Probabilistic diagnosis of clustered faults for hypercube-based multiprocessor system
AU - Lv, Mengjie
AU - Zhou, Shuming
AU - Sun, Xueli
AU - Lian, Guanqin
AU - Liu, Jiafei
AU - Wang, Dajin
N1 - Publisher Copyright:
© 2019 Elsevier B.V.
PY - 2019/11/12
Y1 - 2019/11/12
N2 - As the sizes of multiprocessor systems grow, chances of processors becoming faulty increase, making it an important issue to diagnose faulty nodes in the system. Different models have been proposed and studied. One proposed by Huang et al. employed a probabilistic fault model to determine the status of a cluster of nodes in rectangular grid structures [8]. Later, Tang et al. extended the diagnosis algorithm to more general, regular topologies, in which any pair of adjacent nodes has no common neighbors [25]. In a recent work by Lu et al., the algorithm was further extended to regular topologies where any pair of adjacent nodes has a certain number of common neighbors [18]. In this paper, we extend the threshold to apply the probabilistic diagnosis algorithm for hypercube-based multiprocessor system, and carry out an analysis on the algorithm's effectiveness. The analysis shows a very high rate of correct diagnosis, both for an individual node and for nodes as a whole. Although the analysis is done for a particular regular network (the hypercube), the outcome can serve as a useful reference, and can shed light on the effectiveness of the probabilistic diagnosis for a large group of triangle-free multiprocessor systems.
AB - As the sizes of multiprocessor systems grow, chances of processors becoming faulty increase, making it an important issue to diagnose faulty nodes in the system. Different models have been proposed and studied. One proposed by Huang et al. employed a probabilistic fault model to determine the status of a cluster of nodes in rectangular grid structures [8]. Later, Tang et al. extended the diagnosis algorithm to more general, regular topologies, in which any pair of adjacent nodes has no common neighbors [25]. In a recent work by Lu et al., the algorithm was further extended to regular topologies where any pair of adjacent nodes has a certain number of common neighbors [18]. In this paper, we extend the threshold to apply the probabilistic diagnosis algorithm for hypercube-based multiprocessor system, and carry out an analysis on the algorithm's effectiveness. The analysis shows a very high rate of correct diagnosis, both for an individual node and for nodes as a whole. Although the analysis is done for a particular regular network (the hypercube), the outcome can serve as a useful reference, and can shed light on the effectiveness of the probabilistic diagnosis for a large group of triangle-free multiprocessor systems.
KW - Clustered faults
KW - Diagnosis algorithm
KW - Fault tolerance
KW - Hypercube
KW - Probabilistic diagnosis model
KW - Reliability
UR - http://www.scopus.com/inward/record.url?scp=85071286520&partnerID=8YFLogxK
U2 - 10.1016/j.tcs.2019.06.023
DO - 10.1016/j.tcs.2019.06.023
M3 - Article
AN - SCOPUS:85071286520
SN - 0304-3975
VL - 793
SP - 113
EP - 131
JO - Theoretical Computer Science
JF - Theoretical Computer Science
ER -