Skip to main navigation Skip to search Skip to main content

Intermittent Fault Diagnosis of Data Center Network CSDC Under Probabilistic Fault Model

  • Limei Lin
  • , Yanze Huang
  • , Xiaoding Wang
  • , Dajin Wang
  • , Sun Yuan Hsieh
  • , Jie Wu

Research output: Contribution to journalArticlepeer-review

Abstract

As the core infrastructures in the information systems, the data center networks carry a large number of tasks of data processing and storage. In a data center network, intermittent faults are often difficult to be found and dealt with in time because of their hiddenness and uncertainty. Once these faults accumulate to a certain extent, they can cause serious network outages and even lead to the collapse of the entire data center. In order to discover and resolve these potential problems in a timely manner, it is crucial to apply intermittent fault diagnosis, thus ensures the continuous and stable operation of the data center. In this paper, we propose the intermittent fault diagnosability tPMCI(Cn) for an n -dimensional data center network CSDC under the Preparata/Metze/Chien model (PMC model) by establishing the fault tolerance of the network. Additionally, under the PMC model, we propose a probabilistic multiple intermittent fault diagnosis algorithm (PMIFDPMC) with time complexity O(nN) by preferentially generating weighted multiple test networks (GWMTN) where N is the scale of CSDC. Moreover, we apply the algorithm PMIFDPMC to a 7-dimensional CSDC and a real-world dataset of the Internet of Things. Across different scenarios of intermittent fault nodes, we calculate the Accuracy, Recall, FNR, G-mean, and F1-score using various testing iterations. The experimental results demonstrate that, as the number of testing iterations of algorithm PMIFDPMC increases, the quantity of intermittent fault nodes that are correctly diagnosed also increases. This highlights the favorable performance and effectiveness of algorithm PMIFDPMC on the real-world dataset of the Internet of Things.

Original languageEnglish
Pages (from-to)2347-2361
Number of pages15
JournalIEEE Transactions on Networking
Volume34
DOIs
StatePublished - 2026

Keywords

  • data center network
  • Fault diagnosis
  • PMC model
  • reliability

Fingerprint

Dive into the research topics of 'Intermittent Fault Diagnosis of Data Center Network CSDC Under Probabilistic Fault Model'. Together they form a unique fingerprint.

Cite this