Distributed probabilistic fault diagnosis for multiprocessor systems

Piotr Berman, Andrzej Pelc

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    36 Citations (Scopus)

    Abstract

    A class of n-unit multiprocessor systems with O(n log n) interconnecting links is constructed, and a distributed probabilistic fault diagnosis algorithm whose probability of correctness converges to 1 as n → ∞ is proposed. For small probability of unit failure, a distributed diagnosis whose probability also converges to 1 as the size of the system grows is proposed for the hypercube. On the other hand, it is proved that if a class of systems has fewer than kn log n links for a small constant k, the probability of correctness of every fault diagnosis converges to 0 as n → ∞. By combining the probabilistic and the distributed approach the authors' model of fault diagnosis removes the major drawbacks of the PMC (Preparata-Metze-Chien) model: the assumption of tests with complete fault coverage and the assumption of a fault-free central monitoring unit capable of performing diagnosis.

    Original languageEnglish (US)
    Title of host publicationDigest of Papers - FTCS (Fault-Tolerant Computing Symposium)
    PublisherPubl by IEEE
    Pages340-346
    Number of pages7
    ISBN (Print)081862051X
    StatePublished - 1990
    Event20th International Symposium on Fault-Tolerant Computing - FTCS 20 - Chapel Hill, NC, USA
    Duration: Jun 26 1990Jun 28 1990

    Other

    Other20th International Symposium on Fault-Tolerant Computing - FTCS 20
    CityChapel Hill, NC, USA
    Period6/26/906/28/90

    Fingerprint

    Failure analysis
    Monitoring

    All Science Journal Classification (ASJC) codes

    • Hardware and Architecture

    Cite this

    Berman, P., & Pelc, A. (1990). Distributed probabilistic fault diagnosis for multiprocessor systems. In Digest of Papers - FTCS (Fault-Tolerant Computing Symposium) (pp. 340-346). Publ by IEEE.
    Berman, Piotr ; Pelc, Andrzej. / Distributed probabilistic fault diagnosis for multiprocessor systems. Digest of Papers - FTCS (Fault-Tolerant Computing Symposium). Publ by IEEE, 1990. pp. 340-346
    @inproceedings{6d09ac785b5443139c7fb2e2f71fe249,
    title = "Distributed probabilistic fault diagnosis for multiprocessor systems",
    abstract = "A class of n-unit multiprocessor systems with O(n log n) interconnecting links is constructed, and a distributed probabilistic fault diagnosis algorithm whose probability of correctness converges to 1 as n → ∞ is proposed. For small probability of unit failure, a distributed diagnosis whose probability also converges to 1 as the size of the system grows is proposed for the hypercube. On the other hand, it is proved that if a class of systems has fewer than kn log n links for a small constant k, the probability of correctness of every fault diagnosis converges to 0 as n → ∞. By combining the probabilistic and the distributed approach the authors' model of fault diagnosis removes the major drawbacks of the PMC (Preparata-Metze-Chien) model: the assumption of tests with complete fault coverage and the assumption of a fault-free central monitoring unit capable of performing diagnosis.",
    author = "Piotr Berman and Andrzej Pelc",
    year = "1990",
    language = "English (US)",
    isbn = "081862051X",
    pages = "340--346",
    booktitle = "Digest of Papers - FTCS (Fault-Tolerant Computing Symposium)",
    publisher = "Publ by IEEE",

    }

    Berman, P & Pelc, A 1990, Distributed probabilistic fault diagnosis for multiprocessor systems. in Digest of Papers - FTCS (Fault-Tolerant Computing Symposium). Publ by IEEE, pp. 340-346, 20th International Symposium on Fault-Tolerant Computing - FTCS 20, Chapel Hill, NC, USA, 6/26/90.

    Distributed probabilistic fault diagnosis for multiprocessor systems. / Berman, Piotr; Pelc, Andrzej.

    Digest of Papers - FTCS (Fault-Tolerant Computing Symposium). Publ by IEEE, 1990. p. 340-346.

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    TY - GEN

    T1 - Distributed probabilistic fault diagnosis for multiprocessor systems

    AU - Berman, Piotr

    AU - Pelc, Andrzej

    PY - 1990

    Y1 - 1990

    N2 - A class of n-unit multiprocessor systems with O(n log n) interconnecting links is constructed, and a distributed probabilistic fault diagnosis algorithm whose probability of correctness converges to 1 as n → ∞ is proposed. For small probability of unit failure, a distributed diagnosis whose probability also converges to 1 as the size of the system grows is proposed for the hypercube. On the other hand, it is proved that if a class of systems has fewer than kn log n links for a small constant k, the probability of correctness of every fault diagnosis converges to 0 as n → ∞. By combining the probabilistic and the distributed approach the authors' model of fault diagnosis removes the major drawbacks of the PMC (Preparata-Metze-Chien) model: the assumption of tests with complete fault coverage and the assumption of a fault-free central monitoring unit capable of performing diagnosis.

    AB - A class of n-unit multiprocessor systems with O(n log n) interconnecting links is constructed, and a distributed probabilistic fault diagnosis algorithm whose probability of correctness converges to 1 as n → ∞ is proposed. For small probability of unit failure, a distributed diagnosis whose probability also converges to 1 as the size of the system grows is proposed for the hypercube. On the other hand, it is proved that if a class of systems has fewer than kn log n links for a small constant k, the probability of correctness of every fault diagnosis converges to 0 as n → ∞. By combining the probabilistic and the distributed approach the authors' model of fault diagnosis removes the major drawbacks of the PMC (Preparata-Metze-Chien) model: the assumption of tests with complete fault coverage and the assumption of a fault-free central monitoring unit capable of performing diagnosis.

    UR - http://www.scopus.com/inward/record.url?scp=0025665993&partnerID=8YFLogxK

    UR - http://www.scopus.com/inward/citedby.url?scp=0025665993&partnerID=8YFLogxK

    M3 - Conference contribution

    SN - 081862051X

    SP - 340

    EP - 346

    BT - Digest of Papers - FTCS (Fault-Tolerant Computing Symposium)

    PB - Publ by IEEE

    ER -

    Berman P, Pelc A. Distributed probabilistic fault diagnosis for multiprocessor systems. In Digest of Papers - FTCS (Fault-Tolerant Computing Symposium). Publ by IEEE. 1990. p. 340-346