Abstract
In this work a new, cellular, local diagnostic procedure for a class of massively parallel systems with a regular topology is reported. The fault model is proposed to be suited for a given realistic system therefore production and run-time failures are assumed. Appropriate cluster and random faults are possible; additionally, permanent and/or intermittent faults are permitted. The system architecture is proposed to be a regular network with low network connectivity, a high number of intelligent nodes, and with no passive hardware redundancy. The diagnostic procedure is organized in parallel communication rounds, and is the same for all system units.
Preview
Unable to display preview. Download preview PDF.
V. References
D.Fussel, P.Varman, "Fault-Tolerant Wafer-Scale Architecture for VLSI," Proc. 9th Annu. Symp. on Computer Architecture, April 1982, pp. 190–198.
R.Trobec, "A Local Distributed Diagnosis," Technical Report Jozef Stefan Institute, IJS-1432, December 1986.
R.C. Russell, I. Catt, "Wafer-Scale Integration — A Fault-Tolerant Procedure," IEEE Journal of Solid-State Circuits, Vol.SC-13, No.3, June 1978, pp. 339–344.
I. Koren, D.K. Pradhan, "Yield and Performance Enhancement Through Redundancy in VLSI and WSI Multiprocessor Systems," Proceeding of the IEEE, Vol.74, No.5, May 1986, pp. 699–711.
J.G.Kuhl, S.M.Reddy, "Distributed Fault-Tolerance for Large Multiprocessor System," Proc. 7th Annu. Symp. Comput. Arch., May 1980, pp. 23–30.
F.J.Meyer, D.K.Pradhan, "Dynamic Testing Strategy for Distributed System," Proc. of the 15th Inter. Symp. on Fault-Tolerant Computing Systems, June 1985, pp. 84–90.
P.Banerjee, J.A.Abraham, "Fault-Secure Algorithms for Multiple-Processors Systems," Proc. of the Inter. Conf. on Computer Architecture, June 1984, pp. 147–154.
F.R.K.Chung, F.T.Leighton, A.L.Rosenberg, "Diogenes: A Methodology for Designing Fault-Tolerant VLSI Processor Array," Proc. 13th Inter. Symp. on Fault-Tolerant Computing, 1983, pp. 26–32.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1989 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Trobec, R. (1989). Cellular diagnostic in parallel systems. In: Wolf, G., Legendi, T., Schendel, U. (eds) Parcella '88. Parcella 1988. Lecture Notes in Computer Science, vol 342. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-50647-0_130
Download citation
DOI: https://doi.org/10.1007/3-540-50647-0_130
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-50647-8
Online ISBN: 978-3-540-46062-6
eBook Packages: Springer Book Archive