Skip to main content

A Tool for Measuring and Monitoring Distributed Systems During Operation

  • Conference paper

Part of the book series: Informatik-Fachberichte ((INFORMATIK,volume 168))

Abstract

Distributed systems demonstrate characteristics that will provide benefits, such as reliability, availability, ease of modularity, incremental growth, configuration flexibility, and high system performance. However, the increased activity in distributed systems have yielded a substantial lack in methods and tools for monitoring and measuring these systems to verify these features.

INCAS (Incremental Architecture for Distributed Systems) funded by the DFG as part of the Sonderforschungsbereich 124 “VLSI Design and Parallel Architectures”

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   54.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. B. Bates, J.C. Wileden: High-level Debugging of Distributed Systems: The Behavioral Abstraction Approach; The Journal of Systems and Software; pp.255–264; Mar. 1983.

    Google Scholar 

  2. H.K. Berg, M.G. Smith: A Distributed System Experimentation Facility; Proc. of the 3rd International Conf. on Distributed Computing Systems;pp.324–329; Oct. 1982.

    Google Scholar 

  3. K.M. Chandy, L. Lamport: Distributed Snapshots: Determining Global States of Distributed Systems; ACM Trans. Computer Systems 3(1); pp.63–75; 1985.

    Article  Google Scholar 

  4. R. Curtis, L. Wittie: BUGNET: A Debugging System for Parallel Programming Environments; IEEE Distributed Computer Systems Conference; pp.394–399; 1982.

    Google Scholar 

  5. D. Ferrari: Architecture and Instrumentation in a Modular Interactive System; Computer 6(11); pp. 25–29; Nov. 1973.

    Google Scholar 

  6. D. Ferrari: Computer Systems Performance Evaluation; Prentice-Hall, Inc., Englewood Cliffs, New Jersey; 1978.

    Google Scholar 

  7. D. Ferrari, V. Minetti: A Hybrid Measurement Tool for Minicomputers; Experimental Computer Performance and Evaluation; D. Ferrari and M. Spadoni (Eds); North-Holland Publishing Company, 1981.

    Google Scholar 

  8. D. Ferrari, G. Serazzi, A. Zeigner: Measurement and Tuning of Computer Systems; Prentice-Hall, Inc., Englewood Cliffs, New Jersey; 1983.

    Google Scholar 

  9. H. Fromm et al.: Experiences with Performance Measurement and Modeling of a Processor Array; IEEE Trans. Computers; 32(1); pp.15–31; Jan. 1983.

    Article  Google Scholar 

  10. H. Garcia-Molina et al.: Debugging a Distributed System; IEEE Trans. on Software Eng., Vol. SE-10, No.2; pp. 210–219; Mar. 1984.

    Article  Google Scholar 

  11. W. Gora, U. Herzog, S.K. Tripathi: Clock Synchronization on the Factory Floor; in: R. Rosenthal (Ed.): Proc. of the NBS Workshop on Factory Communications; March 1987; National Bureau of Standards, Gaithersburg; 1987.

    Google Scholar 

  12. R. Gusella, S. Latti: TEMPO-A Network Time Controller for a Distributed Berkeley Unix System; Distributed Processing Tech. Comm. Newsletter, 6(2); pp.7–15; IEEE; June 1984.

    Google Scholar 

  13. D. Haban, D. Wybranietz: Hardware Supported Monitoring in Distributed Computing Systems; University of Kaiserslautern; SFB 124 Report No. 23/86.

    Google Scholar 

  14. D. Haban: DTM — A Distributed Test Methodology; in Proc. 6th Symposium on Reliability in Distributed Software and Database Systems; pp.66–73; March 1987.

    Google Scholar 

  15. D. Haban, W. Weigel: Global Events and Global Breakpoints in Distributed Systems; Proc, of the Hawaii International Conference on System Sciences; Jan 1988.

    Google Scholar 

  16. P.K. Harter (Jr.), D.M. Heimbigner, R. King: IDD: An Interactive Distributed Debugger; In Proceedings of 5th Int. Conf. on Distributed Computing Systems; pp. 498–506; May 1985.

    Google Scholar 

  17. R. Hofmann et al.: Zählmonitor 4: Ein Monitorsystem für das Hardware-und Hybrid-Monitoring von Multiprozessor-und Multicomputer-Systemen; in: U. Herzog, M. Paterok (Hrsg.): 4.GI/ITG-Fachtagung Messung, Modellierung und Bewertung von Rechensystemen; Erlangen, Sept./Okt. 1987; Informatik-Fachberichte 154.

    Google Scholar 

  18. R. Ibbett: The Hardware Monitoring of a High Performance Processor; Computer Performance Evaluation; Advanced Book Program Cranfield Institute of Technology, (UK) pp. 274–292; Dec. 78.

    Google Scholar 

  19. K.W. Kolence, P.J. Kiviat: Software Unit Profiles and Kiviat Figures; ACM Sigmetrics Performance & Evaluation Review; June 1976.

    Google Scholar 

  20. H. Kopetz, W. Ochsenreiter: Interval Measurements in Distributed Real Time Systems; Proc of the 7th International Conf. on Distributed Computing Systems; Berlin; pp. 292–298; Sept. 1987.

    Google Scholar 

  21. L. Lamport: Time, Clocks and the Ordering of Events in a Distributed System; Commun. ACM 21(7); pp.558–565; 1978.

    Article  MATH  Google Scholar 

  22. L. Lamport, R. Shostak, M. Pease: The Byzantine General Problem; ACM Trans. Prog. Lang. Syst. 4(3); 1982.

    Google Scholar 

  23. J.E. Lambert, F. Halsall: Program Debugging and Performance Evaluation Aids for a Multi-Microprocessor System; Software & Microsystems, 3(1); Feb. 1984.

    Google Scholar 

  24. R. LeBlanc, A.D. Robbins: Event-Driven Monitoring of Distributed Systems; 5th Int. Conf. on Distributed Computer Systems; Denver; pp. 515–522; 1985.

    Google Scholar 

  25. T.J. LeBlanc, J.B. Mellor-Crummey: Debugging Parallel Programs with Instant Replay; IEEE Trans. Computers 36(4); April 1987.

    Google Scholar 

  26. K. Marzullo, S. Owicki: Maintaining the Time in a Distributed System; Proc. of the 2nd ACM Symp. on Princ. of Distributed Computing; 1983.

    Google Scholar 

  27. F. Mattern: Experiences with a New Distributed Termination Detection Algorithm; Proc. of the 2nd Int. Workshop on Distributed Algorithms; July 8–10, 1987; Amsterdam; Lecture Notes, Springer Verlag.

    Google Scholar 

  28. B.P. Miller: Performance Characterization of Distributed Programs; Ph.D. Dissertation; Technical Report UCB/CSD 84/197; University of California, Berkeley; May 1984.

    Google Scholar 

  29. B.P. Miller, C. Macrander, S. Sechrest: A Distributed Programs Monitor for Berkeley UNIX; Software-Practice and Experience 16(2) pp. 183–200; Feb. 1986.

    Article  Google Scholar 

  30. B.P. Miller, Yang Cui-Qing: IPS-An Interactive and Automatic Performance Tool for Parallel and Distributed Programs; Proc. of the 7th Int. Conf. on Distributed Computing Systems; Berlin; pp.482–489; Sept. 1987.

    Google Scholar 

  31. J. Nehmer et al.: Key Concepts of the INCAS Multicomputer Project; IEEE Trans. on Software Engineering, 13(8) pp. 913–923; Aug. 1987.

    Article  Google Scholar 

  32. Z. Segall, L. Rudolph: PIE: A Programming and Instrumentation Environment for Parallel Processing; IEEE Software; pp.22–37; Nov. 1985.

    Google Scholar 

  33. E.T. Smith: Debugging Technique for Communicating Loosely-Coupled Processes, University of Rochester, Ph.D.; 1982.

    Google Scholar 

  34. R. Snodgras: Monitoring Distributed Systems, A Relational Approach, Carnegie-Mellon University, Dept. of Computer Science; 1982.

    Google Scholar 

  35. L. Svobodova: Online System Performance Measurements with Software and Hybrid Monitors; Operating Systems Rev. 7(4); pp. 45–53; Oct. 1973.

    Article  Google Scholar 

  36. L. Svobodova: Computer System Measurability; IEEE Computer, June 76

    Google Scholar 

  37. L. Svobodova: Performance Monitoring in Computer Systems: A Structural Approach; ACM Operating System Review, Vol. 15, No.3; July 81.

    Google Scholar 

  38. W.A. Wulf et al.: Hydra/C.mmp: An Experimental Computer System; McGraw-Hill; 1981.

    Google Scholar 

  39. D. Wybranietz, R. Massar: An Overview of LADY; SFB 124 Report 12/85; Univ of Kaiserslautern; 1985.

    Google Scholar 

  40. D. Wybranietz: A Simulation System for Multicast Communications with Interactive Facilities; 15th Simula Users Conf., St. Helier, Jersey, Channel Islands; Sept. 87.

    Google Scholar 

  41. J. Yoyce et al.: Monitoring Distributed Systems; ACM Trans. on Computer Systems; 5(2); pp. 121–150; May 1987.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1988 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Haban, D., Wybranietz, D. (1988). A Tool for Measuring and Monitoring Distributed Systems During Operation. In: Kastens, U., Rammig, F.J. (eds) Architektur und Betrieb von Rechensystemen. Informatik-Fachberichte, vol 168. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-73451-9_23

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-73451-9_23

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-18994-7

  • Online ISBN: 978-3-642-73451-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics