Abstract
The growth in the scale of systems and networks has created many challenges for their management, especially for event processing. Our premise is that scaling event processing requires parallelism. To this end, we observe that event processing can be divided into intra-event processing such as filtering and inter-event processing such as root cause analysis. Since intra-event processing is easily parallelized, we propose an architecture in which intra-event processing elements (IAPs) are replicated to scale to larger event input rates. We address two challenges in this architecture. First, the IAPs are subject to overloads that require effective flow control, a capability that was not present in the components we used to build IAPs. Second, we need to balance the loads on IAPs to avoid creating resource bottlenecks. These challenges are further complicated by the presence of disturbances such as CPU intensive administrative tasks that reduce event processing rates. We address these challenges using designs based on control theory, a technique for analyzing stability, accuracy, and settling times. We demonstrate the effectiveness of our approaches with testbed experiments that include a disturbance in the form of a CPU intensive application.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Burns, L., Hellerstein, J.L., Ma, S., Perng, C.S., Rabenhorst, D.A., Taylor, D.: A systematic approach to discovering correlation rules for event management. In: IEEE/IFIP Integrated Network Management (May 2001)
Carzaniga, A., Rosenblum, D.S., Wolf, A.L.: Achieving scalability and expressiveness in an internet-scale event notification service. In: Proceedings of the Nineteenth Annual ACM Symposium on Principles of Distributed Computing, Portland, Oregon, pp. 219–227 (July 2000)
Chen, M., Zheng, A., Lloyd, J., Jordan, M., Brewer, E.: A statistical learning approach to failure diagnosis. In: International Conference on Autonomic Computing (ICAC 2004), New York, NY (May 2004)
Chen, M.Y., Kiciman, E., Fratkin, E., Fox, A., Brewer, E.A.: Pinpoint: Problem determination in large, dynamic internet services. In: DSN, pp. 595–604 (2002)
Hewlett-Packard Development Company. Hp OpenView (2005), http://www.openview.hp.com/
IBM Corporation. Tivoli, http://www.ibm.com/software/tivoli/
Microsoft Corporation. Microsoft Operations Manager, http://www.microsoft.com/mom/
Diao, Y., Hellerstein, J.L., Storm, A., Surendra, M., Lightstone, S., Parekh, S., Garcia-Arellano, C.: Using MIMO Linear Control for Load Balancing in Computing Systems. In: American Control Conference, pp. 2045–2050 (June 2004)
Hellerstein, J.L., Diao, Y., Parekh, S., Tilbury, D.M.: Feedback Control of Computing Systems. Wiley-IEEE Press, Chichester (2004)
Hofmeyr, S.A., Forrest, S., Somayaji, A.: Intrusion detection using sequences of system calls. Journal of Computer Security 6(3), 151–180 (1998)
Huebsch, R., Hellerstein, J.M., Lanham, N., Loo, B.T., Shenker, S., Stoica, I.: Querying the internet with PIER. In: Proceedings of the 29th VLDB Conference (2003)
Krishnamurthy, S., Chandrasekaran, S., Cooper, O., Deshpande, A., Franklin, M.J., Hellerstein, J.M., Hong, W., Madden, S., Reiss, F., Shah, M.A.: Telegraphcq: An architectural status report. IEEE Data Eng. Bull. 26(1), 11–18 (2003)
van Renesse, R., Birman, K.P., Vogels, W.: Astrolabe: A robust and scalable technology for distributed system monitoring, management, and data mining. ACM Transactions on Computer Systems 21(2), 164–206 (2003)
Vilalta, R., Apté, C., Hellerstein, J.L., Ma, S., Weiss, S.M.: Predictive algorithms in the management of computer systems. IBM Systems Journal 41(3), 461–474 (2002)
Yemini, S.A., Kliger, S., Mozes, E., Yemini, Y., Ohsie, D.: High speed and robust event correlation. IEEE Communications Magazine 34(5), 82–90 (1996)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 IFIP International Federation for Information Processing
About this paper
Cite this paper
Xu, W., Hellerstein, J.L., Kramer, B., Patterson, D. (2005). Control Considerations for Scalable Event Processing. In: Schönwälder, J., Serrat, J. (eds) Ambient Networks. DSOM 2005. Lecture Notes in Computer Science, vol 3775. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11568285_20
Download citation
DOI: https://doi.org/10.1007/11568285_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-29388-0
Online ISBN: 978-3-540-32244-3
eBook Packages: Computer ScienceComputer Science (R0)