Abstract
Nowadays cloud computing has become a major trend that enterprises and research organizations are pursuing with increasing zest. A potentially important application area for clouds is data analytics. In our previous publication, we introduced a novel cloud infrastructure, the CloudMiner, which facilitates data mining on massive scientific data. By providing a cloud platform which hosts data mining cloud services following the Software as a Service (SaaS) paradigm, CloudMiner offers the capability for realizing cloud-based data mining tasks upon traditional distributed databases and other dataset types. However, little attention has been paid to the issue of data stream management on the cloud so far. We have noticed the fact that some features of the cloud meet very well the requirements of data stream management. Consequently, we developed an innovative software framework, called the StreamMiner, which is introduced in this paper. It serves as an extension to the CloudMiner for facilitating, in particular, real-world data stream management and analysis using cloud services. In addition, we also introduce our tentative implementation of the framework. Finally, we present and discuss the first experimental performance results achieved with the first StreamMiner prototype.
Keywords
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Buyya, R., Broberg, J., Goscinski, A.: Cloud Computing: Principles and Paradigms. Wiley, Chichester (2011)
Perrott, R., Harmer, T., Lewis, R.: e-Science Infrastructure for Digital Media Broadcasting. Computer, 67–72 (2008)
Goscinski, A., Janciak, I., Han, Y., Brezany, P.: The CloudMiner: Moving Data Mining into Computational Clouds. In: Grid and Cloud Database Management. Springer, Berlin (2011)
Sempolinski, P., Thain, D.: A Comparison and Critique of Eucalyptus, OpenNebula and Nimbus. In: 2010 IEEE Second International Conference on Cloud Computing Technology and Science, pp. 417–426 (2010)
Laitkorpi, M., Selonen, P., Systa, T.: Towards a Model-Driven Process for Designing ReSTful Web Services. In: IEEE International Conference on Web Services, pp. 173–180 (2009)
Tilak, S., Hubbard, P., Miller, M., Fountain, T.: The Ring Buffer Network Bus (RBNB) DataTurbine Streaming Data Middleware for Environmental Observing Systems. In: IEEE International Conference on e-Science and Grid Computing, pp. 125–133 (2007)
Bifet, A., Holmes, G., Pfahringer, B., Kranen, P., Kremer, H., Jansen, H., Seidl, T.: MOA: Massive Online Analysis, a Framework for Stream Classification and Clustering. In: Journal of Machine Learning Research (JMLR) Workshop and Conference Proceedings (2010)
Feng, J., Wen, P., Liu, J., Li, H.: Elastic stream cloud (ESC): A stream-oriented cloud computing platform for Rich Internet Application. In: 2010 International Conference on High Performance Computing and Simulation, pp. 203–208 (2010)
Vijayakumar, S., Zhu, Q., Agrawal, G.: Dynamic Resource Provisioning for Data Streaming Applications in a Cloud Environment. In: 2010 IEEE Second International Conference on Cloud Computing Technology and Science, pp. 441–448 (2010)
Kleiminger, W., Kalyvianaki, E., Pietzuch, P.: Balancing load in stream processing with the cloud. In: 2011 IEEE 27th International Conference on Data Engineering Workshops, pp. 16–21 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Han, Y., Brezany, P., Goscinski, A. (2011). Stream Management within the CloudMiner. In: Xiang, Y., Cuzzocrea, A., Hobbs, M., Zhou, W. (eds) Algorithms and Architectures for Parallel Processing. ICA3PP 2011. Lecture Notes in Computer Science, vol 7016. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24650-0_18
Download citation
DOI: https://doi.org/10.1007/978-3-642-24650-0_18
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24649-4
Online ISBN: 978-3-642-24650-0
eBook Packages: Computer ScienceComputer Science (R0)