Skip to main content

Web Service Based Algorithm Management Framework for Stream Data Processing

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2013)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 7827))

Included in the following conference series:

  • 2923 Accesses

Abstract

In recent ten years, In contrast to lots of work on developing algorithms for stream data mining, little work has been done in the management of various stream data processing algorithms, which make the application rate of algorithms quite low. In this research, we first establish a process model for data stream mining. An algorithm management framework based on web services for stream data processing, AMF4SDP, is then proposed. We analyze the construction of data stream processing algorithm repository and the architecture of the framework. Using the framework, a data stream oriented algorithm management prototype system is implemented on Eclipse. Experiments validate that the framework has high flexibility and self adaptability.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. He, H., et al.: Incremental learning from stream data. IEEE Transactions on Neural Networks 22(12, pt. 1), 1901–1914 (2011)

    Article  Google Scholar 

  2. Lian, X., Chen, L.: Similarity join processing on uncertain data streams. IEEE Transactions on Knowledge and Data Engineering 23(11), 1718–1734 (2011)

    Article  Google Scholar 

  3. Guha, S., Meyerson, A., Mishra, N., Motwani, R., O’Callaghan, L.: Clustering Data Streams: Theory and Practice. TKDE Special Issue on Clustering 15 (2003)

    Google Scholar 

  4. Indyk, P., Koudas, N., Muthukrishnan, S.: Identifying Representative Trends in Massive Time Series Data Sets Using Sketches. In: The 26th International Conference on Very Large Data Bases, Cairo, Egypt (2000)

    Google Scholar 

  5. Manku, G.S., Motwani, R.: Approiximate frequency counts over data streams. In: 28th International Conference on Very Large Data Bases, Hong Kong, China (2002)

    Google Scholar 

  6. Wang, H., Fan, W., Yu, P.S., Han, J.: Mining Concept-drifting Data Streams using Ensemble Classifiers. In: The 9th ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), Washington DC, USA (2003)

    Google Scholar 

  7. Zhu, X., Xiao, F., Huang, Z., Shen, G., Jin, L.: Description Logic Based Extended Predictive Model Markup Language EPMML. Chinese Journals of Computers 35(8), 1644–1655 (2012) (in Chinese Language)

    Google Scholar 

  8. Zhu, X., Wang, H., Gan, H., Gao, C.: Construction and Management of Automatical Reasoning Supported Data Mining Metadata. In: 2011 IEEE International Conference on Supernetworks and System Management, pp. 205–210 (2011)

    Google Scholar 

  9. Zhu, X., Yang, J.: An Extended Predictive Model Markup Language for Data Mining. In: Chen, L., Tang, C., Yang, J., Gao, Y. (eds.) WAIM 2010. LNCS, vol. 6184, pp. 218–231. Springer, Heidelberg (2010)

    Chapter  Google Scholar 

  10. Zhu, X.-D., Huang, Z.-Q.: Conceptual modeling rules extracting for data streams. Knowledge-Based Systems 21, 934–940 (2008)

    Article  MathSciNet  Google Scholar 

  11. Zhu, X., Huang, Z., Shen, G.: Description Logic based Consistency Checking upon Data Mining Metadata. In: Wang, G., Li, T., Grzymala-Busse, J.W., Miao, D., Skowron, A., Yao, Y. (eds.) RSKT 2008. LNCS (LNAI), vol. 5009, pp. 475–482. Springer, Heidelberg (2008)

    Chapter  Google Scholar 

  12. CRISP-DM.:CRoss Industry Standard Process for Data Mining (2011), http://www.crisp-dm.org

  13. DMG.:PMML Version 3.1. (2010), http://www.dmg.org/index.html

  14. Kang, J., Naughton, J.F., Viglas, S.D.: Evaluating Window Joins over Unbounded Streams. In: Proceedings of the 28th VLDB Conference, Hong Kong, China (2002)

    Google Scholar 

  15. Romei, A., Ruggieri, S., Turini, F.: KDDML:A middleware language and system for knowledge discovery in databases. Data & Knowledge Engineering 57, 179–220 (2006)

    Article  Google Scholar 

  16. Yang, L., Zuo, C., Wang, Y.G.: Research and Implementation of Service Oriented Architecture for Knowledge Discovery. Chinese Journal of Computers 28, 445–457 (2005)

    Google Scholar 

  17. Cheung, W.K., Zhang, X., Wong, H., Liu, J.: Service-Oriented Distributed Data Mining. IEEE Internet Computing (2006)

    Google Scholar 

  18. Lauinen, P., Tuovinen, L., Ring, J.: Smart Archive: A Component-based Data Mining Application Framework. In: The 5th International Conference on Intelligent Systems Design and Applications (2005)

    Google Scholar 

  19. Liu, G., Yuan, S., Dong, L., Li, Y.: An Implementation of Data Mining PMML Based on Work Flow. Journal of Chinese Computer Systems 28, 891–894 (2007)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhu, X., Wang, H. (2013). Web Service Based Algorithm Management Framework for Stream Data Processing. In: Hong, B., Meng, X., Chen, L., Winiwarter, W., Song, W. (eds) Database Systems for Advanced Applications. DASFAA 2013. Lecture Notes in Computer Science, vol 7827. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40270-8_17

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-40270-8_17

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-40269-2

  • Online ISBN: 978-3-642-40270-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics