Skip to main content

An architecture for distributed enterprise data mining

  • Track C3: Computational Science
  • Conference paper
  • First Online:
High-Performance Computing and Networking (HPCN-Europe 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1593))

Included in the following conference series:

Abstract

The requirements for data mining systems for large organisations and enterprises range from logical and physical distribution of large data and heterogeneous computational resources to the general need for high performance at a level that is sufficient for interactive work. This work categorises the requirements and describes the Kensington software architecture that addresses these demands. The system is capable of transparently supporting parallel computation at two levels, and we describe a configuration for trans-atlantic distributed parallel data mining that was demonstrated at the recent Supercomputing conference.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  • [CDG+97] J. Chattratichat, J. Darlington, M.Ghanem, Y. Guo, H. Hüning, M. Köhler, J. Sutiwaraphun, H. W. To, and D. Yang. Large scale data mining: Challenges and responses. In Proceedings of Third International Conference on Knowledge Discovery and Data Mining, pages 143–146, 1997.

    Google Scholar 

  • [CDG+98] J. Chattratichat, J. Darlington, Y. Guo, S. Hedvall, M. Köhler, A. Saleem, J. Sutiwaraphun, and D. Yang. A software architecture for deploying high-performance solutions on the internet. In High-Performance Computing and Networking, 1998.

    Google Scholar 

  • [CS96] P. K. Chan and S. J. Stolfo. Sharing learned models among remote database partitions by local meta-learning. In E. Simoudis, J. Han, and U. Fayyad, editors, The Second International Conference on Knowledge Discovery and Data Mining, pages 2–7. AAAI Press, 1996.

    Google Scholar 

  • [FPSS96] U. M. Fayyad, G. Piatetsky-Shapiro, and P. Smyth. From data mining to knowledge discovery: An overview. In U. M. Fayyad, G. Piatetesky-Shapiro, P. Smyth, and R. Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining. MIT Press, 1996.

    Google Scholar 

  • [Pen97] Wanida Pensuwon. Parallel neural networks. Msc. thesis, Imperial College, University of London, 1997.

    Google Scholar 

  • [Qui93] J. Ross Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufman Publishers, 1993.

    Google Scholar 

  • [Rue97] Stefan Rueger. Parallel self-organising maps. In Proceedings of the Seventh Parallel Computing Workshop, Australian National University, Canberra, September 25–26, 1997.

    Google Scholar 

  • [Toi96] Hannu Toivonen. Discovery of Frequent Patterns in Large Data Collections. PhD thesis, Department of Computer Science, University of Finland, 1996.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Peter Sloot Marian Bubak Alfons Hoekstra Bob Hertzberger

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag

About this paper

Cite this paper

Chattratichat, J., Darlington, J., Guo, Y., Hedvall, S., Köhler, M., Syed, J. (1999). An architecture for distributed enterprise data mining. In: Sloot, P., Bubak, M., Hoekstra, A., Hertzberger, B. (eds) High-Performance Computing and Networking. HPCN-Europe 1999. Lecture Notes in Computer Science, vol 1593. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0100618

Download citation

  • DOI: https://doi.org/10.1007/BFb0100618

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-65821-4

  • Online ISBN: 978-3-540-48933-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics