An architecture for distributed enterprise data mining

Chattratichat, J.; Darlington, J.; Guo, Y.; Hedvall, S.; Köhler, M.; Syed, J.

doi:10.1007/BFb0100618

J. Chattratichat¹,
J. Darlington¹,
Y. Guo¹,
S. Hedvall¹,
M. Köhler¹ &
…
J. Syed¹

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1593))

Included in the following conference series:

International Conference on High-Performance Computing and Networking

232 Accesses
31 Citations

Abstract

The requirements for data mining systems for large organisations and enterprises range from logical and physical distribution of large data and heterogeneous computational resources to the general need for high performance at a level that is sufficient for interactive work. This work categorises the requirements and describes the Kensington software architecture that addresses these demands. The system is capable of transparently supporting parallel computation at two levels, and we describe a configuration for trans-atlantic distributed parallel data mining that was demonstrated at the recent Supercomputing conference.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

[CDG⁺97] J. Chattratichat, J. Darlington, M.Ghanem, Y. Guo, H. Hüning, M. Köhler, J. Sutiwaraphun, H. W. To, and D. Yang. Large scale data mining: Challenges and responses. In Proceedings of Third International Conference on Knowledge Discovery and Data Mining, pages 143–146, 1997.
Google Scholar
[CDG⁺98] J. Chattratichat, J. Darlington, Y. Guo, S. Hedvall, M. Köhler, A. Saleem, J. Sutiwaraphun, and D. Yang. A software architecture for deploying high-performance solutions on the internet. In High-Performance Computing and Networking, 1998.
Google Scholar
[CS96] P. K. Chan and S. J. Stolfo. Sharing learned models among remote database partitions by local meta-learning. In E. Simoudis, J. Han, and U. Fayyad, editors, The Second International Conference on Knowledge Discovery and Data Mining, pages 2–7. AAAI Press, 1996.
Google Scholar
[FPSS96] U. M. Fayyad, G. Piatetsky-Shapiro, and P. Smyth. From data mining to knowledge discovery: An overview. In U. M. Fayyad, G. Piatetesky-Shapiro, P. Smyth, and R. Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining. MIT Press, 1996.
Google Scholar
[Pen97] Wanida Pensuwon. Parallel neural networks. Msc. thesis, Imperial College, University of London, 1997.
Google Scholar
[Qui93] J. Ross Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufman Publishers, 1993.
Google Scholar
[Rue97] Stefan Rueger. Parallel self-organising maps. In Proceedings of the Seventh Parallel Computing Workshop, Australian National University, Canberra, September 25–26, 1997.
Google Scholar
[Toi96] Hannu Toivonen. Discovery of Frequent Patterns in Large Data Collections. PhD thesis, Department of Computer Science, University of Finland, 1996.
Google Scholar

Download references

Author information

Authors and Affiliations

Data Mining Group, Imperial College Parallel Computing Centre, 180 Queen's Gate, SW7 2BZ, London, UK
J. Chattratichat, J. Darlington, Y. Guo, S. Hedvall, M. Köhler & J. Syed

Authors

J. Chattratichat
View author publications
You can also search for this author in PubMed Google Scholar
J. Darlington
View author publications
You can also search for this author in PubMed Google Scholar
Y. Guo
View author publications
You can also search for this author in PubMed Google Scholar
S. Hedvall
View author publications
You can also search for this author in PubMed Google Scholar
M. Köhler
View author publications
You can also search for this author in PubMed Google Scholar
J. Syed
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Peter Sloot Marian Bubak Alfons Hoekstra Bob Hertzberger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chattratichat, J., Darlington, J., Guo, Y., Hedvall, S., Köhler, M., Syed, J. (1999). An architecture for distributed enterprise data mining. In: Sloot, P., Bubak, M., Hoekstra, A., Hertzberger, B. (eds) High-Performance Computing and Networking. HPCN-Europe 1999. Lecture Notes in Computer Science, vol 1593. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0100618

Download citation

DOI: https://doi.org/10.1007/BFb0100618
Published: 17 November 2006
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65821-4
Online ISBN: 978-3-540-48933-7
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics