Abstract
The requirements for data mining systems for large organisations and enterprises range from logical and physical distribution of large data and heterogeneous computational resources to the general need for high performance at a level that is sufficient for interactive work. This work categorises the requirements and describes the Kensington software architecture that addresses these demands. The system is capable of transparently supporting parallel computation at two levels, and we describe a configuration for trans-atlantic distributed parallel data mining that was demonstrated at the recent Supercomputing conference.
Preview
Unable to display preview. Download preview PDF.
References
[CDG+97] J. Chattratichat, J. Darlington, M.Ghanem, Y. Guo, H. Hüning, M. Köhler, J. Sutiwaraphun, H. W. To, and D. Yang. Large scale data mining: Challenges and responses. In Proceedings of Third International Conference on Knowledge Discovery and Data Mining, pages 143–146, 1997.
[CDG+98] J. Chattratichat, J. Darlington, Y. Guo, S. Hedvall, M. Köhler, A. Saleem, J. Sutiwaraphun, and D. Yang. A software architecture for deploying high-performance solutions on the internet. In High-Performance Computing and Networking, 1998.
[CS96] P. K. Chan and S. J. Stolfo. Sharing learned models among remote database partitions by local meta-learning. In E. Simoudis, J. Han, and U. Fayyad, editors, The Second International Conference on Knowledge Discovery and Data Mining, pages 2–7. AAAI Press, 1996.
[FPSS96] U. M. Fayyad, G. Piatetsky-Shapiro, and P. Smyth. From data mining to knowledge discovery: An overview. In U. M. Fayyad, G. Piatetesky-Shapiro, P. Smyth, and R. Uthurusamy, editors, Advances in Knowledge Discovery and Data Mining. MIT Press, 1996.
[Pen97] Wanida Pensuwon. Parallel neural networks. Msc. thesis, Imperial College, University of London, 1997.
[Qui93] J. Ross Quinlan. C4.5: Programs for Machine Learning. Morgan Kaufman Publishers, 1993.
[Rue97] Stefan Rueger. Parallel self-organising maps. In Proceedings of the Seventh Parallel Computing Workshop, Australian National University, Canberra, September 25–26, 1997.
[Toi96] Hannu Toivonen. Discovery of Frequent Patterns in Large Data Collections. PhD thesis, Department of Computer Science, University of Finland, 1996.
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 1999 Springer-Verlag
About this paper
Cite this paper
Chattratichat, J., Darlington, J., Guo, Y., Hedvall, S., Köhler, M., Syed, J. (1999). An architecture for distributed enterprise data mining. In: Sloot, P., Bubak, M., Hoekstra, A., Hertzberger, B. (eds) High-Performance Computing and Networking. HPCN-Europe 1999. Lecture Notes in Computer Science, vol 1593. Springer, Berlin, Heidelberg. https://doi.org/10.1007/BFb0100618
Download citation
DOI: https://doi.org/10.1007/BFb0100618
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-65821-4
Online ISBN: 978-3-540-48933-7
eBook Packages: Springer Book Archive