Abstract
Hospitals throughout Europe hold vast amounts of data in the form of patient records. Performing on-the-fly analyses of these data and their actual transformation into information and knowledge may help improve medical procedures, treatments or prevent illnesses. Grid technology has recently emerged to address the needs for efficient and effective exploitation of heterogeneous and geographically distributed resources, such as large and distributed data, open source or proprietary programs for data analysis, massive storage devices and high-performance computers. A de facto standard framework for building grid environments is the Open Grid Service Architecture (OGSA) and the corresponding Web Service Resource Framework (WSRF). The Globus Toolkit version 4, is a fully WSRF-compliant grid middleware, which addresses the needs for secure, flexible, interoperable and seamless use of grid resources. The DataMiningGrid© (www.datamininggrid.org) system was recently built on top of existing Globus technology inter alia to address the requirements of a community of medical users and enable them to perform on-the-fly analysis of geographically distributed medical databases. DataMiningGrid© is a set of grid services and user-friendly workflow editing and managing tools, which facilitate manipulation of distributed data, registering, discovery and use of grid-enabled statistical and data mining programs, their execution in the grid environment and a provenance tracking mechanism. The software is now freely available at SourceForge.net under Apache License V2. The present work illustrates the use of the DataMiningGrid© system to perform analysis of nine regional medical databases in Slovenia.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Open Grid Services Architecture (OGSA), http://www.globus.org/ogsa/
Foster I, Globus Toolkit Version 4 (2005) Software for Service- Oriented Systems, in Jin H, Reed D, and Jiang W, (editors): NPC 2005, LNCS 3779, pp 2–13
Stankovski V, May M, Franke J, Schuster A, McCourt D, Dubitzky W (2004) A Service-Centric Perspective for Data Mining in Complex Problem Solving Environments, HR Arabnia and J Ni (editors). Proc of Int”l Conference on Parallel and Distributed Processing Techniques and Applications (PDPTA'04), II, pp780–787
GridLab, Weka4WS at http://grid.deis.unical.it/weka4ws/
Witten IH, Frank E (2005) Data Mining: Practical machine learning tools and techniques. 2nd Edition. Morgan Kaufmann, San Francisco.
Triana at http://www.trianacode.org/
Antonioletti M, Atkinson M, Baxter R, Borley A, Chue Hong NP, Collins B et al. (2005) The design and implementation of Grid database services in OGSA-DAI. Concurrency and Computation: Practice and Experience 17(2–4):357–376
OGSA-DAI project Web site at www.ogsadai.org.uk/ under documentation
GridBus Service Broker, a grid scheduler for computational and data grids, www.Gridbus.org/broker/
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Stankovski, V., Swain, M., Stimec, M., Fidler Mis, N. (2007). Analyzing Distributed Medical Databases on DataMiningGrid©. In: Jarm, T., Kramar, P., Zupanic, A. (eds) 11th Mediterranean Conference on Medical and Biomedical Engineering and Computing 2007. IFMBE Proceedings, vol 16. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-73044-6_41
Download citation
DOI: https://doi.org/10.1007/978-3-540-73044-6_41
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-73043-9
Online ISBN: 978-3-540-73044-6
eBook Packages: EngineeringEngineering (R0)