Abstract
Grid systems, as large-scale distributed computing environments, are widely used by data mining communities. This paper proposes a set of system-level Grid services to form an infrastructure supporting data-intensive applications and data mining. ChinaGrid, aiming at integrate heterogeneous massive resources distributed on China Education and Research Network (CERNET), is a national-wide Grid project supported by the Chinese government. ChinaGrid Supporting Platform (CGSP) is a Grid middleware developed for the ChinaGrid. It provides a series of system-level services of the ChinaGrid, helps to build application portals and integrate Grid resources, and supports the secondary development of Grid services. The Data Management Services (DMS) is a group of Grid services in CGSP to manage storage and data resources, support transparent data access, and guarantee high-performance data transfer on the Grid. It consists of metadata management service, storage resource management service, replication management service, storage agent and transfer client. It offers the fundamental support for data mining applications on ChinaGrid. In this paper, we introduce the design principle and implementation of DMS.
Supported by the National Science Foundation of China under Grant Nos. 60673174, 90412010 and the National High-Tech Research and Development Plan of China under Grant No. 2006AA01A115.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Foster, I., Kesselman, C., Tuecke, S.: The anatomy of the Grid: Enabling scalable virtual organization. International Journal of Supercomputer Applications 15(3), 200–222 (2001)
Jin, H.: ChinaGrid: Making Grid Computing a Reality. In: Chen, Z., Chen, H., Miao, Q., Fu, Y., Fox, E., Lim, E.-p. (eds.) ICADL 2004. LNCS, vol. 3334, pp. 13–24. Springer, Heidelberg (2004)
ChinaGrid, http://www.chinagrid.edu.cn
ChinaGrid Supporting Platform, http://www.chinagrid.edu.cn/cgsp
Foster, I., Kesselman, C.: Globus: A Metacomputing Infrastructure Toolkit. International Journal of Supercomputer Applications 11(2), 115–129 (1998)
Rajasekar, A., Wan, M., Moore, R.: MySRB and SRB - components of a Data Grid. In: Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing, Edinburgh, pp. 301–310. Institute of Electrical and Electronics Engineers Inc, San Francisco (2002)
Kubiatowicz, J., Bindel, D., Chen, Y., et al.: OceanStore: An Architecture for Global-Scale Persistent Storage. ACM SIGPLAN Notices 35(11), 190–201 (2000)
The Web Services Resource Framework, http://www.globus.org/wsrf
Ganger, G.R., Strunk, J.D., Klosterman, A.J.: Self-* Storage: brick with automated administration. Technical Report CMU-CS-03-178. Carnegie Mellon University (August 2003)
Allcock, W., Bresnahan, J., Kettimuthu, R., Link, M., Dumitrescu, C., Raicu, I., Foster, I.: The Globus Striped GridFTP Framework and Server. In: Gschwind, T., Aßmann, U., Nierstrasz, O. (eds.) SC 2005. LNCS, vol. 3628, Springer, Heidelberg (2005)
Liu, L.K., Wu, Y.W., Yang, G.W., et al.: General Running Service: An Execution Framework for Executing Legacy Program on Grid. In: Fifth International Conference on Grid and Cooperative Computing Workshops, pp. 522–529.
He, F., Wu, Y.W., Yang, G.W., et al.: Grid Programming Environment over ChinaGrid Support Platform. In: Fifth International Conference on Grid and Cooperative Computing Workshops, pp. 530–535.
BitTorrent Protocol Specification, http://www.bittorrent.org/protocol.html
Guan, X.S., Jin, H., Xie, C., Wang, Q.C.: An Adaptive transfer Algorithm in GDSS. In: Second International Conference on Knowledge Economy and Development of Science and Technology(KEST2004), BeiJing, China, September 17-19 2004, pp. 226–233.
Jin, H., Ran, L., Wang, Z., et al.: Architecture Design of Global Distributed Storage System for Data Grid. High Technology Letters 9(4), 1–4 (2003)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wu, S., Wang, W., Xiong, M., Jin, H. (2007). Data Management Services in ChinaGrid for Data Mining Applications. In: Washio, T., et al. Emerging Technologies in Knowledge Discovery and Data Mining. PAKDD 2007. Lecture Notes in Computer Science(), vol 4819. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-77018-3_42
Download citation
DOI: https://doi.org/10.1007/978-3-540-77018-3_42
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-77016-9
Online ISBN: 978-3-540-77018-3
eBook Packages: Computer ScienceComputer Science (R0)