Journal of Grid Computing

, Volume 8, Issue 2, pp 341–364 | Cite as

The Back-End of a Two-Layer Model for a Federated National Datastore for Academic Research VOs that Integrates EGEE Data Management

  • Brian Coghlan
  • John Walsh
  • Stephen Childs
  • Geoff Quigley
  • David O’Callaghan
  • Gabriele Pierantoni
  • John Ryan
  • Neil Simon
  • Keith Rochford


This paper proposes an architecture for the back-end of a federated national datastore for use by academic research communities, developed by the e-INIS (Irish National e-InfraStructure) project, and describes in detail one member of the federation, the regional datastore at Trinity College Dublin. It builds upon existing infrastructure and services, including Grid-Ireland, the National Grid Initiative and EGEE, Europe’s leading Grid infrastructure. It assumes users are in distinct research communities and that their data access patterns can be described via two properties, denoted as mutability and frequency-of-access. The architecture is for a back-end—individual academic communities are best qualified to define their own front-end services and user interfaces. The proposal is designed to facilitate front-end development by placing minimal restrictions on how the front-end is implemented and on the internal community security policies. The proposal also seeks to ensure that the communities are insulated from the back-end and from each other in order to ensure quality of service and to decouple their front-end implementation from site-specific back-end implementations.


Digital repositories Data storage EGEE Data management Grid 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    CMIP5 - Coupled Model Intercomparison Project Phase 5.
  2. 2.
    Digital Humanities Observatory.
  3. 3.
    Heliophysics Integrated Observatory (HELIO).
  4. 4.
    Irish Centre for High End Computing (ICHEC).
  5. 5.
    ESFRI Working Group About Digital Repositories, ESFRI Position Paper. ESFRI (2007). Also available as (2007)
  6. 6.
    NSF07-28 Cyberinfrastructure Vision for 21st Century Discovery. “National Science Foundation” (2007). Also available as (2007)
  7. 7.
    OECD Principles and Guidelines for Access to Research Data from Public Funding. OECD (2007). Also available as (2007)
  8. 8.
    Enabling Inter-Repository Access Management between iRODS and Fedora. In: 4th International Conference on Open Repositories, OR09. Conference Presentations. Georgia Institute of Technology. (2009)
  9. 9.
    Blanke, T., Hedges, M., Dunn, S.: Arts and humanities e-science—current practices and future challenges. Future Gener. Comput. Syst. 25(4), 474–480. doi: 10.1016/j.future.2008.10.004 (2009)CrossRefGoogle Scholar
  10. 10.
    Cassidy, K., McCandless, J., Childs, S., Walsh, J., Coghlan, B., Dagger, D.: Combining a virtual Grid testbed and Grid elearning courseware. In: Proc. Cracow Grid Workshop 2006 (CGW06). Academic Computer Centre CYFRONET AGH, Cracow, Poland (2006)Google Scholar
  11. 11.
    Childs, S., Coghlan, B., O’Callaghan, D., Quigley, G., Walsh, J.: Centralised fabric management for a national Grid infrastructure. In: Cracow Grid Workshop (CGW’05). Cracow, Poland (2005)Google Scholar
  12. 12.
    Colarelli, D., Grunwald, D., Neufeld, M.: The case for massive arrays of idle disks (maid). In: In the 2002 Conference on File and Storage Technologies, p. 2002. On (2002)Google Scholar
  13. 13.
    Cornillon, P., Gallagher, J., Sgouros, T.: Opendap: accessing data in a distributed, heterogeneous environment. Data Sci. J. 2, 164–174. doi: 10.2481/dsj.2.164 (2003)CrossRefGoogle Scholar
  14. 14.
    Data Management Task Force: e-IRG Report on Data Management. e-Infrastructure Reflection Group (2009)Google Scholar
  15. 15.
    Foster, I., Kesselman, C., Tsudik, G., Tuecke, S.: A security architecture for computational Grids. In: Proc. 5th ACM Conference on Computer and Communications Security Conference, pp. 83–92 (1998)Google Scholar
  16. 16.
    Hedges, M., Blanke, T., Hasan, A.: Rule-based curation and preservation of data: a data Grid approach using iRODS. In: Future Generation Computer Systems-the International Journal of Grid Computing-Theory Methods and Applications, vol. 25(4), pp. 446–452. 3rd IEEE International Conference on e-Science and Grid Computing, Bangalore, India, 10–13 Dec 2007. doi: 10.1016/j.future.2008.10.003 (2009)
  17. 17.
    Hertel, C.R.: Implementing CIFS: the Common Internet File System. Prentice Hall PTR. (2003)
  18. 18.
    Klous, S., Frey, J., Son, S.C., Thain, D., Roy, A., Livny, M., van den Brand, J.: Transparent access to Grid resources for user software. Concurr. Comput.: Practice and Experience 18(7), 787–801. doi: 10.1002/cpe.961 (2006)CrossRefGoogle Scholar
  19. 19.
    Knopper, S., Koning, R., Roodhart, J., Grosso, P., de Laat, C.: Amsterdam cinegrid exchange—a distributed high-quality digital media solution. SNE Technical Report SNE-UVA-2009-01. Available at (2009)
  20. 20.
    Koblitz, B., Santos, N., Pose, V.: The AMGA metadata service. Journal of Grid Computing 6(1), 61–76. doi: 10.1007/s10723-007-9084-6 (2008)CrossRefGoogle Scholar
  21. 21.
    Laure, E., Gr, C., Fisher, S., Frohner, A., Kunszt, P., et al.: Programming the Grid with glite. In: Computational Methods in Science and Technology, vol. 12, pp. 33–45. (2006)
  22. 22.
    Mandrichenko, I., Allcock, W., Perelmutov, T.: GridFTP v2 Protocol Description. GGF Document Series GFD.47. Also available as (2005)
  23. 23.
    Oinn, T., Addis, M., Ferris, J., Marvin, D., Senger, M., Greenwood, M., Carver, T., Glover, K., Pocock, M.R., Wipat, A., Li, P.: Taverna: a tool for the composition and enactment of bioinformatics workflows. Bioinformatics 20(17), 3045–3054. doi: 10.1093/bioinformatics/bth361 (2004)CrossRefGoogle Scholar
  24. 24.
    Saklikar, S., Saha, S.: Next steps for security assertion markup language (SAML). In: SWS’07: Proceedings of the 2007 ACM Workshop on Secure Web Services, pp. 52–65. Assoc Computing Machinery, 1515 Broadway, New York, NY 10036-9998 USA. ACM Workshop on Secure Web Services, Fairfax, VA, 02 NOV 2007 (2007)Google Scholar
  25. 25.
    Satran, J., Meth, K., Sapuntzakis, C., Chadalapaka, M., Zeidner, E.: Internet Small Computer Systems Interface (iSCSI). RFC 3720 (Proposed Standard). Updated by RFCs 3980, 4850, 5048 (2004)
  26. 26.
    Schulz, G.: MAID 2.0: Energy Savings without Performance Compromises.
  27. 27.
    Sim, A., Shoshani, A., Badino, P., Barring, O., Baud, J., Corso, E., Witt, S.D., Donno, F., Gu, J., Haddox-Schatz, M., Hess, B., Jensen, J., Kowalski, A., Litmaath, M., Magnoni, L., Perelmutov, T., Petravick, D., Watson, C.: The storage resource manager interface specification version 2.2. Also available as GGF Document Series GFD.129 (2008)
  28. 28.
    Thain, D., Livny, M.: Parrot: Transparent user-level middleware for data-intensive computing. In: In Workshop on Adaptive Grid Middleware. (2003)
  29. 29.
    Whitehead, J.E., Wiggins, M.: Webdav: IETF standard for collaborative authoring on the web. IEEE Internet Computing 2(5), 34–40. doi: 10.1109/4236.722228 (1998)CrossRefGoogle Scholar
  30. 30.
    Witzig, C.: Shibboleth Interoperability Through a Short Lived Credential Service. EGEE-II. Report EGEE-II-MJRA1.4-770102-v0.96.doc (2006)Google Scholar
  31. 31.
    Zhang, S., Coddington, P., Wendelborn, A.: Davis: a generic interface for SRB and iRODS. DHPC Technical Report DHPC-197 (2009)Google Scholar

Copyright information

© Springer Science+Business Media B.V. 2010

Authors and Affiliations

  • Brian Coghlan
    • 1
  • John Walsh
    • 1
  • Stephen Childs
    • 1
  • Geoff Quigley
    • 1
  • David O’Callaghan
    • 1
  • Gabriele Pierantoni
    • 1
  • John Ryan
    • 1
  • Neil Simon
    • 1
  • Keith Rochford
    • 2
  1. 1.School of Computer Science and StatisticsTrinity College DublinDublin 2Ireland
  2. 2.Dublin Institute for Advanced StudiesDublin 2Ireland

Personalised recommendations