Skip to main content

Grid and Distributed Public Computing Schemes for Structural Proteomics: A Short Overview

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 4743))

Abstract

Grid and distributed public computing schemes has become an essential tool for many scientific fields including bioinformatics, computational biology and systems biology. The adoption of these technologies has given rise to a wide range of projects and contributions that provide various ways of setting up these environments and exploiting their potential resources and services for different domains of applications. This paper aims to provide a distilled overview of some of the major projects, technologies and resources employed in the area of structural proteomics. The major emphasis would be to briefly comment on various approaches related to the gridification and parallelization of some flagship legacy applications, tools and data resources related to key structural proteomics problems such as protein structure prediction, folding and comparison. The comments are based on theoretical analysis of some interesting parameters such as performance gain after gridification, user level interaction environments, workload distribution and the choice of deployment infrastructure and technologies. The study of these parameters would provide a basis for some motivating justification needed for further research and development in this domain.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. http://en.wikipedia.org/wiki/Biological_databases

  2. Rice, P., Longden, I., Bleasby, A.: EMBOSS: The European Molecular Biology Open Software Suite. Trends in Genetics 16, 276–277 (2000)

    Article  Google Scholar 

  3. Stevens, R.D., Robinson, A.J., Goble, C.A.: MyGrid: Personalized bioinformatics on the information grid. Bioinformatics 19, 302–304 (2003)

    Article  Google Scholar 

  4. Gagliardi, F., Jones, B., Grey, F., Begin, M.E., Heikkurinen, M.: Building an infrastructure for scientific Grid computing: status and goals of the EGEE project. Philosophical Transactions: Mathematical, Physical and Engineering Sciences 363, 1729–1742 (2005)

    Article  Google Scholar 

  5. Michael, P., Sergio, M., Peter, K.: The SwissBioGrid Project: Objectives, Preliminary Results and Lessons Learned. In: 2nd IEEE International Conference on e-Science and Grid Computing (e-Science 2006) - Workshop on Production Grids, IEEE Computer Society Press, Amsterdam, Netherlands (2006)

    Google Scholar 

  6. World Community Grid: Technology Solving Problems, http://www.worldcommunitygrid.org/

  7. Pande, V.S., Baker, I., Chapman, J., et al.: Atomistic Protein Folding Simulations on the Submillisecond Time Scale Using Worldwide Distributed Computing. Biopolymers 68, 91–109 (2003)

    Article  Google Scholar 

  8. Taufer, M., An, C., Kerstens, A., Brooks, C.L.: Predictor@Home: a protein structure prediction supercomputer based on public-resource computing. In: Proceedings of 19th IEEE International Parallel and Distributed Processing Symposium, IEEE Computer Society Press, Los Alamitos (2005)

    Google Scholar 

  9. Rosetta@Home: Protein Folding, Design and Docking, http://boinc.bakerlab.org/rosetta/

  10. Shah, A.A., Barthel, D., Lukasiak, P., Blacewicz, J., Krasnogor, N.: Web & Grid Technologies in Bioinformatics, Computational Biology and Systems Biology. Current Bioinformatics (submitted)

    Google Scholar 

  11. BioSimGrid: A distributed database for biomolecular simulations, http://www.biosimgrid.org/#about

  12. Zhang, Y., Skolnick, J.: The protein structure prediction problem could be solved using the current PDB library. Proceedings of US National Academy of Science 102, 1029–1034 (2005)

    Article  Google Scholar 

  13. Susumu, D., Kazutoshi, F., Hideo, M., Haruki, N., Shinji, S.: An Empirical Study of Grid Applications in Life Science Lesson learnt from Biogrid project in Japan. International Journal of Information Technology 11, 16–28 (2005)

    Google Scholar 

  14. Chikenji, G., Fujitsuka, Y., Takada, S.: A reversible fragment assembly method for De Novo protein structure prediction. Journal of Chemical Physics 119, 6895–6903 (2003)

    Article  Google Scholar 

  15. Birnbaum, A., Hayes, J., Wilfred, W., et al.: Grid Workflow Software for a High-Throughput Proteome Annotation Pipeline. In: Konagaya, A., Satou, K. (eds.) LSGRID 2004. LNCS (LNBI), vol. 3370, pp. 68–81. Springer, Heidelberg (2005)

    Google Scholar 

  16. Herrera, J., Huedo, E., Montero, R.S., Lorente, I.M.: Benchmarking of a joint IRISGrid/EGEE Testbed with a Bioinformatics Application. Scalable Computing: Practice and Experience 7, 25–32 (2006)

    Google Scholar 

  17. Huedo, E., Ruben, S.M., Ignacio, M.L.: Adaptive Grid Scheduling of a High-Throughput Bioinformatics Application. In: Wyrzykowski, R., Dongarra, J.J., Paprzycki, M., Waśniewski, J. (eds.) Parallel Processing and Applied Mathematics. LNCS, vol. 3019, pp. 840–847. Springer, Heidelberg (2004)

    Google Scholar 

  18. Tanimura, Y., Aoi, K., Hiroyasu, T., Miki, M., Okamoto, Y., Dongarra, J.: Implementation of Protein Tertiary Structure Prediction System with NetSolve. In: Proceedings of the High Performance Computing and Grid in Asia Pacific Region, Seventh International Conference on (HPCAsia 2004) (2004)

    Google Scholar 

  19. Natrajan, A., Crowley, M., Wilkins, D.N., Humphrey, M.A., Fox, A.D., Grimshaw, A.S., Brooks, C.L.: Studying Protein Folding on the Grid: Experiences using CHARMM on NPACI Resources under Legion. In: Proceeding of the HPDC Conference, San Francisco, CA, USA (2001)

    Google Scholar 

  20. UK, B., Taufer, M., Stricker, T., Settanni, G., Cavalli, A.: Implementation and characterization of protein folding on a desktop computational grid - is CHARMM a suitable candidate for the united devices metaprocessor? In: Technical Report 385, ETH Zurich, Institute for Computer systems (2002)

    Google Scholar 

  21. Ferrari, C., Guerra, C., Zanotti, G.: A grid-aware approach to protein structure comparison. Journal of Parallel Distributed Computing 63, 728–737 (2003)

    Article  Google Scholar 

  22. Barthel, D., Hirst, J.D., Blacewicz, J., Krasnogor, N.: Procksi: a metaserver for protein comparison using kolmogorov and other similarity measures. In: Late Breaking Papers. European Conference on Computational Biology, Eilat, Israel (2006)

    Google Scholar 

  23. Park, S.J., Yamamura, M.: FROG (Fitted Rotation and Orientation of protein structure by means of real-coded Genetic algorithm): Asynchronous Parallelizing for Protein Structure-Based Comparison on the Basis of Geometrical Similarity. Genome Informatics 13, 344–345 (2002)

    Google Scholar 

  24. Park, S., Yamamura, J.: Two-layer Protein Structure Comparison. In: Proceedings of the 15th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2003) (2003)

    Google Scholar 

  25. Cannataro, M., Comin, M., Ferrari, C., Guerra, C., Guzzo, A., Veltri, P.: Modeling a Protein Structure Comparison Application on the Grid Using PROTEUS. In: Herrero, P., Pérez, M.S., Robles, V. (eds.) SAG 2004. LNCS, vol. 3458, pp. 75–85. Springer, Heidelberg (2005)

    Google Scholar 

  26. Zamboulis, L., Fan, H., Belhajjame, K., Siepen, J., et al.: Data Access and Integration in the ISPIDER Proteomics Grid. In: Leser, U., Naumann, F., Eckman, B. (eds.) DILS 2006. LNCS (LNBI), vol. 4075, pp. 3–18. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  27. Allan, R., Keegan, R., Meredith, D., Winn, M., Winter, G.: e-HTPX– HPC, Grid and Web-Portal Technologies in High Throughput Protein Crystallography. In: Proceedings of the UK e-Science All Hands Meeting, Nottingham, UK (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Parimala Thulasiraman Xubin He Tony Li Xu Mieso K. Denko Ruppa K. Thulasiram Laurence T. Yang

Rights and permissions

Reprints and permissions

Copyright information

© 2007 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shah, A.A., Barthel, D., Krasnogor, N. (2007). Grid and Distributed Public Computing Schemes for Structural Proteomics: A Short Overview. In: Thulasiraman, P., He, X., Xu, T.L., Denko, M.K., Thulasiram, R.K., Yang, L.T. (eds) Frontiers of High Performance Computing and Networking ISPA 2007 Workshops. ISPA 2007. Lecture Notes in Computer Science, vol 4743. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74767-3_44

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-74767-3_44

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-74766-6

  • Online ISBN: 978-3-540-74767-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics