Abstract
The number of natural protein sequences is infinitely small as compared to the number of proteins theoretically possible. Thus, a huge number of protein sequences, defined as “never born proteins” or NBPs, have never been observed in nature. The study of the structural and functional properties of NBPs represents a way to improve our knowledge on the fundamental properties that make existing protein sequences so unique. Protein structure prediction tools combined with the use of large computing resources allow to tackle this problem. The study of NBPs requires the generation of a large library of non-natural protein sequences (105–107) and the prediction of their three-dimensional structure. On a single CPU it would require years to predict the structure of such a library of protein sequences. However, this is an embarrassingly parallel problem in which the same computation must be repeated several times and the use of grid infrastructures makes feasible to approach this problem in an acceptable time frame. Here we describe the set up of a simulation environment within the EUChinaGRID[1] infrastructure that allows non expert users to exploit grid resources for large-scale proteins structure prediction.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
The EUChinaGRID project, http://www.euchinagrid.org
Rohl, C.A., Strauss, C.E., Misura, K.M., Baker, D.: Protein structure prediction using Rosetta. Methods Enzymol. 383, 66–93 (2004)
GENIUS Grid Portal, https://genius.ct.infn.it/
Matsumoto, M., Nishimura, T.: Mersenne Twister: A 623-dimensionally equidistributed uniform pseudo-random number generator. ACM Transactions on Modeling and Computer Simulation 8, 3–30 (1998)
Source code for MT19937 available at the URL: http://www.math.sci.hiroshima-u.ac.jp/ ∼m-mat/MT/emt.html
Altschul, S.F., Gish, W., Miller, W., Myers, E.W., Lipman, D.J.: Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990)
Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Bourne, P.E.: The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000)
McGuffin, L.J., Bryson, K., Jones, D.T.: The PSIPRED protein structure prediction server. Bioinformatics 16, 404–405 (2000)
National Center for Biotechnology Information, http://www.ncbi.nlm.nih.gov
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Rocca, G.L., Minervini, G., Evangelista, G., Luisi, P.L., Polticelli, F. (2009). Porting Biological Applications in Grid: An Experience within the EUChinaGRID Framework. In: Lin, S.C., Yen, E. (eds) Grid Computing. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-78417-5_6
Download citation
DOI: https://doi.org/10.1007/978-0-387-78417-5_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-78416-8
Online ISBN: 978-0-387-78417-5
eBook Packages: Computer ScienceComputer Science (R0)