Abstract
We discuss applicability of Principal Component Analysis and Particle Swarm Optimization in protein tertiary structure prediction. The proposed algorithm is based on establishing a low-dimensional space where the sampling (and optimization) is carried out via Particle Swarm Optimizer (PSO). The reduced space is found via Principal Component Analysis (PCA) performed for a set of previously found low-energy protein models. A high frequency term is added into this expansion by projecting the best decoy into the PCA basis set and calculating the residual model. Our results show that PSO improves the energy of the best decoy used in the PCA considering an adequate number of PCA terms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Zhang, Y.: Progress and challenges in protein structure prediction. Curr. Opin. Struct. Biol. 18, 342–348 (2008)
Bonneau, R., Strauss, C.E., Rohl, C.A., Chivian, D., Bradley, P., Malmstrom, L., Robertson, T., Baker, D.: De novo prediction of three-dimensional structures for major protein families. J. Mol. Biol. 322, 65–78 (2002)
Bradley, P., Chivian, D., Meiler, J., Misura, K., Rohl, C., Schief, W.W.W., Schueler-Furman, O., Murphy, P., Schonbrun, J., Rosetta predictions in: CASP5: successes, failures, and prospects for complete automation. Proteins 53, 457–468 (2003)
Chivian, D., Kim, D.E., Malmstrom, L., Bradley, P., Robertson, T., Murphy, P., Strauss, C.E., Bonneau, R., Rohl, C.A., Baker, D.: Automated prediction of CASP-5 structures using the Robetta server. Proteins 53, 524–533 (2003)
Sen, T.Z., Feng, Y., Garcia, J.V., Kloczkowski, A., Jernigan, R.L.: The extent of cooperativity of protein motions observed with elastic network models is similar for atomic and coarser-grained models. J. Chem. Theory Comput. 2, 696–704 (2006)
Gniewek, P., Kolinski, A., Jernigan, R.L., Kloczkowski, A.: Elastic network normal modes provide a basis for protein structure refinement. J. Chem. Phys. 136, 195101 (2012)
Fernández-Martínez, J.L.: Model reduction and uncertainty analysis in inverse problems. Lead. Edge 34, 1006–1016 (2015)
Price, S.L.: From crystal structure prediction to polymorph prediction: interpreting the crystal energy landscape. Phys. Chem. Chem. Phys. 10, 1996–2009 (2008)
Fernández-Martínez, J.L., et al.: On the topography of the cost functional in linear and nonlinear inverse problems. Geophysics 77, W1–W15 (2012)
Fernández-Martínez, J.L., García-Gonzale, E.: Stochastic stability analysis of the linear continuous and discrete PSO models. Trans. Evol. Comp. 15, 405–423 (2011)
Fernández-Martínez, J.L., García-Gonzalo, E.: Stochastic stability and numerical analysis of two novel algorithms of the PSO family: PP-PSO and RR-PSO. Int. J. Artif. Intell. Tools 21, 1240011 (2012)
Jolliffe, I.T.: Principal Component Analysis. Springer, Heidelberg (2002). https://doi.org/10.1007/b98835
Kennedy, J., Eberhart, R.: A new optimizers using particle swarm theory. In: Proceedings of Sixth International Symposium Micromachine Human Science, vol. 1, pp. 39–46 (1995)
Fernández-Martínez, J.L., García-Gonzalo, E.: The generalized PSO a new door to PSO evolution. J. Artif. Evol. Appl. 2008, 861275 (2008)
Fernández-Martínez, J.L., García-Gonzalo, E.: The PSO family: deduction, stochastic analysis and comparison. Swarm Intell 3, 245–273 (2009)
Gront, D., Kolinski, A.: BioShell – A package of tools for structural biology prediction. Bioinformatics 22, 621–622 (2006)
Gront, D., Kolinski, A.: Utility library for structural bioinformatics. Bioinformatics 24, 584–585 (2008)
Gniewek, P., Kolinski, A., Jernigan, R.L., Kloczkowski, A.: BioShell - threading: a versatile monte carlo package for protein threading. BMC Bioinform. 22, Article no. 22 (2014)
Aramini, J.M., et al.: Solution NMR structure of a putative Uracil DNA glycosylase from Methanosarcina acetivorans. Northeast Structural Genomics Consortium Target MvR76 (2010)
Ramelot, T.A., et al.: Solution NMR structure of the PBS linker Polypeptide domain (fragment 254-400) of Phycobilisome linker protein ApcE from Synechocystis sp. PCC 6803. Northeast Structural Genomics Consortium Target SgR209C
Eletsky, A., et al.: Solution NMR structure of the N-terminal domain of putative ATP-dependent DNA Helicase RecG-related Protein from Nitrosomonas europaea. Northeast Structural Genomics Consortium Target NeR70A (2010)
Heidebrecht, T., et al.: The structural basis for recognition of J-base containing DNA by a Novel DNA-binding domain in JBP1. Northeast Structural Genomics Consortium and others (2010)
Cuff, M.E., et al.: The lactose-specific IIB component domain structure of the phosphoenolpyruvate: carbohydrate phosphotransferase system (PTS) from Streptococcus pneumoniae. Midwest Center for Structural Genomics Target TIGR4 (2010)
Ramagopal, U.A. et al.: Structure of putative HAD superfamily (subfamily III A) hydrolase from Legionella pneumophila. 3N1U, New York Structural Genomics Research Center Target (2010)
Oke, M., et al.: Crystal structure of the hypothetical protein PA0856 from Pseudomonas Aeruginosa. Joint Center for Structural Genomics NP_249547.1 (2010)
Zhang, R., et al.: The crystal structure of functionally unknown protein from Neisseria Meningitidis MC58. Midwest Center for Structural Genomics Target 3NYM (2008)
Forouhar, F., et al.: Crystal structure of the N-terminal domain of DNA-binding protein SATB1 from Homo Sapiens. Northeast Structural Genomics Consortium Target HR4435B (2010)
Acknowledgements
A. K. acknowledges financial support from NSF grant DBI 1661391 and from The Research Institute at Nationwide Children’s Hospital.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG, part of Springer Nature
About this paper
Cite this paper
Álvarez, Ó., Fernández-Martínez, J.L., Fernández-Brillet, C., Cernea, A., Fernández-Muñiz, Z., Kloczkowski, A. (2018). On the Use of Principal Component Analysis and Particle Swarm Optimization in Protein Tertiary Structure Prediction. In: Rutkowski, L., Scherer, R., Korytkowski, M., Pedrycz, W., Tadeusiewicz, R., Zurada, J. (eds) Artificial Intelligence and Soft Computing. ICAISC 2018. Lecture Notes in Computer Science(), vol 10842. Springer, Cham. https://doi.org/10.1007/978-3-319-91262-2_10
Download citation
DOI: https://doi.org/10.1007/978-3-319-91262-2_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-91261-5
Online ISBN: 978-3-319-91262-2
eBook Packages: Computer ScienceComputer Science (R0)