Abstract
Protein structure prediction (PSP) is one of the most important problems in computational biology. This chapter introduces a novel hybrid Estimation of Distribution Algorithm (EDA) to solve the PSP problem on HP model. Firstly, a composite fitness function containing the information of folding structure core (H-Core) is introduced to replace the traditional fitness function of HP model. The new fitness function is expected to select better individuals for probabilistic model of EDA. Secondly, local search with guided operators is utilized to refine found solutions for improving efficiency of EDA. Thirdly, an improved backtracking-based repairing method is introduced to repair invalid individuals sampled by the probabilistic model of EDA. It can significantly reduce the number of backtracking searching operation and the computational cost for long sequence protein. Experimental results demonstrate that the new method outperforms the basic EDAs method. At the same time, it is very competitive with other existing algorithms for the PSP problem on lattice HP models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Armananzas, R., Inza, I., Santana, R., Saeys, Y., Flores, J.L., Lozano, J.A., Peer, Y.V., Blanco, R., Robles, V., Bielza, C., Larranaga, P.: A review of estimation of distribution algorithms in bioinformatics. BioData Mining I(6), 1–12 (2008)
Baluja, S.: Population-based incremental learning: A method for integrating genetic search based function optimization and competitive learning. Tech. Rep. CMU-CS-94C163, Carnegie Mellon University (1994)
Baluja, S., Davies, S.: Using optimal dependency-trees for combinatorial optimization: Learning the structure of the search space. In: Proc. the 14th International Conference on Machine Learning, pp. 30–38 (1997)
Berger, B., Leight, T.: Protein folding in the hydrophobic-hydrophilic (HP) model is NP-complete. Journal of Computational Biology 5(1), 27–40 (1998)
Chen, B., Li, L., Hu, J.: An improved backtracking method for EDAs based protein folding. In: Proc. of ICROS-SICE International Joint Conference 2009, Fukuoka Japan, pp. 4669–4673 (2009)
Chen, B., Li, L., Hu, J.: A novel EDAs based method for HP model protein folding. In: Proc. 2009 IEEE Congress on Evolutionary Computation (CEC 2009), Trondheim Norway, pp. 309–315 (2009)
Cotta, C.: Protein structure prediction using evolutionary algorithms hybridized with backtracking. In: Mira, J. (ed.) Evolutionary Computation in Bioinformatics, pp. 321–328. Springer, Berlin (2003)
Crescenzi, P., Goldman, D., Papadimitriou, C., Piccolboni, A., Yannakakis, M.: On the complexity of protein folding. Journal of Computational Biology 5(3), 423–466 (1998)
Flores, S., Smith, J.: Study of fitness landscapes for the HP model of protein structure prediction. In: Proc. 2003 IEEE Congress on Evolutionary Computation (CEC 2003), Trondheim Norway, pp. 2338–2345 (2003)
Goldberg, D.E.: The Design of Innovation: Genetic Algorithms and Evolutionary Computation. Kluwer Academic Publishers, Dordrecht (2002)
Greenwood, G.W., Shin, J.M.: On the evolutionary search for solutions to the protein folding problem. In: Fogel, G.B. (ed.) Artificial Neural Nets Problem Methods, pp. 115–136. Elsevier Science and Technology Books, Amsterdam (2002)
Harik, G.R., Lobo, F.G., Goldberg, D.E.: The compact genetic algorithm. IEEE Transactions on Evolutionary Computation 3(4), 287–297 (1999)
Hoque, T., Chetty, M., Dooley, L.S.: A guided genetic algorithm for protein folding prediction using 3D hydrophobic-hydrophilic model. In: Proc. 2006 IEEE Congress on Evolutionary Computation (CEC 2006), pp. 2339–2346 (2006)
Jeremy, S.D., Charles Jr., L.I., Paul, V.: Mimic: Finding optima by estimating probability densities. In: Advances in Neural Information Processing Systems, p. 424. The MIT Press, Cambridge (1996)
Krasnogor, N., Hart, W.E., Smith, J., Pelta, D.A.: Protein structure prediction with evolutionary algorithms. In: Proc. of Genetic Evol.Comput. Conf., Orlando, pp. 1596–1601 (1999)
Larranaga, P., Lozano, J.A.: Estimation of Distribution Algorithms. A New Tool for Evolutionary Computation. Kluwer Academic, Dordrecht (2002)
Lau, K.F., Dill, K.A.: A lattice statistical mechanics model of the conformational and sequence spaces of proteins. Marcromolecules 22(10), 3986–3997 (1989)
Lozano, J.A., Larranaga, P., Inza, I., Bengoetxea, E.: Towards a New Evolutionary Computation: Advances on Estimation of Distribution Algorithms. Springer, Berlin (2006)
Mendiburu, A., Lozano, J.A., Miguel-Alonso, J.: Parallel implementation of EDAs based on probabilistic graphical models. IEEE Transactions on Evolutionary Computation 9(4), 406–423 (2005)
Muhlenbein, H., Mahnig, T., Rodriguez, A.O.: Schemata, distributions and graphical models in evolutionary optimization. Journal of Heuristics 5(2), 213–247 (1999)
Muhlenbein, H., PaaB, G.: From recombination of genes to the estimation of distributions i. binary parameters. In: Ebeling, W., Rechenberg, I., Voigt, H.-M., Schwefel, H.-P. (eds.) PPSN 1996. LNCS, vol. 1141, pp. 178–187. Springer, Heidelberg (1996)
Pelikan, M., Goldber, D., Lobo, F.: A survey of optimization by building and using probabilistic models. Computational Optimization and Applications 21, 5–20 (2002)
Pelikan, M., Goldberg, D.E., Cantu-Paz, E.: Boa: The bayesian optimization algorithm, pp. 525–532. Morgan Kaufmann, San Francisco (1999)
Santana, R., Larranaga, P., Lozano, J.A.: Protein folding in simplified models with estimation of distribution algorithms. IEEE Transactions on Evolutionary Computation 12(4), 418–438 (2008)
Santana, R., Ponce, L., Ochoa, A.: The edge incident model. In: Proc. the Second Symposium on Artificial Intelligence (CIMAF 1999), pp. 352–359 (1999)
Song, J., Cheng, J., Zheng, T., Mao, J.: A novel genetic algorithm for HP model protein folding. In: Proc. of 6th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2005), pp. 935–937 (2005)
Unger, R., Moult, J.: Genetic algorithms for protein folding simulations. Journal of Molecular Biology 231(1), 75–81 (1993)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Chen, B., Hu, J. (2010). Protein Structure Prediction Based on HP Model Using an Improved Hybrid EDA. In: Chen, Yp. (eds) Exploitation of Linkage Learning in Evolutionary Algorithms. Evolutionary Learning and Optimization, vol 3. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12834-9_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-12834-9_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-12833-2
Online ISBN: 978-3-642-12834-9
eBook Packages: EngineeringEngineering (R0)