Skip to main content

Protein Structure Prediction Based on HP Model Using an Improved Hybrid EDA

  • Chapter
Exploitation of Linkage Learning in Evolutionary Algorithms

Part of the book series: Evolutionary Learning and Optimization ((ALO,volume 3))

Abstract

Protein structure prediction (PSP) is one of the most important problems in computational biology. This chapter introduces a novel hybrid Estimation of Distribution Algorithm (EDA) to solve the PSP problem on HP model. Firstly, a composite fitness function containing the information of folding structure core (H-Core) is introduced to replace the traditional fitness function of HP model. The new fitness function is expected to select better individuals for probabilistic model of EDA. Secondly, local search with guided operators is utilized to refine found solutions for improving efficiency of EDA. Thirdly, an improved backtracking-based repairing method is introduced to repair invalid individuals sampled by the probabilistic model of EDA. It can significantly reduce the number of backtracking searching operation and the computational cost for long sequence protein. Experimental results demonstrate that the new method outperforms the basic EDAs method. At the same time, it is very competitive with other existing algorithms for the PSP problem on lattice HP models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Armananzas, R., Inza, I., Santana, R., Saeys, Y., Flores, J.L., Lozano, J.A., Peer, Y.V., Blanco, R., Robles, V., Bielza, C., Larranaga, P.: A review of estimation of distribution algorithms in bioinformatics. BioData Mining I(6), 1–12 (2008)

    Google Scholar 

  2. Baluja, S.: Population-based incremental learning: A method for integrating genetic search based function optimization and competitive learning. Tech. Rep. CMU-CS-94C163, Carnegie Mellon University (1994)

    Google Scholar 

  3. Baluja, S., Davies, S.: Using optimal dependency-trees for combinatorial optimization: Learning the structure of the search space. In: Proc. the 14th International Conference on Machine Learning, pp. 30–38 (1997)

    Google Scholar 

  4. Berger, B., Leight, T.: Protein folding in the hydrophobic-hydrophilic (HP) model is NP-complete. Journal of Computational Biology 5(1), 27–40 (1998)

    Article  Google Scholar 

  5. Chen, B., Li, L., Hu, J.: An improved backtracking method for EDAs based protein folding. In: Proc. of ICROS-SICE International Joint Conference 2009, Fukuoka Japan, pp. 4669–4673 (2009)

    Google Scholar 

  6. Chen, B., Li, L., Hu, J.: A novel EDAs based method for HP model protein folding. In: Proc. 2009 IEEE Congress on Evolutionary Computation (CEC 2009), Trondheim Norway, pp. 309–315 (2009)

    Google Scholar 

  7. Cotta, C.: Protein structure prediction using evolutionary algorithms hybridized with backtracking. In: Mira, J. (ed.) Evolutionary Computation in Bioinformatics, pp. 321–328. Springer, Berlin (2003)

    Google Scholar 

  8. Crescenzi, P., Goldman, D., Papadimitriou, C., Piccolboni, A., Yannakakis, M.: On the complexity of protein folding. Journal of Computational Biology 5(3), 423–466 (1998)

    Article  Google Scholar 

  9. Flores, S., Smith, J.: Study of fitness landscapes for the HP model of protein structure prediction. In: Proc. 2003 IEEE Congress on Evolutionary Computation (CEC 2003), Trondheim Norway, pp. 2338–2345 (2003)

    Google Scholar 

  10. Goldberg, D.E.: The Design of Innovation: Genetic Algorithms and Evolutionary Computation. Kluwer Academic Publishers, Dordrecht (2002)

    MATH  Google Scholar 

  11. Greenwood, G.W., Shin, J.M.: On the evolutionary search for solutions to the protein folding problem. In: Fogel, G.B. (ed.) Artificial Neural Nets Problem Methods, pp. 115–136. Elsevier Science and Technology Books, Amsterdam (2002)

    Google Scholar 

  12. Harik, G.R., Lobo, F.G., Goldberg, D.E.: The compact genetic algorithm. IEEE Transactions on Evolutionary Computation 3(4), 287–297 (1999)

    Article  Google Scholar 

  13. Hoque, T., Chetty, M., Dooley, L.S.: A guided genetic algorithm for protein folding prediction using 3D hydrophobic-hydrophilic model. In: Proc. 2006 IEEE Congress on Evolutionary Computation (CEC 2006), pp. 2339–2346 (2006)

    Google Scholar 

  14. Jeremy, S.D., Charles Jr., L.I., Paul, V.: Mimic: Finding optima by estimating probability densities. In: Advances in Neural Information Processing Systems, p. 424. The MIT Press, Cambridge (1996)

    Google Scholar 

  15. Krasnogor, N., Hart, W.E., Smith, J., Pelta, D.A.: Protein structure prediction with evolutionary algorithms. In: Proc. of Genetic Evol.Comput. Conf., Orlando, pp. 1596–1601 (1999)

    Google Scholar 

  16. Larranaga, P., Lozano, J.A.: Estimation of Distribution Algorithms. A New Tool for Evolutionary Computation. Kluwer Academic, Dordrecht (2002)

    Google Scholar 

  17. Lau, K.F., Dill, K.A.: A lattice statistical mechanics model of the conformational and sequence spaces of proteins. Marcromolecules 22(10), 3986–3997 (1989)

    Article  Google Scholar 

  18. Lozano, J.A., Larranaga, P., Inza, I., Bengoetxea, E.: Towards a New Evolutionary Computation: Advances on Estimation of Distribution Algorithms. Springer, Berlin (2006)

    MATH  Google Scholar 

  19. Mendiburu, A., Lozano, J.A., Miguel-Alonso, J.: Parallel implementation of EDAs based on probabilistic graphical models. IEEE Transactions on Evolutionary Computation 9(4), 406–423 (2005)

    Article  Google Scholar 

  20. Muhlenbein, H., Mahnig, T., Rodriguez, A.O.: Schemata, distributions and graphical models in evolutionary optimization. Journal of Heuristics 5(2), 213–247 (1999)

    Article  Google Scholar 

  21. Muhlenbein, H., PaaB, G.: From recombination of genes to the estimation of distributions i. binary parameters. In: Ebeling, W., Rechenberg, I., Voigt, H.-M., Schwefel, H.-P. (eds.) PPSN 1996. LNCS, vol. 1141, pp. 178–187. Springer, Heidelberg (1996)

    Chapter  Google Scholar 

  22. Pelikan, M., Goldber, D., Lobo, F.: A survey of optimization by building and using probabilistic models. Computational Optimization and Applications 21, 5–20 (2002)

    Article  MATH  MathSciNet  Google Scholar 

  23. Pelikan, M., Goldberg, D.E., Cantu-Paz, E.: Boa: The bayesian optimization algorithm, pp. 525–532. Morgan Kaufmann, San Francisco (1999)

    Google Scholar 

  24. Santana, R., Larranaga, P., Lozano, J.A.: Protein folding in simplified models with estimation of distribution algorithms. IEEE Transactions on Evolutionary Computation 12(4), 418–438 (2008)

    Article  Google Scholar 

  25. Santana, R., Ponce, L., Ochoa, A.: The edge incident model. In: Proc. the Second Symposium on Artificial Intelligence (CIMAF 1999), pp. 352–359 (1999)

    Google Scholar 

  26. Song, J., Cheng, J., Zheng, T., Mao, J.: A novel genetic algorithm for HP model protein folding. In: Proc. of 6th International Conference on Parallel and Distributed Computing, Applications and Technologies (PDCAT 2005), pp. 935–937 (2005)

    Google Scholar 

  27. Unger, R., Moult, J.: Genetic algorithms for protein folding simulations. Journal of Molecular Biology 231(1), 75–81 (1993)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Chen, B., Hu, J. (2010). Protein Structure Prediction Based on HP Model Using an Improved Hybrid EDA. In: Chen, Yp. (eds) Exploitation of Linkage Learning in Evolutionary Algorithms. Evolutionary Learning and Optimization, vol 3. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-12834-9_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-12834-9_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-12833-2

  • Online ISBN: 978-3-642-12834-9

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics