Skip to main content
Log in

EuLoc: a web-server for accurately predict protein subcellular localization in eukaryotes by incorporating various features of sequence segments into the general form of Chou’s PseAAC

  • Published:
Journal of Computer-Aided Molecular Design Aims and scope Submit manuscript

Abstract

The function of a protein is generally related to its subcellular localization. Therefore, knowing its subcellular localization is helpful in understanding its potential functions and roles in biological processes. This work develops a hybrid method for computationally predicting the subcellular localization of eukaryotic protein. The method is called EuLoc and incorporates the Hidden Markov Model (HMM) method, homology search approach and the support vector machines (SVM) method by fusing several new features into Chou’s pseudo-amino acid composition. The proposed SVM module overcomes the shortcoming of the homology search approach in predicting the subcellular localization of a protein which only finds low-homologous or non-homologous sequences in a protein subcellular localization annotated database. The proposed HMM modules overcome the shortcoming of SVM in predicting subcellular localizations using few data on protein sequences. Several features of a protein sequence are considered, including the sequence-based features, the biological features derived from PROSITE, NLSdb and Pfam, the post-transcriptional modification features and others. The overall accuracy and location accuracy of EuLoc are 90.5 and 91.2 %, respectively, revealing a better predictive performance than obtained elsewhere. Although the amounts of data of the various subcellular location groups in benchmark dataset differ markedly, the accuracies of 12 subcellular localizations of EuLoc range from 82.5 to 100 %, indicating that this tool is much more balanced than other tools. EuLoc offers a high, balanced predictive power for each subcellular localization. EuLoc is now available on the web at http://euloc.mbc.nctu.edu.tw/.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  1. Nakai K (2000) Adv Protein Chem 54:277

    Article  CAS  Google Scholar 

  2. Chou KC, Shen HB (2007) Anal Biochem 370:1

    Article  CAS  Google Scholar 

  3. Chou KC (2011) J Theor Biol 273:236

    Article  CAS  Google Scholar 

  4. Emanuelsson O, Nielsen H, Brunak S, von Heijne G (2000) J Mol Biol 300:1005

    Article  CAS  Google Scholar 

  5. Nair R, Rost B (2003) Proteins 53:917

    Article  CAS  Google Scholar 

  6. Park KJ, Kanehisa M (2003) 19:1656

  7. Scott MS, Thomas DY, Hallett MT (2004) Genome Res 14:1957

    Article  CAS  Google Scholar 

  8. Bhasin M, Garg A, Raghava GP (2005) Bioinformatics 21:2522

    Article  CAS  Google Scholar 

  9. Gardy JL, Laird MR, Chen F, Rey S, Walsh CJ, Ester M, Brinkman FS (2005) Bioinformatics 21:617

    Article  CAS  Google Scholar 

  10. Xie D, Li A, Wang M, Fan Z, Feng H (2005) Nucleic Acids Res 33:W105

    Article  CAS  Google Scholar 

  11. Guda C (2006) Nucleic Acids Res 34:W210

    Article  CAS  Google Scholar 

  12. Hoglund A, Donnes P, Blum T, Adolph HW, Kohlbacher O (2006) Bioinformatics 22:1158

    Article  Google Scholar 

  13. Pierleoni A, Martelli PL, Fariselli P, Casadio R (2006) Bioinformatics 22(14):E408

    Google Scholar 

  14. Yu CS, Chen YC, Lu CH, Hwang JK (2006) Proteins 64:643

    Article  CAS  Google Scholar 

  15. Shatkay H, Hoglund A, Brady S, Blum T, Donnes P, Kohlbacher O (2007) Bioinformatics 23:1410

    Article  CAS  Google Scholar 

  16. Chang JM, Su EC, Lo A, Chiu HS, Sung TY, Hsu WL (2008) Proteins 72(2):693

  17. Fyshe A, Liu Y, Szafron D, Greiner R, Lu P (2008) Bioinformatics 24:2512

    Article  CAS  Google Scholar 

  18. Garg A, Raghava GP (2008) BMC Bioinform 9:503

    Article  Google Scholar 

  19. Huang WL, Tung CW, Ho SW, Hwang SF, Ho SY (2008) BMC Bioinform 9:80

    Article  Google Scholar 

  20. Nasibov E, Kandemir-Cavas C (2008) Comput Biol Chem 32:448

    Article  CAS  Google Scholar 

  21. Chou KC, Shen HB (2008) Nat Protoc 3:153

    Article  CAS  Google Scholar 

  22. Chou KC, Shen HB (2007) J Proteome Res 6:1728

    Article  CAS  Google Scholar 

  23. Shen HB, Chou KC (2007) Biochem Biophys Res Commun 355:1006

    Article  CAS  Google Scholar 

  24. Chou KC, Shen HB (2007) J Cell Biochem 100:665

    Article  CAS  Google Scholar 

  25. Shen HB, Chou KC (2007) Protein Eng Des Sel 20:39

    Article  CAS  Google Scholar 

  26. Chou KC, Shen HB (2006) J Proteome Res 5:3420

    Article  CAS  Google Scholar 

  27. Shen HB, Chou KC (2007) Biopolymers 85:233

    Article  CAS  Google Scholar 

  28. Nakashima H, Nishikawa K (1994) J Mol Biol 238:54

    Article  CAS  Google Scholar 

  29. Chou KC, Elrod DW (1999) Protein Eng 12:107

    Article  CAS  Google Scholar 

  30. Chou KC, Cai YD (2002) J Biol Chem 277:45765

    Article  CAS  Google Scholar 

  31. Chou KC (2001) Proteins 43:246

    Article  CAS  Google Scholar 

  32. Zhou GP, Doctor K (2003) Proteins 50:44

    Article  CAS  Google Scholar 

  33. Chou KC, Wu ZC, Xiao X (2011) PLoS ONE 6:e18258

    Article  CAS  Google Scholar 

  34. Wu ZC, Xiao X, Chou KC (2012) Protein Pept Lett 19:4

    Article  CAS  Google Scholar 

  35. Chou KC, Wu ZC, Xiao X (2012) Mol BioSyst 8:629

    Article  CAS  Google Scholar 

  36. Wu ZC, Xiao X, Chou KC (2011) Mol BioSyst 7:3287

    Article  CAS  Google Scholar 

  37. Xiao X, Wu ZC, Chou KC (2011) J Theor Biol 284:42

    Article  CAS  Google Scholar 

  38. Mei S (2012) J Theor Biol 310:80

    Article  CAS  Google Scholar 

  39. Xiao X, Wu ZC, Chou KC (2011) PLoS ONE 6:e20592

    Article  CAS  Google Scholar 

  40. Lee TY, Chen YJ, Lu CT, Ching WC, Teng YC, Huang HD (2012) Bioinformatics 28:2293

    Article  CAS  Google Scholar 

  41. Lee TY, Lin ZQ, Hsieh SJ, Bretana NA, Lu CT (2011) Bioinformatics 27:1780

    Article  CAS  Google Scholar 

  42. Lee TY, Chen YJ, Lu TC, Huang HD (2011) PLoS ONE 6:e21849

    Article  CAS  Google Scholar 

  43. Lee TY, Bretana NA, Lu CT (2011) BMC Bioinformatics 12:261

    Article  CAS  Google Scholar 

  44. Lee TY, Bo-Kai Hsu J, Chang WC, Huang HD (2011) Nucleic Acids Res 39:D777

    Article  Google Scholar 

  45. Lee TY, Hsu JB, Lin FM, Chang WC, Hsu PC, Huang HD (2010) J Comput Chem 31:2759

    Article  CAS  Google Scholar 

  46. Wong YH, Lee TY, Liang HK, Huang CM, Wang TY, Yang YH, Chu CH, Huang HD, Ko MT, Hwang JK (2007) Nucleic Acids Res 35:W588

    Article  Google Scholar 

  47. Huang HD, Lee TY, Tzeng SW, Horng JT (2005) Nucleic Acids Res 33:W226

    Article  CAS  Google Scholar 

  48. Qiu JD, Huang JH, Shi SP, Liang RP (2010) Protein Pept Lett 17:715

    Article  CAS  Google Scholar 

  49. Chen C, Shen ZB, Zou XY (2012) Protein Pept Lett 19:422

    Article  CAS  Google Scholar 

  50. Gu Q, Ding YS, Zhang TL (2010) Protein Pept Lett 17:559

    Article  CAS  Google Scholar 

  51. Li LQ, Zhang Y, Zou LY, Zhou Y, Zheng XQ (2012) Protein Pept Lett 19:375

    Article  CAS  Google Scholar 

  52. Zia Ur R, Khan A (2012) Protein Pept Lett 19:890

    Article  Google Scholar 

  53. Mohabatkar H, Mohammad Beigi M, Esmaeili A (2011) J Theor Biol 281:18

    Article  CAS  Google Scholar 

  54. Zeng YH, Guo YZ, Xiao RQ, Yang L, Yu LZ, Li ML (2009) J Theor Biol 259:366

    Article  CAS  Google Scholar 

  55. Chen C, Chen L, Zou X, Cai P (2009) Protein Pept Lett 16:27

    Article  Google Scholar 

  56. Ding H, Luo LF, Lin H (2009) Protein Pept Lett 16:351

    Article  CAS  Google Scholar 

  57. Zhou XB, Chen C, Li ZC, Zou XY (2007) J Theor Biol 248:546

    Article  CAS  Google Scholar 

  58. Georgiou DN, Karakasidis TE, Nieto JJ, Torres A (2009) J Theor Biol 257:17

    Article  CAS  Google Scholar 

  59. Yu LZ, Guo YZ, Li YZ, Li GB, Li ML, Luo JS, Xiong WJ, Qin WL (2010) J Theor Biol 267:1

    Article  CAS  Google Scholar 

  60. Jiang XY, Wei R, Zhang TL, Gu Q (2008) Protein Pept Lett 15:392

    Article  CAS  Google Scholar 

  61. Li FM, Li QZ (2008) Protein Pept Lett 15:612

    Article  Google Scholar 

  62. Lin H, Ding H, Guo FB, Zhang AY, Huang J (2008) Protein Pept Lett 15:739

    Article  CAS  Google Scholar 

  63. Zhang GY, Li HC, Gao JQ, Fang BS (2008) Protein Pept Lett 15:1132

    Article  CAS  Google Scholar 

  64. Han L, Cui J, Lin H, Ji Z, Cao Z, Li Y, Chen Y (2006) Proteomics 6:4023

    Article  CAS  Google Scholar 

  65. Veropoulos K, Cristianini N, Campbell C (1999) Proceedings of the international joint conference on artificial intelligence (IJCAI99), workshop ML3, p 55

  66. Nair R, Rost B (2002) Protein Sci 11:2836

    Article  CAS  Google Scholar 

  67. Nielsen H, Engelbrecht J, von Heijne G, Brunak S (1996) Proteins 24:165

    Article  CAS  Google Scholar 

  68. Chou KC, Shen HB (2010) PLoS ONE 5:e9931

    Article  Google Scholar 

  69. Chou KC, Shen HB (2010) PLoS ONE 5:e11335

    Article  Google Scholar 

  70. UniProt C (2008) Nucleic Acids Res 36(Database issue):D190

  71. Boeckmann B, Blatter MC, Famiglietti L, Hinz U, Lane L, Roechert B, Bairoch A (2005) C R Biol 328:882

    Article  CAS  Google Scholar 

  72. Esmaeili M, Mohabatkar H, Mohsenzadeh S (2010) J Theor Biol 263:203

    Article  CAS  Google Scholar 

  73. Mohabatkar H (2010) Protein Pept Lett 17:1207

    Article  CAS  Google Scholar 

  74. Lin H (2008) J Theor Biol 252:350

    Article  CAS  Google Scholar 

  75. Chou KC (2009) Curr Proteomics 6:262

    Article  CAS  Google Scholar 

  76. Carrie C, Giraud E, Whelan J (2009) FEBS J 276:1187

    Article  CAS  Google Scholar 

  77. Millar AH, Whelan J, Small I (2006) Curr Opin Plant Biol 9:610

    Article  CAS  Google Scholar 

  78. Bannai H, Tamada Y, Maruyama O, Nakai K, Miyano S (2002) Bioinformatics 18:298

    Article  CAS  Google Scholar 

  79. von Heijne G (1990) Curr Opin Cell Biol 2:604

    Article  Google Scholar 

  80. Hurtley SM (1996) Protein targeting. Oxford University Press, Oxford

  81. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Nucleic Acids Res 25:3389

    Article  CAS  Google Scholar 

  82. Bateman A, Birney E, Cerruti L, Durbin R, Etwiller L, Eddy SR, Griffiths-Jones S, Howe KL, Marshall M, Sonnhammer ELL (2002) Nucleic Acids Res 30:276

    Article  CAS  Google Scholar 

  83. Sigrist CJA, Cerutti L, Hulo N, Gattiker A, Falquet L, Pagni M, Bairoch A, Bucher P (2002) Briefings Bioinform 3:265

    Article  CAS  Google Scholar 

  84. Nair R, Carter P, Rost B (2003) Nucleic Acids Res 31:397

    Article  CAS  Google Scholar 

  85. Li ZR, Lin HH, Han LY, Jiang L, Chen X, Chen YZ (2006) Nucleic Acids Res 34:W32

    Article  CAS  Google Scholar 

  86. Solito E, Christian HC, Festa M, Mulla A, Tierney T, Flower RJ, Buckingham JC (2006) Faseb J 20:1498

    Article  CAS  Google Scholar 

  87. Jensen LJ, Gupta R, Blom N, Devos D, Tamames J, Kesmir C, Nielsen H, Staerfeldt HH, Rapacki K, Workman C, Andersen CA, Knudsen S, Krogh A, Valencia A, Brunak S (2002) J Mol Biol 319:1257

    Article  CAS  Google Scholar 

  88. Mizushima S (1984) Mol Cell Biochem 60:5

    Google Scholar 

  89. Eichler J (2001) Eur J Biochem 268:4366

    Article  CAS  Google Scholar 

  90. Pal-Bhowmick I, Vora HK, Jarori GK (2007) Malar J 6:45

    Article  Google Scholar 

  91. Kiemer L, Bendtsen JD, Blom N (2005) Bioinformatics 21(7):1269

    Google Scholar 

  92. Shien DM, Lee TY, Chang WC, Hsu JB, Horng JT, Hsu PC, Wang TY, Huang HD (2009) J Comput Chem 30(9):1532

    Google Scholar 

  93. Gupta R, Jung E, Brunak S (2004) [online] Available http://www.cbs.dtu.dk/services/NetNGlyc/

  94. Hansen JE, Lund O, Tolstrup N, Gooley AA, Williams KL, Brunak S (1998) Glycoconj J 15:115

    Article  CAS  Google Scholar 

  95. Blom N, Gammeltoft S, Brunak S (1999) J Mol Biol 294:1351

    Article  CAS  Google Scholar 

  96. Chang WC, Lee TY, Shien DM, Hsu JB, Horng JT, Hsu PC, Wang TY, Huang HD, Pan RL (2009) J Comput Chem 30(15):2526

    Google Scholar 

  97. Eddy SR (1998) Bioinformatics 14:755

    Article  CAS  Google Scholar 

  98. Chang CC, Lin CJ (2001) Software available at http://www. csie. ntu. edu. tw/cjlin/libsvm 80:604

  99. Zakeri P, Moshiri B, Sadeghi M (2011) J Theor Biol 269:208

    Article  CAS  Google Scholar 

  100. Nanni L, Lumini A, Gupta D, Garg A (2011) IEEE/ACM Trans Comput Biol Bioinform 9(2):467

    Google Scholar 

  101. Jiawei Han MK (2006) Data mining: concepts and techniques. Morgan Kaufmann, San Francisco

  102. Witten IH, Frank E (2005) Data mining: practical machine learning tools and techniques. Morgan Kaufmann, San Francisco

  103. Crooks GE, Hon G, Chandonia JM, Brenner SE (2004) Genome Res 14:1188

    Article  CAS  Google Scholar 

  104. Schneider TD, Stephens RM (1990) Nucleic Acids Res 18:6097

    Article  CAS  Google Scholar 

  105. Cokol M, Nair R, Rost B (2000) EMBO Rep 1:411

    Article  CAS  Google Scholar 

  106. Schaecher SR, Diamond MS, Pekosz A (2008) J Virol 82:9477

    Article  CAS  Google Scholar 

  107. Ladd AN, Cooper TA (2004) J Cell Sci 117:3519

    Article  CAS  Google Scholar 

  108. Hirata T, Okabe M, Kobayashi A, Ueda K, Matsuo M (2009) Biosci Biotechnol Biochem 73(3):619

    Google Scholar 

  109. Eisenhaber B, Eisenhaber F (2007) Curr Protein Pept Sci 8:197

    Article  CAS  Google Scholar 

  110. Lee TY, Huang HD, Hung JH, Huang HY, Yang YS, Wang TH (2006) Nucleic Acids Res 34:D622

    Article  CAS  Google Scholar 

Download references

Acknowledgments

The authors would like to thank the National Science Council of the Republic of China, No. NSC 101-2628-E-155-002-MY2, 99-2221-E-008-083-MY3, NSC 101-2311-B-009-003-MY3 and NSC 100-2627-B-009-002. This work was supported in part by the UST-UCSD International Center of Excellence in Advanced Bioengineering sponsored by the Taiwan National Science Council I-RiCE Program under Grant Number: NSC 101-2911-I-009-101, and Veterans General Hospitals and University System of Taiwan (VGHUST) Joint Research Program under Grant Number: VGHUST101-G5-1-1. This work was also partially supported by MOE ATU.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tzong-Yi Lee.

Electronic supplementary material

Rights and permissions

Reprints and permissions

About this article

Cite this article

Chang, TH., Wu, LC., Lee, TY. et al. EuLoc: a web-server for accurately predict protein subcellular localization in eukaryotes by incorporating various features of sequence segments into the general form of Chou’s PseAAC. J Comput Aided Mol Des 27, 91–103 (2013). https://doi.org/10.1007/s10822-012-9628-0

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10822-012-9628-0

Keywords

Navigation