Skip to main content

Structure-Based Prediction of Protein Phosphorylation Sites Using an Ensemble Approach

  • Conference paper
Book cover Intelligent Computing in Bioinformatics (ICIC 2014)

Part of the book series: Lecture Notes in Computer Science ((LNBI,volume 8590))

Included in the following conference series:

  • 3434 Accesses

Abstract

As one of the most prevailing post-translational modifications, phosphorylation is vital in regulating almost every cellular behavior. In this paper, we propose a new computational method that can effectively identify phosphorylation sites by using optimally chosen properties. The highlight of our method is that the optimal combination of features was selected from a set of 165 novel structural neighborhood properties by a random forest feature selection method. And then an ensemble learning method based on support vector machine was used to build the prediction model. Experimental results obtained from cross validation and independent test suggested that our method achieved a significant improvement on the prediction quality. Promising results were obtained after being compared with the state-of-the-art approaches using independent dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Steen, H., Jebanathirajah, J.A., Rush, J., Morrice, N., Kirschner, M.W.: Phosphorylation Analysis by Mass Spectrometry Myths, Facts, and the Consequences for Qualitative and Quantitative Measurements. Molecular & Cellular Proteomics 5(1), 172–181 (2006)

    Article  Google Scholar 

  2. Delom, F., Chevet, E.: Phosphoprotein Analysis: From Proteins to Proteomes. Proteome Science 4(1), 15 (2006)

    Article  Google Scholar 

  3. Pinna, L.A., Ruzzene, M.: How Do Protein Kinases Recognize Their Substrates?: Biochimica et Biophysica Acta (BBA)-Molecular Cell Research 1314(3), 191–225 (1996)

    Google Scholar 

  4. Iakoucheva, L.M., Radivojac, P., Brown, C.J., O’Connor, T.R., Sikes, J.G., Obradovic, Z., Dunker, A.K.: The Importance of Intrinsic Disorder for Protein Phosphorylation. Nucleic Acids Research 32(3), 1037–1049 (2004)

    Article  Google Scholar 

  5. Blom, N., Gammeltoft, S., Brunak, S.: Sequence and Structure-Based Prediction of Eukaryotic Protein Phosphorylation Sites. Journal of Molecular Biology 294(5), 1351–1362 (1999)

    Article  Google Scholar 

  6. Xue, Y., Ren, J., Gao, X., Jin, C., Wen, L., Yao, X.: Gps 2.0, a Tool to Predict Kinase-Specific Phosphorylation Sites in Hierarchy. Molecular & Cellular Proteomics 7(9), 1598–1608 (2008)

    Article  Google Scholar 

  7. Xue, Y., Li, A., Wang, L., Feng, H., Yao, X.: Ppsp: Prediction of Pk-Specific Phosphorylation Site with Bayesian Decision Theory. BMC Bioinformatics 7(1), 163 (2006)

    Article  Google Scholar 

  8. Wong, Y.-H., Lee, T.-Y., Liang, H.-K., Huang, C.-M., Wang, T.-Y., Yang, Y.-H., Chu, C.-H., Huang, H.-D., Ko, M.-T., Hwang, J.-K.: Kinasephos 2.0: A Web Server for Identifying Protein Kinase-Specific Phosphorylation Sites Based on Sequences and Coupling Patterns. Nucleic Acids Research 35(suppl. 2), W588–W594 (2007)

    Google Scholar 

  9. Zanzoni, A., Ausiello, G., Via, A., Gherardini, P.F., Helmer-Citterich, M.: Phospho3d: A Database of Three-Dimensional Structures of Protein Phosphorylation Sites. Nucleic Acids Research 35(suppl. 1), D229–D231 (2007)

    Google Scholar 

  10. Zanzoni, A., Carbajo, D., Diella, F., Gherardini, P.F., Tramontano, A., Helmer-Citterich, M., Via, A.: Phospho3d 2.0: An Enhanced Database of Three-Dimensional Structures of Phosphorylation Sites. Nucleic Acids Research 39(suppl. 1), D268–D271 (2011)

    Google Scholar 

  11. Huang, Y., Niu, B., Gao, Y., Fu, L., Li, W.: Cd-Hit Suite: A Web Server for Clustering and Comparing Biological Sequences. Bioinformatics 26(5), 680–682 (2010)

    Article  Google Scholar 

  12. Deng, L., Guan, J., Wei, X., Yi, Y., Zhang, Q.C., Zhou, S.: Boosting Prediction Performance of Protein–Protein Interaction Hot Spots by Using Structural Neighborhood Properties. Journal of Computational Biology 20(11), 878–891 (2013)

    Article  MathSciNet  Google Scholar 

  13. Liaw, A., Wiener, M.: Classification and Regression by Randomforest. R News 2(3), 18–22 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Gao, Y., Hao, W., Chen, Z., Deng, L. (2014). Structure-Based Prediction of Protein Phosphorylation Sites Using an Ensemble Approach. In: Huang, DS., Han, K., Gromiha, M. (eds) Intelligent Computing in Bioinformatics. ICIC 2014. Lecture Notes in Computer Science(), vol 8590. Springer, Cham. https://doi.org/10.1007/978-3-319-09330-7_15

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-09330-7_15

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-09329-1

  • Online ISBN: 978-3-319-09330-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics