Skip to main content

Identification of Interface Residues Involved in Protein-Protein Interactions Using Naïve Bayes Classifier

  • Conference paper
Book cover Advanced Data Mining and Applications (ADMA 2008)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5139))

Included in the following conference series:

Abstract

The identification of interface residues involved in protein-protein interactions(PPIs) has broad application in rational drug design and metabolic etc. Here a Naïve Bayes classifier for PPIs prediction with features including protein sequence profile and residue accessible surface area was proposed. This method adequately used the character of Naïve Bayes classifier which assumed independence of the attributes given the class. Our test results on a diversity dataset made up of only hetero-complex proteins achieved 68.1% overall accuracy with a correlation coefficient of 0.201, 40.2% specificity and 49.9% sensitivity in identify interface residues as estimated by leave-one-out cross-validation. This result indicated that the method performed substantially better than chance (zero correlation). Examination of the predictions in the context of 3-dimensional structures of proteins demonstrated the effectiveness of this method in identifying protein-protein sites.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhou, H.-X., Qin, S.: Interaction-site prediction for protein complexes: a critical assessment. Bioinformatics 23(17), 2203–2209 (2007)

    Article  Google Scholar 

  2. Ben-Shem, A., Frolow, F., Nelson, N.: Crystal structure of plant photosystem I. Nature 426, 630–635 (2003)

    Article  Google Scholar 

  3. Lanman, J., Lam, T.T., Bames, S., Sakalian, M., Emmett, M.R., Marshall, A.G., Prevelige, J., Peter, E.: Identification of novel interactions in HIV-1 capsid protein assembly by high-resolution mass Spectrometry. J. Mol. Biol. 325, 759–772 (2003)

    Article  Google Scholar 

  4. Trester-Zedlitz, M., Kamada, K., Burley, S.K., Fenyo, D., Chait, B.T., Muir, T.W.: A modular cross-linking approach for exploring protein interactions. J. Am. Chem. Soc. 125, 2416–2425 (2003)

    Article  Google Scholar 

  5. Koike, A., Takagi, T.: Prediction of protein-protein interaction sites using support vector machines. Protein Engineering, Design & Selection 17(2), 165–173 (2004)

    Article  Google Scholar 

  6. Chung, J.-L., Wang, W., Bourne, P.E.: Exploiting sequence and structure homologs to identify protein-protein binding sites. PROTEINS:Structure, Function, and Bioinformatics 62, 630–640 (2006)

    Article  Google Scholar 

  7. Yan, C., Dobbs, D., Honavar, V.: A two-stage classifier for identification of protein-protein interface residues. Bioinformatics 20 (suppl. 1), i371-i378 (2004)

    Article  Google Scholar 

  8. Ofran, Y., Rost, B.: Predicted protein-protein interaction sites from local sequence information. FEBS Letters 544, 236–239 (2003)

    Article  Google Scholar 

  9. Chen, H., Zhou, H.-X.: Prediction of Interface residues in protein-protein complexes by a consensus neural network method: test against NMR data. PROTEINS: Structure, Function, and Bioinformatics 61, 21–35 (2005)

    Article  Google Scholar 

  10. LI, M.-H., Lin, L., Wang, X.-L., Liu, T.: Protein-protein interaction site prediction based on conditional random fields. Bioinformatics 23(5), 597–604 (2007)

    Article  Google Scholar 

  11. James, R., Bradford, C.J., Needham, A.J., Bulpitt, D.R.: Westhead: Insights into protein-protein interfaces using a Bayesian network prediction method. J. Mol. Biol. 362, 365–386 (2006)

    Article  Google Scholar 

  12. Bradford, J.R., Westhead, D.R.: Improved prediction of protein-protein binding sites using a support vector machines approach. Bioinformatics 21(8), 1487–1494 (2005)

    Article  Google Scholar 

  13. Berman, H.M., Westbrook, J., Feng, Z., Gilliland, G., Bhat, T.N., Weissig, H., Shindyalov, I.N., Boume, P.E.: The Protein Data Bank. Nucleic Acids Res. 28, 235–242 (2000)

    Article  Google Scholar 

  14. Xennarios, I., Salwinski, L., Duan, X.J., Higney, P., Kim, S., Eisenberg, D.: DIP: The Database of Interacting Proteins. A research tool for studying cellular networks of protein interactions. Nucleic Acids Res. 30, 303–305 (2002)

    Article  Google Scholar 

  15. Chakrabarti, P., Janin, J.: Dissecting protein-protein recognition sites. PROTEINS: Structure, Function, and Genetics 47, 334–343 (2002)

    Article  Google Scholar 

  16. Res, I., Mihalek, I., Lichtarge, O.: An evolution based classifier for prediction of protein interfaces without using protein structures. Bioinformatics 21(10), 2496–2501 (2005)

    Article  Google Scholar 

  17. Dong, Q., Wang, X., Lin, L., Guan, Y.: Exploiting residue-level and profile-level interface propensities for usage in binding sites prediction of proteins. BMC Bioinformatics 8(147), 1–13 (2007)

    Google Scholar 

  18. Kabsch, W., Sander, C.: Dictionary of protein secondary structure: pattern of hydrogen-bonded and geometrical features. Biopolymers 22, 2577–2637 (1983)

    Article  Google Scholar 

  19. Rost, B., Sander, C.: Conservation and prediction of solvent accessibility in protein families. PROTEINS: Structure, Function, and Genetics 20, 216–226 (1994)

    Article  Google Scholar 

  20. Jones, S., Thomton, J.M.: Principles of protein-protein interactions. Proc. Natl. Acad. Sci. USA 93, 13–20 (1996)

    Article  Google Scholar 

  21. Yan, C., Honavar, V., Dobbs, D.: Identification of interface residues in protease-inhibitor and antigen-antibody complexes: a support vector machine approach. Neural Comput. & Applic. 13, 123–129 (2004)

    Article  Google Scholar 

  22. Dodge, C., Schneider, R., Sander, C.: The HSSP database of protein structure-sequence alignments and family profiles. Nucleic Acids Res. 26, 313–315 (1998)

    Article  Google Scholar 

  23. Christian Borgelt’s Webpages, http://www.borgelt.net//bayes.html

  24. Honavar, V., Yan, C., Dobbs, D.: Predicting protein-protein interaction sites from amino acid sequence. Technical report ISU-CS-TR 02-11, Department of Computer Science, Iowa State University, pp. 2–11 (2002), http://archives.cs.iastate.edu/documents/disk0/00/00/02/88/index.html

  25. Stubbs, M.T., Laber, B., Bode, W., Huber, R., Jerala, R., Lenarcic, B., Turk, V.: The refined 2.4 A X-ray crystal structure of recombinant human stefin B in complex with the cysteine proteinase papain: a novel type of proteinase inhibitor interaction. EMBO J. 9, 1939–1947 (1990)

    Google Scholar 

  26. Rees, D.C., Lipscomb, W.N.: Refined crystal structure of the potato inhibitor complex of carboxypeptidase A at 2.5 A resolution. J. Mol. Biol. 160, 475–498 (1982)

    Article  Google Scholar 

  27. Deisenhofer, J.: Crystallographic refinement and atomic models of a human Fc fragment and its complex with fragment B of protein A from Staphylococcus aureus at 2.9- and 2.8-A resolution. Biochemistry 20, 2361–2370 (1981)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Wang, C., Cheng, J., Su, S., Xu, D. (2008). Identification of Interface Residues Involved in Protein-Protein Interactions Using Naïve Bayes Classifier. In: Tang, C., Ling, C.X., Zhou, X., Cercone, N.J., Li, X. (eds) Advanced Data Mining and Applications. ADMA 2008. Lecture Notes in Computer Science(), vol 5139. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88192-6_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-88192-6_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-88191-9

  • Online ISBN: 978-3-540-88192-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics