Abstract
Protein-protein interaction (PPI) plays an important role in the living organisms, and a major goal of proteomics is to determine the PPI networks for the whole organisms. So both experimental and computational approaches to predict PPIs are urgently needed in the field of proteomics. In this paper, four distinct protein encoding methods are proposed, based on the biological significance extracted from the categories of protein subcellular and functional localizations. And then, some classifiers are tested to prediction PPIs. To show the robustness of classification and ensure the reliability of results, each classifier is examined by many independent random experiments of 10-fold cross validations. The model of random forest achieves some promising performance of PPIs.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Uetz, P., Giot, L., Cagney, G., Mansfield, T.A., Judson, R.S., Knight, J.R., Lockshon, D., Narayan, V., Srinivasan, M., Pochart, P., Qureshi-Emili, A., Li, Y., Godwin, B., Conover, D., Kalbfleisch, T., Vijayadamodar, G., Yang, M.J., Johnston, M., Fields, S., Rothberg, J.M.: A comprehensive analysis of protein-protein interactions in saccharomyces cerevisiae. Nature 403, 623–627 (2000)
Gavin, A.C., Bösche, M., Krause, R., Grandi, P., Marzioch, M., Bauer, A., Schultz, J., Rick, J.M., Michon, A.M., Cruciat, C.M., Remor, M., Höfert, C., Schelder, M., Brajenovic, M., Ruffner, H., Merino, A., Klein, K., Hudak, M., Dickson, D., Rudi, T., Gnau, V., Bauch, A., Bastuck, S., Huhse, B., Leutwein, C., Heurtier, M.A., Copley, R.R., Edelmann, A., Querfurth, E., Rybin, V., Drewes, G., Raida, M., Bouwmeester, T., Bork, P., Seraphin, B., Kuster, B., Neubauer, G., Superti-Furga, G.: Functional organization of the yeast proteome by systematic analysis of protein complexes. Nature 415, 141–147 (2002)
Mrowka, R., Patzak, A., Herzel, H.: Is there a bias in proteome research? Genome Research 11, 1971–1973 (2001)
Sprinzaka, E., Sattath, S., Margalit, H.: How reliable are experimental protein-protein interaction data? Journal of Molecular Biology 327(5), 919–923 (2003)
Xenarios, I., Salwínski, L., Duan, X.J., Higney, P., Kim, S.M., Eisenberg, D.: Dip, the database of interacting proteins: a research tool for studying cellular networks of protein interactions. Nucleic Acids Research 30(1), 303–305 (2002)
Sprinzaka, E., Margalit, H.: Correlated sequence-signatures as markers of protein-protein interaction. Journal of Molecular Biology 311(4), 681–692 (2001)
Qi, Y.J., Joseph, Z.B., Seetharaman, J.K.: Evaluation of different biological data and computational classification methods for use in protein interaction prediction. PROTEINS: Structure, Function, and Bioinformatics 63, 490–500 (2006)
Nanni, L., Alessandra Lumini, A.: An ensemble of k-local hyperplanes for predicting protein-protein interactions. Bioinformatics 22(10), 1207–1210 (2006)
Shen, J.W., Zhang, J., Luo, X.M., Zhu, W.L., Yu, K.Q., Chen, K.X., Li, Y.X., Jiang, H.L.: Predicting protein-protein interactions based only on sequences information. PNAS 104(11), 4337–4341 (2007)
Marcotte, E., Pellegrini, M., Ng, H.L., Rice, D.W., Yeates, T.O., Eisenberg, D.: Detecting protein function and protein-protein interactions from genome sequences. Science 285, 751–753 (1999)
Dandekar, T., Snela, B., Huynena, M., Bork, P.: Conservation of gene order: a fingerprint of proteins that physically interact. Trends in Biochemical Sciences 23(9), 324–328 (1998)
Pellegrini, M., Marcotte, E.M., Thompson, M., Eisenberg, J., Yeates, T.O.: Assigning protein functions by comparative genome analysis:protein phylogenetic profiles. Proc. Natl. Acad. Sci. USA 96, 4285–4288 (1999)
Jansen, R., Yu, H.Y., Greenbaum, D., Kluger, Y., Krogan, N.J., Chung, S., Emili, A., Snyder, M., Greenblatt, J.F., Gerstein, M.: A bayesian networks approach for predicting protein-protein interactions from genomic data. Science 302(5644), 449–453 (2003)
Lin, N., Wu, B.L., Jansen, R., Gerstein, M., Zhao, H.Y.: Information assessment on predicting protein-protein interactions. BMC Bioinformatics 5, 154 (2004)
Wu, X.M., Zhu, L., Guo, J., Zhang, D.Y., Lin, K.: Predicting protein-protein interactions based only on sequences information. Nucleic Acids Research 34(7), 2137–2150 (2006)
Chen, X.W., Liu, M.: Prediction of protein-protein interactions using random decision forest framework. Bioinformatics 21(24), 4394–4400 (2005)
Mewes, H.W., Amid, C., Arnold, R., Frishman, D., Güldener, U., Mannhaupt, G., Münsterkötter, M., Pagel, P., Strack, N., Stümpflen, V., Warfsmann, J., Ruepp, A.: Mips: analysis and annotation of proteins from whole genomes. Nucleic Acids Research 32, 41–44 (2004)
Michael Cherry, J., Ball, C., Weng, S., Juvik, G., Schmidt, R., Adler, C., Dunn, B., Dwight, S., Riles, L., Mortimer, R.K., Botstein, D.: Genetic and physical maps of saccharomyces cerevisiae. Proteins 387(suppl.), 67–73 (1997)
Ben-Hur, A., Noble, W.S.: Choosing negative examples for the prediction of protein-protein interactions. BMC Bioinformatics 7(suppl. 1), S2 (2006)
The Gene Ontology Consortium: Gene ontology: tool for the unification of biology. Nature Genetics 25(1), 25–29 (2000)
Breiman, L.: Random forests. Machine Learning 45, 5–32 (2001)
Hall, M., Frank, E., Holmes, G., Pfahringer, B., Reutemann, P., Witten, I.: The weka data mining software: An update. SIGKDD Explorations 11, 1 (2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Cai, Y., Yu, J., Wang, H. (2010). Prediction of Protein-Protein Interactions Using Subcellular and Functional Localizations. In: Li, K., Jia, L., Sun, X., Fei, M., Irwin, G.W. (eds) Life System Modeling and Intelligent Computing. ICSEE LSMS 2010 2010. Lecture Notes in Computer Science(), vol 6330. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15615-1_34
Download citation
DOI: https://doi.org/10.1007/978-3-642-15615-1_34
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15614-4
Online ISBN: 978-3-642-15615-1
eBook Packages: Computer ScienceComputer Science (R0)