Prediction of Implicit Protein-Protein Interaction by Optimal Associative Feature Mining

Eom, Jae-Hong; Chang, Jeong-Ho; Zhang, Byoung-Tak

doi:10.1007/978-3-540-28651-6_13

Prediction of Implicit Protein-Protein Interaction by Optimal Associative Feature Mining

Jae-Hong Eom¹⁹,
Jeong-Ho Chang¹⁹ &
Byoung-Tak Zhang¹⁹

Conference paper

1303 Accesses
4 Citations

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3177))

Abstract

Proteins are known to perform a biological function by interacting with other proteins or compounds. Since protein-protein interaction is intrinsic to most cellular processes, protein interaction prediction is an important issue in post-genomic biology where abundant interaction data has been produced by many research groups. In this paper, we present an associative feature mining method to predict implicit protein-protein interactions of S.cerevisiae from public protein-protein interaction data. To overcome the dimensionality problem of conventional data mining approach, we employ feature dimension reduction filter (FDRF) method based on the information theory to select optimal informative features and to speed up the overall mining procedure. As a mining method to predict interaction, we use association rule discovery algorithm for associative feature and rule mining. Using the discovered associative feature we predict implicit protein interactions which have not been observed in training data. According to the experimental results, the proposed method accomplishes about 94.8% prediction accuracy with reduced computation time which is 32.5% faster than conventional method that has no feature filter.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Deng, M., et al.: Inferring domain–domain interactions from protein–protein interactions. Genome Res. 12, 1540–1548 (2002)
Article Google Scholar
Goffeau, A., et al.: Life with 6000 genes. Science 274, 546–567 (1996)
Article Google Scholar
Agrawal, R., et al.: Mining association rules between sets of items in large databases. In: Proc. of ACM SIGMOD 1993, pp. 207–216 (1993)
Google Scholar
Satou, K., et al.: Extraction of substructures of proteins essential to their biological functions by a data mining technique. In: Proc. of ISMB 1997, vol. 5, pp. 254–257 (1997)
Google Scholar
Oyama, T., et al.: Extraction of knowledge on protein–protein interaction by association rule discovery. Bioinformatics 18, 705–714 (2002)
Article Google Scholar
Yu, L., Liu, H.: Feature selection for high dimensional data: a fast correlation-based filter solution. In: Proc. of ICML 2003, pp. 856–863 (2003)
Google Scholar
Mewes, H.W., et al.: MIPS: a database for genomes and protein sequences. Nucleic Acids Res. 30, 31–34 (2002)
Article Google Scholar
Xenarios, I., et al.: DIP: The Database of Interacting Proteins. A research tool for studying cellular networks of protein interactions. Nucleic Acids Res. 30, 303–305 (2002)
Article Google Scholar
Christie, K.R., et al.: Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms. Nucleic Acids Res. 32, D311–D314 (2004)
Google Scholar
Quinlan, J.: C4.5: Programs for machine learning. Morgan Kaufmann, San Francisco (1993)
Google Scholar
Press, W.H., et al.: Numerical recipes in C. Cambridge University Press, Cambridge (1988)
MATH Google Scholar
Csank, C., et al.: Three yeast proteome databases: YPD, PombePD, and CalPD (Myco- PathPD). Methods Enzymol 350, 347–373 (2002)
Article Google Scholar
Ito, T., et al.: A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc. Natl Acad. Sci. USA 98, 4569–4574 (2001)
Article Google Scholar
Uetz, P., et al.: A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 403, 623–627 (2000)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Biointelligence Lab., School of Computer Science and Engineering, Seoul National University, Seoul, 151-744, South Korea
Jae-Hong Eom, Jeong-Ho Chang & Byoung-Tak Zhang

Authors

Jae-Hong Eom
View author publications
You can also search for this author in PubMed Google Scholar
Jeong-Ho Chang
View author publications
You can also search for this author in PubMed Google Scholar
Byoung-Tak Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Engineering, Computing, and Mathematics, University of Exeter, EX4 4QF, Exeter, UK
Zheng Rong Yang
School of Electrical and Electronic Engineering, University of Manchester, UK
Hujun Yin
School of Engineering, Computer Science and Mathematics, University of Exeter, EX4 4QF, UK
Richard M. Everson

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Eom, JH., Chang, JH., Zhang, BT. (2004). Prediction of Implicit Protein-Protein Interaction by Optimal Associative Feature Mining. In: Yang, Z.R., Yin, H., Everson, R.M. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2004. IDEAL 2004. Lecture Notes in Computer Science, vol 3177. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28651-6_13

Download citation

DOI: https://doi.org/10.1007/978-3-540-28651-6_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-22881-3
Online ISBN: 978-3-540-28651-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics