Skip to main content

Prediction of Implicit Protein-Protein Interaction by Optimal Associative Feature Mining

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3177))

Abstract

Proteins are known to perform a biological function by interacting with other proteins or compounds. Since protein-protein interaction is intrinsic to most cellular processes, protein interaction prediction is an important issue in post-genomic biology where abundant interaction data has been produced by many research groups. In this paper, we present an associative feature mining method to predict implicit protein-protein interactions of S.cerevisiae from public protein-protein interaction data. To overcome the dimensionality problem of conventional data mining approach, we employ feature dimension reduction filter (FDRF) method based on the information theory to select optimal informative features and to speed up the overall mining procedure. As a mining method to predict interaction, we use association rule discovery algorithm for associative feature and rule mining. Using the discovered associative feature we predict implicit protein interactions which have not been observed in training data. According to the experimental results, the proposed method accomplishes about 94.8% prediction accuracy with reduced computation time which is 32.5% faster than conventional method that has no feature filter.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Deng, M., et al.: Inferring domain–domain interactions from protein–protein interactions. Genome Res. 12, 1540–1548 (2002)

    Article  Google Scholar 

  2. Goffeau, A., et al.: Life with 6000 genes. Science 274, 546–567 (1996)

    Article  Google Scholar 

  3. Agrawal, R., et al.: Mining association rules between sets of items in large databases. In: Proc. of ACM SIGMOD 1993, pp. 207–216 (1993)

    Google Scholar 

  4. Satou, K., et al.: Extraction of substructures of proteins essential to their biological functions by a data mining technique. In: Proc. of ISMB 1997, vol. 5, pp. 254–257 (1997)

    Google Scholar 

  5. Oyama, T., et al.: Extraction of knowledge on protein–protein interaction by association rule discovery. Bioinformatics 18, 705–714 (2002)

    Article  Google Scholar 

  6. Yu, L., Liu, H.: Feature selection for high dimensional data: a fast correlation-based filter solution. In: Proc. of ICML 2003, pp. 856–863 (2003)

    Google Scholar 

  7. Mewes, H.W., et al.: MIPS: a database for genomes and protein sequences. Nucleic Acids Res. 30, 31–34 (2002)

    Article  Google Scholar 

  8. Xenarios, I., et al.: DIP: The Database of Interacting Proteins. A research tool for studying cellular networks of protein interactions. Nucleic Acids Res. 30, 303–305 (2002)

    Article  Google Scholar 

  9. Christie, K.R., et al.: Saccharomyces Genome Database (SGD) provides tools to identify and analyze sequences from Saccharomyces cerevisiae and related sequences from other organisms. Nucleic Acids Res. 32, D311–D314 (2004)

    Google Scholar 

  10. Quinlan, J.: C4.5: Programs for machine learning. Morgan Kaufmann, San Francisco (1993)

    Google Scholar 

  11. Press, W.H., et al.: Numerical recipes in C. Cambridge University Press, Cambridge (1988)

    MATH  Google Scholar 

  12. Csank, C., et al.: Three yeast proteome databases: YPD, PombePD, and CalPD (Myco- PathPD). Methods Enzymol 350, 347–373 (2002)

    Article  Google Scholar 

  13. Ito, T., et al.: A comprehensive two-hybrid analysis to explore the yeast protein interactome. Proc. Natl Acad. Sci. USA 98, 4569–4574 (2001)

    Article  Google Scholar 

  14. Uetz, P., et al.: A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 403, 623–627 (2000)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Eom, JH., Chang, JH., Zhang, BT. (2004). Prediction of Implicit Protein-Protein Interaction by Optimal Associative Feature Mining. In: Yang, Z.R., Yin, H., Everson, R.M. (eds) Intelligent Data Engineering and Automated Learning – IDEAL 2004. IDEAL 2004. Lecture Notes in Computer Science, vol 3177. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-28651-6_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-28651-6_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22881-3

  • Online ISBN: 978-3-540-28651-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics