Skip to main content

Mining High-Correlation Association Rules for Inferring Gene Regulation Networks

  • Conference paper
Data Warehousing and Knowledge Discovery (DaWaK 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5691))

Included in the following conference series:

Abstract

Construction gene regulation networks can provide insights into the understanding the molecular mechanisms underlying important biological processes. We present a novel association rule mining for building large-scale gene regulation networks from microarray data. Gene expression microarray data typically contains a very high gene dimension and a very low sample size, rendering a great challenge for existing association rule mining algorithms. In this paper, we develop a novel algorithm, HCMiner, to mine high-correlation association rules from microarray data. HCMiner initially overlapping partitions the dimension of genes according to their correlations and introduces the support-free framework for mining association rules. Several experiments on Yeast dataset show that the proposed algorithm outperforms existing algorithms with respect to scalability and effectiveness.

This research is partly supported by the National Natural Science Foundation of China (No. 60703105) and the Natural Science Foundation of Shaanxi Province (No. 2007F27). All opinions, findings, conclusions and recommendations in this paper are those of the authors and do not necessarily reflect the views of the funding agencies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Aggarwal, C., Procopiuc, C., Yu, P.: Finding localized associations in market basket data. IEEE Trans. Knowl. Data Eng. 14(1), 51–62 (2002)

    Article  Google Scholar 

  2. Agrawal, R., Imielinski, T., Swami, A.N.: Mining association rules between sets of items in large databases. In: Proc. ACM SIGMOD International Conference on Management of Data, pp. 207–216 (1993)

    Google Scholar 

  3. Albert, R.: Scale-free networks in cell biology. Journal Cell Sci. 118, 4947–4957 (2005)

    Article  Google Scholar 

  4. Barabasi, A., Oltvai, Z.: Network biology: understanding the cells functional organization. Nat. Rev. Genet. 5, 101–113 (2004)

    Article  Google Scholar 

  5. Becquet, C., Blachon, S., Jeudy, J.B.B., Gandrillon, O.: Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human sage data. Genome Biology 3(12) (2002)

    Google Scholar 

  6. Cheng, Y., Church, G.M.: Biclustering of expression data. In: Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, pp. 93–103 (2000)

    Google Scholar 

  7. Cong, G., Tan, K.L., Tung, A., Pan, F.: Mining frequent closed patterns in microarray data. In: Proc. Fourth IEEE Intl. Conf. Data Mining (ICDM), pp. 363–366 (2004)

    Google Scholar 

  8. The Gene Ontology Consortium. The gene ontology (go) database and informatics resource. Nucleic Acids Research 32, 258–261 (2004)

    Google Scholar 

  9. Creighton, C., Hanash, S.: Mining gene expression databases for association rules. Bioinformatics 19, 79–869 (2003)

    Article  Google Scholar 

  10. Cho, R.J., et al.: A genome-wide transcriptional analysis of the mitotic cell cycle. Molecular Cell 2(1), 65–73 (1998)

    Article  Google Scholar 

  11. Friedman, N., Linial, M., Nachman, I., Pe’er, D.: Using bayesian network to analyze expression data. Journal of Computational Biology 7, 601–620 (2000)

    Article  Google Scholar 

  12. Gasch, A., Eisen, M.: Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering. Genome Biol. 3 (2002)

    Google Scholar 

  13. Huang, Z., Li, J., Su, H., Watts, G., Chen, H.: Large-scale regulatory network analysis from microarray data: modified bayesian network learning and association rule mining. Decis. Support Syst. 43(4), 1207–1225 (2007)

    Article  Google Scholar 

  14. Kotlyar, M., Jurisica, I.: Predicting protein-protein interactions by association mining. Information Systems Frontiers 8(1), 37–47 (2006)

    Article  Google Scholar 

  15. Murphy, K., Mian, S.: Modeling gene expression data using dynamic bayesian networks. In: Technical Report, Computer Science Division, University of California, Berkeley (1999)

    Google Scholar 

  16. Oba, S., Sato, M., Takemasa, I., et al.: A bayesian missing value estimation method for gene expression profile data. Bioinformatics 19(16), 2088–2096 (2003)

    Article  Google Scholar 

  17. Oyama, T., Kitano, K., Satou, K., Ito, T.: Extraction of knowledge on protein protein interaction by association rule discovery. Bioinformatics 18(5), 705–714 (2002)

    Article  Google Scholar 

  18. Pan, F., Cong, G., Tung, K., Yang, J., Zaki, M.: Carpenter: Finding closed patterns in long biological datasets. In: Proc. ACM SIGKDD Intl. Conf. Knowledge Discovery and Data Mining (KDD), pp. 637–642 (2003)

    Google Scholar 

  19. Pandey, G., Steinbach, M., Gupta, R., Garg, T., Kumar, V.: Association analysis-based transformations for protein interaction networks: A function prediction case study. In: Proc. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 540–549 (2007)

    Google Scholar 

  20. Tsay, Y., Chang-Chien, Y.: An efficient cluster and decomposition algorithm for mining association rules. Inf. Sci. 160, 161–170 (2004)

    Article  Google Scholar 

  21. Tuzhilin, A., Adomavicius, G.: Handling very large numbers of association rules in the analysis of microarray data. In: Proc. of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 23–26 (2002)

    Google Scholar 

  22. Wahde, M., Hertz, J.: Modeling genetic regulatory dynamics in neural development. Journal of Computational Biology 8, 14863–14868 (2001)

    Article  Google Scholar 

  23. Xiong, H., He, X., Ding, C., Zhang, Y., Kumar, V., Holbrook, S.R.: Identification of functional modules in protein complexes via hyperclique pattern discovery. In: Proc. Pacific Symposium on Biocomputing (PSB), pp. 221–232 (2005)

    Google Scholar 

  24. Yeung, K., Medvedovic, M., Bumgarner, R.: From co-expression to co-regulation: how many microarray experiments do we need? Genome Biol. 5(7) (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Shang, X., Zhao, Q., Li, Z. (2009). Mining High-Correlation Association Rules for Inferring Gene Regulation Networks. In: Pedersen, T.B., Mohania, M.K., Tjoa, A.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2009. Lecture Notes in Computer Science, vol 5691. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03730-6_20

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03730-6_20

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03729-0

  • Online ISBN: 978-3-642-03730-6

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics