Mining High-Correlation Association Rules for Inferring Gene Regulation Networks

Shang, Xuequn; Zhao, Qian; Li, Zhanhuai

doi:10.1007/978-3-642-03730-6_20

Xuequn Shang¹⁹,
Qian Zhao¹⁹ &
Zhanhuai Li¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5691))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

1064 Accesses
2 Citations

Abstract

Construction gene regulation networks can provide insights into the understanding the molecular mechanisms underlying important biological processes. We present a novel association rule mining for building large-scale gene regulation networks from microarray data. Gene expression microarray data typically contains a very high gene dimension and a very low sample size, rendering a great challenge for existing association rule mining algorithms. In this paper, we develop a novel algorithm, HCMiner, to mine high-correlation association rules from microarray data. HCMiner initially overlapping partitions the dimension of genes according to their correlations and introduces the support-free framework for mining association rules. Several experiments on Yeast dataset show that the proposed algorithm outperforms existing algorithms with respect to scalability and effectiveness.

This research is partly supported by the National Natural Science Foundation of China (No. 60703105) and the Natural Science Foundation of Shaanxi Province (No. 2007F27). All opinions, findings, conclusions and recommendations in this paper are those of the authors and do not necessarily reflect the views of the funding agencies.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aggarwal, C., Procopiuc, C., Yu, P.: Finding localized associations in market basket data. IEEE Trans. Knowl. Data Eng. 14(1), 51–62 (2002)
Article Google Scholar
Agrawal, R., Imielinski, T., Swami, A.N.: Mining association rules between sets of items in large databases. In: Proc. ACM SIGMOD International Conference on Management of Data, pp. 207–216 (1993)
Google Scholar
Albert, R.: Scale-free networks in cell biology. Journal Cell Sci. 118, 4947–4957 (2005)
Article Google Scholar
Barabasi, A., Oltvai, Z.: Network biology: understanding the cells functional organization. Nat. Rev. Genet. 5, 101–113 (2004)
Article Google Scholar
Becquet, C., Blachon, S., Jeudy, J.B.B., Gandrillon, O.: Strong-association-rule mining for large-scale gene-expression data analysis: a case study on human sage data. Genome Biology 3(12) (2002)
Google Scholar
Cheng, Y., Church, G.M.: Biclustering of expression data. In: Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, pp. 93–103 (2000)
Google Scholar
Cong, G., Tan, K.L., Tung, A., Pan, F.: Mining frequent closed patterns in microarray data. In: Proc. Fourth IEEE Intl. Conf. Data Mining (ICDM), pp. 363–366 (2004)
Google Scholar
The Gene Ontology Consortium. The gene ontology (go) database and informatics resource. Nucleic Acids Research 32, 258–261 (2004)
Google Scholar
Creighton, C., Hanash, S.: Mining gene expression databases for association rules. Bioinformatics 19, 79–869 (2003)
Article Google Scholar
Cho, R.J., et al.: A genome-wide transcriptional analysis of the mitotic cell cycle. Molecular Cell 2(1), 65–73 (1998)
Article Google Scholar
Friedman, N., Linial, M., Nachman, I., Pe’er, D.: Using bayesian network to analyze expression data. Journal of Computational Biology 7, 601–620 (2000)
Article Google Scholar
Gasch, A., Eisen, M.: Exploring the conditional coregulation of yeast gene expression through fuzzy k-means clustering. Genome Biol. 3 (2002)
Google Scholar
Huang, Z., Li, J., Su, H., Watts, G., Chen, H.: Large-scale regulatory network analysis from microarray data: modified bayesian network learning and association rule mining. Decis. Support Syst. 43(4), 1207–1225 (2007)
Article Google Scholar
Kotlyar, M., Jurisica, I.: Predicting protein-protein interactions by association mining. Information Systems Frontiers 8(1), 37–47 (2006)
Article Google Scholar
Murphy, K., Mian, S.: Modeling gene expression data using dynamic bayesian networks. In: Technical Report, Computer Science Division, University of California, Berkeley (1999)
Google Scholar
Oba, S., Sato, M., Takemasa, I., et al.: A bayesian missing value estimation method for gene expression profile data. Bioinformatics 19(16), 2088–2096 (2003)
Article Google Scholar
Oyama, T., Kitano, K., Satou, K., Ito, T.: Extraction of knowledge on protein protein interaction by association rule discovery. Bioinformatics 18(5), 705–714 (2002)
Article Google Scholar
Pan, F., Cong, G., Tung, K., Yang, J., Zaki, M.: Carpenter: Finding closed patterns in long biological datasets. In: Proc. ACM SIGKDD Intl. Conf. Knowledge Discovery and Data Mining (KDD), pp. 637–642 (2003)
Google Scholar
Pandey, G., Steinbach, M., Gupta, R., Garg, T., Kumar, V.: Association analysis-based transformations for protein interaction networks: A function prediction case study. In: Proc. ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD), pp. 540–549 (2007)
Google Scholar
Tsay, Y., Chang-Chien, Y.: An efficient cluster and decomposition algorithm for mining association rules. Inf. Sci. 160, 161–170 (2004)
Article Google Scholar
Tuzhilin, A., Adomavicius, G.: Handling very large numbers of association rules in the analysis of microarray data. In: Proc. of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 23–26 (2002)
Google Scholar
Wahde, M., Hertz, J.: Modeling genetic regulatory dynamics in neural development. Journal of Computational Biology 8, 14863–14868 (2001)
Article Google Scholar
Xiong, H., He, X., Ding, C., Zhang, Y., Kumar, V., Holbrook, S.R.: Identification of functional modules in protein complexes via hyperclique pattern discovery. In: Proc. Pacific Symposium on Biocomputing (PSB), pp. 221–232 (2005)
Google Scholar
Yeung, K., Medvedovic, M., Bumgarner, R.: From co-expression to co-regulation: how many microarray experiments do we need? Genome Biol. 5(7) (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science and Engineering, Northwestern Polytechnical University, P.O. Box 168, Shaanxi, 710072, China
Xuequn Shang, Qian Zhao & Zhanhuai Li

Authors

Xuequn Shang
View author publications
You can also search for this author in PubMed Google Scholar
Qian Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zhanhuai Li
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, Aalborg University, Selma Lagerlöfsvej 300, 9220, Aalborg Ø, Denmark
Torben Bach Pedersen
IBM India Research Lab, Plot No. 4, Block C, Institutional Area, Vasant Kunj, 110 070, New Delhi, India
Mukesh K. Mohania
Institute of Software Technology and Interactive Systems, Vienna University of Technology, Favoritenstr. 9-11/188, 1040, Wien, Austria
A Min Tjoa

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shang, X., Zhao, Q., Li, Z. (2009). Mining High-Correlation Association Rules for Inferring Gene Regulation Networks. In: Pedersen, T.B., Mohania, M.K., Tjoa, A.M. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2009. Lecture Notes in Computer Science, vol 5691. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03730-6_20

Download citation

DOI: https://doi.org/10.1007/978-3-642-03730-6_20
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03729-0
Online ISBN: 978-3-642-03730-6
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics