Skip to main content

A Hough Transform-Based Biclustering Algorithm for Gene Expression Data

  • Conference paper
  • First Online:
Machine Learning and Cybernetics (ICMLC 2014)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 481))

Included in the following conference series:

  • 1586 Accesses

Abstract

In pattern classification, when the feature space is of high dimensionality or patterns are “similar” on a subset of features only, the traditional clustering methods do not show good performance. Biclustering is a class of methods that simultaneously carry out grouping on two dimensions and has many applications to different fields, especially gene expression data analysis. Because of simultaneous classification on both rows and columns of a data matrix, the biclustering problem is inherently intractable and computationally complex. One of the most complex models in biclustering problem is linear coherent model. Several biclustering algorithms based on this model have been proposed in recent years. However, none of them is able to perfectly recognize all linear patterns in a bicluster. In this work, we propose a novel algorithm based on Hough transform that can find all linear coherent patterns. In the sequel we apply it to gene expression data.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Madeira, S.C., Oliveira, A.L.: Biclustering algorithms for biological data analysis: a survey. IEEE Transactions on Computational Biology and Bioinformatics 1, 24–45 (2004)

    Article  Google Scholar 

  2. Zhao, H., Liew, A.W.C., Wang, D.Z., Yan, H.: Biclustering Analysis for Pattern Discovery: Current Techniques, Comparative Studies and Applications. Current Bioinformatics 7(1), 43–55 (2012)

    Article  Google Scholar 

  3. Li, G., Ma, Q., Tang, H., Paterson, A.H., Xu, Y.: QUBIC: a qualitative algorithm for analyses of gene expression data. Nucleic Acids Research 37, e101 (2009)

    Article  Google Scholar 

  4. Serin, A., Vingron, M.: DeBi: discovering differentially expressed biclusters using a frequent itemset approach. Algorithms for Molecular Biology 6, 18 (2011)

    Article  Google Scholar 

  5. Huang, Q., Tao, D., Li, X., Liew, A.W.C.: Parallelized evolutionary learning for detection of biclusters in gene expression data. IEEE/ACM Transactions on Computational Biology and Bioinformatics 9, 560–570 (2012)

    Article  Google Scholar 

  6. Huang, Q., Lu, M., Yan, H.: An evolutionary algorithm for discovering biclusters in gene expression data of breast cancer. In: IEEE Congress on Evolutionary Computation, pp. 829–834 (2008)

    Google Scholar 

  7. Divina, F., Aguilar-Ruiz, J.S.: Biclustering of expression data with evolutionary computation. IEEE Transactions on Knowledge and Data Engineering 18, 590–602 (2006)

    Article  Google Scholar 

  8. Chakraborty, A., Maka, H.: Biclustering of gene expression data using genetic algorithm. In: Proceeding of the IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, pp. 1–8 (2005)

    Google Scholar 

  9. Hochreiter, S., Bodenhofer, U., Heusel, M., Mayr, A., Mitterecker, A., Kasim, A., Khamiakova, T., Van Sanden, S., Lin, D., Talloen, W., Bijnens, L., Göhlmann, H.W., Shkedy, Z., Clevert, D.A.: FABIA: factor analysis for bicluster acquisition. Bioinformatics 26, 1520–1527 (2010)

    Article  Google Scholar 

  10. Zhao, H., Liew, A.W.C., Xie, X., Yan, H.: A new geometric biclustering algorithm based on the Hough transform for analysis of large-scale microarray data. Journal of Theoretical Biology 251, 264–274 (2007)

    Article  MathSciNet  Google Scholar 

  11. Gan, X., Liew, A.W.C., Yan, H.: Discovering biclusters in gene expression data based on high-dimensional linear geometries. BMC Bioinformatics 9, 209 (2008)

    Article  Google Scholar 

  12. Wang, D.Z., Yan, H.: A graph spectrum based geometric biclustering algorithm. Journal of Theoretical Biology 317, 200–211 (2013)

    Article  MathSciNet  Google Scholar 

  13. Wang, Z., Yu, C.W., Cheung, R.C.C., Yan, H.: Hypergraph based geometric biclustering algorithm. Pattern Recognition Letters 33, 1656–1665 (2012)

    Article  Google Scholar 

  14. Goldenshluger, A., Zeevi, A.: The Hough transform estimator. Ann. Stat. 32, 1908–1932 (2004)

    Article  MATH  MathSciNet  Google Scholar 

  15. Illingworth, J., Kittler, J.: A survey of the Hough transform. Comput. Vision Graphics Image Process 44, 87–116 (1988)

    Article  Google Scholar 

  16. Tavazoie, S., Hughes, J.D., Campbell, M.J., Cho, R.J., Church, G.M.: Systematic determination of genetic network architecture. Nat. Genet. 22, 281–285 (1999)

    Article  Google Scholar 

  17. Rosenwald, A., et al.: The use of molecular profiling to predict survival after chemotherapy for diffuse large-B-cell lymphoma. New Engl. J. Med. 346, 1937–1947 (2002)

    Article  Google Scholar 

  18. Ihmels, J., Bergmann, S., Barkai, N.: Defining transcription modules using large-scale gene expression data. Bioinformatics 20, 1993–2003 (2004)

    Article  Google Scholar 

  19. Murali, T.M., Kasif, S.: Extracting conserved gene expression motifs from gene expression data. Pacific Symposium on Biocomputing 8, 77–88 (2003)

    Google Scholar 

  20. Cheng, Y., Church, G.M.: Biclustering of expression data. In: Proceedings of the Eighth International Conference on Intelligent Systems for Molecular Biology, pp. 93–103 (2000)

    Google Scholar 

  21. Kluger, Y., Basri, R., Chang, J.T., Gerstein, M.: Spectral biclustering of microarray data: coclustering genes and conditions. Genome Research 13, 703–716 (2003)

    Article  Google Scholar 

  22. Turner, H., Bailey, T., Krzanowski, W.: Improved biclustering of microarray data demonstrated through systematic performance tests. Computational Statistics and Data Analysis 48, 235–254 (2005)

    Article  MATH  MathSciNet  Google Scholar 

  23. Ashburner, M., Ball, C.A., et al.: Gene Ontology: tool for the unification of biology. Nature Genetics 25, 25–29 (2000)

    Article  Google Scholar 

  24. Kanehisa, M., Goto, S.: KEGG: Kyoto Encyclopedia of Genes and Genomes. Nucleic Acids Res. 28, 27–30 (2000)

    Article  Google Scholar 

  25. Boyle, E.I., Weng, S., Gollub, J., Jin, H., Botstein, D., Cherry, J.M., Sherlock, G.: GO:TermFider–open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes. Bioinformatics 20, 3710–3715 (2004)

    Article  Google Scholar 

  26. Bindea, G., Mlecnik, B., et al.: ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics 25, 1091–1093 (2009)

    Article  Google Scholar 

  27. Gan, X., Liew, A.W.C., Yan, H.: Biclustering gene expression data based on a high dimensional geometric method. In: Proceedings of the Fourth International Conference on Machine Learning and Cybernetics (ICMLC 2005), vol. 6, pp. 3388–3393 (August 18 – 21, 2005)

    Google Scholar 

  28. Duda, R.O., Hart, P.E.: Use of the Hough transformation to detect lines and curves in pictures. Comm. ACM 15, 11–15 (1972)

    Article  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alan Wee-Chung Liew .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

To, C., Nguyen, T.T., Liew, A.WC. (2014). A Hough Transform-Based Biclustering Algorithm for Gene Expression Data. In: Wang, X., Pedrycz, W., Chan, P., He, Q. (eds) Machine Learning and Cybernetics. ICMLC 2014. Communications in Computer and Information Science, vol 481. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45652-1_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-662-45652-1_11

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-662-45651-4

  • Online ISBN: 978-3-662-45652-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics