Skip to main content

Semi-possibilistic Biclustering Applied to Discrete and Continuous Data

  • Conference paper
Advanced Machine Learning Technologies and Applications (AMLTA 2012)

Abstract

In contrast to hard biclustering, possibilistic biclustering not only has the ability to cluster a group of genes together with a group of conditions as hard biclustering but also it has outlier rejection capabilities and can give insights towards the degree under which the participation of a row or a column is most effective. Several previous possibilistic approaches are based on computing the zeros of an objective function. However, they are sensitive to their input parameters and initial conditions beside that they don’t allow constraints on biclusters. This paper proposes an iterative algorithm that is able to produce k-possibly overlapping semi-possibilistic (soft) biclusters satisfying input constraints. The proposed algorithms basically alternate between a depth-first search and a breadth-first search to effectively minimize the underlying objective function. It allows constraints, applicable to any acceptable (dis)similarity measure for the type of the input dataset and it is not sensitive to initial conditions. Experimental results show the ability of our algorithm to offer substantial improvements over several previously proposed biclustering algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Zhiguan, W., Chi, W.: Hypergraph based geometric biclustering algorithm. Pattern Recognition Letters 33(12), 1656–1665 (2012)

    Article  Google Scholar 

  2. Sharara, H., Ismail, M.A.: BISOFT: A novel algorithm for clustering gene expression data. In: Proceedings of the 7th IEEE International Conference on Bioinformatics and Bioengineering, BIOCOMP 2008, pp. 974–981 (2007)

    Google Scholar 

  3. Cheng, Y., Church, G.M.: Biclustering of expression data. In: Proc. Eighth Int’l Conf. Intelligent Systems for Molecular Biology (ISMB 2000), pp. 93–103 (2000)

    Google Scholar 

  4. Filippone, M., Masulli, F., Rovetta, S., Mitra, S., Banka, H.: Possibilistic Approach to Biclustering: An Application to Oligonucleotide Microarray Data Analysis. In: Priami, C. (ed.) CMSB 2006. LNCS (LNBI), vol. 4210, pp. 312–322. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  5. Getz, G., et al.: Coupled Two-Way Clustering Analysis of Gene Microarray Data. Proc. Natural Academy of Sciences, US, 12079–12084 (2000)

    Google Scholar 

  6. Ismail, M.A., Kamel, M.S.: Multidimensional data clustering utilizing hybrid search strategies. Pattern Recognition 22(1), 75–89 (1989)

    Article  MathSciNet  MATH  Google Scholar 

  7. Liu, G., et al.: Distance Based Subspace Biclustering with Flexible Dimension Partitioning, pp. 1250–1254. IEEE (2007)

    Google Scholar 

  8. Mahfouz, M.A., Ismail, M.A.: BIDENS: Iterative Density Based Biclustering Algorithm with Application to Gene Expression Analysis. Proceedings of World Academy of Science, Engineering and Technology 37, 342–348 (2009)

    Google Scholar 

  9. Mahfouz, M.A., Ismail, M.A.: Enhanced Possibilistic Biclustering Algorithm. In: Proceedings of the 3rd IEEE International Conference on Bioinformatics and Biomedical Engineering, Beijing, China, 6 pages (2009)

    Google Scholar 

  10. Mahfouz, M.A., Ismail, M.A.: Distance Based Possibilistic Biclustering Algorithm. In: Proceedings of the 3rd IEEE International Conference on Bioinformatics and Biomedical Engineering, Beijing, China, 4 pages (2009)

    Google Scholar 

  11. Ng, R.T., Han, J.: Efficient and effective clustering methods for spatial data mining. In: Proceedings of the 20th VLDB Conference, Santiago, Chile, pp. 144–155 (1994)

    Google Scholar 

  12. Okada, Y., et al.: Module Discovery in Gene Expression Data Using Closed Itemset Mining Algorithm. IPSG Transactions in Bioinformatics 48, 39–48 (2007)

    Google Scholar 

  13. Pei, J., et al.: Fault-tolerant frequent pattern mining: Problems and challenges. In: Workshop on Research Issues in Data Mining and Knowledge Discovery (2001)

    Google Scholar 

  14. Sara, C.M., Arlindo, L.O.: Biclustering Algorithms for Biological Data Analysis: A Survey. IEEE Trans. Computational Biology and Bioinformatics 1 (2004)

    Google Scholar 

  15. Anindya, B., Rajat, K.: Bi-correlation clustering algorithm for determining a set of co-regulated genes. Bioinformatics 25(21), 2795–2801 (2009)

    Article  Google Scholar 

  16. Selim, S.Z., Ismail, M.A.: Soft clustering of multidimensional data A semi-fuzzy approach. Pattern Recogn. 17(5), 559–568 (1984)

    Article  MATH  Google Scholar 

  17. Christinat, Y., et al.: Gene Expression Data Analysis Using a Novel Approach to Biclustering Combining Discrete and Continuous Data. IEEE/ACM Transactions on Computational Biology And Bioinformatics 5(4) (2008)

    Google Scholar 

  18. Yang, J., et al.: Enhanced Biclustering on Expression Data. In: Proc. Third IEEE Conf. Bioinformatics and Bioeng., pp. 321–327 (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Mahfouz, M.A., Ismail, M.A. (2012). Semi-possibilistic Biclustering Applied to Discrete and Continuous Data. In: Hassanien, A.E., Salem, AB.M., Ramadan, R., Kim, Th. (eds) Advanced Machine Learning Technologies and Applications. AMLTA 2012. Communications in Computer and Information Science, vol 322. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-35326-0_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-35326-0_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-35325-3

  • Online ISBN: 978-3-642-35326-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics