Skip to main content

Heart Disease Diagnosis Using Co-clustering

  • Conference paper
  • First Online:
Scalable Information Systems (INFOSCALE 2014)

Abstract

Due to the advancement of information technology and its incorporation in various health applications, a huge amount of medical data is being produced continuously. Consequently, efficient techniques are required to analyse such large datasets and extract meaningful information as well as knowledge. Disease diagnosis is an important application domain of data mining techniques and can be resembled with the anomaly detection which is one of the primary tasks of data mining research. In past decades, heart disease caused the maximum death all over the world. As a result, heart disease diagnosis is a challenge for both data mining and health care communities. In this paper, co-clustering is introduced as a powerful data analysis tool to diagnose heart disease and extract the underlying data pattern of the datasets. The performance of the proposed method is evaluated using Cleveland Clinic Foundation Heart Disease dataset against other existing clustering based anomaly detection techniques. Experimental results reflect not only better accuracy but also meaningful information about the dataset which is helpful for further analysis of heart disease diagnosis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 34.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 44.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. World Health Organization. http://www.who.int/en/

  2. Chandola, V., Banerjee, A.: Anomaly detection: a survey. ACM Comput. Surv. 41(3), 15:1–15:58 (2009)

    Article  Google Scholar 

  3. Ahmed, M., Mahmood, A., Hu, J.: Outlier detection. In: Khan Pathan, A.-S. (ed.) The State of the Art in Intrusion Prevention and Detection, pp. 3–23. CRC Press, Boca Raton (2014)

    Chapter  Google Scholar 

  4. Fang, X.: Are you becoming a diabetic? a data mining approach. In: Proceedings of the 6th International Conference on Fuzzy Systems and Knowledge Discovery - Volume 5, ser. FSKD 2009, pp. 18–22. IEEE Press, Piscataway (2009)

    Google Scholar 

  5. Ahmed. M., Naser. A.: A novel approach for outlier detection and clustering improvement. In: 2013 8th IEEE Conference on Industrial Electronics and Applications (ICIEA), pp. 577–582 (2013)

    Google Scholar 

  6. Tucker, L.R.: The extension of factor analysis to three-dimensional matrices. In: Gulliksen, H., Frederiksen, N. (eds.) Contributions to Mathematical Psychology, pp. 110–127. Holt, Rinehart and Winston, New York (1964)

    Google Scholar 

  7. Tucker, L.R.: Clustering with block mixture models. Pattern Recogn. 36(2), 463–473 (2003)

    Article  Google Scholar 

  8. Tucker, L.R.: Block clustering with bernoulli mixture models: comparison of different approaches. Comput. Stat. Data Anal. 52(6), 3233–3245 (2008)

    Article  Google Scholar 

  9. Banerjee, A., Dhillon, I., Ghosh, J., Merugu, S., Modha, D.S.: A generalized maximum entropy approach to bregman co-clustering and matrix approximation. J. Mach. Learn. Res. 8, 1919–1986 (2007)

    MATH  MathSciNet  Google Scholar 

  10. Knorr, E.M., Ng, R.T.: Algorithms for mining distance-based outliers in large datasets. In: Proceedings of the 24rd International Conference on Very Large Data Bases, ser. VLDB 1998, pp. 392–403. Morgan Kaufmann Publishers Inc., San Francisco (1998)

    Google Scholar 

  11. Ramaswamy, S., Rastogi, R., Shim, K.: Efficient algorithms for mining outliers from large data sets. SIGMOD Rec. 29(2), 427–438 (2000)

    Article  Google Scholar 

  12. Breunig, M.M., Kriegel, H.-P., Ng, R.T., Sander, J.: Lof: identifying density-based local outliers. SIGMOD Rec. 29(2), 93–104 (2000)

    Article  Google Scholar 

  13. He, Z., Xu, X., Deng, S.: Discovering cluster based local outliers. Pattern Recogn. Lett. 2003, 9–10 (2003)

    MATH  Google Scholar 

  14. Mennatallah Amer, M.G.: Nearest-neighbor and Clustering Based Anomaly Detection Algorithms For Rapidminer. Shaker Verlag GmbH, Aachen (2012)

    Google Scholar 

  15. Bache, K., Lichman, M.: UCI machine learning repository (2013). http://archive.ics.uci.edu/ml

  16. Shouman, M., Turner, T., Stocker, R.: Using decision tree for diagnosing heart disease patients. In: Proceedings of the Ninth Australasian Data Mining Conference - Volume 121, ser. AusDM 2011, pp. 23–30. Australian Computer Society Inc., Darlinghurst (2011)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Mohiuddin Ahmed .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Institute for Computer Sciences, Social Informatics and Telecommunications Engineering

About this paper

Cite this paper

Ahmed, M., Mahmood, A.N., Maher, M.J. (2015). Heart Disease Diagnosis Using Co-clustering. In: Jung, J., Badica, C., Kiss, A. (eds) Scalable Information Systems. INFOSCALE 2014. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 139. Springer, Cham. https://doi.org/10.1007/978-3-319-16868-5_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-16868-5_6

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-16867-8

  • Online ISBN: 978-3-319-16868-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics