Heart Disease Diagnosis Using Co-clustering

Ahmed, Mohiuddin; Mahmood, Abdun Naser; Maher, Michael J.

doi:10.1007/978-3-319-16868-5_6

Mohiuddin Ahmed¹⁸,
Abdun Naser Mahmood¹⁸ &
Michael J. Maher¹⁸

Part of the book series: Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering ((LNICST,volume 139))

Included in the following conference series:

International Conference on Scalable Information Systems

330 Accesses
1 Citations

Abstract

Due to the advancement of information technology and its incorporation in various health applications, a huge amount of medical data is being produced continuously. Consequently, efficient techniques are required to analyse such large datasets and extract meaningful information as well as knowledge. Disease diagnosis is an important application domain of data mining techniques and can be resembled with the anomaly detection which is one of the primary tasks of data mining research. In past decades, heart disease caused the maximum death all over the world. As a result, heart disease diagnosis is a challenge for both data mining and health care communities. In this paper, co-clustering is introduced as a powerful data analysis tool to diagnose heart disease and extract the underlying data pattern of the datasets. The performance of the proposed method is evaluated using Cleveland Clinic Foundation Heart Disease dataset against other existing clustering based anomaly detection techniques. Experimental results reflect not only better accuracy but also meaningful information about the dataset which is helpful for further analysis of heart disease diagnosis.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 34.99; Price excludes VAT (USA)

Softcover Book: USD 44.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

World Health Organization. http://www.who.int/en/
Chandola, V., Banerjee, A.: Anomaly detection: a survey. ACM Comput. Surv. 41(3), 15:1–15:58 (2009)
Article Google Scholar
Ahmed, M., Mahmood, A., Hu, J.: Outlier detection. In: Khan Pathan, A.-S. (ed.) The State of the Art in Intrusion Prevention and Detection, pp. 3–23. CRC Press, Boca Raton (2014)
Chapter Google Scholar
Fang, X.: Are you becoming a diabetic? a data mining approach. In: Proceedings of the 6th International Conference on Fuzzy Systems and Knowledge Discovery - Volume 5, ser. FSKD 2009, pp. 18–22. IEEE Press, Piscataway (2009)
Google Scholar
Ahmed. M., Naser. A.: A novel approach for outlier detection and clustering improvement. In: 2013 8th IEEE Conference on Industrial Electronics and Applications (ICIEA), pp. 577–582 (2013)
Google Scholar
Tucker, L.R.: The extension of factor analysis to three-dimensional matrices. In: Gulliksen, H., Frederiksen, N. (eds.) Contributions to Mathematical Psychology, pp. 110–127. Holt, Rinehart and Winston, New York (1964)
Google Scholar
Tucker, L.R.: Clustering with block mixture models. Pattern Recogn. 36(2), 463–473 (2003)
Article Google Scholar
Tucker, L.R.: Block clustering with bernoulli mixture models: comparison of different approaches. Comput. Stat. Data Anal. 52(6), 3233–3245 (2008)
Article Google Scholar
Banerjee, A., Dhillon, I., Ghosh, J., Merugu, S., Modha, D.S.: A generalized maximum entropy approach to bregman co-clustering and matrix approximation. J. Mach. Learn. Res. 8, 1919–1986 (2007)
MATH MathSciNet Google Scholar
Knorr, E.M., Ng, R.T.: Algorithms for mining distance-based outliers in large datasets. In: Proceedings of the 24rd International Conference on Very Large Data Bases, ser. VLDB 1998, pp. 392–403. Morgan Kaufmann Publishers Inc., San Francisco (1998)
Google Scholar
Ramaswamy, S., Rastogi, R., Shim, K.: Efficient algorithms for mining outliers from large data sets. SIGMOD Rec. 29(2), 427–438 (2000)
Article Google Scholar
Breunig, M.M., Kriegel, H.-P., Ng, R.T., Sander, J.: Lof: identifying density-based local outliers. SIGMOD Rec. 29(2), 93–104 (2000)
Article Google Scholar
He, Z., Xu, X., Deng, S.: Discovering cluster based local outliers. Pattern Recogn. Lett. 2003, 9–10 (2003)
MATH Google Scholar
Mennatallah Amer, M.G.: Nearest-neighbor and Clustering Based Anomaly Detection Algorithms For Rapidminer. Shaker Verlag GmbH, Aachen (2012)
Google Scholar
Bache, K., Lichman, M.: UCI machine learning repository (2013). http://archive.ics.uci.edu/ml
Shouman, M., Turner, T., Stocker, R.: Using decision tree for diagnosing heart disease patients. In: Proceedings of the Ninth Australasian Data Mining Conference - Volume 121, ser. AusDM 2011, pp. 23–30. Australian Computer Society Inc., Darlinghurst (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Engineering and Information Technology, UNSW Canberra, Canberra, ACT, 2600, Australia
Mohiuddin Ahmed, Abdun Naser Mahmood & Michael J. Maher

Authors

Mohiuddin Ahmed
View author publications
You can also search for this author in PubMed Google Scholar
Abdun Naser Mahmood
View author publications
You can also search for this author in PubMed Google Scholar
Michael J. Maher
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mohiuddin Ahmed .

Editor information

Editors and Affiliations

Chung-Ang University, Seoul, Korea, Republic of (South Korea)
Jason J. Jung
University of Craiova, Craiova, Romania
Costin Badica
Eötvös Loránd University, Budapest, Hungary
Attila Kiss

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ahmed, M., Mahmood, A.N., Maher, M.J. (2015). Heart Disease Diagnosis Using Co-clustering. In: Jung, J., Badica, C., Kiss, A. (eds) Scalable Information Systems. INFOSCALE 2014. Lecture Notes of the Institute for Computer Sciences, Social Informatics and Telecommunications Engineering, vol 139. Springer, Cham. https://doi.org/10.1007/978-3-319-16868-5_6

Download citation

DOI: https://doi.org/10.1007/978-3-319-16868-5_6
Published: 07 April 2015
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-16867-8
Online ISBN: 978-3-319-16868-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics