Skip to main content

Advertisement

Log in

Development of Big Data Predictive Analytics Model for Disease Prediction using Machine learning Technique

  • Patient Facing Systems
  • Published:
Journal of Medical Systems Aims and scope Submit manuscript

Abstract

Now days, health prediction in modern life becomesvery much essential. Big data analysis plays a crucial role to predict future status of healthand offerspreeminenthealth outcome to people. Heart disease is a prevalent disease cause’s death around the world. A lotof research is going onpredictive analytics using machine learning techniques to reveal better decision making. Big data analysis fosters great opportunities to predict future health status from health parameters and provide best outcomes. WeusedBig Data Predictive Analytics Model for Disease Prediction using Naive Bayes Technique (BPA-NB). It providesprobabilistic classification based on Bayes’ theorem with independence assumptions between the features. Naive Bayes approach suitable for huge data sets especially for bigdata. The Naive Bayes approachtrain the heart disease data taken from UCI machine learning repository. Then, it was making predictions on the test data to predict the classification. The results reveal that the proposed BPA-NB scheme providesbetter accuracy about 97.12% to predict the disease rate. The proposed BPA-NB scheme used Hadoop-spark as big data computing tool to obtain significant insight on healthcare data. The experiments are done to predict different patients’ future health condition. It takes the training dataset to estimate the health parameters necessary for classification. The results show the early disease detection to figure out future health of patients.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Banu, N. K. S., Swamy, S., Prediction of heart disease at early stage using data mining and big data analytics: A survey, International Conference on Electrical, Electronics, Communication, Computer and Optimization Techniques , IEEE, Mysuru, India, 2016.

  2. Wang, L., and Alexander, C. A., Big Data in Medical Applications and Health Care. Current Research in Medicine 6:1–8, 2015.

    Article  Google Scholar 

  3. Palit, I., and Reddy, C. K., Scalable and Parallel Boosting with MapReduce. Ieee Transactions on Knowledge And Data Engineering 24(10):1904–1916, 2012.

    Article  Google Scholar 

  4. Kumar, P., Mohapatra, S. K., and Shih-Lin, W., Analyzing Healthcare Big Data With Prediction for Future Health Condition. IEEE Access 4:9786–9799, 2016.

    Article  Google Scholar 

  5. Alexander, C. A., and Wang, L., Big Data Analytics in Heart Attack Prediction. Journal of Nursing and Care 6(2):1–9, 2017.

    Article  Google Scholar 

  6. Ram, S., Zhang, W., and Williams, M., Predicting Asthma-Related Emergency Department Visits Using Big Data. IEEE Journal 19(4):1216–1223, 2015.

    Google Scholar 

  7. Chen, D., Chen, Y., Brownlow, B. N., and Kanjamala, P. P., Real-Time Daily Healthcare Data Into HDFS and Elastic Search Index Inside a Big Data Platform. IEEE Transaction 13(2):595–606, 2017.

    Google Scholar 

  8. Heureuxi, A. L., Grolingeri, K., Elyamany, H. F., and Miriama, Machine Learning With Big Data:Challenge and Approaces. IEEE Access 5:7776–7797, 2017.

    Article  Google Scholar 

  9. chen, M., Hao, Y., and Hwang, K., Disease Prediction by Machine Learning Over Big Data from Healthcare Communities. IEEE Access 5:8869–8879, 2017.

    Article  Google Scholar 

  10. Abdulsalamyassine, S., Mining Human Activity Patterns From Smart Home Big Data for Health Care Applications. IEEE Access 5:13131–13149, 2017.

    Article  Google Scholar 

  11. Wang, Y., and Kung, L. A., Terry Anthony Byrd, “Understanding itscapabilities and potential benefits for healthcare organizations”. Journal of Technological Forecasting and Social Change 126:3–13, 2018.

    Article  Google Scholar 

  12. Gao Zhu, F., A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis. IEEE TransactionsEmerging Topics in Computing 2(3):267–279, 2014.

    Article  Google Scholar 

  13. Rav, D., Wong, C., and Deligianni, F., Deep Learning for Health Informatics. IEEE Journal of Biomedical and Health Informatics 21(1):4–22, 2017.

    Article  Google Scholar 

  14. Dayal, M., and Singh, N., Indian Health Care Analysis using Big Data Programming Tool. Procedia Computer Science 89:521–527, 2016.

    Article  Google Scholar 

  15. Yang, L., and Zhou, Y., Exploring feature sets for two-phase biomedical named entity recognition using semi-CRFs. Journal of Knowledge and Information Systems 40(2):439–453, 2014.

    Article  Google Scholar 

  16. Wittek, P., and Daranyi, S., Accelerating text mining workloads in a mapreduce-based distributed GPU environment. Journal of Parallel and Distributed Computing 73(2):198–206, 2013.

    Article  Google Scholar 

  17. Mehta, N., and Pandit, A., Concurrence of big data analytics and healthcare: A systematic review. International Journal of Medical Informatics 114:57–65, 2018.

    Article  Google Scholar 

  18. Viceconti, M., Hunter, P., and Hose, R., Big Data, Big Knowledge: Big Data for Personalized Healthcare. IEEE Journal of Biomedical and Health Informatics 19:4–33, 2015.

    Article  Google Scholar 

  19. Andreu-Perez, J., Poon, C. C. Y., Merrifield, R. D., Wong, S. T. C., Yang, G-Z, Fellow, “Big Data for Health”, IEEE Journal of Biomedical and Health Informatics, Vol.16, Pp.16–35, 2015

  20. Tamano, S. N., and Araki, T., Optimizing multiple machine learning jobs on MapReduce, IEEE International Conference on Big Data Intelligence and Computing and Cyber Science and Technology, Vol.30, pp.59–66

  21. Yeh, J-F, Yeh, C-K, Yu, K-H, Li, Y-T, Tsai, W-L, Condition Random Fields-based Grammatical Error Detection for Chinese as Second Language, Department of Computer Science and Information Engineering (2014), Vol. 186, Pp. 537–566

  22. Vimal, S., Kalaivani, L., Kaliappan, M., Suresh, A., Gao, X.-Z., and Varatharajan, R., Development of secured data transmission using machine learning based discrete time partial observed markov model and energy optimization in Cognitive radio networks. Neural Comput & Applic, 2018. https://doi.org/10.1007/s00521-018-3788-3.

  23. Kannan, N., Sivasubramanian, S., Kaliappan, M., Vimal, S., and Suresh, A., Predictive big data analytic on demonetization data using support vector machine. Cluster Comput, 2018. https://doi.org/10.1007/s10586-018-2384-8 March 2018.

  24. SudhakarIlango, S., Vimal, S., Kaliappan, M., and Subbulakshmi, P., Optimization using Artificial Bee Colony based clustering approach for big data. Cluster Computing. https://doi.org/10.1007/s10586-017-1571-3.

  25. Kaliappan, M., Augustine, S., and Paramasivan, B., Enhancing energy efficiency and load balancing in mobile adhoc network using dynamic genetic algorithms. Journal of Network and Computer Applications 73:35–43, 2016.

    Article  Google Scholar 

  26. Suresh, A., Udendhran, R., and Balamurgan, M., Hybridized neural network and decision tree based classifier for prognostic decision making in breast cancers. Soft Computing, 2019. https://doi.org/10.1007/s00500-019-04066-4.

  27. Suresh, A., Udendhran, R., and Balamurgan, M., A Novel Internet of Things Framework Integrated with Real Time Monitoring for Intelligent Healthcare Environment. Journal of Medical System 43(6):165, 2019. https://doi.org/10.1007/s10916-019-1302-9.

    Article  CAS  Google Scholar 

  28. Suresh, A., Kumar, R., and Varatharajan, R., Health Care Data Analysis using Evolutionary Algorithm. Journal of Supercomputing, 2018. https://doi.org/10.1007/s11227-018-2302-0.

  29. Kaliappan, M., and Paramasivan, B., Enhancing secure routing in Mobile Ad Hoc Networks using a Dynamic Bayesian Signalling Game model. Journal of Computers & Electrical Engineering 41:301–313, 2015.

    Article  Google Scholar 

  30. Paramasivan, B., Viju, M. J., Kaliappan, P. M., Development of a Secure Routing Protocol using Game Theory Model in Mobile Ad Hoc Networks, Journal of Communications and Networks, 17, 1, 2015

    Article  Google Scholar 

  31. Vimal, S., Kalaivani, L., and Kaliappan, M., Collaborative approach on mitigating spectrum sensing data hijack attack and dynamic spectrum allocation based on CASG modeling in wireless cognitive radio networks. Cluster Computing, 2017. https://doi.org/10.1007/s10586-017-1092-0.

  32. Mariappan, E., Kaliappan, M., Vimal, S., Energy Efficient Routing protocol using Grover’s Searching algorithm using MANET, Asian Journal of Information Technology,Vol:15, no.24,2016

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to R. Venkatesh.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article is part of the Topical Collection on Patient Facing Systems

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Venkatesh, R., Balasubramanian, C. & Kaliappan, M. Development of Big Data Predictive Analytics Model for Disease Prediction using Machine learning Technique. J Med Syst 43, 272 (2019). https://doi.org/10.1007/s10916-019-1398-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10916-019-1398-y

Keywords

Navigation