Abstract
Now days, health prediction in modern life becomesvery much essential. Big data analysis plays a crucial role to predict future status of healthand offerspreeminenthealth outcome to people. Heart disease is a prevalent disease cause’s death around the world. A lotof research is going onpredictive analytics using machine learning techniques to reveal better decision making. Big data analysis fosters great opportunities to predict future health status from health parameters and provide best outcomes. WeusedBig Data Predictive Analytics Model for Disease Prediction using Naive Bayes Technique (BPA-NB). It providesprobabilistic classification based on Bayes’ theorem with independence assumptions between the features. Naive Bayes approach suitable for huge data sets especially for bigdata. The Naive Bayes approachtrain the heart disease data taken from UCI machine learning repository. Then, it was making predictions on the test data to predict the classification. The results reveal that the proposed BPA-NB scheme providesbetter accuracy about 97.12% to predict the disease rate. The proposed BPA-NB scheme used Hadoop-spark as big data computing tool to obtain significant insight on healthcare data. The experiments are done to predict different patients’ future health condition. It takes the training dataset to estimate the health parameters necessary for classification. The results show the early disease detection to figure out future health of patients.
Similar content being viewed by others
References
Banu, N. K. S., Swamy, S., Prediction of heart disease at early stage using data mining and big data analytics: A survey, International Conference on Electrical, Electronics, Communication, Computer and Optimization Techniques , IEEE, Mysuru, India, 2016.
Wang, L., and Alexander, C. A., Big Data in Medical Applications and Health Care. Current Research in Medicine 6:1–8, 2015.
Palit, I., and Reddy, C. K., Scalable and Parallel Boosting with MapReduce. Ieee Transactions on Knowledge And Data Engineering 24(10):1904–1916, 2012.
Kumar, P., Mohapatra, S. K., and Shih-Lin, W., Analyzing Healthcare Big Data With Prediction for Future Health Condition. IEEE Access 4:9786–9799, 2016.
Alexander, C. A., and Wang, L., Big Data Analytics in Heart Attack Prediction. Journal of Nursing and Care 6(2):1–9, 2017.
Ram, S., Zhang, W., and Williams, M., Predicting Asthma-Related Emergency Department Visits Using Big Data. IEEE Journal 19(4):1216–1223, 2015.
Chen, D., Chen, Y., Brownlow, B. N., and Kanjamala, P. P., Real-Time Daily Healthcare Data Into HDFS and Elastic Search Index Inside a Big Data Platform. IEEE Transaction 13(2):595–606, 2017.
Heureuxi, A. L., Grolingeri, K., Elyamany, H. F., and Miriama, Machine Learning With Big Data:Challenge and Approaces. IEEE Access 5:7776–7797, 2017.
chen, M., Hao, Y., and Hwang, K., Disease Prediction by Machine Learning Over Big Data from Healthcare Communities. IEEE Access 5:8869–8879, 2017.
Abdulsalamyassine, S., Mining Human Activity Patterns From Smart Home Big Data for Health Care Applications. IEEE Access 5:13131–13149, 2017.
Wang, Y., and Kung, L. A., Terry Anthony Byrd, “Understanding itscapabilities and potential benefits for healthcare organizations”. Journal of Technological Forecasting and Social Change 126:3–13, 2018.
Gao Zhu, F., A Survey of Clustering Algorithms for Big Data: Taxonomy and Empirical Analysis. IEEE TransactionsEmerging Topics in Computing 2(3):267–279, 2014.
Rav, D., Wong, C., and Deligianni, F., Deep Learning for Health Informatics. IEEE Journal of Biomedical and Health Informatics 21(1):4–22, 2017.
Dayal, M., and Singh, N., Indian Health Care Analysis using Big Data Programming Tool. Procedia Computer Science 89:521–527, 2016.
Yang, L., and Zhou, Y., Exploring feature sets for two-phase biomedical named entity recognition using semi-CRFs. Journal of Knowledge and Information Systems 40(2):439–453, 2014.
Wittek, P., and Daranyi, S., Accelerating text mining workloads in a mapreduce-based distributed GPU environment. Journal of Parallel and Distributed Computing 73(2):198–206, 2013.
Mehta, N., and Pandit, A., Concurrence of big data analytics and healthcare: A systematic review. International Journal of Medical Informatics 114:57–65, 2018.
Viceconti, M., Hunter, P., and Hose, R., Big Data, Big Knowledge: Big Data for Personalized Healthcare. IEEE Journal of Biomedical and Health Informatics 19:4–33, 2015.
Andreu-Perez, J., Poon, C. C. Y., Merrifield, R. D., Wong, S. T. C., Yang, G-Z, Fellow, “Big Data for Health”, IEEE Journal of Biomedical and Health Informatics, Vol.16, Pp.16–35, 2015
Tamano, S. N., and Araki, T., Optimizing multiple machine learning jobs on MapReduce, IEEE International Conference on Big Data Intelligence and Computing and Cyber Science and Technology, Vol.30, pp.59–66
Yeh, J-F, Yeh, C-K, Yu, K-H, Li, Y-T, Tsai, W-L, Condition Random Fields-based Grammatical Error Detection for Chinese as Second Language, Department of Computer Science and Information Engineering (2014), Vol. 186, Pp. 537–566
Vimal, S., Kalaivani, L., Kaliappan, M., Suresh, A., Gao, X.-Z., and Varatharajan, R., Development of secured data transmission using machine learning based discrete time partial observed markov model and energy optimization in Cognitive radio networks. Neural Comput & Applic, 2018. https://doi.org/10.1007/s00521-018-3788-3.
Kannan, N., Sivasubramanian, S., Kaliappan, M., Vimal, S., and Suresh, A., Predictive big data analytic on demonetization data using support vector machine. Cluster Comput, 2018. https://doi.org/10.1007/s10586-018-2384-8 March 2018.
SudhakarIlango, S., Vimal, S., Kaliappan, M., and Subbulakshmi, P., Optimization using Artificial Bee Colony based clustering approach for big data. Cluster Computing. https://doi.org/10.1007/s10586-017-1571-3.
Kaliappan, M., Augustine, S., and Paramasivan, B., Enhancing energy efficiency and load balancing in mobile adhoc network using dynamic genetic algorithms. Journal of Network and Computer Applications 73:35–43, 2016.
Suresh, A., Udendhran, R., and Balamurgan, M., Hybridized neural network and decision tree based classifier for prognostic decision making in breast cancers. Soft Computing, 2019. https://doi.org/10.1007/s00500-019-04066-4.
Suresh, A., Udendhran, R., and Balamurgan, M., A Novel Internet of Things Framework Integrated with Real Time Monitoring for Intelligent Healthcare Environment. Journal of Medical System 43(6):165, 2019. https://doi.org/10.1007/s10916-019-1302-9.
Suresh, A., Kumar, R., and Varatharajan, R., Health Care Data Analysis using Evolutionary Algorithm. Journal of Supercomputing, 2018. https://doi.org/10.1007/s11227-018-2302-0.
Kaliappan, M., and Paramasivan, B., Enhancing secure routing in Mobile Ad Hoc Networks using a Dynamic Bayesian Signalling Game model. Journal of Computers & Electrical Engineering 41:301–313, 2015.
Paramasivan, B., Viju, M. J., Kaliappan, P. M., Development of a Secure Routing Protocol using Game Theory Model in Mobile Ad Hoc Networks, Journal of Communications and Networks, 17, 1, 2015
Vimal, S., Kalaivani, L., and Kaliappan, M., Collaborative approach on mitigating spectrum sensing data hijack attack and dynamic spectrum allocation based on CASG modeling in wireless cognitive radio networks. Cluster Computing, 2017. https://doi.org/10.1007/s10586-017-1092-0.
Mariappan, E., Kaliappan, M., Vimal, S., Energy Efficient Routing protocol using Grover’s Searching algorithm using MANET, Asian Journal of Information Technology,Vol:15, no.24,2016
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article is part of the Topical Collection on Patient Facing Systems
Rights and permissions
About this article
Cite this article
Venkatesh, R., Balasubramanian, C. & Kaliappan, M. Development of Big Data Predictive Analytics Model for Disease Prediction using Machine learning Technique. J Med Syst 43, 272 (2019). https://doi.org/10.1007/s10916-019-1398-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10916-019-1398-y