Population Classification upon Dietary Data Using Machine Learning Techniques with IoT and Big Data

  • Jangam J. S. Mani
  • Sandhya Rani Kasireddy
Part of the SpringerBriefs in Applied Sciences and Technology book series (BRIEFSAPPLSCIENCES)


In this digital age, data is generated monstrous from diverse sources like IoT enabled smart gadgets, and so on worldwide very swiftly in distinctive formats. This data with the traits say volume, velocity, variety and so on referred to as big data. Since a decade, big data technologies have been utilized in most of the companies even in healthcare alongside IoT to gain treasured insights in making knowledgeable selections spontaneously to improve medical treatment particularly for patients with complicated medical history having multiple health ailments. For healthy living, after water and oxygen, diet plays a critical role in offering the strength needed to assist the life’s existence-maintaining strategies and also the vitamins needed to construct and keep all body cells. The intent of this work is to offer a framework that classifies the population into four classes based on the quality of diet they devour within 30-days of dietary recall as balanced, unbalanced, nearly balanced, and nearly unbalanced using the machine learning techniques specifically logistic regression, linear discriminant analysis (LDA), and random forest. NHANES datasets had been used to assess the proposed framework alongside the metrics accuracy, precision, etc. This framework also allows us in gathering person’s health and dietary details dynamically anytime with the voice (IoT) to find out to which food regimen the person belongs to. This could be pretty beneficial for a person, medical doctors, and dieticians as nicely.


Healthcare Machine learning IoT Nutrition Big data 


  1. 1.
    Raghupathi W, Raghupathi V (2014) Big data analytics in healthcare: promise and potential. Health Inf Sci Syst 2(1):3Google Scholar
  2. 2.
    Akil L, Ahmad HA Relationships between obesity and cardiovascular diseases in four southern states and Colorado, Scholar
  3. 3.
    Gartner IT Glossary (n.d.) Retrieved from
  4. 4.
    Akred J Founder and CTO, silicon valley data science. What is big data?
  5. 5.
    Zikopoulos PC, Eaton C, deRoos D, Deutsch T, Lapis G (2012) Understanding big data. McGraw-Hill, New YorkGoogle Scholar
  6. 6.
    Tomines A, Readhead H, Readhead A, Teutsch S (2013) Applications of electronic health information in public health: uses, opportunities and barriers. eGEMs (Generating evidence & methods to improve patient outcomes)1(2), Article 5. Doi
  7. 7.
    Rajendra N et al (2015) Modern diet and its impact on human health. J Nutr Food Sci 5:6. Scholar
  8. 8.
    Vilchis-Gil J et al (2015) Food habits, physical activities and sedentary lifestyles of eutrophic and obese school children: a case–control study. BMC Public Health 15:124. Scholar
  9. 9.
    Carruthers K Internet of things and beyond: cyber-physical systems. IEEE Internet of Things, 10 May 2016, Retrieved 26 Dec 2017
  10. 10.
    Schatz B (2015) National surveys of population health: big data analytics for mobile health monitors. Big Data 3:219–229. Scholar
  11. 11.
  12. 12.
    Nilufer Hajra, Worldwide phenomenon: poor diet linked to death, global study reveals, 20th Sep 2017Google Scholar
  13. 13.
    Parthasarathy KS (2017) Childhood and adolescent obesity increases tenfold in four decades–analysis. Eurasia Rev News Anal Retrieved on 11 Nov 2017
  14. 14.
    Webber L, Kilpi F, Marsh T, Rtveladze K, Brown M, McPherson K (2017) High rates of obesity and non-communicable diseases predicted across Latin America. Barengo NC, ed. PLoS ONE. 2012;7(8):e39589. Scholar
  15. 15.
    Aisha M Mapped: the global epidemic of ‘lifestyle’ disease in charts. The Telegraph News, 29th Mar 2018, Retrieved on 13 May 2018
  16. 16.
    Mehtha N, Pandit A (2018) Concurrence of big data analytics and healthcare: a systematic review. Int J Med Inf 114:57–65CrossRefGoogle Scholar
  17. 17.
    Healthcare Technology Review: 2017, referral md, Retrieved 14 May 2018
  18. 18.
    Clark A, Ng JQ, Morlet N, Semmens JB (2016) Big data and ophthalmic research. Surv Ophthalmol 61:443–465. Scholar
  19. 19.
    Chen M, Hao Y, Hwang K, Wang L, Wang L (2017) Disease prediction by machine learning over big data from healthcare communities. IEEE Access 5:8869–8879. Scholar
  20. 20.
    Dinachandra Singh K, Alagarajan M, Ladusingh L (2015) What explains child malnutrition of indigenous people of Northeast India? PLoS ONE 10(6):e0130567. Scholar
  21. 21.
    Ahluwalia N, Dwyer J, Terry A, Moshfegh A, Johnson C (2016) Update on NHANES dietary data: focus on collection, release, analytical considerations, and uses to inform public policy. Adv Nutr 7(1):121–134. Scholar
  22. 22.
    Guenther PM, Kirkpatrick SI, Reedy J, Krebs-Smith SM, Buckman DW, Dodd KW, Casavale KO, Carroll RJ (2014) J Nutr 144(3):399–407. Published online 2014 Jan 22. doi:
  23. 23.
    Hearty AP, Gibney MJ Analysis of meal patterns with the use of supervised data mining techniques—artificial neural networks and decision trees, Scholar
  24. 24.
    Dezhi X, Ganegoda GU et al (2011) Rule based classification to detect malnutrition in children. Int J Comput Sci Eng (IJCSE)3(1). ISSN: 0975-3397Google Scholar
  25. 25.
    Park M, Kim H, Kim SK (2014) Knowledge discovery in a community data set: malnutrition among the elderly. Healthc Inf Res 20(1):30–38CrossRefGoogle Scholar
  26. 26.
    NHANES-National Health and Nutrition Examination Survey.
  27. 27.
    Oracle Text Application Developer’s Guide 12c Release 1, E41398-07, May 2015Google Scholar
  28. 28.
    Pang Ning Tan MS (2006) Introduction to data mining. Pearson Education Asia Ltd., China P. RGoogle Scholar
  29. 29.
    Polamuri S How multinomial logistic regression model works in machine learning Retrieved on 29th June 2017
  30. 30.
    Venables WN, Ripley BD (2002) Modern applied statistics with S, 4th edn. SpringerGoogle Scholar
  31. 31.
    Olson DL, Delen D (2008) Advanced data mining techniques, 1st edn. Springer (1 Feb 2008), p 138. ISBN 3-540-76916-1Google Scholar
  32. 32.
    Breiman L (2001) Random forests. Mach Learn 45:5–32. Scholar
  33. 33.
    Chakraborty DP (2010) Prediction accuracy of a sample-size estimation method for ROC studies. Acad Radiol 17(5):628–638. Scholar
  34. 34.
    Kajaree D, Behera R (2017) A survey on healthcare monitoring system using body sensor network. Int J Innov Res Comput Commun Eng 5(2):1302–1309Google Scholar
  35. 35.

Copyright information

© The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2019

Authors and Affiliations

  1. 1.Department of Computer ScienceSri Padmavati Mahila ViswavidyalayamTirupatiIndia
  2. 2.SPMVVTirupatiIndia

Personalised recommendations