Abstract
Advances in sensing and computing capabilities are making it possible to embed increasing computing power in small devices. This has enabled the sensing devices not just to passively capture data at very high resolution but also to take sophisticated actions in response. Combined with advances in communication, this results in an ecosystem of highly interconnected devices referred to as the Internet of Things—IoT. In conjunction, the advances in machine learning have allowed building models on this ever increasing amount of data. Consequently, devices all the way from heavy assets such as aircraft engines to wearables such as health monitors can all now not only generate massive amounts of data but can draw back on aggregate analytics to “improve” their performance over time. Big data analytics has been identified as a key enabler for the IoT. In this chapter, we discuss various avenues of the IoT where big data analytics either is already making a significant impact or is on the cusp of doing so. We also discuss social implications and areas of concern.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Note that we use the term “edge devices” loosely to encompass not just the devices such as RFID tags but also other sensors esp. MEMS, including embedded sensors, monitoring and diagnostic sensors aboard industrial assets and so on.
- 2.
We do briefly cover some of these categories since they are indeed critical and the effectiveness of IoT applications and capabilities are highly contingent on effective solutions in these areas.
- 3.
Cyber-physical systems consist of computational and physical components that are able to perceive real-time changes as a result of seamless integration [53].
- 4.
Oil and gas industry can be viewed in three different segments: Upstream (concerned with exploration, drilling/development and production), Midstream (concerend with trading, transportation and refining) and Downstream (concerned with bulk distribution and retail). Refining step has components in both midstream and downstream sectors.
- 5.
- 6.
- 7.
- 8.
Health Insurance Portability and Accountability Act.
References
Aggarwal, C.C., Abdelzaher, T.: Integrating sensors and social networks. In: Aggarwal, C.C. (ed.) Social Network Data Analytics, pp. 379–412. Springer, US (2011). doi:10.1007/978-1-4419-8462-3_14; ISBN:978-1-4419-8461-6; http://dx.doi.org/10.1007/978-1-4419-8462-3_14
Aggarwal, C.C., Ashish, N., Sheth, A.: The internet of things: a survey from the data-centric perspective. In: Managing and Mining Sensor Data, pp. 383–428. Springer (2013)
Baaziz, A., Quoniam, L.: How to use big data technologies to optimize operations in upstream petroleum industry. Int. J. Innov. (IJI) 1(1), 30–42 (2013)
Bengio, Y., Ducharme, R., Vincent, P., Jauvin, C.: A neural probabilistic language model. J. Mach. Learning Res. 3, 1137–1155 (2003)
Bettencourt, L.M.A.: The uses of big data in cities. Santa Fe Institute working paper 2013-09-029, September 2013. http://www.santafe.edu/media/workingpapers/13-09-029.pdf
Bosch MongoDB white-paper: IoT and big data. Technical report, October 2014. http://info.mongodb.com/rs/mongodb/images/MongoDB_BoschSI_IoT_BigData.pdf
Boyd, S., Parikh, N., Chu, E., Peleato, B., Eckstein, J.: Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends Mach. Learn. 3(1), 1–122 (2011)
Brasco, C., Eklund, N., Shah, M., Marthaler, D.: Predictive modeling of high-bypass turbofan engine deterioration. In: Proceedings of the Annual Conference of the Prognostics and Health Management Society (PHM 2013), vol. 4. PHM Society (2013). http://www.phmsociety.org/node/1104
Bui, N., Zorzi, M.: Health care applications: a solution based on the internet of things. In: Proceedings of the 4th International Symposium on Applied Sciences in Biomedical and Communication Technologies, ISABEL ’11, pp. 131:1–131:5. ACM, New York, NY, USA (2011). http://doi.acm.org/10.1145/2093698.2093829
Byrnes, N.: Cities find rewards in cheap technologies. MIT Technology Review, November 2014. http://www.technologyreview.com/news/532466/cities-find-rewards-in-cheap-technologies/
Chui, M., Löffler, M., Roberts, R.: The internet of things. McKinsey Quarterly 2, 1–9 (2010). http://www.mckinsey.com/insights/high_tech_telecoms_internet/the_internet_of_things
Cognizant Report: Reaping the benefits of the internet of things. Technical Report, May 2014. http://www.cognizant.com/InsightsWhitepapers/Reaping-the-Benefits-of-the-Internet-of-Things.pdf
Crankshaw, D., Bailis, P., Gonzalez, J.E., Li, H., Zhang, Z., Franklin, M.J., Ghodsi, A., Jordan, M.I.: The missing piece in complex analytics: low latency, scalable model management and serving with velox. In: Conference on Innovative Data Systems Research (CIDR). Asilomar, CA (2014)
Dean, J., Corrado, G., Monga, R., Chen, K., Devin, M., Mao, M., Ranzato, M., Senior, A., Tucker, P., Yang, K., Le, Q.V., Ng, A.Y.: Large scale distributed deep networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25, pp. 1223–1231. Curran Associates, Inc. (2012). http://papers.nips.cc/paper/4687-large-scale-distributed-deep-networks.pdf
Deb, B., Shah, M., Evans, S., Mehta, M., Gargulak, A., Lasky, T.: Towards systems level prognostics in the cloud. In: Proceedings of the IEEE Conference on Prognostics and Health Management (PHM), pp. 1–6. IEEE (2013). ISBN:978-1-4673-5722-7
Dohr, A., Modre-Opsrian, R., Drobics, M., Hayn, D., Schreier, G.: The internet of things for ambient assisted living. In: Seventh International Conference on Information Technology: New Generations (ITNG), 2010, pp. 804–809. IEEE (2010)
Doukas, C., Maglogiannis, I.: Bringing IoT and cloud computing towards pervasive healthcare. In: 2012 Sixth International Conference on Innovative Mobile and Internet Services in Ubiquitous Computing (IMIS), pp. 922–926, July 2012. doi:10.1109/IMIS.2012.26
Feblowitz, J.: The big deal about big data in upstream oil and gas. IDC Energy Insights, October 2012
Feigelson, E.D., Babu, G.J.: Big data in astronomy. Significance 9(4), 22–25 (2012)
Fung, B.C.M., Wang, K., Chen, R., Yu, P.S.: Privacy-preserving data publishing: A survey of recent developments. ACM Comput. Surv. 42(4), 14:1–14:53, June 2010. doi:10.1145/1749603.1749605; ISSN:0360-0300; http://doi.acm.org/10.1145/1749603.1749605
Garcia, A.B., Bentes, C., de Melo, R.C., Zadrozny, B., Penna, T.J.P.: Sensor data analysis for equipment monitoring. Knowled. Inform. Syst. 28(2), 333–364 (2011). doi:10.1007/s10115-010-0365-1; ISSN:0219-1377; http://dx.doi.org/10.1007/s10115-010-0365-1
Ghose, A., Bhaumik, C., Das, D., Agrawal, A.K.: Mobile healthcare infrastructure for home and small clinic. In: Proceedings of the 2nd ACM International Workshop on Pervasive Wireless Healthcare, MobileHealth ’12, pp. 15–20. ACM, New York, NY, USA (2012). doi:10.1145/2248341.2248347; ISBN:978-1-4503-1292-9; http://doi.acm.org/10.1145/2248341.2248347
Glas, B., Guajardo, J., Hacioglu, H., Ihle, M., Wehefritz, K., Yavuz, A.: Signal-based automotive communication security and its interplay with safety requirements. In: Proceedings of Embedded Security in Cars Conference, November 2012
Gonzalez, J.E., Xin, R.S., Dave, A., Crankshaw, D., Franklin, M.J., Stoica, I.: Graphx: graph processing in a distributed dataflow framework. In: 11th USENIX Symposium on Operating Systems Design and Implementation (OSDI 14), pp. 599–613. USENIX Association, Broomfield, CO, October 2014. ISBN:978-1-931971-16-4; https://www.usenix.org/conference/osdi14/technical-sessions/presentation/gonzalez
Gubbi, J., Buyya, R., Marusic, S., Palaniswami, M.: Internet of things (IoT): a vision, architectural elements, and future directions. Future Gen. Comput. Syst. 29, 1645–1660 (2013)
Hems, A., Soofi, A., Perez, E.: Drilling for new business value: how innovative oil and gas companies are using big data to outmaneuver the competition. A Microsoft White Pater, May 2013
Hesla, L.: Particle physics tames big data. Symmetry 1 (2012)
IBM White Paper: Predictive maintenance for manufacturing. IBM (2011)
Japkowicz, N., Shah, M.: Evaluating Learning Algorithms: A classification perspective. Cambridge University Press (2011)
Jeske, M., Grüner, M., Weiß, F.: Big data in logistics: A DHL perspective on how to move beyond the hype. DHL Customer Solutions and Innovation, December 2013. http://www.delivering-tomorrow.com/wp-content/uploads/2014/02/CSI_Studie_BIG_DATA_FINAL-ONLINE.pdf
Joint DHL Bosch KIT Report: Self-driving vehicles in logistics: A DHL perspective on implications and use cases for the logistics industry. Technical report (2014). http://www.delivering-tomorrow.com/wp-content/uploads/2014/12/dhl_self_driving_vehicles.pdf
Kleiner, A., Talwalkar, A., Sarkar, P., Jordan, M.I.: A scalable bootstrap for massive data. J. Royal Statis. Soc. 76, 795–816 (2013)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25, pp. 1097–1105. Curran Associates, Inc. (2012). http://papers.nips.cc/paper/4824-imagenet-classification-with-deep-convolutional-neural-networks.pdf
Kurtz, J., Hoy, P., McHargue, L., Ward, J.: Improving operational and financial results through predictive maintenance. IBM Smarter Analytics Leadership Summit, Feb 2013
Lawson, S.: IoT groups are like an orchestra tuning up: the music starts in 2016. Computer World, Dec 2014. http://www.computerworld.com/article/2863498/networking-hardware/IoT-groups-are-like-an-orchestra-tuning-up-the-music-starts-in-2016.html
Le, Q.V., Monga, R., Devin, M., Chen, K., Corrado, G.S., Dean, J., Ng, A.Y.: Building high-level features using large scale unsupervised learning. In: International Conference on Machine Learning (2012)
Lee, J., Lapira, E., Bagheri, B., Kao, H.: Recent advances and trends in predictive manufacturing systems in big data environment. Manuf. Lett. 1, 38–41 (2013)
Lee, J., Kao, H., Yang, S.: Service innovation and smart analytics for industry 4.0 and big data environment. Procedia CIRP 16, 3–8 (2014)
Leuth, K.L.: IoT market segments biggest opportunities in industrial manufacturing. IoT-Analytics (2014). http://IoT-analytics.com/IoT-market-segments-analysis/
Lin, J., Keogh, E., Lonardi, S., Chiu, B.: A symbolic representation of time series, with implications for streaming algorithms. In: Proceedings of the 8th ACM SIGMOD workshop on Research issues in data mining and knowledge discovery, pp. 2–11. ACM (2003)
Mackey, L., Talwalkar, A., Jordan, M.I.: Distributed matrix completion and robust factorization. J. Mach. Learn. Res. (2014)
Markkanen, A., Shey, D.: The intersection of analytics and the internet of things. IEEE Internet of Things Newsletter, Nov 2014. http://IoT.ieee.org/newsletter/november-2014/the-intersection-of-analytics-and-the-internet-of-things.html
Marz, N., Warren, J.: Big data: principles and best practices of scalable realtime data systems. Manning Publications Co. (2015)
Matwin, S.: Privacy-preserving data mining techniques: survey and challenges. In: Discrimination and Privacy in the Information Society, pp. 209–221. Springer (2013)
McKinsey Study: Connected car, automotive value chain unbound. Technical report (2014)
Metz, R.: Ces 2015: Wearables everywhere. MIT Technology Review, January 2015. http://www.technologyreview.com/news/533916/ces-2015-wearables-everywhere/
Middleton, P., Kjeldsen, P., Tully, J.: Forecast: The Internet of Things, worldwide, 2013. Gartner, November 2013
Mind Commerce LLC Report: Big data in extraction and natural resource industries: Mining, water, timber, oil and gas 2014–2019. Technical report, July 2014. http://www.researchandmarkets.com/research/3qpj9t/big_data_in
MIT Business Report: Cities get smarter. Technical report (2015)
Nambiar, R., Bhardwaj, R., Sethi, A., Vargheese, R.: A look at challenges and opportunities of big data analytics in healthcare. In: 2013 IEEE International Conference on Big Data, pp. 17–22. IEEE (2013)
Navarro-Arribas, G., Torra, V.: Advanced Research in Data Privacy (2014)
Nicholson, R.: Big data in the oil and gas industry. IDC Energy Insights, September 2012
NIST Report: Workshop report on foundations for innovation in cyber-physical systems. Technical report, Jan 2013. http://www.nist.gov/el/upload/CPS-WorkshopReport-1-30-13-Final.pdf
Orts, E., Spigonardo, J.: Sustainability in the age of big data. Special Report, Initiative for Global Environmental Leadership (IGEL), Knowledge at Wharton, September 2014. http://knowledge.wharton.upenn.edu/article/the-big-data-and-energy-synergy/
Páez, D., Aparicio, F., de Buenaga, M., Ascanio, J.R.: Big data and IoT for chronic patients monitoring. In: Ubiquitous Computing and Ambient Intelligence. Personalisation and User Adapted Services, pp. 416–423. Springer (2014)
Pan, X., Jegelka, S., Gonzalez, J., Bradley, J.K., Jordan, M.: Parallel double greedy submodular maximization. In: Advances in Neural Information Processing Systems 22, (2014)
Poulymenopoulou, M., Malamateniou, F., Vassilacopoulos, G.: Machine learning for knowledge extraction from phr big data. Stud. Health Technol. Inform. 202, 36–39 (2013)
Reddy, A.S.: Reaping the benefits of the internet of things. Cognizant Reports, May 2014
Salakhutdinov, R.: Learning deep generative models. Ph.D. thesis, University of Toronto, Toronto, Canada (2009)
Seshadri, M.: Big data science challenging the oil industry. Energyworld (2013). http://web.idg.no/app/web/online/event/energyworld/2013/emc.pdf
Socher, R., Pennington, J., Huang, E.H., Ng, A.Y., Manning, C.D.: Semi-supervised recursive autoencoders for predicting sentiment distributions. In: Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP ’11, pp. 151–161. Association for Computational Linguistics, Stroudsburg, PA, USA, 2011. ISBN:978-1-937284-11-4. http://dl.acm.org/citation.cfm?id=2145432.2145450
Sowe, S.K., Kimata, T., Mianxiong, D., Zettsu, K.: Managing heterogeneous sensor data on a big data platform: IoT services for data-intensive science. In: 2014 IEEE 38th International Computer Software and Applications Conference Workshops (COMPSACW), pp. 295–300, July 2014. doi:10.1109/COMPSACW.2014.52
Tracey, D., Sreenan, C.: A holistic architecture for the internet of things, sensing services and big data. In: 2013 13th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (CCGrid), pp. 546–553, May 2013. doi:10.1109/CCGrid.2013.100
Turner, V., Gantz, J.F., Reinsel, D., Minton, S.: The digital universe of opportunities: rich data and the increasing value of the internet of things. IDC White Paper, April 2014. http://idcdocserv.com/1678
Vandermerwe, S., Rada, J.: Servitization of business: adding value by adding services. Eur. Manage J. 6(6), 314–324 (1989)
Vermesan, O., Friess, P.: Internet of Things: Converging Technologies for Smart Environments and Integrated Ecosystems. River Publishers (2013)
Waller, M.A., Fawcett, S.E.: Data science, predictive analytics, and big data: a revolution that will transform supply chain design and management. J. Bus. Logist. 34(2), 77–84 (2013)
Wang, Y., Bai, H., Stanton, M., Chen, W., Chang, E.Y.: Plda: parallel latent dirichlet allocation for large-scale applications. In: Proceedings of the 5th International Conference on Algorithmic Aspects in Information and Management, AAIM ’09, pp. 301–314. Springer-Verlag, Berlin, Heidelberg (2009). doi:10.1007/978-3-642-02158-9_26; ISBN:978-3-642-02157-2; http://dx.doi.org/10.1007/978-3-642-02158-9_26
Witten, B.: Top 10 IoT security mishaps 2014. In: Industrial Internet Consortium Web blog post. IIC (2014). http://blog.iiconsortium.org/2014/12/top-10-IoT-security-mishaps-2014-.html
Yashiro, T., Kobayashi, S., Koshizuka, N., Sakamura, K.: An internet of things (IoT) architecture for embedded appliances. In: Humanitarian Technology Conference (R10-HTC), 2013 IEEE. Region, vol. 10, pp. 314–319 (2013). doi:10.1109/R10-HTC.2013.6669062
Yavuz, A.A.: Practical immutable signature bouquets (pisb) for authentication and integrity in outsourced databases. In: Data and Applications Security and Privacy XXVI, pp. 179–194. Springer (2013)
Zaki, M., Neely, A.: Optimising asset management within complex service networks: the role of data. Cambridge Service Alliance, working paper:1–11 (2014)
Zanella, A., Bui, N., Castellani, A., Vangelista, L., Zorzi, M.: Internet of things for smart cities. IoT J IEEE, 1(1):22–32 (2014). doi:10.1109/JIoT.2014.2306328; ISSN:2327-4662
Zaslavsky, A, Perera, C., Georgakopoulos, D.: Sensing as a service and big data. arXiv:1301.0159 (2013)
Zhai, K., Boyd-Graber, J., Asadi, N., Alkhouja, M.L.: Mr. lda: A flexible large scale topic modeling package using variational inference in mapreduce. In: Proceedings of the 21st International Conference on World Wide Web, WWW ’12, pp. 879–888, ACM, New York, NY, USA (2012). doi:10.1145/2187836.2187955; ISBN:978-1-4503-1229-5; http://doi.acm.org/10.1145/2187836.2187955
Zhou, Z., Chawla, N., Jin, Y., Williams, G.: Big data opportunities and challenges: discussions from data analytics perspectives [discussion forum]. IEEE Comput. Intell. Magaz. 9(4), 62–74 (2014)
Zicari, R.V., Akerkar, R. (ed.): Big data computing. In: Big Data: Challenges and Opportunities, pp. 103–128. Chapman and Hall/CRC (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Appendix
Appendix
Links to entities referred to in the article (in alphabetical order):
-
Amazon AWS for IoT: http://aws.amazon.com/iot/
-
Amazon Echo: http://www.amazon.com/oc/echo/ref_=ods_dp_ae
-
Beddit: http://www.beddit.com/
-
Bosch, ABB, LG and Cisco’s joint venture announced recently to cooperate on open standards for smart homes: http://www04.abb.com/global/seitp/seitp202.nsf/0/9421f99d7575ceccc1257c1d0033fa4a/file/8364IR_en_Red_Elephant_20131024_final.pdf
-
Bosch Indego: https://www.bosch-indego.com/gb/en/
-
Cloud Foundry: http://www.cloudfoundry.org/about/index.html
-
Cloudera and Hortonwork’s real-time offering: http://www.infoq.com/news/2014/01/Spark-Storm-Real-Time-Analytics
-
Ego LS: http://www.liquidimageco.com/
-
Fitbit: http://www.fitbit.com/
-
Hadoop Ecosystem: See, for instance, http://hadoopecosystemtable.github.io/
-
Hubject, a joint networking mobility initiative of the BMW group, Bosch, Daimler, EnBW, RWE and Siemens: https://www.bosch-si.com/solutions/mobility/our-solutions/hubject.html
-
IBM SyNAPSE: http://www.research.ibm.com/cognitive-computing/neurosynaptic-chips.shtml#fbid=CAQQuy4xAkK
-
Jawbone: https://jawbone.com/
-
Jibo: http://www.myjibo.com/
-
Microsoft’s IoT offerings: http://www.microsoft.com/en-us/server-cloud/internet-of-things.aspx
-
NVidia’s Tegra X1: http://www.nvidia.com/object/tegra-x1-processor.html
-
Pacif-i: http://bluemaestro.com/
-
Pandas: http://pandas.pydata.org/
-
Predictive Model Markup Language (PMML): http://www.dmg.org/v4-1/GeneralStructure.html
-
Qualcomm Zeroth: https://www.qualcomm.com/news/onq/2013/10/10/introducing-qualcomm-zeroth-processors-brain-inspired-computing
-
Spark: https://spark.apache.org/
-
Sproutling: http://www.sproutling.com/
-
Tado: https://www.tado.com/
-
Vessyl: https://www.myvessyl.com/
-
Withings Aura: http://www.withings.com/us/withings-aura.html
-
Zementis: http://zementis.com/
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Shah, M. (2016). Big Data and the Internet of Things. In: Japkowicz, N., Stefanowski, J. (eds) Big Data Analysis: New Algorithms for a New Society. Studies in Big Data, vol 16. Springer, Cham. https://doi.org/10.1007/978-3-319-26989-4_9
Download citation
DOI: https://doi.org/10.1007/978-3-319-26989-4_9
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-26987-0
Online ISBN: 978-3-319-26989-4
eBook Packages: EngineeringEngineering (R0)