Abstract
The era of big data has arrived. Big data and cloud computing go hand-in-hand. Internet of things (IoT) has resulted in a hyper-world consisting of the social, cyber, and physical worlds, with data as a bridge. These topics are closely related to data science and are introduced in this chapter.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Aguilera, M. K., Strom, R. E., Sturman, D. C., Astley, M., & Chandra, T. D. (1999). Matching events in a content-based subscription system. In Proceedings of the 18th Annual ACM Symposium on Principles of Distributed Computing (pp. 53–61). Atlanta, GA.
Amokrane, A., Zhani, M. F., Langar, R., Boutaba, R., & Pujolle, G. (2013). Greenhead: Virtual data center embedding across distributed infrastructures. IEEE Transactions on Cloud Computing, 1(1), 36–49.
Andreev, K., & Racke, H. (2004). Balanced graph partitioning. In Proceedings of the 16th Annual ACM Symposium on Parallelism in Algorithms and Architectures (pp. 120–124). Barcelona, Spain.
Bessani, A., Correia, M., Quaresma, B., Andre, F., & Sousa, P. (2011). DepSky: Dependable and secure storage in a cloud-of-clouds. In Proceedings of the 6th European Conference on Computer Systems (pp. 31–46).
Bhatotia, P., Wieder, A., Rodrigues, R., Acar, U. A., & Pasquin, R. (2011). Incoop: Mapreduce for incremental computations. In Proceedings of the 2nd ACM Symposium on Cloud Computing (Article No. 7, 14 pp.). Cascais, Portugal.
Bilal, K., Manzano, M., Khan, S. U., Calle, E., Li, K., & Zomaya, A. Y. (2013). On the characterization of the structural robustness of data center networks. IEEE Transactions on Cloud Computing, 1(1), 64–77.
Bu, Y., Howe, B., Balazinska, M., & Ernst, M. D. (2010). Haloop: Efficient iterative data processing on large clusters. Proceedings of the VLDB Endowment, 3(1), 285–296.
Chen, W., & Wassell, I. J. (2011). Energy efficient signal acquisition in wireless sensor networks: A compressive sensing framework. In Proceedings of the 6th International Symposium on Wireless and Pervasive Computing (pp. 1–6). Hong Kong, China.
Davis, D., Pilz, G., & Zhang, A. (Eds.). (2012). Cloud Infrastructure Management Interface (CIMI) Primer, DSP2027, v. 1.0.1. Distributed Management Task Force.
Dean, J., & Ghemawat, S. (2004). MapReduce: Simplified data processing on large clusters. In Proceedings of the 6th Symposium on Operating System Design and Implementation (pp. 137–150). San Francisco, CA.
Dean, J., & Ghemawat, S. (2008). MapReduce: Simplified data processing on large clusters. Communications of the ACM, 51(1), 107–113.
Dinh, T. T. A., Liu, R., Zhang, M., Chen, G., Ooi, B. C., & Wang, J. (2018). Untangling blockchain: A data processing view of blockchain systems. IEEE Transactions on Knowledge and Data Engineering, 30(7), 1366–1385.
Dong, Y., Yang, X., Li, X., Li, J., Tian, K., & Guan, H. (2010). High performance network virtualization with SR-IOV. In Proceedings of the 16th International Conference on High-Performance Computer Architecture (pp. 1–10). Bangalore, India.
Eagle, N., & Pentland, A. (2006). Reality mining: Sensing complex social systems. Personal and Ubiquitous Computing, 10(4), 255–268.
Ekanayake, J., Pallickara, S., & Fox, G. (2008). MapReduce for data intensive scientific analyses. In Proceedings of the IEEE 4th International Conference on eScience (pp. 277–284). Indianapolis, IN.
Fiege, L., Gartner, F. C., Kasten, O., & Zeidler, A. (2003). Supporting mobility in content-based publish/subscribe middleware. In Proceedings of the ACM/IFIP/USENIX 2003 International Conference on Middleware (pp. 103–122). Alzburg, Austria.
Ganti, R., Ye, F., & Lei, H. (2011). Mobile crowdsensing: Current state and future challenges. IEEE Communications Magazine, 49(11), 32–39.
Gulisano, V., Jimenez-Peris, R., Patino-Martinez, M., Soriente, C., & Valduriez, P. (2012). Streamcloud: An elastic and scalable data streaming system. IEEE Transactions on Parallel and Distributed Systems, 23(12), 2351–2365.
Guo, B., Chen, C., Zhang, D., Yu, Z., & Chin, A. (2016). Mobile crowd sensing and computing: When participatory sensing meets participatory social media. IEEE Communications Magazine, 54(2), 131–137.
Hacigumus, H., Iyer, B., Li, C., & Mehrotra, S. (2002). Executing SQL over encrypted data in the database-service-provider model. In Proceedings of ACM SIGMOD International Conference on Management of Data (pp. 216–227). Madison, WI.
Huang, T., Lan, L., Fang, X., An, P., Min, J., & Wang, F. (2015). Promises and challenges of big data computing in health sciences. Big Data Research, 2(1), 2–11.
Ingersoll, G. (2009). Introducing apache mahout: Scalable, commercial-friendly machine learning for building intelligent applications. IBM Corporation.
Isard, M., Budiu, M., Yu, Y., Birrell, A., & Fetterly, D. (2007). Dryad: Distributed data-parallel programs from sequential building blocks. In Proceedings of the 2nd ACM SIGOPS/EuroSys European Conference on Computer Systems (pp. 59-72). Lisbon, Portugal.
Jayalath, C., Stephen, J., & Eugster, P. (2014). Universal cross-cloud communication. IEEE Transactions on Cloud Computing, 2(2), 103–116.
Jin, H., Wang, X., Wu, S., Di, S., & Shi, X. (2015). Towards optimized fine-grained pricing of IaaS cloud platform. IEEE Transactions on Cloud Computing, 3(4), 436–448.
Koponen, T., Casado, M., Gude, N., Stribling, J., Poutievski, L., Zhu, M., et al. (2010). Onix: A distributed control platform for large-scale production networks. In Proceedings of the 9th USENIX Symposium on Operating Systems Design and Implementation (pp. 1–6). Vancouver, Canada.
Kreutz, D., Ramos, F. M., & Verissimo, P. (2013). Towards secure and dependable software-defined networks. In Proceedings of the 2nd ACM SIGCOMM Workshop on Hot Topics in Software Defined Networking (pp. 55–60). Hong Kong, China.
Lin, J., Yu, W., Zhang, N., Yang, X., Zhang, H., & Zhao, W. (2017). A Survey on Internet of things: Architecture, enabling technologies, security and privacy, and applications. IEEE Internet of Things Journal, 4(5), 1125–1142.
Malewicz, G., Austern, M. H., Bik, A. J., Dehnert, J. C., Horn, I., Leiser, N., et al. (2010). Pregel: A system for large-scale graph processing. In Proceedings of ACM SIGMOD International Conference on Management of Data (pp. 135–146). Indianapolis, IN.
Mashayekhy, L., Nejad, M. M., & Grosu, D. (2015). Cloud federations in the sky: Formation game and mechanism. IEEE Transactions on Cloud Computing, 3(1), 14–27.
Melnik, S., Gubarev, A., Long, J., Romer, G., Shivakumar, S., Tolton, M., et al. (2010). Dremel: Interactive analysis of web-scale datasets. In Proceedings of the 36th International Conference on Very Large Data Bases (pp. 330–339).
Mitton, N., Papavassiliou, S., Puliafito, A., & Trivedi, K. S. (2012). Combining cloud and sensors in a smart city environment. EURASIP Journal on Wireless Communications and Networking, 2012(247), 1–10.
Mont, M. C., McCorry, K., Papanikolaou, N., & Pearson, S. (2012). Security and privacy governance in cloud computing via SLAS and a policy orchestration service. In Proceedings of the 2nd International Conference on Cloud Computing and Services Science (pp. 670–674). Porto, Portugal.
Nakamoto, S. (2008). Bitcoin: A peer-to-peer electronic cash system. https://bitcoin.org/bitcoin.pdf.
Nurmi, D., Wolski, R., Grzegorczyk, C., Obertelli, G., Soman, S., Youseff, L., et al. (2009). The Eucalyptus open-source cloud computing system. Proceedings of the 9th IEEE/ACM International Symposium on Cluster Computing and the Grid (pp. 124–131). Shanghai, China.
Patel, M., Hu, Y., Hédé, P., Joubert, J., Thornton, C., Naughton, B., et al. (2014). Mobile-edge computing—Introductory technical white paper. White paper, Mobile-Edge Computing (MEC) Industry Initiative.
Rochwerger, B., Breitgand, D., Epstein, A., Hadas, D., Loy, I., Nagin, K., et al. (2011). Reservoir—When one cloud is not enough. Computer, 44(3), 44–51.
Sandholm, T., & Lai, K. (2010). Dynamic proportional share scheduling in Hadoop. In Proceedings of the 15th International Workshop on Job Scheduling Strategies for Parallel Processing, LNCS (Vol. 6253, pp. 110–131). Atlanta, GA. Berlin: Springer.
Schad, J., Dittrich, J., & Quiane-Ruiz, J.-A. (2010). Runtime measurements in the cloud: Observing, analyzing, and reducing variance. Proceedings of the VLDB Endowment, 3, 460–471.
Sempolinski, P., & Thain, D. (2010). A comparison and critique of Eucalyptus, OpenNebula and Nimbus. In Proceedings of IEEE 2nd International Conference on Cloud Computing Technology and Science (pp. 417–426). Indianapolis, IN.
Sotomayor, B., Montero, R. S., Llorente, I. M., & Foster, I. (2008). Capacity leasing in cloud systems using the OpenNebula engine. In Proceedings of Workshop on Cloud Computing and its Applications. Chicago, IL.
Vaquero, L. M., Celorio, A., Cuadrado, F., & Cuevas, R. (2015). Deploying large-scale datasets on-demand in the cloud: Treats and tricks on data distribution. IEEE Transactions on Cloud Computing, 3(2), 132–144.
Xiang, L., Luo, J., & Rosenberg, C. (2013). Compressed data aggregation: Energy-efficient and high-fidelity data collection. IEEE/ACM Transactions on Networking, 21(6), 1722–1735.
Xin, R. S., Rosen, J., Zaharia, M., Franklin, M. J., Shenker, S., & Stoica, I. (2013). Shark: SQL and rich analytics at scale. In Proceedings of ACM SIGMOD International Conference on Management of Data (pp. 13–24). New York.
Xiong, J., Liu, X., Yao, Z., Ma, J., Li, Q., Geng, K., et al. (2014). A secure data self-destructing scheme in cloud computing. IEEE Transactions on Cloud Computing, 2(4), 448–458.
Yan, Z., Ding, W., Yu, X., Zhu, H., & Deng, R. H. (2016). Deduplication on encrypted big data in cloud. IEEE Transactions on Big Data, 2(2), 138–150.
Yao, Y., Tai, J., Sheng, B., & Mi, N. (2015). LsPS: A job size-based scheduler for efficient task assignments in Hadoop. IEEE Transactions on Cloud Computing, 3(4), 411–424.
Zaharia, M., Borthakur, D., Sarma, J. S., Elmeleegy, K., Shenker, S., & Stoica, I. (2009). Job scheduling for multi-user mapreduce clusters. Technical Report UCB/EECS-2009-55, University of California, Berkeley.
Zhang, Q., Zhani, M. F., Yang, Y., Boutaba, R., & Wong, B. (2015). PRISM: Fine-grained resource-aware scheduling for MapReduce. IEEE Transactions on Cloud Computing, 3(2), 182–194.
Zhang, Y., Chen, S., Wang, Q., & Yu, G. (2015). i\(^2\)MapReduce: Incremental MapReduce for mining evolving big data. IEEE Transactions on Knowledge and Data Engineering, 27(7), 1906–1919.
Zhang, Y., Gao, Q., Gao, L., & Wang, C. (2012). iMapReduce: A distributed computing framework for iterative computation. Journal of Grid Computing, 10(1), 47–68.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2019 Springer-Verlag London Ltd., part of Springer Nature
About this chapter
Cite this chapter
Du, KL., Swamy, M.N.S. (2019). Big Data, Cloud Computing, and Internet of Things. In: Neural Networks and Statistical Learning. Springer, London. https://doi.org/10.1007/978-1-4471-7452-3_31
Download citation
DOI: https://doi.org/10.1007/978-1-4471-7452-3_31
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-4471-7451-6
Online ISBN: 978-1-4471-7452-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)