Dimension Reduction for Big Data Analytics in Internet of Things

  • Waleed EjazEmail author
  • Alagan Anpalagan
Part of the SpringerBriefs in Electrical and Computer Engineering book series (BRIEFSELECTRIC)


The number of Internet of Things (IoT) devices continues to grow with the invention of sophisticated applications in smart cities. It is forecasted that there will be 50 billion IoT devices by 2025. These large numbers of IoT devices and sensors are generating a huge amount of data in the various different formats such as plain messages, images, audio, and video. It is important to analyze this large amount of data. However, limited capabilities of IoT devices (such as low-power and computational capability) require efficient and robust methods to deal with the big data analytics. Numerous statistical techniques such as regression analysis, support vector machines, ensembles, decision trees, analysis of variance, correlation and autocorrelation, etc. led to massive amounts of data being processed in novel ways. It is important to reduce the number of variables in data before processing it. Dimension reduction is considered as an effective method to reduce the number of variables in data generated by IoT devices. In this chapter, we first present related work on dimension reduction in IoT systems. Then, we provide a detailed discussion of solutions for dimension reduction with several examples. Finally, we present conclusions and highlight open research areas for data reduction in IoT systems.


  1. 19.
    W. Ejaz, M. Ibnkahla, Machine-to-machine communications in cognitive cellular systems, in 2015 IEEE International Conference on Ubiquitous Wireless Broadband (ICUWB) (IEEE, Piscataway, 2015), pp. 1–5.Google Scholar
  2. 37.
    Ericsson, Mobility report: internet of things forecast. Technical report, 2018Google Scholar
  3. 38.
    M. Stolpe, The internet of things: Opportunities and challenges for distributed data analysis. ACM SIGKDD Explorations Newsletter 18(1), 15–34 (2016)CrossRefGoogle Scholar
  4. 39.
    Z.M. Hira, D.F. Gillies, A review of feature selection and feature extraction methods applied on microarray data. Adv. Bioinform. 2015, 198363 (2015)Google Scholar
  5. 40.
    H.H. Pajouh, R. Javidan, R. Khayami, D. Ali, K.-K.R. Choo, A two-layer dimension reduction and two-tier classification model for anomaly-based intrusion detection in IoT backbone networks, in IEEE Transactions on Emerging Topics in Computing, 2016Google Scholar
  6. 41.
    T. Zhang, B. Yang, Big data dimension reduction using PCA, in 2016 IEEE International Conference on Smart Cloud (SmartCloud) (IEEE, Piscataway, 2016), pp. 152–157CrossRefGoogle Scholar
  7. 42.
    K. Guo, Y. Tang, P. Zhang, CSF: crowdsourcing semantic fusion for heterogeneous media big data in the Internet of Things. Inf. Fusion 37, 77–85 (2017)CrossRefGoogle Scholar
  8. 43.
    M.H. ur Rehman, V. Chang, A. Batool, T.Y. Wah, Big data reduction framework for value creation in sustainable enterprises. Int. J. Inf. Manage. 36(6), 917–928 (2016)CrossRefGoogle Scholar
  9. 44.
    S. He, D.-H. Shin, J. Zhang, J. Chen, Y. Sun, Full-view area coverage in camera sensor networks: dimension reduction and near-optimal solutions. IEEE Trans. Veh. Technol. 65(9), 7448–7461 (2016)CrossRefGoogle Scholar
  10. 45.
    A. Papageorgiou, B. Cheng, E. Kovacs, Real-time data reduction at the network edge of Internet-of-Things systems, in 11th International Conference on Network and Service Management (CNSM) (IEEE, Piscataway, 2015), pp. 284–291Google Scholar
  11. 46.
    S. Cheng, Z. Cai, J. Li, H. Gao, Extracting Kernel dataset from big sensory data in wireless sensor networks. IEEE Trans. Knowl. Data Eng. 29(4), 813–827 (2017)CrossRefGoogle Scholar
  12. 47.
    Dimensionality Reduction. Accessed 22 July 2018
  13. 48.
    Beginners Guide To Learn Dimension Reduction Techniques. Accessed 22 July 2018
  14. 49.
    C.P. Chen, C.-Y. Zhang, Data-intensive applications, challenges, techniques and technologies: a survey on Big Data. Inf. Sci. 275, 314–347 (2014)CrossRefGoogle Scholar
  15. 50.
    F. Chen, P. Deng, J. Wan, D. Zhang, A.V. Vasilakos, X. Rong, Data mining for the internet of things: literature review and challenges. Int. J. Distrib. Sens. Netw. 11(8), 431047 (2015)CrossRefGoogle Scholar
  16. 51.
    D.H. Jeong, C. Ziemkiewicz, W. Ribarsky, R. Chang, C.V. Center, Understanding principal component analysis using a visual analytics tool, Charlotte visualization center, UNC Charlotte, vol. 19, 2009Google Scholar
  17. 53.
    J.C. Faria, C.G.B. Demétrio, I.B. Allaman, Biplot of multivariate data based on principal components, 2018Google Scholar
  18. 54.
    R. Indhumathi, S. Sathiyabama, Reducing and clustering high dimensional data through principal component analysis. Int. J. Comput. Appl. 11(8), 1–4 (2010)Google Scholar
  19. 55.
    A. Asuncion, D. Newman, UCI machine learning repository, 2007Google Scholar

Copyright information

© The Author(s), under exclusive licence to Springer Nature Switzerland AG 2019

Authors and Affiliations

  1. 1.Thompson Rivers UniversityKamloopsCanada
  2. 2.Ryerson UniversityTorontoCanada

Personalised recommendations