Cloud Based Heterogeneous Big Data Integration and Data Analysis for Business Intelligence

  • T. JayarajEmail author
  • J. Abdul Samath
Conference paper
Part of the Lecture Notes on Data Engineering and Communications Technologies book series (LNDECT, volume 49)


Due to the enormous growth of information technology, a huge amount of big data is produced daily, wherein heterogeneity is considered as the main feature of big data. Heterogeneous data integration is still remaining as a bottleneck. It becomes as a very difficult task to integrate and complete the business information demands. Hence, in this research work we have presented a novel Heterogeneous Data Integration and Analysis framework for solving the challenges associated with heterogeneous big data. Big data analysis is an information extraction technique generally used by organizations for business intelligence. However, data mining doesn’t provide good performance for very large data set due to the problems of high computational cost and lack of memory. In this article, we have proposed Convolutional Neural Networks (CNN) architecture for heterogeneous big data analysis. Finally, experimental results make it clear that the proposed method is the fastest data integration framework and that it is also considered as a good analysis model for business.


Big data Heterogeneous data Data integration Data analysis Multisource data 


  1. 1.
    Gal, A., et al.: Heterogeneous data integration by learning to rerank schema matches. In: 2018 IEEE International Conference on Data Mining (ICDM) (2018)Google Scholar
  2. 2.
    Chen, W., et al.: Multi-source and heterogeneous data integration model for big data analytics in power DCS. In: IEEE 2016 International Conference on Cyber-Enabled Distributed Computing and Knowledge Discovery (CyberC) (2016)Google Scholar
  3. 3.
    Tripathi, D., et al.: Model for heterogeneous data integration on cloud. In: 2016 3rd International Conference on Computing for Sustainable Global Development (INDIACom) (2016)Google Scholar
  4. 4.
    Vidal, M.-E., et al.: Semantic data integration techniques for transforming big biomedical data into actionable knowledge. In: 2019 IEEE 32nd International Symposium on Computer-Based Medical Systems (CBMS) (2019)Google Scholar
  5. 5.
    Albawi, S., et al.: Understanding of a convolutional neural network. In: 2017 International Conference on Engineering and Technology (ICET) (2017)Google Scholar
  6. 6.
    Yang, J., el al.: Application of deep convolution neural network. In: 2017 14th International Computer Conference on Wavelet Active Media Technology and Information Processing (ICCWAMTIP) (2017)Google Scholar
  7. 7.
    Song, H., et al.: A review on the self and dual interactions between machine learning and optimisation. Prog. Artif. Intell. 8(2), 143–165 (2019)CrossRefGoogle Scholar
  8. 8.
    Huang, J.-C., et al.: Application and comparison of several machine learning algorithms and their integration models in regression problems. Neural Comput. Appl., 1–9Google Scholar
  9. 9.
    Ertam, F., et al.: Data classification with deep learning using Tensorflow. In: 2017 IEEE International Conference on Computer Science and Engineering (UBMK) (2017)Google Scholar
  10. 10.
    Suma, V.: Towards sustainable industrialization using big data and Internet of Things. J. ISMAC 1(01), 24–37 (2019)CrossRefGoogle Scholar
  11. 11.
    Suresh, R.M., et al.: An overview of data preprocessing in data and web usage mining. In: IEEE 2006 1st International Conference on Digital Information Management (2006)Google Scholar
  12. 12.
    Wei, J., et al.: Research on data preprocessing in supermarket customers data mining. In: IEEE 2010 2nd International Conference on Information Engineering and Computer Science (2010)Google Scholar
  13. 13.
    Qin, H., et al.: Research on extract, transform and load (ETL) in land and resources star schema data warehouse. In: IEEE 2012 Fifth International Symposium on Computational Intelligence and Design (2012)Google Scholar
  14. 14.
    Adnan, et al.: Performance analysis of extract, transform, load (ETL) in apache Hadoop atop NAS storage using ISCSI. In: IEEE 2017 4th International Conference on Computer Applications and Information Processing Technology (CAIPT) (2017)Google Scholar
  15. 15.
    Chemchem, A., et al.: Deep learning and data mining classification through the intelligent agent reasoning. In: IEEE 2018 6th International Conference on Future Internet of Things and Cloud Workshops (FiCloudW) (2018)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2020

Authors and Affiliations

  1. 1.Research and Development CentreBharathiar UniversityCoimbatoreIndia
  2. 2.Chikkana Government Arts and Science CollegeTiruppurIndia

Personalised recommendations