Cluster Computing

, Volume 22, Supplement 4, pp 8007–8015 | Cite as

The methods of big data fusion and semantic collision detection in Internet of Thing

  • Ruo HuEmail author
  • Hui-min Zhao
  • Yantai Wu


We sometimes find ourselves with plenty of data fusion in Internet of Thing, which necessitates an automatic removing semantic collision. For this, it is necessary to detect semantic collision, with a fairly reliable method to find many semantic collision and powerful enough to run in a reasonable time. Big data fusion in Internet of Thing represents today an important data quality challenge which leads to bad decision-making. This paper proposes and compares on real data effective fusion matching methods for automatic removing semantic collision of files based on names, working with Chinese texts or English texts, and the names of people or places, in East or in the West. After conducting a more complete classification of big data fusion than the usual classifications, we introduce several methods for big data fusion. Through a simple model, we highlight a global efficiency, accuracy and recover. We propose a new measuring mechanism between records, as well as rules for automatic big data fusion. Analyses made on Internet of Thing containing real data in western cities, and on a known standard Internet of Thing containing names of companies in the China, have shown better results than those of known methods, with a lesser complexity.


Big data fusion Semantic collision Internet of Thing Measuring mechanism Matching methods 



This study is supported by Natural Science Fund Project in Guangdong province (No.2015A030313671) and Major Project for Guangzhou collaborative innovation of industry-university-research (No.201704020196). This study is supported by Guangzhou Key Laboratory of Digital Content Processing and security technologies and Guangdong provincial Application-oriented technical research and development Special fund project (2016B010127006) and International Scientific and technological cooperation projects of Guangdong province (2017A050501039).


  1. 1.
    Duda, R.O., Hart, P.E., Stork, D.G.: Pattern Generalization, 2nd edn. Wiley Inter-science, Hoboken (2012)Google Scholar
  2. 2.
    Hu, R., Jiang, C.Y., Xu, H.: A new efficiency judging method for healthy big data system using heuristic algorithm. Basic Clin. Pharmacol. Toxicol. 118, 42 (2016)Google Scholar
  3. 3.
    Kishore, J.K., Patnaik, L.M., Mani, V., Agrawal, V.K.: Application of genetic programming for multi-category pattern generalization. IEEE Trans. Evolut. Comput. 4(3), 242–258 (2014)CrossRefGoogle Scholar
  4. 4.
    Robinson, D.: Implications of neural networks for how we think about brain function. Behav. Brain Sci. 15, 644–655 (2012)Google Scholar
  5. 5.
    Holland, J.H., Holyoak, K.J., Nisbett, R.E., Thagard, P.R.: Induction: Steps of Inference, Learning, and Discovery. Cambridge University Press, Cambridge (2013)Google Scholar
  6. 6.
    Hu, R.: Channel access controlling in wireless integrated information network using smart grid system. Appl. Math. Inf. Sci. 6(3), 813–820 (2012)MathSciNetGoogle Scholar
  7. 7.
    Erfani, T.S., Utyuzhnikov, S.V.: Directed search field: a method for even generation of cloud plan frontier in multi-objective optimization. J. Eng. Optim. 43(5), 1–18 (2014). CrossRefGoogle Scholar
  8. 8.
    Ruo, H.: New network access control method using intelligence agent technology. Appl. Math. Inf. Sci. 7, 44–48 (2013)CrossRefGoogle Scholar
  9. 9.
    O’Neill, M., Ryan, C.: Grammatical Evolution: Evolutionary Automatic Programming in an Arbitrary Language. Kluwer Academic Publishers, Dordrecht (2013)zbMATHGoogle Scholar
  10. 10.
    Schwab, I., Link, N.: Reusable artificial intelligence from big data analysis generalization. In: Genetic and Evolutionary Computing (ICGEC 2011), (2013)Google Scholar
  11. 11.
    Hu, R., Hu, H., Xu, H.: Abnormal access matching through big data analytics in health neural network. Basic Clin. Pharmacol. Toxicol. 118, 73–73 (2016)Google Scholar
  12. 12.
    Hu, R., Hu, H., Xiao, Z.H.: Matching unit of health neural network unit based on relation object framework. Basic Clin. Pharmacol. Toxicol 118, 72–73 (2016)Google Scholar
  13. 13.
    Kotanchek, M., Smits, G., Kordon, A.: Industrial strength genetic programming. In: Riolo, R., Kluwer, B.W. (eds.) GP Theory and Practice. Springer, New York (2003)Google Scholar
  14. 14.
    Smits, G., Kotanchek, M.: Cloud plan-front exploitation in big data analysis. In: Riolo, R., Kluwer, B.W. (eds.) GP Theory and Practice. Springer, New York (2004)Google Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2018

Authors and Affiliations

  1. 1.School of Computer ScienceGuangdong Polytechnic Normal UniversityGuangzhouChina
  2. 2.The Guangzhou Key Laboratory of Digital Content Processing and Security TechnologiesGuangzhouChina
  3. 3.School of Internet Finance and Information EngineeringGuangdong University of FinanceGuangzhouChina

Personalised recommendations