Supporting IoT Data Similarity at the Edge Towards Enabling Distributed Clustering

  • Hasibur RahmanEmail author
Part of the Advances in Intelligent Systems and Computing book series (AISC, volume 745)


Hundreds of billions of things are expected to be integrated for heterogeneous Internet-of-Things (IoT) applications, which promises to drive the Future Internet. This variant IoT data mandates intelligent solutions to make sense of current data in real-time closer to the data origin. Clustering physically distributed data would enable efficient utilization where finding similarity becomes the central issue. To counter this, Jaro-Winkler and Jaccard-like algorithm have been proposed and extended to a distributed protocol to enable distributed clustering at the edge. Performance study, on a scalable IoT platform and an edge device, shows feasibility and effectiveness of the approach with respect to efficiency and applicability.


IoT Distributed data Clustering similarity Edge computing 


  1. 1.
    Guillemin, P., Friess., P.: Internet of things strategic research roadmap. The Cluster of European Research Projects, Technical report, September 2009Google Scholar
  2. 2.
    Perera, C., et al.: Context aware computing for the Internet of Things: a survey. IEEE Commun. Surv. Tutor. 16(1), 414–454 (2014)CrossRefGoogle Scholar
  3. 3.
    Rahman, H., Rahmani, R.: Enabling distributed intelligence assisted Future Internet of Things Controller (FITC). Appl. Comput. Inf. (2017). CrossRefGoogle Scholar
  4. 4.
    Tele2 IoT talks, May 2017. Accessed 5 May 2017
  5. 5.
    Seth, A.: Internet of Things to smart IoT through semantic, cognitive, and perceptual computing. IEEE Intell. Syst. 31(2), 108–112 (2016)CrossRefGoogle Scholar
  6. 6.
    Maarala, A., Su, X., Riekki, J.: Semantic reasoning for context-aware internet of things applications. IEEE Internet Things J. PP(99), 1 (2016)Google Scholar
  7. 7.
    Rahman, H., Rahmani, R., Kanter, T.: Multi-modal Context-Aware reasoNer (CAN) at the edge of IoT. Procedia Comput. Sci. 109, 335–342 (2017)CrossRefGoogle Scholar
  8. 8.
    Perera, C., et. al.: Ca4iot: context awareness for Internet of Things. In: Proceedings of the IEEE International Conference on Green Computing and Communications (2012)Google Scholar
  9. 9.
    Rahman, H., et al.: Reasoning service enabling smarthome automation at the edge of context networks. In: Advances in Information Systems and Technologies. Springer (2016)CrossRefGoogle Scholar
  10. 10.
    Rahmani, A.M. et al: Exploiting smart e-Health gateways at the edge of healthcare Internet-of-Things: a fog computing approach. In: FGCS (2017)Google Scholar
  11. 11.
    TongKe, F.: Smart agriculture based on cloud computing and IOT. JCIT 8(2), 210–216 (2013)CrossRefGoogle Scholar
  12. 12.
    Rahmani, R., Rahman, H., Kanter, T.: On performance of logical-clustering of flow-sensors. Int. J. Comput. Sci. Issues (IJCSI) 10(5, No. 2), 1–13 (2013)Google Scholar
  13. 13.
    Rahmani, R., Rahman, H., Kanter, T.: Context-based logical clustering of flow sensors - exploiting hyperflow and hierarchical DHTs. In: Proceeding(s) of 4th International Conference on Next Generation Information Technology, CNIT (2013)Google Scholar
  14. 14.
    Tsai, C., et al.: Data mining for Internet of Things: a survey. IEEE Commun. Surv. Tutor. 16(1), 77–97 (2014). 1st Quart.MathSciNetCrossRefGoogle Scholar
  15. 15.
    Ienco, D., Pensa, R.G., Meo, R.: Context-based distance learning for categorical data clustering. In: Proceedings of the 8th International Symposium, IDA, pp. 83–94 (2009)CrossRefGoogle Scholar
  16. 16.
    Lulli, A. et. al.: Scalable k-NN based text clustering. In: 2015 IEEE International Conference on Big Data (Big Data). IEEE (2015)Google Scholar
  17. 17.
    Tara, L., Prasad, G.V.S.N.R.V.: PageRank technique along with probability-maximization in sentence-clustering. In: IJESC.
  18. 18.
    Kanungo, T., et al.: An efficient K-means clustering algorithm: analysis and implementation. IEEE Trans. Pattern Anal. Mach. Intell. 24(7), 881–892 (2002)CrossRefGoogle Scholar
  19. 19.
    Rahman, H., Rahmani, R., Kanter, T.: Enabling scalable publish/subscribe for logical-clustering in crowdsourcing via mediasense. In: IEEE SAI Conference (2014)Google Scholar
  20. 20.
    Ghahramani, Z.: Probabilistic machine learning and artificial intelligence. Nature 7553, 452–459 (2015)CrossRefGoogle Scholar
  21. 21.
    Cohen, W., Ravikumar, P., Fienberg, S.: A comparison of string metrics for matching names and records. In: The International Conference on KDD (2003)Google Scholar
  22. 22.
    Barberousse, A., Franceschelli, S., Imbert, C.: Computer simulations as experiments. Synthese 169(3), 557–574 (2009)MathSciNetCrossRefGoogle Scholar

Copyright information

© Springer International Publishing AG, part of Springer Nature 2018

Authors and Affiliations

  1. 1.Department of Computer and Systems Sciences (DSV)Stockholm UniversityKistaSweden

Personalised recommendations