Skip to main content

Different Methods for Cluster’s Representation and Their Impact on the Effectiveness of Searching Through Such a Structure

  • Conference paper
  • First Online:
Computational Collective Intelligence (ICCCI 2018)

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11056))

Included in the following conference series:

  • 1376 Accesses

Abstract

In this paper the topic of clustering and searching through clusters generated from real-world knowledge bases is discussed. Authors analyze three methods of cluster’s representatives creation, focusing on their advantages and flaws. What is more the authors introduce a concept of forward-chaining inference which uses a density-based DBSCAN algorithm to generate cluster of rules and speed up the whole inference process. The experiments were conducted on real-world knowledge bases with a relatively large number of rules to evaluate the efficiency of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    A cluster is considered relevant when its similarity (to the set of facts) is above a given threshold.

  2. 2.

    When using index structures, like R-trees [2], the average computational complexity is about \(O(\log n)\), where n is the number of instances [2].

  3. 3.

    The DBSCAN algorithm distinguishes core (objects in the center of the cluster) and border points (objects near the cluster’s border), and uses specific notations of reachability and connectedness to form a cluster, which are discussed in [2].

  4. 4.

    Because gaining access to large, real-world knowledge bases is very difficult.

  5. 5.

    More information about the structure and experimental evaluation concerning these knowledge bases can be found in [17].

  6. 6.

    The measure was chosen, because it is simple to comprehend and does not have a significant impact on the DBSCAN execution time.

  7. 7.

    Although 3 types of representatives were discussed in this paper, only the first two approaches were used in the experiments, because for the last one it is difficult to specify how many threshold values should be considered and this optimal threshold (percentage of rules considered) may vary for every knowledge base.

References

  1. Chemchem, A., Djenouri, Y., Drias, H.: Incremental induction rules clustering. In: 2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA), pp. 492–497, May 2013

    Google Scholar 

  2. Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of 2nd International Conference on Knowledge Discovery and Data Mining, pp. 226–231. AAAI Press (1996)

    Google Scholar 

  3. EXSYS: EXSYS Home Page (2018). http://www.exsys.com. Accessed Mar 2018

  4. Google: Google Knowledge Graph (2018). https://developers.google.com/knowledge-graph/. Accessed Mar 2018

  5. Hashizume, A., Yongguang, B., Du, X., Ishii, N.: Generating representative from clusters of association rules on numeric attributes. In: Liu, J., Cheung, Y., Yin, H. (eds.) IDEAL 2003. LNCS, vol. 2690, pp. 605–613. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-45080-1_82

    Chapter  Google Scholar 

  6. He, J., et al.: Rule clustering and super-rule generation for transmembrane segments prediction. In: 2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW 2005), pp. 224–227, August 2005

    Google Scholar 

  7. JESS: JESS Information (2016). http://herzberg.ca.sandia.gov. Accessed May 2016

  8. Latkowski, R., Mikołajczyk, M.: Data decomposition and decision rule joining for classification of data with missing values. In: Tsumoto, S., Słowiński, R., Komorowski, J., Grzymała-Busse, J.W. (eds.) RSCTC 2004. LNCS (LNAI), vol. 3066, pp. 254–263. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-25929-9_30

    Chapter  MATH  Google Scholar 

  9. Liu, C., Urbani, J., Qi, G.: Efficient RDF stream reasoning with graphics processing units (GPUs). In: Proceedings of the 23rd International Conference on World Wide Web, WWW 2014 Companion, pp. 343–344. ACM, New York (2014)

    Google Scholar 

  10. Nalepa, G.J., Ligęza, A., Kaczor, K.: Overview of knowledge formalization with XTT2 rules. In: Bassiliades, N., Governatori, G., Paschke, A. (eds.) RuleML 2011. LNCS, vol. 6826, pp. 329–336. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22546-8_26

    Chapter  Google Scholar 

  11. Nowak-Brzezińska, A.: Mining rule-based knowledge bases inspired by rough set theory. Fundam. Inform. 148(1–2), 35–50 (2016)

    Article  MathSciNet  Google Scholar 

  12. Nowak-Brzezinska, A., Wakulicz-Deja, A.: Exploration of knowledge bases inspired by rough set theory. In: Proceedings of the 24th International Workshop on Concurrency, Specification and Programming, Rzeszow, Poland, September 28–30, 2015, vol. 1, pp. 64–75 (2015)

    Google Scholar 

  13. Pindur, R., Susmaga, R., Stefanowski, J.: Hyperplane aggregation of dominance decision rules. Fundam. Inform. 62(2), 117–137 (2004)

    MathSciNet  MATH  Google Scholar 

  14. Reynolds, A.P., Richards, G., Rayward-Smith, V.J.: The application of K-medoids and PAM to the clustering of rules. In: Yang, Z.R., Yin, H., Everson, R.M. (eds.) IDEAL 2004. LNCS, vol. 3177, pp. 173–178. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-28651-6_25

    Chapter  Google Scholar 

  15. Rowe, M.C., Labhart, J., Bechtel, R., Matney, S., Carrow, S.: Forward chaining parallel inference. In: Proceedings of the Second IEEE Symposium on Parallel and Distributed Processing, pp. 455–462 (1990)

    Google Scholar 

  16. Sikora, M., Gudys, A.: CHIRA - convex hull based iterative algorithm of rules aggregation. Fundam. Inform. 123(2), 143–170 (2013)

    MathSciNet  MATH  Google Scholar 

  17. Simiński, R.: The experimental evaluation of rules partitioning conception for knowledge base systems. In: Borzemski, L., Grzech, A., Świątek, J., Wilimowska, Z. (eds.) Information Systems Architecture and Technology: Proceedings of 37th International Conference on Information Systems Architecture and Technology – ISAT 2016 – Part I. AISC, vol. 521, pp. 79–89. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-46583-8_7

    Chapter  Google Scholar 

  18. Simiński, R., Nowak-Brzezińska, A.: KBExplorator and KBExpertLib as the Tools for building medical decision support systems. In: Nguyen, N.-T., Manolopoulos, Y., Iliadis, L., Trawiński, B. (eds.) ICCCI 2016. LNCS (LNAI), vol. 9876, pp. 494–503. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45246-3_47

    Chapter  Google Scholar 

  19. Xiȩski, T., Simiński, R.: A performance study of two inference algorithms for a distributed expert system shell. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds.) BDAS 2017. CCIS, vol. 716, pp. 512–526. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58274-0_40

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Tomasz Xiȩski .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Xiȩski, T., Nowak-Brzezińska, A. (2018). Different Methods for Cluster’s Representation and Their Impact on the Effectiveness of Searching Through Such a Structure. In: Nguyen, N., Pimenidis, E., Khan, Z., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2018. Lecture Notes in Computer Science(), vol 11056. Springer, Cham. https://doi.org/10.1007/978-3-319-98446-9_27

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-98446-9_27

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-98445-2

  • Online ISBN: 978-3-319-98446-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics