Abstract
In this paper the topic of clustering and searching through clusters generated from real-world knowledge bases is discussed. Authors analyze three methods of cluster’s representatives creation, focusing on their advantages and flaws. What is more the authors introduce a concept of forward-chaining inference which uses a density-based DBSCAN algorithm to generate cluster of rules and speed up the whole inference process. The experiments were conducted on real-world knowledge bases with a relatively large number of rules to evaluate the efficiency of the proposed approach.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
A cluster is considered relevant when its similarity (to the set of facts) is above a given threshold.
- 2.
- 3.
The DBSCAN algorithm distinguishes core (objects in the center of the cluster) and border points (objects near the cluster’s border), and uses specific notations of reachability and connectedness to form a cluster, which are discussed in [2].
- 4.
Because gaining access to large, real-world knowledge bases is very difficult.
- 5.
More information about the structure and experimental evaluation concerning these knowledge bases can be found in [17].
- 6.
The measure was chosen, because it is simple to comprehend and does not have a significant impact on the DBSCAN execution time.
- 7.
Although 3 types of representatives were discussed in this paper, only the first two approaches were used in the experiments, because for the last one it is difficult to specify how many threshold values should be considered and this optimal threshold (percentage of rules considered) may vary for every knowledge base.
References
Chemchem, A., Djenouri, Y., Drias, H.: Incremental induction rules clustering. In: 2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA), pp. 492–497, May 2013
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of 2nd International Conference on Knowledge Discovery and Data Mining, pp. 226–231. AAAI Press (1996)
EXSYS: EXSYS Home Page (2018). http://www.exsys.com. Accessed Mar 2018
Google: Google Knowledge Graph (2018). https://developers.google.com/knowledge-graph/. Accessed Mar 2018
Hashizume, A., Yongguang, B., Du, X., Ishii, N.: Generating representative from clusters of association rules on numeric attributes. In: Liu, J., Cheung, Y., Yin, H. (eds.) IDEAL 2003. LNCS, vol. 2690, pp. 605–613. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-45080-1_82
He, J., et al.: Rule clustering and super-rule generation for transmembrane segments prediction. In: 2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW 2005), pp. 224–227, August 2005
JESS: JESS Information (2016). http://herzberg.ca.sandia.gov. Accessed May 2016
Latkowski, R., Mikołajczyk, M.: Data decomposition and decision rule joining for classification of data with missing values. In: Tsumoto, S., Słowiński, R., Komorowski, J., Grzymała-Busse, J.W. (eds.) RSCTC 2004. LNCS (LNAI), vol. 3066, pp. 254–263. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-25929-9_30
Liu, C., Urbani, J., Qi, G.: Efficient RDF stream reasoning with graphics processing units (GPUs). In: Proceedings of the 23rd International Conference on World Wide Web, WWW 2014 Companion, pp. 343–344. ACM, New York (2014)
Nalepa, G.J., Ligęza, A., Kaczor, K.: Overview of knowledge formalization with XTT2 rules. In: Bassiliades, N., Governatori, G., Paschke, A. (eds.) RuleML 2011. LNCS, vol. 6826, pp. 329–336. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22546-8_26
Nowak-Brzezińska, A.: Mining rule-based knowledge bases inspired by rough set theory. Fundam. Inform. 148(1–2), 35–50 (2016)
Nowak-Brzezinska, A., Wakulicz-Deja, A.: Exploration of knowledge bases inspired by rough set theory. In: Proceedings of the 24th International Workshop on Concurrency, Specification and Programming, Rzeszow, Poland, September 28–30, 2015, vol. 1, pp. 64–75 (2015)
Pindur, R., Susmaga, R., Stefanowski, J.: Hyperplane aggregation of dominance decision rules. Fundam. Inform. 62(2), 117–137 (2004)
Reynolds, A.P., Richards, G., Rayward-Smith, V.J.: The application of K-medoids and PAM to the clustering of rules. In: Yang, Z.R., Yin, H., Everson, R.M. (eds.) IDEAL 2004. LNCS, vol. 3177, pp. 173–178. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-28651-6_25
Rowe, M.C., Labhart, J., Bechtel, R., Matney, S., Carrow, S.: Forward chaining parallel inference. In: Proceedings of the Second IEEE Symposium on Parallel and Distributed Processing, pp. 455–462 (1990)
Sikora, M., Gudys, A.: CHIRA - convex hull based iterative algorithm of rules aggregation. Fundam. Inform. 123(2), 143–170 (2013)
Simiński, R.: The experimental evaluation of rules partitioning conception for knowledge base systems. In: Borzemski, L., Grzech, A., Świątek, J., Wilimowska, Z. (eds.) Information Systems Architecture and Technology: Proceedings of 37th International Conference on Information Systems Architecture and Technology – ISAT 2016 – Part I. AISC, vol. 521, pp. 79–89. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-46583-8_7
Simiński, R., Nowak-Brzezińska, A.: KBExplorator and KBExpertLib as the Tools for building medical decision support systems. In: Nguyen, N.-T., Manolopoulos, Y., Iliadis, L., Trawiński, B. (eds.) ICCCI 2016. LNCS (LNAI), vol. 9876, pp. 494–503. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45246-3_47
Xiȩski, T., Simiński, R.: A performance study of two inference algorithms for a distributed expert system shell. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds.) BDAS 2017. CCIS, vol. 716, pp. 512–526. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58274-0_40
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this paper
Cite this paper
Xiȩski, T., Nowak-Brzezińska, A. (2018). Different Methods for Cluster’s Representation and Their Impact on the Effectiveness of Searching Through Such a Structure. In: Nguyen, N., Pimenidis, E., Khan, Z., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2018. Lecture Notes in Computer Science(), vol 11056. Springer, Cham. https://doi.org/10.1007/978-3-319-98446-9_27
Download citation
DOI: https://doi.org/10.1007/978-3-319-98446-9_27
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98445-2
Online ISBN: 978-3-319-98446-9
eBook Packages: Computer ScienceComputer Science (R0)