Different Methods for Cluster’s Representation and Their Impact on the Effectiveness of Searching Through Such a Structure

Xiȩski, Tomasz; Nowak-Brzezińska, Agnieszka

doi:10.1007/978-3-319-98446-9_27

Tomasz Xiȩski¹⁷ &
Agnieszka Nowak-Brzezińska¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 11056))

Included in the following conference series:

International Conference on Computational Collective Intelligence

1376 Accesses

Abstract

In this paper the topic of clustering and searching through clusters generated from real-world knowledge bases is discussed. Authors analyze three methods of cluster’s representatives creation, focusing on their advantages and flaws. What is more the authors introduce a concept of forward-chaining inference which uses a density-based DBSCAN algorithm to generate cluster of rules and speed up the whole inference process. The experiments were conducted on real-world knowledge bases with a relatively large number of rules to evaluate the efficiency of the proposed approach.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
A cluster is considered relevant when its similarity (to the set of facts) is above a given threshold.
2.
When using index structures, like R-trees [2], the average computational complexity is about \(O(\log n)\), where n is the number of instances [2].
3.
The DBSCAN algorithm distinguishes core (objects in the center of the cluster) and border points (objects near the cluster’s border), and uses specific notations of reachability and connectedness to form a cluster, which are discussed in [2].
4.
Because gaining access to large, real-world knowledge bases is very difficult.
5.
More information about the structure and experimental evaluation concerning these knowledge bases can be found in [17].
6.
The measure was chosen, because it is simple to comprehend and does not have a significant impact on the DBSCAN execution time.
7.
Although 3 types of representatives were discussed in this paper, only the first two approaches were used in the experiments, because for the last one it is difficult to specify how many threshold values should be considered and this optimal threshold (percentage of rules considered) may vary for every knowledge base.

References

Chemchem, A., Djenouri, Y., Drias, H.: Incremental induction rules clustering. In: 2013 8th International Workshop on Systems, Signal Processing and their Applications (WoSSPA), pp. 492–497, May 2013
Google Scholar
Ester, M., Kriegel, H.P., Sander, J., Xu, X.: A density-based algorithm for discovering clusters in large spatial databases with noise. In: Proceedings of 2nd International Conference on Knowledge Discovery and Data Mining, pp. 226–231. AAAI Press (1996)
Google Scholar
EXSYS: EXSYS Home Page (2018). http://www.exsys.com. Accessed Mar 2018
Google: Google Knowledge Graph (2018). https://developers.google.com/knowledge-graph/. Accessed Mar 2018
Hashizume, A., Yongguang, B., Du, X., Ishii, N.: Generating representative from clusters of association rules on numeric attributes. In: Liu, J., Cheung, Y., Yin, H. (eds.) IDEAL 2003. LNCS, vol. 2690, pp. 605–613. Springer, Heidelberg (2003). https://doi.org/10.1007/978-3-540-45080-1_82
Chapter Google Scholar
He, J., et al.: Rule clustering and super-rule generation for transmembrane segments prediction. In: 2005 IEEE Computational Systems Bioinformatics Conference - Workshops (CSBW 2005), pp. 224–227, August 2005
Google Scholar
JESS: JESS Information (2016). http://herzberg.ca.sandia.gov. Accessed May 2016
Latkowski, R., Mikołajczyk, M.: Data decomposition and decision rule joining for classification of data with missing values. In: Tsumoto, S., Słowiński, R., Komorowski, J., Grzymała-Busse, J.W. (eds.) RSCTC 2004. LNCS (LNAI), vol. 3066, pp. 254–263. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-25929-9_30
Chapter MATH Google Scholar
Liu, C., Urbani, J., Qi, G.: Efficient RDF stream reasoning with graphics processing units (GPUs). In: Proceedings of the 23rd International Conference on World Wide Web, WWW 2014 Companion, pp. 343–344. ACM, New York (2014)
Google Scholar
Nalepa, G.J., Ligęza, A., Kaczor, K.: Overview of knowledge formalization with XTT2 rules. In: Bassiliades, N., Governatori, G., Paschke, A. (eds.) RuleML 2011. LNCS, vol. 6826, pp. 329–336. Springer, Heidelberg (2011). https://doi.org/10.1007/978-3-642-22546-8_26
Chapter Google Scholar
Nowak-Brzezińska, A.: Mining rule-based knowledge bases inspired by rough set theory. Fundam. Inform. 148(1–2), 35–50 (2016)
Article MathSciNet Google Scholar
Nowak-Brzezinska, A., Wakulicz-Deja, A.: Exploration of knowledge bases inspired by rough set theory. In: Proceedings of the 24th International Workshop on Concurrency, Specification and Programming, Rzeszow, Poland, September 28–30, 2015, vol. 1, pp. 64–75 (2015)
Google Scholar
Pindur, R., Susmaga, R., Stefanowski, J.: Hyperplane aggregation of dominance decision rules. Fundam. Inform. 62(2), 117–137 (2004)
MathSciNet MATH Google Scholar
Reynolds, A.P., Richards, G., Rayward-Smith, V.J.: The application of K-medoids and PAM to the clustering of rules. In: Yang, Z.R., Yin, H., Everson, R.M. (eds.) IDEAL 2004. LNCS, vol. 3177, pp. 173–178. Springer, Heidelberg (2004). https://doi.org/10.1007/978-3-540-28651-6_25
Chapter Google Scholar
Rowe, M.C., Labhart, J., Bechtel, R., Matney, S., Carrow, S.: Forward chaining parallel inference. In: Proceedings of the Second IEEE Symposium on Parallel and Distributed Processing, pp. 455–462 (1990)
Google Scholar
Sikora, M., Gudys, A.: CHIRA - convex hull based iterative algorithm of rules aggregation. Fundam. Inform. 123(2), 143–170 (2013)
MathSciNet MATH Google Scholar
Simiński, R.: The experimental evaluation of rules partitioning conception for knowledge base systems. In: Borzemski, L., Grzech, A., Świątek, J., Wilimowska, Z. (eds.) Information Systems Architecture and Technology: Proceedings of 37th International Conference on Information Systems Architecture and Technology – ISAT 2016 – Part I. AISC, vol. 521, pp. 79–89. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-46583-8_7
Chapter Google Scholar
Simiński, R., Nowak-Brzezińska, A.: KBExplorator and KBExpertLib as the Tools for building medical decision support systems. In: Nguyen, N.-T., Manolopoulos, Y., Iliadis, L., Trawiński, B. (eds.) ICCCI 2016. LNCS (LNAI), vol. 9876, pp. 494–503. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-45246-3_47
Chapter Google Scholar
Xiȩski, T., Simiński, R.: A performance study of two inference algorithms for a distributed expert system shell. In: Kozielski, S., Mrozek, D., Kasprowski, P., Małysiak-Mrozek, B., Kostrzewa, D. (eds.) BDAS 2017. CCIS, vol. 716, pp. 512–526. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-58274-0_40
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Computer Science, University of Silesia, ul. Bȩdzińska 39, 41-200, Sosnowiec, Poland
Tomasz Xiȩski & Agnieszka Nowak-Brzezińska

Authors

Tomasz Xiȩski
View author publications
You can also search for this author in PubMed Google Scholar
Agnieszka Nowak-Brzezińska
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tomasz Xiȩski .

Editor information

Editors and Affiliations

Faculty of Information Technology, Ton Duc Thang University, Ho Chi Minh City, Vietnam
Ngoc Thanh Nguyen
Department of Computer Science and Creative Technologies, University of the West of England, Bristol, United Kingdom
Elias Pimenidis
Department of Computer Science and Creative Technologies, University of the West of England, Bristol, United Kingdom
Zaheer Khan
Faculty of Computer Science and Management, Wrocław University of Science and Technology, Wrocław, Poland
Bogdan Trawiński

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Xiȩski, T., Nowak-Brzezińska, A. (2018). Different Methods for Cluster’s Representation and Their Impact on the Effectiveness of Searching Through Such a Structure. In: Nguyen, N., Pimenidis, E., Khan, Z., Trawiński, B. (eds) Computational Collective Intelligence. ICCCI 2018. Lecture Notes in Computer Science(), vol 11056. Springer, Cham. https://doi.org/10.1007/978-3-319-98446-9_27

Download citation

DOI: https://doi.org/10.1007/978-3-319-98446-9_27
Published: 08 August 2018
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-98445-2
Online ISBN: 978-3-319-98446-9
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics