Abstract
This paper proposes an adaptive document representation (concept vector space model) using Markov Chain model. The vector space representation is one of the most common models for representing documents in classification process. The document classification based on ontology classification approach is represented as a vector, whose components are ontology concepts and their relevance. The relevance is represented the by frequency of concepts’ occurrence. These concepts make various contributions in classification process. The contributions depend on the position of concepts where they are depicted in the ontology hierarchy. The hierarchy such as classes, subclasses and instances may have different values to represent the concepts’ importance. The weights to define concepts’ importance are generally selected by empirical analysis and are usually kept fixed. Thus, making it less effective and time consuming. We therefore propose a new model to automatically estimate weights of concepts within the ontology. This model initially maps the ontology to a Markov chain model and then calculates the transition probability matrix for this Markov chain. Further, the transition probability matrix is used to compute the probability of steady states based on left eigenvectors. Finally, the importance is calculated for each ontology concept. And, an enhanced concept vector space representation is created with concepts’ importance and concepts’ relevance. The concept vector space representation can be adapted for new ontology concepts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Raghavan, P.: Extracting and Exploiting Structure in Text Search. In: SIGMOD Conference, p. 635 (2003)
Al-Azmi, A.-A.R.: Data, Text, and Web Mining for Business Intellegence: A Survey. International Journal of Data Mining & Knowledge Management Process (IJDKP) 3(2) (2013)
Keikha, M., Khonsari, A., Oroumchian, F.: Rich document representation and classification: An analysis. Knowledge-Based Systems 22(1), 67–71 (2009)
Gu, H., Zhou, K.: Text Classification Based on Domain Ontology. Journal of Communication and Computer 3(5) (2006)
Yang, X., Sun, N., Zhang, Y., Kong, D.: General Framework for Text Classification based on Domain Ontology. In: 3rd International Workshop on Semantic Media Adaptation and Personalization (2008)
Fang, J., Guo, L., Wang, X., Yang, N.: Ontology-Based Automatic Classification and Ranking for Web Documents. In: Proceedings the Fourth International Conference on Fuzzy Systems and Knowlede Discovery, China, (2007)
da Costa Pereira, C., Tettamanzi, A.G.B.: An Evolutionary Approach to Ontology-Based User Model Acquisition. In: Di Gesú, V., Masulli, F., Petrosino, A. (eds.) WILF 2003. LNCS (LNAI), vol. 2955, pp. 25–32. Springer, Heidelberg (2006)
Frost, H.R., McCray, A.: Markov Chain Ontology Analysis (MCOA). BMC Bioinformatics 13(14), 23 (2012)
Brin, S., Page, L.: The anatomy of a large-scale hypertextual Web search engine. In: Proceedings of the 7th International Conference on World Wide Web, vol. 7 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this paper
Cite this paper
Kastrati, Z., Imran, A.S. (2014). Adaptive Concept Vector Space Representation Using Markov Chain Model. In: Janowicz, K., Schlobach, S., Lambrix, P., Hyvönen, E. (eds) Knowledge Engineering and Knowledge Management. EKAW 2014. Lecture Notes in Computer Science(), vol 8876. Springer, Cham. https://doi.org/10.1007/978-3-319-13704-9_16
Download citation
DOI: https://doi.org/10.1007/978-3-319-13704-9_16
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-13703-2
Online ISBN: 978-3-319-13704-9
eBook Packages: Computer ScienceComputer Science (R0)