Abstract
In this paper, we elaborate on an approach to construction of semantic-linguistic feature vectors (FV) that are used in search. These FVs are built based on domain semantics encoded in an ontology and enhanced by a relevant terminology from Web documents. The value of this approach is twofold. First, it captures relevant semantics from an ontology, and second, it accounts for statistically significant collocations of other terms and phrases in relation to the ontology entities. The contribution of this paper is the FV construction process and its evaluation. Recommendations and lessons learnt are laid down.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agirre, E., Ansa, O., Hovy, E.H., Martínez, D.: Enriching very large ontologies using the WWW. In: ECAI Workshop on Ontology Learning. CEUR-WS.org, vol. 31 (2000)
Bergamaschi, S., Bouquet, P., Giazomuzzi, D., Guerra, F., Po, L., Vincini, M.: An Incremental Method for the Lexical Annotation of Domain Ontologies. Int. J. on Semantic Web and Information Systems 3(3), 57–80 (2007)
Bouquet, P., Serafini, L., Zanobini, S.: Semantic Coordination: A New Approach and an Application. In: Fensel, D., Sycara, K., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, pp. 130–145. Springer, Heidelberg (2003)
Bry, F., Koch, C., Furche, T., Schaffert, S., Badea, L., Berger, S.: Querying the Web Reconsidered: Design Principles for Versatile Web Query Languages. Int. J. on Semantic Web and Information Systems 1(2), 1–21 (2005)
Castells, P., Fernandez, M., Vallet, D.: An adaptation of the vector-space model for ontology-based information retrieval. IEEE TKDE 19(2), 261–272 (2007)
Cilibrasi, R., Vitanyi, P.: The Google Similarity Distance. IEEE Transactions on Knowledge and Data Engineering 19(3), 370–383
Formica, A., Missikoff, M., Pourabbas, E., Taglino, F.: Weighted Ontology for Semantic Search. In: Meersman, R., Tari, Z. (eds.) OTM 2008, Part II. LNCS, vol. 5332, pp. 1289–1303. Springer, Heidelberg (2008)
Justeson, J.S., Katz, S.M.: Technical terminology: some linguistic properties and an algorithm for identification in text. Natural Language Engineering, vol. 1, pp. 9–27. Cambridge University Press, Cambridge (1995)
Lopez, V., Sabou, M., Motta, E.: PowerMap: Mapping the Real Semantic Web on the Fly. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L.M. (eds.) ISWC 2006. LNCS, vol. 4273, pp. 414–427. Springer, Heidelberg (2006)
Mitchell, T.M.: Machine Learning. McGraw-Hill, New York (1997)
Ogden, C.K., Richards, I.A.: The meaning of meaning: a study of the influence of language upon thought and of the science of symbolism. Kegan Paul, Trench, Trubner & Co., London (1930)
Osinski, S., Weiss, D.: A Concept-Driven Algorithm for Clustering Search Results. IEEE Intelligent Systems 20, 48–54 (2005)
Panagis, Y., Sakkopoulos, E., Garofalakis, J., Tsakalidis, A.: Optimisation mechanism for web search results using topic knowledge. Int. J. Knowledge and Learning 2(1/2), 140–153 (2006)
Solskinnsbakk, G., Gulla, J.: Ontological Profiles in Enterprise Search. Knowledge Engineering: Practice and Patterns, 302–317 (2008)
Strasunskas, D., Tomassen, S.L.: The role of ontology in enhancing semantic searches: the EvOQS framework and its initial validation. Int. J. Knowledge and Learning 4, 398–414 (2008)
Su, X., Gulla, J.A.: An information retrieval approach to ontology mapping. Data & Knowledge Engineering 58, 47–69 (2006)
Suomela, S., Kekalainen, J.: Ontology as a search-tool: A study of real user’s query formulation with and without conceptual support. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 315–329. Springer, Heidelberg (2005)
Tomassen, S.L., Strasunskas, D.: Construction of Ontology based Semantic-Linguistic Feature Vectors for Searching: the Process and Effect. In: WI-IAT 2009. IEEE Computer Society, Milano (2009)
Yang, H.-C.: A method for automatic construction of learning contents in semantic web by a text mining approach. Int. J. Knowledge and Learning 2(1/2), 89–105 (2006)
Zhou, X., Hu, X., Zhang, X.: Topic Signature Language Models for Ad hoc Retrieval. IEEE Transactions on Knowledge and Data Engineering 19, 1276–1287 (2007)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tomassen, S.L., Strasunskas, D. (2009). Semantic-Linguistic Feature Vectors for Search: Unsupervised Construction and Experimental Validation. In: Gómez-Pérez, A., Yu, Y., Ding, Y. (eds) The Semantic Web. ASWC 2009. Lecture Notes in Computer Science, vol 5926. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-10871-6_14
Download citation
DOI: https://doi.org/10.1007/978-3-642-10871-6_14
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-10870-9
Online ISBN: 978-3-642-10871-6
eBook Packages: Computer ScienceComputer Science (R0)