Abstract
The extraction of semantic relations from texts is currently gaining increasing interest. However, a large number of current methods are language and domain dependent, and the statistical and language-independent methods tend to work only with large amounts of text. This leaves out the extraction of semantic relations from standalone documents, such as single documents of unique subjects, reports from very specific domains, or small books.
We propose a statistical method to extract semantic relations using clusters of concepts. Clusters are areas in the documents where concepts occur more frequently. When clusters of different concepts occur in the same areas, they may represent highly related concepts.
Our method is language independent and we show comparative results for three different European languages.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Patwardhan, S., Pedersen, T.: Using WordNet-based context vectors to estimate the semantic relatedness of concepts. In: Proceedings of the EACL 2006 Workshop Making Sense of Sense, pp. 1–12 (2006)
Hsu, M.-H., Tsai, M.-F., Chen, H.-H.: Query expansion with conceptNet and wordNet: An intrinsic comparison. In: Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. LNCS, vol. 4182, pp. 1–13. Springer, Heidelberg (2006)
Tikk, D., Yang, J.D., Bang, S.L.: Hierarchical text categorization using fuzzy relational thesaurus. KYBERNETIKA-PRAHA 39(5), 583–600 (2003)
Yousefi, J., Kosseim, L.: Using semantic constraints to improve question answering. In: Kop, C., Fliedl, G., Mayr, H.C., Métais, E. (eds.) NLDB 2006. LNCS, vol. 3999, pp. 118–128. Springer, Heidelberg (2006)
Sheth, A., Arpinar, I.B., Kashyap, V.: Relationships at the heart of semantic web: Modeling, discovering, and exploiting complex semantic relationships. In: Nikravesh, M., Azvine, B., Yager, R., Zadeh, L.A. (eds.) Enhancing the Power of the Internet. STUDFUZZ, vol. 139, pp. 63–94. Springer, Heidelberg (2003)
Ventura, J., Silva, J.F.: Mining concepts from texts. In: International Conference on Computer Science (2012)
Biemann, C.: Ontology Learning from Text: A Survey of Methods. LDV-Forum Journal 20(2), 75–93 (2005)
Gmez-Prez, A., Manzano-Macho, D.: Deliverable 1.5: A survey of ontology learning methods and techniques. Ontology Based Information Exchange for Knowledge Management and Electronic Commerce 29243 (2003)
Grefenstette, G.: Evaluation techniques for automatic semantic extraction: comparing syntactic and window based approaches. In: Corpus Processing for Lexical Acquisition, pp. 205–216. MIT Press, Cambridge (1996)
Akbik, A., Broß, J.: Wanderlust: Extracting Semantic Relations from Natural Language Text Using Dependency Grammar Patterns. In: Proceedings of the 18th International World Wide Web Conference, Madrid, Spain (2009)
Nakayama, K., Hara, T., Nishio, S.: Wikipedia Link Structure and Text Mining for Semantic Relation Extraction. In: SemSearch 2008, CEUR Workshop Proceedings (2008) ISSN 1613-0073
Ghani, R., Fano, A.: Using Text Mining to Infer Semantic Attributes for Retail Data Mining. In: Proceeding of the 2nd IEEE International Conference on Data Mining (ICDM 2002), Maebashi, Japan, pp. 195–203 (2002)
Snow, R., Jurafsky, A., Ng, A.: Learning syntactic patterns for automatic hypernym discovery. In: Advances in Neural Information Processing Systems (NIPS 2004), Vancouver, British Columbia (2004)
Mohit, B., Narayanan, S.: Semantic Extraction with Wide-Coverage Lexical Resources. In: Proceedings of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies, Edmonton, Canada, pp. 64–66 (2003)
Gildea, D., Jurafsky, D.: Automatic Labeling of Semantic Roles. Computational Linguistics 28(3), 245–288 (2002)
Ruiz-Casado, M., Alfonseca, E., Castells, P.: Automatic extraction of semantic relationships for wordNet by means of pattern learning from wikipedia. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, pp. 67–79. Springer, Heidelberg (2005)
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet project. In: Proceedings of the 17th International Conference on Computational Linguistics, Montreal, Canada, pp. 86–90 (1998)
Miller, G.A.: WordNet: A Lexical Database for English. Communications of the ACM 38(11), 39–41 (1995)
Deerwester, S., Harshman, R., Dumais, S., Furnas, G., Landauer, T.: Improving Information Retrieval with Latent Semantic Indexing. In: Proceedings of the 51st Annual Meeting of the American Society for Information Science, pp. 36–40 (1988)
Panchenko, A., Adeykin, S., Romanov, A., Romanov, P.: Extraction of Semantic Relations between Concepts with KNN Algorithms on Wikipedia. In: Proceedings of the 10th International Conference on Formal Concept Analysis, Leuven, Belgium (2012)
Terra, E., Clarke, C.L.A.: Frequency estimates for statistical word similarity measures. In: Proceedings of the Human Language Technology and North American Chapter of Association of Computational Linguistics Conference 2003 (HLT/NAACL 2003), pp. 244–251 (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ventura, J., Silva, J. (2013). Using Clusters of Concepts to Extract Semantic Relations from Standalone Documents. In: Correia, L., Reis, L.P., Cascalho, J. (eds) Progress in Artificial Intelligence. EPIA 2013. Lecture Notes in Computer Science(), vol 8154. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40669-0_44
Download citation
DOI: https://doi.org/10.1007/978-3-642-40669-0_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40668-3
Online ISBN: 978-3-642-40669-0
eBook Packages: Computer ScienceComputer Science (R0)