Using Clusters of Concepts to Extract Semantic Relations from Standalone Documents

Ventura, João; Silva, Joaquim

doi:10.1007/978-3-642-40669-0_44

João Ventura²² &
Joaquim Silva²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 8154))

Included in the following conference series:

Portuguese Conference on Artificial Intelligence

2832 Accesses

Abstract

The extraction of semantic relations from texts is currently gaining increasing interest. However, a large number of current methods are language and domain dependent, and the statistical and language-independent methods tend to work only with large amounts of text. This leaves out the extraction of semantic relations from standalone documents, such as single documents of unique subjects, reports from very specific domains, or small books.

We propose a statistical method to extract semantic relations using clusters of concepts. Clusters are areas in the documents where concepts occur more frequently. When clusters of different concepts occur in the same areas, they may represent highly related concepts.

Our method is language independent and we show comparative results for three different European languages.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Patwardhan, S., Pedersen, T.: Using WordNet-based context vectors to estimate the semantic relatedness of concepts. In: Proceedings of the EACL 2006 Workshop Making Sense of Sense, pp. 1–12 (2006)
Google Scholar
Hsu, M.-H., Tsai, M.-F., Chen, H.-H.: Query expansion with conceptNet and wordNet: An intrinsic comparison. In: Ng, H.T., Leong, M.-K., Kan, M.-Y., Ji, D. (eds.) AIRS 2006. LNCS, vol. 4182, pp. 1–13. Springer, Heidelberg (2006)
Chapter Google Scholar
Tikk, D., Yang, J.D., Bang, S.L.: Hierarchical text categorization using fuzzy relational thesaurus. KYBERNETIKA-PRAHA 39(5), 583–600 (2003)
MATH Google Scholar
Yousefi, J., Kosseim, L.: Using semantic constraints to improve question answering. In: Kop, C., Fliedl, G., Mayr, H.C., Métais, E. (eds.) NLDB 2006. LNCS, vol. 3999, pp. 118–128. Springer, Heidelberg (2006)
Chapter Google Scholar
Sheth, A., Arpinar, I.B., Kashyap, V.: Relationships at the heart of semantic web: Modeling, discovering, and exploiting complex semantic relationships. In: Nikravesh, M., Azvine, B., Yager, R., Zadeh, L.A. (eds.) Enhancing the Power of the Internet. STUDFUZZ, vol. 139, pp. 63–94. Springer, Heidelberg (2003)
Chapter Google Scholar
Ventura, J., Silva, J.F.: Mining concepts from texts. In: International Conference on Computer Science (2012)
Google Scholar
Biemann, C.: Ontology Learning from Text: A Survey of Methods. LDV-Forum Journal 20(2), 75–93 (2005)
Google Scholar
Gmez-Prez, A., Manzano-Macho, D.: Deliverable 1.5: A survey of ontology learning methods and techniques. Ontology Based Information Exchange for Knowledge Management and Electronic Commerce 29243 (2003)
Google Scholar
Grefenstette, G.: Evaluation techniques for automatic semantic extraction: comparing syntactic and window based approaches. In: Corpus Processing for Lexical Acquisition, pp. 205–216. MIT Press, Cambridge (1996)
Google Scholar
Akbik, A., Broß, J.: Wanderlust: Extracting Semantic Relations from Natural Language Text Using Dependency Grammar Patterns. In: Proceedings of the 18th International World Wide Web Conference, Madrid, Spain (2009)
Google Scholar
Nakayama, K., Hara, T., Nishio, S.: Wikipedia Link Structure and Text Mining for Semantic Relation Extraction. In: SemSearch 2008, CEUR Workshop Proceedings (2008) ISSN 1613-0073
Google Scholar
Ghani, R., Fano, A.: Using Text Mining to Infer Semantic Attributes for Retail Data Mining. In: Proceeding of the 2nd IEEE International Conference on Data Mining (ICDM 2002), Maebashi, Japan, pp. 195–203 (2002)
Google Scholar
Snow, R., Jurafsky, A., Ng, A.: Learning syntactic patterns for automatic hypernym discovery. In: Advances in Neural Information Processing Systems (NIPS 2004), Vancouver, British Columbia (2004)
Google Scholar
Mohit, B., Narayanan, S.: Semantic Extraction with Wide-Coverage Lexical Resources. In: Proceedings of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies, Edmonton, Canada, pp. 64–66 (2003)
Google Scholar
Gildea, D., Jurafsky, D.: Automatic Labeling of Semantic Roles. Computational Linguistics 28(3), 245–288 (2002)
Article Google Scholar
Ruiz-Casado, M., Alfonseca, E., Castells, P.: Automatic extraction of semantic relationships for wordNet by means of pattern learning from wikipedia. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, pp. 67–79. Springer, Heidelberg (2005)
Chapter Google Scholar
Baker, C.F., Fillmore, C.J., Lowe, J.B.: The Berkeley FrameNet project. In: Proceedings of the 17th International Conference on Computational Linguistics, Montreal, Canada, pp. 86–90 (1998)
Google Scholar
Miller, G.A.: WordNet: A Lexical Database for English. Communications of the ACM 38(11), 39–41 (1995)
Article Google Scholar
Deerwester, S., Harshman, R., Dumais, S., Furnas, G., Landauer, T.: Improving Information Retrieval with Latent Semantic Indexing. In: Proceedings of the 51st Annual Meeting of the American Society for Information Science, pp. 36–40 (1988)
Google Scholar
Panchenko, A., Adeykin, S., Romanov, A., Romanov, P.: Extraction of Semantic Relations between Concepts with KNN Algorithms on Wikipedia. In: Proceedings of the 10th International Conference on Formal Concept Analysis, Leuven, Belgium (2012)
Google Scholar
Terra, E., Clarke, C.L.A.: Frequency estimates for statistical word similarity measures. In: Proceedings of the Human Language Technology and North American Chapter of Association of Computational Linguistics Conference 2003 (HLT/NAACL 2003), pp. 244–251 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

CITI/DI/FCT, Universidade Nova de Lisboa, Campus de Caparica, 2829-516, Caparica, Portugal
João Ventura & Joaquim Silva

Authors

João Ventura
View author publications
You can also search for this author in PubMed Google Scholar
Joaquim Silva
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Informatics Department, University of Lisbon, Campo Grande, 174-016, Lisbon, Portugal
Luís Correia
Information Systems Department, University of Minho, Campus de Azurém, 4800-058, Guimarães, Portugal
Luís Paulo Reis
Department of Education, University of the Azores, Campus de Angra do Heroísmo, Angra do Heroísma, 9700-042, Azores, Portugal
José Cascalho

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ventura, J., Silva, J. (2013). Using Clusters of Concepts to Extract Semantic Relations from Standalone Documents. In: Correia, L., Reis, L.P., Cascalho, J. (eds) Progress in Artificial Intelligence. EPIA 2013. Lecture Notes in Computer Science(), vol 8154. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-40669-0_44

Download citation

DOI: https://doi.org/10.1007/978-3-642-40669-0_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-40668-3
Online ISBN: 978-3-642-40669-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics