Abstract
The primary goal of ontology development is to share and reuse domain knowledge among people or machines. This study focuses on the approach of extracting semantic relationships from unstructured textual documents related to medicinal herb from websites and proposes a lexical pattern technique to acquire semantic relationships such as synonym, hyponym, and part-of relationships. The results show of nine object properties (or relations) and 105 lexico-syntactic patterns have been identified manually, including one from the Hearst hyponym rules. The lexical patterns have linked 7252 terms that have the potential as ontological terms. Based on this study, it is believed that determining the lexical pattern at an early stage is helpful in selecting relevant term from a wide collection of terms in the corpus. However, the relations and lexico-syntactic patterns or rules have to be verified by domain expert before employing the rules to the wider collection in an attempt to find more possible rules.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Haase, P., Sure, Y.: State-of-the-Art on Ontology Evolution. Institute AIFB, University of Karlsruhe (2004), http://www.aifb.uni-karlsruhe.de/WBS/ysu/publications/SEKT-D3.1.1.1.b.pdf
Staab, S., Schnurr, H.-P., Studer, R., Sure, Y.: Knowledge processes and ontologies. IEEE Intelligent Systems, Special Issue on Knowledge Management 16(1), 26–34 (2001)
Fuller, S., Revere, D., Bugni, P.F., Martin, G.M.: A knowledgebase system to enhance scientific discovery: Telemakus. Biomedical Digital Libraries 1, 2 (2004)
Swanson, D.R., Smalheiser, N.R.: An interactive system for finding complementary literatures: A stimulus to scientific discovery. Artificial Intelligence 91(2), 183–203 (1997)
Alani, H., Kim, S., Millard, D.E., Weal, M.J., Hall, W., Lewis, P.H., Shadbolt, N.R.: Automatic Ontology-Based Knowledge Extraction from Web Documents. IEEE Intelligent Systems 18(1), 14–21 (2003)
Cimiano, P., Hotho, A., Staab, S.: Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis. Journal of Artificial Intelligence Research 24, 305–339 (2005)
Hearst, M.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the Fourteenth International Conference on Computational Linguistics, pp. 539–545 (1992)
Kawtrakul, A., Suktarachan, M., Imsombut, A.: Automatic Thai Ontology Construction and Maintenance System. In: Workshop on Papillon, Grenoble, France (2004)
AGROVOC Thesaurus, http://jodi.ecs.soton.ac.uk/incoming/Soergel/JoDI_FAO_Soergl_revC.html
Imsombut, A., Kawtrakul, A.: Automatic building of an ontology on the basis of text corpora in Thai. Journal of Language Resources and Evaluation 42(2), 137–149 (2007)
Zaharudin, I., Noah, S.A., Noor, M.M.: Knowledge Acquisition from Textual Documents for the Construction of Medicinal herb Ontology Domain. J. Applied Science 9(4), 794–798 (2009)
Moldovan, D., Girju, R., Rus, V.: Domain-specific knowledge acquisition from text. In: Proceedings of the sixth conference on Applied natural language processing, Seattle, Washington, pp. 268–275 (2000)
Pantel, P., Pennacchiotti, M.: Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, Sydney, Australia, pp. 113–120 (2006)
Maedche, A., Staab, S.: Ontology Learning for the Semantic Web. IEEE Intelligent Systems 16(2) (2001)
Xu., F., Kurz., D., Piskorski., J., Schmeier, S.: A Domain Adaptive Approach to Automatic Acquisition of Domain Relevant Terms and their Relations with Bootstrapping. In: Proceedings of the 3rd International Conference on Language Resources an Evaluation (LREC 2002), Las Palmas, Canary Islands, Spain, May 29-31 (2002)
Girju, R., Badulescu, A., Moldovan, D.: Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations. In: The Proceedings of the Human Language Technology Conference, Edmonton, Canada (2003)
Genia Tagger, http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/tagger/
Celjuska, D., Vargas-Vera, M.: Ontosophie: A semi-automatic system for ontology population from text. In: Proceedings of the 3rd International Conference on Natural Language Processing, ICON (2004)
Ralph, M.W., Norman, K.S.: Meta-rules as a basis for processing ill-formed input. Computational Linguistics 9(3-4) (1983)
Zagibalov., T., Carroll, J.: Automatic seed word selection for unsupervised sentiment classification of Chinese text. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1 (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ibrahim, Z., Noah, S.A., Noor, M.M. (2010). Rules for Ontology Population from Text of Malaysia Medicinal Herbs Domain. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds) Rough Set and Knowledge Technology. RSKT 2010. Lecture Notes in Computer Science(), vol 6401. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16248-0_55
Download citation
DOI: https://doi.org/10.1007/978-3-642-16248-0_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16247-3
Online ISBN: 978-3-642-16248-0
eBook Packages: Computer ScienceComputer Science (R0)