Rules for Ontology Population from Text of Malaysia Medicinal Herbs Domain

Ibrahim, Zaharudin; Noah, Shahrul Azman; Noor, Mahanem Mat

doi:10.1007/978-3-642-16248-0_55

Zaharudin Ibrahim^24,25,
Shahrul Azman Noah²⁵ &
Mahanem Mat Noor²⁶

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 6401))

Included in the following conference series:

International Conference on Rough Sets and Knowledge Technology

994 Accesses
7 Citations

Abstract

The primary goal of ontology development is to share and reuse domain knowledge among people or machines. This study focuses on the approach of extracting semantic relationships from unstructured textual documents related to medicinal herb from websites and proposes a lexical pattern technique to acquire semantic relationships such as synonym, hyponym, and part-of relationships. The results show of nine object properties (or relations) and 105 lexico-syntactic patterns have been identified manually, including one from the Hearst hyponym rules. The lexical patterns have linked 7252 terms that have the potential as ontological terms. Based on this study, it is believed that determining the lexical pattern at an early stage is helpful in selecting relevant term from a wide collection of terms in the corpus. However, the relations and lexico-syntactic patterns or rules have to be verified by domain expert before employing the rules to the wider collection in an attempt to find more possible rules.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Haase, P., Sure, Y.: State-of-the-Art on Ontology Evolution. Institute AIFB, University of Karlsruhe (2004), http://www.aifb.uni-karlsruhe.de/WBS/ysu/publications/SEKT-D3.1.1.1.b.pdf
Staab, S., Schnurr, H.-P., Studer, R., Sure, Y.: Knowledge processes and ontologies. IEEE Intelligent Systems, Special Issue on Knowledge Management 16(1), 26–34 (2001)
Google Scholar
Fuller, S., Revere, D., Bugni, P.F., Martin, G.M.: A knowledgebase system to enhance scientific discovery: Telemakus. Biomedical Digital Libraries 1, 2 (2004)
Article Google Scholar
Swanson, D.R., Smalheiser, N.R.: An interactive system for finding complementary literatures: A stimulus to scientific discovery. Artificial Intelligence 91(2), 183–203 (1997)
Article MATH Google Scholar
Alani, H., Kim, S., Millard, D.E., Weal, M.J., Hall, W., Lewis, P.H., Shadbolt, N.R.: Automatic Ontology-Based Knowledge Extraction from Web Documents. IEEE Intelligent Systems 18(1), 14–21 (2003)
Article Google Scholar
Cimiano, P., Hotho, A., Staab, S.: Learning Concept Hierarchies from Text Corpora using Formal Concept Analysis. Journal of Artificial Intelligence Research 24, 305–339 (2005)
MATH Google Scholar
Hearst, M.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the Fourteenth International Conference on Computational Linguistics, pp. 539–545 (1992)
Google Scholar
Kawtrakul, A., Suktarachan, M., Imsombut, A.: Automatic Thai Ontology Construction and Maintenance System. In: Workshop on Papillon, Grenoble, France (2004)
Google Scholar
AGROVOC Thesaurus, http://jodi.ecs.soton.ac.uk/incoming/Soergel/JoDI_FAO_Soergl_revC.html
Imsombut, A., Kawtrakul, A.: Automatic building of an ontology on the basis of text corpora in Thai. Journal of Language Resources and Evaluation 42(2), 137–149 (2007)
Article Google Scholar
Zaharudin, I., Noah, S.A., Noor, M.M.: Knowledge Acquisition from Textual Documents for the Construction of Medicinal herb Ontology Domain. J. Applied Science 9(4), 794–798 (2009)
Article Google Scholar
Moldovan, D., Girju, R., Rus, V.: Domain-specific knowledge acquisition from text. In: Proceedings of the sixth conference on Applied natural language processing, Seattle, Washington, pp. 268–275 (2000)
Google Scholar
Pantel, P., Pennacchiotti, M.: Espresso: Leveraging Generic Patterns for Automatically Harvesting Semantic Relations. In: Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics, Sydney, Australia, pp. 113–120 (2006)
Google Scholar
Maedche, A., Staab, S.: Ontology Learning for the Semantic Web. IEEE Intelligent Systems 16(2) (2001)
Google Scholar
Xu., F., Kurz., D., Piskorski., J., Schmeier, S.: A Domain Adaptive Approach to Automatic Acquisition of Domain Relevant Terms and their Relations with Bootstrapping. In: Proceedings of the 3rd International Conference on Language Resources an Evaluation (LREC 2002), Las Palmas, Canary Islands, Spain, May 29-31 (2002)
Google Scholar
Girju, R., Badulescu, A., Moldovan, D.: Learning Semantic Constraints for the Automatic Discovery of Part-Whole Relations. In: The Proceedings of the Human Language Technology Conference, Edmonton, Canada (2003)
Google Scholar
Genia Tagger, http://www-tsujii.is.s.u-tokyo.ac.jp/GENIA/tagger/
Celjuska, D., Vargas-Vera, M.: Ontosophie: A semi-automatic system for ontology population from text. In: Proceedings of the 3rd International Conference on Natural Language Processing, ICON (2004)
Google Scholar
Ralph, M.W., Norman, K.S.: Meta-rules as a basis for processing ill-formed input. Computational Linguistics 9(3-4) (1983)
Google Scholar
Zagibalov., T., Carroll, J.: Automatic seed word selection for unsupervised sentiment classification of Chinese text. In: Proceedings of the 22nd International Conference on Computational Linguistics, vol. 1 (2008)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information System Management, Faculty of Information Management, Universiti Teknologi MARA, 40450, Shah Alam, Selangor, Malaysia
Zaharudin Ibrahim
Department of Information Science, Faculty of Information Science and Technology, Universiti Kebangsaan Malaysia, 43600, Selangor, Malaysia
Zaharudin Ibrahim & Shahrul Azman Noah
School of Biosciences and Biotechnology, Faculty Science and Technology, Universiti Kebangsaan Malaysia, 43600, Selangor, Malaysia
Mahanem Mat Noor

Authors

Zaharudin Ibrahim
View author publications
You can also search for this author in PubMed Google Scholar
Shahrul Azman Noah
View author publications
You can also search for this author in PubMed Google Scholar
Mahanem Mat Noor
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

School of Computer and Information Technology, Beijing Jiaotong University, 100044, Beijing, China
Jian Yu
Faculty of Economics, University of Catania, Corso Italia, 55, 95129, Catania, Italy
Salvatore Greco
Department of Mathematics and Computing Science, Saint Mary’s University, B3H 3C3, Halifax, Nova Scotia, Canada
Pawan Lingras
Institute of Computer Science and Technology, Chongqing University of Posts and Telecommunications, 400065, Chongqing, China
Guoyin Wang
Institute of Mathematics, Warsaw University, Banacha 2, 02-097, Warsaw, Poland
Andrzej Skowron

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Ibrahim, Z., Noah, S.A., Noor, M.M. (2010). Rules for Ontology Population from Text of Malaysia Medicinal Herbs Domain. In: Yu, J., Greco, S., Lingras, P., Wang, G., Skowron, A. (eds) Rough Set and Knowledge Technology. RSKT 2010. Lecture Notes in Computer Science(), vol 6401. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-16248-0_55

Download citation

DOI: https://doi.org/10.1007/978-3-642-16248-0_55
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-16247-3
Online ISBN: 978-3-642-16248-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics