Abstract
Biologists usually focus on only a small, individualized, sub-domain of the huge domain of biology. With respect to their sub-domain, they often need data collected from various different web resources. In this research, we provide a tool with which biologists can generate a sub-domain-size, user-specific ontology that can extract data from web resources. The central idea is to let a user provide a seed, which consists of a single data instance embedded within the concepts of interest. Given a seed, the system can generate an extraction ontology, match information with the user’s view based on the seed, and collect information from online repositories. Our initial experimentations indicate that our prototype system can successfully match source data with an ontology seed and gather information from different sources with respect to user-specific, personalized views.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Buitelaar, P., Olejnik, D., Sintek, M.: Ontolt: A protege plug-in for ontology extraction from text. In: Fensel, D., Sycara, K.P., Mylopoulos, J. (eds.) ISWC 2003. LNCS, vol. 2870, Springer, Heidelberg (2003)
Cimiano, P., Völker, J.: Text2Onto—a framework for ontology learning and data-driven change discovery. In: Montoyo, A., Muńoz, R., Métais, E. (eds.) NLDB 2005. LNCS, vol. 3513, pp. 227–238. Springer, Heidelberg (2005)
Ding, Y., Lonsdale, D.W., Embley, D.W., Hepp, M., Xu, L.: Generating ontologies via language components and ontology reuse. In: NLDB 2007. Proceedings of 12th International Conference on Applications of Natural Language to Information Systems, Paris, France, pp. 131–142 (June 2007)
Embley, D.W., Campbell, D.M., Jiang, Y.S., Liddle, S.W., Lonsdale, D.W., Ng, Y.-K., Smith, R.D.: Conceptual-model-based data extraction from multiple-record Web pages. Data & Knowledge Engineering 31(3), 227–251 (1999)
Navigli, R., Velardi, P., Cucchiarelli, A., Neri, F.: Quantitative and qualitative evaluation of the OntoLearn ontology learning system. In: Proceedings of the 20th International Conference on Computational Linguistics, Geneva, Switzerland, pp. 1043–1050 (August 2004)
Noy, N.F., Sintek, M., Decker, S., Crubezy, M., Fergerson, R.W., Musen, M.: Creating semantic web contents with Protege-2000. IEEE Intelligent Systems 16(2), 60–71 (2001)
Pivk, A.: Automatic ontology generation from web tabular structures. AI Communnications 19(1), 83–85 (2006)
Spyns, P., Oberle, D., Volz, R., Zheng, J., Jarrar, M., Sure, Y., Studer, R., Meersman, R.: OntoWeb—a semantic web community portal. In: Karagiannis, D., Reimer, U. (eds.) PAKM 2002. LNCS (LNAI), vol. 2569, pp. 189–200. Springer, Heidelberg (2002)
Tao, C., Embley, D.W.: Automatic hidden-web table interpretation by sibling page comparison. In: Afrati, F.N., Kolaitis, P. (eds.) ER 2007. Proceedings of the 26th International Conference on Conceptual Modeling, Auckland, New Zealand, pp. 389–404 (November 2007)
Tijerino, Y.A., Embley, D.W., Lonsdale, D.W., Ding, Y., Nagy, G.: Toward ontology generation from tables. World Wide Web: Internet and Web Information Systems 8(3), 251–285 (2004)
Wang, Y., Völker, J., Haase, P.: Towards semi-automatic ontology building supported by large-scale knowledge acquisition. In: AAAI Fall Symposium On Semantic Web for Collaborative Knowledge Acquisition, Arlington, Virginia, vol. FS-06-06, pp. 70–77 (October 2006)
Zhou, Y.: Generating data-extraction ontologies by example. Master’s thesis, Brigham Young University (December 2005)
Author information
Authors and Affiliations
Editor information
Rights and permissions
Copyright information
© 2007 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Tao, C., Embley, D.W. (2007). Seed-Based Generation of Personalized Bio-ontologies for Information Extraction. In: Hainaut, JL., et al. Advances in Conceptual Modeling – Foundations and Applications. ER 2007. Lecture Notes in Computer Science, vol 4802. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-76292-8_9
Download citation
DOI: https://doi.org/10.1007/978-3-540-76292-8_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-76291-1
Online ISBN: 978-3-540-76292-8
eBook Packages: Computer ScienceComputer Science (R0)