Bootstrapping Domain Ontology for Semantic Web Services from Source Web Sites

  • Wensheng Wu
  • AnHai Doan
  • Clement Yu
  • Weiyi Meng
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3811)


The vision of Semantic Web services promises a network of interoperable Web services over different sources. A major challenge to the realization of this vision is the lack of automated means of acquiring domain ontologies necessary for marking up the Web services. In this paper, we propose the DeepMiner system which learns domain ontologies from the source Web sites. Given a set of sources in a domain of interest, DeepMiner first learns a base ontology from their query interfaces. It then grows the current ontology by probing the sources and discovering additional concepts and instances from the data pages retrieved from the sources. We have evaluated DeepMiner in several real-world domains. Preliminary results indicate that DeepMiner discovers concepts and instances with high accuracy.


Data Region Domain Ontology Text Segment Query Interface Data Page 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Arlotta, L., Crescenzi, V., Mecca, G., Merialdo, P.: Automatic annotation of data extracted from large Web sites. In: WebDB (2003)Google Scholar
  2. 2.
  3. 3.
    Benatallah, B., Hacid, M., Leger, A., Rey, C., Toumani, F.: On automating web services discovery. VLDB Journal 14(1) (2005)Google Scholar
  4. 4.
    Casati, F., Shan, M.: Models and languages for describing and discovering e-services. In: Tutorial, SIGMOD (2001)Google Scholar
  5. 5.
    The OWL-S Services Coalition. OWL-S: Semantic Markup for Web Services,
  6. 6.
    Crescenzi, V., Mecca, G., Merialdo, P.: RoadRunner: Towards automatic data extraction from large Web sites. In: Proc. of VLDB (2001)Google Scholar
  7. 7.
    Denker, G., Kagal, L., Finin, T., Paolucci, M., Sycara, K.: Security for daml web services: Annotation and matchmaking. In: ISWC (2003)Google Scholar
  8. 8.
    Dumas, M., O’Sullivan, J., Hervizadeh, M., Edmond, D., Hofstede, A.: Towards a semantic framework for service description. In: DS-9 (2001)Google Scholar
  9. 9.
    Fensel, D., Bussler, C.: The Web Service Modeling Framework WSMF. Electronic Commerce: Research and Applications 1 (2002)Google Scholar
  10. 10.
    Heß, A., Kushmerick, N.: Machine learning for annotating semantic web services. In: AAAI Spring Symposium on Semantic Web Services (2004)Google Scholar
  11. 11.
    Leymann, F.: WSFL (Web Service Flow Language) (2001)Google Scholar
  12. 12.
    Li, B., Tsai, W., Zhang, L.: Building e-commerce systems using semantic application framework. Int. J. Web Eng. Technol. 1(3) (2004)Google Scholar
  13. 13.
    Mitchell, T.: Machine Learning. McGraw-Hill, New York (1997)zbMATHGoogle Scholar
  14. 14.
    Paolucci, M., Sycara, K.: Semantic web services: Current status and future directions. In: ICWS (2004)Google Scholar
  15. 15.
    Patil, A., Oundhakar, S., Sheth, A., Verma, K.: METEOR-S: Web service annotation framework. In: WWW (2004)Google Scholar
  16. 16.
    Raghavan, S., Garcia-Molina, H.: Crawling the hidden Web. In: VLDB (2001)Google Scholar
  17. 17.
    Sabou, M., Wroe, C., Goble, C., Mishne, G.: Learning domain ontologies for web service descriptions: an experiment in bioinformatics. In: WWW (2005)Google Scholar
  18. 18.
    Salton, G., McGill, M.: Introduction to Modern Information Retrieval. McCraw-Hill, New York (1983)Google Scholar
  19. 19.
    Sivashanmugam, K., Verma, K., Sheth, A., Miller, J.: Adding semantics to web services standards. In: ICWS (2003)Google Scholar
  20. 20.
  21. 21.
  22. 22.
  23. 23.
    VanderMeer, D., Datta, A., et al.: FUSION: A system allowing dynamic Web service composition and automatic execution. In: CEC (2003)Google Scholar
  24. 24.
    Vasiliu, L., Zaremba, M., et al.: Web-service semantic enabled implementation of machine vs. machine business negotiation. In: ICWS (2004)Google Scholar
  25. 25.
    Wang, J., Lochovsky, F.: Data extraction and label assignment for Web databases. In: WWW (2003)Google Scholar
  26. 26.
  27. 27.
    Wu, W., Yu, C., Doan, A., Meng, W.: An interactive clustering-based approach to integrating source query interfaces on the Deep Web. In: SIGMOD (2004)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Wensheng Wu
    • 1
  • AnHai Doan
    • 1
  • Clement Yu
    • 2
  • Weiyi Meng
    • 3
  1. 1.University of IllinoisUrbanaUSA
  2. 2.University of IllinoisChicagoUSA
  3. 3.Binghamton UniversityBinghamtonUSA

Personalised recommendations