Abstract
For deep web, abundant data information will be presented to users in the form of tables through query forms. Because of the isomerism of data source, to understand and integrate these tables is a very challenging task. This paper proposed a domain ontology based technique for integrating tables. As a method that is completely independent from the structure of tables, it could efficiently solve the nested and conjoined problems existed in complex tables. Experimental results prove that method’s effectiveness of improving the accuracy of integration.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Lerman, K., Getoor, L., Minton, S., Knoblock, C.: Using the Structure of Web Sites for Automatic Segmentation of Tables. In: Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, pp. 119–130 (2004)
Cai, D., Yu, S., Wen, J.-R., Ma, W.-Y.: VIPS: A Vision-based Page Segmentation Algorithm. Microsoft Technical Report. MSR-TR-2003-79 (2003)
Tao, C., Embley, D.W.: Automatic Hidden-web Table Interpretation by Sibling Page Comparison. In: Proceedings of the 26th International Conference on Conceptual Modeling, pp. 566–581 (2007)
Chen, K., Zuo, W., Zhang, F., He, F., Peng, T.: Automatic Generation of Domain-specific Ontology from Deep Web. Journal of Information and Computational Science 7(2), 519–525 (2010)
Zhao, H., Meng, W., Wu, Z., Raghavan, V., Yu, C.: Fully Automatic Wrapper Generation for Search Engines. In: Proceedings of the 14th international conference on World Wide Web, pp. 66–75 (2005)
Kushmerick, N.: Wrapper Induction: Efficiency and Expressiveness. Artificial Intelligence 118(3), 171–181 (2000)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Chen, K., Zuo, W., He, F., Chen, Y. (2011). Automatic Deep Web Table Segmentation by Domain Ontology. In: Yu, Y., Yu, Z., Zhao, J. (eds) Computer Science for Environmental Engineering and EcoInformatics. CSEEE 2011. Communications in Computer and Information Science, vol 159. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-22691-5_32
Download citation
DOI: https://doi.org/10.1007/978-3-642-22691-5_32
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-22690-8
Online ISBN: 978-3-642-22691-5
eBook Packages: Computer ScienceComputer Science (R0)