Skip to main content

Constructing Interface Schemas for Search Interfaces of Web Databases

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3806))

Abstract

Many databases have become Web-accessible through form-based search interfaces (i.e., search forms) that allow users to specify complex and precise queries to access the underlying databases. In general, such a Web search interface can be considered as containing an interface schema with multiple attributes and rich semantic/meta information; however, the schema is not formally defined on the search interface. Many Web applications, such as Web database integration and deep Web crawling, require the construction of the schemas. In this paper, we introduce a schema model for complex search interfaces, and present a tool (WISE-iExtractor) for automatically extracting and deriving all the needed information to construct the schemas. Our experimental results on real search interfaces indicate that this tool is highly effective.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bergamaschi, S., Castano, S., Vincini, M., Beneventano, D.: Semantic Integration of Heterogeneous Information Sources. Data & Knowledge Engineering 36, 215–249 (2001)

    Article  MATH  Google Scholar 

  2. Chang, K., He, B., Li, C., Patel, M., Zhang, Z.: Structured Databases on the Web: Observations and Implications. SIGMOD Record 33(3) (September 2004)

    Google Scholar 

  3. Chang, K., Garcia-Molina, H.: Mind Your Vocabulary: Query Mapping Across Heterogeneous Information Sources. In: SIGMOD Conference (1999)

    Google Scholar 

  4. Gal, A., Modica, G., Jamil, H.: OntoBuilder: Fully Automatic Extraction and Consolidation of Ontologies from Web Sources. In: ICDE Conference (2004)

    Google Scholar 

  5. He, B., Chang, K.: Statistical Schema Matching across Web Query Interfaces. In: SIGMOD Conference (2003)

    Google Scholar 

  6. He, B., Tao, T., Chang, K.: Clustering Structured Web Sources: a Schema-based, Model-Differentiation Approach. In: Lindner, W., Mesiti, M., Türker, C., Tzitzikas, Y., Vakali, A.I. (eds.) EDBT 2004. LNCS, vol. 3268, pp. 536–546. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  7. He, H., Meng, W., Yu, C., Wu, Z.: WISE-Integrator: An Automatic Integrator of Web Search Interfaces for E-commerce. In: VLDB Conference (2003)

    Google Scholar 

  8. He, H., Meng, W., Yu, C., Wu, Z.: Automatic Extraction of Web Search Interfaces for Interface Schema Integration. In: WWW Conference (2004)

    Google Scholar 

  9. Kaljuvee, O., Buyukkokten, O., Garcia-Molina, H., Paepcke, A.: Efficient Web Form Entry on PDAs. In: WWW Conference (2000)

    Google Scholar 

  10. Kushmerick, N.: Learning to Invoke Web Forms. In: Meersman, R., Tari, Z., Schmidt, D.C. (eds.) CoopIS 2003, DOA 2003, and ODBASE 2003. LNCS, vol. 2888, pp. 997–1013. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  11. Levy, A., Rajaraman, A., Ordille, J.: Querying Heterogeneous Information Sources Using Source Descriptions. In: VLDB Conference (1996)

    Google Scholar 

  12. Peng, Q., Meng, W., He, H., Yu, C.: WISE-Cluster: Clustering E-Commerce Search Engines Automatically. In: WIDM workshop (2004)

    Google Scholar 

  13. Raghavan, S., Garcia-Molina, H.: Crawling the Hidden Web. In: VLDB Conference (2001)

    Google Scholar 

  14. Wu, W., Yu, C., Doan, A., Meng, W.: An Interactive Clustering-based Approach to Integrating Source Query interfaces on the Deep Web. In: SIGMOD Conference (2004)

    Google Scholar 

  15. Wang, J., Lochovsky, F.H.: Data Extraction and Label Assignment for Web Databases. In: WWW Conference (2003)

    Google Scholar 

  16. Zhang, Z., He, B., Chang, K.: Understanding Web Query Interfaces: Best-Effort Parsing with Hidden Syntax. In: SIGMOD Conference (2004)

    Google Scholar 

  17. MetaQuerier: http://metaquerier.cs.uiuc.edu/formex

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

He, H., Meng, W., Yu, C., Wu, Z. (2005). Constructing Interface Schemas for Search Interfaces of Web Databases. In: Ngu, A.H.H., Kitsuregawa, M., Neuhold, E.J., Chung, JY., Sheng, Q.Z. (eds) Web Information Systems Engineering – WISE 2005. WISE 2005. Lecture Notes in Computer Science, vol 3806. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11581062_3

Download citation

  • DOI: https://doi.org/10.1007/11581062_3

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-30017-5

  • Online ISBN: 978-3-540-32286-3

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics