Skip to main content

Deep-Web Search

  • Reference work entry
  • First Online:
  • 34 Accesses

Synonyms

Hidden-web search

Definition

With the proliferation of dynamic Web sites, whose contents are provided by online databases in response to querying, deep-Web search aims at finding information from this “hidden” or “deep” Web. Current search engines crawl and index pages from statically linked Web pages, or the “surface” Web. As such crawlers cannot effectively query online databases, much of the deep Web is not generally covered by current search engines, and thus remains invisible to users. A deep-Web search system queries over online databases to help users search these databases uniformly.

Historical Background

In the last few years, the Web has been rapidly “deepened” by the massive networked databases on the Internet. While the surface Web has linked billions of static HTML pages, a far more significant amount of information is hidden in the deep Web, behind the query forms of searchable databases. A July 2000 survey [1] claims that there were 500 billion hidden pages in...

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   4,499.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   6,499.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Recommended Reading

  1. BrightPlanet.com. The deep web: surfacing hidden value. http://www.brightplanet.com/resources/details/deepweb.html. 2000.

  2. He B, Chang KCC. Statistical schema matching across web query interfaces. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2003. p. 217–28.

    Google Scholar 

  3. He H, Meng W, Lu Y, Yu CT, Wu Z. Towards deeper understanding of the search interfaces of the deep web. In: Proceedings of the 16th International World Wide Web Conference; 2007. p. 133–55.

    Article  Google Scholar 

  4. He B, Patel M, Zhang Z, Chang KCC. Accessing the deep web: a survey. Commun ACM. 2007;50(5):94–101.

    Article  Google Scholar 

  5. Raghavan S, Garcia-Molina H. Crawling the hidden web. In: Proceedings of the 27th International Conference on Very Large Data Bases; 2001. p. 129–38.

    Google Scholar 

  6. Wang J, Wen JR, Lochovsky FH, Ma WY. Instance-based schema matching for web databases by domain-specific query probing. In: Proceedings of the 30th International Conference on Very Large Data Bases; 2004. p. 408–19.

    Chapter  Google Scholar 

  7. Wu P, Wen JR, Liu H, Ma WY. Query selection techniques for efficient crawling of structured web sources. In: Proceedings of the 22nd International Conference on Data Engineering; 2006. p. 47.

    Google Scholar 

  8. Wu W, Yu CT, Doan A, Meng W. An interactive clustering-based approach to integrating source query interfaces on the deep web. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2004. p. 95–106.

    Google Scholar 

  9. Zhang Z, He B, Chang KCC. Understanding web query interfaces: best-effort parsing with hidden syntax. In: Proceedings of the ACM SIGMOD International Conference on Management of Data; 2004.p. 117–18.

    Google Scholar 

  10. Zhang Z, He B, Chang KCC. Light-weight domain-based form assistant: querying web databases on the fly. In: Proceedings of the 31st International Conference on Very Large Data Bases; 2005. p. 97–108.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kevin Chang .

Editor information

Editors and Affiliations

Section Editor information

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer Science+Business Media, LLC, part of Springer Nature

About this entry

Check for updates. Verify currency and authenticity via CrossMark

Cite this entry

Chang, K. (2018). Deep-Web Search. In: Liu, L., Özsu, M.T. (eds) Encyclopedia of Database Systems. Springer, New York, NY. https://doi.org/10.1007/978-1-4614-8265-9_188

Download citation

Publish with us

Policies and ethics