Skip to main content
  • 1175 Accesses

Abstract

XML keyword search has emerged as one of the most effective paradigms for finding desired information in hierarchical XML documents. One of the key advantages of the keyword search is its simplicity, that is, users do not have to learn complex query languages and be familiar with the structures of the XML documents. In this chapter, the state-of-the-art meaningful keyword search semantics, such as SLCA, VLCA, and MLCA, are fully illustrated. Furthermore, both the inverted-lists-based and stack-based keyword search algorithms built upon the semantics are also studied. In addition, in order to address the new challenges in XML keyword search, that is, identifying the user search intention and resolving keyword ambiguity, we show an XML TF*IDF ranking strategy based on guidelines that a search engine should meet in both search intention identification and relevance-oriented ranking for search results.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 109.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 139.00
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bao, Z., Chen, B., Ling, T.W., Lu, J.: Effective xml keyword search with relevance oriented ranking. In: ICDE, Shanghai, pp. 517–528 (2009)

    Google Scholar 

  2. Bao, Z., Lu, J., Ling, T.W., Chen, B.: Towards an effective XML keyword search. Knowl. Data Eng. (TKDE) 22(8), 1077–1092 (2010)

    Article  Google Scholar 

  3. Cohen, S., Mamou, J., Kanza, Y., Sagiv, Y.: XSEarch: A semantic search engine for XML. In: VLDB, pp. 45−56 (2003)

    Google Scholar 

  4. Guo, L., Shao, F., Botev C., Shanmugasundaram, J.: XRANK: Ranked keyword search over XML documents. In: SIGMOD, San Diego, pp. 16−27 (2003)

    Google Scholar 

  5. Hristidis, V., Koundas, N., Papakonstantinou, Y., Srivastava, D.: Keyword proximity search in XML trees. IEEE Trans. Knowl. Data Eng. (TKDE) 18(4), 525–539 (2006)

    Article  Google Scholar 

  6. Huang, J., Xu, J., Zhou, J., Meng, X.: MLCEA: An entity based semantics for XML keyword search. J Comput. Res. Dev. 45(Suppl), 372–377 (2008). 10 (NDBC2008 Guilin)

    Google Scholar 

  7. Liu, Z., Chen, Y.: Identifying meaningful return information for XML keyword search. In: SIGMOD, Beijing, pp. 329−340 (2007)

    Google Scholar 

  8. Ling, T.W., Dobbie, G.: Using semantics in XML data management. In: WSPC, 31 March 2007

    Google Scholar 

  9. Li, G., Feng, J., Wang, J., Zhou, L.: Effective keyword search for valuable LCAs over XML documents. In: CIKM, Lisbon, pp. 31–40 (2007)

    Google Scholar 

  10. Li, Y., Yu, C., Jagadish, H.V.: Schema-free XQuery. In: VLDB, Toronto (2004)

    Google Scholar 

  11. Sun C., Chan C.Y., Goenka, A.K.: Multiway slca-based keyword search in xml data. In: WWW, Banff, pp. 1043–1052 (2007)

    Google Scholar 

  12. Schmidt, A., Kersten, M.L., Windhouwer, M.: Querying XML documents made easy: Nearest concept queries. In: ICDE, Heidelberg (2001)

    Google Scholar 

  13. Salton, G., Mcgill, M.: Introduction to Modern Information Retrieval. McGraw-Hill Book Company, New York (1984)

    Google Scholar 

  14. Supasitthimethee, U., Shimizu, T., Yoshikawa, M., Porkaew, K.: XSemantic: An extension of LCA based XML semantic search. IEICE Trans. (IEICET) 92-D(5), 1079–1092 (2009)

    Google Scholar 

  15. Vesper, V.: Let’s do Dewey. http://www.mtsu.edu/vvesper/dewey.html

  16. Xu, Y., Papakonstantinou, Y.: Efficient keyword search for smallest LCAs in XML databases. In: SIGMOD, Baltimore, pp. 537–538 (2005)

    Google Scholar 

  17. Zhou, J., Bao, Z., Ling, T.W., Meng, X.: MCN: A new semantics towards effective XML keyword search. In: DASFAA, Brisbane, pp. 511–526 (2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2013 Tsinghua University Press, Beijing and Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Lu, J. (2013). Effective XML Keyword Search. In: An Introduction to XML Query Processing and Keyword Search. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-34555-5_6

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-34555-5_6

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-34554-8

  • Online ISBN: 978-3-642-34555-5

  • eBook Packages: Computer Science

Publish with us

Policies and ethics