Skip to main content

Object Semantics for XML Keyword Search

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 8422))

Abstract

It is well known that some XML elements correspond to objects (in the sense of object-orientation) and others do not. The question we consider in this paper is what benefits we can derive from paying attention to such object semantics, particularly for the problem of keyword queries. Keyword queries against XML data have been studied extensively in recent years, with several lowest-common-ancestor based schemes proposed for this purpose, including SLCA, MLCA, VLCA, and ELCA. It can be seen that identifying objects can help these techniques return more meaningful answers than just the LCA node (or subtree) by returning objects instead of nodes. It is more interesting to see that object semantics can also be used to benefit the search itself. For this purpose, we introduce a novel Nearest Common Object Node semantics (NCON), which includes not just common object ancestors but also common object descendants. We have developed XRich, a system for our NCON-based approach, and used it in our extensive experimental evaluation. The experimental results show that our proposed approach outperforms the state-of-the-art approaches in terms of both effectiveness and efficiency.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Bao, Z., Ling, T.W., Chen, B., Lu, J.: Efficient XML keyword search with relevance oriented ranking. In: ICDE (2009)

    Google Scholar 

  2. Ding, B., Yu, J.X., Wang, S., Qin, L., Zhang, X., Lin, X.: Finding top-k min-cost connected trees in database. In: ICDE (2007)

    Google Scholar 

  3. Dreyfus, S.E., Wagner, R.A.: The steiner problem in graphs. Networks (1971)

    Google Scholar 

  4. Fong, J., Wong, H.K., Cheng, Z.: Converting relational database into XML documents with DOM. Information & Software Technology (2003)

    Google Scholar 

  5. Guo, L., Shao, F., Botev, C., Shanmugasundaram, J.: XRANK: Ranked keyword search over XML documents. In: SIGMOD (2003)

    Google Scholar 

  6. He, H., Wang, H., Yang, J., Yu, P.S.: BLINKS: Ranked keyword searches on graphs. In: SIGMOD (2007)

    Google Scholar 

  7. Kargar, M., An, A.: Keyword search in graphs: Finding r-cliques. PVLDB (2011)

    Google Scholar 

  8. Kim, J., Jeong, D., Baik, D.-K.: A translation algorithm for effective RDB-to-XML schema conversion considering referential integrity information. Journal Inf. Sci. Eng. (2009)

    Google Scholar 

  9. Le, T.N., Wu, H., Ling, T.W., Li, L., Lu, J.: From structure-based to semantics-based: Effective XML keyword search. In: Ng, W., Storey, V.C., Trujillo, J.C. (eds.) ER 2013. LNCS, vol. 8217, pp. 356–371. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  10. Li, G., Feng, J., Wang, J., Zhou, L.: Effective keyword search for valuable LCAs over XML documents. In: CIKM (2007)

    Google Scholar 

  11. Li, G., Ooi, B.C., Feng, J., Wang, J., Zhou, L.: EASE: Efficient and adaptive keyword search on unstructured, semi-structured and structured data. In: SIGMOD (2008)

    Google Scholar 

  12. Li, L., Le, T.N., Wu, H., Ling, T.W., Bressan, S.: Discovering semantics from data-centric XML. In: Decker, H., Lhotská, L., Link, S., Basl, J., Tjoa, A.M. (eds.) DEXA 2013, Part I. LNCS, vol. 8055, pp. 88–102. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  13. Li, Y., Yu, C., Jagadish, H.V.: Schema-free XQuery. In: VLDB (2004)

    Google Scholar 

  14. Liu, Z., Chen, Y.: Identifying meaningful return information for XML keyword search. In: SIGMOD (2007)

    Google Scholar 

  15. Ribeiro, L., Härder, T.: Entity identification in XML documents. In: Grundlagen von Datenbanken (2006)

    Google Scholar 

  16. Tao, Y., Papadopoulos, S., Sheng, C., Stefanidis, K.: Nearest keyword search in XML documents. In: SIGMOD (2011)

    Google Scholar 

  17. Termehchy, A., Winslett, M.: EXTRUCT: Using deep structural information in XML keyword search. PVLDB (2010)

    Google Scholar 

  18. Truong, B.Q., Bhowmick, S.S., Dyreson, C.E., Sun, A.: MESSIAH: Missing element-conscious slca nodes search in XML data. In: SIGMOD (2013)

    Google Scholar 

  19. Wu, H., Bao, Z.: Object-oriented XML keyword search. In: Jeusfeld, M., Delcambre, L., Ling, T.-W. (eds.) ER 2011. LNCS, vol. 6998, pp. 402–410. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  20. Xu, Y., Papakonstantinou, Y.: Efficient keyword search for smallest LCAs in XML databases. In: SIGMOD (2005)

    Google Scholar 

  21. Zhou, J., Bao, Z., Wang, W., Ling, T.W., Chen, Z., Lin, X., Guo, J.: Fast slca and elca computation for XML keyword queries based on set intersection. In: ICDE (2012)

    Google Scholar 

  22. Zhou, R., Liu, C., Li, J.: Fast ELCA computation for keyword queries on XML data. In: EDBT (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this paper

Cite this paper

Le, T.N., Ling, T.W., Jagadish, H.V., Lu, J. (2014). Object Semantics for XML Keyword Search. In: Bhowmick, S.S., Dyreson, C.E., Jensen, C.S., Lee, M.L., Muliantara, A., Thalheim, B. (eds) Database Systems for Advanced Applications. DASFAA 2014. Lecture Notes in Computer Science, vol 8422. Springer, Cham. https://doi.org/10.1007/978-3-319-05813-9_21

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05813-9_21

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05812-2

  • Online ISBN: 978-3-319-05813-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics