Advertisement

XEdge: An Efficient Method for Returning Meaningful Clustered Results for XML Keyword Search

  • Wenxin Liang
  • Yuanyuan Gan
  • Xianchao Zhang
Part of the Lecture Notes in Computer Science book series (LNCS, volume 8506)

Abstract

In this paper, we investigate the problem of returning meaningful clustered results for XML keyword search. We begin by presenting a multi-granularity computing methodology, in order to make full use of the structural information of XML trees to extract features. In this method, we first propose the concept of Cluster Compactness Granularity (CCG) to partition the search results into different clusters, which enable users to precisely and quickly seek their desired answers, according to the connection compactness between LCA nodes. We then propose the concept of Subtree Compactness Granularity (SCG) to rank individual results within clusters and measure the query result relevance. Furthermore, we define a novel semantics of Compact LCA (CLCA), which not only improves the accuracy by eliminating redundant LCAs that do not contribute to meaningful answers, but also overcomes the shielding effects of SLCA-based methods. Using the proposed CCG and SCG features and the CLCA semantics, we finally implement an efficient algorithm called XEdge for generating meaningful clustered results. Comparing with the existing methods such as XSeek and XKLUSTER, the experimental results demonstrate the effectiveness of the proposed multi-granularity clustering methodology and validity of the complemented ranking strategy, as well as the meaningfulness of CLCA semantics.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Liu, Z., Chen, Y.: Identifying meaningful return information for xml keyword search. In: SIGMOD, pp. 329–340 (2007)Google Scholar
  2. 2.
    Liu, Z., Chen, Y.: Return specification inference and result clustering for keyword search on xml. ACM TODS 35(2), 1–47 (2010)Google Scholar
  3. 3.
    Yang, W., Zhu, H.: Semantic-distance based clustering for xml keyword search. In: Zaki, M.J., Yu, J.X., Ravindran, B., Pudi, V. (eds.) PAKDD 2010. LNCS, vol. 6119, pp. 398–409. Springer, Heidelberg (2010)CrossRefGoogle Scholar
  4. 4.
    Liu, Z., Chen, Y.: Processing keyword search on xml: A survey. World Wide Web 14(5-6), 671–707 (2011)CrossRefGoogle Scholar
  5. 5.
    Zhou, R., Liu, C., Li, J., Yu, J.X.: Elca evaluation for keyword search on probabilistic xml data. World Wide Web 16(2), 171–193 (2013)CrossRefGoogle Scholar
  6. 6.
    Liu, X., Wan, C., Chen, L.: Returning clustered results for keyword search on xml documents. IEEE TKDE 23(12), 1811–1825 (2011)Google Scholar
  7. 7.

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  • Wenxin Liang
    • 1
  • Yuanyuan Gan
    • 1
  • Xianchao Zhang
    • 1
  1. 1.School of SoftwareDalian University of TechnologyDalianChina

Personalised recommendations