A New Path Expression Computing Approach for XML Data

  • Jianhua Lv
  • Guoren Wang
  • Jeffrey Xu Yu
  • Ge Yu
  • Hongjun Lu
  • Bing Sun
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2590)


Most query languages in XML database systems use Regular Path Expressions (RPE) to query or extract data from databases and some query processing and optimization techniques have been proposed for RPEs. Conceptually XML documents are collections of path instances. Each path instance should conform to an XML element tag sequence, called path schema. A RPE query can be written as an automaton that can represent a language, while path schemas can be seen as sentences. In this paper, a novel RPE computing approach, automaton match (AM), is proposed. AM queries the RPEs by matching the automatons with path schemas. The experimental results show AM is quite efficient for computing RPE queries.


Finite State Automaton Path Expression Semistructured Data Path Schema Document Object Model 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. [1]
    M. Altmel and M. Franklin. Efficient filtering of XML documents for selective dissemination of information. Proc. of the 26th VLDB Conf., Cairo, Egypt, 2000, 53–63. 36, 38Google Scholar
  2. [2]
    A. Bonifati and S. Ceri. Comparative analysis of five XML query languages. SIGMOD Record, 2000, 29(1): 68–79. 35CrossRefGoogle Scholar
  3. [3]
    B. F. Cooper, N. Sample, M. J. Franklin, G. R. Hjaltason and M. Shadmon. A Fast Index for Semistructured Data. Proc. of the 27th VLDB Conf., Roma, Italy, 2001, 341–350. 36Google Scholar
  4. [4]
    R. Goldman and J. Widom. DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. Proc. of the 23rd VLDB Conf., Athens, Greece, 1997, 436–445.Google Scholar
  5. [5]
    Q. Li and B. Moon. Indexing and querying XML Data for regular path expressions. Proc. of the 27th VLDB Conf., Roma, Italy, 2001, 361–370. 36Google Scholar
  6. [6]
    H. Lu, G. Wang, G. Yu, Y. Bao, J. Lv and Y. Yu. Xbase: Making your gigabyte disk queriable. Proc. of the 2002 ACM SIGMOD Conf. 2002. USA. 36, 43Google Scholar
  7. [7]
    J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A Database Management System for Semistructured Data. SIGMOD Record, 26(3):54–66.Google Scholar
  8. [8]
    J. McHugh and J. Widom. Query optimization for XML. Proc. of the 25th VLDB Conf., Edinburgh, Scotland, 1999, 315–326. 36Google Scholar
  9. [9]
    A. Renner. XML Data and Object Databases: A Perfect Couple? Proc. of the 17th ICDE Conf., Heidelberg, Germany, 2001, 143–148. 35Google Scholar
  10. [10]
    A. Schmidt, F. Waas, M. Kersten, M. J. Carey, I. Manolescu and R. Busse. XMark: A Benchmark for XML Data Management. Proc. of the 28th VLDB Conf., Hong Kong, China, 2002. 43Google Scholar
  11. [11]
    M. Yoshikawa, T. Amagasa, T. Shimura, and S. Uemura. XRel: A path-based approach to storage and retrieval of XML documents using relational databases. ACM Transactions on Internet Technology, 2001, 1(1).Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Jianhua Lv
    • 1
  • Guoren Wang
    • 1
  • Jeffrey Xu Yu
    • 2
  • Ge Yu
    • 1
  • Hongjun Lu
    • 3
  • Bing Sun
    • 1
  1. 1.Northeastern University of ChinaShenyangChina
  2. 2.The Chinese University of Hong KongHong KongChina
  3. 3.The Hong Kong University of Science and TechnologyHong KongChina

Personalised recommendations