Skip to main content

Oxone: A Scalable Solution for Detecting Superior Quality Deltas on Ordered Large XML Documents

  • Conference paper
Conceptual Modeling - ER 2006 (ER 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4215))

Included in the following conference series:

Abstract

Recently, a number of relational-based approaches for detecting the changes to XML data have been proposed to address the scalability problem of main memory-based approaches (e.g., X-Diff, XyDiff). These approaches store the XML documents in the relational database and issue SQL queries (whenever appropriate) to detect the changes. In this paper, we propose a relational-based ordered XML change detection technique (called Oxone) that uses a schema-conscious approach as the underlying storage strategy for XML data. Previous efforts have focused on detecting changes to ordered XML in an schema-oblivious storage environment. Although the schema-oblivious approach produces better result quality compared to XyDiff (a main memory-based ordered XML change detection approach), its performance degrade with increase in data size and is slower than XyDiff for smaller data set. We propose a technique to overcome these limitations. Our experimental results show that Oxone is up to 22 times faster and more scalable than the relational-based schema-oblivious approach. The performances of Oxone and XyDiff (C version) are comparable. However, more importantly, our approach is more scalable compared to XyDiff for larger datasets and has much superior the result quality of deltas than XyDiff.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cobena, G., Abiteboul, S., Marian, A.: Detecting Changes in XML Documents. In: ICDE (2002)

    Google Scholar 

  2. Leonardi, E., Bhowmick, S.S.: Xandy: A Scalable Change Detection Technique for Ordered XML Documents Using Relational Databases. DKE Journal (to appear)

    Google Scholar 

  3. Leonardi, E., Bhowmick, S.S., Madria, S.: Xandy: Detecting Changes on Large Unordered XML Documents Using Relational Databases. In: DASFAA, China (2005)

    Google Scholar 

  4. Leonardi, E., Bhowmick, S.S.: Detecting Changes on Unordered XML Documents Using Relational Databases: A Schema-Conscious Approach. In: CIKM (2005)

    Google Scholar 

  5. Papadimitriou, C., Steiglitz, K.: Combinatorial Optimization: Algorithms and Complexity. Prentice-Hall, Englewood Cliffs (1982)

    MATH  Google Scholar 

  6. Shanmugasundaram, J., Tufte, K., Zhang, C., He, G., DeWitt, D.J., Naughton, J.F.: Relational Databases for Querying XML Documents: Limitations and Opportunities. The VLDB Journal (1999)

    Google Scholar 

  7. Lu, H., Jiang, H., Xu, J.X., Yu, G., et al.: What Makes the Differences: Benchmarking XML Database Implementations. ACM TOIT 5(1) (2005)

    Google Scholar 

  8. Wang, Y., DeWitt, D.J., Cai, J.: X-Diff: An Effective Change Detection Algorithm for XML Documents. In: ICDE, Bangalore (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Leonardi, E., Bhowmick, S.S. (2006). Oxone: A Scalable Solution for Detecting Superior Quality Deltas on Ordered Large XML Documents. In: Embley, D.W., Olivé, A., Ram, S. (eds) Conceptual Modeling - ER 2006. ER 2006. Lecture Notes in Computer Science, vol 4215. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11901181_16

Download citation

  • DOI: https://doi.org/10.1007/11901181_16

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-47224-7

  • Online ISBN: 978-3-540-47227-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics