Skip to main content

Searching Proper Replications in Mapping XML into Relations

  • Conference paper
Database Systems for Advanced Applications (DASFAA 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2973))

Included in the following conference series:

  • 969 Accesses

Abstract

In this paper, we introduce two efficient search methods that heuristically find proper replications close to the optimal value. These searched replications can notably reduce the overall query cost in mapping XML into relations. The necessity of such an automatic tool arises from the complexity of XML, i.e., the fact that it is difficult to search the optimal replication set to enhance query performance because of mass complex XML data and queries. Moreover, it was observed that the search problem requires exponential time of the number of given queries. Another important problem in implementing them is that exact query cost can not be measured within reasonable time. That is, actually executing an XML query over actual relational schema costs much time whenever a replication is evaluated in the algorithms, and effective estimation technique is needed. To overcome the obstacle, we considered the histogram-based estimation in commercial RDBMS. It is very simple and can support exact estimation. In addition, it can resolve the trait of highly skewed structure in XML very well. Finally, the effect of two search methods and the estimation are analyzed through some experiments.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Goldberg, D.E.: Genetic algorithms in search, optimization, and machine learning. Addison-Wesley Publishing Company, Reading (1989)

    MATH  Google Scholar 

  2. Kanne, C.C., Moerkotte, G.: Efficient storage of XML data. In: Proceedings of International Conference on DATA ENGINEERING, California, USA, p. 198 (2000)

    Google Scholar 

  3. Graefe, G., McKenna, W.J.: The volcano optimizer generator: Extensibility and efficient search. In: Proceedings of International Conference on DATA ENGINEERING, pp. 209–218 (1993)

    Google Scholar 

  4. Piatetsky-Shapiro, G., Connell, C.: Accurate estimation of the number of tuples satisfying a condition. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 256–276 (1984)

    Google Scholar 

  5. Freire, J., Haritsa, J.R., Ramanath, M., Roy, P., Siméon, J.: StatiX: Making XML count. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 181–191 (2002)

    Google Scholar 

  6. Kim, J., Park, S.: Performance Enhancement through structural redundancy in mapping XML into relations. In: Proceedings of the Eighth database systems for advanced applications, pp. 345–353. IEEE Computer Society Press, Los Alamitos (2003)

    Google Scholar 

  7. McHugh, J., Widom, J.: Query optimization for XML. In: Proceedings of VLDB, pp. 315–326 (1999)

    Google Scholar 

  8. Shanmugasundaram, J., Shekita, E., Barr, R., Carey, M., Lindsay, B., Pirahesh, H., Reinwald, B.: Efficiently publishing relational data as XML documents. In: Proceedings of VLDB, Cairo, Egypt, pp. 65–76 (2000)

    Google Scholar 

  9. Polyzotis, N., Garofalakis, M.: Statistical synopses for graph-structured XML databases. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 358–369 (2002)

    Google Scholar 

  10. Selinger, P., Astrahan, M., Chamberlin, D., Lorie, R., Price, T.: Access path selection in a relational database management system. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 23–34 (1979)

    Google Scholar 

  11. Lipton, R.J., Naughton, J.F., Schneider, D.A.: Practical selectivity estimation through adaptive sampling. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 1–11 (1990)

    Google Scholar 

  12. Poosala, V., Ioannidis, Y., Haas, P., Shekita, E.: Improved histograms for selectivity estimation of range predicates. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 294–305 (1996)

    Google Scholar 

  13. Sun, W., Ling, Y., Rish, N., Deng, Y.: An instant and accurate size estimation method for joins and selections in a retrieval-intensive environment. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 79–88 (1993)

    Google Scholar 

  14. Ioannidis, Y., Christodoulakis, S.: Optimal histograms for limiting worst-case error propagation in the size of join results. ACM TODS 18(4), 709–748 (1993)

    Article  Google Scholar 

  15. Wu, Y., Patel, J.M., Jagadish, H.V.: Estimating answer sizes for XML queries. In: Jensen, C.S., Jeffery, K., Pokorný, J., Šaltenis, S., Bertino, E., Böhm, K., Jarke, M. (eds.) EDBT 2002. LNCS, vol. 2287, pp. 590–608. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  16. Kim, J.-H., Park, S.: Flexible modification of relational schema by x2RMap in storing XML into relations. In: Shafazand, H., Tjoa, A.M. (eds.) EurAsia-ICT 2002. LNCS, vol. 2510, p. 330. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kim, J., Park, S. (2004). Searching Proper Replications in Mapping XML into Relations. In: Lee, Y., Li, J., Whang, KY., Lee, D. (eds) Database Systems for Advanced Applications. DASFAA 2004. Lecture Notes in Computer Science, vol 2973. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24571-1_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24571-1_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-21047-4

  • Online ISBN: 978-3-540-24571-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics