Skip to main content

Mapping XML Schema to Relations Using Genetic Algorithm

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3215))

Abstract

As web-applications grow in number and complexity, there is a need for efficient mappings from XML schemas to the flat relational tables so that existing functions in relational database systems can be utilized. However, many of the existing mapping methods, such as the model-based or the structure-based methods, do not exploit query history for better query performance. In this paper, we propose the use of genetic algorithm (GA) in a cost-based approach for converting a XML schema to relational tables. By formulating the mapping problem as a cost optimization task with respect to a set of weighted frequent queries, we can obtain an efficient mapping that minimizes the queries execution time. In our experiments, we show that the mapping obtained by GA is superior to other non-cost-based approaches. In particular, the GA approach has out-performed the greedy heuristic in the browsing queries where the accessed attributes are many and scattered.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Cheng, J., Xu, J.: XML and DB2. In: Proceedings of 16th International Conference on Data Engineering, pp. 569–576 (2000)

    Google Scholar 

  2. Oracle Corporaion: XML Support in Oracle 8 and beyond, Technical white paper, http://otn.oracle.com/tech/xml/htdocs/xml_twp.html

  3. Florescu, D., Kossman, D.: Storing and Querying XML Data Using an RDBMS. IEEE Data Engineering Bulletin 22(3), 27–34 (1999)

    Google Scholar 

  4. Pankowski, T.: XML-SQL: An XML Query Language Based on SQL and Path Tables. In: Chaudhri, A.B., Unland, R., Djeraba, C., Lindner, W. (eds.) EDBT 2002. LNCS, vol. 2490, pp. 184–209. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Shanmugasundaram, J., Tufte, K., Zhang, C., He, G., DeWitt, D.J., Naughton, J.F.: Relational Databases for Querying XML Documents: Limitations and Opportunities. In: VLDB 1999, pp. 302–314 (1999)

    Google Scholar 

  6. Mani, M., Lee, D.: XML to Relational Conversion using Theory of Regular Tree Grammars. In: Bressan, S., Chaudhri, A.B., Li Lee, M., Yu, J.X., Lacroix, Z. (eds.) CAiSE 2002 and VLDB 2002. LNCS, vol. 2590, pp. 81–103. Springer, Heidelberg (2003)

    Chapter  Google Scholar 

  7. Lee, D., Mani, M., Chu, W.W.: Efficient Schema Conversions between XML and Relational Models (Invited Paper). In: Workshop on Knowledge Transformations for the Semantic Web (2002)

    Google Scholar 

  8. Lee, D., Chu, W.W.: CPI: Constraints-Preserving Inlining Algorithm for Mapping XML DTD to Relational Schema. Data and Knowledge Engineering (DKE) 39(1), 3–25 (2001)

    Article  Google Scholar 

  9. Bohannon, P., Friere, J., Roy, P., Simeon, J.: From XML Schema to Relations: A Cost-based Approach to XML Storage. In: Proceedings of the 2002 International Conference on Data Engineering, Febuary 2002, pp. 64–80 (2002)

    Google Scholar 

  10. Ng, V., Law, D.M., Gorla, N., Chan, C.K.: Applying Genetic Algorithms in Database Partitioning. In: Proceedings of the 2003 ACM Symposium on Applied Computing, pp. 544–549 (2003)

    Google Scholar 

  11. Medline DTD, http://www.nlm.nih.gov/databases/dtd/nlmmedlinecitation_021101.dtd

  12. Niamir, B.: Attribute partitioning in a Self-Adaptive Relational Database System, PhD Dissertation, MIT Lab. for Computer Science (1978)

    Google Scholar 

  13. Chambers, L.: Practical Handbook of Genetic Algorithms, vol. 1. CRC Press, Boca Raton (1995)

    MATH  Google Scholar 

  14. Internet Movies Database, http://www.imdb.com

  15. Deutsch, M.F. Suciu, D.: Storing semistructured data with stored. In: SIGMOD Int’l. Conf. on Management of Data, pp. 431–442 (1999)

    Google Scholar 

  16. Wang, W.Q., Lee, M.-L., Ooi, B.C., Tan, K.-L.: XStorM: A Scalable Storage Mapping Scheme for XML Data. In: Poster Proceedings of the Tenth International World Wide Web Conference, pp. 176–177 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ng, V., Kong, C.C., Chan, S. (2004). Mapping XML Schema to Relations Using Genetic Algorithm. In: Negoita, M.G., Howlett, R.J., Jain, L.C. (eds) Knowledge-Based Intelligent Information and Engineering Systems. KES 2004. Lecture Notes in Computer Science(), vol 3215. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30134-9_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-30134-9_33

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-23205-6

  • Online ISBN: 978-3-540-30134-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics