Skip to main content

ENAXS: Efficient Native XML Storage System

  • Conference paper
  • First Online:
Web Technologies and Applications (APWeb 2003)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2642))

Included in the following conference series:

Abstract

XML is a self-describing meta-language and fast emerging as a dominant standard for Web data exchange among various applications. With the tremendous growth of XML documents, an efficient storage system is required to manage them. The conventional databases, which require all data to adhere to an explicitly specified rigid schema, are unable to provide an efficient storage for tree-structured XML documents. A new data model that is specifically designed for XML documents is required. In this paper, we propose a new storage system, named Efficient Native XML Storage System (ENAXS), for large and complex XML documents. ENAXS stores all XML documents in its native format to overcome the deficiencies of the conventional databases, achieve optimal storage utilization and support efficient query processing. In addition, we propose a path-based indexing scheme which is embedded in ENAXS for fast data retrieval. We have implemented ENAXS and evaluated its performance with real data sets. Experimental results show the efficiency and scalability of the proposed system in utilizing storage space and executing various types of queries.

This work is partially supported by the SingAREN21 research grant M48020004

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. D. Alin, F. Mary, and D. Suciu. Storing Semistructured Data with STORED. SIGMOD Record, pages 431–442, 1999.

    Google Scholar 

  2. V. Christophides, S. Abiteboul, S. Cluet, and M. Scholl. From Structured Documents to Novel Query Facilities. In Proc. ACM SIGMOD Conf., Minneapolis, Minnesota, May 1994.

    Google Scholar 

  3. C. W. Chung, J. K. Min, and K. Shim. APEX: An Adaptive Path Index for XML Data. ACM SIGMOD, 4(6), June 2002.

    Google Scholar 

  4. T. S. Chung, S. Park, S. Y. Han, and H. J. Kim. Extracting Object-Oriented Database Schemas from XML DTDs Using Inheritance. In Proc. 2nd Int. Conf. EC-Web, Munich, Germany, September 2001.

    Google Scholar 

  5. B. F. Cooper, S. Neal, J. F. Michael, R. H. Gisli, and S. Moshe. A Fast Index for Semistructured Data. In Proc. 27th Int. Conf. on Very Large Data Bases, pages 341–350, Roma, Italy, 2001.

    Google Scholar 

  6. D. Florescu and D. Kossmann. Storing and Querying XML Data using an RDBMS. In Bullettin of the Technical Committee on Data Engineering, 22(3):27–34, September 1999.

    Google Scholar 

  7. R. Goldman and J. Widom. DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. In Proc. 23rd Int. Conf. on Very Large Data Bases, Athens, Greece, 1997.

    Google Scholar 

  8. H. V. Jagadish, Shurug AI-Khalifa, Laks V. S., Andrew Nierman, Stylianos Paparizons, Jignesh Patel, Divesh Srivastava, and Yuqing Wu. TIMBER: A Native XML Database. VLDB Journal (To appear), 2002.

    Google Scholar 

  9. S. Jayavel, T. Kristin, H. Gang, Z. Chun, D. David, and N. Jeffrey. Relational Databases for Querying XML Documents: Limitations and Opportunities. In Proc. 25th Int. Conf. on Very Large Data Bases, Edinburgh, Scotland, 1999.

    Google Scholar 

  10. C. C. Kanne and G. Moerkotte. Efficient Storage of XML Data. In Proc. 16th Int. Conf. on Data Engineering, San Diego, CA, February 2000.

    Google Scholar 

  11. M. Klettke and H. Meyer. XML and Object Relational Database Systems Enchancing Structural Mapping Based on Statistics. In Int. Workshop on the Web and Database (WebDB), Dallas, 2000.

    Google Scholar 

  12. K. Loney and G. Koch. Oracle 8i: The Complete Reference. McGrawHill, 2000.

    Google Scholar 

  13. J. McHugh, S. Abiteboul, R. Goldman, D. Quass, and J. Widom. Lore: A Database Management System for Semistructured Data. SIGMOD Record, 26(3), September 1997.

    Google Scholar 

  14. D. Quass, J. Widom, R. Goldman, K. Haas, Q. Luoan J. MchHugh, S. Nestorov, A. Rajaraman, H. Rivero, S. Abiteboul, J. Ullman, and J. Wiener. LORE: A Lightweight Object REpository for Semistructured Data. ACM SIGMOD, 25(2):549–549, June 1996.

    Article  Google Scholar 

  15. M. Rays. Bringing the Internet to Your Database: Using SQL Server 2000 and XML to Bulid Loosely-Coupled Systems. In Proc. 17th IEEE Int. Conf. on Data Engineering, Heidelberg, Germany, April 2001.

    Google Scholar 

  16. F. Rizzolo and A. Mendelzon. Indexing XML Data with ToXin. In Proc. 4th Int. Workshop on the Web and Database (in Conjunction with ACM SIGMOD), Santa Barbara, CA, May 2001.

    Google Scholar 

  17. H. Schoning. Tamino: A DBMS Designed for XML. In Proc. 17th Int. Conf. on Data Engineering, pages 149–154, Heidelberg, Germany, April 2001.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Win, KM., Ng, WK., Lim, EP. (2003). ENAXS: Efficient Native XML Storage System. In: Zhou, X., Orlowska, M.E., Zhang, Y. (eds) Web Technologies and Applications. APWeb 2003. Lecture Notes in Computer Science, vol 2642. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-36901-5_6

Download citation

  • DOI: https://doi.org/10.1007/3-540-36901-5_6

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-02354-8

  • Online ISBN: 978-3-540-36901-1

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics