Abstract
GML has become the standard format for geographical data transfer, exchange and storage. Usually, in GML documents there are many verbose tags and a large amount of coordinate data, which makes them be of extremely large volume. Thus, it is necessary to compress these documents to reduce storage and transmission cost. GML data is often stored and transferred in the form of multiple documents. Although some GML compressors have been developed recently, all of them can process only a single GML document at a time. In this paper, we propose a stream compressor for GML documents, called GDScomp, which can compress a stream of multiple GML documents effectively. It shares the structural information among multiple GML documents by a common dictionary to employ the dynamic compression method and uses the delta compression method for the coordinate data. Experimental results show that GDScomp can achieve satisfactory compression performance when compressing GML documents streams.
Keywords
This work was supported by National Natural Science Foundation of China under grants No. 60873040 and No. 60873070, and China 863 Program under grant No. 2009AA01Z135.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Open GIS Consortium. Open GIS Geography Markup Language (GML) Implementation Specification Version 3.1.0, http://www.opengis.net/gml
Corcoles, J., Gonzalez, P.: A specification of a spatial query language over GML. In: Proceedings of ACM-GIS 2001, pp. 112–117 (2001)
Guan, J., Zhou, S.: GPress: Towards Effective GML Documents Compression. In: Proceedings of the 23rd International Conference on Data Engineering (ICDE 2007), pp. 1473–1474 (2007)
Guan, J., Zhou, S., Chen, Y.: An Effective GML Documents Compressor. IEICE Transactions on Information and Systems E91-D(7), 1982–1990 (2008)
Gzip (2000), http://www.gzip.org/
Moffat, A.: Implementing the PPM data compression scheme. IEEE Transactions on Communications 38(11), 1917–1921 (1990)
Liefke, H., Suciu, D.: XMill: An efficient compressor for XML data. In: Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD 2000), pp. 153–164 (2000)
Tolani, P., Haritsa, J.: XGRIND: A query-friendly XML compressor. In: IEEE Proceedings of the 18th International Conference on Data Engineering (ICDE 2002), pp. 225–234 (2002)
Cheney, J.: Compressing XML with multiplexed hierarchical PPM models. In: Proceedings of the IEEE Data Compression Conference (DCC 2001), pp. 163–172 (2001)
Girardot, M., Sundaresan, N.: Millau: An encoding format for efficient representation and exchange of XML over the Web. In: Proceedings of the 9th International WWW Conference, pp. 747–765 (2000)
Gao, J., Yang, D., Tang, S., et al.: XPath Evaluation Oriented XML Data Stream Compression. Journal of Software 16(2), 123–232 (2005)
Ziv, J., Lempel, A.: A universal algorithm for sequential data compression. IEEE Transactions on Information Theory 23(3), 337–343 (1977)
Sakr, S.: An Empirical Evaluation of XML Compression Tools. In: Proceedings of DASFAA Workshops 2009, pp. 49–63 (2009)
CityGML (2002), http://www.citygml.org/
German Working Committee of the Surveying Authorities. Authoritative Real Estate Cadastre Information System (ALKIS), http://www.lv-bw.de/alkis.info/nas-bsp.html
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2011 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yu, Y., Li, Y., Zhou, S. (2011). A GML Documents Stream Compressor. In: Xu, J., Yu, G., Zhou, S., Unland, R. (eds) Database Systems for Adanced Applications. DASFAA 2011. Lecture Notes in Computer Science, vol 6637. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-20244-5_7
Download citation
DOI: https://doi.org/10.1007/978-3-642-20244-5_7
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-20243-8
Online ISBN: 978-3-642-20244-5
eBook Packages: Computer ScienceComputer Science (R0)