Skip to main content

Parallelization of an XML Data Compressor on Multi-cores

  • Conference paper
Parallel Processing and Applied Mathematics (PPAM 2011)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 7204))

  • 1707 Accesses

Abstract

Because of a growing interest in using XML for massive complex data there has been considerable research on designing XML compressors. This paper presents our research aimed at building parallel XML compressors, using Java and OpenMP (with C++). Our findings show that OpenMP is a preferred choice achieving better results than Java using a multi-core platform.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. bzip2, http://www.bzip.org/ (retrieved on April 20, 2010)

  2. C++ library: boost, http://www.boost.org/users/news/version_1_45_0 (retrieved on April 20, 2010)

  3. gzip compression, http://www.gzip.org/ (retrieved on April 20, 2010)

  4. OpenMP, http://openmp.org/wp/about-openmp/ (retrieved on April 20, 2010)

  5. Oracle Berkeley DB 11g, http://www.oracle.com/technetwork/database/berkeleydb/overview/index.html (retrieved on April 20, 2010)

  6. Parallel BZIP2, http://compression.ca/pbzip2 (retrieved on April 20, 2010)

  7. pigz - parallel gzip, http://www.zlib.net/pigz (retrieved on April 20, 2010)

  8. TBB, http://www.threadingbuildingblocks.org/ (retrieved on April 20, 2010)

  9. Wratislavia XML corpus, http://www.ii.uni.wroc.pl/~inikep/research/Wratislavia/ (retrieved on April 20, 2010)

  10. Xerces, http://xerces.apache.org/xerces-j/ (retrieved on April 20, 2010)

  11. xmlgen - the benchmark data generator, http://www.xml-benchmark.org/generator.html (retrieved on April 20, 2010)

  12. Barney, B.: OpenMP Tutorial, https://computing.llnl.gov/tutorials/openMP/ (retrieved on April 20, 2010)

  13. Butenhof, D.: Programming with POSIX threads. Addison-Wesley professional computing series. Addison-Wesley (1997), http://books.google.ca/books?id=_xvnuFzo7q0C

  14. Müldner, T., Fry, C., Miziołek, J., Corbin, T.: Updates of compressed dynamic xml documents. In: Eight International Network Conference, INC 2010, Heidelberg, Germany, pp. 315–324 (July 2010)

    Google Scholar 

  15. Müldner, T., Fry, C., Miziołek, J., Durno, S.: XSAQCT: XML queryable compressor. Balisage: The Markup Conference, Montréal, Canada (August 2009), http://www.balisage.net/Proceedings/vol3/html/Muldner01/BalisageVol3-Muldner01.html

  16. Oaks, S., Wong, H.: Java Threads, 3rd edn. O’Reilly Media, Inc. (2004)

    Google Scholar 

  17. Osthoff, C., Schepke, C., Panetta, J., Grunmann, P., Maillard, N., Navaux, P., Dias, P.L.S., Lopes, P.P.: I/O Performance Evaluation on Multicore Clusters with Atmospheric Model Environment. In: Proceedings of the 2010 22nd International Symposium on Computer Architecture and High Performance Computing Workshops, SBAC-PADW 2010, pp. 49–54. IEEE Computer Society, Washington, DC (2010), http://dx.doi.org/10.1109/SBAC-PADW.2010.15

    Chapter  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Müldner, T., Fry, C., Corbin, T., Miziołek, J.K. (2012). Parallelization of an XML Data Compressor on Multi-cores. In: Wyrzykowski, R., Dongarra, J., Karczewski, K., Waśniewski, J. (eds) Parallel Processing and Applied Mathematics. PPAM 2011. Lecture Notes in Computer Science, vol 7204. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31500-8_11

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-31500-8_11

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-31499-5

  • Online ISBN: 978-3-642-31500-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics