Skip to main content

Research on Light-Weight Compression Schemes Based on Simulative Column-Store

  • Conference paper
  • First Online:
Book cover Big Data Computing and Communications (BigCom 2015)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9196))

Included in the following conference series:

  • 2133 Accesses

Abstract

Column-stores have achieved significant improvement over row-oriented databases and various light-weight compression schemes make this architecture more efficient. However these performance benefits cannot be utilized directly by most relational DBMSs though they still occupy the largest share of database market. In this paper we explore the potential of row-stores by simulating column-store within a traditional engine and additionally introduce light-weight compression algorithms that are commonly used in column-stores. Besides we propose a novel light-weight algorithm in the context of simulative column-store. Experimental analysis demonstrates that these optimization techniques bring encouraging performance gains in certain OLAP query scenarios.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Abadi, D., Madden, S., Ferreira, M.: Integrating compression and execution in column-oriented database systems. In: Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pp. 671–682. ACM (2006)

    Google Scholar 

  2. Abadi, D.J.: Query execution in column-oriented database systems. Ph.D. thesis, Massachusetts Institute of Technology (2008)

    Google Scholar 

  3. Abadi, D.J., Boncz, P.A., Harizopoulos, S.: Column-oriented database systems. Proceedings of the VLDB Endowment 2(2), 1664–1665 (2009)

    Article  Google Scholar 

  4. Abadi, D.J., Madden, S.R., Hachem, N.: Column-stores vs. row-stores: how different are they really? In: Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pp. 967–980. ACM (2008)

    Google Scholar 

  5. Apte, T., Ingle, M., Goyal, A.: Dictionary symbol encoding: A column-store case study. In: 2014 World Congress on Computer Applications and Information Systems (WCCAIS), pp. 1–4. IEEE (2014)

    Google Scholar 

  6. Boncz, P.A., Kersten, M.L.: Monet: An impressionist sketch of an advanced database system. In: Proc. IEEE BIWIT Workshop. Citeseer (1994)

    Google Scholar 

  7. Bruno, N.: Teaching an old elephant new tricks. arXiv preprint (2009). 0909.1758

  8. El-Helw, A., Ross, K.A., Bhattacharjee, B., Lang, C.A., Mihaila, G.A.: Column-oriented query processing for row stores. In: Proceedings of the ACM 14th International Workshop on Data Warehousing and OLAP, pp. 67–74. ACM (2011)

    Google Scholar 

  9. Ferreira, M.C.: Compression and query execution within column oriented databases. Ph.D. thesis, Massachusetts Institute of Technology (2005)

    Google Scholar 

  10. Hall, A., Bachmann, O., Büssow, R., Gănceanu, S., Nunkesser, M.: Processing a trillion cells per mouse click. Proceedings of the VLDB Endowment 5(11), 1436–1446 (2012)

    Article  Google Scholar 

  11. Harizopoulos, S., Liang, V., Abadi, D.J., Madden, S.: Performance tradeoffs in read-optimized databases. In: Proceedings of the 32nd International Conference on Very Large Data Bases, pp. 487–498. VLDB Endowment (2006)

    Google Scholar 

  12. Jagodits, S.K.G.C.T., Valduriez, H.B.P.: A query processing strategy for the decomposed storage model. In: Proceedings, p. 636. Order from IEEE Computer Society (1987)

    Google Scholar 

  13. Lemire, D., Kaser, O.: Reordering columns for smaller indexes. Information Sciences 181(12), 2550–2570 (2011)

    Article  MathSciNet  Google Scholar 

  14. Lemire, D., Kaser, O., Gutarra, E.: Reordering rows for better compression: Beyond the lexicographic order. ACM Transactions on Database Systems (TODS) 37(3), 20 (2012)

    Article  Google Scholar 

  15. MacNicol, R., French, B.: Sybase iq multiplex-designed for analytics. In: Proceedings of the Thirtieth International Conference on Very Large Data Bases, vol. 30, pp. 1227–1230. VLDB Endowment (2004)

    Google Scholar 

  16. Stonebraker, M., Abadi, D.J., Batkin, A., Chen, X., Cherniack, M., Ferreira, M., Lau, E., Lin, A., Madden, S., O’Neil, E., et al.: C-store: a column-oriented dbms. In: Proceedings of the 31st International Conference on Very Large Data Bases, pp. 553–564. VLDB Endowment (2005)

    Google Scholar 

  17. Yu, L., Zhang, Y., Wang, S., Zhang, Q.: Research on simulative column-storage model policy based on row-storage model. J. Comput. Res. Dev. 47, 78–885 (2010)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Meng Huang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2015 Springer International Publishing Switzerland

About this paper

Cite this paper

Huang, M., Qiu, X., Li, S., Liu, D. (2015). Research on Light-Weight Compression Schemes Based on Simulative Column-Store. In: Wang, Y., Xiong, H., Argamon, S., Li, X., Li, J. (eds) Big Data Computing and Communications. BigCom 2015. Lecture Notes in Computer Science(), vol 9196. Springer, Cham. https://doi.org/10.1007/978-3-319-22047-5_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-22047-5_5

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-22046-8

  • Online ISBN: 978-3-319-22047-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics