Skip to main content

A Cost Model for DBaaS Storage

  • Conference paper
  • First Online:
Database and Expert Systems Applications (DEXA 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9827))

Included in the following conference series:

Abstract

Cloud infrastructures employ hybrid storage systems that incorporate various types of devices (flash memory solid-state and hard disk drives). Dealing with such heterogeneity makes the use of data placements strategies necessary. These strategies generally rely on cost modeling techniques. In this paper, we propose a cost model for the storage of database objects in a Cloud infrastructure. Our cost model increments the existing work by including: (1) storage cost, which comprises the occupation, the energy and the endurance costs, (2) the penalty cost that could arise from the SLA (Service Level Agreement) violation, and (3) the migration cost resulting from the object movement between storage systems. We also evaluate the relevance of our model and its usability throughout examples.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Cheng, Y., Iqbal, M.S., Gupta, A., Butt, A.R.: Pricing games for hybrid object stores in the cloud: provider vs. tenant. In: 7th USENIX Workshop on Hot Topics in Cloud Computing (HotCloud 2015) (2015)

    Google Scholar 

  2. Li, Z., Mukker, A., Zadok, E.: On the importance of evaluating storage systems’ $costs. In: 6th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage 2014) (2014)

    Google Scholar 

  3. Cheng, Y., Iqbal, M.S., Gupta, A., Butt, A.R.: CAST: tiering storage for data analytics in the cloud. In: Proceedings of the 24th International Symposium on High-Performance Parallel and Distributed Computing (2015)

    Google Scholar 

  4. Zhang, N., Tatemura, J., Patel, J.M., Hacigümüş, H.: Towards cost-effective storage provisioning for DBMSs. Proc. VLDB Endow. 5, 274–285 (2011)

    Article  Google Scholar 

  5. Tai, J., Sheng, B., Yao, Y., Mi, N.: Live data migration for reducing SLA violations in multi-tiered storage systems. In: 2014 IEEE International Conference on Cloud Engineering (IC2E), pp. 361–366 (2014)

    Google Scholar 

  6. Zhang, G., Chiu, L., Liu, L.: Adaptive data migration in multi-tiered storage based cloud environment. In: 2010 IEEE 3rd International Conference on Cloud Computing (CLOUD), pp. 148–155 (2010)

    Google Scholar 

  7. Ouarnoughi, H., Boukhobza, J., Singhoff, F., Rubini, S.: A cost model for virtual machine storage in cloud IaaS context. In: 2016 24th Euromicro International Conference on Parallel, Distributed, and Network-Based Processing (PDP), pp. 664–671 (2016)

    Google Scholar 

  8. Boukhobza, J.: Flashing in the Cloud: Shedding some Light on NAND Flash Memory Storage Systems. IGI Global, Hershey (2013)

    Book  Google Scholar 

  9. Shriver, E.: Performance modeling for realistic storage devices (1997)

    Google Scholar 

  10. Sharaf, M.A., Chrysanthis, P.K., Labrinidis, A., Amza, C.: Optimizing I/O-intensive transactions in highly interactive applications. In: Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, pp. 785–798. ACM, New York (2009)

    Google Scholar 

  11. Oh, Y., Choi, J., Lee, D., Noh, S.H.: Caching less for better performance: balancing cache size and update cost of flash memory cache in hybrid storage systems. In: FAST (2012)

    Google Scholar 

  12. Guerra, J., Pucha, H., Glider, J.S., Belluomini, W., Rangaswami, R.: Cost effective storage using extent based dynamic tiering. In: FAST, pp. 20–20 (2011)

    Google Scholar 

  13. Kim, Y., Gupta, A., Urgaonkar, B., Berman, P., Sivasubramaniam, A.: HybridStore: a cost-efficient, high-performance storage system combining SSDs and HDDs. In: 2011 IEEE 19th International Symposium on Modeling, Analysis Simulation of Computer and Telecommunication Systems (MASCOTS), pp. 227–236 (2011)

    Google Scholar 

  14. Kim, Y., Gupta, A., Urgaonkar, B., Berman, P., Sivasubramaniam, A.: HybridPlan: a capacity planning technique for projecting storage requirements in hybrid storage systems. J. Supercomput. 67, 277–303 (2013)

    Article  Google Scholar 

  15. Lin, L., Zhu, Y., Yue, J., Cai, Z., Segee, B.: Hot random off-loading: a hybrid storage system with dynamic data migration. In: 2011 IEEE 19th International Symposium on Modeling, Analysis Simulation of Computer and Telecommunication Systems (MASCOTS), pp. 318–325 (2011)

    Google Scholar 

  16. Lu, C., Alvarez, G.A., Wilkes, J.: Aqueduct: online data migration with performance guarantees. In: Proceedings of the 1st USENIX Conference on File and Storage Technologies. USENIX Association, Berkeley (2002)

    Google Scholar 

  17. Sundaram, V., Wood, T., Shenoy, P.: Efficient data migration in self-managing storage systems. In: IEEE International Conference on Autonomic Computing, 2006, ICAC 2006, pp. 297–300 (2006)

    Google Scholar 

  18. Dutta, A.K., Hasan, R.: How much does storage really cost? Towards a full cost accounting model for data storage. In: Altmann, J., Vanmechelen, K., Rana, O.F. (eds.) GECON 2013. LNCS, vol. 8193, pp. 29–43. Springer, Heidelberg (2013)

    Chapter  Google Scholar 

  19. Moore, R.L., D’Aoust, J., McDonald, R.H., Minor, D.: Disk and tape storage cost models. In: Archiving Conference, pp. 29–32. Society for Imaging Science and Technology (2007)

    Google Scholar 

  20. Gurumurthi, S., Sivasubramaniam, A., Kandemir, M., Franke, H.: DRPM: dynamic speed control for power management in server class disks. In: Proceedings of 30th Annual International Symposium on Computer Architecture, 2003, pp. 169–179 (2003)

    Google Scholar 

  21. Hylick, A., Sohan, R., Rice, A., Jones, B.: An analysis of hard drive energy consumption. In: IEEE International Symposium on Modeling, Analysis and Simulation of Computers and Telecommunication Systems, 2008, MASCOTS 2008, pp. 1–10 (2008)

    Google Scholar 

  22. Son, S.W., Chen, G., Kandemir, M.: Disk layout optimization for reducing energy consumption. In: Proceedings of the 19th Annual International Conference on Supercomputing, pp. 274–283. ACM, New York (2005)

    Google Scholar 

  23. Pinheiro, E., Weber, W.-D., Barroso, L.A.: Failure trends in a large disk drive population. In: FAST, pp. 17–23 (2007)

    Google Scholar 

  24. Garg, S.K., Gopalaiyengar, S.K., Buyya, R.: SLA-based resource provisioning for heterogeneous workloads in a virtualized cloud datacenter. In: Xiang, Y., Cuzzocrea, A., Hobbs, M., Zhou, W. (eds.) ICA3PP 2011, Part I. LNCS, vol. 7016, pp. 371–384. Springer, Heidelberg (2011)

    Chapter  Google Scholar 

  25. Canim, M., Mihaila, G.A., Bhattacharjee, B., Ross, K.A., Lang, C.A.: An object placement advisor for DB2 using solid state storage. Proc. VLDB Endow. 2, 1318–1329 (2009)

    Article  Google Scholar 

  26. Du, L.: Pricing and resource allocation in a cloud computing market. In: Proceedings of the 2012 12th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing (Ccgrid 2012), pp. 817–822. IEEE Computer Society, Washington (2012)

    Google Scholar 

  27. Ouarnoughi, H., Boukhobza, J., Singhoff, F., Rubini, S.: A multi-level I/O tracer for timing and performance storage systems in IaaS cloud. In: REACTION (2014)

    Google Scholar 

Download references

Acknowledgement

This work is supported by the PHC (Partenariat Hubert Curien) Tassili GHEEMaS project (number 16MDU964).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jalil Boukhobza .

Editor information

Editors and Affiliations

Appendix A: Notations

Appendix A: Notations

Variable

Description

Customers

\( \varvec{U} \),\( \varvec{u}_{\varvec{k}} \)

The set of customer in cloud, kth customer in cloud \( {\text{k}} \in \{1,{\text{K}}\} \)

\( \varvec{w}_{{\varvec{u}_{\varvec{k}} }} \)

The workload of customer \( u_{k} \)

\( \varvec{pn}_{{\varvec{u}_{\varvec{k}} }} \)

The penalty of customer \( u_{k} \)

\( \varvec{iops}_{sla,u_{k}} \)

The IOPS SLA of customer \( u_{k} \)

\( \varvec{iops}_{offered,u} (\varvec{u}_{\varvec{k}} ) \)

The IOPS offered to customer \( {\text{u}}_{\text{k}} \) in the period \( T \)

\( \varvec{t}_{exe,u} (\varvec{u}_{\varvec{k}} ) \)

The time need to handle the I/O workload of customer \( {\text{u}}_{\text{k}} \)

\( \varvec{io}_{req,u} (\varvec{u}_{\varvec{k}} ) \)

The total number of the IO requests issued from customer \( {\text{u}}_{\text{k}} \)

Objects

\( O , O_{{u_{k} }} \)

The set of cloud objects, The set of objects of customer \( u_{k} \)

\( O_{{d_{j} }} \)

The set of objects hosted in device \( d_{j} \)

\( o_{{i,u_{k} }} \), \( s_{{o_{{i,u_{k} }} }} \)

The \( i th \) object of customer \( u_{k} i \in \{1,I\} \), its size

\( req_{{op,o_{{i,u_{k} }} }} \)

The average IOPS of type \( op \) issued to the object \( o_{i,k} \)

Migration

\( O_{mv} ,o_{{mv_{m} ,d_{s} ,d_{d} }} \)

The set of objects to move, the \( m th \) objects to move

\( pr_{{o_{{mv_{m} }} }} \)

The priorty of \( m th \) objects to move

Devices

\( D \), \( d_{j} \)

The set of device, The \( j th \) device \( j \in \{1,J\} \)

\( p_{{d_{j} }} \),\( c_{{d_{j} }} \),\( wo_{{d_{j} }} \)

The price, The capacity, The wear out of device \( d_{j} \)

\( iops_{{op,d_{j} }} \)

The max throughput of device \( d_{j} \) for operation type \( op \)

\( t_{{op,d_{j} }} \)

The response time of device \( d_{j} \) for operation type \( op \)

\( t_{atv} (d_{j} ) \),\( t_{idl} (d_{j} ) \)

The active time, the idle time of device

\( seek_{{d_{j} }} \)

The average seek distance of device \( dj \)

\( msr_{{op,d_{j} }} \)

The set of experimental measures taken for the device \( d_{j} \) and I/O operations of type \( op \)

\( wo_{w} \)

The impact of workload on the lifetime of device

\( E_{ss,t} \),\( E_{d,t} \),\( E_{up} \)

storage system energy, device energy, energy unit price

\( P,P_{{atv,d_{j} }} \),\( P_{{idl,d_{j} }} \)

The power, the active power, the idle power of device \( d_{j} \)

General

\( io_{size} \)

The size of I/O block (database block)

\( T \)

Period of time

\( op \)

\( op \subseteq OP, OP = \{rr,sr,rw,sw\} \),\( rr \) :random read. \( sr \): sequential read,\( rw \) :random write, and \( sw \) :sequential write.

Cost notations

\( Cost_{pl,T} \)

The placment cost for given period \( T \)

\( Cost_{stg,T} \)

The storage cost for given period \( T \)

\( Cost_{pnl,T} \)

The penalty cost for given period \( T \)

\( Cost_{pnl,u,T} (u_{k}) \)

The penalty cost of customer \( u_{k} \) for given period \( T \)

\( Cost_{mgr,T} \)

The Migration Cost for given period \( T \)

\( Cost_{mng,T} \)

The Management Cost for given period \( T \)

\( Cost_{occp,T} \)

The occupation cost for given period \( T \)

\( Cost_{w,T} \)

The workload cost for given period \( T \)

\( Cost_{amz,T} \)

The amortized cost for given period \( T \)

\( Cost_{amz,1} \)

The amortized cost for one unite of time

\( Cost_{erg,T} \)

The energy cost for given period \( T \)

\( Cost_{edr,T} \)

The Endurance cost of stotage system for given period \( T \)

\( Cost_{edr,d,T} (d_{j}) \)

The endurence cost of device \( d_{j} \) for given period \( T \)

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing Switzerland

About this paper

Cite this paper

Boukhelef, D., Boukhobza, J., Boukhalfa, K. (2016). A Cost Model for DBaaS Storage. In: Hartmann, S., Ma, H. (eds) Database and Expert Systems Applications. DEXA 2016. Lecture Notes in Computer Science(), vol 9827. Springer, Cham. https://doi.org/10.1007/978-3-319-44403-1_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-44403-1_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-44402-4

  • Online ISBN: 978-3-319-44403-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics