Skip to main content

Performance Driven Database Design for Scalable Web Applications

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5739))

Abstract

Scaling up web applications requires distribution of load across multiple application servers and across multiple database servers. Distributing load across multiple application servers is fairly straightforward; however distributing load (select and UDI queries) across multiple database servers is more complex because of the synchronization requirements for multiple copies of the data. Different techniques have been investigated for data placement across multiple database servers, such as replication, partitioning and de-normalization. In this paper, we describe our architecture that utilizes these data placement techniques for determining the best possible layout of data. Our solution is general, and other data placement techniques can be integrated within our system. Once the data is laid out on the different database servers, our efficient query router routes the queries to the appropriate database server/(s). Our query router maintains multiple connections for a database server so that many queries are executed simultaneously on a database server, thus increasing the utilization of each database server. Our query router also implements a locking mechanism to ensure that the queries on a database server are executed in order. We have implemented our solutions in our system, that we call SIPD (System for Intelligent Placement of Data). Preliminary experimental results illustrate the significant performance benefits achievable by our system.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Amiri, K., Park, S., Tewari, R., Padmanabhan, S.: DBProxy: A Dynamic Data Cache for Web Applications. In: IEEE Int’l Conference on Data Engineering (ICDE), Bangalore, India (March 2003)

    Google Scholar 

  2. Böhm, K., Mlivoncic, M., Weber, R.: Quality-aware and load-sensitive planning of image similarity queries. In: Proceedings of the 17th International Conference on Data Engineering, Washington, DC, USA, pp. 401–410 (2001)

    Google Scholar 

  3. Cecchet, E.: C-JDBC: A Middleware Framework for Database Clustering. IEEE Data Engineering Bulletin 27(2), 19–26 (2004)

    Google Scholar 

  4. Gao, L., Dahlin, M., Nayate, A., Zheng, J., Iyengar, A.: Application Specific Data Replication for Edge Services. In: Int’l World Wide Web Conf. (WWW), Budapest, Hungary (May 2003)

    Google Scholar 

  5. Groothuyse, T., Sivasubramanian, S., Pierre, G.: GlobeTP: Template-Based Database Replication for Scalable Web Applications. In: Int’l World Wide Web Conf. (WWW), Alberta, Canada (May 2007)

    Google Scholar 

  6. Heffernan, N.T., Turner, T.E., Lourenco, A.L.N., Macasek, M.A., Nuzzo-Jones, G., Koedinger, K.R.: The ASSISTment Builder: Towards an Analysis of Cost Effectiveness of ITS creation. In: FLAIRS, Florida, USA (2006)

    Google Scholar 

  7. Inmon, W.H.: Information Engineering for the Practitioner: Putting Theory Into Practice. Prentice Hall, Englewood Cliffs (1988)

    Google Scholar 

  8. Kemme, B., Alonso, G.: Don’t be Lazy, be Consistent: Postgres-R, a New Way to Implement Database Replication. In: Int’l Conference on Very Large Data Bases (VLDB), Cairo, Egypt (September 2000)

    Google Scholar 

  9. Plattner, C., Alonso, G.: Ganymed: Scalable Replication for Transactional Web Applications. In: Jacobsen, H.-A. (ed.) Middleware 2004. LNCS, vol. 3231, pp. 155–174. Springer, Heidelberg (2004)

    Chapter  Google Scholar 

  10. Ramakrishnan, R., Gehrke, J.: Database Management Systems, 3rd edn. McGraw-Hill, New York (2003)

    MATH  Google Scholar 

  11. Röhm, U., Böhm, K., Schek, H.-J.: Cache-aware query routing in a cluster of databases. In: Proceedings of the 17th International Conference on Data Engineering, Washington, DC, USA, pp. 641–650 (2001)

    Google Scholar 

  12. Ronstrom, M., Thalmann, L.: MySQL Cluster Architecture Overview. MySQL Teachnical White Paper (April 2004)

    Google Scholar 

  13. Schkolnick, M., Sorenson, P.: Denormalization: A Performance Oriented Database Design Technique. In: AICA Congress, Bologna, Italy (1980)

    Google Scholar 

  14. Sivasubramanian, S., Pierre, G., van Steen, M.: GlobeDB: Autonomic Data Replication for Web Applications. In: Int’l World Wide Web Conf. (WWW), Chiba, Japan (May 2005)

    Google Scholar 

  15. Tolia, N., Satyanarayanan, M.: Consistency-Preserving Caching of Dynamic Database Content. In: Int’l World Wide Web Conf. (WWW), Alberta, Canada (May 2007)

    Google Scholar 

  16. Westland, J.C.: Economic Incentives for Database Normalization. Information Processing and Management 28(5), 647–662 (1992)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Patvarczki, J., Mani, M., Heffernan, N. (2009). Performance Driven Database Design for Scalable Web Applications. In: Grundspenkis, J., Morzy, T., Vossen, G. (eds) Advances in Databases and Information Systems. ADBIS 2009. Lecture Notes in Computer Science, vol 5739. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03973-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03973-7_5

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03972-0

  • Online ISBN: 978-3-642-03973-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics