Skip to main content

SQL TVF Controlling Forms - Express Structured Parallel Data Intensive Computing

  • Conference paper
  • 1143 Accesses

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5181))

Abstract

A key issue in supporting the synthesis of data intensive computation and data management is to liberate users from low-level parallel programming, by specifying applications functionally independent of the underlying server infrastructure, and further, by providing high-level primitives to express the control flow of applying functions to data partitions. Currently only few such primitives, e.g. Map-Reduce and Cross-Apply, are available, and their expressive power is limited to “flat parallel computing”. To deal with “structured parallel computing” where a function is applied to multiple objects with execution order dependencies, a general framework for creating and combining such primitives is required.

We propose the SQL-FCF framework as the database centric solution to the above problem. We embed into SQL queries the Function Controlling Forms (FCFs) to specify the flow control of applying Table Valued Functions (TVFs) to multiple data partitions. We further support the extensibility of this framework by allowing new FCFs to be defined from existing ones with SQL phrases. Based on this approach, we provided a SQL based high-level interface for “structured parallel computing” in architecting a hydrologic scientific computation platform. Envisioning that the simple parallel computing primitives will evolve and form a general framework, our effort is a step towards that goal.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Asanovic, K., Bodik, R., Catanzo, B.C., Gebis, J.J., Husbands, P., Keutzer, K., Patterson, D.A., Plishker, W.L., Shalf, J., Williams, S.W., Yelick, K.A.: The landscape of parallel computing research: A view from Berkeley, Technical Report UCB/EECS-2006-183, U. C., Berkeley, December 18 (2006)

    Google Scholar 

  2. Backus, J.: Can Programming be Liberated from the von Neumann Style? A functional style and its algebra of programs. CACM 21(8) (1978)

    Google Scholar 

  3. Barclay, T., Gray, J., Chong, W.: TerraServer Bricks – A High Availability Cluster Alternative, Technical Report, MSR-TR-2004-107 (October 2004)

    Google Scholar 

  4. Barroso, L.A., Dean, J., H"olze, U.: Web search for a planet:”The Google cluster architecture. IEEE Micro 23(2), 22–28 (2003)

    Article  Google Scholar 

  5. Ben-gan, I., et al.: Inside Microsoft SQL Server 2005: T-SQL Programming (2006)

    Google Scholar 

  6. Bryant, R.E.: Data-Intensive Supercomputing: The case for DISC, CMU-CS-07-128 (2007)

    Google Scholar 

  7. Chen, Q., Hsu, M., Dayal, U.: A Data-Warehouse/OLAP Framework for Scalable Telecommunication Tandem Traffic Analysis. In: ICDE 2000, pp. 201–210 (2000)

    Google Scholar 

  8. Chen, Q., Hsu, M.: Inter-Enterprise Collaborative Business Process Management. In: Proc. of 17th Int’l Conf. on Data Engineering (ICDE-2001), Germany (2001)

    Google Scholar 

  9. Chen, Q., Dayal, U., Hsu, M.: A Distributed OLAP Infrastructure for E-Commerce. In: Proc. Fourth IFCIS CoopIS Conference, UK (1999)

    Google Scholar 

  10. Chen, Q., Kambayashi, Y.: Nested Relation Based Database Knowledge Representation. In: ACM-SIGMOD Conference, pp. 328–337 (1991)

    Google Scholar 

  11. Chen, Q., Gardarin, G.: An Implementation Model for Reasoning with Complex Objects. In: ACM-SIGMOD Conference, pp. 164–172 (1988)

    Google Scholar 

  12. Dean, J.: Experiences with MapReduce, an abstraction for large-scale computation. In: International Conference on Parallel Architecture and Compilation Techniques. ACM, New York (2006)

    Google Scholar 

  13. DeWitt, D., Gray, J.: Parallel Database Systems: the Future of High Performance Database Systems. CACM 35(6) (June 1992)

    Google Scholar 

  14. Gray, J., Liu, D.T., Nieto-Santisteban, M.A., Szalay, A.S., Heber, G., DeWitt, D.: Scientific Data Management in the Coming Decade. SIGMOD Record 34(4) (2005)

    Google Scholar 

  15. Hsu, M., Xiong, Y.: Building a Scalable Web Query System. In: Bhalla, S. (ed.) DNIS 2007. LNCS, vol. 4777, Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  16. HP Neoview enterprise datawarehousing platform, http://h71028.www7.hp.com/ERC/downloads/4AA0-7932ENW.pdf

  17. Netz, A., Chaudhuri, S., Bernhardt, J., Fayyad, U.: Integration of data mining and relational databases. In: Proceeding of the 26th Conference on Very Large Databases, pp. 719–722 (2000)

    Google Scholar 

  18. Olston, C., Reed, B., Srivastava, U., Kumar, R., Tomkins, A.: Pig Latin: A Not-So-Foreign Language for Data Processing. In: VLDB (2008)

    Google Scholar 

  19. Saarenvirta, G.: Operational Data Mining. DB2 Magazine 6 (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Sourav S. Bhowmick Josef Küng Roland Wagner

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Chen, Q., Hsu, M. (2008). SQL TVF Controlling Forms - Express Structured Parallel Data Intensive Computing. In: Bhowmick, S.S., Küng, J., Wagner, R. (eds) Database and Expert Systems Applications. DEXA 2008. Lecture Notes in Computer Science, vol 5181. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-85654-2_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-85654-2_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-85653-5

  • Online ISBN: 978-3-540-85654-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics