Abstract
Modern data analytic flows involve complex data computations that may span multiple execution engines and need to be optimized for a variety of objectives like performance, fault-tolerance, freshness, and so on. In this paper, we present optimization techniques and tradeoffs in terms of a real-world, cyber-physical flow that starts with raw time series sensor data and external event data, and through a series of analytic operations produces automated actions and actionable insights.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Gupta, C., et al.: Better drilling through sensor analytics: A case study in live operational intelligence. In: SensorKDD, pp. 8–15 (2011)
Missier, P., Soiland-Reyes, S., Owen, S., Tan, W., Nenadic, A., Dunlop, I., Williams, A., Oinn, T., Goble, C.: Taverna, Reloaded. In: Gertz, M., Ludäscher, B. (eds.) SSDBM 2010. LNCS, vol. 6187, pp. 471–481. Springer, Heidelberg (2010)
Simitsis, A., Wilkinson, K., Dayal, U., Castellanos, M.: Optimizing ETL workflows for fault-tolerance. In: ICDE, pp. 385–396 (2010)
Simitsis, A., Wilkinson, K., Castellanos, M., Dayal, U.: Optimizing Analytic Data Flows for Multiple Execution Engines. In: SIGMOD Conference (2012)
Gupta, C., Mehta, A., Dayal, U.: Pqr: Predicting query execution times for autonomous workload management. In: ICAC, pp. 13–22 (2008)
Verma, A., Cherkasova, L., Campbell, R.H.: Aria: automatic resource inference and allocation for mapreduce environments. In: ICAC, pp. 235–244 (2011)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Simitsis, A., Gupta, C., Wilkinson, K., Dayal, U. (2012). Optimizing Flows for Real Time Operations Management. In: Ailamaki, A., Bowers, S. (eds) Scientific and Statistical Database Management. SSDBM 2012. Lecture Notes in Computer Science, vol 7338. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31235-9_44
Download citation
DOI: https://doi.org/10.1007/978-3-642-31235-9_44
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31234-2
Online ISBN: 978-3-642-31235-9
eBook Packages: Computer ScienceComputer Science (R0)