Towards Scheduling Data-Intensive and Privacy-Aware Workflows in Clouds
Nowadays, business or scientific workflows with a massive of data are springing up in clouds. To avoid security and privacy leakage issues, users’ privacy or sensitive data may be restricted to being processed in some specified and trusted cloud datacenters. Meanwhile, users may also pay attention to the cost incurred by renting cloud resources. Therefore, new workflow scheduling algorithms should be developed to achieve a balance between economically utilizing the cloud resources and protection of users’ data privacy and security. In this paper, we propose a cost-aware scheduling algorithm for executing multiple data-intensive and privacy-aware workflow instances in clouds. Our proposed algorithm is based on the strategy of batch processing, the ideas of simulated annealing algorithm and the particle swarm optimization, the coding strategy of which is devised to minimize the total execution cost while meeting specified privacy protection constraints. The experimental results demonstrate the effectiveness of our algorithm.
KeywordsPrivacy protection Cloud Workflow scheduling Cost Batch processing Particle swarm optimization
This paper was supported by National Natural Science Fund of China, under grant number 61402167, 61572187, 61402168, and National Science and Technology Support Project of China, under grant number 2015BAF32B01.
- 2.Chen, C., Liu, J., Wen, Y., Chen, J., Zhou, D.: A hybrid genetic algorithm for privacy and cost aware scheduling of data intensive workflow in cloud. In: Wang, G., Zomaya, A., Perez, G.M., Li, K. (eds.) ICA3PP 2015. LNCS, vol. 9528, pp. 578–591. Springer, Cham (2015). doi: 10.1007/978-3-319-27119-4_40 CrossRefGoogle Scholar
- 3.Kennedy, J., Eberhart, R.: Particle swarm optimization. In: Proceedings of IEEE International Conference on Neutral Networks. pp. 1942–1948. IEEE Service Center, Piscataway (1995)Google Scholar