A Cloud-Based Workflow Management Solution for Collaborative Analytics
The concept of collaborative analytics is to accommodate reuse and collaboration in data analysis process through sharing of analytics methods, algorithms, and computation resources. However, realizing collaborative analytics is challenging due to the large data sets, high throughput and computational intensive requirements. In this demonstration, we present a cloud-based workflow management solution that allows collaborative analytics to run in the cloud computing environment. Our solution provides sharing of analytics resources, recommendation of analytic workflows, dynamic scheduling and provisioning for scalable data analytics, high availability through fault-tolerance, real-time monitoring and tracking of collaborative analytics status. Examples of a generic data mining analysis and climate change analytics are given to show that our work can be applied for a wide variety of study in the real-life world.
KeywordsCloud Environment Dynamic Schedule Cloud Computing Environment Hybrid Heuristic Collaborative Analytic
- 1.Rapidminer – Analytical ETL, Data Mining, and Predictive Reporting, http://rapid-i.com/
- 3.Rahman, M., Li, X., Palit, H.: Hybrid Heuristic for Scheduling Data Analytics Workflow Applications in Hybrid Cloud Environment. In: Proc. High-Performance Grid and Cloud Computing Workshop 2011, USA, May 16-20 (2011)Google Scholar