Traps in Multisource Heterogeneous Big Data Processing
The importance of big data values and application efforts has reached a universal consensus in most fields. While because of the model difference of data storage, computing and analysis, the big data processing performance and big data values show greatly uneven in different scenarios. In this paper, we analyze the traps which may greatly impact big data processing results, and give our suggestions to solve these problems for the multisource and heterogeneous characteristics of big data.
KeywordsFault tolerance Data credibility Data fusion
- 5.Rui, H., Lizy, K., Jianfeng, Z.: Benchmarking big data systems: A review. IEEE Trans. Serv. Comput. 11(3), 1–17 (2017)Google Scholar
- 7.Aishwarya G., Ramnatthan A., et al.: Redundancy does not imply fault tolerance: analysis of distributed storage reactions to single errors and corruptions. In: Proceedings of 15th USENIX Conference on File and Storage Technologies, pp. 149–165 (2017)Google Scholar