Abstract
A parallel forecasting approach used in weather prediction which has important aspect of the modern society, especially with the realization of modern smart cities. A new approach is considering here which will provide excellent result for large semi-structured data. Thus, it is very important to analyze weather data keeping in mind the enormity of the available data sizes. Here it is presented a methodology for weather data analysis keeping in mind the big-data nature of the data sizes pertaining to weather data. Here also it has been taken date-wise atmospheric conditions collected of decade. The traditional k-means clustering is used to form clusters which represents association in between related dates of current year’s and previous year’s weather data. Such associations predict atmospheric conditions of one year’s weather condition on the bases of previous data. Incremental k-means clustering algorithm is used to process current year’s weather parameters as new data and it shows that the calculated weather condition falls under one of the existing clusters to represent similar atmospheric conditions. The total work has been divided by two parts: first, Storing NCDC semi-structured data on hadoop cluster and second, fitting a clustering methodology for predicting weather conditions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kalyankar, M., and S. Alaspurkar. “Data Mining Technique to Analyse the Metrological Data.” International Journal of Advanced Research in Computer Science and Software Engineering 3.2 (2013): 114–118.
Rajinikanth, T. V., Balaram VVSSS, and N. Rajasekhar. “Analysis of Indian weather data sets using data mining techniques.” ACITY, WiMoN, CSIA, AIAA, DPPR, NECO, InWeS 1.1 (2014): 89–94.
Chauhan, Divya, and Jawahar Thakur. “Data Mining Techniques for Weather Prediction: A Review.” International Journal on Recent and Innovation Trends in Computing and Communication 2.8.
Yang, Jie, and Xiaoping Li. “Mapreduce based method for big data semantic clustering.” Systems, Man, and Cybernetics (SMC), 2013 IEEE International Conference on. IEEE, 2013.
Polato, Ivanilton, et al. “A comprehensive view of Hadoop research—A systematic literature review.” Journal of Network and Computer Applications46 (2014): 1–25.
Suguna, Nambiraj, and Keppana G. Thanushkodi. “Predicting missing attribute values using k-means clustering.” Journal of Computer Science 7.2 (2011): 216.
Oyelade, O. J., O. O. Oladipupo, and I. C. Obagbuwa. “Application of k Means Clustering algorithm for prediction of Students Academic Performance.” arXiv preprint arXiv:1002.2425(2010).
Pham, Duc Truong, Stefan Simeonov Dimov, and C. D. Nguyen. “An incremental K-means algorithm.” Proceedings of the Institution of Mechanical Engineers, Part C: Journal of Mechanical Engineering Science 218.7 (2004): 783–795.
Chakraborty, Sanjay, N. K. Nagwani, and Lopamudra Dey. “Weather Forecasting using Incremental K-Means Clustering.” Data Mining and Knowledge Engineering 4.5 (2012): 214–219.
Dey, Ratul Dey Sanjay Chakraborty Lopamudra. “Weather forecasting using Convex hull & K-Means Techniques An Approach.” arXiv preprint arXiv:1501.06456(2015).
[Online]. Available: ftp://ncdc.noaa.gov/pub/data/noaa/.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Sahoo, S. (2017). A Parallel Forecasting Approach Using Incremental K-means Clustering Technique. In: Behera, H., Mohapatra, D. (eds) Computational Intelligence in Data Mining. Advances in Intelligent Systems and Computing, vol 556. Springer, Singapore. https://doi.org/10.1007/978-981-10-3874-7_16
Download citation
DOI: https://doi.org/10.1007/978-981-10-3874-7_16
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-3873-0
Online ISBN: 978-981-10-3874-7
eBook Packages: EngineeringEngineering (R0)