Application of Classification Techniques for Prediction of Water Quality of 17 Selected Indian Rivers
Objective: In this study, prediction using classification techniques are used to predict the water quality of the 17 selected rivers in the year 2011 using their water quality in 2008 to interpret whether the water quality has improved or deteriorated. Methods/Analysis: For this prediction, we have used data mining classification techniques using Waikato Environment for Knowledge Analysis (WEKA) API to the dataset of selected 17 Indian rivers. The data used for prediction was created from ambient water quality of Aquatic Resources in India in 2008 and 2011. Data is obtained from data portal which was published under National Data Sharing and Accessibility Policy (NDSAP) and the contributor was Ministry of Environment and Forests Central Pollution Control Board (CPCB). Findings: Out of the four techniques used, prediction of classes, i.e. excellent, good, average and fair is best done by Naive Bayes followed by J48, SMO and REPTree technique.
KeywordsPrediction using classification techniques Weka Data mining Water quality Indian rivers
I profoundly thank Bharati Vidyapeeth’s College of Engineering for constant support and encouragement.
- 1.Masethe, H. D., & Masethe, M. A. (2014). Prediction of heart disease using classification algorithms. In Proceedings of the World Congress on Engineering and Computer Science 2014 (Vol. II), October 22–24, 2014, San Francisco, USA.Google Scholar
- 2.Saini, P., & Jain, A. K. (2013). Prediction using classification technique for the students’ enrollment process in higher educational institutions. International Journal of Computer Applications (0975–8887), 84(14).Google Scholar
- 3.Padmapriya, A. Dr. (2012). Prediction of higher education admissibility using classification algorithms. International Journal of Advanced Research in Computer Science and Software Engineering, 2(11).Google Scholar
- 4.Sudhakar, K., & Manimekalai, M. Dr. (2014). Study of heart disease prediction using data mining. International Journal of Advanced Research in Computer Science and Software Engineering, 4(1).Google Scholar
- 5.Srivastava, G., & Kumar, P. (2013). Water quality index with missing parameters. IJRET: International Journal of Research in Engineering and Technology, 02(04), 609–614.Google Scholar
- 6.National River Conservation Directorate (NRCD) http://envfor.nic.in/division/national-river-conservation-directorate-nrcd. Date accessed on 30/9/2016.
- 9.Ministry of Environment and Forests https://data.gov.in/ministrydepartment/ministry-environment-and-forests.
- 10.Sujatha, M., Prabhakar, S., & Lavanya Devi, G. Dr. (2013). A survey of classification techniques in data mining. International Journal of Innovations in Engineering and Technology (IJIET), 2(4). ISSN 2319-1058.Google Scholar
- 11.Bhargavi, P., & Jyothi, S. Dr. (2009). Applying Naive Bayes data mining technique for classification of agricultural land soils. IJCSNS International Journal of Computer Science and Network Security, 9(8).Google Scholar
- 12.Patil, T. R., & Sherekar, S. S. Mrs. (2013). Performance analysis of Naive Bayes and J48 classification algorithm for data classification. International Journal of Computer Science and Applications, 6(2). ISSN 0974-1011.Google Scholar
- 13.Platt, J. C. (1998). Sequential minimal optimization: A fast algorithm for training support vector machines (Technical Report MSR-TR-98-14), April 21, 1998.Google Scholar
- 14.Kalmegh, S. (2015). Analysis of WEKA data mining algorithm REPTree, simple cart and RandomTree for classification of Indian News. IJISET—International Journal of Innovative Science, Engineering & Technology, 2(2). ISSN 2348–7968.Google Scholar
- 15.Eclipse IDE http://www.eclipse.org/users/. Date accessed on 11/2/2017.
- 16.Weka website (Latest version 3.6) http://www.cs.waikato.ac.nz/ml/weka/. Date accessed on 30/9/2016.