Abstract
Missing hydrometric data is a critical issue for water resources management projects and problems related to flow damage and risk assessment. Though numerous ways can be found in the literature to impute them (i.e. Box-Jenkins models, Linear regression models, case deletion, listwise and pairwise deletion, etc.), not all will render effective on a given dataset. in tropical river basin, it’s still needed to develop proven and simplified methods to deal with hydrometric data missingness and scarcity. This paper presents the analyses including an assessment of the condition of the existing hydrometric data and works related to the way in which the record was treated for flow forecasting purposes and the construction of the artificial neural network (ANN) models used for predicting the flows. The study was led based on 15-min rainfall, water surface elevation and discharge data, derived from the continuous real-time monitoring station located in the del Medio River Basin from the years 2012 to 2016. As a result, the proposed modeling approach followed two modeling methods, one employing the missing data record and the other was used a multiple imputation (MI) technique to impute the missing data and forecast flow for 1, 2 and 4 h ahead under each approach. The statistical metrics results for the two-modeling approaches, suggest the non-imputed data scenario to rule out the imputed data. This means it is recommended to further optimize the MI technique if to be used effectively to fill in the missing required days of measurements for estimating H3 gaps and afterwards to forecast the flow employing multilayer perceptron (MLP), artificial neural networks (ANNs) with 10-fold cross-validation.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Minera Panama, Environmental Impact Assessment Study (2010)
Christian, W.D., Robert, W.: An artificial neural network approach to rainfall runoff modeling. Hydrol. Sci. J. 43(1), 47–66 (1998)
Toth, E., Brath, A., Montanari, A.: Comparison of short-term rainfall prediction models for real-time flood forecasting. J. Hydrol. 239, 132–147 (2000). Elsevier
Hsieh, W.H., Tang, B.: Applying neural network models to prediction and analysis in meteorology and oceanography. Bull. Amer. Met. Soc. 79, 855–1870 (1998)
FLOW3D Homepage. https://www.flow3d.com/. Accessed 18 Apr 2017
HEC-RAS Homepage. http://www.hec.usace.army.mil/software/hec-ras/downloads.aspx. Accessed 18 Apr 2017
MIKE HYDRO RIVER Homepage. https://www.mikepoweredbydhi.com/products/mike-hydro-river. Accessed 18 Apr 2017
OPENFOAM Homepage. http://www.openfoam.com/. Accessed 18 Apr 2017
REEF3D Homepage. https://reef3d.wordpress.com/. Accessed 18 Apr 2017
Frank, E., Hall, M.A., Witten, I.H.: The WEKA workbench. In: Data Mining: Practical Machine Learning Tools and Techniques, 4th edn. Morgan Kaufmann, Burlington (2016). (Online Appendix)
Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data, 2nd edn. Wiley, Hoboken (2002)
Wilkinson, L., Task Force on Statistical Inference. Statistical methods in psychology journals: guidelines and explanations. Am. Psychol. 54, 594–604 (1999)
Bodner, T.E.: Missing data: prevalence and reporting practices. Psychol. Rep. 99, 675–680 (2006)
Peugh, J.L., Enders, C.K.: Missing data in educational research: a review of reporting practices and suggestions for improvement. Rev. Educ. Res. 74, 525–556 (2004)
Rubin, D.B.: Inference and missing data. Biometrika 63(3), 581–592 (1976)
R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2017). https://www.R-project.org/
van Buuren, S., Groothuis-Oudshoorn, K.: Mice: multivariate Imputation by chained equations in R. J. Stat. Softw. 45(3) (2011). doi:10.18637/jss.v045.i03
Acknowledgement
The authors of this experiment will like to express their appreciation to Minera Panama S.A., Environmental Department for providing the necessary data. This work has been partially supported by the Spanish MICINN under projects: TRA2015–63708-R, and TRA2016-78886-C3-1-R.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer International Publishing AG
About this paper
Cite this paper
Simmonds, J., Gómez, J.A., Ledezma, A. (2017). Data Preprocessing to Enhance Flow Forecasting in a Tropical River Basin. In: Boracchi, G., Iliadis, L., Jayne, C., Likas, A. (eds) Engineering Applications of Neural Networks. EANN 2017. Communications in Computer and Information Science, vol 744. Springer, Cham. https://doi.org/10.1007/978-3-319-65172-9_36
Download citation
DOI: https://doi.org/10.1007/978-3-319-65172-9_36
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-65171-2
Online ISBN: 978-3-319-65172-9
eBook Packages: Computer ScienceComputer Science (R0)