Skip to main content

Data Preprocessing to Enhance Flow Forecasting in a Tropical River Basin

  • Conference paper
  • First Online:
  • 2911 Accesses

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 744))

Abstract

Missing hydrometric data is a critical issue for water resources management projects and problems related to flow damage and risk assessment. Though numerous ways can be found in the literature to impute them (i.e. Box-Jenkins models, Linear regression models, case deletion, listwise and pairwise deletion, etc.), not all will render effective on a given dataset. in tropical river basin, it’s still needed to develop proven and simplified methods to deal with hydrometric data missingness and scarcity. This paper presents the analyses including an assessment of the condition of the existing hydrometric data and works related to the way in which the record was treated for flow forecasting purposes and the construction of the artificial neural network (ANN) models used for predicting the flows. The study was led based on 15-min rainfall, water surface elevation and discharge data, derived from the continuous real-time monitoring station located in the del Medio River Basin from the years 2012 to 2016. As a result, the proposed modeling approach followed two modeling methods, one employing the missing data record and the other was used a multiple imputation (MI) technique to impute the missing data and forecast flow for 1, 2 and 4 h ahead under each approach. The statistical metrics results for the two-modeling approaches, suggest the non-imputed data scenario to rule out the imputed data. This means it is recommended to further optimize the MI technique if to be used effectively to fill in the missing required days of measurements for estimating H3 gaps and afterwards to forecast the flow employing multilayer perceptron (MLP), artificial neural networks (ANNs) with 10-fold cross-validation.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Minera Panama, Environmental Impact Assessment Study (2010)

    Google Scholar 

  2. Christian, W.D., Robert, W.: An artificial neural network approach to rainfall runoff modeling. Hydrol. Sci. J. 43(1), 47–66 (1998)

    Article  Google Scholar 

  3. Toth, E., Brath, A., Montanari, A.: Comparison of short-term rainfall prediction models for real-time flood forecasting. J. Hydrol. 239, 132–147 (2000). Elsevier

    Article  Google Scholar 

  4. Hsieh, W.H., Tang, B.: Applying neural network models to prediction and analysis in meteorology and oceanography. Bull. Amer. Met. Soc. 79, 855–1870 (1998)

    Article  Google Scholar 

  5. FLOW3D Homepage. https://www.flow3d.com/. Accessed 18 Apr 2017

  6. HEC-RAS Homepage. http://www.hec.usace.army.mil/software/hec-ras/downloads.aspx. Accessed 18 Apr 2017

  7. MIKE HYDRO RIVER Homepage. https://www.mikepoweredbydhi.com/products/mike-hydro-river. Accessed 18 Apr 2017

  8. OPENFOAM Homepage. http://www.openfoam.com/. Accessed 18 Apr 2017

  9. REEF3D Homepage. https://reef3d.wordpress.com/. Accessed 18 Apr 2017

  10. Frank, E., Hall, M.A., Witten, I.H.: The WEKA workbench. In: Data Mining: Practical Machine Learning Tools and Techniques, 4th edn. Morgan Kaufmann, Burlington (2016). (Online Appendix)

    Google Scholar 

  11. Little, R.J.A., Rubin, D.B.: Statistical Analysis with Missing Data, 2nd edn. Wiley, Hoboken (2002)

    MATH  Google Scholar 

  12. Wilkinson, L., Task Force on Statistical Inference. Statistical methods in psychology journals: guidelines and explanations. Am. Psychol. 54, 594–604 (1999)

    Google Scholar 

  13. Bodner, T.E.: Missing data: prevalence and reporting practices. Psychol. Rep. 99, 675–680 (2006)

    Article  Google Scholar 

  14. Peugh, J.L., Enders, C.K.: Missing data in educational research: a review of reporting practices and suggestions for improvement. Rev. Educ. Res. 74, 525–556 (2004)

    Article  Google Scholar 

  15. Rubin, D.B.: Inference and missing data. Biometrika 63(3), 581–592 (1976)

    Article  MathSciNet  MATH  Google Scholar 

  16. R Core Team. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2017). https://www.R-project.org/

  17. van Buuren, S., Groothuis-Oudshoorn, K.: Mice: multivariate Imputation by chained equations in R. J. Stat. Softw. 45(3) (2011). doi:10.18637/jss.v045.i03

Download references

Acknowledgement

The authors of this experiment will like to express their appreciation to Minera Panama S.A., Environmental Department for providing the necessary data. This work has been partially supported by the Spanish MICINN under projects: TRA2015–63708-R, and TRA2016-78886-C3-1-R.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jose Simmonds .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2017 Springer International Publishing AG

About this paper

Cite this paper

Simmonds, J., Gómez, J.A., Ledezma, A. (2017). Data Preprocessing to Enhance Flow Forecasting in a Tropical River Basin. In: Boracchi, G., Iliadis, L., Jayne, C., Likas, A. (eds) Engineering Applications of Neural Networks. EANN 2017. Communications in Computer and Information Science, vol 744. Springer, Cham. https://doi.org/10.1007/978-3-319-65172-9_36

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-65172-9_36

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-65171-2

  • Online ISBN: 978-3-319-65172-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics