Abstract
The complex physical processes in a typical coastal aquifer system with transient inputs to numerical simulation models (NSM) result in substantial computational burden in a coupled simulation–optimization (S/O) approach. In such situations, an approximate emulator of the complex physical processes provides a computationally efficient alternative to the NSM. The reliability of these surrogate models (SM) within the coupled S/O approach depends on how accurately they capture the trend of the underlying physical processes. Moreover, these SMs are often associated with prediction uncertainties, which hinder optimality of the solution of the coupled S/O methodology. In this review article, we summarize ensemble approach of combining data-driven SMs to address this prediction uncertainty. Different techniques of ensemble formation as well as their relative advantages and disadvantages are also discussed. Although a wide range of data-driven SMs have been used to approximate associated physical processes of coastal aquifers, the use of ensemble SMs is quite limited. Moreover, these ensemble-based modelling approaches are based on manipulating the training data set, i.e., using different realizations of training data set to train individual SMs within the ensemble. Although ensemble formation by combining multiple SMs based on different algorithms can be found in other application domains, the application of ensemble SMs in the prediction of saltwater intrusion processes has not been developed yet. In addition, more advanced ensemble surrogate-modelling approaches are yet to be established in the context of developing regional scale saltwater intrusion management models.
Similar content being viewed by others
References
Abd-Elhamid HF, Javadi AA (2011) A cost-effective method to control seawater intrusion in coastal aquifers. Water Resour Manage 25(11):2755–2780. https://doi.org/10.1007/s11269-011-9837-7
Ataie-Ashtiani B, Ketabchi H (2011) Elitist continuous ant colony optimization algorithm for optimal management of coastal aquifers. Water Resour Manage 25(1):165–190. https://doi.org/10.1007/s11269-010-9693-x
Ataie-Ashtiani B, Ketabchi H, Rajabi MM (2014) Optimal management of a freshwater lens in a small island using surrogate models and evolutionary algorithms. J Hydrol Eng 19(2):339–354. https://doi.org/10.1061/(ASCE)HE.1943-5584.0000809
Bazi Y, Alajlan N, Melgani F (2012) Improved estimation of water chlorophyll concentration with semisupervised gaussian process regression. IEEE Geosci Remote Sens Lett 50(7):2733–2743. https://doi.org/10.1109/TGRS.2011.2174246
Bera P, Prasher SO, Patel RM, Madani A, Lacroix R, Gaynor JD, Tan CS, Kim SH (2006) Application of MARS in simulating pesticide concentrations in soil. T Asabe 49(1):297–307
Bhattacharjya RK, Datta B (2005) Optimal management of coastal aquifers using linked simulation optimization approach. Water Resour Manage 19(3):295–320. https://doi.org/10.1007/s11269-005-3180-9
Bhattacharjya RK, Datta B (2009) ANN-GA-based model for multiple objective management of coastal aquifers. J Water Resour Plan Manage 135(5):314–322. https://doi.org/10.1061/(ASCE)0733-9496(2009)135:5(314)
Bhattacharjya RK, Datta B, Satish MG (2007) Artificial neural networks approximation of density dependent saltwater intrusion process in coastal aquifers. J Hydrol Eng 12(3):273–282. https://doi.org/10.1061/(ASCE)1084-0699(2007)12:3(273)
Bishop CM (1995) Neural networks for pattern recognition. Oxford University Press Inc, Oxford
Bishop C (2006) Pattern recognition and machine learning. Springer-Verlag, New York
Blanning RW (1975) The construction and implementation of metamodels. Simulation 24:177–184
Breiman L (1996) Bagging predictors. Mach Learn 24(2):123–140. https://doi.org/10.1023/a:1018054314350
Cherkassky V (1998) Fuzzy inference systems: a critical review. In: Kaynak O, Zadeh LA, Türkşen B, Rudas IJ (eds) Computational intelligence: soft computing and fuzzy-neuro integration with applications. Springer, Berlin, pp 177–197. https://doi.org/10.1007/978-3-642-58930-0_10
Christelis V, Mantoglou A (2016) Pumping optimization of coastal aquifers assisted by adaptive metamodelling methods and radial basis functions. Water Resour Manage 30(15):5845–5859. https://doi.org/10.1007/s11269-016-1337-3
Christelis V, Regis RG, Mantoglou A (2017) Surrogate-based pumping optimization of coastal aquifers under limited computational budgets. J Hydroinform. https://doi.org/10.2166/hydro.2017.063
Deb K, Goel T (2001) Controlled elitist non-dominated sorting genetic algorithms for better convergence. In: Zitzler E, Thiele L, Deb K, Coello Coello CA, Corne D (eds) Evolutionary multi-criterion optimization: first international conference, EMO 2001 Zurich, Switzerland, March 7–9, 2001 proceedings. Springer, Berlin, pp 67–81. https://doi.org/10.1007/3-540-44719-9_5
Deb K, Agrawal S, Pratap A, Meyarivan T (2000) A fast elitist non-dominated sorting genetic algorithm for multi-objective optimization: NSGA-II. In: Schoenauer M, Deb K, Rudolph G et al (eds) Parallel problem solving from nature PPSN VI: 6th international conference Paris, France, September 18–20, 2000 proceedings. Springer, Berlin, pp 849–858. https://doi.org/10.1007/3-540-45356-3_83
Dempster AP (1968) A generalization of bayesian inference. J R Stat Soc Ser B (Methodol) 30(2):205–247
Dhar A, Datta B (2009) Saltwater intrusion management of coastal aquifers. I: linked simulation-optimization. J Hydrol Eng 14(12):1263–1272. https://doi.org/10.1061/(ASCE)HE.1943-5584.0000097
Dietterich TG (2000) An experimental comparison of three methods for constructing ensembles of decision trees: bagging, boosting, and randomization. Mach Learn 40(2):139–157. https://doi.org/10.1023/a:1007607513941
El-Baroudy I, Elshorbagy A, Carey SK, Giustolisi O, Savic D (2010) Comparison of three data-driven techniques in modelling the evapotranspiration process. J Hydroinform 12(4):365–379. https://doi.org/10.2166/hydro.2010.029
Forrester AIJ, Sóbester A, Keane AJ (2008) Constructing a surrogate. Engineering design via surrogate modelling. Wiley, Oxford, pp 33–76. https://doi.org/10.1002/9780470770801.ch2
Freund Y, Schapire RE (1996) Experiments with a new boosting algorithm. Paper presented at the proceedings of the thirteenth international conference on international conference on machine learning, Bari, Italy
Friedman JH (1991) Multivariate adaptive regression splines (with Discussion). Ann Stat 19(1):1–67. https://doi.org/10.1214/aos/1176347963
Giustolisi O, Savic DA (2006) A symbolic data-driven technique based on evolutionary polynomial regression. J Hydroinform 8(3):207–222. https://doi.org/10.2166/hydro.2006.020
Goel T, Haftka RT, Shyy W, Queipo NV (2007) Ensemble of surrogates. Struct Multidiscip O 33(3):199–216. https://doi.org/10.1007/s00158-006-0051-9
Green PJ, Silverman BW (1993) Nonparametric regression and generalized linear models: a roughness penalty approach. Taylor & Francis, Abingdon
Hastie T, Tibshirani R, Friedman J (2008) The elements of statistical learning, 2nd edn. Springer, New York
Holman D, Sridharan M, Gowda P, Porter D, Marek T, Howell T, Moorhead J (2014) Gaussian process models for reference ET estimation from alternative meteorological data sources. J Hydrol 517:28–35. https://doi.org/10.1016/j.jhydrol.2014.05.001
Hou Z, Lu W, Xue H, Lin J (2017) A comparative research of different ensemble surrogate models based on set pair analysis for the DNAPL-contaminated aquifer remediation strategy optimization. J Contam Hydrol 203:28–37. https://doi.org/10.1016/j.jconhyd.2017.06.003
Hsieh WW, Tang B (1998) Applying neural network models to prediction and data analysis in meteorology and oceanography. Bull Am Meteorol Soc 79(9):1855–1870. https://doi.org/10.1175/1520-0477(1998)079%3c1855:annmtp%3e2.0.co;2
Hussain MS, Javadi AA, Ahangar-Asr A, Farmani R (2015) A surrogate model for simulation–optimization of aquifer systems subjected to seawater intrusion. J Hydrol 523:542–554. https://doi.org/10.1016/j.jhydrol.2015.01.079
Jang J-SR (1993) ANFIS: adaptive-network-based fuzzy inference system. IEEE Trans Syst Man Cybern 23(3):665–685. https://doi.org/10.1109/21.256541
Jang J-SR, Sun CT, Mizutani E (1997) Neuro-fuzzy and soft computing: a computational approach to learning and machine intelligence. Prentice Hall, Upper Saddle River
Javadi AA, Abd-Elhamid HF, Farmani R (2012) A simulation-optimization model to control seawater intrusion in coastal aquifers using abstraction/recharge wells. Int J Numer Anal Methods Geomech 36(16):1757–1779. https://doi.org/10.1002/nag.1068
Jovanović RŽ, Sretenović AA, Živković BD (2015) Ensemble of various neural networks for prediction of heating energy consumption. Energ Build 94:189–199. https://doi.org/10.1016/j.enbuild.2015.02.052
Karterakis SM, Karatzas GP, Nikolos IK, Papadopoulou MP (2007) Application of linear programming and differential evolutionary optimization methodologies for the solution of coastal subsurface water management problems subject to environmental criteria. J Hydrol 342(3):270–282. https://doi.org/10.1016/j.jhydrol.2007.05.027
Ketabchi H, Ataie-Ashtiani B (2015a) Assessment of a parallel evolutionary optimization approach for efficient management of coastal aquifers. Environ Model Softw 74:21–38. https://doi.org/10.1016/j.envsoft.2015.09.002
Ketabchi H, Ataie-Ashtiani B (2015b) Evolutionary algorithms for the optimal management of coastal groundwater: a comparative study toward future challenges. J Hydrol 520:193–213. https://doi.org/10.1016/j.jhydrol.2014.11.043
Kourakos G, Mantoglou A (2009) Pumping optimization of coastal aquifers based on evolutionary algorithms and surrogate modular neural network models. Adv Water Resour 32(4):507–521. https://doi.org/10.1016/j.advwatres.2009.01.001
Kourakos G, Mantoglou A (2011) Simulation and multi-objective management of coastal aquifers in semi-arid regions. Water Resour Manage 25(4):1063–1074. https://doi.org/10.1007/s11269-010-9677-x
Koza JR (1994) Genetic programming as a means for programming computers by natural selection. Stat Comput 4(2):87–112. https://doi.org/10.1007/bf00175355
Kurtulus B, Razack M (2010) Modeling daily discharge responses of a large karstic aquifer using soft computing methods: artificial neural network and neuro-fuzzy. J Hydrol 381(1–2):101–111. https://doi.org/10.1016/j.jhydrol.2009.11.029
Langevin CD, Thorne DTJ, Dausman AM, Sukop MC, Guo W (2007) SEAWAT Version 4: a computer program for simulation of multi-species solute and heat transport. US Geol Surv Tech Methods Book 6(Chapter A22):39
Lin H-CJ, Rechards DR, Talbot CA, Yeh G-T, Cheng J-R, Cheng H-P, Jones NL (1997) FEMWATER: a three-dimensional finite element computer model for simulating density-dependent flow and transport in variable saturated media. Technical Rep No CHL-97-12 Vicksburg, MS: US Army Engineer Waterways Experiment Station Coastal and Hydraulics Laboratory
Mantoglou A, Papantoniou M (2008) Optimal design of pumping networks in coastal aquifers using sharp interface models. J Hydrol 361(1):52–63. https://doi.org/10.1016/j.jhydrol.2008.07.022
MATLAB (2017) What is sugeno-type fuzzy inference? MATLAB documentation http://aumathworks.com/help/fuzzy/what-is-sugeno-type-fuzzy-inference.html Accessed 12 Dec 2017
McLaughlin JD (2008) Parallel processing of reactive transport models using OpenMP. M.Sc. Thesis, Brigham Young University, Provo, UT, USA
Melin P, Soto J, Castillo O, Soria J (2012) A new approach for time series prediction using ensembles of ANFIS models. Expert Syst Appl 39(3):3494–3506. https://doi.org/10.1016/j.eswa.2011.09.040
Müller J, Piché R (2011) Mixture surrogate models based on Dempster-Shafer theory for global optimization problems. J Glob Optim 51(1):79–104. https://doi.org/10.1007/s10898-010-9620-y
Papadopoulou MP, Nikolos IK, Karatzas GP (2010) Computational benefits using artificial intelligent methodologies for the solution of an environmental design problem: saltwater intrusion. Water Sci Technol 62(7):1479–1490
Parasuraman K, Elshorbagy A (2008) Toward improving the reliability of hydrologic prediction: model structure uncertainty and its quantification using ensemble-based genetic programming framework. Water Resour Res 44(12):W12406. https://doi.org/10.1029/2007WR006451
Park CH, Aral MM (2004) Multi-objective optimization of pumping rates and well placement in coastal aquifers. J Hydrol 290(1):80–99. https://doi.org/10.1016/j.jhydrol.2003.11.025
Pebesma EJ, Heuvelink GBM (1999) Latin hypercube sampling of gaussian random fields. Technometrics 41(4):303–312. https://doi.org/10.2307/1271347
Pham DT, Castellani M (2014) Benchmarking and comparison of nature-inspired population-based continuous optimisation algorithms. Soft Comput 18(5):871–903. https://doi.org/10.1007/s00500-013-1104-9
Pillay N (2004) An investigation into the use of genetic programming for the induction of noviceprocedural programming solution algorithms in intelligent programming tutors. Dissertation, University of KwaZulu-Natal, Durban
Piret C (2007) Analytical and numerical advances in radial basis functions. PhD dissertation, Univ of Colorado, Boulder, CO
Qahman K, Larabi A, Ouazar D, Naji A, Cheng AHD (2009) Optimal extraction of groundwater in Gaza coastal aquifer. J Water Resour Protect 4:249–259
Rajabi MM, Ketabchi H (2017) Uncertainty-based simulation-optimization using Gaussian process emulation: Application to coastal groundwater management. J Hydrol 555(Supplement C):518–534. https://doi.org/10.1016/j.jhydrol.2017.10.041
Rasmussen CE, Williams CKI (2005) Gaussian processes for machine learning. The MIT, Cambridge
Razavi S, Tolson BA, Burn DH (2012) Review of surrogate modeling in water resources. Water Resour Res 48(7):W07401. https://doi.org/10.1029/2011WR011527
Rezania M, Javadi AA, Giustolisi O (2008) An evolutionary-based data mining technique for assessment of civil engineering systems. Eng Comput 25(6):500–517. https://doi.org/10.1108/02644400810891526
Rogers LL, Dowla FU, Johnson VM (1995) Optimal field-scale groundwater remediation using neural networks and the genetic algorithm. Environ Sci Technol 29(5):1145–1155
Roy DK, Datta B (2017a) Fuzzy c-mean clustering based inference system for saltwater intrusion processes prediction in coastal aquifers. Water Resour Manage 31(1):355–376. https://doi.org/10.1007/s11269-016-1531-3
Roy DK, Datta B (2017b) Genetic algorithm tuned fuzzy inference system to evolve optimal groundwater extraction strategies to control saltwater intrusion in multi-layered coastal aquifers under parameter uncertainty. Model Earth Syst Environ. https://doi.org/10.1007/s40808-017-0398-5
Roy DK, Datta B (2017c) Multivariate adaptive regression spline ensembles for management of multilayered coastal aquifers. J Hydrol Eng 22(9):04017031
Roy DK, Datta B (2017d) Optimal management of groundwater extraction to control saltwater intrusion in multi-layered coastal aquifers using ensembles of adaptive neuro-fuzzy inference system. In: World Environmental and Water Resources Congress 2017, American society of civil engineers, May 21–25, 2017 Sacramento, California, USA, pp 139–150
Roy DK, Datta B (2018a) Comparative efficiency of different artificial intelligence based models for predicting density dependent saltwater intrusion processes in coastal aquifers and saltwater intrusion management utilizing the best performing model. Desalin Water Treat 105:160–180
Roy DK, Datta B (2018b) A surrogate based multi-objective management model to control saltwater intrusion in multi-layered coastal aquifer systems. Civ Eng Environ Syst. https://doi.org/10.1080/10286608.2018.1431777
Schapire RE (1990) The strength of weak learnability. Mach Learn 5(2):197–227. https://doi.org/10.1007/bf00116037
Shafer G (1976) A mathematical theory of evidence. Princeton University Press, Princeton
Shiri J, Kişi Ö (2011) Comparison of genetic programming with neuro-fuzzy systems for predicting short-term water table depth fluctuations. Comput Geosci 37(10):1692–1701. https://doi.org/10.1016/j.cageo.2010.11.010
Shu C, Ouarda TBMJ (2007) Flood frequency analysis at ungauged sites using artificial neural networks in canonical correlation analysis physiographic space. Water Resour Res 43(7):W07438. https://doi.org/10.1029/2006WR005142
Sóbester A, Forrester AIJ, Toal DJJ, Tresidder E, Tucker S (2014) Engineering design applications of surrogate-assisted optimization techniques. Optim Eng 15(1):243–265. https://doi.org/10.1007/s11081-012-9199-x
SPM (2016) SPM® (version 8.2), Salford predictive modeller. Salford Systems, San Diego
Sreekanth J, Datta B (2010) Multi-objective management of saltwater intrusion in coastal aquifers using genetic programming and modular neural network based surrogate models. J Hydrol 393(3–4):245–256. https://doi.org/10.1016/j.jhydrol.2010.08.023
Sreekanth J, Datta B (2011a) Comparative evaluation of genetic programming and neural network as potential surrogate models for coastal aquifer management. Water Resour Manage 25(13):3201–3218. https://doi.org/10.1007/s11269-011-9852-8
Sreekanth J, Datta B (2011b) Coupled simulation-optimization model for coastal aquifer management using genetic programming-based ensemble surrogate models and multiple-realization optimization. Water Resour Res 47(4):W04516. https://doi.org/10.1029/2010WR009683
Sreekanth J, Datta B (2011c) Optimal combined operation of production and barrier wells for the control of saltwater intrusion in coastal groundwater well fields. Desalin Water Treat 32(1–3):72–78. https://doi.org/10.5004/dwt.2011.2680
Sreekanth J, Datta B (2014a) Design of an optimal compliance monitoring network and feedback information for adaptive management of saltwater intrusion in coastal aquifers. J Water Resour Plan Manage 140(10):04014026. https://doi.org/10.1061/(ASCE)WR.1943-5452.0000406
Sreekanth J, Datta B (2014b) Stochastic and robust multi-objective optimal management of pumping from coastal aquifers under parameter uncertainty. Water Resour Manage 28(7):2005–2019. https://doi.org/10.1007/s11269-014-0591-5
Sreekanth J, Datta B (2015) Review: simulation-optimization models for the management and monitoring of coastal aquifers. Hydrogeol J 23(6):1155–1166. https://doi.org/10.1007/s10040-015-1272-z
Sugeno M (1985) Industrial applications of fuzzy control. Elsevier Science Inc., New York City
Sugeno M, Yasukawa T (1993) A fuzzy-logic-based approach to qualitative modeling. IEEE Trans Fuzzy Syst 1(1):7. https://doi.org/10.1109/tfuzz.1993.390281
Sun AY, Wang D, Xu X (2014) Monthly streamflow forecasting using Gaussian process regression. J Hydrol 511:72–81. https://doi.org/10.1016/j.jhydrol.2014.01.023
Takagi T, Sugeno M (1985) Fuzzy identification of systems and its applications to modeling and control. IEEE Trans Syst Man Cybern SMC 15(1):116–132. https://doi.org/10.1109/tsmc.1985.6313399
Talbi E-G, Mostaghim S, Okabe T, Ishibuchi H, Rudolph G, Coello Coello CA (2008) Parallel approaches for multiobjective optimization. In: Branke J, Deb K, Miettinen K, Słowiński R (eds) Multiobjective optimization: interactive and evolutionary approaches. Springer, Berlin, pp 349–372. https://doi.org/10.1007/978-3-540-88908-3_13
Tang G, D’Azevedo EF, Zhang F, Parker JC, Watson DB, Jardine PM (2010) Application of a hybrid MPI/OpenMP approach for parallel groundwater model calibration using multi-core computers. Comput Geosci 36(11):1451–1460. https://doi.org/10.1016/j.cageo.2010.04.013
Therrien R, McLaren R, Sudicky E, Panday S (2010) HydroGeoSphere—a three-dimensional numerical model describing fully-integrated subsurface and surface flow and solute transport. University of Waterloo and Université Laval, Canada
Viana FAC, Haftka RT, Steffen V (2009) Multiple surrogates: how cross-validation errors can help us to obtain the best predictor. Struct Multidiscip O 39(4):439–457. https://doi.org/10.1007/s00158-008-0338-0
Wang W-C, Chau K-W, Cheng C-T, Qiu L (2009a) A comparison of performance of several artificial intelligence methods for forecasting monthly discharge time series. J Hydrol 374(3):294–306. https://doi.org/10.1016/j.jhydrol.2009.06.019
Wang WS, Jin JL, Ding J, Li YQ (2009b) A new approach to water resources system assessment—set pair analysis method. Sci China Ser E Technol Sci 52(10):3017–3023
Wolpert DH, Macready WG (1997) No free lunch theorems for optimization. IEEE Trans Evol Comput 1(1):67–82. https://doi.org/10.1109/4235.585893
Yan S, Minsker B (2006) Optimal groundwater remediation design using an adaptive neural network genetic algorithm. Water Resour Res 42(5):W05407. https://doi.org/10.1029/2005WR004303
Yang I-T, Hsieh Y-M, Kung L-O (2012) Parallel computing platform for multiobjective simulation optimization of bridge maintenance planning. J Constr Eng M 138(2):215–226. https://doi.org/10.1061/(ASCE)CO.1943-7862.0000421
Zerpa LE, Queipo NV, Pintos S, Salager J-L (2005) An optimization methodology of alkaline–surfactant–polymer flooding processes using field scale numerical simulation and multiple surrogates. J Petrol Sci Eng 47(3):197–208. https://doi.org/10.1016/j.petrol.2005.03.002
Zhang W, Goh ATC (2016) Multivariate adaptive regression splines and neural network models for prediction of pile drivability. Geosci Front 7(1):45–52. https://doi.org/10.1016/j.gsf.2014.10.003
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
Rights and permissions
About this article
Cite this article
Roy, D.K., Datta, B. A Review of Surrogate Models and Their Ensembles to Develop Saltwater Intrusion Management Strategies in Coastal Aquifers. Earth Syst Environ 2, 193–211 (2018). https://doi.org/10.1007/s41748-018-0069-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s41748-018-0069-3