A novel hybrid intelligent model of support vector machines and the MultiBoost ensemble for landslide susceptibility modeling

  • Binh Thai Pham
  • Abolfazl Jaafari
  • Indra Prakash
  • Dieu Tien Bui
Original Paper
  • 42 Downloads

Abstract

The main aim of this study is to propose a novel hybrid intelligent model named MBSVM which is an integration of the MultiBoost ensemble and a support vector machine (SVM) for modeling of susceptibility of landslides in the Uttarakhand State, Northern India. Firstly, a geospatial database for the study area was prepared, which includes 391 historical landslides and 16 landslide-affecting factors. Then, the sensitivity of different combinations of these factors for modeling was validated using the forward elimination technique. The MBSVM landslide model was built using the datasets generated from the best selected factors and validated utilizing the area under the receiver operating characteristic (ROC) curve (AUC), statistical indexes, and the Wilcoxon signed-rank test. Results show that this novel hybrid model has good performance both in terms of goodness of fit with the training dataset (AUC = 0.972) and the capability to predict landslides with the testing dataset (AUC = 0.966). The efficiency of the proposed model was then validated by comparison with logistic regression (LR), a single SVM, and another hybrid model of the AdaBoost ensemble and an SVM (ABSVM). Comparison results show that the MBSVM outperforms the LR, single SVM, and hybrid ABSVM models. Thus, the proposed model is a promising and good alternative tool for landslide hazard assessment in landslide-prone areas.

Keywords

Landslide susceptibility mapping Machine learning Ensembles MultiBoost SVM GIS India 

Notes

Acknowledgements

We thank to the Director of BISAG, Government of Gujarat, Gandhinagar, Gujarat, India, for helping and supporting this study.

References

  1. Abdulwahid WM, Pradhan B (2016) Landslide vulnerability and risk assessment for multi-hazard scenarios using airborne laser scanning data (LiDAR) Landslides:1–20Google Scholar
  2. Althuwaynee OF, Pradhan B, Park H-J, Lee JH (2014) A novel ensemble bivariate statistical evidential belief function with knowledge-based analytical hierarchy process and multivariate statistical logistic regression for landslide susceptibility mapping. Catena 114:21–36CrossRefGoogle Scholar
  3. Antronico L, Borrelli L, Coscarelli R, Gullà G (2015) Time evolution of landslide damages to buildings: the case study of Lungro (Calabria, southern Italy). Bull Eng Geol Environ 74:47–59CrossRefGoogle Scholar
  4. Antronico L, Borrelli L, Coscarelli R, Pasqua A, Petrucci O, Gullà G (2013) Slope movements induced by rainfalls damaging an urban area: the Catanzaro case study (Calabria, southern Italy). Landslides 10:801–814CrossRefGoogle Scholar
  5. Araújo MB, New M (2007) Ensemble forecasting of species distributions trends in ecology & evolution 22:42-47Google Scholar
  6. Breiman L (1996) Bagging predictors. Mach Learn 24:123–140Google Scholar
  7. Breiman L (2001) Random forests. Mach Learn 45:5–32CrossRefGoogle Scholar
  8. Budimir M, Atkinson P, Lewis H (2015) A systematic review of landslide probability mapping using logistic regression. Landslides 12:419–436CrossRefGoogle Scholar
  9. Chen H, Lee C (2003) A dynamic model for rainfall-induced landslides on natural slopes. Geomorphology 51:269–288CrossRefGoogle Scholar
  10. Chen W, Pourghasemi HR, Kornejady A, Zhang N (2017a) Landslide spatial modeling: introducing new ensembles of ANN, MaxEnt, and SVM machine learning techniques. Geoderma 305:314–327CrossRefGoogle Scholar
  11. Chen W, Pourghasemi HR, Naghibi SA (2017b) A comparative study of landslide susceptibility maps produced using support vector machine with different kernel functions and entropy data mining models in China. Bull Eng Geol Environ:1–18Google Scholar
  12. Chung C-JF, Fabbri AG (1999) Probabilistic prediction models for landslide hazard mapping. Photogrammetric Eng Remote Sens 65:1389–1399Google Scholar
  13. Chung C-JF, Fabbri AG (2003) Validation of spatial prediction models for landslide hazard mapping. Natural Hazards 30:451–472CrossRefGoogle Scholar
  14. Dash M, Liu H (1997) Feature selection for classification. Intell Data Anal 1:131–156CrossRefGoogle Scholar
  15. Dickson ME, Perry GL (2016) Identifying the controls on coastal cliff landslides using machine-learning approaches. Environ Modell Soft 76:117–127CrossRefGoogle Scholar
  16. Dietterich TG (1998) Approximate statistical tests for comparing supervised classification learning algorithms. Neural Comput 10:1895–1923CrossRefGoogle Scholar
  17. Ding Q, Chen W, Hong H (2017) Application of frequency ratio, weights of evidence and evidential belief function models in landslide susceptibility mapping. Geocarto Int 32:619–639Google Scholar
  18. Erdal HI, Karakurt O (2013) Advancing monthly streamflow prediction accuracy of CART models using ensemble learning paradigms. J Hydrol 477:119–128CrossRefGoogle Scholar
  19. Fell R, Corominas J, Bonnard C, Cascini L, Leroi E, Savage W, on behalf of the JTC-1 Joint Technical Committee on Landslides and Engineered Slopes (2008) Guidelines for landslide susceptibility, hazard and risk zoning for land use planning. Eng Geol 102:85–98CrossRefGoogle Scholar
  20. Freund Y, Schapire RE (1995) A desicion-theoretic generalization of on-line learning and an application to boosting. In: European conference on computational learning theory. Springer, pp 23–37Google Scholar
  21. Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55:119–139CrossRefGoogle Scholar
  22. Frye C (2007) About the Geometrical Interval Classification Method. http://blogs.esri.com/esri/arcgis
  23. Glade T, Anderson MG, Crozier MJ (2006) Landslide hazard and risk. John Wiley & SonsGoogle Scholar
  24. Goh AT, Zhang Y, Zhang R, Zhang W, Xiao Y (2017) Evaluating stability of underground entry-type excavations using multivariate adaptive regression splines and logistic regression. Tunn Underground Space Technol 70:148–154CrossRefGoogle Scholar
  25. Gorsevski PV, Brown MK, Panter K, Onasch CM, Simic A, Snyder J (2016) Landslide detection and susceptibility mapping using LiDAR and an artificial neural network approach: a case study in the Cuyahoga Valley National Park. Ohio Landslides 13:467–484CrossRefGoogle Scholar
  26. Gullà G, Peduto D, Borrelli L, Antronico L, Fornaro G (2017) Geometric and kinematic characterization of landslides affecting urban areas: the Lungro case study (Calabria, southern Italy). Landslides 14:171–188CrossRefGoogle Scholar
  27. Guzzetti F, Reichenbach P, Ardizzone F, Cardinali M, Galli M (2006) Estimating the quality of landslide susceptibility models. Geomorphology 81:166–184CrossRefGoogle Scholar
  28. Hájek P, Olej V (2015) Word Categorization of Corporate Annual Reports for Bankruptcy Prediction by Machine Learning Methods. In: International Conference on Text, Speech, and Dialogue. Springer, pp 122–130Google Scholar
  29. Hájek P, Olej V, Myšková R (2015) Predicting financial distress of banks using random subspace ensembles of support vector machines. In: Artificial Intelligence Perspectives and Applications. Springer, pp 131–140Google Scholar
  30. Hattermann F et al (2017) Cross-scale intercomparison of climate change impacts simulated by regional and global hydrological models in eleven large river basins. Clim Change 141:561–576CrossRefGoogle Scholar
  31. Hoang N-D, Tien Bui D (2018) Predicting earthquake-induced soil liquefaction based on a hybridization of kernel Fisher discriminant analysis and a least squares support vector machine: a multi-dataset study. Bull Eng Geol Environ 77:191–204CrossRefGoogle Scholar
  32. Hoehler FK (2000) Bias and prevalence effects on kappa viewed in terms of sensitivity and specificity. J Clin Epidemiol 53:499–503CrossRefGoogle Scholar
  33. Hong H et al (2018) Landslide susceptibility mapping using J48 decision tree with AdaBoost, Bagging and Rotation Forest ensembles in the Guangchang area (China). CATENA 163:399–413CrossRefGoogle Scholar
  34. Hong H et al (2017) A novel hybrid integration model using support vector machines and random subspace for weather-triggered landslide susceptibility assessment in the Wuning area (China). Environ Earth Sci 76:652CrossRefGoogle Scholar
  35. Hosmer DW, Lemeshow S (2000) Interpretation of the fitted logistic regression model Applied Logistic Regression, Second Edition:47–90Google Scholar
  36. Huang J, Wu P, Zhao X (2013) Effects of rainfall intensity, underlying surface and slope gradient on soil infiltration under simulated rainfall experiments. Catena 104:93–102CrossRefGoogle Scholar
  37. Hyndman RJ, Koehler AB (2006) Another look at measures of forecast accuracy. Int J Forecast 22:679–688CrossRefGoogle Scholar
  38. Ilia I, Tsangaratos P (2016) Applying weight of evidence method and sensitivity analysis to produce a landslide susceptibility map. Landslides 13:379–397CrossRefGoogle Scholar
  39. Jaafari A, Najafi A, Pourghasemi H, Rezaeian J, Sattarian A (2014) GIS-based frequency ratio and index of entropy models for landslide susceptibility assessment in the Caspian forest, northern Iran. Int J Environ Sci Technol 11:909–926CrossRefGoogle Scholar
  40. Jaafari A, Najafi A, Rezaeian J, Sattarian A, Ghajar I (2015) Planning road networks in landslide-prone areas: a case study from the northern forests of Iran. Land Use Policy 47:198–208CrossRefGoogle Scholar
  41. Jaafari A, Rezaeian J, Omrani MSO (2017) Spatial prediction of slope failures in support of forestry operations safety. Croatian J Forest Eng 38:107–118Google Scholar
  42. Jelinek H, Abawajy J, Kelarev A, Chowdhury M, Stranieri A (2014) Decision trees and multi-level ensemble classifiers for neurological diagnostics. Austral J Med Sci 1:1–12Google Scholar
  43. Kelarev A, Dazeley R, Stranieri A, Yearwood J, Jelinek H (2012) Detection of CAN by ensemble classifiers based on ripple down rules. In: Pacific Rim Knowledge Acquisition Workshop. Springer, pp 147–159Google Scholar
  44. Khosravi K et al (2018) A comparative assessment of decision trees algorithms for flash flood susceptibility modeling at Haraz watershed, northern Iran. Sci Total Environ 627:744–755CrossRefGoogle Scholar
  45. Komori O, Eguchi S, Ikeda S, Okamura H, Ichinokawa M, Nakayama S (2016) An asymmetric logistic regression model for ecological data. Methods Ecol Evol 7:249–260CrossRefGoogle Scholar
  46. Kotti M, Benetos E, Kotropoulos C, Pitas I (2007) A neural network approach to audio-assisted movie dialogue detection. Neurocomputing 71:157–166CrossRefGoogle Scholar
  47. Kumar* A, Mittal H, Kumar R, Ahluwalia RS (2017) Empirical Attenuation relationship for Peak Ground Horizontal Acceleration for North-East Himalaya. Vietnam J Earth Sci 39(1):47-57Google Scholar
  48. Landis JR, Koch GG (1977) The measurement of observer agreement for categorical data biometrics:159–174Google Scholar
  49. Lau NN (2018) Determination of ground displacement of 25 April 2015 Nepal earthquake by GNSS precise point positioning Vietnam J Earth Sci 40:17–25Google Scholar
  50. Margottini C, Canuti P, Sassa K (2013) Landslide science and practice vol 1. SpringerGoogle Scholar
  51. Marjanović M, Kovačević M, Bajat B, Voženílek V (2011) Landslide susceptibility assessment using SVM machine learning algorithm. Eng Geol 123:225–234CrossRefGoogle Scholar
  52. Meinhardt M, Fink M, Tünschel H (2015) Landslide susceptibility analysis in central Vietnam based on an incomplete landslide inventory: Comparison of a new method to calculate weighting factors by means of bivariate statistics) Geomorphology 234:80–97CrossRefGoogle Scholar
  53. Moosavi V, Niazi Y (2016) Development of hybrid wavelet packet-statistical models (WP-SM) for landslide susceptibility mapping. Landslides 13:97–114CrossRefGoogle Scholar
  54. Neuhäuser B, Terhorst B (2007) Landslide susceptibility assessment using “weights-of-evidence” applied to a study area at the Jurassic escarpment (SW-Germany). Geomorphology 86:12–24CrossRefGoogle Scholar
  55. Parhami B (2006) Introduction to parallel processing: algorithms and architectures. Springer Science & Business MediaGoogle Scholar
  56. Pham BT, Bui DT, Prakash I (2017a) Landslide Susceptibility Assessment Using Bagging Ensemble Based Alternating Decision Trees, Logistic Regression and J48 Decision Trees Methods: A Comparative Study Geotechnical and Geological Engineering:1–15Google Scholar
  57. Pham BT, Khosravi K, Prakash I (2017b) Application and comparison of decision tree-based machine learning methods in landside susceptibility assessment at Pauri Garhwal area, Uttarakhand, India Environmental Processes:1–20Google Scholar
  58. Pham BT, Nguyen V-T, Ngo V-L, Trinh PT, Ngo HTT, Bui DT (2017c) A Novel Hybrid Model of Rotation Forest Based Functional Trees for Landslide Susceptibility Mapping: A Case Study at Kon Tum Province, Vietnam. In: Tien Bui D. NDA, Bui HB., Hoang ND (ed) International Conference on Geo-Spatial Technologies and Earth Resources. Springer, pp 186–201Google Scholar
  59. Pham BT, Pradhan B, Tien Bui D, Prakash I, Dholakia MB (2016a) A comparative study of different machine learning methods for landslide susceptibility assessment: a case study of Uttarakhand area (India). Environ Model Software 84:240–250.  https://doi.org/10.1016/j.envsoft.2016.07.005 CrossRefGoogle Scholar
  60. Pham BT, Prakash I (2017a) Evaluation and Comparison of LogitBoost Ensemble, Fisher’s Linear Discriminant Analysis, Logistic Regression, and Support Vector Machines Methods for Landslide Susceptibility Mapping Geocarto International:1–32Google Scholar
  61. Pham BT, Prakash I (2017b) A Novel Hybrid Intelligent Approach of Random Subspace Ensemble and Reduced Error Pruning Trees for Landslide Susceptibility Modeling: A Case Study at Mu Cang Chai District, Yen Bai Province, Viet Nam. In: Tien Bui D. NDA, Bui HB., Hoang ND (ed) International Conference on Geo-Spatial Technologies and Earth Resources. Springer, pp 255–269Google Scholar
  62. Pham BT, Prakash I (2017c) A novel hybrid model of Bagging-based Naïve Bayes Trees for landslide susceptibility assessment. Bulletin of Engineering Geology and the Environment:1–15Google Scholar
  63. Pham BT, Prakash I (2017d) Spatial Prediction of Rainfall Induced Shallow Landslides Using Adaptive-Network-Based Fuzzy Inference System and Particle Swarm Optimization: A Case Study at the Uttarakhand Area, India. In: Tien Bui D. NDA, Bui HB., Hoang ND (ed) International Conference on Geo-Spatial Technologies and Earth Resources. Springer, pp 224–238Google Scholar
  64. Pham BT, Prakash I, Bui DT (2017d) Spatial prediction of landslides using hybrid machine learning approach based on Random Subspace and Classification and Regression Trees Geomorphology:1–15Google Scholar
  65. Pham BT, Shirzadi A, Bui DT, Prakash I, Dholakia M (2017e) A hybrid machine learning ensemble approach based on a radial basis function neural network and rotation Forest for landslide susceptibility modeling: a case study in the Himalayan area, India International Journal of Sediment ResearchGoogle Scholar
  66. Pham BT, Son LH, Hoang T-A, Nguyen D-M, Tien Bui D (2018a) Prediction of shear strength of soft soil using machine learning methods. Catena 166:181–191Google Scholar
  67. Pham BT, Tien Bui D, Pham HV, Le HQ, Prakash I, Dholakia MB (2016b) Landslide hazard assessment using random SubSpace fuzzy rules based classifier ensemble and probability analysis of rainfall data: a case study at mu Cang Chai District, Yen Bai Province (Viet Nam) journal of the Indian Society of Remote Sensing:1-11  https://doi.org/10.1007/s12524-016-0620-3
  68. Pham BT, Tien Bui D, Prakash I (2018b) Bagging based Support Vector Machines for spatial prediction of landslides. Enviro Earth Sci 77:146Google Scholar
  69. Pham BT, Tien Bui D, Prakash I, Dholakia MB (2017f) Hybrid integration of multilayer perceptron neural networks and machine learning ensembles for landslide susceptibility assessment at Himalayan area (India) using GIS. CATENA 149(Part 1):52–63.  https://doi.org/10.1016/j.catena.2016.09.007 CrossRefGoogle Scholar
  70. Pingale SM, Chandra H, Sharma H, Mishra SS (2012) Morphometric analysis of Maun watershed in Tehri-Garhwal district of Uttarakhand using GIS. Int J Geomat Geosci 3:373–387CrossRefGoogle Scholar
  71. Pourghasemi H, Moradi H, Aghda SF (2013) Landslide susceptibility mapping by binary logistic regression, analytical hierarchy process, and statistical index models and assessment of their performances. Nat Hazards 69:749–779CrossRefGoogle Scholar
  72. Pradhan B (2013) A comparative study on the predictive ability of the decision tree, support vector machine and neuro-fuzzy models in landslide susceptibility mapping using GIS. Comput Geosci 51:350–365CrossRefGoogle Scholar
  73. Ray RL, Jacobs JM (2007) Relationships among remotely sensed soil moisture, precipitation and landslide events. Nat Hazards 43:211–222CrossRefGoogle Scholar
  74. Rodriguez JJ, Kuncheva LI, Alonso CJ (2006) Rotation forest: a new classifier ensemble method. IEEE Trans Patt Anal Mach Intell 28:1619–1630CrossRefGoogle Scholar
  75. Samui P (2008) Slope stability analysis: a support vector machine approach. Environ Geol 56:255CrossRefGoogle Scholar
  76. Shibata R (1981) An optimal selection of regression variables Biometrika:45–54Google Scholar
  77. Shirzadi A et al (2017) Shallow landslide susceptibility assessment using a novel hybrid intelligence approach. Environ Earth Sci 76:60CrossRefGoogle Scholar
  78. Tien Bui D, Ho T-C, Pradhan B, Pham B-T, Nhu V-H, Revhaug I (2016a) GIS-based modeling of rainfall-induced landslides using data mining-based functional trees classifier with AdaBoost, Bagging, and MultiBoost ensemble frameworks. Environ Earth Sci 75:1–22.  https://doi.org/10.1007/s12665-016-5919-4 CrossRefGoogle Scholar
  79. Tien Bui D, Hoang N-D (2017) A Bayesian framework based on a Gaussian mixture model and radial-basis-function Fisher discriminant analysis (BayGmmKda V1. 1) for spatial prediction of floods. Geosci Model Dev 10:3391CrossRefGoogle Scholar
  80. Tien Bui D, Nguyen QP, Hoang N-D, Klempe H (2016b) A novel fuzzy k-nearest neighbor inference model with differential evolution for spatial prediction of rainfall-induced shallow landslides in a tropical hilly area using gis Landslides:1–17Google Scholar
  81. Tien Bui D, Pham BT, Nguyen QP, Hoang N-D (2016c) Spatial prediction of rainfall-induced shallow landslides using hybrid integration approach of Least-Squares Support Vector Machines and differential evolution optimization: a case study in Central Vietnam. Int J Digit Earth 9:1–21.  https://doi.org/10.1080/17538947.2016.1169561 CrossRefGoogle Scholar
  82. Tien Bui D, Pradhan B, Lofman O, Revhaug I, Dick OB (2012) Landslide susceptibility mapping at Hoa Binh province (Vietnam) using an adaptive neuro-fuzzy inference system and GIS. Comput Geosci 45:199–211CrossRefGoogle Scholar
  83. Tien Bui D, Tuan TA, Klempe H, Pradhan B, Revhaug I (2016d) Spatial prediction models for shallow landslide hazards: a comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 13:361–378.  https://doi.org/10.1007/s10346-015-0557-6 CrossRefGoogle Scholar
  84. Tsangaratos P, Ilia I (2016a) Comparison of a logistic regression and Naïve Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size. CATENA 145:164–179CrossRefGoogle Scholar
  85. Tsangaratos P, Ilia I (2016b) Landslide susceptibility mapping using a modified decision tree classifier in the Xanthi perfection, Greece. Landslides 13:305–320CrossRefGoogle Scholar
  86. Vapnik V (1995) The nature of statistical learning theory. Ed. by Springer. Springer-Verlag, VerlagCrossRefGoogle Scholar
  87. Varnes DJ (1984) Landslide hazard zonation: a review of principles and practice, vol 3. UNESCO, ParisGoogle Scholar
  88. Webb GI (2000) Multiboosting: a technique for combining boosting and wagging. Mach Learn 40:159–196CrossRefGoogle Scholar
  89. Zhang C, Ma Y (2012) Ensemble machine learning. SpringerGoogle Scholar
  90. Zhang W, Goh AT (2016a) Evaluating seismic liquefaction potential using multivariate adaptive regression splines and logistic regression. Geomech Eng 10:269–280CrossRefGoogle Scholar
  91. Zhang W, Goh AT (2016b) Multivariate adaptive regression splines and neural network models for prediction of pile drivability. Geosci Front 7:45–52CrossRefGoogle Scholar
  92. Zhang W, Goh ATC (2013) Multivariate adaptive regression splines for analysis of geotechnical engineering systems. Comput Geotech 48:82–95CrossRefGoogle Scholar

Copyright information

© Springer-Verlag GmbH Germany, part of Springer Nature 2018

Authors and Affiliations

  • Binh Thai Pham
    • 1
    • 2
  • Abolfazl Jaafari
    • 3
  • Indra Prakash
    • 4
  • Dieu Tien Bui
    • 5
  1. 1.Geographic Information Science Research GroupTon Duc Thang UniversityHo Chi Minh CityVietnam
  2. 2.Faculty of Environment and Labour SafetyTon Duc Thang UniversityHo Chi Minh CityVietnam
  3. 3.Young Researchers and Elite Club, Karaj BranchIslamic Azad UniversityKarajIran
  4. 4.Department of Science & Technology, Bhaskarcharya Institute for Space Applications and Geo-Informatics (BISAG)Government of GujaratGandhinagarIndia
  5. 5.Geographic Information System Group, Department of Business Administration and Computer ScienceUniversity College of Southeast NorwayBø i TelemarkNorway

Personalised recommendations