Evaluating machine learning and statistical prediction techniques for landslide susceptibility modeling
- Autor(en)
- J. N. Goetz, A. Brenning, H. Petschko, P. Leopold
- Abstrakt
Statistical and now machine learning prediction methods have been gaining popularity in the field of landslide susceptibility modeling. Particularly, these data driven approaches show promise when tackling the challenge of mapping landslide prone areas for large regions, which may not have sufficient geotechnical data to conduct physically-based methods. Currently, there is no best method for empirical susceptibility modeling. Therefore, this study presents a comparison of traditional statistical and novel machine learning models applied for regional scale landslide susceptibility modeling. These methods were evaluated by spatial k-fold cross-validation estimation of the predictive performance, assessment of variable importance for gaining insights into model behavior and by the appearance of the prediction (i.e. susceptibility) map. The modeling techniques applied were logistic regression (GLM), generalized additive models (GAM), weights of evidence (WOE), the support vector machine (SVM), random forest classification (RF), and bootstrap aggregated classification trees (bundling) with penalized discriminant analysis (BPLDA). These modeling methods were tested for three areas in the province of Lower Austria, Austria. The areas are characterized by different geological and morphological settings.Random forest and bundling classification techniques had the overall best predictive performances. However, the performances of all modeling techniques were for the majority not significantly different from each other; depending on the areas of interest, the overall median estimated area under the receiver operating characteristic curve (AUROC) differences ranged from 2.9 to 8.9 percentage points. The overall median estimated true positive rate (TPR) measured at a 10% false positive rate (FPR) differences ranged from 11 to 15pp. The relative importance of each predictor was generally different between the modeling methods. However, slope angle, surface roughness and plan curvature were consistently highly ranked variables. The prediction methods that create splits in the predictors (RF, BPLDA and WOE) resulted in heterogeneous prediction maps full of spatial artifacts. In contrast, the GAM, GLM and SVM produced smooth prediction surfaces. Overall, it is suggested that the framework of this model evaluation approach can be applied to assist in selection of a suitable landslide susceptibility modeling technique.
- Organisation(en)
- Institut für Geographie und Regionalforschung
- Externe Organisation(en)
- Austrian Institute of Technology, University of Waterloo (UW), Friedrich-Schiller-Universität Jena
- Journal
- Computers & Geosciences
- Band
- 81
- Seiten
- 1-11
- Anzahl der Seiten
- 11
- ISSN
- 0098-3004
- DOI
- https://doi.org/10.1016/j.cageo.2015.04.007
- Publikationsdatum
- 08-2015
- Peer-reviewed
- Ja
- ÖFOS 2012
- 105404 Geomorphologie
- Schlagwörter
- ASJC Scopus Sachgebiete
- Computers in Earth Sciences, Information systems
- Link zum Portal
- https://ucrisportal.univie.ac.at/de/publications/evaluating-machine-learning-and-statistical-prediction-techniques-for-landslide-susceptibility-modeling(f5d3f8a1-8ef6-4462-aeb0-76bd1539cdaa).html