Next Article in Journal
Promising Generative Adversarial Network Based Sinogram Inpainting Method for Ultra-Limited-Angle Computed Tomography Imaging
Next Article in Special Issue
Climate-Triggered Insect Defoliators and Forest Fires Using Multitemporal Landsat and TerraClimate Data in NE Iran: An Application of GEOBIA TreeNet and Panel Data Analysis
Previous Article in Journal
A Method Framework for Automatic Airspace Reconfiguration-Monte Carlo Method for Eliminating Irregular Sector Shapes Generated by Region Growth Method
Previous Article in Special Issue
Estimation of Arsenic Content in Soil Based on Laboratory and Field Reflectance Spectroscopy
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Novel Performance Assessment Approach Using Photogrammetric Techniques for Landslide Susceptibility Mapping with Logistic Regression, ANN and Random Forest

1
Department of Geomatics Engineering, Hacettepe University, 06800 Beytepe Ankara, Turkey
2
Department of Geological Engineering, Hacettepe University, 06800 Beytepe Ankara, Turkey
*
Author to whom correspondence should be addressed.
Sensors 2019, 19(18), 3940; https://doi.org/10.3390/s19183940
Submission received: 13 August 2019 / Revised: 8 September 2019 / Accepted: 10 September 2019 / Published: 12 September 2019

Abstract

:
Prediction of possible landslide areas is the first stage of landslide hazard mitigation efforts and is also crucial for suitable site selection. Several statistical and machine learning methodologies have been applied for the production of landslide susceptibility maps. However, the performance assessment of such methods have conventionally been carried out by utilizing existing landslide inventories. The purpose of this study is to investigate the performances of landslide susceptibility maps produced with three different machine learning algorithms, i.e., random forest, artificial neural network, and logistic regression, in a recently constructed and activated dam reservoir and assess the external quality of each map by using pre- and post-event photogrammetric datasets. The methodology introduced here was applied using digital surface models generated from aerial photogrammetric flight data acquired before and after the dam construction. Aerial photogrammetric images acquired in 2012 and 2018 (after the dam was filled) were used to produce digital terrain models and orthophotos. The 2012 dataset was used for producing the landslide susceptibility maps and the results were evaluated by comparing the Euclidian distances between the two surface models. The results show that the random forest method outperforms the other two for predicting the future landslides.

1. Introduction

Although the Earth’s surface is dynamic and landslides are part of the dynamic processes, they are sometimes triggered by human intervention, especially for infrastructure construction purposes. Constructing a dam is an important interference to the topography and other nature characteristics of a region. In addition, proper site selection for such an expensive investment is crucial for its construction costs and life time. Landslides can be triggered by dams, and in return, they may affect the life time of dams. Therefore, producing accurate landslide inventories and reliable susceptibility maps are needed prior to site selection.
During the last three decades, thousands of landslide susceptibility maps have been produced and published in international literature. Several auto-production methods [1,2,3] for landslide susceptibility maps have been applied in the literature and their performances have generally been compared by employing landslide inventories that are prepared manually. However, the main purpose of a landslide susceptibility map is to assess the probability of landslide occurrence in an area and depict the findings spatially. In other words, a landslide susceptibility map aims at predicting any future landslide activity. For this reason, the performance assessment of a susceptibility computation method using existing landslide inventory is a little bit open to discussion. However, it has been almost impossible to assess the performance of a landslide susceptibility map up to now due to the lack of historical information and records. Additionally, according to [4], one of the emerging priorities and new frontiers in engineering geology is validation of models in order to provide more quantitative and reliable results. Gokceoglu and Sezer [5] stated that the increasing trend in the regional landslide susceptibility and hazard assessments is expected because regional landslide susceptibility assessment is considered as the first stage of landslide hazard mitigation efforts. A recent review on landslide susceptibility literature [6] has shown the rising trend in the number of publications on this topic. Despite the increasing number of studies, validation of the methods and the susceptibility maps rely only on the existing landslide inventories.
Among several landslide susceptibility map production methods proposed in the literature, artificial neural network (ANN) ([7,8,9,10,11,12]) and logistic regression (LR) (i.e., [13,14,15,16,17,18,19,20]) methods were widely used and often found successful. On the other hand, the application of random forest (RF) method is relatively new and rare (i.e., [21,22,23,24,25,26]). In this study, these three data-driven machine learning (ML) methods were selected for landslide susceptibility mapping and performance comparison. The main goal of this study is to assess the performances of the landslide susceptibility results obtained from these methods (i.e., ANN, LR, RF) in close vicinity of a recently built dam reservoir area in Turkey. Using the 2012 photogrammetric data, landslide susceptibility models were produced. Model training and prediction performances were evaluated separately. The model training performances of three different machine learning algorithms were controlled by the Receiver Operating Characteristic (ROC) curve approach. On the other hand, the prediction performances of the models were evaluated by a novel process. First, the comparison of the surface models of 2012 and 2018 was established. The Euclidean distance approach was used for this purpose. Depending on the Euclidean distance evaluations, surface deformations were determined. New landslides and new activities within the existing failures were defined by investigating the surface deformations observed between 2012 and 2018. As a result of this analysis, the distribution of the post-2012 landslide activities in the landslide susceptibility classes of the 2012 models were determined. This distribution corresponds to the prediction performance of the landslides susceptibility models of 2012. In the models having higher prediction performances, it is expected that the post-2012 landslide activities are observed in areas having higher landslide susceptibility values. As is widely known, slope instabilities around dam reservoirs can be triggered by water, and expected landslides serve to achieve the aims of this study by eliminating the need for natural triggers. The study area covers Kalekoy Dam, which is located in Bingol Province in the Eastern part of Turkey. The reservoir area is located in an active tectonic area and has steep and young morphology in a mountainous region. Consequently, the case investigated in the present study is one of the rare examples for performance assessment of regional landslide susceptibility approaches.

2. Characteristics of the Study Area

The study area, Kalekoy Dam, is located in an active tectonic valley and is limited by the lithological boundary of Late Miocene-Pliocene aged undifferentiated volcanic rocks [27]. Volcanic rocks are mainly composed of basalt, andesite, and pyroclastic layers [28]. It sometimes includes sedimentary intercalations, and is partly built from volcanic sediments [28]. The major geological structure observed in the study area is the Kalekoy segment of the Kavakbasi right lateral strike slip active fault [29]. As mentioned, the study area is an active tectonic valley mainly controlled by the Kalekoy segment. According to the Köppen climate classification, a “Dsa” climate is observed in the study area. “Dsa” means the winters are cold and the summers are arid hot [30]. The mean annual precipitation is given as 701 mm [31]. Due to high seismicity and active tectonism, the topography is very young and steep, and the lithological units have weak shear strength characteristics. Combination of these two adverse factors makes the region prone to landsliding. Solhan volcanic rock units constitute the most landslide-prone lithologies in the regions. The transformation of tuffs into clay by weathering processes cause a considerable reduction in shear strength. Additionally, due to the degradation and weathering processes of basalts, clays are also formed, and these clay layers constitute impermeable layers in the formation. Therefore, percolated surface water and groundwater are able to be accumulated on these layers and increase pore water pressures in the weathered parts of the Solhan volcanic rocks in particular. This simplified process causes many landslides in the region.
Figure 1 depicts location of the study area, the color-coded digital terrain model (DTM) of the year 2012 (before the dam construction), and the landslide inventory. The landslide inventory has been prepared manually using the orthophotos and the DTM of the year 2012. The area has been delineated according to a single lithology type to ensure that the whole area has the same lithological characteristics.
A total of 46 landslides occurring in the Solhan volcanic rocks were detected in the study area as shown in Figure 1. Although the landslide inventory of the study area was prepared from the DTM and the orthophotos, an extensive field study was also carried out in order to ground control of the landslide inventory (Figure 2). The landslides were classified as slide according to the classification system suggested by [32]. Additionally, according to [32], the activity states are mainly defined as dormant, the distribution of the activity is defined as retrogressing, and the main activity style is defined as successive. The minimum landslide area was calculated as 9725 m2, while the maximum landslide area was found to be 382,327 m2 (Table 1). To determine the landslide volumes, the empirical equation suggested by Larsen et al. [33] was implemented. Accordingly, the minimum and maximum landslide volumes were determined as 29,941 m3 and 802,017.7 m3, respectively. Considering the landslide dimensions defined by IAEG [34], the expected depths of failures were also determined. The minimum depth value was calculated as 19 m while that of maximum value was found to be 63 m. The mean depth value of all failures observed in the study area was calculated as 33 m. An additional landslide mapping was also carried out using the DTM produced from ortophotos taken in 2018. Both new occurrences of landslides and activities on the existing failures were determined in the data of 2018. The comparisons performed of which details are given in further sections of this paper, justify the landslide activity definitions given up to here.

3. Methodology and Data

A generic workflow of the study is provided in Figure 3. Aerial photogrammetric images acquired in two different years, 2012 and 2018, using a large-format aerial camera, were obtained from the General Directorate of Mapping (GDM) in Turkey together with image orientation and camera calibration parameters. The datasets were processed with photogrammetric methods to generate dense point cloud, high resolution digital surface models (DSMs), and orthophotos. The DTMs were produced from the DSMs through point cloud filtering. The buildings and the vegetation were removed in a filtering process and the remaining surface models include the terrain surface only. The topographic (or digital elevation model, DEM) derivatives, such as slope, aspect, curvature, etc., were computed from the DTMs. The landslide inventory and the feature images obtained from the topographic derivatives were used to produce the landslide susceptibility maps of the area by using the RF, LR, and ANN techniques. The DTMs of 2012 and 2018 were compared using the least squares surface matching and comparison approach developed by [35]. The results of the DTM comparison, which reflect the landsliding caused by the dam, were used for the visual inspection of the performances of the ML methods. A quantitative assessment is not yet possible since some of the landslides have been filled with water. More details on the data and the methods are given in the following sub-sections.

3.1. Photogrammetric Processing

Aerial photogrammetric flight missions have been regularly performed by GDM in Turkey since 1939 [36]. Although the standard outputs involve manually measured DTM and orthoimages, raw (geometrically and radiometrically unprocessed) images and their interior orientation (IO) and exterior orientation (EO) parameters were obtained from the GDM in order to produce dense DSM and DTM of the area. The IO (camera calibration) parameters provided by the camera vendor (Ultracam Eagle from Vexcel Imaging, Graz, Austria) and the EO parameters measured by onboard GNSS and INS instruments have been refined in a bundle block adjustment process using sufficient number of ground control points for every photogrammetric project. Therefore, the adjusted IO and EO parameters delivered by GDM ensure photogrammetric point positioning accuracy of better than one pixel (i.e., sub-pixel accuracy) as per the national map production standards using aerial photogrammetric methods in Turkey.
Data from two different flight missions, i.e., performed in years 2012 and 2018, were employed in this study. The main flight parameters are provided in Table 2. The ground sampling distances (GSD) of the photos vary due to the altitude variances of the terrain (ca. 1000–2100 m). The triangulation accuracies obtained from these photogrammetric blocks are around 1/3 pixel in planimetry and 1/2 pixel in height (ref.: personal communications with GDM personnel). The images and the orientation parameters were used for the generation of dense point cloud and orthoimage mosaics. The output DSMs were filtered to obtain DTMs, and then down-sampled by spatial interpolation to 10 m grid DTMs in order to achieve computational efficiency in the latter steps.

3.2. Topographic Derivatives

In the present study, only topographic parameters (i.e., derivatives computed from DTM of 2012) were employed. The area within the unique lithology was evaluated. Almost all of the landslides observed in the field are observed within this unit. As mentioned, the Solhan volcanics are prone to landslides in terms of their rock weathering and structural properties. The approach is that, when lithology is accepted as unique, the landslide susceptibility is controlled by the topographic parameters that vary depending on rock weathering and structural properties. In other words, the topography represents the rock mass strength characteristics that vary depending on rock weathering and structural properties. The Solhan volcanics are located in the Bingol complex and the East Anatolian Fault Zone. For this reason, these parameters were not considered when performing landslide susceptibility assessments. Among the topographic derivatives, slope, aspect, altitude, stream power index, distance to channel, distance to ridge, stream power index, general, plan, and profile curvatures were considered as conditioning parameters during the landslide susceptibility analyses.
Figure 4 shows the spatial distributions of the altitude, slope and aspect calculations from 2012 DTM. The main stream in the study area is Murat River and the general physiographic trend of the study area NW-SE. The lowest altitude is 1012 m while the maximum is 2061 m. Especially the northeast parts of the area show the maximum altitudes. The highest slopes are observed around Murat River and the north eastern parts of the area. The reservoir slopes are relatively high and therefore more susceptible to landsliding. Depending on the general physiographic trend of the area, the major slope aspects have SW and NE directions.
The parameter the stream power index (SPI) simply defines erosive power of flowing water based on the assumption that discharge is proportional to specific catchment area [37]. In order to evaluate the SPI parameter, Moore et al. [38] propose the following equation (Equation (1)):
S P I = A s × t a n β
where As is the specific catchment area (m2 m−1), β is the slope gradient (°).
Topographic wetness index (TWI) defines the effect of topography on the location and size of saturated source areas of runoff generation [38]. The parameter TWI has been significantly investigated in landslide susceptibility mapping and constitutes one of the most common used conditioning factors [39]. Moore et al. [38] also suggested an equation for the calculation of TWI under assumed steady-state conditions and uniform soil properties (Equation (2)). Both the SPI and TWI maps of the study area provided in Figure 5.
T W I = l n ( A s t a n β )
Slope curvatures are the second derivatives of DTMs. The curvature parameters define the rate of change of the first derivatives slope gradient and slope aspect in a certain direction [37]. In addition, the distances to the channels (drainage) and the ridges are also effective parameters on the landslide susceptibility and have been employed in previous studies [7]. The curvature maps (general, plan, and profile) and distance maps to the channels and the ridges produced from 2012 DTM are shown in Figure 6. The overall statistical values calculated from the topographic derivative parameters are given in Table 3. In addition, the histograms of all derivative calculations are shown in Figure 7.

3.3. Application of the ML Algorithms

LR, ANN, and RF are widely used supervised learning algorithms in machine learning. Supervised learning methods in principle use data samples and estimate the model parameters based on those data samples. Particularly for landslide susceptibility mapping, the model learns from actual landslide areas and no-landslide areas and gives predictions for the future landslides, known as probability of occurrence of landslide. On the other hand, one can consider landslide susceptibility mapping as a binary classification in supervised learning where each input pixel is actually labeled as 0 and 1. This classification output refers the landslide occurrence probability of each pixel in the whole dataset.
The method of LR relies on well-known sigmoid function and gives the probability of landslide occurrence in the range of 0 and 1. Many references can be found on the details of LR method [40,41]. The formula for logistic regression is given in Equation (3).
P ( y = 1 | x ; θ ) =   1 1 + e   θ x T
where θ   shows parameter vector, [ θ 0 ,   θ 1 ,   θ 2 ,   , θ n ]   , x shows feature vector, [ 1 ,   x 1 ,   x 2 ,   , x n ]   T .
Using independent variables   x , parameter vector θ is estimated iteratively with respect to a cost function. In this study, the cost function is binary cross entropy with L2 regularization as shown in Equation (4). Parameters estimated based on this cost function and updated in each iteration.
J ( θ ) =   i = 1 n ( y l o g ( p ) + ( 1 y ) l o g ( 1 p ) +   λ 2   j = 1 m θ 2
where p   refers likelihood of the occurance of the landslide, y   shows desired value, and λ is the regularization strength.
ANN is a more complex model in comparison to LR and aim at iterative determination of the weights with respect to the given true data samples. In the training stage, the network predicts each sample’s label using their weights and bias values. A cost function finds the distinction between the calculated label and true label. At the backpropagation stage, each weight receives an update according to gradient of the cost function. This process continues until a convergence is met or a maximum number of epoch is reached in the training. More details on the ANN method can be found in [42]. In Figure 8, a visually simplified version of ANN architecture used in this study is given.
According to Figure 8, the input layer has totally 10 neurons. In other words, the parameters are optimally estimated in a 10 dimensional space. The network has one hidden layer with 20 neurons. The last layer has only one neuron, which shows the landslide occurrence probability. The mathematical relationship is denoted in Equation (5).
h   = f (   x   w 1 +   b 1 ) p = h w 2 +   b 2
where
  • x refers feature matrix, the size of the matrix is k by n , where n is feature size, k is sample size;
  • w refers weights, the size of the weight matrix depends on the number of features and number of neurons in the hidden layer (or the number of neuron in the next layer);
  • b refers bias vector, the size of the vector dependent on the number of neurons in the next layer;
  • p refers output classification score between zero and one;
  • f ( ) is sigmoid activation function.
A few factors influence the success of the neural network applications. The first one is the dimensionality of the data. With a higher number of dimensions, more complexity occurs in the network although it helps to solve more complex classification problems. The second factor is the numbers of hidden layers and the number of neurons in the hidden layers. According to the heuristic approaches, doubling the number of features is a good choice for neural networks with one hidden layer. The third and the most important factor is the normalization of the data. In other words, all data should be in the same scale. Although neural networks are able to find a solution with non-normalized data, it takes longer and is not guaranteed to reach the global minimum of the system.
Random forests [43] can be shortly defined as an ensemble method of decision trees. It creates decision trees selecting randomly from the training data set. At the training stage, the decision trees have been evaluated separately and the best score is the average of the results of the trees. The feature selection plays a key role because RF tries to select the most important features when creating decision trees. There are a few important parameters for RF. The first one is the number of trees in the forest, which defines how many decision trees are created in the training. The general approach to set this parameter is that a large number of trees help to generalize the model better. Another parameter is the number of features, which refers the depth of the decision trees. With a larger number of features, better accuracy can be obtained. It should be noted that larger values increase the computational time and one should therefore consider the time in parameterization.
All three ML models were implemented in Python programming language using Scikit-learn [44] ML library. The parameters for each model are summarized in Table 4.

3.4. Performance Assessment Approaches to the Landslide Susceptibility Maps

Two main performance assessment approaches were implemented in this study. The first one is the conventional internal accuracy assessment approach based on the visual analysis of the Receiver Operating Characteristic (ROC) curves and the area under curve (AUC). The ROC curve has been a common way of visualizing a classifier’s performance, and the AUC is one of the best single number indicator [45]. According to the AUC, a larger area shows the better classification accuracy.
As the second approach, the performance of the landslide susceptibility maps was assessed externally by analyzing the landslides occurred after the susceptibility maps were produced. For this purpose, the susceptibility maps produced using the 2012 dataset were assessed with the terrain deformation information derived using 2012 and 2018 datasets. A least square surface matching algorithm and tool, i.e., LS3D, developed by [35] and later used for diverse applications ranging from cultural heritage and object modelling to geomorphology [46,47,48,49,50,51], was used here. Since 1D height differences along the z-axes may not truly represent the surface-to-surface distance, the Euclidean (3D) distances as provided in LS3D were analyzed here. Although the algorithm is powerful enough to estimate the transformation parameters between two or more 3D surfaces by minimizing the sum of the squares of the Euclidean distances between the surfaces, no surface matching was performed in this study since the absolute orientation of the photogrammetric datasets were very accurate.

4. Results

The ROC curves [52] and the AUCs for each model are demonstrated in Figure 9. This assessment method was used previously by several authors [53,54,55,56] to check the model performance. In the present study, the ROC curves were also used for same purpose. As shown in Figure 9, the RF method yields the best classification accuracy with an area value of 0.95, where ANN comes second in the performance with an area of 0.84. The LR algorithm leads to the least accurate results. Assessment of the ROC curve (Figure 9) shows that all ML approaches employed in the present study provide acceptable results.
The landslide susceptibility maps and the surface comparison results between 2012 and 2018 DTMs were provided in the following sub-sections for further visual analysis. As a novel performance assessment, the resulting maps and orthophotos in some parts of the areas were draped on the DTMs in a 3D geovisualization software.

4.1. Landslide Susceptibility Maps

The general overviews of the produced landslide susceptibility maps are provided in Figure 10, Figure 11 and Figure 12 in both 2D and 3D. From the figures, it can be seen that LR produces a very similar output to the altitude classification map (DTM) from 2012, as shown previously in Figure 4.

4.2. Surface Comparison Results

The surface comparison, as the novel part of the study, was performed between the 2012 and 2018 DTMs. The latter one was produced from the images acquired after the dam reservoir has been filled. Therefore, both construction efforts and landslides cause differences between both terrain models. The Euclidian distance residuals plot of the whole area obtained from LS3D are provided in Figure 13. The red and the blue areas in Figure 13 demonstrate the highest elevation changes. Two parts of the study area, one that comprise the road and the other one that comprise the dam construction site were analyzed visually and screenshots showing the performances of each ML and the terrain deformations are shown in Figure 14 and Figure 15. The darker blue areas in all residual plots represent landslides, although part of those are affected by the road construction. All ML methods used here detect the landslide areas, although LR and ANN methods are more conservative than the RF method as can be seen from Figure 14 and Figure 15.

5. Discussions and Conclusions

Regional landslide susceptibility assessment is one of the hot topics in the international literature since these assessments are used for landslide hazard mitigation efforts. Depending on advancements of the ML techniques, the performances of these assessments have improved significantly. However, during the performance assessment stage, existing landslide inventories have been used as an obligation. Similarly, for the model performance, the landslide inventory before 2012 was used. However, in the present study, landslide susceptibility maps were produced using the DTM of 2012 (before construction of the dam), and the DTM of 2018 (after construction of the dam) was used for comparison. Such comparison provides more realistic assessment because the main purpose of a landslide susceptibility map is to predict future landslides. In the present study, fully completed datasets were used for this purpose. The results and conclusions obtained from the present study can be drawn as follows:
  • Considering the geological and geomorphological characteristics of the study area, it is evident that the reservoir area is highly susceptible to landsliding, and a total of 46 landslides were mapped employing orthophotos of 2012. The landslide inventory was checked with extensive field observations. The study area is generally covered by sparse vegetation cover, and the landslides in the study area can be easily observed. Consequently, the reliability of the landslide inventory is high. The performance of a data-driven method is directly based on the quality of the data and the suitability of the method used.
  • Three landslide susceptibility maps were produced by applying three different ML techniques (LS, ANN, and RF). Considering the ROC assessment, it is possible to say that all maps show good performance. However, the RF method yields the best classification accuracy with an area value of 0.95, where ANN comes second in the performance with an area of 0.84.
  • Based on the surface comparison analysis, the most conservative results were given by LR, while the most representative results were provided by RF. Consequently, the RF is the most appropriate approach for the area considered in the present study. These results are supported by the model performance assessment. In other words, there is a good agreement between AUC values and Euclidean distance calculations introduced in the present study.
  • Performance assessment of landslide susceptibility map is still one of major research areas among the international landslide community. For this reason, such type cases are extremely important for accurate assessment of performances of the existing techniques. In other words, if landslide susceptibility maps are produced before occurring a triggering event, performance assessment can be performed accurately after triggering event using Euclidean distance calculations.
  • In addition to developing technologies and methods, the presence of more and representative historical data makes landslide susceptibility maps more efficient. Therefore, landslide susceptibility maps are becoming more effective and useful tools for reducing losses caused by landslides. Some recent studies [57,58] described new approaches (CitSci) and new techniques (Convolutional Neural Networks) to collect the landslide data and to check the quality of the data. Consequently, the approach introduced in the present study has promising results for increasing the quality of landslide susceptibility maps because of the increase in the amount and quality of data.

Author Contributions

C.G. performed conceptualization. E.S., H.A.N., and S.K. performed data curation. E.S. and S.K. carried out formal analysis. S.K. administrated the project and developed methodology. E.S. developed the software and carried out the application. S.K. and C.G. supervised the project. H.A.N. validated the outputs. S.K. and H.A.N. wrote the draft, and C.G. finalized the writing.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Akgun, A.; Sezer, E.A.; Nefeslioglu, H.A.; Gokceoglu, C.; Pradhan, B. An easy-to-use MATLAB program (MamLand) for the assessment of landslide susceptibility using a Mamdani fuzzy algorithm. Comput. Geosci. 2012, 38, 23–34. [Google Scholar] [CrossRef]
  2. Osna, T.; Sezer, E.A.; Akgun, A. GeoFIS: An integrated tool for the assessment of landslide susceptibility. Comput. Geosci. 2014, 66, 20–30. [Google Scholar] [CrossRef]
  3. Sezer, E.A.; Nefeslioglu, H.A.; Osna, T. An expert-based landslide susceptibility mapping (LSM) module developed for Netcad Architect Software. Comput. Geosci. 2017, 98, 26–37. [Google Scholar] [CrossRef]
  4. Juang, C.H.; Carranza-Torres, C.; Crosta, G.; Dong, J.; Gokceoglu, C.; Jibson, R.W.; Shakoor, A.; Tang, H.; van Asch, T.W.J.; Wasowski, J. Engineering geology—A fifty year perspective. Eng. Geol. 2016, 201, 67–70. [Google Scholar] [CrossRef]
  5. Gokceoglu, C.; Sezer, E. A statistical assessment on international landslide literature (1945–2008). Landslides 2009, 6, 345–351. [Google Scholar] [CrossRef]
  6. Reichenbach, P.; Rossia, M.; Malamud, B.D.; Mihir, M.; Guzzetti, F. A review of statistically-based landslide susceptibility models. Earth Sci. Rev. 2018, 180, 60–91. [Google Scholar] [CrossRef]
  7. Nefeslioglu, H.A.; Gokceoglu, C.; Sonmez, H. An assessment on the use of logistic regression and artificial neural networks with different sampling strategies for the preparation of landslide susceptibility maps. Eng. Geol. 2008, 97, 171–191. [Google Scholar] [CrossRef]
  8. Polykretis, C.; Ferentinou, M.; Chalkias, C. A comparative study of landslide susceptibility mapping using landslide susceptibility index and artificial neural networks in the Krios River and Krathis River catchments (northern Peloponnesus, Greece). Bull. Eng. Geol. Environ. 2015, 74, 27–45. [Google Scholar] [CrossRef]
  9. Tien Bui, D.; Tuan, T.A.; Klempe, H.; Pradhan, B.; Revhaug, I. Spatial prediction models for shallow landslide hazards: A comparative assessment of the efficacy of support vector machines, artificial neural networks, kernel logistic regression, and logistic model tree. Landslides 2016, 13, 361–378. [Google Scholar] [CrossRef]
  10. Chen, W.; Pourghasemi, H.R.; Zhao, Z. A GIS-based comparative study of Dempster-Shafer, logistic regression and artificial neural network models for landslide susceptibility mapping. Geocarto Int. 2017, 32, 367–385. [Google Scholar] [CrossRef]
  11. Pham, B.T.; Shirzadi, A.; Tien Bui, D.; Prakash, I.; Dholakia, M. A hybrid machine learning ensemble approach based on a radial basis function neural network and rotation forest for landslide susceptibility modeling: A case study in the Himalayan area, India. Int. J. Sediment Res. 2018, 33, 157–170. [Google Scholar] [CrossRef]
  12. Tian, Y.; Xu, C.; Hong, H.; Zhou, Q.; Wang, D. Mapping earthquake-triggered landslide susceptibility by use of artificial neural network (ANN) models an example of the 2013 Minxian (China) Mw 5.9 event. Geomat. Nat. Hazards Risk 2019, 10, 1–25. [Google Scholar] [CrossRef]
  13. Ayalew, L.; Yamagishi, H. The application of GIS-based logistic regression for landslide susceptibility mapping in the Kakuda Yahiko Mountains, Central Japan. Geomorphology 2005, 65, 15–31. [Google Scholar] [CrossRef]
  14. Can, T.; Nefeslioglu, H.A.; Gokceoglu, C.; Sonmez, H.; Duman, T.Y. Susceptibility assessments of shallow earthflows triggered by heavy rainfall at three catchments by logistic regression analyses. Geomorphology 2005, 72, 250–271. [Google Scholar] [CrossRef]
  15. Duman, T.Y.; Can, T.; Gokceoglu, C.; Nefeslioglu, H.A.; Sonmez, H. Application of logistic regression for landslide susceptibility zoning of Cekmece Area, Istanbul, Turkey. Environ. Geol. 2006, 51, 241–256. [Google Scholar] [CrossRef]
  16. Gorum, T.; Gonencgil, B.; Gokceoglu, C.; Nefeslioglu, H.A. Implementation of reconstructed geomorphologic units in landslide susceptibility mapping: The Melen Gorge (NW Turkey). Nat. Hazards 2008, 46, 323–351. [Google Scholar] [CrossRef]
  17. Nefeslioglu, H.A.; San, B.T.; Gokceoglu, C.; Duman, T.Y. An assessment on the use of Terra ASTER L3A data in landslide susceptibility mapping. J. Appl. Earth Obs. Geoinf. 2012, 14, 40–60. [Google Scholar] [CrossRef]
  18. Lee, S.; Won, J.S.; Jeon, S.W.; Park, I.; Lee, M.J. Spatial landslide hazard prediction using rainfall probability and a logistic regression model. Math. Geosci. 2015, 47, 565–589. [Google Scholar] [CrossRef]
  19. Tsangaratos, P.; Ilia, I. Comparison of a logistic regression and Naive Bayes classifier in landslide susceptibility assessments: The influence of models complexity and training dataset size. Catena 2016, 145, 164–179. [Google Scholar] [CrossRef]
  20. Du, G.L.; Zhang, Y.S.; Iqbal, J.; Yang, Z.H.; Yao, X. Landslide susceptibility mapping using integrated model of information value method and logistic regression in the Bailongjiang watershed, Gansu province, China. J. Mt. Sci. 2017, 14, 249–268. [Google Scholar] [CrossRef]
  21. Hong, H.Y.; Pourghasemi, H.R.; Pourtaghi, Z.S. Landslide susceptibility assessment in Lianhua County (China): A comparison between random forest data mining technique and bivariate and multivariate statistical models. Geomorphology 2016, 259, 105–118. [Google Scholar] [CrossRef]
  22. Hong, H.; Miao, Y.; Liua, J.; Zhu, A.X. Exploring the effects of the design and quantity of absence data on the performance of random forest-based landslide susceptibility mapping. Catena 2019, 176, 45–64. [Google Scholar] [CrossRef]
  23. Chen, W.; Xie, X.; Wang, J.; Pradhan, B.; Hong, H.; Tien Bui, D.; Duan, Z.; Ma, J. A comparative study of logistic model tree, random forest, and classification and regression tree models for spatial prediction of landslide susceptibility. Catena 2017, 151, 147–160. [Google Scholar] [CrossRef] [Green Version]
  24. Chu, L.; Wang, L.J.; Jiang, J.; Liu, X.; Sawada, K.; Zhang, J. Comparison of landslide susceptibility maps using random forest and multivariate adaptive regression spline models in combination with catchment map units. Geosci. J. 2019, 23, 341–355. [Google Scholar] [CrossRef]
  25. Youssef, A.M.; Pourghasemi, H.R.; Pourtaghi, Z.S.; Al-Katheeri, M.M. Landslide susceptibility mapping using random forest, boosted regression tree, classification and regression tree, and general linear models and comparison of their performance at Wadi Tayyah Basin, Asir Region, Saudi Arabia. Landslides 2016, 13, 839–856. [Google Scholar] [CrossRef]
  26. Dou, J.; Yunus, A.P.; Tien Bui, D.; Merghadi, A.; Sahana, M.; Zhu, Z.; Chen, C.W.; Khosravi, K.; Yang, Y.; Thai Pham, B. Assessment of advanced random forest and decision tree algorithms for modeling rainfall-induced landslide susceptibility in the Izu-Oshima Volcanic Island, Japan. Sci. Total Environ. 2019, 662, 332–346. [Google Scholar] [CrossRef] [PubMed]
  27. Akbaş, B.; Akdeniz, N.; Aksay, A.; Altun, İ.; Balcı, V.; Bilginer, E.; Bilgiç, T.; Duru, M.; Ercan, T.; Gedik, İ.; et al. Türkiye Jeoloji Haritası; Maden Tetkik ve Arama Genel Müdürlüğü Yayını: Ankara, Turkey, 2016.
  28. Yilmaz, Y.; Saroglu, F.; Guner, Y. Doğu Anadolu’da Solhan (Muş) Volkanitleri’nin Petrojenetik İncelenmesi. Yerbilimleri 1987, 14, 133–163. [Google Scholar]
  29. Emre, Ö.; Duman, T.Y.; Özalp, S.; Elmacı, H.; Olgun, Ş.; Şaroğlu, F. 1/1.250.000 Ölçekli Türkiye Diri Fay Haritası; Maden Tetkik ve Arama Genel Müdürlüğü Özel Yayınlar Serisi; Maden Tetkik ve Arama Genel Müdürlüğü: Ankara, Turkey, 2013.
  30. MGM. Köppen İklim Sınıflandırmasına Göre Turkiye İklimi; Araştırma Dairesi Başkanlığı, Klimatoloji Şube Müdürlüğü; Meteoroloji Genel Müdürlüğü: Ankara, Turkey, 2016.
  31. MGM: Meteoroloji Genel Müdürlüğü. Available online: https://www.mgm.gov.tr/ (accessed on 13 August 2019).
  32. Cruden, D.M.; Varnes, D.J. Landslide types and processes. In Landslides: Investigation and Mitigation; Turner, A.K., Schuster, R.L., Eds.; National Research Council Transportation Research Board Special Report (Book 247); Transportation Research Board: Washington, DC, USA, 1996; pp. 36–75. [Google Scholar]
  33. Larsen, I.J.; Montgomery, D.R.; Korup, O. Landslide erosion controlled by hillslope material. Nat. Geosci. 2010, 3, 247–251. [Google Scholar] [CrossRef]
  34. IAEG (Commision on Landslides). Suggested nomenclature for landslides. Bull. Int. Assoc. Eng. Geol. 1990, 41, 13–16. [Google Scholar] [CrossRef]
  35. Gruen, A.; Akca, D. Least squares 3D surface and curve matching. ISPRS J. Photogramm. Remote Sens. 2005, 59, 151–174. [Google Scholar] [CrossRef] [Green Version]
  36. HGM: Sayisal Uretim Faaliyetlerinde Kilometre Taslari. Available online: https://www.harita.gov.tr/kilometre-taslari (accessed on 2 June 2019).
  37. Wilson, J.P.; Gallant, J.C. Terrain Analysis Principles and Applications; John Wiley and Sons, Inc.: Ottawa, ON, Canada, 2000. [Google Scholar]
  38. Moore, I.D.; Grayson, R.B.; Ladson, A.R. Digital terrain modeling: A review of hydrological, geomorphological, and biological applications. Hydrol. Process. 1991, 5, 3–30. [Google Scholar] [CrossRef]
  39. Gokceoglu, C.; Sonmez, H.; Nefeslioglu, H.A.; Duman, T.Y.; Can, T. The 17 March 2005 Kuzulu landslide (Sivas, Turkey) and landslide-susceptibility map of its near vicinity. Eng. Geol. 2005, 81, 65–83. [Google Scholar] [CrossRef]
  40. Hosmer, D.W.; Lemeshow, S. Applied Logistic Regression; John Wiley & Sons: New York, NY, USA, 2000; p. 375. [Google Scholar]
  41. Kleinbaum, D.G.; Klein, M. Logistic Regression: A Self Learning Text, 2nd ed.; Springer: New York, NY, USA, 2002; p. 513. [Google Scholar]
  42. Negnevitsky, M. Artificial Intelligence—A Guide to Intelligent Systems; Addison-Wesley Co.: Edinburgh, UK, 2002. [Google Scholar]
  43. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  44. Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine Learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
  45. Bradley, A.P. The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recognit. 1997, 30, 1145–1159. [Google Scholar] [CrossRef] [Green Version]
  46. Zhang, L.; Kocaman, S.; Akca, D.; Kornus, W.; Baltsavias, E. Test and performance evaluation of DMC images and new methods for their processing. In Proceedings of the ISPRS Commission I Symposium, Paris, France, 3–6 July 2006. [Google Scholar]
  47. Akca, D.; Gruen, A.; Alkis, Z.; Demir, N.; Breuckmann, B.; Erduyan, I.; Nadir, E. 3D modeling of the Weary Herakles statue with a coded structured light system. In International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences, Proceedings of the ISPRS Commission V Symposium, Dresden, Germany, 25–27 September 2006; Institute of Geodesy and Photogrammetry, Swiss Federal Institute of Technology: Zurich, Switzerland, 2006; Volume XXXVI, Part 5; pp. 14–19. [Google Scholar]
  48. Seybold, H.J.; Molnar, P.; Akca, D.; Doumi, M.; Cavalcanti Tavares, M.; Shinbrot, T.; Andrade, J.S., Jr.; Kinzelbach, W.; Herrmann, H.J. Topography of inland deltas: Observations, modeling, and experiments. Geophys. Res. Lett. 2010, 37, L08402. [Google Scholar] [CrossRef]
  49. Akca, D. Least Squares 3D Surface Matching. Ph.D. Thesis, Institute of Geodesy and Photogrammetry, ETH Zurich, Switzerland, 2007; p. 78. [Google Scholar]
  50. Akca, D. Co-registration of surfaces by 3D Least Squares matching. Photogramm. Eng. Remote Sens. 2010, 76, 307–318. [Google Scholar] [CrossRef]
  51. Akca, D.; Seybold, H.J. Monitoring of a laboratory-scale inland-delta formation using a structured-light system. Photogramm. Rec. 2016, 31, 121–142. [Google Scholar] [CrossRef]
  52. Swets, J.A. Measuring the accuracy of diagnostic systems. Science 1998, 240, 1285–1293. [Google Scholar] [CrossRef] [PubMed]
  53. Hong, H.Y.; Shahabi, H.; Shirzadi, A.; Chen, W.; Chapi, K.; Bin Ahmad, B.; Roodposhti, M.S.; Hesar, A.Y.; Tian, Y.Y.; Bui, D.T. Landslide susceptibility assessment at the Wuning area, China: A comparison between multi-criteria decision making, bivariate statistical and machine learning methods. Nat. Hazards 2019, 96, 173–212. [Google Scholar] [CrossRef]
  54. Chen, W.; Yan, X.S.; Zhao, Z.; Hong, H.Y.; Bui, D.T.; Pradhan, B. Spatial prediction of landslide susceptibility using data mining-based kernel logistic regression, naive Bayes and RBF Network models for the Long County area (China). Bull. Eng. Geol. Environ. 2019, 78, 247–266. [Google Scholar] [CrossRef]
  55. Rahali, H. Improving the reliability of landslide susceptibility mapping through spatial uncertainty analysis: A case study of Al Hoceima, Northern Morocco. Geocarto Int. 2019, 34, 43–77. [Google Scholar] [CrossRef]
  56. Ozer, B.C.; Mutlu, B.; Nefeslioglu, H.A.; Sezer, E.A.; Rouai, M.; Dekayir, A.; Gokceoglu, C. On the use of hierarchical fuzzy inference systems (HFIS) in expert-based landslide susceptibility mapping: The central part of the Rif Mountains (Morocco). Bull. Eng. Geol. Environ. 2019. [Google Scholar] [CrossRef]
  57. Kocaman, S.; Gokceoglu, C. A CitSci app for landslide data collection. Landslides 2019, 16, 611–615. [Google Scholar] [CrossRef]
  58. Can, R.; Kocaman, S.; Gokceoglu, C. A Convolutional neural network architecture for auto-detection of landslide photographs to assess citizen science and volunteered geographic information data quality. ISPRS Int. J. Geo Inf. 2019, 8, 300. [Google Scholar] [CrossRef]
Figure 1. Color-coded digital terrain model (DTM) (from year 2012) of the study area and the landslides.
Figure 1. Color-coded digital terrain model (DTM) (from year 2012) of the study area and the landslides.
Sensors 19 03940 g001
Figure 2. Photos showing the typical landslide morphology in the study area (Photos from the area taken before the dam construction).
Figure 2. Photos showing the typical landslide morphology in the study area (Photos from the area taken before the dam construction).
Sensors 19 03940 g002
Figure 3. Overall workflow of the methodology.
Figure 3. Overall workflow of the methodology.
Sensors 19 03940 g003
Figure 4. Spatial distribution of (a) altitude in meters, (b) slope in degrees, and (c) aspect as geographical directions. The parameters have been calculated from the DTM of year 2012.
Figure 4. Spatial distribution of (a) altitude in meters, (b) slope in degrees, and (c) aspect as geographical directions. The parameters have been calculated from the DTM of year 2012.
Sensors 19 03940 g004
Figure 5. Stream power index (SPI) and topographic wetness index (TWI) maps of the study area produced from 2012 DTM.
Figure 5. Stream power index (SPI) and topographic wetness index (TWI) maps of the study area produced from 2012 DTM.
Sensors 19 03940 g005
Figure 6. Distance maps to the channels and the ridges and the curvature (general, plan, and profile) parameters produced from 2012 DTM.
Figure 6. Distance maps to the channels and the ridges and the curvature (general, plan, and profile) parameters produced from 2012 DTM.
Sensors 19 03940 g006
Figure 7. Histograms of topographical parameters derived from 2012 DTM.
Figure 7. Histograms of topographical parameters derived from 2012 DTM.
Sensors 19 03940 g007
Figure 8. The ANN algorithm used in the study.
Figure 8. The ANN algorithm used in the study.
Sensors 19 03940 g008
Figure 9. ROC curves obtained from the classifications with LR, ANN, and RF.
Figure 9. ROC curves obtained from the classifications with LR, ANN, and RF.
Sensors 19 03940 g009
Figure 10. The landslide susceptibility map produced with the LR method in 2D (above) and 3D (below) visualization.
Figure 10. The landslide susceptibility map produced with the LR method in 2D (above) and 3D (below) visualization.
Sensors 19 03940 g010
Figure 11. The landslide susceptibility map produced with the ANN method in 2D (above) and 3D (below) visualization.
Figure 11. The landslide susceptibility map produced with the ANN method in 2D (above) and 3D (below) visualization.
Sensors 19 03940 g011
Figure 12. The landslide susceptibility map produced with the RF method in 2D (above) and 3D (below) visualization.
Figure 12. The landslide susceptibility map produced with the RF method in 2D (above) and 3D (below) visualization.
Sensors 19 03940 g012
Figure 13. The residual plot of the Euclidian differences obtained from the comparison between 2012 and 2018 DTMs of the study area.
Figure 13. The residual plot of the Euclidian differences obtained from the comparison between 2012 and 2018 DTMs of the study area.
Sensors 19 03940 g013
Figure 14. The ortophotos (a,b), ML outputs (ce), and plots of the Euclidian differences (f,g) obtained from the comparison between 2012 and 2018 DTMs of the study area. Dam construction area is shown here.
Figure 14. The ortophotos (a,b), ML outputs (ce), and plots of the Euclidian differences (f,g) obtained from the comparison between 2012 and 2018 DTMs of the study area. Dam construction area is shown here.
Sensors 19 03940 g014aSensors 19 03940 g014b
Figure 15. The ortophotos (a,b), ML outputs (ce), and plots of the Euclidian differences (f,g) obtained from the comparison between 2012 and 2018 DTMs of the study area. The road area is shown here.
Figure 15. The ortophotos (a,b), ML outputs (ce), and plots of the Euclidian differences (f,g) obtained from the comparison between 2012 and 2018 DTMs of the study area. The road area is shown here.
Sensors 19 03940 g015aSensors 19 03940 g015b
Table 1. Descriptive statistics of the landslide area, volume, and expected depth of failures.
Table 1. Descriptive statistics of the landslide area, volume, and expected depth of failures.
StatisticsArea (m2)Volume (m3)Expected Depth of Failure (m)
Min.9725.7829,941.1318.47
Max.382,327.003,982,504.1362.50
Mean71,114.29506,904.9332.68
Std. Deviation77,731.99802,017.7010.07
Table 2. Main specifications of the photogrammetric flight missions from 2012 and 2018.
Table 2. Main specifications of the photogrammetric flight missions from 2012 and 2018.
Flight YearNumber of PhotosFlight AltitudesGSDForward Lap & Side LapCameraImage Orientation Accuracy
2012348300 m–8900 m54 cm–58 cm~60%
~20%
Ultracam Eagle from Vexcel ImagingPlanimetry: 1/3 GSD
Height: 1/2 GSD
2018526650 m–6750 m43 cm–50 cm~70%
~30%
Table 3. Statistical summary of the values calculated from the topographic derivative parameters.
Table 3. Statistical summary of the values calculated from the topographic derivative parameters.
ParameterMaxMinMeanMedianStandard DeviationSkewnessKurtosis
Aspect6.2830.0003.1253.1621.752−0.017−1.090
Distance to Channel678.80.000113.92394.86887.9331.0921.513
Distance to Ridge492.50.000102.21289.44376.00.8930.529
Elevations2060.81011.51418.41373.6211.71.0430.704
General Curvature2.249−2.9430.0000.0030.079−1.55135.520
Plan Curvature0.079−0.0900.0000.0000.009−0.1542.039
Profile Curvature0.105−0.0990.0000.0000.008−0.5053.921
Slope1.3290.0000.3610.3620.1920.155−0.701
SPI7,899,360.00.000717.21329.77622,157.900166.12638,897.104
TWI24.0610.7945.2974.8551.9912.52311.358
Table 4. The model parameters used for ANN, logistic regression (LR), and random forest (RF) methods.
Table 4. The model parameters used for ANN, logistic regression (LR), and random forest (RF) methods.
ModelParameterValue
Logistic RegressionMax iteration1000
Loss functionBinary Cross Entropy with L2
Artifical Neural NetworkMax iteration1000
Learning Rate0.01
Optimization algorithm ADAM
Activation FunctionSigmoid
Loss FunctionBinary Cross Entropy with L2
Random ForestNumber of trees500

Share and Cite

MDPI and ACS Style

Sevgen, E.; Kocaman, S.; Nefeslioglu, H.A.; Gokceoglu, C. A Novel Performance Assessment Approach Using Photogrammetric Techniques for Landslide Susceptibility Mapping with Logistic Regression, ANN and Random Forest. Sensors 2019, 19, 3940. https://doi.org/10.3390/s19183940

AMA Style

Sevgen E, Kocaman S, Nefeslioglu HA, Gokceoglu C. A Novel Performance Assessment Approach Using Photogrammetric Techniques for Landslide Susceptibility Mapping with Logistic Regression, ANN and Random Forest. Sensors. 2019; 19(18):3940. https://doi.org/10.3390/s19183940

Chicago/Turabian Style

Sevgen, Eray, Sultan Kocaman, Hakan A. Nefeslioglu, and Candan Gokceoglu. 2019. "A Novel Performance Assessment Approach Using Photogrammetric Techniques for Landslide Susceptibility Mapping with Logistic Regression, ANN and Random Forest" Sensors 19, no. 18: 3940. https://doi.org/10.3390/s19183940

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop