Keywords

1 Introduction

Landslides are one of the most catastrophic geo-hazards that cause severe injuries, property destruction and deaths all over the world. The challenge that landslides have posed to property and infrastructure and human lives over the past few decades in Australia, have been summarized by Flentje (1998) and, Leventhal and Kotze (2008). In the Wollongong local government area in NSW, landslide costs have been reported by Osuchowski and Roberts (2011), and Tobin (2012). The total minimum estimated value of destroyed and damaged houses due to landsliding in Wollongong region is AUD $14 million and the cost of nearly AUD $300 million has been borne by government authorities over the period of 1950–2013 to restore landslide-related damages. This equates to a minimum annual expenditure of AUD $4.8 million.

With the increasing pressure to develop marginal land, consideration of landslide hazard and risk in land-use planning has become more important in recent decades. In order to assess and manage the landslide risk, the Australian Geomechanics Society Landslide Risk Management Guidelines (AGS 2007) and JTC 1 2008 (Fell et al. 2008) guidelines logically suggest that development of Landslide Inventories and then Landslide Susceptibility Zoning are essential. Application of decision trees (a knowledge-based data-mining technique) (Kohavi and Quinlan 1999; Quinlan 1986) in landslide susceptibility mapping has been gaining attention over past few decades due to its enhanced predictive capabilities and independence from subjective expert judgments (Flentje et al. 2007; Miner et al. 2010; Saito et al. 2009; Yeon et al. 2010). Until now, there have been limitations in integrating data mining and GIS techniques due to incompatibilities between data mining software requirements and readily accessible GIS datasets and tools. Besides converting GIS data into a structure which is readable in the data mining software See5 (Quinlan 1993), returning the See5 output and interpreting the confidence grid within a GIS environment is another important challenge to overcome when using data mining techniques for modelling of landslide susceptibility. The University of Wollongong Landslide Research Team has developed the LSDM toolbar for these reasons. These tools extract and convert data from any GIS layers (including from the Landslide Inventory), format the data to meet the input requirements of the See5 Data Mining software, call the See5 program, apply the results of the decision tree analysis to produce a validated numerical grid of landslide susceptibility and classify it according to the recommendations of the Landslide Risk Management guidelines (AGS 2007; Table 4).

2 Implementation of LSDM ArcGIS Add-in Toolbar

ArcGIS v.10 is enhanced with the embedded scripting language Python and a new desktop customization VB.NET add-In module. The customization capability of the ArcObjects software development kit integrated with .NET development environment (Visual Studio Express for .NET 2008), has been used in developing the LSDM toolbar. Working with ArcObjects (a library of Component Object Model components which forms the base of the ArcGIS) within a .NET development environment, enables access to a series of ArcGIS built-in tools such as data management, visualization and spatial algorithms. This has allowed the writers to develop the LSDM toolbar relatively quickly.

The Visual studio.net environment is used to implement the interface for receiving user commands. The LSDM toolbar comprised of six command buttons (Fig. 208.1) to conduct data mining and GIS tasks entirely within the desktop GIS environment.

Fig. 208.1
figure 1

LSDM toolbar

The landslide inventory (Flentje et al. 2012) is the most important evidence based input layer that is used in the preparation of landslide susceptibility and hazard zoning. A DEM and its derivatives and geology are the next primary inputs for building the model, but any layers deemed appropriate can be used. See5 gains information pixel by pixel from each data layer corresponding to landslide and non-landslide locations, the first tool component (button) in the LSDM toolbar records pixel X, Y location with the corresponding attribute values from each of the input model layers. DEM raster layer and other derivatives can be selected by the user to extract cell values into a text file with the extension of .data which is readable in See5 software. Also, a *.names file is created based on the user inputs to describe the attributes of the corresponding data layers. The second component of the tool calls See5 (a standalone license for the low cost Data Mining software See5 must be installed on the host workstation) to derive the rule sets from the input data. Analyzing the rules and calibrating the susceptibility model is also done automatically at this stage. The third component of the LSDM toolbar is used to re-map See5 rules into GIS map algebra functions. These functions apply the logic of the rules using the input data layers so that the modelled outcome merges all the See5 rule based predictions into a new floating point ESRI grid. This grid represents the landslide susceptibility with a numerical value assigned to each cell location. Also, this component provides the facility to produce the source files to draw Receiver Operating Characteristic (ROC) curves (Corominas et al. 2013) and success rate curves to assess the accuracy of the susceptibility model with respect to the Landslide Inventory distribution.

The fourth and final component can be used to visualize the ROC curves and success rate curves. Furthermore, this feature facilitates the assessment of susceptibility class boundaries (user defines the number of classes). These can be obtained by entering the percentage distribution of the landslide inventory being included in a particular class and based on these parameters; a classified map layer of the landslide susceptibility map is produced.

3 Application

A revised landslide susceptibility model has been developed using the LSDM toolbar for the Wollongong local government area in order to obtain an accurate and an up-to-date outcome using a high resolution DEM derived from the latest airborne laser scan data. Before arriving at the final result, several iterations of the susceptibility model building can be required to select the most significant landslide causative factors and filter off datasets or individual parameters which may mislead the model. A total number of 5,89,060 landslide and non-landslide pixels have been recorded with the attributes of 10 landslide causative factors identified as most relevant in the latest Wollongong study. A Landslide susceptibility model has been developed using 211 rules which depict the classification process of landslides and non-landslide pixels according to the cut-off values of one or more data layers. In the resulting susceptibility map, the high susceptibility class includes 57.1 % of the total landslide population whereas very low susceptibility class includes 4.1 % (Fig. 208.2).

Fig. 208.2
figure 2

Landslide susceptibility map of Wollongong local government area

4 Conclusion

The LSDM tool has been successfully trialled on very large grids of 3 billion pixels and it has been proven to be a powerful tool in providing resilient, quantifiable and repeatable landslide susceptibility models. The customization capabilities available with Arc-GIS have enabled the automation of the GIS data preparation for data mining, the actual data mining, calling the See5 software from within Arc and converting the final outcome into an ArcGIS grid. The LSDM toolbar rigorously classifies this grid into susceptibility classes based on the landslide distribution as per the AGS (2007) guidelines. This development has been very successful in saving time and providing a user friendly interface with built-in grid classification tools to produce an accurate and transparent outcome, essentially free from subjective expert user judgments. The integration of See5 and GIS techniques enables visualizing of the final rule based modelling outcome and assessing the accuracy both qualitatively and quantitatively. The alternative to date has been a highly time consuming and tedious series of manual processes. Given a landslide inventory and relevant data layers, months of work are now reduced potentially to a few minutes or hours of simple processing time. Upon completion of the first authors PhD, expected in late 2014, this LSDM toolbar will be made available at no cost from our University of Wollongong landslide research team website (http://eis.uow.edu.au/landslide).