Introduction

Land use/cover (LU/LC) and its underlying anthropogenic exploitation are crucial links between human activities and the natural environment on the Earth (Liu et al. 2017a, b; Muceku et al. 2016). Spatiotemporal LUCC simulations are beneficial and repeatable tools for studying and analyzing causes, trends, and consequences of alternative future landscape dynamics relative to socio-economic and natural environmental driving forces (Jia et al. 2018; Liu et al. 2017a, b; Verburg et al. 2004). The experimental modeling of land use changes is a complicated spatial dynamic and a nonlinear process (Verburg et al. 2002) to identify the patterns of changes and determine the effective variables that form the basis for modeling and prediction of future regional changes (Sangermano et al. 2012). Land change simulation is emphasized as the changing processes are accelerated due to socio-economic drivers or intense human activities.

Various approaches have been proposed to model LUCC including a multi-agent systems model (Tian and Qiao 2014; Castella and Verburg 2007), generalized linear models, unbalanced support-vector machines (Huang et al. 2009), or aggregated multivariate regression models (Yu et al. 2011; Zang and Huang 2006). The advantages and disadvantages of LUCC models have been discussed widely in the literature. For instance, the Markov chain approach can be used to predict land use changes (Liu et al. 2017a, b), but it fails to simulate the changes in the spatial distribution (Du et al. 2010). In addition, a Markov chain optimization procedure provides adequate information to support decisions; yet, it fails to present land use dynamic variations (Karimi et al. 2018; Yang et al. 2012). Hybrid models, obtained from integration of several individual models, have been recommended as a more viable modeling strategy (Marquez et al. 2019; Ghosh et al. 2017; Wang and Murayama 2017). For instance, the integration of the cellular automata (CA) and the Markov chain approaches has been used for simulation of LUCC, resulting in an improved modeling performance (Ghosh et al. 2017; Jokar Arsanjani et al. 2013). Moreover, to optimize probability maps of changes, genetic algorithm or neural network has been recently applied for LUCC modeling (Soares-Filho et al. 2013). The main goal behind integration of these models is to eliminate their defects while taking advantages of each of them.

Although integrated models are frequently and commonly used to study LUCC, there is still a limited understanding of the relationship between these factors and the causal chains constituting LUCC processes (Verburg et al. 2006). Generally, driving forces and variables of LUCC could be divided into two categories that include socio-economic and geographical factors (Jia et al. 2018). The first category often focuses on topography, location, and urban infrastructures, while the second category concentrates on GDP, amount, and density of population (Zhang et al. 2013). The selection of spatial and attribute variables for modeling and simulation procedures depends on such factors based on the objective of the study as well as the data availability. Location of a study area also affects the variables selection which specifies the parameters in the modeling procedure. But in addition to accessing the required data (Samardžić-Petrović et al. 2017), an important issue is which variables positively affect the final accuracy.

Improving the accuracy of modeling land use changes has always been a challenge in the field of land use change studies and researchers have explored different solutions for improving the accuracy of models (Shafizadeh-Moghadam et al. 2017; Kazemzadeh-Zow et al. 2017; Yirsaw et al. 2017; Chang-Martinez et al. 2015; Olmedo et al. 2015; Geri et al. 2011; Pontius and Malanson 2005). One option is to screen effective parameters by calibration procedure so as to improve the accuracy of models (Verstegen et al. 2014). In this view, it is expected that selection of effective variables, and excluding those that are considered less important, may improve the structure of a model and consequently enhance its performance (Wang et al. 2011). In addition, this approach can be effectively used to interpret and identify the factors that are important in land use changes. Given the difference in the causal-relationship patterns affecting LUCC processes in geographical regions, with the aid of this approach, we can advance our understanding of such differences and gain a better insight into the causal factors affecting the LUCC patterns. Furthermore, the analysis of the variable importance can help to reinforce the model structure, identify the critical inputs, and prioritize the tasks for updating, modifying, and simplifying a model (Ligmann-Zielinska 2013; Lilburne and Tarantola 2009). Moreover, according to the parsimony rule proposed by William of Ockham, the fourteenth-century philosopher, we should exclude factors that we do not have a proper understanding of and go straight forward to “the most accurate solution to a problem that is also the simplest and the shortest solution”. Hence, the inclusion of the variables that positively affect the accuracy of the results and the exclusion of the variables that are not important may contribute to end up with a parsimonious model and may also improve its accuracy.

In this sense, the main aim of this study is to explore whether and to what extent an adding/deleting approach can improve the accuracy of a hybrid LUCC model, in which cellular automata, Markov chain, and artificial neural network models are integrated. Adding/deleting approach is used as a technique for sensitivity analysis to measure the effect of parameters on different phenomena. To the best of our knowledge, this is the first time that an adding/deleting approach is developed for improving LUCC modeling. We have also applied special parameters for coastal regions such as “Distance to coastline”, “Distance to Mangrove forest”, and “Distance to Ports” as effective variables in LUCC modeling that could be considered as other innovations in this study. The result of the proposed approach has been compared with other commonly used hybrid modeling methods including CA_MC_ANN (cellular automata-Markov chain-artificial neural network), MC_ANN (Markov chain-artificial neural network), and CA_MC_LogReg (cellular automata-Markov chain-logistic regression) to validate the proposed method. Moreover, this study aims to obtain a sufficient understanding of the trend of LUCC in the Qeshm Island and identify the variables that have a positive impact on LUCC modeling process.

Materials and methods

Study area

Qeshm Island with an area of approximately 1500 km2, is located in the south of Iran as the biggest island in the Persian Gulf from 55°14′58″E to 56°17′27″E and from 27°00′00″N to 26°32′04″N (Fig. 1). Qeshm is a large and diverse landscape that represents great ecological richness due to its geomorphology, climate, and hydrology. We can also see a diverse architecture and culture in this coastal zone that contains active ports, which play an effective role in connecting the island, as Free Zone Area, to the other commercial centers in the region for trading purposes. In addition, fisheries, tourism, and energy are related activities that have shown to highly impact the economic performance of the area (Bayani 2016). Furthermore, Mangrove Forests in this island shaped the largest mangrove community in Iran’s coasts that turned this island into a very important area in terms of environmental resources.

Fig. 1
figure 1

Location of the study area in the south of Iran in the north of Strait of Hormuz between the Persian Gulf and Oman

Data preparation

Landsat data, in particular, are an irreplaceable source of satellite images, as they offer an extensive archive for retrospective Land Use/Land Cover (LULC) studies (Shafizadeh-Moghadam et al. 2017). We used Landsat images (the following table-totally 6 Scenes) from USGS Landsat Archive-Level-1 (Table 1):

Table 1 List of Landsat data

The topographic map of the Qeshm Island, generated by Iran’s National Cartographic Centre at the scale of 1:25,000 and the Google Earth software were used to extract training data (Keshtkar et al. 2017).

In this research, changes were detected using the post-classification method. First, cloud-free Landsat images were selected (level 1 products) and downloaded from the USGS archive as the dates of the images were close to the plant growth season (in the spring). The study area was identified and masked on the images and to use all of the image bands in the classification process, all bands (except for the thermal bands due to the spatial resolution) of each image were combined using the Layer-Stacking command and the mosaic of images was created for further processing. In the processing stage, first, the following six classes were defined in the classification phase: built-up, agricultural, dense vegetation, mangrove, water body, and barren land (Table 2).

Table 2 List of LULC classes and definitions for each class

Given the importance of the two classes, “Built-up” and “Agriculture”, we used an on-screen digitizing method to draw their boundaries using high-resolution satellite images derived from Google Earth software, in conjunction with topographic maps and pan-shaped images to make the identification of these classes more efficient. The remaining four land use classes were detected using the maximum likelihood (ML) method given the training samples from those classes. The maximum likelihood procedure is the most widely used classifier of remotely sensed imagery and is based on Bayesian probability theory (Eastman 2015, Ozturk 2015). We then merged the output of the classifier to the two classes, extracted from the on-screen digitizing, to obtain the final land use maps consisting of six classes. In order to evaluate the accuracy of the maps, 300 control points were randomly drawn from the study area for each year, and the true classes were identified at each location using the visual interpretation of the corresponding high-resolution images derived from Google Earth and also the pan-shaped Landsat images. By comparing the true classes with the generated land use maps, several accuracy metrics were calculated including the overall, user and producer accuracy as well as the kappa coefficient (Table 3).

Table 3 Summary of mapping accuracy obtained for different years

Figure 2 illustrates the classification results using the combination of on-screen digitizing and supervised classification-ML on the Qeshm Island in the form of Land use maps prepared in six classes.

Fig. 2
figure 2

Land use maps of Qeshm Island for 2002–2014

Simulation of LUCC

Cellular automata

Cellular automata, defined in a raster space, is a modeling technique that is widely used in simulations of natural and synthetic processes due to its discrete dynamic nature. In a model based on the cellular automata, space is defined as a raster lattice and each unit is called a cell. This model is developed based on interactions between the following components: (1) the network space, which is known as the lattice; (2) cells that constitute the lattice; (3) the cell state that represents the quality and the states of the cell; and (4) the transition rules that control the changes of the cell state (Mitsova et al. 2011). The transition rules can be either global or local. In this study, the Markov chain and artificial neural network (ANN) models were adopted in our hybrid model as a data-driven approach to generate global and local rules. The cellular automata then updated the lattice cells discretely and simultaneously according to the local and global rules. The value of each new cell is determined based on the values of a set of variables at the cell, as well as at the neighboring cells, as the output of the modeling process. The cellular automata model is obtained using eq. 1:

$$ {\boldsymbol{S}}_{\left(\mathbf{t},\mathbf{t}+\mathbf{1}\right)}=\mathbf{f}\left({\mathbf{S}}_{\left(\boldsymbol{t}\right)},\boldsymbol{N}\right) $$
(1)

where S denotes the limited and discrete cell state and N is the cell context. Further, t and t + 1 stand for the two different times, and f denotes the transition rules of the cell states in the local space (Sang et al. 2011).

Markov chain model

The Markov chain (MC) turned to be an important prediction method in the field of geographical research that can be used in the prediction of geographical features with no aftereffect (Sang et al. 2011). A model based on the Markov chain can express land use transitions between two periods of times that will be used to map the changes. The transitions are generated and put in a probability matrix of transitions that provides a basis for predicting and mapping future changes. The Markov chain model uses the conditional probability of Bayes’ formula to predict the land use changes (eq. 2):

$$ {\boldsymbol{S}}_{\left(\mathbf{t}+\mathbf{1}\right)}={\boldsymbol{p}}_{\left(\mathbf{ij}\right)}\ast {\mathbf{S}}_{\left(\boldsymbol{t}\right)} $$
(2)

where S(t) and S(t + 1) are the system states at times t and t + 1, respectively. In addition, p(ij) is the transfer probability matrix for one state, which is obtained using eq. (3) (Sang et al. 2011):

$$ \mathrm{pij}=\left[\begin{array}{cccc}{p}_{11}& {p}_{12}& \dots & {p}_{1n}\\ {}{p}_{21}& {p}_{22}& \dots & {p}_{2n}\\ {}...& ...& \dots & \dots \\ {}{p}_{n1}& {p}_{n2}& \dots & {p}_{nn}\end{array}\right]\kern1.00em \left[0\le {P}_{ij}<1\ \ and\sum \limits_{j=1}^N{P}_{ij}=1,\left(i,j=1,2,\dots, n\right)\right] $$
(3)

The Markov chain models are considered effective means of modeling land use evolution and provide indices for determining the directions and changes in land uses (Eastman 2015; Benito et al. 2010).

Multi-layer perceptron neural network

We used an artificial neural network (ANN) algorithm with a multi-layer perceptron (MLP) setting to calculate the probability of potential changes for the land uses in our study area. ANN is a learning algorithm, composed of several interconnected networks of processing units (artificial neurons) that is inspired by the biological neural network. This algorithm can learn from data and detect patterns with nonlinear structure by converting the system inputs to the user’s desired outputs. In ANN, artificial neurons are organized into layers. An MLP is a form of ANN with at least three layers that can be used to estimate the inherent relationships between the inputs and outputs of a model. It is also the most commonly used network model for image classification in RS (Yuan et al. 2009). The combination of the MLP approach and the back-propagation learning algorithms (or BP which is used in ANN to calculate the error contribution of each neuron after a batch of data is processed; for example, in image recognition multiple images) is one of the most common models of neural networks (Kazemzadeh-Zow et al. 2017). A typical MLP network is composed of an input layer, one or several hidden layers, and an output layer (Eastman 2015) which are used for entering data, processing data, analyzing data, and producing outputs, respectively (Yuan et al. 2009). In this research, the transition potential maps (TPM) values based on the land use changes between two time periods, 2002 and 2008, are used as the output layer (also called response variable), and the factors (also called the predictors) that derive the changes are organized in the input layer. In addition, the hidden layer was used to identify the relationships between the modified pixels and the criteria.

The MLP neural network links the response variable to the predictors using two procedures, forward and backward propagation steps, which are taken to make the corrective changes to the neuron connection weights. In the training stage, each sample is added to the model input layer and the receptor node collects all of the weight signals transmitted from all of the nodes to the node connected to the previous layer. Equation4 is utilized to calculate the input received by a simple node, j, in MLP.

$$ {\boldsymbol{net}}_{\boldsymbol{j}}=\sum \limits_{\boldsymbol{i}=\mathbf{1}}^{\boldsymbol{m}}{\boldsymbol{w}}_{\boldsymbol{i}\boldsymbol{j}}{\boldsymbol{o}}_{\boldsymbol{i}} $$
(4)

Where netj refers to the input that a single node j receives; wj represents the weight between node i and node j and oi is the output from node i. The output from a node i was calculated through eq. (5):

$$ {\boldsymbol{o}}_{\boldsymbol{j}}=\mathbf{f}\left({\boldsymbol{net}}_{\boldsymbol{j}}\right) $$
(5)

It must be noted that the f function is normally a sigmoidal nonlinear function used to calculate the total input weight before the signal reaches the next layer. During this process, after forwarding the signals, the activity of the output nodes is compared to the expected activity. However, under special conditions, the actual output of the modeling process differs from the target output due to the network errors. In this case, eq. 6, which is known as the delta rule, can be used to backward the errors so as to correct the relationships between the network components.

$$ {\boldsymbol{\Delta} \mathbf{w}}_{\mathbf{ji}\left(\mathbf{t}+\mathbf{1}\right)}={\boldsymbol{\upeta} \boldsymbol{\updelta}}_{\mathbf{ji}}{\mathbf{o}}_{\mathbf{i}}+\boldsymbol{\upalpha} {\boldsymbol{\Delta} \mathbf{w}}_{\mathbf{ji}\left(\mathbf{t}\right)} $$
(6)

In this relation, η is the learning speed, ais the stimulus factor, and δis the calculation error. The forward and backward transmission process continues until the features of all land use classes are taught to the network with the aim of obtaining the proper weight for the accurate relationship between the input layer and the hidden layer(s) as well as between the hidden layer(s) and the output layer. Equation (7) is used to calculate the number of hidden nodes.

$$ {\mathbf{N}}_{\mathbf{h}}=\mathbf{INT}\left(\sqrt{{\mathbf{N}}_{\mathbf{i}}\ast {\mathbf{N}}_{\mathbf{o}}}\right) $$
(7)

where Nh, Ni, and NO show the number of the hidden nodes, input nodes, and output nodes of the network, respectively (Eastman 2015).

Sensitivity analysis through adding/deleting approach

A challenging problem in modeling land use changes is the calibration of the models (Van Vliet et al. 2016; Wang et al. 2012; Wang et al. 2011; Silva and Clarke 2002). To achieve more accurate models, integrated methods are proposed with various algorithms (Yirsaw et al. 2017; Chang-Martinez et al. 2015; Muceku et al. 2015). Sensitivity analysis can be used for a better understanding of components on modeling procedure, identifying of errors, and performance of variables (Li et al. 2014). In order to improve the calibration of a land use change model, the present study utilizes a procedure to select the effective variables in the modeling and explores whether this procedure improves the accuracy of the model.

We used the adding/deleting procedure through the repetitive model calibration to examine the effect of the input variables that may also be considered as a sensitivity analysis procedure. This method measures the sensitivity of the model to various combinations of the variables by deleting and adding variables to the model (Ligmann-Zielinska 2013). Through a repetitive model calibration, every time a variable is excluded from the model to measure its impact on the performance of the model. Finally, the variables that have no positive effects on the accuracy of the model will be excluded from the model structure. This will result in a parsimonious model that is expected to also improve the accuracy of the modeling results.

CA-Markov

There are various approaches to modeling LULC, such as equation-based models, system models, statistical techniques, expert models, evolutionary models, cellular models (including cellular automata and Markov chain) multi-agent (Parker et al. 2003), and hybrid models including CA-Markov (Yirsaw et al. 2017; Chang-Martinez et al. 2015). In this study, a hybrid model based on coupling cellular automata and Markov chains (hear after CA-Markov) was used which is a multi-criteria evaluation method that simultaneously uses CA and Markov chain modules. Using the expected value of change through Markov chain analysis, CA–Markov applies a contiguity kernel to ‘grow out’ a land use map to a later time period using a CA function (Jokar Arsanjani et al. 2011). Thus, this method has the ability to convert the results of the Markov chain by means of a CA function to spatially explicit outcomes (Pontius et al. 2004).

CA–Markov model is a robust approach due to its ability to employ Geographic Information Systems (GIS) and remote sensing (RS) data and quantity estimating as well as modeling the spatial and temporal dynamics of LUCC (Jokar Arsanjani and Kainz 2011). In the CA-Markov, the CA approach is used to control Markov’s strict function for considering both the spatial and temporal changes at the same time. A range of data including demographic, biophysical, and socio-economic data can be used in the model to determine the initial conditions, and then parameterize the model to generate the probability of transitions, that can be used to define the neighborhood rules and preparing the transition potential maps (TPM) (Kamusoko et al. 2009). Figure 3 illustrates the overall process of this study.

Fig. 3
figure 3

Flowchart of the methodology and main steps in the study

Validation

In order to validate the modeling results, the produced maps by different approaches have been compared with land use map in 2014. To describe the difference between predicted maps and real map, disagreement parameters recommended by Pontius and Millones have been used in this study. According to this method, quantification error and allocation error are presented to explain the accuracy of the modeling process. Quantification error or quantity disagreement is defined as the number of pixels of a category in the predicted map which are not matched with the corresponding cells in the original map. Likewise, location error or allocation disagreement would happen when the position of a class in predicted maps is not compatible with a real map. In this method, the Figure of Merit (FOM) is derived through eq. (5) to show compatibility between changes in predicted and observed maps which range from 0 to 100% (Tajbakhsh et al. 2018; Pontius and Millones 2011).

$$ \mathrm{FOM}=\mathbf{B}/\left(\mathbf{A}+\mathbf{B}+\mathbf{C}+\mathbf{D}\right) $$
(8)

where A is the amount of observed change projected as persistence; B is the area of observed change predicted as change correctly; C is the observed change projected as change but in an incorrect class; and D is the area of error because of the observed persistence projected as change (Tajbakhsh et al. 2018; Pontius and Millones 2011).

In addition to the above method, the kappa coefficient which is still used in many similar studies (Alilou et al. 2018; Rimal et al. 2018) was used to validate the results of this study.

Results

Model configuration

In this study, the CA_Markov_ANN hybrid model integrated with adding/deleting applied to screen variables has been used to simulate LU/LC of 2014 based on LU/LC maps of 2002 and 2008. Markov chain and ANN output have been used as inputs of the CA model in order to predict LUCC. The initial required data for LUCC modeling are LU/LC and variables which affect LU/LC changes (Yang et al. 2015). To this end, at first, LU/LC maps were extracted for three time intervals (2002, 2008, and 2014), and effective variables on land changes were prepared according to the literature survey, previous studies carried out in this field and available information for the study area. Based on the former studies considering modeling LUCC, two types of information should be generated from the initial data for modeling process: (1) quantity of changes and (2) transition potential maps-TPM (Jia et al. 2018; Xu et al. 2018; Kazemzadeh-Zow et al. 2017; Zhai et al. 2016; Brown et al. 2013). For this purpose, Markov chain was used to generate “quantity of changes” for each class separately. In fact, based on the changes that took place between 2002 and 2008, the probability of a possible change has been estimated for 2014 through using MLP. At this stage, in order to improve the accuracy of the modeling results, the disturbing variables were identified by using the sensitivity analysis (adding/deleting approach) method and were eliminated from the model.

In this study, 17 variables were used in the study area (Table 5). It has been tried that these variables be selected from 4 mainly categories include utilities, socio-economic, physical, and finally environmental factors that were earlier considered by Aburas et al. in their study. First, all 17 variables (listed in Table 5) were introduced into the model and the model was run. To validate the result of modeling, the produced map was compared with observed LU/LC map of 2014. Each time, one variable was removed and the model was run to simulate LUCC for every class separately, and each time, the result was compared with the real map of 2014 and accuracy of the result was compared with the case where all variables have been used. Indeed, the variables which deletion increased the accuracy of the results were known as disturbing variables and were excluded from the model in the final model. This process was repeated for all six classes. Comparison between the results of the final and initial models demonstrated that the accuracy of the modeling improved by identifying and eliminating the disturbing variables.

Predicting LUCC was done through using the land change modeler (LCM) which is developed by Clark University in IDRISI-TerrSet software. Transition potential map (TPM) and the quantity of changes in land use maps were used to assess the simulation in 2014 (Tajbakhsh et al. 2018). Markov chain as the most common stochastic method (Kazemzadeh-Zow et al. 2017) was used to obtain TPM. The summary of Markov probability matrix (Table 4) shows that the highest probability is related to the transitions of different classes to “Built_up”. Among these classes, “Agriculture” has the highest probability to be changed into the “Built-up” areas, and then “Dense_Vegetation”, “Bare_Land”, “Water_Body”, and “Mangrove” were in the next ranks, respectively.

Table 4 A summary of the Markov probability matrix for simulating the transition between the primary and final cellular states

Modeling the transition potential maps using MLP neural network

Modeling of the potential land use change transitions links the changes between different classes in a time period to the variables that may derive the changes (such as slope, proximity to the road, and other variables). For example, it can show the potential of transition from barren-land to built-up at each cell in the form of probability. Switching from one land use class to another is introduced with the aid of sub-models known as the transition models which is a key parameter for MC (Zhai et al. 2016). Selection of the sub-models was also assessed in this research based on the dominance of the changes that had occurred in one region in different scenarios. In this regard, the scenarios containing variation prediction sub-models were selected using trials and errors, and eventually, the scenario offering the best description of the regional changes from 2008 to 2014 was used to predict the changes. In this study, in order to predict the changes in the land use map of 2014, 23 sub-models of the transitions from barren-land to built-up and agriculture, and also the transitions from agriculture to built-up were fitted using MLP-ANN. Figure 4 shows the rate of potential change of each class to other classes for 2014.

Fig. 4
figure 4

Changes potential maps based on land use classes for 2014. Agriculture (a), Barren_land (b), Built_up (c), Dense_vegetation (d), Mangrove (e), Water body (f)

Using adding/deleting approach to improve the model performance

In this study, through a repetitive modeling process, one of the variables was removed at each run, and its performance was compared with the model used all the variables. Figure 5 shows the final predicted map for 2014.

Fig. 5
figure 5

Land cover simulation for 2014

Table 5 shows the results of adding/deleting procedure in evaluating the effects of different variables on the model performance for each land use class. Therefore, in this process, the impact of each variable was measured on the modeling process and in order to increase the accuracy of the simulation, the inclusion of variables with a negative impact were prevented in the model. Out of the 17 variables used in the models, “Elevation”, “Distance to Mangrove forest”, and “Distance to the road” were the only variables that were involved in the modeling of the changes of all the six classes. Elevation which is known as a geographical driver was used to develop a new approach for LUCC simulation in China (Xu et al. 2018) and the positive impact of “Distance to Mangrove forest” as one of the specific variables in coastal areas was considerable. However, “Distance to Vegetation”, “Distance to built-up areas”, “Distance to coastline”, and “Soil Condition” had a positive effect on improving the accuracy of the models for only three land use classes.

Table 5 The values in the table show the amount of increment or decrement in the areas that are predicted correctly as the result of the deletion of the variable for each land use class. The negative values represent the variables whose elimination from the model reduces the accuracy of the simulation. Conversely, the positive numbers represent variables whose elimination improves the accuracy of the model. The last two rows of the table show the changes that were correctly predicted by the models before and after applying the adding/deleting procedure

According to the results of this section, the correctly predicted land use changes increased from 6.09 to 7.2 km2 after applying the adding/deleting procedure. Increasing the accuracy of modeling has predominantly occurred in Barren lands and Mangrove classes (Table 5).

Table 6 shows the level of accuracy for each land use class before and after using the adding/deleting procedure. The results revealed that, with the exception of the “Dense Vegetation” class, the accuracy of the predicted changes for all the land use classes increased after calibrating the model by adding/deleting procedure. For “Dense_Vegetation class”, the accuracy was reduced by approximately 0.01 km2, which is not a big number in comparison with the size of the study area. This class, which is much affected by the annual precipitation, suffered from large changes. In total, the areas that were correctly predicted increased by more than 1.1 km2 overall after using our approach, which represents more than 18% of the study area that has been changed.

Table 6 Validation of the model before and after adding/deleting approach

Among all the classes of the land use maps, the highest increment in the level of accuracy was attributed to Bare_Land, Mangrove, and Agriculture. In addition, the highest and the lowest numbers of the variables that had a positive effect on the accuracy of a class belonged to the prediction by the Water_Body class (i.e., 15 variables) and the Dense_Vegetation class (i.e., 10 variables), respectively (Fig. 6).

Fig. 6
figure 6

Green bars show the correct predicted changes after adding/deleting and the red bars show correct predicted changes before adding/deleting (a left, unit: km2)- ). The number of variables used to model in every class (b right)

Results of validation

To validate the methods of modeling in this study, the 3D method which is suggested by Pontius-Millones has been used. This technique has been recently used in similar studies (Varga et al. 2019; Tajbakhsh et al. 2018; Yao et al. 2017). In this method, the FOM is derived through the percentage of “change simulated correctly” dividing on all agreement/disagreements. As seen in Table 7, the main sector of the study area (more than 96%) is simulated correctly as persistence in four methods. It has also been reported in a study conducted by Tajbakhsh et al. (2018), and due to its extensive range, this section is removed from the graph in Fig. 7. According to table, the “change simulated correctly” range and the FOM factor (as the final factor for accuracy evaluation in 3D method) in the proposed method of this study are more than other methods used for comparison. Also, the percentage of “Persistence simulated as change” as one of the disagreements is lower than others. Details of this comparison are also evident in the diagram shown in Fig. 7.

Table 7 Validation of the used modeling approach in comparison with other modeling methods include CA_MC_ANN (cellular automata-Markov chain-artificial neural network), MC_ANN (Markov chain-artificial neural network), and CA_MC_LogReg (cellular automata-Markov chain-logistic regression)
Fig. 7
figure 7

Agreement and disagreement components in 1-CA_MC_LogReg, 2-MC_ANN, 3-CA_MC_ANN, and 4-CA_MC_ANN_SA

In addition to the Pontius-Millones method, in this research, the standard Kappa index, which is also used in many other modeling LUCC studies (Yulianto et al. 2018; Kazemzadeh-Zow et al. 2017; Zhai et al. 2016; Han et al. 2015) was used to measure the accuracy of the model predictions and verify the validation of the modeling. In this approach, the general correction is the proportion of the similar pixels to the total number of pixels which is being compared. Therefore, in order to validate the results of the CA-Markov combinational model, the simulated maps from 2014 were compared with the real map and the Kappa standard coefficient for the results was 0.94. The results of the comparison by the differentiation of the land use classes are shown in Table 8. As illustrated in the table, the maximum of the Kappa coefficient is related to the barren land and water body which have a larger area in comparison with other classes. Also, a visual comparison between simulated map from 2014 after adding/deleting with the simulated map before adding/deleting demonstrates increment of accuracy in the simulated map using adding/deleting approach (Fig. 8).

Table 8 Kappa standard coefficient for each class
Fig. 8
figure 8

Comparison of the simulated map after adding/deleting (top) with the simulated map before adding/deleting (down)

Discussion

In general, the accuracy has always been an important issue in studies whose purpose is to model and predict changes of phenomena or their behaviors including LUCC. This is even more important in modeling and simulation of land use changes (Olmedo et al. 2015) as achieving such the model with high accuracy is a difficult task, and using the suitable modeling approach in order to achieve the goals, is essential (Brown et al. 2013). Cellular automata (CA) has been widely used to simulate spatiotemporal properties of complex systems and environments (Yang et al. 2012; Clancy et al. 2010). Recently, hybrid models in which CA was integrated with other models such as Markov chains, logistic regression, neural network, etc. have been developed to simulate and predict LUCC (Rimal et al. 2018; Yirsaw et al. 2017; Chang-Martinez et al. 2015). Each of these models individually has capabilities and also can act as a complementary of others. More or less, CA-Markov modeling has given promisingly accurate and reliable results in most of their applied fields in the geographic and spatial domain (Ghosh et al. 2017), such a way that it can be said CA-Markov has become the most common method for LUCC simulation (Yulianto et al. 2018). In fact, it can be claimed that the result of modeling using this method is comparable to actual LULC changes (Halmy et al. 2015). Nevertheless, attempts to increase the accuracy of this model and other LUCC modeling approaches are followed by the scientific community (Brown et al. 2013).

In this study, CA-Markov was used to model LUCC in the Qeshm Island. In addition, the ANN was used to simulate the behavior of the pixels due to its ability to model nonlinear relationships in complex systems (Yang et al. 2015; Pijanowski et al. 2014). Although the ANN can determine the locations of the changes, it has low efficiency for determining the amount and type of changes. The empirical and dynamic methods are used to solve this problem. The Markov model can depict the direction of LUCC shifts and predict the future land requirements for land use categories by taking into account the influence of related factors on land use requirements (Han et al. 2015). Therefore, a hybrid model, which combines these three parts, can increase the modeling efficiency. The high accuracy of this hybrid method was mentioned by Zhai et al., but this does not mean that this model is perfect (Zhai et al. 2016) and improving the efficiency of this method is still required.

Variables used for LUCC modeling can help us to understand the causes of change and are also an important part of the simulation (Zhai et al. 2016). Furthermore, identifying and determining the variables affecting the pattern of land use changes is one of the issues that help us ameliorate the performance of the model (Aburas et al. 2016; Mishra et al. 2014). Given that the pattern of land use changes varies from region to region and depends on geographical conditions and characteristics, it is difficult to come up with a list of definite and identical variables. Different set of variables have been employed for modeling LUCC in various studies (Zeng et al. 2015; Chowdhury and Maithani 2014). Previously, efforts have been made to identify the variables that are more appropriate for the modeling process of LUCC (Zhai et al. 2016). For example, in a study by Dubovyk et al., two sets of factors were introduced including firstly, the factors determined by the experts and secondly, the factors that could be selected through the literature (Dubovyk et al. 2011). Aburas et al. (2017) also mentioned that variables could be selected based on experts’ experiences (). In our study, however, we used the adding/deleting approach that is also useful to explore the contribution of different variables to account for the changes in the models. We argue that incorporating such the approach in studies can help us identify the effective or useless variables, and consequently, improve the model calibration.

Analyzing and comparing the simulated with the observed maps for the year 2014 shows that the CA-Markov model is generally reliable for modeling and predicting the LUCC in the Qeshm Island. However, as also discussed by Zhou et al. (2012) in their study, for classes that are highly sensitive to environmental or socioeconomic factors (i.e., the changes in these factors can lead to notable changes in the classes), we may need further information or appropriate incorporations of additional variables in the modeling process. For example, a Built-Up class is affected by management decisions and regional development plans. Thus, we need to incorporate the management policies into the modeling process to obtain more accurate results for this class. While the CA-Markov model implements the contiguity rule to simulate changes in a land use class due to the proximity of the class to a similar class within the scope of the study (Kamusoko et al. 2009) and does not consider socio-economic factors and policies in modeling process. This issue has also been highlighted in a study performed by Jokar Arsanjani et al. (2011).

The strong reliance of the Dense_Vegetation class on the precipitation, which can cause difficulty in the process of simulation and prediction of its changes, is a significant issue in this study. While in areas where precipitation is not considered a limiting factor (for example, in the eastern and southeastern parts of China), it is expected that socio-economic factors play a major role (Shi et al. 2018). This means that, due to the importance and influence of managerial factors and human decisions on land use changes in the region, the results showed in order to achieve a higher accuracy, we need to apply bio-physic, socio-economic and human variables in the modeling process (Guerrero et al. 2018; Jokar Arsanjani et al. 2011). For instance, multi-agent models have recently been developed to simulate land use changes; hence, taking human behavior into the expansion of built-up usage would certainly increase prediction accuracy (Jokar Arsanjani and Kainz 2011; Parker et al. 2003). Despite the problems mentioned above, the present study achieved its goal of identifying effective and disturbing variables using the adding/deleting technique and ultimately improved the accuracy of the results. With taking the achievements of this research into account, using the adding/deleting method can improve the accuracy of “Cellular Automata-Markov Chain-Artificial Neural Network” for land use changes modeling.

Conclusions

We developed an adding/deleting approach in this study as an innovation to improve efficiency of model and then applied it in the Qeshm Island to explore whether it could identify the effective and disturbing variables in the hybrid land change model and improve the accuracy of the simulation. We identified the disturbing variables using the proposed approach and then excluded them from the hybrid model, which ultimately resulting in an increased accuracy of the land use change predictions. The area of the correctly predicted classes increased to 7.2 km2, which was greater than 6.09 km2 without using the proposed approach. Among 17 variables that have been used in the modeling process, “Elevation”, “Distance to Mangrove forest”, and “Distance to the road” were introduced as the positive variables used to simulate all classes. Therefore, the significance of these variables is more than other variables.

The proposed adding/deleting approach and its integration with CA-Markov for screening the effective and disturbing variables can help to improve the performance of model. The findings confirmed the importance of model configuration and the appropriate framework of the effective elements in the modeling process in the study area since these elements can affect modeling outputs in different ways. Particularly, socioeconomic and managerial factors should be seriously taken into consideration when performing LUCC modeling. Due to the climatic conditions in the study area, which is located in a dry climate, the strong dependence of vegetation on annual precipitation is evaluated as a challenge in the modeling process. However, this issue is recommended to be assessed in similar climates. Furthermore, this article is limited to the Qeshm Island in the Persian Gulf and the implementation of the same approach for other regions especially in different climate zones and different countries is highly recommended.