Introduction

Urbanization is the most contemporary phenomenon of urban development, which is also called urban revolution (Turok and McGranahan 2013). According to UN predictions, the urban areas will inhabit more than half (50%) of the rural population by 2050. This phenomenon is one of the major problems in most regions of the world, as chaotic urbanization and lack of good planning have resulted in the expansion of unplanned urban areas (Jafari et al. 2016). Rapid urban expansion, in developing nations in particular, is accompanied by major socio-economic problems (Nguyen 2015). This phenomenon is accomplished by misusing natural resources, which directly disturbs the urban ecosystem and makes the urban environment more susceptible to many enforced risks (Keith et al. 2019; Kharel 2010; Cui et al. 2019).

The rapid urban growth and land use land cover (LULC) transformation led researchers and decision-makers to give much attention to the impacts of rapid urban growth on the ecosystem and livelihood of the people (Hasan et al. 2020; Deribew 2020; Shao et al. 2021; Assefa et al. 2021). Prediction and simulation of urban growth and LULC change patterns have become essential to environmental protection and sustainable urban development (Mishra et al. 2021). Furthermore, the urban landscape’s complex urban system and structure must be clearly understood to forecast future spatiotemporal urban growth dynamics and LULC changes (Rimal et al. 2018; Rana and Sarkar 2021). In the case of urban growth simulation, the growth processes and historical information must be taken into account to accurately determine spatiotemporal changes and interactions (Gómez et al. 2020 and Alaei et al. 2020), so the process of identifying the fundamental driving factors of urban growth and LULC changes can be realized using simulation techniques (Goncalves et al. 2019). Geographic information system (GIS) and remote sensing (RS) play significant roles in understanding the spatiotemporal dynamics of urban growth and LULC patterns, as well as determining the influential driving factors of these complex urban structures (Liping et al. 2018).

The RS and GIS techniques are widely used to assess and monitor spatiotemporal urban growth dynamics and LULC change patterns (Berila and Isufi 2021). Recently, these techniques have proven to be effective tools for supporting planners and policy makers to develop sustainable policies and make accurate planning decisions. RS and GIS provide many advantages to researchers and urban planners. First, it helps to analyze data spatially and offers various options such as modeling (Akintunde 2019). Second, these techniques provide the interpretation of physical data with socio-economic data, and provide an important linkage in the planning processes (Jat et al. 2017). RS and GIS techniques also help visualization of complex spatial patterns and relationships and help change analysis (Aithal and Ramachandra 2016). These techniques are accessible at a low cost to planners and researchers (Rawat and Kumar 2015). Both effectively monitor, evaluate, analyze, map, and control spatial phenomena (Jat et al. 2017; Aswal et al. 2018). Because of these and other advantages, researchers, urban planners, and policy makers heavily depend on RS and GIS techniques to simulate urban growth patterns and future LULC changes.

Currently, different methods and models utilizing RS and GIS techniques are being applied to model urban growth patterns and forecast LULC changes (Liping et al. 2018). Some studies have used cellular automata (CA)-based Markov chain (MC) models for urban growth modeling (Al-sharif and Pradhan 2014; Abutaleb and Ahmed 2016; Altuwaijri et al. 2019; Aburas et al. 2021), while others applied quantitative methods like logistic regression to simulate urban growth and LULC changes (Alsharif and Pradhan 2014; Tripathy and Bandopadhyay 2018; Myagmartseren et al. 2020; Salem et al. 2021). Some studies have used the integration of different dynamic simulation and statistical models or hybrid models (Arsanjani et al. 2012; Hamdy et al. 2016; Mozaffaree et al. 2021). For instance, some studies used the integration of CA-based Markov chain (MC) models with logistic regression model to obtain more realistic and appropriate results. Urban growth modeling based on geospatial technologies has been applied to understand better the complex urban growth processes within a specified time. This enables us to develop effective urban growth and land use policies for sustainable urban development and appropriate resource allocation (Hisham et al. 2019).

Integrating cellular automata (CA) models with geographic information system (GIS) introduces a new way of modeling urban growth and LULC changes in a realistic approach, allowing for new possibilities. A robust methodology for monitoring urban complex systems integrates GIS spatial analytical tools with spatial dynamic models such as CA (Mohammadi et al. 2013; Samat 2009). CA model was developed and proposed as an alternative to a pre-existed traditional spatial models in the mid 1988s because of its simplicity, capacity to do dynamic spatial simulations, and capability for producing high-resolution modeling using GIS and remotely sensed data (Maithani 2010a; Fitawok et al. 2020; de Almeida et al. 2003; Blecic et al. 2004). The traditional models such as the concentric zone model and Lowry and Grain-Lowry model were more static and criticized by researchers for being hyper comprehensive, data hungry, and complicated (Maithani 2010a). The ability of the CA model to simulate and predict urban growth patterns and land use changes mainly relies on the assumptions that the previous changes affect the future through local and regional interactions among different land use types (Aburas et al. 2021). The cellular automata (CA) model is highly influenced by cell size, neighborhood type, and size, which have to be considered significant factors while obtaining the best modeling results (S. Q. Wang et al. 2012). CA model is commonly used to model and simulate the changes, dimensions, and trends in urban growth and land use changes (Sang et al. 2011; Aburas et al. 2017). It can be used to analyze and summarize changes in urban and non-urban lands based on the number of transition area probabilities from one status to another status over a specific period (Silver and Silva 2021). MC cannot simulate changes in spatial trends. But it is an effective model to simulate the extent to which land uses have been changed (Azari et al. 2022).

Previous studies have attempted to project the future urban growth and LULC changes in various parts of the world using CA–Markov modeling and determine the factors aggravating it through the application of multi-criteria evaluation (MCE) and analytic hierarchy process (AHP) techniques. For instance, Jafari et al. (2016) tried to monitor the major land use changes from 1989 to 2013 and forecast the future sprawling of the Hyrcanian region, Iran, for 2023 and 2037 using CA-Markov models. This study revealed that there would be a rapid urban expansion for the target years, with 11,510 ha and 18,320 ha of urban expansion expected, respectively. Tajbakhsh et al. (2016) also examined and modeled urban sprawl and its consequences on the land use land cover changes observed in Mashhad City, Iran. Results of this study on the projected LU maps for the year 2025 indicated that the city’s development will be more intense than in the base years. Keshtkar and Voigt (2016) used CA-Markov coupled with the MC model to investigate spatiotemporal landscape changes. This was done to model future landscape changes in the research area of Central Germany from 2020 to 2050. This study found that the continued illegal growth of urban surfaces and radical land use transformation will drastically decline natural resources.

Though they were few, studies were conducted on urban growth in the study area of Bahir Dar. Research by Wubante et al. (2020), for example, attempted to investigate the ecosystem of expropriated land and urban expansion in Bahir Dar City based on sociological research techniques/surveys. Fitawok et al. (2020) used CA–Markov model to forecast the effect of urban expansion on LULC changes in Bahir Dar and simulated the possible LU changes for 2025, 2034, and 2045. However, these studies have shown several limitations. Firstly, these studies have focused on analyzing driving factors over one specified period. This indicates enough attention has not been paid to long-term spatiotemporal dynamics of urban growth and land use change and its driving factors. Second, these studies have failed to incorporate various proximity and physical factors in their model.

Only few factors such as slope and road were considered main driving factors of urban growth processes. These limited independent variable models will lead to biased results and conclusions. But the present  study has focused on multiple physical and proximity factors to fill the gaps and solve the problems identified in previous studies. For instance, the study was conducted for long-term study periods (1984–2049), consisting of six study periods and 65 years of spatiotemporal urban growth dynamics in Bahir Dar. But studies conducted before gave much attention to socio-economic factors that drive urban sprawl because of lack of quality data, so more investigations were required to verify the presence of rapid urban growth and model the continued growth trends of the city for the coming year. Though severe environmental and socio-economic problems have resulted from rapid and uncontrolled urban growth in Bahir Dar, previous studies lack to provide detail and critical analysis of these problems, so no study focused on critical analysis of urban growth, driving factors, and future scenarios until this study was conducted in Bahir Dar. This study was conducted to fill the gaps not covered by previous research in the city and its surroundings.

Previous works indicate a need to forecast and quantify the location and extent of urban growth and LULC changes over time and space. The study area Bahir Dar is chosen for this study because of its fastest growth in recent decades. The urban growth dynamics of the city have not been well documented and adequately analyzed. In addition to its rapid population growth, the city is the hub of socio-economic and political activities of the region in particular and the country in general. This multi-functional nature of the city results in high population and economic flow, which leads to rapid urban growth. In developing countries like Ethiopia, the socio-economic and political conditions may have affected the urban growth, which further leads to rapid and uncontrolled urban growth of cities. The main problem in Bahir Dar is the rapid transformation of land cover types from non-urban to urban landscape, mainly conversion of agricultural fields and forest land to a built-up environment. This is primarily due to the high demand for housing, expansion of transport networks, and industrial and commercial buildings at the expense of agricultural and forest lands. This affects the environment and socio-economic conditions of the city and its surroundings, so this economic transformation and rapid population growth result in unplanned urbanization and change in land use patterns of Bahir Dar City. This problem needs to be evaluated to design an appropriate urban planning system.

The city of Bahir Dar and its surroundings experienced rapid urban growth and fast LULC transformation in the last three and half decades. As a result of this continuous growth, non-urban lands have been converted into urban lands (impervious surfaces), affecting people’s livelihood and urban ecology. The current study simulates future urban growth patterns and LULC changes through integrated CA–Markov and MC models in Bahir Dar City, Ethiopia. Firstly, LULC maps of the target years (1984, 1994, 2009, and 2019) were prepared using ArcMap and ERDAS Imagine. Secondly, transitional matrixes were computed from LULC maps using the Markov model to simulate significant changes in the land use/land cover patterns. Thirdly, CA-Markov model have been applied to forecast the spatiotemporal distribution of LU from 2019 to 2049 using a transitional matrix and transitional suitability maps. No study has been conducted on the simulation of urban growth in Bahir Dar and its surroundings using an integrated CA-Markov model for long study periods with multiple variables. The use of high-resolution SPOT satellite images as a reference for LULC mapping provides accurate results and proper conclusions. This study could significantly contribute to plan the future urban growth management in Bahir Dar. It can be an instrument for a decision support system for urban planners and policy makers to design urban expansion plans with sustainable habitat development.

Methods and materials

Study area

Bahir Dar is a city in Ethiopia’s northwest region. It is divided into nine sub-cities and 32 rural kebeles, comprising an area of 213.55 km2. The absolute location of the study area lies between 11° 30′ 11″ latitude and 37° 29′ 4″ longitude (Fig. 1). The elevation of the study area is ranging from 1117 to 2008 m above sea level. The two most important factors influencing urban growth and land use land cover changes are population growth and economic development (Habibi and Asadi 2011; Lity 2021).

Fig. 1
figure 1

Study area

Urban growth in Bahir Dar is occurring mainly due to economic development in the city, rapid growth of population in the outskirts of rural areas, and the political situation of the country in rural areas (Mehriar et al. 2020; Turok and McGranahan 2013). The bad political situation and maladministration in the rural regions forced the rural community to migrate from rural to urban areas. In addition to this, economic centrality, land use policy, rural–urban migration, socio-political factors, and the process of industrialization are the drivers of the urban growth and LULC changes in the city (Appelhans 2017; Fitawok et al. 2020). As data taken from Ethiopian Central Statistics Agency (CSA) indicated, the research area had a total population of 54,800 in 1984, which increased to 96,140 in 1994, 221,991 in 2007, and 348,429 in 2017. According to the Bahir Dar city administration, the current projected total population of Bahir Dar is 389,177.

Data sets

The Landsat data which was used for this study has been taken from the USGS website https://earthexplorer.usgs.gov. These data consist of dry season cloud free Landsat images of thematic mapper (TM) for the years 1984, 1994, and 2009 and operational land imager (OLI) (Table 1 and Fig. 2). All the images used for this study have been freely acquired during the dry seasons from Dec. 24 to April 6 for all study years. Furthermore, all the Landsat images and Shuttle Radar Topography Mission digital elevation model (SRTM DEM) with 30-m resolution of the study area were projected in UTM zone 37 N. In addition, the Ethiopian Mapping Agency (EMA) provided high-resolution SPOT 5 and 6/7 images with 1.5 and 5-m resolutions, as well as a topographic map of Bahir Dar for the year 1988 (scale 1: 50, 000) for this study. The administrative boundary vector data covering the research area has been obtained from the Geospatial Center of Bahir Dar University. The required population data for 1984, 1994, 2007, and 2017 have been collected from Ethiopia’s Central Statistics Agency (CSA). Road network and water body data were taken from Google earth, Open Street Map, and USGS websites.

Table 1 Data set of Landsat time series images
Fig. 2
figure 2

Methodological flowchart

Pre-processing and classification of images

The satellite imageries have gone subjected to digital analysis. Image pre-processing and various image enhancement techniques such as layer stacking, band composite, and sub-setting and post-processing have been applied on the processed digital data (Fig. 2). In this regard, both radiometric and geometric corrections have been applied to improve the quality of all acquired images. Furthermore, the projection for the satellite imageries was made to the Universal Transverse Mercator projection system, 37 N zone, and datum of world geodetic system WGS84 to ensure the uniformity of the data set during analysis. Arc map 10.3 and ERDAS imagine 2013 were used to pre-process and analyze the data for this study. To apply geometric correction, digital numbers (DN) were converted into spectral radiance and images were processed for radiometric calibration. There are different image enhancement methods to acquire the required information from images. Histogram modification, filtering, and band compositions are some of the methods. The band composition method have been applied in the present study to distinguish the different category information from the image. The false color composite (FCC) of the Landsat images have been  used to classify and map the LULC maps. Maximum likelihood supervised classification mechanism was used for image classification and thematic map preparation. Li et al. (2014), Ismail (2009), and Lindström et al. (2006) have used maximum likelihood supervised classification technique for their study. Maximum likelihood supervised classification is more effective and can provide better accuracy. The authors did this classification scheme (Table 2) based on the land use land cover classification method developed by Hassan et al. (2016).

Table 2 Land cover classification scheme

The Landsat images downloaded freely from the USGS website have been directly imported to ArcMap for pre-processing, layer stacking, band composite, and sub-setting. These images were exported from ArcMap for further processing. After that, the exported images were imported to ERDAS image for classification. The authors have taken a total of 360 training points from 6 LULC classes; 60 samples per LULC class were taken randomly. High-resolution SPOT images of 2006 and 2016 have been used as a reference to identify land use classes of 2009 and 2019 and Google maps with a prior knowledge of the researcher. Aerial photo of 1982 and topographic maps of 1984 for Bahir Dar have been used for reference and accuracy assessment of the classified land use map. The map of 1994 was classified and validated using Google earth map, with prior knowledge of the researcher and elder people in the study area. The accuracy for each study period was evaluated using quantitative metrics of overall accuracy and kappa statistics. This accuracy assessment has been evaluated and measured using a stratified sampling approach. The overall accuracy of the land cover maps has been presented as 97.14%, 07.45%, 97.30%, and 97.76% for 1984, 1994, 2009, and 2019, respectively (Table 3). The kappa statistics values for reference years were calculated and the results have been presented as 0.92, 0.89, 0.93, and 0.85 for 1984, 1994, 2009, and 2019, respectively.

Table 3 Accuracy assessment results for land use land cover maps

Potential driving factors of urban growth

Driving factors of urban growth attracted increasing attention (Guangjin et al. 2016; Liao et al. 2017; Hamdy et al. 2017; Li et al. 2018). According to the literature, various factors are affecting urban growth processes (hinder or promote) such as physical factors (elevation and slope), social factors (population density and social services), environmental factors, political factors (policies), and connectivity factors (distance from road). In this study, distance from the existing urban area, distance from the international airport, distance from forest land, distance from public services, distance from the city center, distance from river, aspect, slope, road distance, and elevation were considered as factors affecting urban growth processes and land use change.

The influence of urban growth factors varies from place to place. So, to select factors of urban growth, an urban planning expert’s opinion, interviews with elders, and similar studies carried out by earlier researchers have been used. Population density is crucial for urban growth and land use change because it is directly linked to the urban market, rigid demand, and agglomeration capability (Li et al. 2018; Surya et al. 2020). However, in the present study population density has not been considered as the main driving factor due to insufficient data.

Elevation and slope were usually restrictive factors to urban growth, especially for areas with poor natural conditions and low development (Li et al. 2019). Distance to a river affects urban growth in two ways. First, urban growth is restricted by the presence of rivers/lakes. Second, water resources near the city are an advantage and provide waterborne possibilities that promote urban development into urban fringes (Yanping and Zhen 2019). In Bahir Dar, the presence of Lake Tana and Abay/Blue Nile River has a significant role in urban growth and LULC changes. These characteristics of water bodies and its influence on urban growth do not vary over time.

Distance to a city center is also considered one of the proximity factors. This variable impacts urban growth and land use change through location effects (Kasraian et al. 2019). The closer a region is to the city center, the higher its probability of urban growth. The continuous and rapid urban growth pattern and LULC change in Bahir Dar is the main explanation for this phenomenon because areas near the city center have a high advantage in accessing socio-economic resources with low transport costs. The other potential factor of urban growth and LULC change is distance from the main road, which means transportation accessibility factor or variable characterizing traffic distance of the city. Transportation accessibility factors usually promote urban growth and rapid transformation of land covers. Urban growth along the transport lines (main road) is one of the main urban growth patterns. Transport infrastructure in Bahir Dar has been improved in the last 35 years (1984–2019). This promotes rapid urban growth and LULC change for the last three and half decades and will continue in the future; hence, the CA–Markov model results show these trends.

Pair-wise comparisons of the factors affecting urban growth were conducted to obtain the weight and effect of each driving factor on urban growth processes and LULC change. At first, the matrix of pair-wise comparisons of factors to criteria and criteria to each other is formed and criteria scoring to each other and factors to criteria are performed using an expert in Delphi technique and the field of view. The Expert choice software has been used in the hierarchical analysis process method to calculate each factor’s weight. Others would have its influence on rapid urban growth and LULC change. However, the above factors are the main driving forces of rapid urban growth and land use in Bahir Dar.

Markov chain model and CA Markov

The future trends and spatial structure of urban growth have been modeled using a combination of MC and CA. MC is a quantitative simulation model that can describe urban sprawl dynamics and project future urban growth dynamics depending on transition probability matrix (Maithani 2010b). Cellular automata (CA) are a computational method capable of simulating urban growth processes by describing a complex system through a set of simple rules (Wang et al. 2021). It was designed for urban growth to simulate the process of urbanization in a hypothetical region. It consists of rules that describe the spatial interaction of cells and the set of parameters that led to the exploration of different urban forms. CA’s ability to simulate urban growth, change land use, and population expansion has become suitable for simulating complex geographical processes (Falah et al. 2020). MC helps predict future land use and provides quantitative information extracted from time series satellite images, which will be used for future urban planning and resource management. It can indicate and explain the probability of changes of land use classes from one to the other based on the information obtained from time series satellite images of two time intervals. Future simulations were done for the coming 30 years in three periods using the proposed model to forecast future LU change and urban growth dynamics depending on LULC classes taken from satellite images (2019–2029, 2029–2039, and 2039–2049). This can also be expressed (Fan et al. 2008):

$$P\left({x}_{t}= \frac{j}{{x}_{0}}= {I}_{0 }, {X }_{1}= {i}_{ 1},\dots , {X}_{ t-1}= {i}_{ t-1}\right)=P({x}_{t}=\frac{J}{{X}_{t-1}}= {i}_{ t-1})$$
(1)

If a Markov sequence of random variant takes the discrete value a1, …………an, then:

$$P\left(x_n=\frac{a_{in}}{x_{n-1}}=a_{i\;n-1\;}\dots,\;X_1=a_{i\;n-1}\right)=P(x_n=\frac{a_{i\;n}}{a_{i\;n-1}})$$
(2)

where P is the transitional probability matrix, xt is the state of cell x at time t, Xt−1 is the state of cell x at earlier time, and it−1 refers to the state of cell i at earlier time. The sequence xn is named a Markov chain. The CA–Markov model has the advantage of providing a good projection of spatiotemporal LULC changes and urban growth changes in quantity and space (Parsa and Salehi 2016). CA uses a bottom-up approach, making it more appropriate to incorporate temporal and spatial components in given processes (Santéet al. 2010; Rimal et al. 2017). A discrete grid dynamic system characterizes this model with regular divided spatial cells and time progresses in discrete steps. It updates the state of each cell using local rules; the state of each cell at a given time is determined by the state of its own and its neighbors at the previous time step (Torrens 2001). The results of the investigations reported by Liu et al. (2021), Liu and He (2009), Helbich and Leitner (2009), Triantakonstantis and Mountrakis (2012), Halmy et al. (2015), and Waseem et al. (2017) confirm that the integration of MC–CA models, multi-criteria evaluation (MCE), and AHP are the most fundamental decision-making operations and enable the evaluation of group decisions systematically and logically. Previous studies have used cellular automata-based Markov chain model to simulate the future spatiotemporal dynamics of urban growth and land use change pattern. For instance, Samat et al. (2011) used CA model to simulate the land use changes at peri-urban areas of Perai region, Penang State, Malaysia. This study applied the cellular automata modeling using the Land Change Modeler module of IDRISI Selva software. It reveals that all the approaches used in this model are very simple and can easily be implemented to foresee the future spatial pattern of urban growth using different policy approaches.

Study by Zhu et al. (2021) used cellular automata to simulate urban land use change with a constraint irregular space representation in Nanjing City, China. The study applied this model in the same application used by previous research (IDIRISI software) to simulate the land use changes. Mozaffaree et al. (2021) used cellular automata model for simulation of urban expansion in Tallinn, Estonia. All the necessary data were processed in ArcGIS and then the CA modeling was applied in IDRISI Selva Land Change Modeler module. The other study carried out by Falah et al. (2020) used CA and AHP to model urban growth in Qazvin City, Iran. This study detects the extent of urban sprawl using Shannon’s entropy through remote sensing and GIS. Simulation of future growth of the city was done using the IDRISI Selva software.

The LULC maps were input for CA-Markov models. The LULC maps prepared in the ERDAS software were imported to the IDRSI Selva software and land use maps (earlier and later) were processed using the Land Change Modeler module to harmonize the different map elements. The conditional transition images were created using a Markovian transition estimator from the earlier (2009) and later (2019) images/LULC maps. The simulation used CA-Markov models based on Markov transition area files and transition suitability image collections. The land use map of 2019 was used as base year and starting point to predict the urban growth of Bahir Dar for the upcoming 30 years. The general steps implemented in the IDRISI Selva software to apply CA-MC models are the following: (1) construction of CA-Markov suitability maps; (2) generation of the transfer matrix and the state of transitional probability matrix using Markov model; and (3) production of future land use using CA. In this study, the transition probability/potential matrix was derived from the land use conditions in 1984–1994, 1994–200, 2009–2029, and 1984–2019, so the urban growth map of 2019 was used as a reference map to simulate 2029, 2039, and 2049 by computing the matrix transitional areas. The mapping and analysis were conducted using ArcGIS 10.3, ERDAS Imagine 2013, and IDRISI Selva 17.0 (Table 4).

Table 4 AHP evaluation scale (Saaty 1987)

Analytical hierarchy processes

Analytical hierarchy processes (AHP) are multi-criterion decision-making techniques (Saaty 1987). It provides a thorough and rational framework for making decisions and comparing alternatives (Waseem et al. 2017). AHP is utilized to compute the weight of variable maps which pass through fuzzy standardization processes in the IDRISI Selva software using pair-wise comparison of factors involved in urban growth processes (Table 5). It also gives the decision-maker more options (Ozdemir and Saaty 2006). There are steps to AHP modeling: (1) structuring the hierarchy. AHP entails defining the problem from top to bottom, deciding on criteria, and developing alternatives (Saaty 1987). Second (2) is computing decision matrix and pair-wise comparisons. This phase compares a factor at one level to another at the higher level in the hierarchy. The weights of maps used for identifying the influential factors of urban growth and LULC changes are computed using weights given to criteria. The formulas below were used to calculate the weight of the criteria (Mohammadizadeh et al. 2016):

  1. (a)

    Prepare a (nxn) pair-wise comparison matrix for multiple factors. It is proposed that aij = the chosen factor (factor i to factor j). Then, aij = 1/aij was assumed.

  2. (b)

    For this calculation, the matrix values are normalized, and a sum of each column is computed. Each binary comparison element is divided by the total value of the column in which it is located. The normalized comparison matrix values were obtained using these calculation procedures.

  3. (c)

    The mean values of each row in normalized comparison matrix are determined. These averages represent an estimate of relative priorities for the compared elements.

  4. (d)

    Finally, the consistency ratio is checked. Each matrix’s eigenvectors and maximum eigenvalue were computed. Then, the consistency index is then using the procedures below.

    $$CR= {\uplambda }_{\mathrm{max}}-n/n-1$$
    (3)

where λmax represents the eigenvalue of the paired comparison matrix.

Table 5 Pair-wise comparison matrix generated from AHP

The consistency ratio (CR) of the paired comparison matrix was tested as follows:

$$CR=CI/R$$
(4)

where RI is the random consistency index.

A valid CR has a value of 0.10 or 10% or less. A CR value greater than 0.10 indicates the AHP process needs a regenerated pair-wise comparison matrix (Saaty 1987). The pair-wise comparison process was repeatedly done until the CR < 0.1 value was obtained.

Previous studies identified various factors responsible for urban growth in different urban centers. These factors could have a different level of influence over space and time, so to minimize the limitations regarding this variation, the researcher selected the factors in a very wise and careful way. First, the researcher selected 15 factors of urban growth and land use change from previous literatures. Then, interview was conducted with local experts and elder people about the factors. Among the 15 factors, five factors were removed from analysis based on the data availability and context of the study area by urban planning experts. Factors removed from this study were population density, land value, GDP, proximity to railway, and proximity to the nearby satellite towns. In this study, the pair-wise comparison of the relative importance of the variables is compared using Saaty’s 1–9 scale in the IDRISI software (Table 4), so expert knowledge in the area resulted selection of 10 factors for this study. The researcher prepared pair-wise comparison questionnaires for 12 local urban planning experts. Based on the response of these experts, the researcher computed the weights of urban growth factors using AHP in IDRISI, so the relative importance of factors was determined by the values given by experts. Furthermore, the consistency ratio (CR) is used to check the accuracy of the values assigned to the factors by experts. In this study, the researcher applied a 10*10 pair-wise comparison matrix to determine and generate the weight of the selected factors using AHP in the IDRISI software (Table 5).

CA–Markov validation approach

The overall achievement rate was determined by kappa index calculations, which allows for a better understanding of the key determinants of weakness and strength of results. The maps generated for the study are in a high level of agreement when 75% ≤ kappa < 100; they are in a medium level of agreement when 50% ≤ kappa ≤ 75%; and they are in a poor agreement when kappa is below 50% (Pontius 2000; Pontius et al. 2001), so the model was validated to verify the accuracy of the CA–MC simulation model to project the condition of the future LULC of the study area (Omar et al. 2014) following 2019 land use simulations using 1994 and 2009 classified images. Kappa index of agreement (KIA) (Mosammam et al. 2017) includes kappa for location (klocation), kappa for standard (kappastandard), kappa for no information (Kno), and kappa for strata (kappastrata). Mosammam et al. (2017) and Omar et al. (2014) asses the level of agreement between the projected and actual LU maps using the validation module in IDRISI. The validated model was also used to compare the area of the actual and simulated land use maps of the research area. The kappa index values for this model are acceptable (> 75%), so predicted land cover maps of 2029, 2039, and 2949 are valid. As stated by Omar et al. (2014), kappa statistics can be calculated using the following equations:

$$\mathrm{Kappa}\;\mathrm{for}\;\mathrm{no}\;\mathrm{information}=\frac{(M\left(m\right)N\left(n\right))}{P\left(p\right)-N(n)}$$
(5)
$$\mathrm{Kappa}\;\mathrm{for}\;\mathrm{location}=\mathrm K\;\mathrm{location}=\frac{(M\left(m\right)\ast N(n))}{P\left(m\right)-N(n)}$$
(6)
$$\mathrm{Kappa}\;\mathrm{for}\;\mathrm{quantity}=\mathrm K\;\mathrm{quantity}=\frac{(M\left(m\right)\ast H\left(m\right))}{K\left(m\right)-H\left(m\right)}$$
(7)
$$\mathrm{Kappa}\;\mathrm{standard}=\mathrm K\;\mathrm{standard}=\frac{(M\left(m\right)\ast N\left(n\right))}{P\left(p\right)-N\left(n\right)}$$
(8)

where K index refers to kno, Klocation, and kstandard, and no information is defined by N(n), medium grid cell level by M(m), and perfect grid cell level information across the landscape by P(p).

Analysis and result

Land use/cover structure changes

For the study period (1984–2049), the classification of land use (Figs. 3, 45) employed in the model that is proposed consists of six classes, as determined by spatial overlay analysis in a GIS environment: wetlands, water bodies, open space, forest, crop land, and built-up land (Table 2). Built-up (urban) area increased from 243.94 to 2529.98 ha from 1984 to 2019, and urban development has increased by 2285.06 ha in the last 35 years. However, during this period, crop land was decreased by 11.17% from 1984 to 2019. In the same study period, forest land was reduced by 2.69% due to increasing trends of artificial surfaces/constructions (Table 6, 7, 8, 9).

Fig. 3
figure 3

Area of LU/LC classes of Bahir Dar in percentage in 1984, 1994, 2009, 2019, 2029, 2039, and 2049

Fig. 4
figure 4

Land use land cover map of Bahir Dar for the years 1984, 1994, 2009, and 2019

Fig. 5
figure 5

Driving factors for CA–Markov model. a Distance from road. b Slope map. c Distance from city center. d Distance from built-up areas. e Distance from airport. f Elevation. g Distance from public services. h Distance from river. i Aspect. j Distance from forest

Table 6 Percentage changes of the area of land use classes over the study period
Table 7 Transition area matrix between the study year 1984 and 1994
Table 8 Transition area matrix between 1994 and 2009
Table 9 Transition area matrix between 2009 and 2019

Based on the CA–Markov modeling results, the built-up area will continue increasing in its expansion from 2529.98 ha (Table 5) in 1984 to 3983 ha (Table 11). This indicates that the built-up area will increase by 18.56% from 2019 to 2029 at the expense of crop land, forest, and open spaces (Fig. 3), whereas crop land will continue declining because of the rapid growth of built-up surfaces towards these land covers. Forest land will reduce from 1589.39 ha in 2019 (Table 5) to 638 ha in 2029 (Table 11). The figure above (Fig. 3) indicates that built-up will continue to increase in the next 30 years (2029–2049) from 2598.98 to 4998 ha. However, crop land will decline from 14,183.2 to 12,736 ha due to rapid urban growth towards agricultural fields of urban fringes, green areas, and open spaces (Fig. 3).

Evaluating the accuracy and validation of land use maps is the most important aspect of land use classification, land use change detection, and modeling. The classification accuracy for land use land cover maps was calculated and evaluated using statistics like kappa coefficient, producer’s, user’s, and overall accuracy. Overall classification accuracy is the percentage of correctly classified samples of an error matrix. It is computed by dividing the total number of correctly classified samples by the total number of reference samples. The kappa coefficient is a measure of the overall agreement of a matrix. In contrast to the overall accuracy — the ratio of the sum of diagonal values to total number of cell counts in the matrix. The kappa statistics coefficient which ranges from − 1 to + 1 shows the degree of accuracy of the base map/reference data and LULC values of the classified images (Congalton 2001). A kappa coefficient less than zero indicates that there is less agreement; 0–0.2 means slight agreement; 0.2–0.41 shows fair agreement; 0.41–0.60 means moderate agreement; 0.61–0.80 means substantial agreement and 0.81–1.0 perfect agreement. The overall classification accuracy for 1984, 1994, 2009, and 2019 were 97.14%, 97.45%, 97.3%, and 94.76%, respectively, while the kappa statistics values for the stated years were 0.92, 0.89, 0.85, and 0.89, respectively (Table 3).

LULC conversions analysis

In the current study, the conversion of LULC from one class to the other class all over the study periods has been depicted (Tables 7, 89). Persistence is represented by the diagonals of the matrix from the tables, whereas conversions from one category to another are represented by the off-diagonals (Table 7). Between 1984 and 1994, 149.5, 843.9, 584.8, 10.1, and 16 ha of land have been transformed from built-up, forest, open space, water, and wetland, respectively, to crop land, though large amount of crop land was persisted and continued as an agricultural field (Table 7). During this study period, 213.7, 37.7, 17.9, 7.9, and 3 ha of land were converted from crop land, forest, open space, water body, and wetland to built-up. At the same time, 77.5 ha of built-up area persisted and continued as built-up land during this time. As indicated in the table below (Table 7), the conversion of water body (10.1 ha) and built-up (149.5 ha) to crop land was high. There are different reasons responsible for these transformations of water bodies and built-up land. Due to deforestation and overgrazing, water bodies have been changing to dry land in Ethiopia, so this could happen because of this high land use transformation and rapid urban growth. In the case of built-up land, farmers in the surrounding areas/urban fringes are displaced from their dwellings and migrated to Bahir Dar and other small towns due to government plans of redevelopment and provision of land to investors, so built-up land in urban fringes was replaced by large private and government investments. The other reason could be the seasons by which the Landsat images were captured. Dry season images were used for this study, so built-up and crop lands may have similar reflections during harvesting seasons.

Between 1994 and 2009, 122.7 ha, 969.6 ha, 1301 ha, and 436 ha of land were changed from built-up, forest, open space, and wetland to crop land, respectively. In the same period, a large amount of crop land was converted to other land use classes, i.e., 1067.3 ha to built-up, 1030 ha forest, 998.9 ha to open space, and 244.3 ha to wetland (Table 8).

Between 2009 and 2019, a high rate of conversion was observed in built-up and crop land because of rapid urban expansion at the expanse of the other land uses, mainly crop land. During this period, 1122.3 ha, 188.6 ha, 78.6 ha, 7.9 ha, and 55.5 ha of land have been converted from cropland, forest, open space, water body, and wetland to built-up, respectively, even though, built-up was also converted into other land use categories; crop land (511.2 ha), forest (91 ha), open space (44.7 ha), and wetland (102.3 ha).

Determined driving factors for urban growth and LULC changes

Several factors are considered driving forces of rapid urban growth and speedy LULC changes for this research. Distance from the existing urban area, distance from the international airport, distance from forest land, distance from public services, distance from city center, distance from river, aspect, slope, road distance, and elevation were taken into consideration as driving factors; see Fig. 8, processed in ArcMap and IDRISI selva for urban growth projection. These criteria were chosen from the preliminary investigations’ expert conclusions (Lawal et al. 2011; Arsanjani et al. 2012; Hamdy et al. 2016). Here consistency ratio (CR) of the weights specified was determined to ensure their logical consistency, and a result of CR ≤ 0.1 reveals the correctness of the matrix (Table 10).

In the case of weight of variables determining urban expansion in Bahir Dar, distance from road, slope, distance from CBD, distance from built-up, and distance from airport are most effective driving factors with final weight of 0.29, 0.21, 0.15, 0.10, and 0.076, respectively (Table 10). The other variables used in the model, such as elevation, distance from public services, distance from river, and distance from water body and aspect, have insignificant influence over the rapid growth and land use transformation into the other land use categories in Bahir Dar.

Table 10 Fuzzy model standardization of variables and its weights extracted from analytical hierarchy processes (AHP)

Distance from the main road (Addis Ababa to Gondar and Dessie town) plays a significant role in Bahir Dar’s growth. The city has been growing along this road and areas close to this transport line have better accessibility to socio-economic and political services. Urban areas with a distance of 0–1316 m from the main road have the highest chance of urban development. Urban areas with a distance of 1316–3392 m from the main road have medium growth opportunities. However, areas > 3392 m distance from the main road have no probability of growth/are not suitable for development.

The other main driving factor of urban growth and LULC change in Bahir Dar is the slope. Urban areas located in Bahir Dar with a 0% slope have the highest suitability for urban growth and parts of the city with 0–13% have decreasing urban growth suitability. However, areas with > 13% have no suitability for urban growth. This means urban areas with a slope > 13% restricts urban growth or demand a high cost of urban development.

Distance from the center of the city is also a potential factor that affects urban growth and LULC change in Bahir Dar. It is the main attraction point/growth pole of the city and its surroundings. According to this study, areas closer to the city center have the highest likelihood of urban growth than the urban fringes. Accordingly, areas of the city located 0–4614 m have the highest probability of urban growth, or such distance from the city center promotes urban growth. In contrast, areas found from 4614–7674 m have a decreasing probability of urban growth compared to regions of the city closer to the city center. In contradiction, parts of the city located at a distance > 7674 m are not suitable for urban growth.

Based on this study, distance from built-up areas is also the main driving factor of urban growth and land use change in Bahir Dar. Distance of 0–149 m from existing built-up areas promotes urban growth, while distance 149–373 m decreases urban development probability. Areas > 373 m have no probability of urban growth or are not suitable for urban development due to a high distance from existing urban patches.

According to the findings of this study, the other factors selected and included for this study have no significant role in urban growth in Bahir Dar. This is determined based on their weights extracted from the analytical hierarchy processes (AHP). These variables have a weight below 0.1, which is a low value to consider as the main driving forces of urban growth and LULC change in the study area (Table 9).

Markov chain transition matrix

The table (Table 11) reveals the Markov chain probability matrix summary for major LULC transitions in Bahir Dar between 1984 and 1994 and the possible quantity in the projection of future LULC changes observed from different land categories. For instance, Table 10 shows 51.12% transition probability of wetland to wetland is 51.12%, whereas the future transition probability of wetland to forest was 37.03%. The transition probability of crop land to crop land was 69.99%, whereas from crop land to built-up was declined to 2.54%. Between 1994 and 2009, the future transition probability of crop land to crop land is 68.38% and from built-up to built-up is 70.2%. At the same time, the future transition probability of crop land to built-up is 7.2%. In the third period (2009 to 2019), the future transition probability of cropland to cropland was 84.86% and for built-up to built-up is 84.99%. The other LULC classes had less probability remaining as they are. However, the probability of the future transition of crop land to built-up was expected to be 25.4%, followed by forest (18.8%) and open space (12.9) from 1984 to 1994. Table 12 shows 57.4 transition probability of wetland to built-up, followed by crop land to built-up (15.79%) and water body to built-up (12.7%) from 1994 to 2009. In the third study period (2009–2019), forest land has a high transition probability (87.79%) to built-up, followed by crop land (15.7%) and water body (12.7%). In general, the probability matrix tables (Tables 1113) show the growing and decreasing probability of changing land use over time.

Table 11 Markov transition probability matrix calculated using LULC maps of 1984–1994
Table 12 Markov transition probability matrix calculated using LULC maps of 1994–2009
Table 13 Markov transition probability matrix calculated using LULC maps of 2009–2019

Cellular automata-based Markov chain modeling

This type of modeling was used to calculate the area of the anticipated urban growth and LULC types for 2029, 2039, and 2049, as shown in the figures below (Figs. 6, 78). For this study, built-up and cropland showed significant changes compared to the other land use land cover types (Tables 14, 15 and 16). For instance, cropland was decreased from 13,787 ha in 2029 to 13,526 ha in 2039 and 13,556 ha in 2039 to 12,736 ha in 2049, respectively. A continuous increase in the built-up area will be observed in 2029 3938 ha (18.61%), 2039 4214 ha (19.69%), and 2049 4998 ha (23.35%), respectively. Water bodies, on the other hand, will show a nearly consistent proportion from 2029 (1.77%) to 2039 (1.91%) and 2049 (2.01%). This figure will be insignificant for the LULC changes of the study area in the projected years. Built-up areas are predicted to expand at the expense of other LU types, specifically cropland.

Fig. 6
figure 6

Projected 2029

Fig. 7
figure 7

Projected 2039

Fig. 8
figure 8

Projected 2049

Table 14 Projected LULC of Bahir Dar for 2029, 2039, and 2049
Table 15 Kappa index of agreement to ability to specify quantity and location accurately to predict 2019 LULC map
Table 16 Area of projected and actual land use for 2019

Model validation and calibration

It is obvious that quantitative models that are applied to simulate the future condition of natural and artificial land could pass through a validation process to ensure the degree of validity and verify the results obtained from it. Likewise, for validation of the proposed model, the actual (a) and projected (b) (Fig. 9) LULC maps for 2019 were compared and the k coefficient for quantity and location was calculated (Fig. 10 and Table 15). The statistical result obtained in these model validation shows that kno is 0.8816, klocation is 0.8899, klocationstrata is 0.8799, and kstandard is 0.8483, respectively (Table 15). These figures reveal that the quantification and location errors are almost nonexistent or minor. This shows that the simulation can accurately specify both location and quantity. All the k index values are greater than 80%, indicating a strong agreement between the actual and projected LULC maps for 2019. The k statistics further show that the CA-Markov chain simulation was well designed and that the accuracy assessment was sufficiently accurate.

Fig. 9
figure 9

Actual (a) and projected (b) land cover maps of 2019

Fig. 10
figure 10

Validation of the model using actual and projected 2019

The total area for the actual LULC map of 2019 was 21,734 ha and the total area for predicted 2019 from the model is 21,319 ha (Table 16). This result shows the model is valid as the overall error is 1.89%, which is negligible. This is calculated by subtracting the total area of the predicted map 2019 from the total area of the actual map of 2019. This result is then converted to a percentage to determine the overall error of the CA–Markov model applied in this study.

Discussion

This report investigated urban growth dynamics and LULC changes over six time periods, 1984–1994, 1994–2009, 2009–2019, 2019–2029, 2029–2039, and 2039–2049. The results presented in the previous section are of interest not only for Bahir Dar City, Ethiopia but also for a more thoughtful reflection on the future urban growth impacts on the global climate and livelihood of the people and potential factors of urban growth and LULC changes. According to the findings, built-up areas increased by 89.33% between 1984 and 2019, so 2041.1 ha of land has been transformed into artificial surfaces/built-up land during this time. This indicates that urban areas have grown rapidly during the past three and a half decades. Similar studies by Sun et al. (2020) from 2000 to 2017 revealed a high urban growth rate in China’s Yangtze River. Their research shows that an increase in the built-up area, combined with rapid urban growth from the core to the periphery, has resulted in a loss of natural land. The present study shows that forest and crop land were deceased while built-up increased due to high scaled and rapid urban growth in the study period. For instance, forest and crop land lost 574.56 ha and 2384.9 ha of its cover from 1984 to 2019. In the same study period, built-up areas showed an increment in its area coverage, i.e., gaining 2286.04 ha, at the expense of the natural land. A study conducted by Martellozzo et al. (2018) on the impact of urban growth on agriculture and natural land indicated that built-up expansion adversely affects the economy and ecology of Italy. Another study (Huang et al. 2020) conducted on the occupation of crop land on global urban growth investigated that the urban occupied 159,170 km2 of cropland, accounting for 45.9% of the total expanded urban area, and exerted a significant impact on crop land and its productivity. Such deterioration and natural and semi-natural land loss have become a major worry that practically affects Africa’s entire continent (Terfa et al. 2017 and Lasisi et al. 2017), Europe, and Asia.

The findings of this study indicated that crop land had lost a greater percentage of its area than forests between 1984 and 2019. Table 5 reveals that crop land has lost 11.71% of its covers, whereas forests have lost only 2.69% during the same time period. These findings indicated that crop lands are highly vulnerable land use classes strongly affected by rapid urban expansion in Ethiopia. Crop land in the study has been diminishing, and earlier research has cautioned that the loss of crop land and forest might have a significant impact on food security and ecosystem services (Keith et al. 2019; Patra et al. 2018; Wang et al. 2020; Trinder and Liu 2020; Basawaraja et al. 2011; Bolca et al. 2007 and Fazal 2000). Previous studies on urban expansion reported that agricultural land is diminishing because of the rapid horizontal expansion and growth of cities and towns in Ethiopia (Alemineh 2018; Ayele and Tarekegn 2020). Research by Asempah et al. (2021) reported that settlement expands eight times the area covered by settlements between 1990 and 2020, i.e., 7.5 km2 (1.28%) of land covered by settlement was increased by 59.89 km2 (10.33%) in 2020 Wa municipality, Ghana. A similar study conducted in Selangor, Malaysia showed that built-up cover was increased by 608.8 km2, whereas natural vegetation was decreased by 831.8 km2 from 1999 to 2017 (Azari et al. 2022). Though urban forests can provide necessary ecosystem services to the urban landscape, no study has reported the positive correlation of urban growth on forest coverage or urban green areas. This indicated that rapid urban growth and land use change have greatly influenced the urban ecology throughout the world (Zhu et al. 2019; Das and Das 2019; Shao et al. 2021). Urban areas in Africa are growing in population and land cover, posing a threat to the continent’s ecosystems, biodiversity, and forest coverage. However, the concern over biodiversity and forest land conservation, exacerbated by the expanding urbanization, is overshadowed by the ever-worsening social problems (Seto et al. 2017; Toit et al. 2018). According to studies (Deribew 2020 and Dessu et al. 2020), recent urban expansion and infrastructure development, along with a steep reduction in green space/vegetation, have intensified the intensity and rate of urban growth’s impact on the ecosystem and the local climate in the western fringe of Addis Ababa and southwest Ethiopia, respectively.

This study helps to achieve sustainable development goals (SDGs) such as sustainable cities and communities and climate action set by the UN (UN 2018). Rapid urban growth and LULC changes have been affecting the sustainable development of Bahir Dar due to the decline in the forest, wetlands, and open spaces. The growing population has high demand for sanitation, health care, housing, and electricity. Developing countries like Ethiopia cannot afford the cost of these infrastructures and provide basic services to the people, so people are forced to use other options such as clearing green lands and forests to satisfy their demand and basic needs (Asoka et al 2013; Puplampu and Boafo 2021).

Furthermore, the UN also set another goal of sustainable development called climate action (UN 2018). However, the rapid urban growth and land use transformation in Bahir Dar affect the effort to take urgent action to combat climate change and its impact. Urban growth and rapid land use transformation in Bhir Dar are directly linked to the expansion of manufacturing industry decline in forest and open spaces. This has contributed significantly to increasing greenhouse gases and atmospheric temperature, which are directly linked to climate change (Y. Wang et al. 2014; Murshed and Yusuf 2018; Milesi and Churkina 2020), so this study helps planners and decision-makers to have detailed information on the current and future urban growth trends and LULC changes in Bahir Dar and develop an appropriate plan and policy to achieve sustainable development in the future.

This study used CA-based Markov chain models to simulate the future urban growth patterns and LULC changes in Bahir Dar city and its surroundings. Various studies were carried out on simulation of urban growth and LULC change in different parts of the world using this model (Shafizadeh and Helbich 2013; Al-sharif and Pradhan 2014; Abutaleb and Ahmed 2016; Altuwaijri et al. 2019; Cai and Wang 2020; Aburas et al. 2021) with different findings. The results obtained from this model indicated that built-up patches will expand in size by 2029, 2039, and 2049. A total of 934 ha is expected to be added to the built-up areas in this time period (coming 30 years). However, in the same time span (2019–2049), crop land and wetland will lose 1736 and 417 ha, respectively (Table 17). This means that built-up areas in the suburbs will expand to the wetlands and crop lands and these land covers will be transformed rapidly into built-up areas (Table 16 and Figs. 6, 78). A study conducted by Mozaffaree and Oja (2021) on urban growth simulated by integrated cellular automata and agent-based models in Tallinn, Estonia reported that the overall built-up areas will reach 175.24 km2 with an increase of 30.25% in total from 1990 to 2030. This growth of urban surface would be at the expense of other land uses such as forest and croplands, affecting the urban ecology and agricultural productivity of the community in the urban fringes. Siddiqui et al. (2018) carried out a similar study on urban growth dynamics using CA-MC models and logistic regression in Uttar Pradesh, India, indicating that the city is expected to grow manifolds from 53.6 km2 in 1993 to 441.2 km2 in 2023. Studies were also conducted by Jat et al. (2017) and Saadani et al. (2020) on urban growth modeling using CA-MC models. The former research revealed that the built-up area is expected to grow manifolds and is expected to have spontaneous growth of urban pixels with a 70% of probability of getting developed in the upcoming years (2015–2040) in Ajmer City of Rajasthan, India. The latter study indicated that built-up area will increase from 29.9 to 43.8 from 2018 to 2040 in EL Jadida City, Moroccan metropolitan due to rapid urban growth and LULC changes. All these studies indicated that urban growth and land use transformation would continue developing in the coming years at the expense of crop land natural lands. Generally, CA–Markov model results revealed that croplands and forests are expected to decline in the upcoming 30 years. In 2049, these two land cover classes will cover 62.37% of the study area, a decrease of 10.48% from the current distribution. This indicates that there will be a significant level of agricultural land and forest loss and land fragmentation in the study area in the upcoming 30 years (Table 16). The present study has validated using kappa statistics such as Kno, Klocation, Klocationstrata, and Kstandard. The K index values are greater than 80%, indicating the presence of strong agreement between the actual and simulated urban growth maps.

Table 17 Quantity of land cover classes in hectares from 2019 to 2049

The potential factors identified by this study for rapid urban growth and LULC changes in Bahir Dar and its environs are distance from road, slope, distance from CBD, and distance from built-up with its respective weights of 0.29, 0.21, 0.15, and 0.10, respectively. Previous research identified numerous driving factors of urban growth and LULC changes such as physical, proximity, and socio-economic factors (Falah et al. 2020; Hassan and Elhassan 2020; Rana and Sarkar 2021; Liu et al. 2021). For instance, Falah et al. (2020) identified distance from roads, land prices, distance from faults, and distance from the river are the main driving factors of urban growth in Qazvin City. The other study by Mohammadi et al. (2013) reported that distance from highways, roads, residential areas, distance from urban infrastructures, and distance from educational land use were the main factors of urban growth in Isfahan. A similar study shows elevation, slope, soil texture, population density, and distance to commercial area are the main driving factors of urban growth and LULC change (Aburas et al. 2017). These findings indicated that the factors of urban growth and land use change vary over space and time.

Despite the fact that the model employed in this study did an excellent job of simulating the future, there are many uncertainties in the projection of land use classes (Naghibi et al. 2016; Otgonbayar et al. 2018; Li et al. 2020; Chen 2022), which are explained below. First and foremost, it is crucial to notice that the maps which were compared to each other and the actual and projected maps for 2019 differ somehow. Secondly, while the study found that the procedures utilized in the Markov chain model analysis are effective approaches for computing land cover transition probabilities, the procedures were designed with the assumption that transition probabilities do not alter over time (Mozaffaree and Oja 2021). Because the models cannot asses and analyze the changes and processes occurring every time on the landscape structure and LU classes, this causes uncertainty (Memarian et al. 2012). The result, for example, indicates that the transition probabilities from various land covers to wetland and water bodies are high from 1984 to 2019, and the future land changes are modeled using these transition probabilities. However, based on the current distribution, these land covers are expected to decrease dramatically in the years ahead. Lastly, due to unpredictable events such as natural hazards, policy changes, climate changes, and potential uncertainties related to simulator models, simulation of future LULC would be full of uncertainties (Prestele et al. 2016; Ferchichi et al. 2017). The other limitation of CA-based MC model is its contestant-driven feature rather than a preference-led model, so it lacks dimension of human decision to determine the spatiotemporal dynamics of urban growth in the coming years (Mozaffaree and Oja 2021).

This research reveals that integrating GIS and RS with CA–Markov models provides an enhanced understanding of the futures that Bahir Dar’s terrain will encounter. Facing declining crop, forest, and wetland problems, the projected land use maps will be used as an early warning system to predict and understand the future consequences of urban growth, socio-economic and environmental impacts, and rapid transformation of land use classes to artificial intelligence surfaces. Overall, these findings could be used as a road map/strategic plan for urban planning, assisting urban municipalities/local governments in properly understanding the complex landscape structure and developing improved land use policies and balancing urban growth trends with urban environmental conservations allowing for long-term urban development. CA–Markov modeling should be empirically repeated and further comparative research is needed in the future to evaluate whether these projected landscape changes are specific to Bahir Dar City.

Conclusion

This study demonstrates dynamic simulation of urban growth and land use change using integrated cellular automata and markov chain models: a case of Bahir Dar city, Ethiopia, to detect changes in the area by comparing images between 2 years. Based on the CA–Markov models, the probability of changes in Bahir Dar City and its surroundings was predicted. Built-up (urban) area increased from 243.94 to 2529.98 ha, whereas crop land was decreased by 11.17% from 1984 to 2019. Results predicted by CA–Markov model indicate that over the upcoming years, the city’s rapid expansion into its environs, particularly on the western and eastern fronts, will be more intense and faster than on the other fronts. Based on the results of this study, cropland will decrease from 137.87 km2 in 2029 to 135.26 km2 in 2039 and 135.56 km2 in 2039 to 127.36 in 2049 km2. However, a continuous increase in built-up area will be observed in 2029 39.38 km2 (18.61%), 2039 42.14 km2 (19.69%), and 2049 49.98 km2 (23.35%). Distance from road, slope, CBD, distance from built-up, and distance from airport are the most effective driving factors of urban growth in Bahir Dar with a final weight of 0.29, 0.21, 0.15, 0.10, and 0.076 respectively.

Forests and crop lands will primarily transit to urban artificial surfaces as built-up areas expand and eventually promote rapid urbanization and unplanned settlements. These gains in urban patches will result from losses in crop land and forests in 2029, 2039, and 2049, which will cause a reduction in soil productivity and increase flooding. These gradual changes will result in social and economic marginalization, mainly in eastern and western fridges. Therefore, sustainable development of Bahir Dar can be achieved if high priority is given to urban growth management by planners and decision-makers. The CA-based model has the ability to represent non-linear, spatial, and stochastic processes.

Furthermore, the CA-based MC model can model and control complex spatially distributed processes and provide clear insights into local behaviors and global land use/cover change patterns. Moreover, spatial and temporal complexities of land use change can be well represented and simulated using suitable transition rules in CA model. CA–Markov model also has some disadvantages; it cannot handle the top-down processes. The study was conducted based on medium resolution (30 m) satellite images, so this could affect the results related to LULC maps because multiple land use land cover classes were prevalent in single pixels. For instance, water bodies could not be mapped due to the coarser resolution of satellite images. The other limitation of this study is lack of required data related to population density. Population density map is necessary for this study, but due to lack of this necessary data, the variable is excluded from the list of driving factors of urban growth and land use changes. The mapping of urban growth using GIS and RS as well as urban growth and LULC change modeling using CA-Markov models could have significant contribution for future urban growth management in Bahir Dar and can be an instrument for a decision support system for urban planners and policy makers to design urban growth plans with an approach of sustainable habitat development. This study recommends the application of integrated simulation models of CA-Markov and agent-based model (ABM) for the same study area and draw comparisons of results obtained from both models to overcome the CA–Markov barriers for future work.