Introduction

Land use/land cover (LULC) change is one of the main drivers of terrestrial biodiversity loss and altered ecosystem functions and services across the globe (Bateman et al. 2013; Tittensor et al. 2014). Anthropogenic LULC change is continuing to increase in extent and intensity (Marques et al. 2019) and is forecast to remain a major driver influencing terrestrial ecosystems in the future (Sala et al. 2000). Major changes in LULC include agricultural expansion and intensification, urbanisation, industrialisation, deforestation of natural forest and forest planting for timber, undertaken to meet the demands of an increasing population worldwide (Newbold et al. 2015; Song et al. 2018). While most studies focus on recent changes in land use, it is important to set current changes in a historical context (Cousins et al. 2015; Fescenko and Wohlgemuth 2017). Some studies have examined such changes across networks of sample sites with known history (e.g. Redhead et al. 2014). However, because this is restricted to specific locations, broader and complex patterns occurring across landscapes cannot be assessed. Re-constructing historical landscape maps allows LULC change to be examined across large areas, which is important for assessing the degree and type of changes and their spatial distribution. Such analyses can help inform land management decisions and support the implementation of future conservation measures, for example by identifying which areas are at greatest risk of future change.

LULC maps can be produced using a variety of sources, including field survey data, aerial photography and satellite imagery. Satellite images have become increasingly important for obtaining land cover data and are often used to monitor LULC change. Such data can also be used to support the development of LULC models, which aim to detect drivers of historical change and/or predict future changes (Veldkamp and Lambin 2001). There are a wide variety of approaches used to model LULC change (Lu et al. 2004; Noszczyk 2019). LULC models require the identification of the most important changes, such as urban expansion, agricultural intensification, or protection of natural areas, along with geographical predictors for where specific changes are most likely to occur. This may include soil type, topography, the previous LULC and other landscape features such as watercourses or infrastructure. As the importance of and demand for understanding LULC change has increased, a number of modelling software programmes have been developed (Fuchs et al. 2013; Sharp et al. 2016). These tools all employ a similar principle, whereby statistical analysis is used to identify patterns between the current distribution of LULC and environmental covariates.

Owing to data availability and time constraints, many LULC change studies are only able to re-construct one historical landscape. Comparisons in LULC are therefore often performed between two snapshots in time (Reis 2008; Hooftman and Bullock 2012; Cousins et al. 2015), which provides little information on the dynamics of change during the intervening period. More detailed information on trajectories of LULC change can help land managers and conservationists address more specific problems, which would otherwise be difficult to solve with only two time points. For example, determining where certain habitats occurred in the past and at what time period they were lost can be useful for locating areas where ecological restoration could take place or where habitats could be reconnected (Willems 2001). Similarly, variation in biodiversity among apparently similar habitats can be explained by their different land use histories, which can inform conservation management choices (Fescenko and Wohlgemuth 2017).

Our study area is the county of Dorset, a predominantly rural landscape in southern England, which has undergone dramatic land use change, mostly through agricultural intensification, over the twentieth century (Hooftman and Bullock 2012), in common with many regions across Europe. Dorset is an ideal area to examine LULC change, as there is a wealth of environmental datasets for this county, including an extensive botanical survey conducted by Good (1937). Good’s dataset has provided valuable insights into patterns of change in heathland, calcareous grassland and woodland (Keith et al. 2009; Newton et al. 2012; Diaz et al. 2013) and more recently it has enabled the generation of a habitat time-series dataset across Dorset (Ridding et al. 2020). Ridding et al. (2020) determined the habitat type of over 3700 locations that were derived from the original Good survey sites, using contemporary field survey data and spatial datasets, for the years 1930, 1950, 1980, 1990 and 2015. Hooftman and Bullock (2012) created a land use map for Dorset in the 1930s and compared this with the UK Land Cover Map of 2000. They found that 97% of semi-natural grasslands were converted into agriculturally-improved grassland or arable land, as well as a large areas of heathlands and rough grassland. Although the study quantified broad LULC change over time, it was not possible to assess more accurately when these key LULC changes occurred. We aimed to improve on this by producing a time-series of maps spanning the past ca. 80 years in Dorset.

We used a modelling tool and detailed habitat data from Ridding et al. (2020) to inform the model, to generate LULC maps for Dorset in 1950 and 1980. These could then be used to analyse LULC change alongside the existing 1930s land cover map generated by Hooftman and Bullock (2012) and the CEH Land Cover Map 2015 (Rowland et al. 2017a). The aim of our study was to:

  1. (i)

    Assess the accuracy of the modelling method;

  2. (ii)

    Identify the timing of key LULC changes between 1930 and 2015;

  3. (iii)

    Determine the uncertainty associated with the methodology.

Method

Study area

Dorset, southern England, is currently ca. 2653 km2 in area, including the urban areas of Bournemouth, Poole and Christchurch that were added in 1974. Prior to that Dorset was ca. 2500 km2 in area (Hooftman and Bullock 2012). The population more than doubled between 1930 and 2017, from ca. 198,000 to ca. 424, 670 excluding the urban centres of Bournemouth and Poole (Office for National Statistics 2017). Like many regions in Western Europe, Dorset underwent considerable land use change during the twentieth and early twenty-first centuries, through agricultural intensification, afforestation and urbanisation, which led to significant losses of semi-natural habitats and fragmentation of remaining areas (Webb 1990; Hooftman and Bullock 2012).

InVEST scenario generator

To create LULC maps of Dorset in 1950 and 1980, we used the Integrated Valuation of Ecosystem Services and Tradeoffs (InVEST) Rule Based Scenario Generator tool (Sharp et al. 2016) (subsequently “InVEST”). The years 1950 and 1980 were selected based on the availability of detailed habitat data from Ridding et al. (2020), which was required to inform the model. InVEST uses a range of inputs as predictor variables to model land cover change using multi-criteria evaluation methods and overlay analysis (Sharp et al. 2016). Although the model is relatively simple compared to other approaches (Verburg et al. 2002; Liping et al. 2018), it is ideal for modelling LULC change over large areas because it is computationally efficient. The simplicity of the model also makes it easy for the user to incorporate known drivers and constraints, compared to methods such as cellular automata and neural networks (Sharp et al. 2016).

InVEST determines the suitability of individual grid cells for LULC change based on the following inputs; a baseline raster LULC map, a transition matrix table and other optional data including land suitability factors, constraints and override layers (Fig. 1). The transition matrix table provides the quantity of change per LULC, and the likelihood of a particular LULC converting to another LULC. Within this table, LULC types are prioritised using a value between 1 and 9; thus when multiple objectives compete for a single cell, the one with the highest priority wins. A proximity value within the same table controls the assumption that pixels close to a LULC type are more likely to be converted to that cover type if they are suitable. The land suitability factors are physical and environmental variables that are likely to affect the suitability of land for a given LULC type and thus where in the landscape changes are likely to occur. The factors are given a factor weight between the value of 0 and 1, which determines the weight given to the factors vs. the transition matrix (Fig. 1). For example, a weight of 0.3 means that 30% of the final suitability is contributed by the factors, whilst the remaining 70% is attributed to the transition matrix (Sharp et al. 2016). The constraint input within InVEST prevents particular areas of the baseline landscape from changing where there are known factors that limit the likelihood of change. The override function, changes the LULC type of individual grid cells based on the users input, which occurs after the model has run. The following sections describe the data utilised for each model input.

Fig. 1
figure 1

Schematic showing the methodology used to create the 1950 and 1980 LULC maps of Dorset (adapted from Sharp et al. (2016))

Baseline LULC maps

The 1930 adapted Dudley Stamp Map produced by Hooftman and Bullock (2012) was used as the baseline LULC map for the creation of the 1950 Dorset map. The Dudley Stamp Map was created from the 1930s Land Utilisation Survey of Britain, where volunteers mapped LULC on six-inch to the mile Ordnance Survey (OS) maps (Stamp 1931). This baseline map is based on the historic county boundary which we used for the entire map time-series to ensure consistency. The 1930 map did not clearly distinguish broadleaved from coniferous woodland. To address this issue we used Good’s survey of 7575 vegetation stands (Good 1937) to identify areas of coniferous woodland, which is likely to have been planted in this part of the country. The Good stands did not give complete coverage over Dorset, so all remaining patches of woodland that were not surveyed by Good were assumed to be broadleaved. This is likely to be an underestimate of coniferous woodland in Dorset; however records suggest that the coverage of coniferous woodland in southern England during this time was limited (Best and Coppock 1962).

For the creation of the 1980 Dorset map, we used the CEH Land Cover Map 2015 (LCM2015) (Rowland et al. 2017a) as the baseline. The LCM2015 is a parcel-based land cover map for the UK created by classifying satellite data into 21 land classes that are based on the broad habitats defined by Jackson (2000). This method was preferred over using the generated 1950 map, since the LCM2015 is already a validated product, and this avoided using two sequential interpolations to create the 1980 map and the likely propagation of errors. We trialled the alternative approach in preliminary analyses but this gave less accurate results (see Online Resource 1). No acid grassland was identified in the LCM2015 in Dorset, even though this habitat was known to be present at this time (Ridding et al. 2020). This is because small areas of semi-natural habitat are often not detected in the LCM2015, which has a minimum mappable unit of 0.5 ha and is poor at detecting linear features, such as remnant strips of semi-natural grassland (Ridding et al. 2015). To address this and improve the accuracy of the baseline map, we replaced areas that were misclassified in the LCM2015 with acid grassland from Natural England’s Priority Habitats’ Inventory (Natural England 2015) using ESRI ArcGIS v10.4 (© ESRI, Redlands, CA).

The baseline maps and consequently the generated LULC time-series contained 13 aggregated land classes: “broadleaved woodland”, “coniferous woodland”, “arable”, “calcareous grassland”, “acid grassland”, “neutral grassland”, “improved grassland”, “fen, marsh, swamp”, “heathland”, “coastal”, “water”, “urban” and “other”. The other category includes inland rock, which was only mapped for 1980, since this LULC type only occurred in the LCM2015 and not in the adapted Dudley Stamp map for 1930 (Hooftman and Bullock 2012). The baseline maps were converted to 100 m resolution rasters, using a maximum area cell assignment in ArcGIS, and thus this resolution was also used to create the 1950 and 1980 map. The selected resolution, which has been used in other LULC studies (Moulds et al. 2018), was a compromise between capturing detailed LULC change, whilst maintaining a scale at which influential factors, such as soil and slope are likely to impact the InVEST predictions. At finer scales complex factors such as land ownership would likely become important and could not be captured by modelled factors in InVEST. Redhead et al. (2018) found that running the InVEST nutrient model at resolutions finer than 100 m showed only small gains in accuracy compared with the extra running time and large file sizes.

Transition matrix

To determine the amount of LULC change between 1930 and 1950, and between 2015 and 1980, and the likely transitions between different land covers, we utilised data from a survey time-series dataset where habitat type has been assessed at over 3700 sites across Dorset in 1930, 1950, 1980 and 2015 (see Ridding et al. 2020). Subsequently, we refer to this database as the “habitat time-series”. The quantity of LULC change between both 1930 to 1950, and 2015 to 1980 was determined by calculating the percentage change in each LULC type, using the habitat time-series. A transition matrix of LULC change based on counts of changes across the habitat time-series sites was also generated for both time periods, which was subsequently adjusted on a scale of 0–10 to meet the input requirements for InVEST (see Online Resource 2 and 3). Where a particular LULC was not present in the original baseline data, for example improved grassland in 1930, an area change rather than a percentage change was required by InVEST. To calculate this we used the number of improved grassland sites in 1950 from the habitat time-series dataset, as a percentage of the total sites multiplied by the area of Dorset.

Priority values were required for LULCs which increased between 1930 and 1950; arable, coniferous woodland and urban. Priority values rank the LULC type, thus when multiple objectives compete for a single cell, the LULC with the highest priority wins. The literature reveals that there were considerable increases in arable and improved grassland during this period in Britain, and specifically in Dorset (Fuller 1987; Hooftman and Bullock 2012), suggesting that the transitions to these LULCs should be high priority in InVEST, thus we assigned scores of 8 and 7, respectively. During this time the planting of coniferous woodlands also increased rapidly (Best and Coppock 1962), however farming was a higher priority in the British lowlands after the Second World War compared with conifer planting; thus we assigned a priority score of 5 to this LULC type.

For the 1980 map, the priority values were based on change in the opposite direction (2015 to 1980), thus the number of semi-natural habitats (neutral grassland, calcareous grassland, heathland, fen, marsh, swamp habitats) increased, as well as arable and coastal LULCs. The amount of arable land decreases between 1980 and 2015 (Ridding et al. 2020) due to technological advances improving productivity of existing arable land, hence from 2015 to 1980 arable land actually increases. Using the habitat time-series dataset from Ridding et al. (2020), we determined that increasing the number of semi-natural habitats was a greater priority than arable and coastal which only increased by a small percentage during the same period Ridding et al. (2020). Values of 9 and 7, were therefore assigned to semi-natural habitats and arable/coastal respectively. We used 1000 m as the proximity value (where cells close to a LULC type within this distance are more likely to be converted to that cover type if suitable) for the increasing habitats for both the 1950 and 1980 map, as any finer scales are likely to be influenced by more complex factors such as accessibility and land ownership as suggested by Redhead et al. (2020).

Modelled factors

We examined a range of physical and environmental suitability factors for the generation of both the 1950 and 1980 maps which is a requirement for InVEST, including slope, elevation, rainfall, temperature, soil and Agricultural Land Classification (AGL) (see Table 1). These factors were selected based on similar studies in the literature (Verburg and Overmars 2009; Fuchs et al. 2013) and the availability of data for the whole of Dorset across multiple time periods where applicable (e.g. rainfall, temperature).

Table 1 A summary of variables used to model LULC change in Dorset between 1930 and 1950, and 2015 and 1980, with their source, scale/resolution and description

To determine which factors influenced the suitability of the increasing LULC types, we performed logistic regression using sites from the habitat time-series dataset (Ridding et al. 2020) that had remained versus sites which had undergone change for the 1950 and 1980 map. The sample size for habitat time-series sites converting to “urban” between 1930 and 1950 was too small to assess (n = 20) (noting the historical Dorset boundary excludes the large urban areas), and the same was true for coastal sites (n = 17) between 2015 and 1980. Elevation was strongly correlated with average temperature and rainfall (Pearson’s r > 0.6 or < − 0.6), so this was excluded from all models. Logistic regression analyses were performed in R v.3.4.2 (R Core Team 2019).

To determine the most suitable factor weight (factors vs transition matrix, see Fig. 1) we examined three different weights; 0.3, 0.5, 0.7 and evaluated these using the habitat time-series (see section on validation). A weight of 0.5 was selected to understand how equal weighting would perform, whilst 0.3 and 0.7 were arbitrarily selected using the example in Sharp et al. (2016), to represent and test the differences between a high or low weighting for factors versus transition matrix.

Constraints and override

In England the basic type of statutory protection is the designation as a Site of Special Scientific Interest (SSSI), which are areas of land selected for ‘special interest by reason of any of its flora, fauna, or geological or physiographical features’ (JNCC 2015). Although the first SSSIs were not designated until the 1950s (DEFRA 2009), Ridding et al. (2020) found sites which were classified as protected in the habitat time-series were more likely to remain as their original habitat. For this reason we used the extent of SSSI as a constraints layer. To improve the predicted output further, we also evaluated which 100 m cells had remained consistent between 1930 and 1990 for the 1950 map and 2015 and 1950 for the 1980 map. The partial Land Cover Map 1990 covered 83% of Dorset and was created using the same methodology used to make the LCM2015 (Rowland et al. 2017b). We assumed that where the LULC within the 100 m cell matched, in 1930 and 1990 for example, the LULC would have stayed the same in 1950, thus these matching cells were also used as a constraint layer, to prevent LULC change occurring in those locations.

For the generation of the 1980 map we also used the override layer in InVEST (Fig. 1). For this we examined which 100 m cells were consistent in both the generated 1950 map and the revised LCM1990 map (Rowland et al. 2017b), and presumed that this remained the same in 1980. This data was used as an override rather than a constraint, since the LULC type within particular cells may have differed between 1990 and 2015.

Accuracy

To assess the accuracy of the 1950 and 1980 Dorset maps produced using InVEST, we created ten cross-validation datasets from the habitat time-series dataset. In each of the 10 datasets, 75% of the habitat time-series sites were randomly selected for the training dataset, whilst the remaining 25% were used as a test dataset. Since some of the habitat time-series data did not completely match the baseline 1930 and 2015 map (see Ridding et al. 2020), we ensured that the test dataset only contained habitat time-series sites where the LULC in the habitat sites matched the corresponding baseline map, to ensure a fair comparison in the following interpolated map. Each of the ten training datasets were used to determine the percentage change for each LULC type and the significant factors which influenced change, as described above. For each of the ten cross-validation datasets for 1950 and 1980, InVEST was run in Python 2.7.0 one hundred times to account for the random selection of cells for change when all suitability factors and transition likelihoods were equal. A final output for each of the ten cross-validation datasets, was created using the modal LULC type for each cell. Where cells had equal counts of two LULC types no modal LULC was identified, thus these cells remain as “No data”. This occurrence was infrequent, occurring in a mean of just 0.5% of cells per model run (see Table 2). During the process of combining the one hundred rasters using the modal LULC for each cell, if a LULC type did not demonstrate change due to the large number of possible cells where the conversion could occur, meaning none of the changes were evident in the final modal map, we used the cross-validation dataset with the greatest accuracy and the most accurate run within this set to determine where the LULC change should take place.

Table 2 Accuracy, Kappa Index and certainty scores for each of the ten cross-validation datasets for the creation of the 1950 and 1980 LULC map for Dorset

To validate the output for each of the ten datasets, we compared the LULC from the 1950 and 1980 map outputs with the LULC assigned in the corresponding year in the habitat time-series (Ridding et al. 2020) using the test datasets (i.e. the remaining 25%). Accuracy was calculated as the percentage of habitat time-series sites which were consistent between the LULC type in Ridding et al. (2020) and the 1950/1980 LULC output from InVEST. In order to determine the Cohen’s Kappa Index, which measures the inter-rate agreement between two datasets (McHugh 2012), a single LULC type per site is required, so for sites containing multiple LULC types we assigned the LULC from the 1950/1980 InVEST output which had the largest coverage in area within the habitat site.

To produce the final 1950 and 1980 map output, we determined the modal LULC across the ten map outputs produced from the cross-validation datasets. For individual cells with no modal LULC type, we assigned the LULC to the cross-validation dataset output which performed the best, determined using the highest percentage agreement and Kappa Index values. The final 1950 and 1980 map output was validated and averaged across the ten test datasets.

Uncertainty

To map uncertainty associated with the model runs in InVEST, we calculated how many of the one hundred output rasters matched the final modal LULC for each cell, for each of the ten cross-validation datasets, following Redhead et al. (2020). An average certainty value for the whole study area was generated, excluding cells which were included as a constraint or override layer. This is because these cells were not allowed to change in InVEST, thus the one hundred output rasters would always match the final modal output, therefore biasing the overall certainty score. To map and determine uncertainty for the final 1950 and 1980 map output, we averaged across the ten datasets.

Results

Accuracy

The created 1950 and 1980 LULC maps (Fig. 2) showed a strong correspondence with the LULC from the habitat time-series dataset across all of the ten cross-validation model runs and also the final map output for each time point, as indicated by the accuracy values in Table 2. There were also high levels of agreement between the map output and validation dataset, evidenced by the Kappa Index (Table 2), where values between 0.80 and 0.9 indicate a strong level of agreement (McHugh 2012). The lowest Kappa Index recorded overall (0.77), still suggests a good level of agreement between the two datasets. The accuracy and agreement was slightly higher for the final 1950 output compared with that for 1980.

Fig. 2
figure 2

LULC maps of Dorset indicating the 13 LULC types in 1930 (Hooftman and Bullock 2012), 1950, 1980 and 2015 (Rowland et al. 2017a)

Error matrices were also generated for each time point; 1950 and 1980 (Table 3). Many LULC types showed good agreement between the generated map output and the habitat time-series in 1950, including “coastal”, “fen, marsh and swamp” and “broadleaved woodland”. However there was some confusion between semi-natural grasslands and arable/improved grassland. There was also confusion between improved grassland and arable, where more improved grassland sites were classified as arable in the generated 1950 LULC map. Similar patterns were shown in the error matrices for 1980, with LULC types such as “coastal”, “heathland”, “fen, marsh and swamp” and “broadleaved woodland”, being fairly consistent between the two datasets. The classification of calcareous grassland was better for 1980, however there was still confusion between neutral grassland, arable and improved grassland.

Table 3 Error matrices for (a) the final 1950 LULC map compared with the corresponding LULC from the habitat time-series (Ridding et al. 2020) and (b) the final 1980 LULC map compared with the corresponding LULC from the habitat time-series

Timing of LULC change between 1930 and 2015

The landscape underwent significant changes between 1930 and 2015 (Fig. 2). In 1930 the landscape was dominated by semi-natural grasslands (ca. 155, 008 ha) compared with 2015 when Dorset was dominated by improved grassland and arable land (ca. 200, 547 ha) (Table 4). Arable land expanded the greatest by 1950 in a region running south-west to north-east in Dorset (Fig. 2). This area and time period also coincided with the greatest loss of calcareous grassland (− 25,096 ha). The loss of neutral grassland which occupied much of the north and western area was also higher by 1950 (− 57,413 ha). The largest increase in improved grassland however, occurred between 1950 and 1980. Acid grassland and heathland are located in the south-east of the region, but reduced dramatically in area by 2015, with the greatest change occurring by 1980. This was largely due to expansion of coniferous woodland and urbanisation, as well as improved grassland.

Table 4 Area (ha) of each LULC type in Dorset between 1930 and 2015

Modelled factors

A range of physical and environmental variables were found to have a significant influence on the suitability for LULC change between 1930 and 1950, and 2015 and 1980 (Table 5). Many of the variables, including slope, soil texture and soil fertility were consistently significant across all of the ten cross- validation datasets, suggesting they were good predictors of where particular LULC should occur (Table 5). Some variables, including soil texture and fertility, showed no variation for certain LULC types, meaning certain values were strongly associated with particular LULC types. For instance, heathland was consistently found on acidic sandy soils. Consequently models could not converge, which provided a strong justification to include these variables as modelled factors in InVEST.

Table 5 Factors, their direction of change indicated by arrows (↑ = increase, ↓ = decrease) and significance (*p < 0.05, **p < 0.01, ***p < 0.001), included in each of the ten-cross validation sites for (a) increasing the area of coniferous woodland, improved grassland and arable in 1950 compared to 1930, and (b) increasing the area of arable, neutral grassland, calcareous grassland, heathland, coastal and FMS (fen, marsh, swamp) habitats for 1980 compared to 2015

To determine the best factor weight for the modelled factors against the transition matrix for InVEST (Fig. 1), we ran InVEST one hundred times for the first cross-validation dataset using three different weightings (0.3, 0.5 and 0.7) and compared the output LULC map with LULC from the habitat time-series. We determined that a factor weight of 0.5 produced the most accurate results and the highest Kappa Index score (Accuracy: 88%, Kappa Index: 0.84), compared with 0.3 (87%, 0.82) and 0.7 (79%, 0.71), thus this factor weight was used for remaining cross-validation datasets for the 1950 and 1980 output. A factor weight of 0.5 ensures an equal contribution from influential factors and the transition matrix.

Uncertainty

There was greater certainty across the 1980 cross-validation datasets and the final map output compared with 1950 (Table 2). However, datasets from both time periods demonstrated high levels of certainty associated with the InVEST model. In each of the certainty maps (Fig. 3), there were particular regions of uncertainty across Dorset, which overlapped to some degree in both time periods. Much of the uncertainty in the 1950 output was concentrated around the south and east of Dorset. There was some overlap in the southern region in the 1980 output, but this appeared to extend further north. Very few cells across all of the cross-validation datasets for 1950 and 1980 contained “No data” suggesting there were only a small number of cells where no modal LULC was identified (Table 2).

Discussion

Accuracy

The modelling method employed in this study demonstrated high levels of accuracy, with both the 1950 and 1980 LULC maps showing a strong correspondence with the habitat time-series dataset. Although this may be expected given that the rule transitions are based on that dataset, a number of other parameters were determined for InVEST, which were clearly effective in this study. This shows that with just a limited sample of habitat sites, this modelling method can predict and determine historical changes across a landscape.

Fig. 3
figure 3

Certainty maps of Dorset for 1950 and 1980, where light areas show good agreement between the hundred runs and the final modal map, whilst areas in black show greater uncertainty

A significant proportion of the mismatch between the map outputs and the habitat time-series occurred between arable and improved grassland. However, some confusion between these intensive agricultural LULC types might be expected, particularly when modelling from 2015 to 1980, since agricultural systems in the UK often have grass and clover leys incorporated into arable rotations to manage weed problems or to increase soil fertility (AHDB 2018) so the two classes are not necessarily mutually exclusive. This confusion could also be due to a number of social, economic and political issues that we could not model, including changes in pricing and profitability of crops vs. livestock (Zayed 2016). There was also some confusion between some of the semi-natural grasslands and arable, particularly for the 1950 map output (Table 2), which is consistent with other historical land cover modelling (e.g. Fuchs et al. 2013). This highlights the difficulty in predicting such change and is likely to arise because other small scale factors which cannot be captured by InVEST, will also be influencing change, such as land ownership.

Despite the strong correspondence between the map outputs and the habitat time-series, some LULC types which are known to have undergone considerable change between 1930 and 1950, changed by very little or not at all e.g. acid grassland (Table 4). Little heathland was converted in the 1950 map output, but it is known that this habitat experienced dramatic declines across Dorset over that time period (Moore 1962; Webb and Haskins 1980). This is likely to be due to the fact that large areas of heathland were lost to coniferous woodland during this period, which InVEST struggled to predict. This is because the area of coniferous woodland in 1930 was very small to begin with and the modelled factors only assisted in narrowing down the location of change to sandy acidic soils, which corresponded in general to the occurrence of heathland in Dorset, rather than narrowing down to specific 100 m cells within heathland areas. Fen, marsh and swamp and acid grassland were other LULC types which reduced by very little, if at all. This may be because these LULC types were competing with change from other LULC types such as calcareous and neutral grassland, which underwent significant conversion to improved grassland and arable and were a higher priority for change (Online Resource 4). This reflects one of the weaknesses of the InVEST tool, which currently models LULC change based on the percentage change of increasing LULC types only and not those which have shrunk.

Other issues arose due to differences with the baseline maps that were used to model the 1950 and 1980 outputs. For instance, the LULC type, “other”, in this study referred to inland rock, which was classified in the 2015 map (Rowland et al. 2017a) and hence the generated 1980 output, but was absent from the 1930 and 1950 maps. There were also differences in how water was mapped, with rivers included in the 1930 baseline map, but not in 2015. The same was true for woodlands, with different classes for the baseline maps, which may explain why broadleaved woodland did not follow the trends identified in other studies (Hooftman and Bullock 2012; Ridding et al. 2020). Furthermore, the definitions of LULC types varied slightly between the start and end maps for fen, marsh and swamp and coastal. For example, coastal in the 1930s map referred to sand dunes/littoral sediment, whilst in the 2015 map this included categories such as littoral rock, littoral sediment, supra-littoral rock and supra-littoral sediment.

Timing of LULC changes between 1930 and 2015

The modelling method employed in this study has enabled us to identify the timing and spatial patterns of key LULC changes over ca. 85 year period. Overall, we found that 87% of semi-natural habitat was lost in Dorset, with the greatest losses occurring in neutral (99%) and calcareous (97%) grasslands. These results are consistent with other studies in Dorset (Hooftman and Bullock 2012) and across England and Wales (Fuller 1987; Ridding et al. 2015). By creating the time-series of LULC maps, we were able to determine that the greatest loss of calcareous and neutral grassland occurred in the period 1930–1950. This corresponds to the time where arable land increased most across Dorset, which is consistent with the period of agriculture intensification across Europe (Best and Coppock 1962; Stoate et al. 2001). The time-series also enhanced findings from Ridding et al. (2020) by revealing where the changes occurred spatially; largely to the west of the county and along the fertile band of chalk soil running south-west to north-east. The largest increase in improved grassland occurred between 1950 and 1980. Arable land, however decreased after the 1950s. This most likely reflects the shift in farming, whereby fewer fields were required for conversion after advances in mechanisation and chemical applications led to great increases in yield (Stoate et al. 2001). There were also a number of economic and political factors, including falls in prices for agricultural products. Since a number of land covers, for example heathland, did not change by much between 1930 and 1950 using our modelling methodology, it was difficult to identify a more exact time period of loss, however the generation of the 1980 LULC map revealed that by this point there had already been a considerable loss of heathland.

Uncertainty

This study found low levels of uncertainty associated with the methodology, with an average of over 90% of the hundred runs matching with the final 1950 and 1980 map outputs. This means we have confidence in the modelled placement of the majority of LULC types, suggesting the modelled factors and transition tables were useful in narrowing down appropriate locations for certain LULC types to occur. This highlights the importance of having comprehensive data which can be used to inform InVEST, as the habitat time-series dataset (Ridding et al. 2020) did in our study. There was slightly greater certainty for the 1980 map compared with the 1950 map, which may reflect that more significant modelled factors were identified for changes from 2015 to 1980 compared with 1930 to 1950, giving InVEST more information and thus confidence in the placement of increasing LULC types.

There were particular areas in Dorset that demonstrated higher levels of uncertainty, which were generally found around the southern and eastern areas of the region. There were areas of overlap along the southern coast towards the east on both the 1950 and 1980 maps. For 1950 this resulted in large amounts of arable land being predicted in these areas. It is likely that these areas were suitable for arable, including being flatter, having high soil fertility and a lower average temperature (significant modelled factors) and InVEST struggled to decide the exact 100 m × 100 m cells in which to position arable, so when the final modal map was created numerous arable cells were generated.

Conclusion

This study has shown that it is possible to generate a time-series of historical landscapes using a modelling method which involved the use of InVEST and detailed habitat time-series data to inform the model. To our knowledge this is the first time InVEST has been used reconstruct historical landscapes, rather than predict future scenarios (Gibson and Quinn 2017; Sharma et al. 2018). We have shown that the method produced accurate outputs, and demonstrates the importance of obtaining appropriate data to inform the model. The creation of this LULC time-series allowed spatial and temporal changes in LULC to be identified over multiple time periods. This builds on Hooftman and Bullock (2012) by revealing more accurately the timing of change for certain LULC types, for instance the greatest losses in neutral and calcareous grassland occurred in 1950, the period when arable land expanded the most. We also determined a high level of certainty in using the modelling method employed in this study. This is important to assess, but is often overlooked in other LULC prediction studies (Sharma et al. 2018). Although the generated maps are not suitable for performing fine-scale analysis, particularly where high levels of uncertainty were detected, they are however useful for looking at more general patterns at the landscape scale, including habitat fragmentation and changes in ecosystem service delivery. This can be useful for environmental managers and landscape planners for informing future land management plans, as well as conservation strategies such as restoration. The modelling methodology can be used to create historical landscapes in any situation, providing there is sufficient data to inform the InVEST model.

This study however, has also highlighted some of limitations of reconstructing historical LULC maps, even when a region has abundant data, as in this study. This is particularly relevant for LULC types which cover a small area or those which have little information on environmental or physical factors which inform where a LULC type should occur. Increasing the availability of relevant data would improve such mapping approaches. This has also been identified in other studies (Liping et al. 2018; Sharma et al. 2018). For instance, increasing the availability of temporal datasets would be very beneficial for historical mapping, since most data are often static in time, such as accessibility and distance to roads. Furthermore, the indirect factors which are often very influential on LULC change, including political or economic drivers, such as the market for agricultural goods or the introduction of a new policies, are currently not incorporated due to model limitations. The incorporation of such factors into modelling programs such as InVEST and the associated effect on accuracy is required.