Introduction

The estimation of coverage, i.e., the area proportion of a study region covered by a certain target class, is essential to many aspects of environmental monitoring on different spatial scales, ranging from forest cover of countries to lichen cover on rocks or applications in microscopy (Kaiser 1983; Barabesi and Marcheselli 2008). Information on the area of different forest types and their changes over time is important also for Measuring, Reporting, and Verification for REDD+. In context of recent common agricultural policy within the European Community (CAP), the coverage of certain landscape elements like single trees, bushes, or hedgerows in cultivated farmland is needed (European Commission 2011). Estimates of the area of different land cover classes are relevant for nature conservation planning and the management of woody biomass resources in context of renewable energy (Bull 1997; McKendry 2002; Berndes et al. 2003). Depending on the landscape structure and the predominant land use, biomass potentials from trees outside forest are a significant source of raw material. In many regions of the world, biomass density and related carbon stocks per area outside forest are sometimes even higher than in degraded areas that still comply with a forest definition (Kleinn 2000; Bellefontaine et al. 2002; Fischer et al. 2011; Baffetta, Corona and Fattorini 2011a).

The most frequently used information sources for assessments of vegetation area and coverage are remote sensing imagery and field sampling. Frequently, both sources are used together in an integrated approach: while remote sensing imagery allows the production of wall-to-wall maps, probabilistic sampling offers the possibility to derive design-based inference on many ecologically variables. In this study, we focus on sampling techniques applied to aerial photographs. Standard sampling techniques and plot designs as applied for example in forest inventories typically fail for the assessment of woody resources outside the forest, mainly because of the different spatial distribution and relative sparseness of woody vegetation embedded in the agricultural landscape (Kleinn 2000, 2002). As a consequence, the development of new sampling protocols that are adapted towards the recent information need has become a field of research (Ramezani and Holm 2011; Baffetta, Fattorini and Corona 2011b). Depending on the goals, sample surveys cannot only be more cost efficient than complete mapping but also of higher accuracy since data collection can be implemented in a more detailed and careful manner if restricted to smaller areas (Freese 1962; Raj 1968; Corona et al. 2004).

Line intersect sampling, often abbreviated as LIS, is a well-known sampling protocol that has been applied to a wide range of target objects including hedges, roads, forest edge, coarse woody debris in forests, landscape metrics, etc. (Warren and Olsen 1964; Matérn 1964; DeVries 1986; Battles et al. 1996; Ringvall and Ståhl 1999; Corona et al 2004; Esseen et al. 2006; Woodall and Nagel 2006; Affleck 2010; Ramezani and Holm 2011; Kleinn et al. 2011). The literature on line sampling is fairly inconsistent in regard to a clear distinction between line intersect sampling and line intercept sampling, and often both are used synonymously. Even if there are notable differences in how observations are obtained from linear observation units, both alternatives can be analyzed by a unifying framework (Kaiser 1983; Valentine et al. 2001).

Line intersect sampling (LIS) is a field protocol that is often applied such that lines or transects are used to decide about the inclusion of elements into a probabilistic sample. The variable of interest is measured on those elements that are intersected by a transect line, while their design-based inclusion probability depends on the element’s width and orientation (Affleck et al. 2005). If the population parameter of interest is the total length of linear features, like, for example, the length of the forest boundary or roads, no measurement other than simple count of intersections is needed (Matérn 1964; Kleinn et al. 2011). The individual probability of intersection (viz. the respective inclusion probability) is a function of the transect length. A probabilistic estimate of the total length per unit area can then be obtained with the Horvitz–Thompson estimator. For a comprehensive overview of line intersect sampling, see for example Gregoire and Valentine (2008), Kaiser (1983), or Barabesi and Marcheselli (2008).

With line intercept sampling, the intercept length of a line falling into a certain target class (e.g., forest crown cover in a landscape) is the basis to obtain an unbiased estimate of the fraction of this class on the total area of the sampling frame. The estimated fraction of the target class multiplied with the area size of the sampling frame gives an unbiased estimate of its total coverage. It is obvious that sampling does not refer to a discrete population of disjoint elements but to a continuous population in this case (Cochran 1977). Obviously, both approaches of line sampling can be integrated and applied simultaneously, e.g., in order to produce estimates on forest cover and the length of the forest boundary, like frequently done in large area forest inventories.

The purpose of this study is a performance comparison of different sampling schemes based on plots, lines, and clusters of points for estimating coverage of target land-use classes in the study region from aerial photographs. Special focus is laid on the estimation of coverage of different classes of woody vegetation outside forest (WOF).

Methods

Sampling scheme

The population under study is the infinite population of points constituting the study area of the administrative district of Göttingen in Southern Lower Saxony, Germany, with a total size of 1,117.7 km2. Sampling locations are selected from this continuum in accordance with a systematic square sampling grid of 2 × 2 km. Starting point and orientation (North–South) of the grid are aligned to the sampling design of the national forest inventory (BWI) of Germany (BMELV 2007) that has a base grid of 4 × 4 km. While the German NFI selects forest plots only, we extended the observations also to non-forest areas. The resulting sample size (number of sample locations) for the study area is n = 279 (Fig. 1).

Fig. 1
figure 1

Administrative district of Göttingen in Southern Lower Saxony, Germany overlaid with a systematic sampling grid of 2 × 2 km. At each of the 297 sampling locations, a quadratic sample plot of 400 × 400 m was established

A complete cover of digital aerial imagery with a ground resolution of 20 cm was acquired from the land survey administration for the district of Göttingen for the year 2010.

Land cover was classified according to the biotope survey guide of the German Federal Agency of Nature Conservation (BfN 2002). At the highest level of detail, this hierarchical classification scheme distinguishes more than 600 different biotope and land cover classes (LCC). As a clear distinction of all biotopes is not always possible from aerial photographs, many classes were assigned a coarser level of aggregation that was adapted to the target of our study. We used a higher level of detail in the classification of trees outside forest, hedgerows, or groves than for urban areas, water bodies, or agricultural areas (Table 1).

Table 1 Overview of biotopes and land cover classes (LCC) adopted from the BfN-Survey guide (BfN 2002)

The FAO forest definition is widely adopted in many national forest inventories, and it is the definition to which the international forest reporting of FAO refers to. According to this definition, forest land is characterized by a tree crown cover (or equivalent stocking level) of more than 10 % and area of more than 0.5 ha. The trees should be able to reach a minimum height of 5 m at maturity in situ. Further, forest formations must have a minimum width of 20 m.

Plot sampling

At each of the 279 selected sampling points, a quadratic aerial photo plot of 400 × 400 m (16 ha) was established, so that in total an area of 4,464 ha was observed, which is equivalent to a sampling intensity of 3.9 %. All land cover types inside these sample quadrates were delineated in a GIS environment, and the resulting polygons were subsequently classified according to the LCC.

Line intercept sampling

Based on the completely mapped land cover classes inside the sample of aerial photo plots, we performed five different variations of LIS at all sampling locations. The different linear observation units were superimposed onto the fully delineated land cover polygons and the respective land cover classes were assigned to the resulting line intercepts by standard GIS procedures.

As basic approach, a single line of 200 m (L1) and 400 m (L2) length was centered at each sampling location in random orientation (Fig. 2).

Fig. 2
figure 2

Different configuration of observation designs. L1 a single line of 200 m; L2 a single line of 400 m; LC1 a cluster of four lines, each of 100 m; LC2 a cluster of four lines, each of 200 m; C the circumference of a circle with 100 m radius. P is a cluster of the eight start- and endpoints of the line segments in design LC1

Further, we adopted clusters of four line segments, each of 100 m (CL1) and 200 m (LC2) length and random orientation (Fig. 3). As the single line segments within such a cluster are not selected independently, they are treated as one single observation for estimation. As a further design alternative, the circumference line of a circle with radius 100 m was used.

Fig. 3
figure 3

One aerial photo plot of 400 × 400 m (F) showing different landscape elements (left) and delineated land cover classes (LCC coded following BfN 2002) overlain with a cluster of four sampling lines implemented as design LC2 (right)

Clusters of points

In addition to square plots and the different linear observation units, we also tested the efficiency of estimating land cover classes by using clusters of points. Therefore, we classified only the start- and endpoints of the four line segments in design LC1 (Fig. 2) which makes a cluster of eight points on each grid point. Contrary to the continuous scale of observations that result from the line intercepts, this design allows observing a discrete number of nine area proportions only (0/8, 1/8, …, 8/8).

Table 2 gives an overview about the size and shape of applied observation units.

Table 2 Configuration of different observation designs

Estimation

We estimate the coverage of land cover classes (LCC) within a bounded two-dimensional region of known area by means of the sample proportion of these classes observed either as (1) their coverage observed in fixed area sample plots, (2) their proportion intercepted by randomly oriented lines of fixed length, and (3) the proportion of points within a cluster falling into the respective classes. Each selected observation unit produces one observation of coverage for each LCC, taking on values between 0 and 1. Clusters composed of multiple unconnected line segments or points are treated as one observation unit for estimation. It should be noted that our interest is in estimating the total coverage of LCCs only and not in any mean characteristics, e.g., the mean density or size of distinct land cover patches in the area under study.

We consider a continuous population of points p constituting the whole study region A with known horizontal area A. For each land cover class j, we introduce the survey variable y j(p) which is equal to 1 if p belongs to the class j and 0 otherwise. Then the coverage of the class j can be rigorously defined as A j /A.

The line intercept design involves selecting a linear transect of fixed length l with random orientation, centered at a selected sampling location i. The proportion P j covered by the area of class j is estimated from the sample proportion p j by

$$ {\widehat{p}}_{ij}=\frac{l_{ij}}{l_i}, $$

where l ij represents the intercept length of the transect falling in A j . The length l ij may result from multiple partial intercepts whenever a sample line intersects a set A j more than one time. As \( {\widehat{p}}_{ij} \) is an unbiased estimator of the population proportion A j /A, the total area of class j at sample location i is estimated as \( {\widehat{A}}_{ij}={\widehat{p}}_{ij}A \). Alternatively, if the interest is in the relative coverage of A j only, the constant area of A can be ignored. For the clusters of points implemented as design P, the relative share of a land cover is estimated from the proportion of points per cluster falling in the respective class as \( {\widehat{p}}_{ij}={n}_{ij}/8 \).

We adopt a systematic distribution of the sampling locations in our study (Fig. 1). An unbiased estimator for the area of coverage class j from n samples is given by

$$ {\widehat{A}}_j=\frac{1}{n}{\displaystyle \sum {}_{i=1}^n}{\widehat{A}}_{ij}, $$

and the variance is estimated—though not unbiasedly—by:

$$ \widehat{\mathrm{V}}\left({\widehat{A}}_j\right)=\frac{1}{n-1}{\displaystyle \sum {}_{i=1}^n}{\left({\widehat{A}}_j-{\widehat{A}}_{ij}\right)}^2. $$

We apply this simple random sampling estimator to systematic sampling, expecting that it is usually upwards biased and produces a conservative estimate of variance (Gregoire and Valentine 2008). In order to compare the statistical precision resulting from the different observation designs, we estimate the standard error of estimates as \( \operatorname{SE}={\left(\widehat{\mathrm{V}}\left({\widehat{A}}_j\right)/n\right)}^{-\raisebox{1ex}{$1$}\!\left/ \!\raisebox{-1ex}{$2$}\right.} \) and the relative standard error as \( \operatorname{RSE}=\operatorname{SE}/{\widehat{A}}_j\times 100\% \).

If sampling locations are selected unrestrictedly in the study area, there is concern about edge effects as plots, lines, or point clusters might overlap the boundary of the sampling frame (Gregoire 1982; Gregoire and Monkevich 1994; Gregoire and Valentine 2008). While edge correction techniques are relatively straightforward for single lines, corrections for clusters of lines are more complex. However, at the scale of our study, possible edge effects are of less concern and were not considered. Only six aerial photo plots partially overlap the boundary that is a political boundary and not related to any changes of the configuration of landscape elements.

Results

Table 3 summarizes the results of our sampling study for the sample size of n = 279 and all observation designs. As the implemented observation designs are design-unbiased, the differences in estimated coverage are a result of sampling errors. As to be expected, the complete delineation of land cover classes inside the aerial photo plots of 16 ha each (approach F as of Table 2) results in the highest precision of estimated area for all land cover classes. However, the differences in precision of estimates in terms of the estimated relative standard errors (RSE) are minor between the different observation designs. They are minor especially for those classes that cover a relatively large portion of the total landscape. For correct interpretation, it should be noted that the estimated RSE given in Table 3 is a relative error for a proportion.

Table 3 Estimated coverage of land cover classes (LCC) in the study area (in %) for each observation design and constant sample size of n = 279

Taking coverage of forest (resp. forest crown cover) as example, it is estimated to be 35.87 ± 6.8 % (viz. a 95 % confidence interval between 33.4 and 38.3 %) of the total study area based on the complete delineation of aerial photo plots and 35.88 ± 6.8 % using the line cluster LC2. In both cases, the estimated RSE is lower than 7%. For rare classes, e.g., the coverage of single tree crowns in the landscape, the precision of estimates is much lower, as expected. Their total crown coverage in the study area is estimated to be 0.14 % with a RSE of 20.7 % from completely mapped plots and 0.12 % with a RSE of 27.9 % from design LC2. The different performance of observation designs is in general more obvious for those classes that are relatively rare, like single trees or bushes and shrubs (Fig. 4).

Fig. 4
figure 4

Estimated relative standard errors (RSE) of coverage per land cover class (in ascending order) for the different observation designs. The total size of each observation unit is given in the legend. Bars show the estimated coverage of LCCs based on design F

All designs estimate the total coverage of crop fields, and in a forest with RSE lower than 10 %, the differences in precision are very small. Furthermore, all designs estimate the coverage of aggregated woody vegetation outside forest (WOF) with RSE between 10 and 20 %. The line cluster LC2, which has the longest total line length (800 m) of the LIS designs, has the second best overall performance for all relevant classes. In general, the order of RSE follows the observed line length in most cases. If we calculate a mean RSE over all land cover classes, the descending order of overall performance is F, LC2, C, P, LC1, L2, and L1.

Comparing designs L2 and LC1 that are of same total line length (400 m), the difference between a single long line and a cluster of shorter spread out line segments becomes obvious. The line cluster performs better for most classes. Exception is the coverage of single tree crowns, where a single line of same total length (L2) and even a shorter line (L1) results in a higher precision.

The performance of the point cluster (P) is even better than for some of the LIS designs for those classes that have a relatively high coverage. For the estimation of forest cover, the RSE is estimated to be 6.9 %, for that of cropland 5.6 %, and for that of grassland 10.3 %, respectively. The precision of estimates is therefore only marginally lower than with the full delineation F or design LC2 for these classes. However, the point clusters fail in case of rare elements. The coverage of single tree crowns is estimated to be 0 % with design P, as no single tree crown was hit by a point.

Discussion

In our study, we compare a plot design and different configurations of line intercept and point cluster designs applied in high-resolution aerial imagery with constant sample size of n = 279. As the applied estimators yield estimates of variance that are expected to be upwards biased to an unknown extent for systematic sampling, small differences in the performance between the alternative observation designs should not be overinterpreted. The efforts to obtain a single observation \( {\widehat{p}}_{ij} \) are quite different and dependent on the total size of the observation units shown in Table 2. As sampling was performed in aerial photographs only, considerations about possible field implementation of the different observation units were not taken into account. A complete delineation and classification of all land cover polygons in aerial photographs in all quadrates (4,464 ha in total) is a huge effort and very costly. Cutting line segments at their intersections with the different land-use classes in a GIS environment is, depending on the total length of these lines, much faster and straightforward to implement (Ramezani and Holm 2011). The point cluster design P is by far the most basic, fastest, and cheapest approach among the alternatives. Kleinn (1991, 1994) compared line intercept sampling and point clusters of the start- and endpoints of the line segments alone in a simulation study on area estimation. There appear to be few studies comparing the performance of these two alternatives in statistical and practical terms. None of the approaches is free of interpretation errors. Even the full delineation of landscape elements in the 400 m × 400 m is prone to errors. However, an argument for sampling is that such errors occur on a lower scale and that the selective interpretation on smaller areas can be implemented in a more detailed and careful manner (Eysn et al. 2012).

An interesting advantage of the point clusters is that it can be, contrary to the other observation designs, implemented without any GIS expertise. If the points are visualized and overlain on any available imagery with sufficient resolution, the observations can be obtained by simply counting the points per cluster and LCC. In case of our study area, it would be possible to derive the same estimates also based on freely available imagery in virtual globes like, e.g., Google Earth, as high-resolution imagery is available for our study area and the visual discrimination of land cover classes is possible.

One of the most important guiding principles of planning any kind of natural resources inventory is the conformity with the objectives of the study. Constraints lead the planner to a design that offers the best anticipated precision yet satisfying the constraints on time, costs, and logistics. This efficiency is given as the relation between invested resources and resulting precision of estimates. Despite the differences in complexity of observation designs under study, the resulting estimated standard errors (RSE) of the coverage of different land cover classes are minor in many cases. Especially for those classes that cover relatively large proportions of the sampling frame, they seem even negligible. Our results show that an estimate of agricultural crop land for example can be obtained with nearly similar precision from either a time-consuming fully mapped plot design on a total area of 279 × 16 ha (4,464 ha) or a very simple classification of eight points per plot (2,232 points). The same holds for the estimation of forest crown cover and the area of grassland.

Kleinn (1991) compared the precision of estimates for forest area by using either lines as observation units or the start- and endpoints of these lines only. He concluded that the start- and endpoints deliver comparable or even higher precision as observations from the total lines, while the difference in performance was related to the fragmentation of the land cover classes and the length of the line samples relative to this fragmentation. The results of our empirical study lead to similar conclusions for the main land cover classes. However, we must not generalize this observation as it might be dependent on how well the observation units are adapted and optimized towards the special target variables of an observational study and the landscape structure. Our results show that clusters of points (P) might have a huge potential to increase the overall efficiency of a sampling study if they are well adapted to the spatial structure and patch size of the target classes. It is interesting to note that their better performance for some classes is due to assessing less redundant information at each sampling location. Therefore, applying well-adapted clusters of points might be a win-win situation, as uncertainty and costs might be reduced at the same time.

Both line cluster configurations are superior compared to single lines for most classes. The rationale for selecting multiple spread out line segments rather than one single longer line is that they are expected to capture more of the given variability inside each sampling unit, reducing the variability between sample units. However, for single trees, our results show that single longer lines perform better, where the reason might be that the mean distance between these rare objects is quite large.

The circular line C that is a very compact form shows no advantages over straight lines for the estimation of classes that cover a large proportion of the landscape, even if the total length of the perimeter is relatively long. However, as shown in Fig. 4, it is superior to the line clusters for relatively small and dispersed landscape elements.

Conclusions

In general, we can conclude that manual wall-to-wall mapping of landscape elements is neither necessary nor efficient if the goal is to estimate land cover classes that have a relatively high coverage. Probabilistic sampling and design-based inference can be an attractive alternative to derive estimates with relatively high precision if sampling and observation design are well adapted to the target variables. Contrary to model-based remote sensing analysis that needs a lot of expertise, training, and capacity, sampling is easy to implement once a suitable sampling protocol is available. However, in ecological monitoring the less common land cover classes are often of main interest. Therefore, a combination of different approaches, e.g., mapping rare landscape elements like single trees or bushes while using point clusters for larger classes, might increase the overall efficiency. For the special case of rare cover types, adaptive plot designs might be an interesting alternative (Yang et al. 2011).

We further conclude that the good performance of point clusters for classes with relatively high coverage in our and other studies (Kleinn 1991; 1994) should be motivation for further research on this approach. Observations on points can be derived from or combined with any LIS approach and might help to increase the overall efficiency of line sampling in general.