Introduction

The three-dimensional structure of forests is an important indicator of biodiversity and carbon dynamics in terrestrial ecosystems (McElhinny et al. 2005; Fahey et al. 2010). Forests with a variety of structural components likely provide a wide range of habitats and resources for species (McElhinny et al. 2005), resulting in a positive correlation between the structural complexity of forests and biodiversity (Mac Nally et al. 2001; Tanabe et al. 2001). In addition, the structure of forests is an integral part of the global carbon cycle as tree volume and density determine above-ground carbon storage (Houghton et al. 2009) and foliage amounts drive the sequestration of carbon from the atmosphere into the terrestrial biosphere (Schulze et al. 2002).

Of the estimated 861 ± 66 petagrams of carbon stored in forests, 32 % is reported to be stored in the boreal (Pan et al. 2011). In addition to containing a large portion of the world’s forests, the boreal is expected to be among the biomes most impacted by a changing climate (Parry et al. 2007). To accurately forecast how climate change will affect biodiversity and carbon dynamics in boreal ecosystems, we require an improved quantification of the natural and anthropogenic factors that control boreal forest structure and how these factors are changing. Disturbance, site productivity, species composition, and forest management are the main drivers of structure in boreal forests (Spies 1998; Boucher et al. 2006; Boisvenue and Running 2006; Brassard and Chen 2006). In the northern boreal of Canada where most forests are not subject to management activities (Andrew et al. 2012), our knowledge of the impact of these factors on structure is limited by a lack of plots or inventory data (Gillis et al. 2005), preventing a clear understanding of how forest structure will be altered by a changing climate.

Disturbance, principally fire, is the dominant driver of stand age and structure in Canadian boreal forests (Kurz and Apps 1999; Bond-Lamberty et al. 2007; Amiro et al. 2009). The time between fires, known as the fire cycle, increases from west to east in the Canadian boreal and is controlled primarily by climate and the probability of lightning strikes (Brassard and Chen 2006). Approximately 2 million hectares of forests are burned annually in Canada (Stocks et al. 2002), with direct carbon emissions estimated to be an average of 27 Tg of carbon year−1 between 1959 and 1999 in Canada (Amiro et al. 2001). Stand-replacing fires release most of the carbon stored in above-ground biomass to the atmosphere, while the time between fires impacts the accumulation of carbon back into a forest (Kasischke et al. 1995; Amiro et al. 2001). Forest stands generally transition through time from an even- to an uneven-aged structure (Brassard et al. 2008; Larson et al. 2008; Bradford and Kastendick 2010), resulting in stands becoming more structurally complex as time since fire increases. While fire is the dominant disturbance agent in boreal forests, non-stand-replacing disturbances, such as windthrow and insect outbreaks, are also critical to the formation of canopy gaps and lead to more structurally diverse forest stands (Brassard and Chen 2006, Chen and Popadiouk 2002). While localized insect outbreaks play a role in gap formation, regional outbreaks can have significant effects on forest structure and carbon dynamics. For instance, the current mountain pine beetle outbreak in British Columbia killed an estimated 692 million m3 of mature merchantable pine between 1999 and 2010 (Walton 2011), converting the affected forests from a small carbon sink to a large carbon source (Kurz et al. 2008a). Projected range expansion of the mountain pine beetle into the boreal could lead to increased disturbance levels in the boreal through the addition of a new disturbance agent (Safranyik et al. 2010).

Site productivity describes the capacity for growth and development within a stand and plays a critical role in determining forest structure between disturbance events (Boucher et al. 2006). Solar radiation, temperature, water availability, and soil nutrient availability are the basic drivers of productivity; however, foliage amounts and light use efficiency ultimately determine the rate at which carbon can be sequestered into vegetation (Schulze et al. 2002; Running et al. 2004; Boisvenue and Running 2006). Temperature is the main limiting factor to productivity across most of the Canadian boreal, with rates of photosynthesis and decomposition decreasing from the southern to northern boreal in response to decreasing temperature (Churkina and Running 1998). The latitudinal gradient in temperature results in a latitudinal gradient in productivity (Churkina and Running 1998), allowing southern boreal stands to accumulate more biomass between disturbance events than less productive stands further north. Forests have been found to reach an uneven-aged structure faster on higher productivity sites in the boreal (Boucher et al. 2006; Larson et al. 2008), suggesting that southern boreal stands will also become structurally complex sooner than northern boreal stands following a stand-replacing disturbance. In addition, insufficient resources at low productivity sites can restrict maximum tree dimensions, limiting tree size diversity and structural complexity (Boucher et al. 2006).

Thirdly, species composition impacts structure in boreal forests as stand initiating deciduous species are often replaced over time by shade-tolerant coniferous species (Bergeron 2000; Brassard and Chen 2006; Taylor and Chen 2011). Paré and Bergeron (1995) found that total above-ground biomass along a chronosequence in Québec strongly correlated to the presence of Populus tremuloides (trembling aspen) as trembling aspen reached heights unmatched by other boreal species. Therefore, the transition from deciduous to coniferous dominance may be accompanied by a decrease in carbon storage where trembling aspen is in high abundance. The transition from deciduous to coniferous dominance can increase structural complexity with the development of multi-layered and multi-aged canopies, often accompanied by an increase in infestation by spruce budworm (Frelich and Reich 1995; Kneeshaw and Bergeron 1998). Canopy gaps formed by windthrow and insect outbreaks help maintain a deciduous component in older boreal stands (Taylor and Chen 2011), increasing the diversity of tree species and sizes. Older stands consisting purely of late successional conifers can be less structurally diverse than mixedwood stands that maintain a deciduous component (Paré and Bergeron 1995; Brassard et al. 2008), suggesting that structural complexity will not continuously increase with age.

Lastly, active management of forest resources impacts structure with between 700,000 and 1,000,000 ha of forests harvested annually in Canada over the past 20 years (Masek et al. 2011). Clear-cutting is the most common form of harvesting, where contiguous groups of trees are removed and carbon is transferred from above-ground biomass into the forestry sector (Kurz et al. 2009). In most managed forests where sufficient time has elapsed for a second harvest, the rotation time between clear-cuts is shorter than the natural fire cycle, preventing the development of uneven-aged structurally complex systems common in later stages of succession (Bergeron et al. 2004).

Increased fire frequency and intensity (Flannigan et al. 2005) as well as more favorable conditions for insect pests (Carroll et al. 2003; Safranyik et al. 2010) are projected for most Canadian boreal forests, potentially decreasing carbon storage (Thornley and Cannell 2004; Kurz et al. 2008b) and structural complexity (Kneeshaw and Gauthier 2003) in boreal forest ecosystems. Conversely, rising temperatures suggest increased productivity at high latitudes where precipitation is not a limiting factor (Boisvenue and Running 2006), which will likely increase the amount of carbon sequestrated and stored in boreal forests (Denman et al. 2007). While changes in productivity are expected to alter boreal forest structure, the cost and difficulty of collecting inventory data over large areas (Gillis et al. 2005; Wulder et al. 2007; Masek et al. 2011) has limited our quantification of the relationship between productivity and structure. To forecast the coupled effects of changing disturbance regimes and increased productivity on forest structure, the link between productivity and structure in the boreal must be better characterized.

Light Detection and Ranging (Lidar), an active remote sensing technology, provides an opportunity to characterize forest structure over larger spatial scales and at higher sampling frequencies than with conventional field methods (Dubayah and Drake 2000; Lefsky et al. 2002; Lim et al. 2003; Wulder 2008a; Vierling et al. 2011). Lidar systems measure the distance to objects by emitting pulses of near-infrared laser energy and recording the timing and intensity of pulse returns (Wehr and Lohr 1999). The three-dimensional coordinates of objects are derived by coupling these distance measurements with global positioning systems and an inertial measurement unit (Wehr and Lohr 1999). When millions of Lidar pulses are emitted over forest canopies (e.g., >1 pulse/m2), discrete return Lidar systems ultimately produce a cloud of points describing the structure of forest stands (Wehr and Lohr 1999; Lim et al. 2003). Most structural information in a Lidar point cloud can be summarized into three basic attributes: canopy height, canopy cover, and stand structural complexity (Lefsky et al. 2005; Kane et al. 2010b). Canopy height and cover can be estimated directly from a point cloud (Wulder et al. 2008a), while structural complexity can be inferred by the variation in point height (Zimble et al. 2003).

In this paper, we investigate how forest structure across the Canadian boreal forest, as measured by Lidar remote sensing, relates to forest productivity, one of the key, yet poorly quantified, drivers of structure. We summarize the structure of forests using Lidar measures of canopy height, canopy cover, and structural complexity for nearly 25,000 km of airborne Lidar data across the boreal, and compare these attributes to gross primary production (GPP) estimates from the MODerate-resolution Imaging Spectroradiometer (MODIS) and climate variables for six boreal ecozones. To reduce the impact of recent disturbance and management on the observed structure, we use information on land cover, fire history, anthropogenic change, and the presence of roads to restrict our study to mature unmanaged forest stands. Once stratified, we assess the relationship between MODIS GPP and Lidar-derived forest structure metrics across Canadian boreal forests.

Methods

Data sources

Lidar data

In the summer of 2010, the Canadian Forest Service (CFS) working with Applied Geomatics Research Group and the Canadian Consortium for LiDAR Environmental Applications Research (C-CLEAR) acquired 34 transects of small-footprint discrete return airborne Lidar data, spanning from Newfoundland in the east to the Yukon in the west (Wulder et al. 2012). The 34 transects totaled 24,286 km in length with a minimum swath width of 400 m and a nominal pulse density of approximately 2.8 returns/m2 (Fig. 1a). Data were collected using an Optech ALTM 3100 discrete return sensor between the altitudes of 450 – 1,900 m with a fixed scan angle of 15° and a pulse repetition frequency of 70 kHz for most transects (Wulder et al. 2012). The average transect length was approximately 700 km, largely determined by the location of suitable airports (for survey details, see Hopkinson et al. 2011). Customized software tools were developed to pre-process the long transects of Lidar point data, including the classification of points into ground and non-ground returns (Hopkinson et al. 2011).

Fig. 1
figure 1

a Path of 34 small-footprint Lidar transects flown by CFS in 2010. b Average annual MODIS GPP from 2001 to 2010. c Percent of each 1-km MODIS cell classified as forest by the EOSD. d Presence or absence of fire, roads, or anthropogenic change within each 1-km MODIS cell. e Selected mature unmanaged MODIS cells shaded by Lidar-derived canopy cover. f Number of MODIS cells selected for analysis within each boreal ecozone

The Lidar dataset, which contains over 18 billion discrete return points, was divided into 25- by 25-m plots, and a suite of Lidar metrics was calculated for each plot in FUSION (McGaughey 2012), a free software package produced by the US Forest Service for generating forest relevant metrics from Lidar data (Wulder et al. 2012). Lidar metrics describe the distribution and density of Lidar points within a point cloud, allowing plot-level point clouds to be summarized into relatively few structural metrics. Lefsky et al. (2005) and Kane et al. (2010b) found that most structural information in Lidar data could be summarized with a small set of metrics that describe the height, canopy cover, and structural complexity of forests. As a result of these findings, the 95th height percentile (canopy height), percentage of Lidar first returns above 2 m (canopy cover), and the coefficient of variation of return height (structural complexity) were selected as forest structure indicators.

Stand height

Height percentiles describe the cumulative height distribution of Lidar returns and correlate strongly to plot-level inventory attributes such as mean tree height, dominant tree height, and stand volume (Wulder et al. 2008a). In Norway, Næsset (2004) explained 77–92 % of the variation in Lorey’s mean tree height using only height percentiles, while Wulder et al. (2012) explained 83 % of the variation in Lorey’s mean height in the Canadian boreal with the 95th height percentile alone. Here, the 95th height percentile was selected over the maximum return height or 99th height percentile as these latter metrics can provide unrepresentative estimates of stand height in the presence of physical (e.g., birds, power lines) or atmospheric anomalies (Magnussen and Boudewyn 1998; Kane et al. 2010b). The 95th percentile was calculated using only first returns above 2 m.

Canopy cover

Vegetation cover within any vertical position of a canopy can be estimated by calculating the ratio of Lidar pulses intercepted by a canopy layer to the total number of returns that entered the layer with well-established accuracy (Wulder et al. 2008). Andersen (2009) used the percentage of first returns above 2 m to assess canopy closer in boreal Alaska, while Solberg et al. (2006) used the percentage of returns above 1 m to assess insect defoliation in Norway. Likewise, Morsdorf et al (2006) found a strong correspondence (R 2 = 0.73) between Lidar and hemispherical photograph derived estimates of vegetation cover in Swiss mountain pine forests. Canopy cover was calculated as the ratio of first returns above 2 m to the total number of first returns, which conforms closely to most field definitions of canopy cover (Jennings et al. 1999; USDA Forest Service 2003).

Structural complexity

Lastly, the coefficient of variation (CV) of return height was selected as an indicator of stand structural complexity as variability in return height within a forest canopy will relate to the variability of structural elements within the canopy. Zimble et al. (2003) found that single-story canopies had a lower CV of return height than diverse multistory canopies in central Idaho forests. The standard deviation of return height tends to increase with stand height regardless of stand complexity (Kane et al. 2010a), making the CV a more useful index for comparing complexity across varying stand heights. While the 95th height percentile and cover above 2 m relate directly to easily measured components of a stand, the CV of return height serves only as an indicator of structural complexity, as complexity is difficult to define in the field. Therefore, the results should be interpreted with care. The CV was also calculated using first returns above 2 m.

These plot-level metrics, in addition to the other standard metrics calculated in FUSION, were stored in a PostgreSQL database (http://www.postgresql.org/; see Wulder et al. 2012 for a complete list of calculated metrics). From the over 18 billion Lidar points collected during the national transects campaign, Lidar metrics were generated for more than 17 million 25- by 25-m plots.

MODIS GPP

The MODIS GPP algorithm provides 8-day estimates of GPP globally at 1-km spatial resolution. Derived following the principles of Monteith (1972), GPP is determined for each 1-km cell as a function of the absorbed photosynthetically active radiation and the light-use efficiency (LUE) of vegetation:

$$ GPP = {{\varepsilon }_{{\max }}} \times 0.45 \times SWrad \times FPAR \times fVPD \times f{{T}_{{\min }}} $$

where ε max is the maximum LUE; SWrad is the incident short-wave solar radiation, multiplied by 0.45 to derive photosynthetically active radiation (PAR); FPAR is the fraction of incident PAR that is absorbed by vegetation; and fVPD and fTmin are reductions in LUE from high vapor pressure deficits (VPD) that lead to water stress in plants and low temperatures that limit plant function (Zhao and Running 2010).

The algorithm defines ε max by vegetation type according to the MODIS Land Cover Type product (MOD12Q1, Friedl et al. 2010). Daily meteorological data are used to calculate minimum daily temperature (T min), VPD, and SWrad (Zhao and Running 2010). FPAR is determined using the 1-km MODIS FPAR product (MOD15A2, Myneni et al. 2011), which is computed from atmospherically corrected MODIS surface reflectances.

The MODIS GPP algorithm has been implemented in NASA’s MOD17 product to provide 8-day and annual estimates of GPP from 2000 to 2011 (Running et al. 2004). Heinsch et al. (2006) showed that annual MODIS GPP (MOD17A3) had a relatively strong correlation to annual flux tower estimates of GPP across North America (r = 0.859 ± 0.173), but overestimated the tower estimates at most sites (relative error = 24 %). A re-processed version of MOD17A3, which addresses cloud and aerosol contamination issues (Zhao and Running 2010), was obtained for this analysis (available at: ftp://ftp.ntsg.umt.edu/pub/MODIS/Mirror/MOD17_Science_2010/). As inter-annual variability and temporal trends exist within these data (Zhao and Running 2010), GPP estimates from a single year are likely unrepresentative of long-term forest productivity. Therefore, the annual GPP products were compiled into a 10-year average (2001–2010), serving as a long-term estimate of productivity to relate to the Lidar-derived structural metrics. All processing in this analysis was then performed on the 1-km MODIS sinusoidal grid.

Climate data

Minimum annual temperature (MAT) and total annual precipitation (TAP) data for North America were obtained from the Pacific Climate Impacts Consortium (http://pacificclimate.org/tools-and-data/datasets). These climate datasets were derived at 32-km spatial resolution from 1979 to 2010 by the National Centers for Environmental Prediction North American Regional Reanalysis project (Mesinger et al. 2006). A natural neighbor interpolation approach was used to produce annual maps of MAT and TAP on the 1-km MODIS sinusoidal grid. The annual maps were averaged together to derive average MAT and TAP for 1979–2010.

Additional datasets

Land cover was obtained from the Earth Observation for Sustainable Development of Forests (EOSD) program led by the CFS (Wulder et al. 2008b). The EOSD is a 25-m spatial resolution land cover classification of the forested ecozones of Canada derived from Landsat-7 Enhanced Thematic Mapper Plus (ETM+) images (circa 2000) and consists of 23 land cover classes, including 9 forest classes (coniferous, mixedwood, and broadleaf/dense, open and sparse). These nine forest classes were used to estimate the forested percentage of each 1-km MODIS cell (Fig. 1c). In addition, the 500-m MODIS Land Cover Type product (MOD12Q1, Friedl et al. 2010) was obtained from 2001 to 2010 to compare against the EOSD classification. All classes matching the EOSD definition of a forest (i.e., >10 % tree covered) according to the University of Maryland classification scheme (Friedl et al. 2010) were selected and used to calculate the forested percentage of each 1-km cell in each year.

Fire, road, and anthropogenic disturbance layers were used to identify 1-km MODIS cells that potentially contained recent disturbances (Fig. 1d). The 2010 Canadian National Forest Database (CNFDB, Canadian Forest Service 2010) is a collection of fire polygons recorded by provincial and territorial fire management agencies and Parks Canada. While fire records in the CNFDB date back to 1917 in British Columbia, the oldest recorded fire to intersect a CFS Lidar transect was in 1941. The methods for recording fires have changed with time and vary by agency, ranging from sketches of fire boundaries to the interpretation of aerial photography and the classification of satellite imagery.

The 2010 Road Network File is a compilation of all Canadian roads recorded in Statistics Canada’s National Geographic Database (Statistics Canada 2010). In this analysis, the Road Network File acts as an indicator of forest management: if a 1-km MODIS cell contains a road, then the forests within that cell are potentially managed. Logging roads that provide access to managed forests from existing roads may be absent from the Road Network File. Therefore, a 1-km cell was flagged as containing a road if one existed in a neighboring cell (3 × 3 cell window).

Lastly, Global Forest Watch Canada analyzed Landsat data (30-m spatial resolution) to produce anthropogenic change maps for areas in Nova Scotia (Cheng and Lee 2009), Saskatchewan and Manitoba (Stanojevic et al. 2006a), Ontario (Cheng and Lee 2008), Québec (Stanojevic et al. 2006b), and British Columbia (Lee and Gysbers 2008). The major anthropogenic changes identified and mapped in these studies include development, clear-cutting, road construction, agricultural clearing, reservoir construction, and petroleum and natural gas exploration (Stanojevic et al. 2006b). The areas mapped by Global Forest Watch Canada do not cover the entire boreal, placing more importance on the Road Network File to identify potentially managed and anthropogenic disturbed forests. By combining the CNFBD, the Road Network File, and the Global Forest Watch Canada’s anthropogenic change layers, we are identifying, to the best of our ability, MODIS cells that contain recorded disturbance events or potentially managed forests.

Following the Canadian ecozone framework (Ecological Stratification Working Group 1995) and the Brandt definition of the boreal (Brandt 2009), six boreal ecozones were sampled and studied in this analysis (Fig. 1f). Because the Taiga and Boreal Shield ecozones are large and span a wide range of climatic and ecosystem conditions, both were split into east and west compartments (Stinson et al. 2011).

Selection of mature unmanaged forest cells

Indicators of canopy cover, canopy height, and structural complexity were derived for each 1-km MODIS cell by averaging together the plot level (25 × 25 m) Lidar metrics within each cell. Only Lidar plots classified as forest by the EOSD and meeting the structural definition of a forest according to the 2005 Global Forest Resources Assessment [height (95th percentile) > 5 m, canopy cover (percent cover above 2 m) > 10 %] were used to calculate the 1-km cell averages (Food and Agriculture Organization 2006). A “spatial uniqueness” test was performed on the Lidar plots to insure that no area was double counted in the MODIS cell averages where flight lines crossed. Lidar plots with a 95th height percentile above 50 m were assumed to be erroneous and were therefore removed prior to the calculation of the MODIS cell averages. In total, only 591 of 9.4 million forested Lidar plots had a 95th height percentile above 50 m.

MODIS cells containing less than 100 forested Lidar plots were removed from the analysis, in addition to cells where less than 75 % of the Lidar plots were forested. MODIS cells that were less than 75 % forested according to the EOSD (Fig. 1c) were also removed as the GPP estimate could become unrepresentative of the forested portion of the cell with the presence of additional land covers. To remove the effects of disturbance and management on forest structure, cells that contained a fire, anthropogenic change, or a road were also removed (Fig. 1d).

Given that vegetation type is a critical input to the MODIS GPP algorithm, misclassifications in the MODIS Land Cover Type product could result in less reliable GPP estimates (Zhao et al. 2005). Therefore, cells that were less than 75 % forested in any year (2001–2010) according to the MODIS Land Cover Type product were also removed as discrepancies between EOSD and MODIS land cover could signify incorrect vegetation inputs to the GPP calculation.

Averaging the 25- by 25-m plot metrics up to 1 km and applying this set of rules allowed for a direct comparison between Lidar structural metrics and MODIS GPP for mature unmanaged stands. Figure 1e shows the distribution of the 5675 MODIS cells that meet this set of rules (shaded by percent cover above 2 m), while Fig. 1f shows the number of cells that fall within each boreal ecozone. The Boreal Shield East is of particular interest in this study because of its large sample size (1,809) and large latitudinal gradient in GPP (Fig. 1b). The calculation of MODIS cell averages and the stratification of mature unmanaged stands were performed in R (R Development Core Team 2009).

Investigating the relationship between Lidar-derived structure and MODIS GPP

The relationship between Lidar-derived structural metrics and satellite-derived GPP was assessed using Pearson’s correlation coefficient and the modified t test proposed by Clifford et al. (1989) and altered by Dutilleul (1993). In the presence of positive spatial autocorrelation, a standard t test is unfit for assessing the significance of a correlation coefficient as each sample does not constitute a full degree of freedom (Clifford et al. 1989). The modified t test adjusts the degrees of freedom by calculating an “effective sample size” that is inversely proportional to the extent of spatial autocorrelation in each variable (full details can be found in Dutilleul 1993). To assess the extent of spatial autocorrelation, the distances between all pairs of points are divided into k distance strata, and spatial autocorrelation is assessed for both variables in each strata. The specification of k impacts the calculation of the effective sample size as shorter distance intervals (i.e., larger value of k) will result in a higher calculated spatial autocorrelation (Fortin 1999) and a lower effective sample size. When relating wildfire and forest regeneration in Canadian boreal forests, Fortin and Payette (2002) found that the effective sample size increased as k decreased (i.e., larger distance interval), but decreasing k did not affect the acceptance or rejection of the null hypothesis. To assess the effect of k in this analysis, four distance intervals were tested in each ecozone: 5, 10, 20, and 40 km. The modified t test was calculated for each ecozone using the Dutilleul (1993) modification in Pattern Analysis, Spatial Statistics, and Geographic Exiegesis, a freely available spatial analysis software package (Rosenberg and Anderson 2011). In addition to testing the relationship between Lidar-derived structure and MODIS GPP, the relationship between structure and the climate variables (i.e., MAT and TAP) was also assessed. Finally, linear regressions were developed in R to assess the slope of the relationships. We do not communicate the results of the analysis of the Taiga Shield West due to the small sample size in this ecozone (38 MODIS cells remained following stratification).

Results

Canopy cover

Table 1 presents the correlation coefficients (r), slopes, and modified t test results for the relationship between percent cover above 2 m and MODIS GPP, as well as both climate variables, using a distance interval of 10 km for the calculation of effective sample size. The number of strata (k) needed for a distance interval of 10 km varied from 84 in the Hudson Plains and Boreal Cordillera to 257 in the Boreal Shield East. The effective sample sizes were significantly smaller than the original sample sizes in all ecozones. While the Boreal Shield East has the most MODIS cells (1,809), the effective sample sizes in the Boreal Shield East are among the smallest, with values between 11 and 13. The Boreal Shield West, Boreal Plains, and Boreal Cordillera had the largest effective sample sizes, with each ecozone averaging >50. The effective sample size increased as the distance interval increased from 5 to 40 km (results not shown); however, this had no effect on the acceptance or rejection of the null hypothesis (α = 0.05). The level of significance did vary (i.e., from p < 0.05 to 0.01 or p < 0.01 to 0.001) in several cases when the distance interval was changed.

Table 1 The correlation coefficients, slopes, and modified t test results for the relationship between percent cover above 2 m (X) and MODIS GPP, MAT, and TAP (Y)

Figure 2 displays the relationship between percent cover above 2 m and MODIS GPP for each sampled ecozone as a series of boxplots. To investigate the differences between forest types, Fig. 2a displays the relationship in the Boreal Shield East as a scatterplot, with points shaded by the dominant (>50 %) forest type within the cell according to the EOSD land cover classification. A statistically significant (α = 0.05) correlation between Lidar-derived canopy cover and MODIS GPP was found in all but the Hudson Plains, with the strongest link occurring in the Boreal Shield East (r = 0.74, p < 0.01, Fig. 2b). The correlation was weakest in the Boreal Shield West (r = 0.27, p < 0.05, Fig. 2c) and the Boreal Plains (r = 0.44, p < 0.01, Fig. 2d), both of which have a narrow sampled range in GPP. The link was strong between Lidar-derived canopy cover and MODIS GPP in the Boreal Cordillera (r = 0.58, p < 0.001, Fig. 2e) and the Taiga Plains (r = 0.70, p < 0.01, Fig. 2g), but the slope was shallower than in the Boreal Shield East. The sampled range in GPP was larger in the Boreal Shield East than in other ecozones, with a mean GPP value of less than 0.6 kg C m−2 year−1 for the 20–30 % cover group, increasing to over 1.0 kg C m−2 year−1 for the 80–90 % cover group. Markedly more stands had a canopy cover >90 % in the Boreal Shield East than in other ecozones, and these stands had the highest mean GPP of all sampled cover groups (≈1.1 kg C m−2 year−1).

Fig. 2
figure 2

Relationship between percent cover above 2 m and MODIS GPP for a Boreal Shield East (scatterplot), shaded by dominant forest type, b Boreal Shield East (boxplot), c Boreal Shield West, d Boreal Plains, e Boreal Cordillera, f Taiga Shield East, g Taiga Plains, and h Hudson Plains. The number above each bin corresponds to the number of samples within the bin

Figure 2a reveals a distinct separation between coniferous, mixedwood, and broadleaf-dominated stands in the Boreal Shield East. Broadleaf-dominated stands had the highest canopy cover (generally >80 %) and the highest GPP (generally 1.0–1.3 kg C m−2 year−1). Mixedwood stands had high GPP (generally 0.9–1.2 kg C m−2 year−1), however, a wider range in canopy cover as most stands are concentrated between 50 and 95 % cover. Coniferous stands had the largest sampled ranges in both canopy cover and GPP, with the majority of stands having a cover between 20 and 85 % and GPP values between 0.3 and 1.0 kg C m−2 year−1. A positive trend between Lidar-derived canopy cover and MODIS GPP is clearly apparent within coniferous stands, while no trend is apparent within broadleaf or mixedwood stands.

Figure 3 provides insight to the drivers of GPP by displaying the relationship between canopy cover and MAT. The relationship between Lidar-derived canopy cover and MAT was only statistically significant in the Boreal Shield East (r = 0.68, p < 0.05, Fig. 3b), where the sampled range of MAT was highest, and the Boreal Cordillera (r = 0.58, p < 0.01, Fig. 3e). Similarly to GPP, the 90–100 % cover group in the Boreal Shield East had a higher mean MAT than any other sampled cover group across all ecozones. The relationship between Lidar-derived canopy cover and TAP was not significant in any ecozone.

Fig. 3
figure 3

Relationship between percent cover above 2 m and MAT for a Boreal Shield East (scatterplot), shaded by dominant forest type, b Boreal Shield East (boxplot), c Boreal Shield West, d Boreal Plains, e Boreal Cordillera, f Taiga Shield East, g Taiga Plains, and h Hudson Plains. The number above each bin corresponds to the number of samples within the bin

Canopy height

Table 2 presents the correlation coefficients, slopes, and modified t test results for the relationship between the 95th height percentile and MODIS GPP using a distance interval of 10 km, while Fig. 4 displays the relationship as a series of boxplots. The effective sample sizes were relatively similar to Table 1, with the exception of a large increase in the Boreal Cordillera (54.38 to 100.19) and a large decrease in the Boreal Shield West (57.92 to 26.45). Similarly to Lidar-derived cover, changing the distance interval had no effect on the acceptance or rejection of the null hypothesis (α = 0.05). The level of significance did change from p < 0.01 to 0.001 when the distance was increased from 5 to 10 km in the Boreal Cordillera. Correlation coefficients were significant in the Boreal Shield East (r = 0.49, p < 0.05, Fig. 4b) and West (r = 0.47, p < 0.05, Fig. 4c), the Boreal Cordillera (r = 0.33, p < 0.001, Fig. 4e), Taiga Plains (r = 0.59, p < 0.05, Fig. 4g), and Taiga Shield East (r = 0.45, p < 0.05, Fig. 4f). With the exception of the Boreal Shield West and Hudson Plains, the correlation coefficients between Lidar-derived canopy height and MODIS GPP were lower in each ecozone than for Lidar-derived canopy cover. The majority of stands were concentrated into relatively few height bins compared to canopy cover, with nearly 75 % of the stands in the Boreal Shield East (Fig. 4b) between 9 and 15 m. The Taiga Plains (Fig. 4g) contained the tallest stands, while few tall stands were sampled in the Taiga Shield East (Fig. 4f) or Hudson Plains (Fig. 5h). Approximately 4 % of stands in the Boreal Shield East had a 95th height percentile above 18 m. Most of these regionally tall stands in the Boreal Shield East are dominated by broadleaf and mixedwood forest types, with coniferous stands reaching a maximum Lidar-derived height near 18 m (Fig. 4a). Compared to the link between Lidar-derived canopy cover and MODIS GPP, the link between the 95th height percentile and MODIS GPP is not as linear, which is apparent by comparing the Boreal Shield East scatterplots (Fig. 2a vs. Fig. 4a). The most notable trend in Fig. 4a is that the maximum sampled height derived from Lidar increases as GPP increases.

Table 2 The correlation coefficients, slopes, and modified t test results for the relationship between the 95th height percentile (X) and MODIS GPP (Y)
Fig. 4
figure 4

Relationships between 95th height percentile and MODIS GPP for a Boreal Shield East (scatterplot), shaded by dominant forest type, b Boreal Shield East (boxplot), c Boreal Shield West, d Boreal Plains, e Boreal Cordillera, f Taiga Shield East, g Taiga Plains, and h Hudson Plains. The number above each bin corresponds to the number of samples within the bin

Fig. 5
figure 5

a Relationship between the CV of return height and MODIS GPP in the Boreal Shield East, shaded by a dominant forest type, b the 95th height percentile (coniferous-dominated stands only)

Structural complexity

The relationship between the CV of return heights and MODIS GPP is more complex than percent cover above 2 m or the 95th height percentile. Figure 5a displays the relationship between the CV and MODIS GPP in the Boreal Shield East, with points shaded according to dominant forest type. At low levels of MODIS GPP, the range of sampled CV values was narrow and centered near 0.4. As GPP increases, the range of sampled CV values became wider but remained centered near 0.4. Broadleaf-dominated stands generally had the lowest CV, while mixedwood and coniferous stands had a larger range in CV than broadleaf stands.

Figure 5b displays the relationship between the CV of return heights and MODIS GPP for coniferous cells shaded by the 95th height percentile. Short stands tended to have lower CV values than taller stands with similar GPP, and the CV of short stands decreased slightly as GPP increased. Taller stands had a wider range of CV values than short stands, and the maximum sampled CV for tall stands increased as GPP increased.

Discussion

Canopy cover

Strong links between Lidar-derived canopy cover and satellite-derived GPP in the boreal are anticipated for two reasons: (1) more productive sites can support a higher density of trees with more dense canopies, and (2) canopy cover relates to the amount of foliage, which is a key driver of productivity (Schulze et al. 2002). The strength of the relationship between Lidar-derived canopy cover and MODIS GPP across ecozones largely depended on the sampled range of MAT as temperature is the main climatic driver of productivity in Canadian boreal forests (Churkina and Running 1998; Boisvenue and Running 2006). The largest gradient in MAT occurred in the Boreal Shield East, with cold temperatures limiting the productivity and observed stand density in northern coniferous forests compared to southern broadleaf forests. The observed differences in Lidar-derived canopy cover between forest types in the Boreal Shield East are likely caused by this strong latitudinal gradient as forest type transitions along with temperature from broadleaf-dominated stands in the south to coniferous-dominated stands in the north. To investigate the differences in structure across forest types, forest stands under similar site conditions would need to be isolated to remove this latitudinal effect.

While temperature is a main limiting factor of productivity in the Canadian boreal, productivity is fundamentally restricted by the amount of foliage that is absorbing solar radiation (Schulze et al. 2002). The MODIS GPP algorithm accounts for the amount of absorbed solar radiation with the MODIS FPAR product, explaining why MODIS GPP correlates more closely to Lidar-derived canopy cover within most ecozones than temperature data alone. In addition, as the MODIS FPAR product is essentially measuring foliage amounts, FPAR relates directly to canopy cover.

GPP can vary between stands with similar canopy cover values if differences exist in LUE (e.g., differences in temperature or VPD), incident PAR, or the fraction of the canopy that is composed of foliage (Fig. 6a). Productivity can also vary as a function of stand age and successional stage as younger stands are often more productive than older stands (Ryan et al. 1997).

Fig. 6
figure 6

Schematic representations of the observed relationships in the Boreal Shield East between MODIS GPP and a percent cover above 2 m, b 95th height percentile, and c CV of return height

The lack of statistically significant relationships between Lidar-derived canopy cover and TAP is in agreement with past studies showing temperature, not precipitation, to be the primary factor limiting growth in the boreal (Churkina and Running 1998). If precipitation does play a role in determining canopy cover in the boreal, it would be obscured by the strong latitudinal effects in this analysis.

Canopy height

The correlations between the 95th height percentile and MODIS GPP highlight the importance of successional stage and stand age in shaping structure in Canadian boreal forests. Stand-replacing disturbances are the main determinant of age, and therefore height, in the boreal (Kurz and Apps 1999; Bond-Lamberty et al. 2007; Amiro et al. 2009), explaining why Lidar-derived stand height was not linked as strongly to MODIS GPP as Lidar-derived canopy cover in all but the Boreal Shield West and Hudson Plains. Productivity affects stand height by determining the rate of growth between disturbances and restricting growth in the prolonged absence of disturbance in low productivity sites (Boucher et al. 2006). The relationship between the 95th height percentile and MODIS GPP in the Boreal Shield East likely provides insights to the effects of both productivity and age on forest structure (Fig. 6b). We expect that relatively young stands were sampled across a wide range of MODIS GPP, explaining why the minimum sampled stand height remained relatively constant as GPP increased. Alternatively, the maximum sampled 95th height percentile increased along with MODIS GPP as we expect growth to be faster and less restricted by resources at high levels of GPP. Therefore, stands can become taller in the prolonged absence of disturbance on more productive sites. It should be noted that stand height will not continuously increase with time since stand-replacing disturbance as the transition from deciduous to coniferous dominance and non-stand-replacing disturbances can reduce stand height (Paré and Bergeron 1995; Brassard et al. 2008), and stands will not grow indefinitely.

The finding that most coniferous stands reach a maximum 95th height percentile around 18 m in the Boreal Shield East corresponds well to other studies of forest structure in the Boreal Shield (Paré and Bergeron 1995; Brassard et al. 2008). Higher productivity in the southern portion of the ecozone and tall broadleaf species such as trembling aspen allow mixedwoods and broadleaf stands to grow taller than sampled coniferous stands (Paré and Bergeron 1995; Brassard et al. 2008).

Structural complexity

Successional stage and age also play an important role in determining the structural complexity of forests. We expect that the range of sampled CV values became wider in the Boreal Shield East as GPP increased because of several competing factors (Fig. 6c). First, we expect that fewer young stands were sampled in cells with low GPP compared to cells with high GPP as growth rates are likely slower where GPP is low, requiring more time for stands to reach 5 m in height (i.e., the minimum height considered in this analysis). The inclusion of younger stands at higher levels of GPP could explain why the CV of return height in short stands decreases as GPP increases. Canopy gaps, uneven-aged structure and less dense vegetation in mature, low productivity stands will generally result in more complex forest structures than in young, highly productive stands. Alternatively, maximum tree dimensions are less restricted on highly productive sites, and stands can reach an uneven-aged structure faster (Boucher et al. 2006; Larson et al. 2008). Therefore, we expect that mature forest stands will be more structurally complex on high productivity sites than low productivity sites. As a result, the differences between the structural complexities of young and mature stands appear to become greater as GPP increases.

The spherical shape of broadleaf crowns and the greater height of the sampled broadleaf stands in the Boreal Shield East result in generally low CV values for broadleaf-dominated stands. The presence of multi-aged and multi-species canopies in mixedwood stands is the expected cause of higher CV values for many mixedwood stands compared to broadleaf stands. There was a wider range in sampled height for mixedwood stands than broadleaf stands, which we expect represents various stages of succession, resulting in a wider range of sampled CV values for mixedwood stands.

Considerations

Several factors must be considered when analyzing the results of this analysis. We compared a 10-year average of MODIS GPP, which acts as a long-term indicator of forest productivity, to a single snapshot in time of forest structure from airborne Lidar data. As most sampled stands are older than 10 years and have varying disturbance histories, productivity over the most recent 10 years would only reflect part of the observed stand structure. To better quantify the relationship between productivity and forest structure, we must account for disturbance history, successional stage, and stand age. To do so, we attempted to restrict this study to mature unmanaged forests; however, the presence of short stands in highly productive forests suggests that we were unable to remove all young stands from the analysis. While management activity is low in most northern boreal forests (Andrew et al. 2012), natural disturbance is a fundamental component of the ecosystem, yet it is infeasible to monitor in its entirety. As a result, we accounted for time since disturbance using height as an indicator of age within stands of similar GPP.

Additionally, it must be noted that the swath width of the products generated from the Lidar transects (400 m) was narrower than a single MODIS cell (1 km), preventing the structure across entire MODIS cells from being measured. The average MODIS cell in this analysis contained 461 Lidar plots, which accounts for approximately 29 % of the area of a single MODIS cell. Therefore, the forest stands sampled with Lidar may not accurately represent the productivity of an entire cell in all instances. This should not be a major issue as we removed any MODIS cell that was less than 75 % forested. We assume that variations in productivity are minimal within each 1-km cell, which may not always be the case as differences in nutrient and water availability as well as varying species and stages of succession may be occurring within a single cell.

Finally, we must consider the small sample size in low productivity forests of the Boreal Shield East when analyzing these results. At low levels of GPP (i.e., <0.6 kg C m−2 year−1) in the Boreal Shield East, we reported shorter stands and a narrower range of CV values. However, only 5.7 % of the sampled MODIS cells in the ecozone had a GPP value <0.6 kg C m−2 year−1. It is possible that with increased sampling in low productivity forests, we would find taller stands or a wider range in structural complexity.

Conclusions

Lidar is an invaluable source of data for studying forest structure that allows for an improved characterization of the relationship between productivity and structure over large areas. By measuring forest structure with Lidar data along gradients of productivity in the Canadian boreal, we found a strong link between satellite-derived GPP estimates and boreal forest structure. While the relationship was strong between MODIS GPP and percent cover above 2 m, the weaker relationships to the 95th height percentile and the CV of return height emphasize the importance of stand age in determining the structure of boreal forests. Our results suggest that projected increases in productivity at high latitudes could lead to increases in canopy cover, but changes in habitats, resource availability, and carbon storage could also largely depend on changes in disturbance regimes as disturbance largely controls stand age in boreal forests. Incorporating disturbance history in Lidar studies of structure is therefore critical to improve our understanding of current forest structure and how structure will be altered by a changing climate.