Introduction

Accurately estimating forest biomass is essential to understanding how plant growth is relate to the commercial production of wood fiber (Morgan and Moss 1985), energy transfer (David et al. 2006), nutrient flows (Rapp et al. 1999), and the global carbon cycle (Houghton 2007; Bombelli et al. 2009). The characterization of variability and trends in forest biomass at local to national scales is needed for monitoring the sources and sinks in the global carbon cycle (Ganguly et al. 2012). Therefore, the study of aboveground biomass in rapidly changing environments has been a central theme in ecological and hydrological research (Dong et al. 2011; Wang et al. 2014). Shrublands form a major component of vegetation in alpine ecosystems worldwide and are a widely distributed biome type in China (Hu et al. 2006). Shrublands cover 19 % of the Qinghai-Tibet Plateau and approximately 20 % of China (Hou 2001; Hu et al. 2006). Recent research using repeated photography, long-term ecological monitoring, and dendrochronology have documented shrub expansion in some cold regions, which could cause major changes in biodiversity and result in increased shrub cover and biomass in recent decades (Tape et al. 2006;  Myers-Smith 2011). Rapid climatic warming has allowed the rapid increasing of shrublands in China over the past two decades (Fang et al. 2007). Piao et al. (2009) identified shrub increasing as the most uncertain factor contributing to carbon sinks of forest ecosystems in China. A need exists to improve the accuracy of the estimation of alpine shrub biomass because biomass data provide valuable information for explaining the functional role of shrub ecosystems as part of the carbon and hydrological balances in the alpine regions of China (Yashiro et al. 2011; Liu et al. 2014).

Compared with arboreal vegetation, shrubs have often been neglected during biomass estimation research. Shrubs come in a wide variety of forms, both single and multiple stemmed. Thus, accurate estimation of shrub biomass has proved to be relatively difficult owing to the mixture of species typically present in some shrubland ecosystems (Sah et al. 2004). A large amount of research has delved into biomass estimation of individual shrub species in the some temperate and tropical regions using allometric equations (Murray and Jacobson 1982; Hierro et al. 2000; Paton et al. 2002; Návar et al. 2004). However, little research has explored the development of an accurate method to assess shrub biomass in alpine ecosystems (Elzein et al. 2011).

Traditionally, researchers have performed biomass estimation in forest ecosystems using direct destructive or extractive methods, or using indirect methods such as allometric equations (Keller et al. 2001). Undoubtedly, direct methods provide the most accurate biomass estimates; however, they are time-consuming and laborious (Sironen et al. 2003; Case and Hall 2008; Liu and Westman 2009). Allometric equations use mathematical models to estimate vegetative biomass based on various morphological variables such as basal diameter of stems and branches, canopy area, and H (Návar et al. 2004; Elzein et al. 2011). Indirect methods have been developed to estimate some dimensional variables by fitting specific equations for individual plant species or community models (Murray and Jacobson 1982; Návar et al. 2004). Therefore, researchers have developed allometric equations designed for use with specific to forest types (Basuki et al. 2009; Xiang et al. 2011) and shrublands ecosystems (Murray and Jacobson 1982; Uso et al. 1997; Elzein et al. 2011; Hierro et al. 2000; Návar et al. 2004; Foroughbakhch et al. 2005; Sah et al. 2004). Predictive mathematical models have normally been tested using simple linear or multiple regression models (Murray and Jacobson 1982). The relationship between biomass and measured variables has usually been determined using a nonlinear function (Foroughbakhch et al. 2005). Different types of variables and functions have been used for predictive model equations (Návar et al. 2004). Singh et al. (2011) estimated shrub aboveground biomass using a destructive harvesting method in the central Himalayan region of Terai; Luo et al. (2002) estimated biomass and natural vegetation productivity based on forest survey data and field sampling of the Tibetan Plateau. These results indicated that allometric equations provide useful tools that can be used to estimate biomass for shrub ecosystems in cold regions, because they offer a non-destructive and time-efficient method of estimating biomass (Uso et al. 1997; Návar et al. 2004).

The Heihe River originates from the Qilian Mountains, scatters itself across the landscape in the middle reaches oasis region, and disappears into the desert lower reaches. Located between 97.24–102.10°E and 37.41–42.42°N, the Heihe River forms the second largest inland river in Northwestern China and covers about 130,000 km2 (Chen et al. 2007). Well-developed vegetation thrives at elevations from 1,600 to 5,500 m a.s.l. in the mountainous catchment, where the annual average precipitation is about 400 mm. Shrublands cover 1,352 km2, or about 13.7 % of the upper reaches area, in the mountain region of Heihe River Basin (Hou 2001). Developing accurate allometric equations is essential for shrub biomass estimation in this region for two reasons. First, alpine shrubs form the dominant vegetation type in the alpine valleys; this ecosystem has rather extreme conditions making it difficult for arbor species to survive; therefore, shrubs play an important role in water conservation and in the carbon cycle of the mountain environment. Second, the coverage area and average patch area of shrubs have increased by 6.8 and 49 %, respectively, over the past 15 years in a process recently described as “shrubification” in this region (Liu et al. 2011). However, biomass estimation using allometric equations for individual shrub species in the Heihe River Basin has received little attention (Zhang and Han 2008).

This study primarily attempts to establish correlation and regression relationships between aboveground biomass and easily measured parameters (H: height, D: basal diameter, C: crown area, and D 2 H: basal diameter squared × total H) of the alpine shrubs in the upper reaches of Heihe River Basin in Northwestern China. It also attempts to develop general allometric equations designed specifically for the four most common shrub species found here.

Materials and methods

Site description

The study area, located in the Hulu watershed, covers 23.1 km2 in the Qilian Alpine Ecology and Hydrology Research Station within the Qilian Mountains, Qinghai Province, China (38°12′–38°17′N, 99°49′–99°54′E; Fig. 1). The elevation of the Hulu watershed varies from 2,960 to 4,800 m a.s.l; the average annual temperature is about 0.2 °C, with a low mean of −18.4 °C in January and a high reaching 19.0 °C in August. The mean annual precipitation is 495.1 mm, with 85 % falling from June to October, based on 4 years of manual and electronic rain gauge data collected from 2008 to 2012 (Liu et al. 2014).

Fig. 1
figure 1

Locations of the sampling sites in the Hulu Watershed of the Heihe River Basin, Qinghai Province, China

The growing season for vascular plants lasts from early June to mid-September, with vegetation in the Hulu watershed including tree Picea crassifolia and Sabina przewalskii, alpine shrubs and meadows. Shrubs occur mainly on shady, semi-shady, and semi-sunny slopes in the Qilian Mountains. The most dominant shrub species, Salix cupularis, Salix oritrepha, Potentilla fruticosa, Caragana jubata, and Hippophae rhamnoides, occur mainly from 3,000 m to 3,700 m in the Qilian Mountains (Wang et al. 2002). The dominant herbs include Polygonum viviparum, Poa pratensis, Stipa capillata, and Stellera chamaejasme. Mountain drab, chestnut, and subalpine shrub meadow soils cover most of the Hulu watershed (Liu et al. 2014).

Shrub sampling

While considering the characteristics of the distributions of alpine shrubs, four shrubs species, S. cupularis, P. fruticosa, C. jubata, and S. oritrepha, were selected for analysis. Three 10 m × 10 m quadrates were established for each shrub species. Twenty shrubs per quadrate were randomly selected as sample individuals. Therefore, 60 shrubs per species were used to calculate correlation and regression relationships. The shrubs were monitored in mid-September 2012 during the peak season of increasing biomass, when biophysical parameters such as foliage biomass and canopy cover are relatively stable. The elevation, aspect, slope angle, and shrub density of the plots were documented, and the density was obtained by quadrat method (Xiang et al. 2011). The shrubs were harvested using the selective destruction technique (Lodhiyal and Lodhiyal 1997). Prior to harvest, the following data were measured for all sampled shrubs using a digital caliper (XL.10-160, Beijing Zhuochuan Co., Beijing, China): (1) basal diameter (D: diameter at 10 cm above ground, cm); (2) height (H: height from ground level for the tallest point in the canopy, cm); and (3) crown area (C: crown width × crown length, measured at right angles, cm2).

In the laboratory, the fresh weight of all aboveground sections of shrubs was determined to the nearest 0.01 g using a balance (MP21001; Shanghai Hengping Co., Shanghai, China). The samples were then placed in envelopes and dried for 48 h in an oven at 80 °C to a constant weight. Finally, at room temperature, the dry weights of the samples were recorded as the biomass.

Data analysis

First, prior to establishing the allometric equations, scatter plots between the dependent and independent variables were used to determine whether the relationships were linear. Scatter plots were also used to determine the types of relationships existing between shrub biomass and each of the measured and calculated variables. Second, several allometric relationships between the dependent and independent variables were subsequently tested. The independent variables, represented by X, included D, H, D 2 H, and C. Aboveground shrub biomass was the dependent variable, W. The following six models were used in this study: (1) W = a + bX; (2) W = a + bX 1 + cX 2; (3) W = a + b ln X; (4) W = a + bX + cX 2; (5) W = aX b; and (6) ln (W) = ln (a) + bX.

Correlation and regression analyses between total aboveground biomass and field measured parameters (D, H, D 2 H, and C) were conducted using the SPSS 16.0 statistical package. The regression model with the highest R 2 and the lowest root mean square error (RMSE) with a significant fit (P < 0.05) was selected as the best model (Davison and Hinkley 1997). However, the results of R 2 statistical analysis could not be directly compared among equations with untransformed and log-transformed independent variables (Draper and Smith 1998). To compare such equations, a Fit Index (FI) that had a value of unity for an optimum fit similar to R 2 was calculated (Schlaegel 1981). The FI was based on the residuals in the measured units (Brand and Smith 1985; Crow and Schlaegel 1988) based on Eq. (1):

$${\text{FI}} = 1 - \frac{{\sum {(Y_{i} - \hat{Y}_{i} )^{2} } }}{{\sum {(Y_{i} - \bar{Y})^{2} } }},$$
(1)

where \(Y_{i}\) is the ith observed value, \(\hat{Y}_{i}\) is the ith predicted value for \(Y_{i}\), and \(\bar{Y}\) is the mean observed value of \(Y_{i}\). If the FI calculated for a logarithmic model was closer to the R 2 for a linear model, the former was preferred, given that the logarithmic transformation increased the statistical validity of the analysis by homogenizing the variance over the range of sample data. The logarithmic model was suitable for data in which variance increased with an increase in the magnitude of the observation (Sah et al. 2004).

When more than one model presented a similarly good fit with the data, the regression equation with the fewest parameters was selected as the best model. Although logarithmic models are often used to predict shrub biomass, logarithmic transformations could sometimes result in biased biomass estimates. Therefore, when a logarithmic model was selected as the best model, the model was transformed into the corresponding nonlinear model.

Results

Correlation relationships between aboveground biomass and variables

To determine the shrub biomass and develop the allometric equations, the independent variables had to be determined. The variables of four shrub species in this study exhibited no homogeneous distribution between different quadrats (Fig. 2).

Fig. 2
figure 2

Dependent and independent variables of the shrubs (Mean ± SD). D basal diameter (at 10 cm above the ground), H height from ground level to the tallest point in the canopy, C crown area (crown width × crown length, measured at a right angle), W aboveground biomass

Table 1 illustrates the correlation between W and D, H, D 2 H, and C for four shrubs. The correlation coefficient values indicated that a positive correlation existed among all dependent variables and independent variables. The correlation coefficient R values varied from 0.560 for H in S. cupularis to 0.977 for D 2 H in S. oritrepha among species-specific correlations. The measured parameters allowed an effective estimation of shrub biomass. However, the correlations of D, D 2 H, and C with W were significantly higher than that of H with W. Moreover, the indicators of multicollinearity [tolerance, variance inflation factor (VIF)] were statistically examined: the three variables D, D 2 H, and C were correlated. These correlations suggested that a combination of these three variables could provide the best regression model for biomass estimation. Compared with the other variables, the correlation between D 2 H and W was higher for C. jubata, S. cupularis, and S. oritrepha. As shown in the following model, the best parameters for biomass estimation using an allometric equation were determined based on the correlation results (Table 2).

Table 1 Correlation coefficient between dependent and independent variables
Table 2 Allometric equations for aboveground biomass for the four shrubs species

The scatter plot diagrams between each dependent and independent variable could allow advance determination of the expected sign of the coefficient of a particular variable in the regression model (Fig. 3). Despite showing the evident correlations between dependent and independent variables, an effective model could not be developed by employing all measured variables simultaneously because of the multicollinearity among measured independent variables. The tolerances were 0.001, 0.018, and 0.021 and the VIFs were 1.23, 1.54, and 3.15 for D, D 2 H, and C, respectively. Therefore, these three variables could not be included in an equation at the same time.

Fig. 3
figure 3

The scatter plot diagrams between each dependent and independent variable. ad represent data for P. fruticosa, C. jubata, S. cupularis, and S. oritrepha, respectively. D basal diameter (at 10 cm above the ground), H height from ground level to the tallest point in the canopy, C crown area (crown width × crown length, measured at a right angle)

Models that include all measured variables could result in an unexpected negative sign on the coefficient, subsequently giving rise to high values of RMSE. Therefore, to avoid this issue, stepwise regression was used to establish the allometric equations of shrub biomass. This caused these models to improve significantly; they had the fewest parameters with the highest R 2 and FIs values as well as the lowest RMSE values.

Developing allometric equations

The biomass equations based on the variables measured in the field proved an excellent fit for the estimated biomass (Table 2). All of the species-specific regression equations relating biomass to the measured variables were statistically significant (P < 0.01) with equations proposed for each species able to satisfy the requirement of the equations. No self-correlation or heteroscedasticity was observed in the distribution of residues. Notably, all equations had an R 2 higher than 0.95 and an FI higher than 0.96. R 2 values varied from 0.95 for C. jubata and S. cupularis to 0.97 for S. oritrepha among species-specific best-fit models (Table 2). However, the forms of the best-fit model differed among shrub species (Table 2). The power equations were the ideal models for C. jubata and S. cupularis, having the highest R 2 and FI and the lowest RMSE. Exponential equations were chosen as the ideal models for S. oritrepha. Linear models showed the best fit for P. fruticosa (Table 2).

Species-specific allometric equations used different variables and combinations of variables to estimate biomass. When the R 2, FI, and RMSE of the regression equations were compared, D 2 H was found to be the best parameter for predicting C. jubata and S. cupularis biomass. In contrast, equations with C as the independent variable were the best for estimating S. oritrepha biomass. Moreover, H and C were the best parameters for estimating P. fruticosa biomass (Table 2).

Discussion

Direct harvesting techniques for the estimation of shrub biomass are labor intensive and time-consuming. Researchers commonly use allometric equations as a nondestructive alternative, in which biomass is estimated based on the easily measured parameters of shrubs (Návar et al. 2004; Elzein et al. 2011). Mathematical forecasting models for the biomass productivity and carbon cycling in forest ecosystem research could provide an estimation tool and data in support of large-scale models. The relationships between aboveground biomass and measured variables were determined for the four most dominant alpine shrubs in the upper reaches of Heihe River Basin (Wang et al. 2002). The measured variables showed a typical linear correlation with dependent variables, and allometric equations were obtained to estimate the aboveground biomass of the four alpine shrubs. Generally, aboveground biomass is best determined through the analysis of D, H, C, and other variables; these types of variables are typically used as independent variables in shrub biomass regression equations (Murray and Jacobson 1982). However, some researchers continue to establish biomass equations for individual shrub species by relating biomass variables to total shrub characteristics. Stem diameter (D) alone or in combination with H is the best predictor of biomass for some shrub species (Haase and Haase 1995; Paton et al. 2002; Liu et al. 2006; He et al. 2011). In this study, D 2 H provided the most accurate prediction of aboveground biomass in two shrub species, C. jubata and S. cupularis, which was consistent with previous research in other alpine regions (Elzein et al. 2011). The canopy morphology of shrubs serves as an important factor influencing prediction accuracy of model and variables selection. The canopies of C. jubata and S. cupularis form elliptical cones, and height and basal diameter can replace the distribution of branches and leaves during biomass estimation. Zhang and Han (2008) estimated the aboveground biomass of shrubs in the Eastern Qilian Mountains and developed a model with a similar type of equations. However, their number of sampled individual shrubs and the distribution of the canopy size were undocumented.

Previous studies have used crown area to estimate shrub biomass (Peek 1970; Sah et al. 2004), particularly for low-growing shrubs with a multiple-stemmed growth form, which was not amenable to the measurement of individual diameters (Elzein et al. 2011). Harrington (1979) found that the weakest relationships and highest estimation errors were obtained when D was used as the independent variable in small multi-stemmed shrubs. Elzein et al. (2011) used crown area and mean height to develop equations for shrub biomass estimation in the western Alps. In this study, C could be used to accurately predict biomass for the multi-stemmed shrub species S. oritrepha (Table 2). Some studies have not commonly included canopy height into biomass regression equations (Peek 1970; Harrington 1979; Murray and Jacobson 1982; Zhang and Han 2008). However, the addition of height in the models developed for the shrubs was reasonable although the correlation between the height and biomass was lower than that of the other three variables (Tables 1 and 2). Halpern et al. (1996) reported that models considering crown area did not explain variations associated with extension growth, although variation associated with lateral branching was explained. Therefore, the addition of H increased the precision in estimating biomass for P. fruticosa in this study.

In contrast to previous studies using allometric equations to estimate alpine shrub biomass, the present study applied the stepwise regression and used R 2, FI, and RMSE to evaluate the accuracy of the models. In this study, the power, linear, and exponential models provided the most appropriate equations for estimating and describing the relationships between measured variables and aboveground biomass. The best-fit models proposed in the results were consistent with those reported in previous studies of shrubs in the Gannan Grasslands (Zhang and Han 2008). That study estimated biomass using linear regression models to determine the relationships between shrub biomass and crown width. Power and linear equations have commonly been used to model shrub biomasses in subtropical regions (Cai et al. 2006; Zeng et al. 2010), such as the Jinggang and Balangshan Mountains of China (Lin et al. 2010; Liu et al. 2006, respectively), western Andalusia of Spain (Oyonarte and Cerrillo 2003), and northeastern Mexico (Návar et al. 2004). Li et al. (2006) assessed the accumulated stem biomass of P. fruticosa using a linear regression equation. Shrub biomasses in the western Alps were also estimated using logarithmic equations (Elzein et al. 2011). The best-fit equation for S. oritrepha in the current study was a logarithmic equation. However, the equation was transformed to a nonlinear form.

Plant functional types (PFTs) were different from species-specific methods in biomass estimation models (Ustin and Gamon 2010). PFTs were derived from traits based on species morphology, physiology, and/or life history, depending on the aims and scale of the research. Some traits could be measured in the field, and others required more detailed laboratory measurement and experimentation (Duckworth et al. 2000). PFTs bridged the gap between plant physiology and both community and ecosystem processes, thus providing a powerful tool in carbon and biomass research (Díaz and Cabido 1997). PFTs were more important than specific species in modeling the estimating biomass at the ecosystem scale (Manzoni et al. 2011). However, for each PFT, a number of key parameters needed to be defined, such as fecundity, competitiveness, and resorption. The value of each parameter is determined or inferred from observable characteristics such as plant height and leaf area (Lavorel et al. 2007). PFTs will be most useful if the same grouping of species describes a response to, and effects on, multiple environmental factors and ecosystem processes (Chapin et al. 1996). In alpine regions, the microclimate induced by topographical change is very large at the watershed scale. Thus, the parameters of PFTs varied with the changes in elevation and slope (Spadavecchia et al. 2008). The biomass of PFTs varied along a climatic gradient because of the heterogeneous nature of the environment (Liu et al. 2014). In this study, the specific species method was selected to estimate the biomass of shrubs rather than using PFTs because this method is more suitable for accurately modeling shrub biomass in this heterogeneous mountainous area. Species-specific equations could provide the essential data and method needed for estimating biomass of PFTs in a large-scale biomass model. Leaf area index (LAI) is an important variable in many ecological and environmental applications and is usually used in biomass estimation models at the ecosystem scale (Luo et al. 2002; Ganguly et al. 2012). An allometric relationship has been observed between leaf area and stem diameter, crown dimensions, and biomass (Spadavecchia et al. 2008). Specific leaf area (SLA) has been used as an indicator of the relative growth rate (RGR), which represented the productivity and biomass of an ecosystem (Díaz and Cabido 1997). In that study, RGR represented the productivity and biomass of ecosystem while height of the crown base affected SLA. Generally, the LAI increased from the tree top downward to the crown base and decreased significantly with increasing tree size (Seidel et al. 2011). Ganguly et al. (2012) presented an approach based on remote sensing for mapping live forest aboveground biomass, using a model that calculated LAI derived from remote sensing and canopy maximum height. Bartelink (1997) found a strong linear relationship between leaf area and crown area. Therefore, some parameters used in allometric equations for estimating biomass could allow the calculation of a LAI. The species-specific and PFT estimation methods could be combined with environmental variables in future research. The combination will allow researchers to overcome the biases in biomass estimation caused by inconsistencies in field inventories and offers a high-resolution and up-scale approach for forest carbon monitoring and productivity estimation in larger scale ecosystem models.

Conclusions

Researchers have recognized the usefulness of allometric equations for decades, but few equations have been developed for alpine shrubs. The equations presented in this work could be useful in quantifying aboveground biomass in the upper reaches area of the Heihe River Basin. Empirical models related to measured parameters were proposed for alpine shrubs. These models could be applied in biomass estimation based on the strong correlations between each variable and aboveground biomass. The proposed models may also be applicable to other alpine shrubland ecosystems by re-determining the coefficient for the statistical relationships between biomass and the measured variables. Ideally, all the proposed equations should be tested with larger data sets encompassing ecological contexts that were not considered in the present study, including gradients in elevation and slope. Such equations will provide useful data for studies estimating aboveground biomass and allow researchers to test the contribution of shrubs to overall productivity of an alpine ecosystem. The equations provided here could also significantly improve the accuracy of biomass estimation and allow researchers to develop methods designed to increase carbon sequestration in alpine ecosystems. However, care should be taken in applying the allometric models developed in this study to other sites when the structure and size of shrubs is unknown. For example, Zhang and Han (2008) found a shrub height of more than 2.1 m resulted in a loss of model accuracy. The models should also undergo further tests to determine their accuracy for estimating shrub biomass in other alpine regions.