Introduction

Climate change is one of the most pressing global environmental issues of the twenty-first century (Liu et al. 2018). Deforestation and forest degradation are two major anthropogenic drivers of increased greenhouse gas (GHG) emissions (Kaisa et al. 2017; Bayen et al. 2020). The current global greenhouse gas emissions are estimated to be 10 Pg (1 Pg = 1012 kg) of carbon (C) per year (Amundson and Biardeau 2018). Climate change, in turn, affects many developing countries that rely on climate-sensitive sectors like rainfall-dependent and traditional agricultural practices (Matewos 2019). For instance, it increased species extinction (Trisos et al. 2020), food shortages (Ayinu et al. 2022), disease outbreaks (Orke and Li 2022), and reduced crop yields (Ginbo 2022).

Agroforestry systems are among the land uses that have been suggested as a global solution to increase the effectiveness of land use while minimizing negative environmental effects and economic risks for farmers (Mey and Gore 2021). The system is suggested to synergize climate change adaptation and mitigation (Dhyani et al. 2020), and recognized as a GHG mitigation strategy under the Kyoto Protocol (Semere et al. 2022). Agroforestry systems cover over one billion ha, supporting about 560 million people in developing countries (Shi et al. 2018). The systems sequester 1.1–2.2 Pg C annually over a 50-year period globally (Dixon 1995). Besides, the conversion of 630 million hectares of unproductive cropland and grassland to agroforestry could add about 586,000 Mg C per year by 2040 (Verchot et al. 2007).

Smallholder farmers in Ethiopia have also developed site-specific agroforestry practices that have high potential for climate change mitigation. Homegardens, parklands, and woodlots are among the most common agroforestry practices in the country for a long time (Tsedeke et al. 2021). In the central part of Ethiopia, where this study was conducted, the local people have been engaged in various agroforestry practices. The majority of the smallholder farmers in the study region integrate trees with crops for shade, soil improvement, and fuelwood. They also commonly maintain a variety of perennial plant species in and around their homegardens and woodlots for a range of products and services.

Due to a lack of appropriate biomass estimation models, the Ethiopian National Forest Monitoring and Assessment Program did not take into account trees in agricultural landscapes when assessing carbon stocks in the past. Also various researchers adopted pantropical generalized biomass equations developed for natural forests (e.g., Chave et al. 2014) to calculate the carbon stock of trees in agricultural landscapes (e.g., Gebremeskel et al. 2021; Chemeda et al. 2022). Biomass models developed for natural forests may not be suitable for agroforestry (Tumwebaze et al. 2013), because tree allometry varies with tree architecture and silvicultural management (Zhang et al. 2020). Moreover, the application of existing pantropical biomass equations derived from a large dataset may not assure optimal precision of biomass estimation at small scales due to site- and species-specific characteristics (Ngomanda et al. 2014; Djomo et al. 2016; Paul et al. 2016). In that case, species-specific allometric equations may be more effective in determining the biomass and carbon stocks of the species than regional and pantropical equations (Mahmood et al. 2020a, b).

To date, only a few allometric models for predicting biomass in Ethiopian agroforestry have been developed for Coffea arabica and Ensete vetricosum (Negash et al. 2013a, b) and trees scattered on rangeland (Feyisa et al. 2018). A recent review of biomass models by Sebrala et al. (2022) found out that the allometric biomass equation for native tree and shrub species has received very little attention in Ethiopia. This has made carbon accounting very difficult for several perennial plant species in agroforestry systems nationally (Rosenstock et al. 2019).

So, it was hypothesized that the derived species-specific and mixed species AGB allometric model of native perennial plant species growing in agricultural landscape can give an accurate estimation compared to the frequently used pantropical and regional allometric models. Thus, the present study aimed at (1) developing species-specific and mixed species allometric biomass models for estimating AGB of native perennial plant species in agricultural landscapes as well as (2) evaluating the performance of some previously developed pantropical and regional models for estimating AGB in agricultural landscapes.

Materials and methods

Study site description

The study was conducted on the smallholder agricultural landscape in Toke Kutaye, Dire Enchini, and Liban Jawi districts of West Shewa zone, which is located in Central Ethiopia (Fig. 1). West Shewa is geographically located between 8° 17′ and 9° 56′ N and 37° 17′ and 38° 45′ E. The zone has three agroecological zones namely; highland (Dega) ranging from an elevation of 2000 m to 3500 masl, midland (Woina dega) from 1600 to 2000 masl; and lowland (Qola) below 1600 masl. The mean annual rainfall and monthly temperature were 1569 mm and 22 °C, respectively. A range of soil types are found, but the dominant soil types are vertisols and loams. Rain-fed agriculture with a mixed farming system consisting of annual crop production and livestock rearing is the major livelihood source for inhabitants in the study area. The most commonly grown crops in the area include barley (Hordeum vulgare), teff (Eragrostis teff), wheat (Triticum aestivum), maize (Zea mays), and enset (Ensete ventricosum). Livestock production involves mixtures of indigenous and crossbred cattle raised under semi-zero grazing systems. The smallholder farmers also use a variety of agroforestry systems, such as home gardens, parklands, and woodlots. Homegardens are multi-story techniques that mimic the structure and species diversity of a forest, including a variety of trees, annual crops, enset, highland bamboo, and other species (Eyasu et al. 2020). Another sort of agroforestry practice is parkland, which make up a significant portion of the land use in the eastern and northern parts of Ethiopia (Endale et al. 2016; Woldu et al. 2020). Parklands are made of carefully selected trees and shrubs from the forest that have been cleared for cropping, and their regeneration is is assisted in the agricultural landscape (Tadesse et al. 2019). Woodlots are another typical agroforestry practices in which tree species are planted on productive or degraded areas in order to provide fuel wood and construction materials (Manaye et al. 2021).

Fig. 1
figure 1

Map of the study area

Sampling and biomass determination

Data were collected through two phase sampling. For the first phase, forest inventories were performed using a total of 243 sample plots. Accordingly, 81 (20 × 20 m) quadrats were laid in homegardens and 81 (10 × 10 m) quadrats in woodlots. In contrast, 81 (100 × 50 m) quadrats were used in parklands. Visual estimation was used to divide farms into grid points, and then random selection was done to place the quadrat. On the sampled plots, trees were measured for their diameter at breast height (DBH), total height (ht), the average crown's diameter (cd), and the species type identified. A caliper or diameter tape was used to measure tree diameters, while tree height was measured using a hypsometer. Additionally, the maximum and minimum crown diameters were measured with a measuring tape in order to determine the average crown diameter (cd).

For the second phase, six perennial plant species, namely: Acacia abyssinica Hochst., Croton macrostachyus Del., Faidherbia albida Del., Albizia schimperiana Oliv., Erythrina brucei Schweinf., and Oldeania alpina (K. Schum), were chosen for destructive sampling based on forest inventory result that consider their importance value index (IVI) (Pothong et al. 2022) (Supplementary material 1). To maintain representativeness of the sample (Moussa and Mahamane 2018) and minimize errors caused by the predominance of one diameter class over another (Sileshi 2014), individual trees were stratified into seven DBH classes and proportionally chosen over these diameter classes (Supplementary material 2). The inclusion of an unequal number of small-diameter trees compared to large-diameter trees in the sampling increases heteroscedacity (Sileshi 2014). However, Oldeania alpina culms were categorized into three age classes: age class 1, one- and two-year-old culms; age class 2, three- and four-year-old culms; and age class 3, five- and six-year-old culms (Abebe et al. 2023) using the culm’s morphological features (Embaye et al. 2005). Besides, during destructive sampling, efforts were made in the selection to represent altitudinal variation (Asrat et al. 2020a). A total of 95 perennial plant individuals (13 for each species) were randomly chosen from DBH classes to minimize the cost and negative ecosystem impacts of harvesting large number of trees (Duncanson et al. 2015). However, for Oldeania alpina, 30 individuals were chosen from age classes since 30–50 sample culms are recommended to develop site-specific allometric models (Huy and Long 2019). Before felling the perennial plant, ecological information such as slope, altitude, and coordinates were all recorded. Moreover, dendrometric variables including the diameter at breast height (DBH), the diameter of the stump's height (dsh), the total height (ht), and the average crown's diameter (cd) (Tesfaye et al. 2016), and the age of Oldeania alpina, were recorded for each perennial plant species (Table 1).

Table 1 Statistical summary for biometric variables and aboveground biomass of the sampled perennial plant species in agricultural landscapes of central Ethiopia

Once the measurements were done, the selected trees were felled (at 0.3 m above the ground level) and processed accordingly. Following the removal of branches and leaves, the felled trees were divided into distinct biomass components, such as stems, branches (large branches with mid-diameter ≥ 10 cm, medium branches size with mid-diameter < 10 cm and > 5 cm and small branches with mid-diameter ≤ 5 cm and ≥ 2 cm) and leaves. Stem refers to the main shoot from the ground to the top of apical meristems up to 2 cm in diameter for highland bamboo. The cut-off diameter of the stems of other trees was set at 2.5 cm because this is the minimum size used for commercial purposes in many areas where wood fuel is scarce (Giday et al. 2013). The remaining part was considered as branch biomass. The leaves’ part contained all twigs, branches < 2 cm, flowers, fruits, seeds, and leaves. Each component was weighed separately in the field for its fresh weight using a hanging balance. For each tree, three disks were taken from the stem and branches (large, medium, and small). For determining the leaves' dry weight, 250-g aliquots of each tree's leaves were obtained (Daba and Soromessa 2019). The sub-samples were placed in an airtight plastic bag and brought to a laboratory, where the green volumes were determined using the water displacement method (Liu et al. 2023) after the bark was removed. Disks that were too large for the volumetric devices were immersed in a larger container, and the green volume was calculated based on the volume of water displaced from the container (Tetemke et al. 2019). The wood's bulk density (wbd) for each disk was determined as the ratio of oven-dry mass to green volume. Then, the stem and branch subsamples were dried at 105 °C, while the leaves and twigs subsamples were dried in an oven at 70 °C for 48 h. After drying, the three subsamples were averaged within each component.

The biomass of each tree section (stem, branch, and leaf) was determined by multiplying the total fresh weight of the section by the respective ratio of oven dry weight to fresh weight. The stump, (i.e., the portion of the stem up to 30 cm above the ground) was also considered component of the merchantable stem section for all trees. We computed the stump volume (V) using the formula V = πd2L, where π is 3.14, d is the stump mid-diameter in m, and L is the length of each stump in m (Magdaline et al. 2020). Later, the biomass of the stump was calculated using volume and wood basic density. The total dry weights of the stem, branch, and tied bundle of twigs and leaves were as added up to determine the tree's total aboveground biomass.

Data analysis

All the data analysis and graphical representation in this study were performed using the R programming language (R Core Team 2020). Multicollinearity among independent variables was tested by applying the variance inflation factor (VIF) using the “car” package, (Zhao et al. 2019; Rahman et al. 2021), and no multicollinearity was found among the independent variables included (VIF < 2.56) (Table 3). Pearson correlation tests were applied to determine the degree of relationship between AGB and dendrometric parameters (Moussa and Mahamane 2018). We therefore, tested weighted nonlinear least squares regression by using the ‘nls’ function in the R software because it performed better than log-linear models (Huynh et al. 2021). Weighted non-linear was used to account for heteroscedasticity in residuals (Huy et al. 2019a). Huynh et al. (2022) showed that weighted non-linear models also had a lower Furnival’s Index (FI) than log-linear models. Moreover, the variance is assumed to be proportional to the square of the mean of the biomass in this study, as recommended by Aabeyir et al. (2020). When the Q-ratio (i.e., the ratio between the parameter estimates of DBH and parameter estimates of ht, in a separate variable model) is equal to 2.0 (between 1.5 and 2.5), it is common to include (DBH)2 × ht as a single input in the regression to address collinearity problems (Dutcă et al. 2019). Due to the low collinearity between DBH and ht in this study and the fact that similar biomass models previously developed in Ethiopia produced Q-ratios outside the range of 1.5–2.5 (Tetemke et al. 2019; Asrat et al. 2020b), a separate predictor approach was adopted for modeling.

Accordingly, eight biomass models (Eqs. 18) using non-linear regression equations were tested for multi-species models while the four models without the wbd were fitted/tested for species-specific cases. DBH is the most frequently used variable for predicting biomass. Other variables, such as ht, wbd, and average crown diameter (cd), have also often been combined with DBH in allometric models in previous studies (Asrat et al. 2020b; Dao et al. 2021). Because adding ht as a predictor variable to DBH accounts for variation in AGB among trees with the same diameter value (Mensah et al. 2016) and reduces relative error (Mukuralinda et al. 2021). Wood density also affects how much biomass is allocated to wood and foliage because trees with low wood densities devote more resources to photosynthetic processes that promote the development of foliage (King et al. 2006). Moreover, including average crown diameter (cd) also improved predictions compared to using a single variable alone (Dimobe et al. 2019; Flores-Hernández et al. 2020; Romero et al. 2022). The following general nonlinear model forms for prediction of biomass were fitted:

$${\mathrm{Model }1:\mathrm{Y}=\mathrm{a}\times (\mathrm{DBH})}^{\mathrm{b}}$$
(1)
$${\mathrm{Model }2:\mathrm{Y}=\mathrm{a}\times (\mathrm{DBH})}^{\mathrm{b}}{\times (\mathrm{ht})}^{\mathrm{c}}$$
(2)
$${\mathrm{Model }3:\mathrm{Y}=\mathrm{a}\times (\mathrm{DBH})}^{\mathrm{b}}{\times (\mathrm{cd})}^{\mathrm{c}}$$
(3)
$${\mathrm{Model }4:\mathrm{Y}=\mathrm{a}\times (\mathrm{DBH})}^{\mathrm{b}}{\times (\mathrm{wbd})}^{\mathrm{c}}$$
(4)
$${\mathrm{Model }5:\mathrm{Y}=\mathrm{a}\times (\mathrm{DBH})}^{\mathrm{b}}{\times (\mathrm{ht})}^{\mathrm{c}}{\times (\mathrm{wbd})}^{\mathrm{d}}$$
(5)
$${\mathrm{Model }6:\mathrm{Y}=\mathrm{a}\times (\mathrm{DBH})}^{\mathrm{b}}{\times (\mathrm{ht})}^{\mathrm{c}}{\times (\mathrm{cd})}^{\mathrm{d}}$$
(6)
$${\mathrm{Model }7:\mathrm{Y}=\mathrm{a}\times (\mathrm{DBH})}^{\mathrm{b}}{\times (\mathrm{wbd})}^{\mathrm{c}}{\times (\mathrm{cd})}^{\mathrm{d}}$$
(7)
$${\mathrm{Model }8:\mathrm{Y}=\mathrm{a}\times (\mathrm{DBH})}^{\mathrm{b}}{\times (\mathrm{ht})}^{\mathrm{c}}{\times (\mathrm{wbd})}^{\mathrm{d}}{\times (\mathrm{cd})}^{\mathrm{e}}$$
(8)

where Y represents the biomass of the a tree, a, b, c, d and e are the estimated parameters of the fitted models, DBH is the diameter at breast height (cm), ht is the tree height (m), cd is average crown diameter (m), and wbd is the wood density (gcm−3) of a given tree.

Model evaluation, selection, and comparison

Four fit statistics were combined to choose the best candidate models: AIC (Akaike Information Criterion); pseudo-R2; RMSE (Root Mean Square Error); and MAE (mean absolute error) (Ubuy et al. 2018b; Asrat et al. 2020b). The best models have a high pseudo-R2 and the lowest AIC, RMSE, and MAE calculated using the caret function in the R software.

For the cross validation of the equations, the root mean square error (RMSE) and mean prediction error (MPE) were computed using the leave one out cross variation (LOOCV) approach. This approach leaves one observation for validation and the remaining n − 1 observations for training the model. The excluded observation is predicted, and the error is calculated. The procedure is repeated n time until every observation has been left out and predicted. The RMSE and MPE are computed using the n errors. These statistical parameters were calculated as follows:

$$\mathrm{RMSE}=\sqrt{\mathrm{MSE}=} \sqrt{\frac{\sum_{\mathrm{i}=1}^{\mathrm{n}}{\left({\mathrm{y}}_{\mathrm{i}}-{\widehat{\mathrm{y}}}_{\mathrm{i}}\right)}^{2}}{\mathrm{n}}},\mathrm{ RMSE\%}= \frac{\mathrm{RMSE}}{\overline{\mathrm{Y}} } \times 100$$
(9)
$$\mathrm{MPE}=\frac{\sum_{\mathrm{i}=1}^{\mathrm{n}}\left({\mathrm{y}}_{\mathrm{i}}-{\widehat{\mathrm{y}}}_{\mathrm{i}}\right)}{\mathrm{n}},\mathrm{ MPE\%}= \frac{\mathrm{MPE}}{\overline{\mathrm{Y}} } \times 100$$
(10)
$$\mathrm{AIC}=\mathrm{nlog}\left(\frac{{\sum_{\mathrm{i}=1}^{\mathrm{n}}\left({\widehat{\mathrm{y}}}_{\mathrm{i}}-{\mathrm{y}}_{\mathrm{i}}\right)}^{2}}{\mathrm{n}}\right)+2\mathrm{p}$$
(11)
$$\mathrm{Pseudo}-{\mathrm{R}}^{2}=1-\frac{\mathrm{SSR}}{\mathrm{CSST}}=1-\frac{\mathrm{SSR}}{ \sum_{\mathrm{i}=1}^{\mathrm{n}}{(\mathrm{yi}-\overline{\mathrm{Y} })}^{2}}$$
(12)

where SSR is the sum of squared residuals, CSST is the corrected sum of squares (∑yi-Ῡ)2, p is the number of parameters in the model, yi is the observed AGB, \({\widehat{y}}_{i}\) is predicted AGB, n is the total number of observations, and \(\overline{Y }\) is mean of observed AGB. With the use of our dataset, we compared several previously developed species-specific and generic models for aboveground biomass in agroforestry and natural forests (Table 2). First, some species-specific models from Ethiopia (Giday et al. 2013; Mulatu and Fetene 2013; Abebe et al. 2023) and Niger (Moussa and Mahamane 2018) were tested. Then three relevant generic models from the natural forests of Ethiopia (Ubuy et al. 2018b; Tetemke et al. 2019; Asrat et al. 2020b) and other sub-Saharan African countries (Kuyah et al. 2012), as well as the pan-tropical model developed by Chave et al. (2014) were tested. The model developed by Chave et al. (2014) was chosen since it is extensively used in tropical regions. Three generic models for natural forests that were developed using multiple variables as in the current study were employed because there was a lack of a general and species-specific model for trees in agricultural landscapes, and there is high possibilities that such models will be used. These models were developed by Tetemke et al. (2019), Ubuy et al. (2018b), and Asrat et al. (2020b) using samples of trees from Ethiopia. The models developed by Giday et al. (2013) for Acacia abyssinica and by Abebe et al. (2023) and Mulatu and Fetene (2013) for Oldeania alpina were the most useful ones we found in the literature utilizing samples of trees from Ethiopia. The generic models developed by Kuyah et al. (2012) and species-specific ones developed by Moussa and Mahamane (2018) for Faidherbia albida were also the most pertinent ones we discovered in the literature because they were developed for tree species that were grown on agricultural landscapes. To look for significant differences between observed and predicted biomass levels, the paired t-test was employed.

Table 2 Previously published species-specific and mixed species biomass allometric model tested using the data set from this study

Results

Species-specific and mixed-species allometric equations to estimate AGB

The aboveground biomass (AGB) model using only DBH as the predictor variable (Model 1) performed best for two of the species (Faidherbia albida and Oldeania alpina) and was optional for other species (Table 3). The fitting statistics of this model (Model 1) were typically good, with pseudo-R2 varying between 85 and 99% across all species and mixed species. Moreover, the model developed for Faidherbia albida had the highest pseudo-R2 (0.99), the lowest RMSE (59.3), and the lowest AIC (138.77) with a significant (p < 0.01) regression model. This model somewhat underpredicted the aboveground biomass for Faidherbia albida and 1–2-year-old highland bamboo (Oldeania alpina) by 0.57% and 0.3%, respectively. However, it overpredicted the aboveground biomass for 3–4, and 5–6-year-old highland bamboo (Oldeania alpina) by 0.15% and 0.05%, respectively.

Table 3 List of fitted aboveground biomass (AGB) allometric models for studied perennial plant species and mixed-species in agricultural landscapes of central Ethiopia

For Erythrina brucei and Albizia schimperiana, a multiple-variable model that included DBH and ht (model 2) was shown to have a lower AIC than the models using DBH as the single variable (model 1), but they had a comparable pseudo-R2. Adding ht to DBH (Model 2) as a compound variable was also the best-performing model that significantly improved the statistical fits for AGB of Croton macrostachyus when compared to the model based on DBH as a single predictor.

An allometric model including DBH and cd (Model 3) as predictors fitted well with our data for the prediction of AGB with a high pseudo-R2 (0.99) and low AIC (135.94) and RMSE (40.8) for Acacia abyssinica. The model that takes into account DBH and ht resulted in an improved pseudo-R2, but had a higher AIC and RMSE compared with model 3 (Table 3). A multiple-variable model (Model 8) that includes DBH, ht, wbd, and cd had the highest pseudo-R2 value (0.95) and the lowest AIC (780.03) for mixed species that developed by using the full dataset (excluding Oldeania alpina). Hence, Model 8 was selected as the best-fit allometric biomass model for the prediction of the AGB for mixed species.

Performance of the present models compared to previously published models

The best-fit model of Oldeania alpina showed the lowest value for the percentage of residual mean square error (RMSE kg) and model prediction error (MPE%) compared to the species-specific allometric biomass models (Mulatu and Fetene 2013; Abebe et al. 2023). The paired t-test indicated that our best-fit AGB model and observed aboveground biomass were statistically identical for this species (Table 4). The graphical presentation of the 1:1 line also supported that our best-fit AGB model for Oldeania alpina was similar to the observed AGB (Fig. 2). The biomass model of Mulatu and Fetene (2013) significantly overpredicted the AGB for 1–2, 3–4, and 5–6 years old highland bamboo, while the biomass model of Abebe et al. (2021) significantly underpredicted the AGB for 1–2, 3–4 and 5–6 years old highland bamboo (Table 4).

Table 4 Comparison of species-specific and mixed species model with previously published allometric models
Fig. 2
figure 2

The relationship between the observed and the predicted AGB for six studied species and mixed-species model. The dashed line represents the 1:1 line or reference line, whereas the black solid line represents the fitted line

The residual mean square error (RMSE%) and model prediction error percentage (MPE%) values of these best-fit models showed high precision of predictability for Faidherbia albida and Acacia abyssinica over species-specific models developed by Moussa and Mahamane (2018) and Giday et al. (2013), respectively, and generic models (Kuyah et al. 2012; Chave et al. 2014; Ubuy et al. 2018b; Tetemke et al. 2019; Asrat et al. 2020b). Nevertheless, among the generic models, the equation developed by Asrat et al. (2020b) for AGB based on 63 trees with dbh ≥ 5 cm in the Dry Afromontane forests of south-central Ethiopia ranked 2nd lowest in RMSE (%) and MPE (%) for these two species. The paired t-test differences between the observed and predicted AGB were not statistically significant for those two species. The scatter plot diagram also depicted the points close to the line of best fit which confirms that the models provided a good fit to the data for both Faidherbia albida and Acacia abyssinica (Fig. 2). However, the generic models of Chave et al. (2014), Ubuy et al. (2018b), Asrat et al. (2020b), and Tetemke et al. (2019) significantly overpredicted the AGB, but the biomass of the two species was significantly underestimated by Kuyah et al. (2012). Species-specific models developed by Moussa and Mahamane (2018) for Faidherbia albida significantly overpredicted the AGB, while Giday et al. (2013) underpredicted biomass for Acacia abyssinica.

The best-ranked models of Erythrina brucei (Model 2) and Albizia schimperiana (Model 4) showed the lowest RMSE (%) and MPE (%) compared to the generic models (Kuyah et al. 2012; Chave et al. 2014; Ubuy et al. 2018b; Tetemke et al. 2019; Asrat et al. 2020b). The observed biomass of Erythrina brucei and Albizia schimperiana was not significantly different from the biomass predicted by our allometric models (p > 0.05). However, it was significantly different from the biomass predicted from the allometric models developed by Kuyah et al. (2012), Chave et al. (2014), Asrat et al. (2020b), and Tetemke et al. (2019). The allometric models developed by Ubuy et al. (2018b) and Tetemke et al. (2019) overpredicted the AGB for those two species. In contrast, when the allometric models developed by Kuyah et al. (2012), Chave et al. (2014), and Asrat et al. (2020b) were applied, the biomass was underpredicted as indicated by the higher MPE value (Table 4).

Our best-fit model (Model 5), overpredicted the AGB of Croton macrostachyus by 0.08%, a small value when compared with other equations that have been generated to predict aboveground biomass in Ethiopia and tropical forests in general (Table 4). The results became more conspicuous in the graphical presentation of the goodness of fit 1:1 line, indicating the capability of our best-fit model 5 to predict the AGB accurately and precisely (Fig. 2). Application of these previously developed models to our dataset underpredicted or overpredicted according to the models generated for each region. For example, equations developed by Kuyah et al. (2012); Asrat et al. (2020b); and Ubuy et al. (2018b) significantly underpredicted biomass. In contrast, when the allometric models developed by Tetemke et al. (2019); and Chave et al. (2014) were applied, the biomass was significantly overpredicted by 40.95% and 12.40%, respectively (Table 4).

The residual mean square error (RMSE%) and the model prediction error (MPE%) of the best-fit model (model 8) for mixed-species were lower than those of generic (Asrat et al. 2020b; Kuyah et al. 2012; Tetemke et al. 2019) models. Nevertheless, among the generic models, Asrat et al. (2020b) ranked the 2nd lowest RMSE (%) and MPE (%) (Table 4). The 1:1 line shown in Fig. 2 indicates that the closer the values are to the 1–1 line, the lower the bias of the prediction. The biomass predicted by Chave et al. (2014) was comparable with observed AGB. However, the biomass model of Ubuy et al. (2018b) and Tetemke et al. (2019) significantly overpredicted the AGB, while biomass model of Kuyah et al. (2012) and Asrat et al. (2020b) significantly underpredicted the biomass (Table 4).

Discussion

Species-specific and mixed-species allometric models to predict AGB

Power allometric models were used to predict the AGB of six native perennial plant species grown on agricultural landscape in central Ethiopia. These type of models are preferred over polynomial and logarithmic equations because they are commonly used, simple, and practical for predicting the biomass of many woody species (Ou and Boussim 2020). Our species-specific biomass models outperformed over mixed-species models. This is consistent with the findings of Nyamukuru et al. (2023). Similarly, multiple-variable models performed better than single-variable models in both species-specific and multi-species models. Our results are in agreement with those of Nyamukuru et al. (2023) for trees and shrubs biomass estimation in the African savanna ecosystems who found that multiple-variables models performed better than single variable models.

For Oldeania alpina, the most accurate allometric biomass model used DBH as the only explanatory variable. This is in line with finding of Inoue et al. (2019) that DBH to be the most accurate predictor to explain the biomass variability for square bamboo in western Japan. A study carried out in the Bobiri forest reserve in Ghana also showed that DBH alone explains variations in the biomass distributions among biomass components, age classes, and total biomass of bamboo (Amoah et al. 2019). Similar to that, a study in southwestern Ethiopia on Oldeania alpina (Yebeyen et al. 2022; Abebe et al. 2023) supported the notion that allometric models based solely on DBH would increase the value of the predicted biomass. In this study, adding tree height to equations for highland bamboo did not improve the model’s performance. This showed that biomass equations using the DBH variable are easy and take less time to measure. Dense highland bamboo woodlots in the study area can also cover bamboo plants tree tops, making it challenging to measure heights in the field. Similar to this, Nfornkah et al. (2021) also revealed that the dense culms of bamboo made it hard to measure height nondestructively, making the use of the height variable model for bamboo problematic from the start. Roxburgh et al. (2015) and Mensah et al. (2017) also found that the inclusion of tree height did not result any significant improvements over the simple power model, proving that species affects how much extra predictors would enhance biomass models.

The allometric models developed with DBH as an explanatory variable were also the best performing models that provide a best prediction of AGB for Faidherbia albida. For this species, AGB estimates using DBH alone led to lower AIC and RMSE and increased accuracy. This agrees with authors (Beedy et al. 2016; Moussa and Mahamane 2018), who revealed DBH is the accurate predictor of AGB of Faidherbia albida in Malawi and Niger, respectively. The findings suggest that DBH change substantially accounts for the variability of tree biomass in agricultural landscape. The facts that tree diameter is the simplest variable to measure in the field and had a significant correlation with AGB. For ease of model application and validation, it is advised to use fewer explanatory variables (Sileshi 2014). Additionally, we found that compared to using DBH alone, adding tree height data did not significantly improve biomass prediction. It is also related to Faidherbia albida trees' tendency to branching, which leads to more radial development than apical growth. The insignificant improvement of height on AGB prediction could also be due to the process of pollarding and prunning branches in parkland agroforestry at intervals of three to four years to increase light availability for understory crops. Moreover, this study found that the inclusion of crown diameter did not improve the model fit for Faidherbia albida. The size of the tree affects the strength of the association between crown diameter and AGB; it is weaker in trees with small crowns (Dimobe et al. 2018). Faidherbia albida has a less variable geometry of the canopy, possibly due to the inherent plasticity of its canopy to interplant disturbances from pollarding and prunning branches in parkland agroforestry.The findings of this study show that models with multiple variables (DBH and ht) had better predictive ability for AGB in Albizia schimperiana, Croton macrostachyus, and Erythrina brucei than the DBH alone model. Including tree height as a predictor variable in AGB models is usually recommended as it can improve model fit, robustness, increased efficiency, and lower model prediction error for fruit tree species in Bangladesh than the commonly used pantropical and regional models (Saha et al. 2021). Many researchers believe that including height in models will lessen model site specificity (Dutcă et al. 2018; Dutcă 2019).

Recently, crown dimensions are essential for improving tree biomass estimates and simplifying AGB estimation (Tetemke et al. 2019; Loubota Panzou et al. 2021; Jucker et al. 2022). In this study, combing cd with DBH improved model fit by 3% and reduced the relative error by 5.4% for AGB of Acacia abyssinica. This could be because the branching patterns of the studied tree species shared approximately 40% of the total AGB. This result is in line with recent study on tree allometry in the dry afromontane forest of Ethiopia where crown width improved model's prediction (Asrat et al. 2020b). Similar findings were made by Dao et al. (2021) for Mangifera indica in parklands of the Sudanian zone of Burkina Faso.

The findings indicate that, for mixed species, the predictor variables may account for 85–95% of the variance in the dependent variable. For mixed species, DBH alone as a single predictor explained over 85% of the variation in AGB and had a relative error of 19.52%. However, adding tree height as an additional predictor reduced the relative error for AGB by 3.01% and improved model fit by 5% for mixed species. Similar observations have been reported on species equations (Mukuralinda et al. 2021; Mensah et al. 2017) and regional mixed-species equations (Teshome et al. 2022) that tree height data improves biomass prediction when compared to equations that use DBH as the only predictor variable.

However, the inclusion of crown diameter did not improve the fit of the model for mixed species. This suggests that the correlation between crown diameter and AGB is size-dependent, with the association being weaker in smaller-crowned trees. The characteristics of pooled tree data may be used to determine tree allometry. For instance, Faidherbia albida has a less variable geometry of the canopy, probably as a result of disturbances from pollarding and prunning branches at intervals of three to four years, which lower the AGB prediction of crown diameter in mixed species. A similar trend was observed in Burkina Faso (Mukuralinda et al. 2021) where the inclusion of crown diameter did not improve the model fits for Terminalia laxiflora.Some studies indicate that using wood bulk density as an independent variable in allometric models can significantly increase the accuracy of the biomass model in tropical forests (Huy et al. 2019b), whereas other studies found that doing so had no significant effect on the model's performance (Zhao et al. 2019; Mahmood et al. 2020a). The complexity of field data gathering and reliance on additional laboratory analysis prevent wood bulk density from being utilized frequently. Some researchers rely on datasets like the Global Wood Density Database (Khan et al. 2020). Also, Ethiopia's Forest Reference Level report for the REDD + implementation used the wood density database rather than directly determining this statistic (Ubuy et al. 2018a). This signifies the importance of inclusion of basic wood density to be included in biomass models as an explanatory variable, which also helps lower model uncertainty (Mukuralinda et al. 2021).

Our results show that the addition of wbd to DBH alone (model 4) reduced the relative error for AGB by 6.28% and improved model fit by 7% for mixed species. Other studies recommend wood density as an additional predictor variable that can be used to improve biomass prediction(Mukuralinda et al. 2021; Zhu et al. 2021; Ganamé et al. 2021).Moreover, the model 5 with three predictors (DBH, ht, and wbd) reduced the relative error for AGB by 7.85% and improved model fit by 10% for mixed species. Similar observation has been reported in species-specific model (Daba and Soromessa 2019; Abich et al. 2021) and mixed-species equations (Ganamé et al. 2021) where inclusion of DBH, ht, and wbd data improves biomass prediction when compared to equations that use DBH as the only predictor variable.

Multiple variables models combining DBH, ht, wbd, and cd as predictors reduced the relative error for AGB by 8.4%, improved model fit by 10%, and perform the best fitness among all of the models for mixed species. Our results agree with reports on regional mixed-species equations (Asrat et al. 2020b; Tetemke et al. 2019) that DBH, ht, wbd, and cd data improve biomass prediction when compared to other models for mixed species. This may be due to the fact that pooling biomass data to develop mixed species models increases the sample size and results in stable regression parameters and variance estimates (Xiang et al. 2016). Moreover, multiple-predictor models were more effective than single-predictor models (Smith et al. 2021). Our finding is most likely due to the different morphologies of the studied species, meaning that the inclusion of DBH, ht, wbd, and cd accounted for variation in the AGB induced by difference in tree architectures.

Performance of the present models compared to previously published models

Generally, the models developed for this study performed better and had a smaller residual mean square error (RMSE%) and the model prediction error (MPE%) since there were more sampled trees in the study. Applying allometric equations with small sample numbers will lead to a bias of up to 70% (Duncanson et al. 2015). In the current study, we employed 95 destructively sampled perennial plant individuals to develop biomass models. Taking into account Ethiopia's restrictions on destructive sampling and the costs associated with biomass assessment in general, the number of sample trees used in this study was greater than previously reported general and species-specific models. Furthermore, the inclusion of a large tree in the sample for this study may improve the precision of the biomass prediction. In contrast to the current models, which were built for trees on agricultural landscape, many of the earlier models were developed for natural forests, with the exception of Moussa and Mahamane (2018) and Kuyah et al. (2012). Because tree biomass allometric models vary by site (Ducey 2012). The allometry of trees is a result of the interaction between endogenous growth processes and exogenous constraints exerted by the environment. The interactions between individuals change tree architectures, including canopy area, branching, and stem form, which determine biomass allocation (Yang et al. 2019). Moreover, characteristics of species and their silvicultural management may be an explanation for the lower prediction of regional and pan-tropical generic allometric models when they are applied to the existing data.

Conclusion

This study has developed species-specific and multi-species biomass allometric models to provide accurate estimates of AGB for native perennial plant species in the agricultural landscape of central Ethiopia. The models with only DBH as a predictor were the best single predictors of AGB for Oldeania alpina and Faidherbia albida, and options for other species. Multiple variable models combining DBH-ht exhibited the highest predictive capacity for AGB in Erythrina brucei, Albizia schimperiana, and Croton macrostachyus, whereas the combination of DBH-cd and DBH-ht-wbd-cd best predicted the AGB of Acacia abyssinica and mixed species, respectively. Species-specific and mixed-species models showed the best predictive capacity for AGB compared to other frequently used regional and pan-tropical models. The findings of the study suggest that mixed-species AGB models will be used when species-specific allometric models are not available at a given site. The allometric models can be used as the basis for reporting and verifying biomass and carbon stocks in the agricultural landscape for carbon crediting schemes such as REDD + and CDM. However, caution must be taken when applying the models to other datasets since their applicability is limited to the range of DBH, land use type, and biophysical conditions of the study area.