Introduction

Hyrcanian forests are the only commercial forests where temperate broad-leaved tree species exist today in Iran. Covering an area of approximately 1.85 million ha, the Hyrcanian forests account for approximately 15% of Iranian forests and 1.1% of the country’s total area. These forests range from sea level up to an elevation of 2800 m and comprise various forest types with no less than 80 woody species (trees and shrubs) (Ahmadi et al. 2020). These forests thus are of significant concern to forest planners and policy makers charged with maintaining the resilience of this important forest ecosystem to support and benefit human livelihoods in the region. Whereas economic planning without consideration for sustaining renewable natural resources can lead to their degradation that can ultimately contribute to the extinction of nations and civilizations (Hatfield and Prueger 2015), decision-makers are no longer solely concerned about economics and do increasingly take a broader, more holistic management perspective (Vanclay 1994).

In forestry, decision-making is the essence of a management plan that includes prescribed activities, a schedule, and control measures aimed at achieving the stated management objectives in a forest area (Bayat et al. 2013). Because multiple forest management objectives often conflict at the landscape scale, a spatial approach to planning uses a modeling technique that adapts spatially explicit stand management objectives to minimize the conflicts at the landscape scale (Baskent and Keles 2005). In addition to high-quality inventory and survey data that describe the current forest condition, both planners and managers thus require a description of the desired future conditions and acceptable alternative future states. To obtain the most reliable information on the likely future state of a forest, planners rely on forecasts from growth and product models that are widely employed in forest management (Bettinger et al. 2005; Weiskittel et al. 2011). The biometrical and mathematical representation of biological processes leads to a growth and yield model. The conversion of growth models into computer programs for forecasting and scenario computation leads to the creation of growth simulators. A model is therefore always the precondition for the development of a simulator, but in contrast, the development of a model does not always lead to a simulator (Pretzsch et al. 2002; Álvarez-González et al. 2009). The development of an even-aged forest stand can be predicted using a stand-level model (Gadow and Hui 1999), while a more detailed approach is often needed for mixed-species or uneven-aged stands (Weiskittel et al. 2011).

Growth and yield models predict the dynamics of forests, including future forest growth and products that enable the study of alternative management options/scenarios and have thus become a key element in sustainable forest management planning (Burkhart 1990; Vanclay 1994; Bayat et al. 2013). For example, sub-models for individual tree diameter/height growth, survival, and ingrowth enabled the predictions of stand density and diameter distributions needed to guide the management of uneven-aged forests in Finland (Pukkala et al. 2009; Pukkala and Kellomäki 2012). Further, harvest intensities were based on height, survival, and growth of the regeneration in uneven mixed softwood stands in Finland to ensure sufficient survival and growth of small spruces (Picea spp.), intolerant birch (Betula spp.) and pine (Pinus spp.) to maintain the species composition of the current selection stands (Eerikainen et al. 2014). Finally, mixed-effects regression models were recently developed to project the distance-independent individual tree growth and yield for uneven-aged forests in northern Iran managed under selection systems (Kalbi et al. 2019). In addition to the mentioned studies, other researches such as Ahmadi et al. (2016, 2020) have developed additional tree-level statistical models for the Hyrcanian forests.

The use of models for better forest management is one of the main factors that justify growth modeling. The developed models predict the growth rate and the factors affecting it, which allows for a better information of the ecosystem. In addition, the future state of the forest can be predicted with these models (Pretzsch et al. 2015; Pretzsch 2009). Accordingly, a correct understanding of the structure of the forest is an essential fact for the optimal management of forest resources. For example, the beech species (Fagus orientalis Lipsky) examined in this analysis covers about one-third of the area and proportionally the standing volume in mixed and relatively pure Hyrcanian forests. Most beech forests are now managed with a new approach to forestry. Accordingly, quantifying forest growth and determining influential factors is one of the most important goals of forest management.

To more fully understand the dynamics of forests, it is often not sufficient to solely model the overall growth of forest stands, but commonly more detailed individual-based modeling approaches are needed that effectively discriminate among several growth components (Vanclay 1994). This is especially true to support uneven-aged forest management in mixed natural stands, and the following set of individual tree growth models is essential: diameter/height growth, tree mortality or survival, and estimates of ingrowth (Trasobares et al. 2004; Bayat et al. 2013). The models can be developed using a variety of statistical techniques, which are further explained below.

The mixed-effects model is a parametric method for addressing hierarchical data. Examples of grouped data are individuals within spatially explicit subgroups, such as sample trees within forest sample plots; observations of a split-plot or a block in a controlled experiment; and repeated measurements of any subjects, such as plants or permanent sample plots. It is quite common that the groups of the data represent only a sample from a population of groups, while general interest is often in making the inference about the entire population of groups. In those situations, the group effects are naturally treated as random variables, which leads to the mixed-effects model, also called mixed or random-effects regression. These mixed models have been extensively used and have now become the standard for growth and yield modeling (Weiskittel et al. 2011). Additional benefits of random group effects over the fixed effects are that the number of observations per group can be small; even one observation per group may be enough for satisfactory estimation accuracy, provided some groups have several observations. As groups get larger, the estimated group effects from the mixed-effects model approach those of the fixed-effects model. Mixed-effects models also enable modeling of variability between groups using fixed predictors that are defined at the group level. Furthermore, prediction is also possible for groups that were not present in the modeling data (Mehtätalo and Lappi 2020). For growth modeling, use of species as hierarchical level in a mixed model has been shown to result in more robust predictions (Kuehne et al. 2020).

In addition to mixed-effects regression techniques, artificial neural network (ANN) is a form of artificial intelligence (AI) and is used for a wide range of complex multivariate problems in various fields, including communication memory, optimization, prediction, diagnosis, control, and forestry (Nandy et al. 2017; Da Rocha et al. 2018; Vieira et al. 2018; Zhu et al. 2018; Zhou et al. 2019). ANNs form a subset of AI whose structure and function generally mimic the human brain and consist of a number of simple structural components with complex connections, analogous to neurons in the brain (Strobl and Forte 2007;). ANNs have recently gained increased traction in forestry with applications in the mapping of forest structure (Ingram et al. 2005), predicting site index (Aertsen et al. 2010), estimating forest growth and yield (Ashraf et al. 2013), modeling tree diameter and height (Reis et al. 2016; Vieira et al. 2018), determining volume and biomass of individual trees (Pereira et al. 2019), predicting the probability of tree survival (Bayat et al. 2019a), and estimation of tree heights (Bayat et al. 2020).

Few studies have investigated how well traditional parametric mixed-effects compare to AI/ANN-based growth and yield models. A recent study comparing the estimation of volume and biomass of individual trees of different species in the Brazilian central savanna derived using both approaches clearly demonstrated the great potential of AI techniques compared to mixed models and underscored the utility of ANN-based models to handle manifold information to accurately predict tree-level attributes (Pereira et al. 2019). However, nonparametric methods like ANN may have important limitations for application in a regional forest growth and yield modeling context, namely inability to extrapolate beyond the fitting data, potential inconsistency with biological expectations, and more difficult transferability to other potential users. To our knowledge, no comprehensive study has compared the tree-level growth and yield results from parametric mixed-effects regression models to those from ANNs to date, particularly in uneven-aged and mixed forests like those in the Hyrcanian forests of Iran.

ANNs used in this study consisted of two architectures that are commonly used for solving difficult engineering problems (Walling et al. 2001; Zhu et al. 2007; Bayat et al. 2020): the feedforward multilayer perceptron (MLP) and the radial base function (RBF) networks. Both methods give robust estimates (Reis et al. 2018; Van Dao et al. 2020) and are able to approximate a variety of nonlinear functions (Benali et al. 2019). The number of layers and neurons in each hidden layer is of great importance in designing the network. The number of neurons in the input and output layers is equal to the number of input and output variables under investigation. The challenging task is to determine the number of hidden layers and the number of neurons in each hidden layer. The ANNs in this study were trained with back-propagation algorithms that used 70% of input–output data records for model training and the remainder for model evaluation (Benali et al. 2019).

To manage mixed uneven-aged forests with the selection system, reliable predictive models of diameter/height growth, survival/mortality rates of trees, and ingrowth of regeneration are needed to guide harvest intensities and residual basal areas to ensure a sustainable development of trees through the diameter distribution. In this study, we have two primary objectives, which are: (1) parameterize various individual tree growth submodels using mixed models and ANN and (2) determine which approach provided the best model accuracy based on a comprehensive model assessment. The primary goal was an accurate and appropriate method for predicting diameter/height growth, survival /mortality rates of trees, and ingrowth of regeneration for the most important and economically valuable species in the forests of northern Iran.

Materials and methods

Study area

This study was carried out in the Jojadeh district, which is one of five sections covered by forestry development projects in the Farim forest. The Jojadeh section is located on the southern part of the Sari, the capital city of the Mazandaran province on the slopes of the northern Alborz Mountains in the Dodangeh district (Fig. 1). The section covers an area of 3550.2 ha with minimum and maximum altitudes of 782 and 1750 m, respectively. The climate of this forest is classified as humid with an annual rainfall of 832.9 mm and an average annual temperature of 11.2 °C. This forest is composed of an uneven-aged and mixed forest, and in most parts has at least three stories. The primary species of this section are Oriental beech (Fagus orientalis Lipsky), chestnut-leaved oak (Quercus castaneifolia Coss. ex J.Gay), Persian maple (Acer velutinum Boiss.), common hornbeam (Carpinus betulus L.) and Caucasian alder (Alnus subcordata C.A.Mey.) (Hamidi et al. 2019). These primary species often occur together in a variety of mixtures and forest types throughout the region.

Fig. 1
figure 1

Overview of the study area in northern Iran and approximate location of plots used in this analysis. Variations in color indicate different elevations (m, AMSL; see legend). (Color figure online)

Study method

The data used for modeling were obtained from two enumeration periods with fixed sample plots. In 2003, 313 circular permanent sample plots (0.1 ha) were systematically laid out using an inventory grid (200 m × 150 m) in the Jojadeh district. After locating the plot centers in the field, the slope of each plot was determined, and the radius of the sample plot was slope corrected. In each plot, the center was permanently monumented and each tree within the plot received a number that was painted at breast height (1.3 m). The DGPS device (RTK model with an accuracy of approximately 1–5 cm (RMS)) was used to record the position of all sample plots (Bettinger et al. 2019; Hamidi et al. 2020). The DBH of all live trees with a diameter > 12.5 cm was measured using a caliper and recorded by species. The basal area (BA) and basal area larger (BAL) are two important competitive factors that were used in this study, which are obtained from the following formulas, respectively.

$${\text{BA}} = \frac{\pi }{4}{\text{DBH}}^{2}$$
(1)

where BA is basal area in m2, DBH is diameter, and π = 3.14.

$${\text{BAL}} = \frac{\pi }{4}\mathop \sum \limits_{j = 1}^{n} \left( {{\text{TF}}_{j} .{\text{DBH}}_{j}^{2} } \right)$$
(2)

where DBHj > DBHi (i.e., all trees larger than subject tree i), DBH is measured in meters and TFj is a tree factor, i.e., the number of trees represented by jth tree in a hectare derived from area (Pokharel and Dech 2012).

In order to record physiographic factors, in each sample plot, first the slope was recorded using the Sunto slope gauge and the aspect determined with a compass, while the altitude and its geographical position were assessed with the GPS device. In order for the trees that were measured at the beginning of the period to be measured again at the end of the period and to determine the number of trees that have exceeded the count limit, the trees in the sample plots in the forest must be marked in some way. One method of marking the center of the sample plot and trees that have exceeded the count limit is to mark the location of the diameter measurement against the breast in the first period, which was done in the research area. Each tree was marked with a specific symbol and recorded in the relevant forms. At the end of the period, the sample plots were recovered and the measurements were calculated as at the beginning of the period. Measurements were re-taken after 10 years in 2013, and the mortality of any previously tallied tree and ingrowth of trees beyond the diameter threshold were noted. Finally, the amount of forest growth models was calculated in region forest.

Data

A total of 4832 trees of which 2878 were Oriental beech, 838 common hornbeam, 187 oak, 417 Caucasian alder, 80 Persian maple, and 432 Other trees were predominantly Acer cappadocicum, Tilia spp., Ulmus minor, Ulmus glabra, Fraxinus excelsior, and Zelkova carpinifolia. All of these observations across the five primary species were available for modeling diameter growth. Height was modeled using 626 pairs of diameter and height, while a total of 5852 and 1340 trees were used for survival and regeneration modeling, respectively.

Due to the uneven structure of most stands in this forest, the thickest and closest tree to the center of the plot were selected for measurement, which was accomplished using a Sunto clinometer and calculated from the following formula (Eq. 3):

$$H = d \, \left[ {\tan \,\alpha {-}\left( {\tan \,\beta } \right)} \right]$$
(3)

where H: the height of the tree (m), d: the horizontal distance to the tree (m), tan α and tan β: The slope of the tip and the bend of the tree, respectively, are relative to the horizontal surface (%).

Modeling approaches

For modeling the 10-year diameter increment (DBH2013-DBH2003) in this study, the following parameters were included in the model: (1) tree size (DBH); (2) competition [total basal area (BA) and basal area of trees larger than the subject tree (BAL) as well as its transformation, ln(BAL)]. BAL has been suggested as a proxy for the ability of trees to compete for light (Schwinning and Weiner 1998). Other studies have posited that tree competitive position, determined using BAL or modifications of BAL, may be the strongest predictors of diameter growth in both even-aged (Adame et al. 2008; Uzoh and Oliver 2008) and uneven-aged stands (Pukkala et al. 2009); and (3) physiographic factors including slope, aspect, altitude and transformation of these variables with natural log-transformed diameter growth as the dependent variable (Vanclay 1994), linear mixed-effect regression was used to model the diameter increment following Lhotkaa and Loewenstein (2011) that incorporated species as the random variable (Kuehne et al. 2020).

Due to errors in tree height measurements associated with the first inventory, only a static total height model associated with the second inventory could be developed. The static height models were developed based on the relationship between tree height and DBH from 626 trees (i.e., the largest diameter and nearest tree to the center in each sample plot). Twenty alternative nonlinear DBH to height model types were fitted and evaluated in this study from which 11 models were two-parameter and nine were three-parameter models.

The mortality model was based on the number of trees that survived or died between 2003 and 2013 and modeled by species using logistic regression. The explanatory variables evaluated were tree size (DBH and different derivatives) and competition factors (BA, BAL, N); only significant variables (p < 0.05) were retained in the final model (Eq. 4):

$$P_{i} = \frac{1}{{1 + e^{{ - \left[ {b\left( 0 \right) + b\left( 1 \right) \times x\left( 1 \right) + b\left( 2 \right) \times x\left( 2 \right) + \cdots + b\left( n \right) \times x\left( n \right)} \right]}} }}$$
(4)

where Pi represents the possibility of tree mortality for threshold i, b0bn are parameters to be estimated and x(1)x(n) are descriptive variables.

The regeneration/ingrowth model used the number of trees that exceeded the diameter threshold of 12.5 cm DBH as the response variable and that was regressed against stand total basal area. Based on the data availability and desired use in future projections, some additional explanatory variables for predicting the number of ingrowth trees per hectare were selected: (1) total basal area (m2 ha–1); (2) number of trees per ha; (3) average DBH of each plot; (4) physiographic factors; and (5) transformations of basal area (m2 ha–1). These were all used in a Poisson generalized linear model (GLM) (Eq. 5).

$$\ln \left( {{\text{IN}}} \right) = \beta_{0} + \beta_{1} x_{1} + \beta_{2} x_{2} + \cdots + \beta_{n} x_{n}$$
(5)

The systematic linear predictor is a multiplication of a parameter vector β and an explanatory variable design matrix X (Li et al. 2011). All computations and modeling were done employing the R software using the lme and nlme packages (Pinheiro et al. 2020) similar to Bayat et al. (2020).

Artificial neural network (ANN)

Prior to training, all input and output data were normalized (Nagy et al. 2002). The ANN structure was iteratively altered by changing the number of hidden nodes. The structure that produced the least amount of error (difference between modeled and observed values) was selected for further testing at a later stage. All calculations were analyzed in STATISTICA software.

Model evaluation

Common criteria for assessing the goodness of fit of model predictions are the coefficient of determination (R2), the root-mean-squared error (RMSE), the relative RMSE, BIAS, and relative BIAS (Lumbres et al. 2016; Bayat et al. 2019b). An important advantage of the relative RMSE (in %) is that it enables comparisons among predictions produced by different models (Pulido-Calvo et al. 2007; Janizadeh et al. 2019). So to the above evaluation metrics, the mortality was evaluated with the Nagelkerke R-square criteria, Chi-square (Hann et al. 2003), Hosmer–Lemeshow test (Padilla et al. 2010) and ROC subsurface levels (Lei et al. 2004).

$${\text{RMSE}} = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {{\text{est}}_{i} - {\text{obs}}_{i} } \right)^{2} }}{n}}$$
(6)
$${\text{Relative}}\,{\text{RMSE}}\, = 100\,*\,\left( {{\text{RMSE/mean}}\,{\text{observation}}\,{\text{value}}} \right)$$
(7)
$${\text{BIAS}} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {{\text{est}}_{i} - {\text{obs}}_{i} } \right)}}{n}$$
(8)
$${\text{Relative}}\,{\text{BIAS}} = 100*\left( {{\text{BIAS/mean}}\,{\text{observation}}\,{\text{value}}} \right)$$
(9)

where esti and obsi are the ith estimate and observation, respectively, and n is the number of observations. For evaluation, we performed cross-validation by randomly selecting 70% of the data as the training dataset and 30% as the evaluation dataset.

Results

The data covered a wide range of conditions typical for the region with the descriptive statistics given in Table 1.

Table1 Descriptive statistics of data used in the growth model

Individual tree diameter growth model

The diameter growth of most trees ranged between 0 and 8 cm in the 10-year period. A linear mixed-effect model was employed for modeling of the diameter growth, and the best performing model is presented in Eq. 10:

$$\begin{aligned} {\text{Id}} & = - 1.64 \, + u_{j} + 0.03\left( {{\text{DBH}}} \right) - 0.00015\left( {{\text{DBH}}^{2} } \right) + 0.0032\left( {{\text{BA}}} \right) - 0.010\left( {{\text{BAL}}} \right) - 0.00016\left( {{\text{DBH}}*{\text{BAL}}} \right) \\ & \quad + \,0.00014\left( {{\text{Slope}}} \right) - 0.03\,\sin \left( {{\text{Aspect}}} \right) + 0.14\,\cos \left( {{\text{Aspect}}} \right) + e_{ij} \\ \end{aligned}$$
(10)

where Id is the diameter growth in a 10-year period; uj ~ N(0,σu2) represents the random effect species; DBH is the diameter at breast height (cm); DBH squared (cm), BA is the stand basal area (m2 ha−1); BAL is the basal area of larger trees in the plot when compared to the subject tree (m2 ha−1). The variance and standard deviation for the random factor of species were 1.79 and 2.83, respectively. Results also showed that the correlation between growth rate and BAL was negative, i.e., the diameter growth decreased with an increase in this index. The RMSE, bias, and R2 were 1.11, 0.12, and 0.20, respectively.

Results of five MLP- and RBF-based ANNs showed that the RBF-based models gave better results and the superior MLP-based model resulted in an RMSE of 0.06 cm, indicating that the predicted average diameter growth is close to the measured diameter growth (Table 2). In Table 2, only the models with high accuracy are given, while all ANNs fitted are given in the Supplemental Materials (Table S1 and S2). Also, Fig. 2 shows the relationship between the actual and predicted diameter growth for all species by the parametric (mixed-effect regression) and nonparametric methods (ANN).

Table 2 Characteristics of the superior MLP-based ANNs for individual tree diameter growth and associated goodness-of-fit criteria for model performance
Fig. 2
figure 2

Graphical presentation of measured and predicted tree diameter growth by the parametric [mixed-effect regression (red line)] and nonparametric methods [ANN (black line)]. (Color figure online)

Individual tree height growth model

The following individual tree data were used to fit tree height models using tree diameter as the main predictor (Table 3). The best obtained height model is shown in Eq. 11:

$$H_{ij} = \left( {1.3 + 65.52/0.029d_{ij}^{ - 0.55} + 0.024 \, \left( {{\text{BA}}} \right) + 0.027 \, \left( {{\text{BAL}}} \right)} \right) + e_{ij}$$
(11)

where H is the tree height and dij is the diameter at the breast height of the tree, BA is the stand basal area (m2 ha−1); BAL is the basal area of the larger trees in the plots (m2 ha−1) and e ~ N (0,σu2) represents the residuals. In addition, modeling was performed for all species separately, but a better model was achieved when all species were combined and species was treated as a random factor. Residual standard error, RMSE, RMSE %, bias, bias %, and R2 were 4.10 m, 4.91 m, 18.45, 0.15 m, 0.27%, and 73%, respectively (Fig. 3).

Table 3 Descriptive statistics of trees’ DBH (cm) and height (m) by species in the study area
Fig. 3
figure 3

Regression model of diameter-height by the parametric [mixed-effect regression (red line)] and nonparametric methods [ANN (black line)]. (Color figure online)

Of the ANN models evaluated for tree height, the MLP-based ANN that contained seven hidden layers was the superior model that resulted in an RMSE of 4.81 m (Table 4). In Table 4, only the models with high accuracy are given, while the results of all ANN are given in the Supplemental Materials (Table S3 and S4). However, predictions using the best ANN and mixed model were nearly identical across the range of tree sizes examined (Fig. 3).

Table 4 Characteristics of the superior MLP-based ANNs for individual tree height and associated goodness-of-fit criteria for model performance

Individual tree survival growth (mortality) model

In the mortality model, we considered the survival of trees to be a one and dead trees are a zero with a binary logistic regression used to estimate the survival model. Equation 12 is the individual tree survival growth (mortality) model

$$Y = \frac{1}{{1 + \exp^{{\left( {4 + 0.17\,{\text{DBH}} - 8.01\,{\text{BA}} - 1.10\,{\text{BAL}}} \right)}} }}$$
(12)

The results of the neural network show that the MLP-based network of tree survival had an RMSE of 0.28 and a bias of 0.011 and performed better than the RBF-based network (Table 5, Fig. 4). In Table 5, only the models with high accuracy are presented, while all of neural network determined are given in the Supplemental Materials (Table S5 and S6).

Table 5 Characteristics of the superior RBF and MLP-based ANNs for individual tree survival and associated goodness-of-fit criteria for model training
Fig. 4
figure 4

Model of survival by the parametric [mixed-effect regression (red line)] and nonparametric methods [ANN (black line)] with observed probabilities by 5 cm diameter class

Evaluation results of survival model are given in Table 6, which suggest that the ANN was slightly better in terms of ROC and R2. The better performance achieved for the ANN was primarily at the ends of the distribution in terms of smaller and larger trees (Fig. 4).

Table 6 Evaluation results of survival model by modeling approach

Ingrowth model

The average rate of ingrowth (number of trees that passed the diameter threshold of 12.5 cm) over a 10-year period was 42.8 ± 6.4 (mean ± SD) trees per ha, which was unequally distributed among the different species (Table 7). The average diameter of ingrowth trees in the 10-year period was 17.1 ± 2.21 cm. Equation 13 shows the ingrowth model:

$$\ln \left( {{\text{IN}}} \right) = 10.70 - 0.35 \, \left( {{\text{mDBH}}} \right) + 0.47\left( {{\text{BA}}} \right) - 0.0043\left( {{\text{BA}}^{2} } \right) + 0.01 \, \left( {{\text{BA}}_{{{\text{Fagus}}}} } \right)$$
(13)

where IN is the ingrowth of a 10-year period (number of trees that passed the diameter class of 12.5 cm) mDBH is the stand-level mean diameter at breast height (cm), BA is the total basal area (m2 ha−1), BA2 is the squared basal area (m2 ha−1) of each plot, and BAFagus is the basal area (m2 ha−1) of Fagus species.

Table 7 Average number of trees passed the diameter threshold of 12.5 cm over a 10-year period

The results of the ANN show that the MLP-based network of ingrowth had an RMSE of 0.54 trees per ha, a bias of 0.027 trees per ha, and performed better than RBF-based network (Tables 8; Fig. 5). In Table 8, only the models with high accuracy are given, while the performance of all determined ANNs is provided in the Supplemental Materials (Table S7 and S8). However, predictions of ingrowth using both ANN and mixed-effects were relatively similar across a range of basal areas (Fig. 5).

Table 8 Characteristics of the superior MLP-based ANNs for the number of ingrowth trees per ha that exceed the DBH threshold of 12.5 cm over 10 years and associated goodness-of-fit criteria for model predictions
Fig. 5
figure 5

Predictions of the mixed-effects ingrowth model [Eq. 11; number of trees passed the diameter class of 12.5 cm) (red line)] and nonparametric methods [ANN (black line)] over total stand-level basal area (m2 ha−1) with observed values

Comparison of modeling approach

A general comparison of the R2 and RMSEs from the parametric (mixed-effects) and nonparametric (ANN) prediction methods indicated that the ANN resulted in the best performance in this analysis (Table 9). For R2, the % improvement by ANN ranged from 3.1 to 157.1%, while % improvement for RMSE was between 10.0 and 94.9%. The largest improvements were primarily observed for the diameter increment model. However, as noted above, many of the predictions using either ANN or mixed-effects were generally consistent across the range of the data and both well aligned with observed trends.

Table 9 Comparison of various model fit statistics for parametric (mixed-effect regression) and nonparametric methods (ANN) by equation

Discussion

Evaluation of the structure and dynamics of natural forests helps to determine the possibility of optimal forest management in achieving a desirable structure so that the implementation of appropriate silviculture operations in the stands under management can help to preserve the biological diversity and sustainability of forests (Ashraf et al. 2013; Zhao et al. 2019). Forest structure and composition are very imperative factors for assessing forest health and sustainability. Hyrcanian forests also have a variety of functions that require an integrated and comprehensive management such as wood production, biodiversity, aesthetics, recreation and habitat. It appears that uneven-aged and mixed species management is an efficient and efficacious way of attaining these varying goals (Bourque and Bayat 2015; Yang et al. 2019). However, lack of suitable decision-support tools for the proper implementation of uneven management in mixed broad-leaved forests has delayed the scientific and efficient implementation of this technique (Bourque et al. 2019).

In this study, some models have been presented that can be employed in forest management. The models for uneven-aged, mixed forest presented in this study include individual tree diameter growth, height, and survival (mortality) as well as ingrowth. According to the results, the individual tree diameter growth had a robust performance across the species examined (Eq. 10). Results of a sensitivity analysis indicated that the most important independent variables used in the best model for diameter growth was the logarithm of DBH. Stand basal area and BAL were used to account for competition. BAL is the most common tool for quantifying one-sided competition because it is an absolute value, a simple calculation, and correlates well with the growth rate (Vanclay 1994; Weiskittel et al. 2011; Kweon and Comeau 2019; Kuehne et al. 2020).

Results also showed that increased competition greatly reduced diameter growth across the species examined. In addition, the ANN based on the multilayer perceptron network (MLP) provided a better fit to the data than the RBF-based ANN (Tab 2; Fig. 3). The results of the ANN sensitivity analysis showed in both cases that DBH and stand basal area were the most influential variables whose effects were further modified by physiographic factors in the form of altitude, slope, and aspect. The parametric model for diameter growth indicated that highest growth was observed on northwest aspects (300–340°), but declined as the slope steepened. Weiskittel et al. (2007) found a similar influence of altitude, slope, and aspect on individual tree diameter growth of species in the US Pacific Northwest.

In this study, the static technique was employed for modeling height growth. The descriptive statistics of trees’ DBH and height indicated that observations were selected from a wide range of values (12.5–138 cm for DBH and 8.0–40.5 m for height; Table 3). This suggests that nearly the full range of tree diameter and height for this area were considered. In the height model, species had no effect on the model, which was obtained when the model was fitted for all species separately and the difference was not statistically significant. The developed mixed effect regression height models from our work expressed tree height as a function of DBH and the best model we produced resulted in a RMSE value equal to 18.5%, which was noticeably lower in the present study than reported by Trasobares et al. (2004), where it ranged from 21 to 24%. This high accuracy is important as height models are needed for predicting tree volume. For the ANN method, the R2 and RMSE % values were lower than the parametric method, which were 0.79 and 18.2%, respectively. The MLP network had higher accuracy than RBF network. In addition, the important variable in this model included diameter, basal area, and basal area in larger trees.

The mortality of individual trees is an important event in the development of a forest stand and has significant applications in forest growth and product modeling (Bourque et al. 2019). In determining the survival model (mortality), despite the predictive difficulties of mortality at the stand level, there are many equations for estimating it at the tree level, and the logistic regression was used in this study similar to several previous studies (e.g., Hann et al. 2003; Monserud and Sterba 1999). Guan and Gertner (1991) in their assessment of using ANN and the logistic function showed the former was an effective technique for tree mortality modeling, which other studies have also found (King et al. 2000; Metcalf et al. 2009). However, when the model is used on independent data, the difference between logistic regression and other fitting methods may be less obvious. As a result, logistic regression is generally appropriate due to its simplicity, robustness, and easy application for many practical purposes of mortality modeling (Yagi and Primicerio 2014).

Results of the developed survival model showed that trees with a diameter of 20 to 150 cm had the highest survival (Fig. 4). With an increase in competition (BAL), tree survival rate decreased rapidly. According to the analysis, beech trees also had a lower survival probability than others. Assessment and validation of tree mortality models were difficult because of its discontinuous and discrete nature (Hann et al. 2006). According to our results, ANN was able to better estimate survival probability and to identify the predictors that contribute the most. ANN uses nonlinear network connections and allows an analysis that explores the efficacy of all input variables at the same time, which may lead to an improved quality of the results. In this study, the validation of models was determined using Nagelkerke R2 and its value was 0.40 (Table 6). According to studies, this parameter represents a good fit of the model if the value is between 0.2 and 0.4 (Wilson and Oliver 2005). In this study, the validity of the model was further assessed using Hosmer–Lemeshow test and its value was 0.092. Given the fact that the value of this parameter is greater than 0.05, the model obtained is valid (Pukkala and Kellomäki 2012). In addition, the other criterion was ROC, which was 0.81 and indicates relatively robust predictions. The correct prediction of the survival model was 92% if 0.5 was used as a threshold for survival. In this model, the diameter and basal area and their transformations were among the factors that had a high impact on the model, respectively, while the physiographic factors in this model had no effect.

Models of ingrowth predict the development of trees from the sapling stage instead of predicting seedling development or trying to evaluate the variety of factors that influence the process of forest stand rejuvenation (Ma et al. 2019). One of the benefits of this approach is that the time needed for establishment of regeneration in the stand after the reduction of stand density due to harvesting allows it to be more realistic (Gould et al. 20062007). Regarding the results, the ingrowth model was generally reasonable and was more reliable than other models (Fig. 5). The ANN showed that MLP approach had highest accuracy and lowest bias (Table 8). Ingrowth increased with the basal area of stand, which is a similar finding to results of Trasobares et al. (2004) who showed that the basal area had a major role in regeneration modeling. In this study, the ANN trial-and-error procedure showed that depending on the input variables, four to ten hidden layers resulted in the best performance. However, the number of neurons on the hidden layer varied based on the number of input variables. Some researchers have suggested that the number of neurons on the hidden layer should be more than twice that of the input neurons (Liu et al. 2013; Zhu et al. 2018). Although other researchers found that the number of hidden layers and their neurons is totally dependent on the size of the data, which always leads to a trial and error procedure or application of optimization techniques (Karaboga and Kaya 2018).

Considering the potential of ANN in forest measurements and modeling, studies like Vendruscolo et al. (2017), Çatal and Saplioğlu (2018), Reis et al. (2018), Sanquetta et al. (2018) and Vieira et al. (2018) have found that they generally perform better than regression models, while they can be estimated with higher speeds and result in more precise models. Thereby, in order to minimize the errors in growth models estimation in uneven-aged forest, such as the Hyrcanian forest, the use of neuron network techniques is very important, (Aertsen et al. 2010; Breiman 2001; Cano et al. 2017; Görgens et al. 2015; Ozçelik et al. 2013, 2010; Reis et al. 2018; Siminski 2017; Simões and Shaw 2007). In particular, ANNs represent powerful methods for data analysis that can enhance estimation skill (Xu et al. 2014). However, it is important to recognize many potential limitations of nonparametric methods like ANN as they may over fit the data, cannot extrapolate beyond the available training data, and do not guarantee biologically plausible behavior. Consequently, the assessment and evaluation of long-term growth projections from both parametric and nonparametric is needed. This is important as improved model fit statistics like R2 and RMSE does not necessitate better long-term (25 + years) growth projections (e.g., Russell et al. 2011), which underscores the need and importance of robust model forms that might be challenging for nonparametric approaches to ensure.

Conclusion

Growth and yield models for mixed and uneven forests have a very important role in sustainable management. Today, a logical solution cannot be achieved in many planning and policy formulation processes without modeling and simulation. Consequently, this study was aimed at providing a growth model for the primary species in the Hyrcanian forest in order to determine the best condition and the most important factors for their growth that can be used in forest management. In this study, tree-level diameter growth, height, and survival as well as stand-level ingrowth were developed using mixed effects and artificial neural networks. Individual tree growth models are needed in order to determined suitable management regimes for the uneven- aged and mixed Hyrcanian forests.

Results showed the MLP model was more accurate than RBF and we found the ANN approach relatively easy to use and powerful as they significantly outperformed the traditional approaches for predicting diameter growth, height, survival, and ingrowth. Overall, results indicated that growth and yield model performance was consistent with expectations, and that the general fit to the validation data was acceptable. The results also indicated that the models performed relatively well in terms of amount of the variation explained (36–99%). Therefore, they seem quite suitable for predicting tree growth and survival in uneven-aged hardwood stand types in the region where the study area was situated.

According to the general findings, ANNs were capable of accurately and robustly predicting the different tree-level variables needed for effective forest modeling. Therefore, ANN can likely be considered a very promising alternative technique when sufficient data are available for analysis. It is suggested that the artificial neural network method be used in other forest areas to maximize the available accuracy in future studies. In addition, it is important to compare long-term projections using ANNs with actual field data to ensure robust model behavior and performance that is well aligned with biological expectations. Ultimately, the best approach for growth and yield modeling might be the hybridization of parametric and nonparametric methods like ANN that can fully capitalize on the advantages of each method. In general, long-term data collected in the forest with permanent sample plots are one of the most important sources for evaluating growth and yield models. It is suggested that these models be developed for other forests using permanent sample plot data.