Introduction

Larix gmelinii var. principis-rupprechtii (Mayr.) Pilger is an important native tree species in mountainous regions of North China, and can be used as a large diameter timber species or as pulp for paper making. It has broad development and application prospects. Provenance or family selection is the foundation of tree breeding and is of significance for providing suitable germplasm for specific areas (Pan et al. 2018). Regional trials or multi-site trials can be used to study genotype by environment (G × E) interactions. By analyzing the adaptability and genetic stability of each germplasm, suitable material for an individual site or for multi-sites can be selected (Lin 2019; Zheng et al. 2019) Based on multi-year observations of trial stands, genetic parameters for growth traits can be estimated. They are critical for designing optimal breeding strategies and for shortening the breeding cycle. In the past, provenance or family tests were often based on single year and single-site experiments, single year and multi-site experiments, or multi-year and single site experiments, and have been reported for species such as Pinus elliottii Engelmann (Zhang et al. 2017), L. kaempferi (Lamb.) Carrière (Pan et al. 2018), L. gmelinii var. principis-rupprechtii (Zheng et al. 2019), L. olgensis Henry (Zhang et al. 2020), Cunninghamia lanceolata (Wu et al. 2019) and P. tabuliformis Carrière (Li et al. 2015). Multi-year and multi-site tests can make more accurate and reliable assessments of provenance or family and site, the G × E interactions, and the identification of early age selection (Xu and Li 2015). Presently, most multi-site and multi-year studies have focused on agricultural crops (Badu-Apraku et al. 2013; Kivuva et al. 2014; Laurie and Booyse 2015; Laurie et al. 2015; Dia et al. 2016). For tree species, due to their long growth cycle, there have been only a few studies about G × E interaction or early selection based on multi-site and multi-year experiments (Bian et al. 2014; Diao et al. 2016). Such studies based on multi-site and multi-year tests have not been reported for L. gmelinii var. principis-rupprechtii.

In order to decide the appropriate breeding strategies, four trial plantations of 25 L. gmelinii var. principis-rupprechtii families were established in Hebei province. The study objectives were to: (1) describe variations in genetic parameters for growth traits with age; (2) determine the most appropriate age for early selection; (3) select families with high yield and good cross-sites stability; and, (4) select sites with high discriminating ability, strong representativeness, and good cross-age stability. These findings help in promoting the genetic improvement of L. gmelinii var. principis-rupprechtii.

Materials and methods

Sample plots and data collection

Four family trials of L. gmelinii var. principis-rupprechtii were carried out in 2003 at four sites in Hebei province: Zhazi, Weichang (CZ); Mayinggou, Chicheng (CC); Liutiaogou, Guyuan (GY); and Yudaokou, Weichang (YDK). The environmental conditions are detailed in Table 1. A randomized complete block design of 25 half-sib families was used in each trial with 10-tree plots and three replicates. In 2019, one average tree from each family was felled to obtain stem analysis data, i.e., growth data of 300 trees at 5 − 17 years of age, including inside-bark diameter at breast height (DBH), tree height, and inside-bark stem volume.

Table 1 Environmental conditions for each test site

Statistical analysis

Family models of each trait (DBH, height and volume) for multiple sites are listed in Naranjo et al. (2012) and Ling et al. (2021) (Eq. 1). They were mixed linear models based on individual ages. The ASReml 4.0 package of R software was used (Gilmour et al. 2016).

$$ Y_{ijkl} = \mu + S_{i} + R_{i\left( j \right)} + F_{k} + SF_{ik} + e_{ijkl} $$
(1)

where, Yijkl is observed value of family k in block j at site i; μ is the overall mean; Si is the fixed effect of the ith site; Ri(j) is the fixed effect of the jth block within the ith site; Fk is the random effect of the kth family; SFik is the random interaction effect of the ith site and the kth family; and eijkl is the random residual effect.

Based on the above-mentioned family models, the family variance components (Vf), the family × site variance components (Vf*l), and the ratio of Vf*l to Vf (Vf*l/Vf) of DBH, height and volume were estimated and analyzed for each age. Significance of variance was tested using the likelihood ratio test (Diao et al. 2016; Lai et al. 2017). Individual-tree heritability (hi2), family heritability (hf 2) and variation coefficients at phenotypic, environmental, and genetic levels (PCV, ECV and GCV) of each trait were calculated with Eqs. 2 − 6 (Ren et al. 2010; Li et al. 2015; Pan et al. 2018), followed by analyzing their trends with tree age.

$$ h_{i}^{2} = 4\sigma_{f}^{2} /\left( {\sigma_{e}^{2} + \sigma_{f}^{2} } \right) $$
(2)
$$ h_{f}^{2} = \sigma_{f}^{2} /\left( {\sigma_{e}^{2} /r + \sigma_{f}^{2} } \right) $$
(3)
$$ PCV = SD_{P} /\overline{X} $$
(4)
$$ ECV = SD_{E} /\overline{X} $$
(5)
$$ GCV = SD_{G} /\overline{X} $$
(6)

where, hi2 is individual-tree heritability, hf 2 family heritability, σf 2variance component of family, σe 2 variance component of environment, r the number of repetitions, PCV phenotypic variation coefficient (%), ECV environmental variation coefficient (%), GCV genetic variation coefficient (%), SDP phenotypic standard deviation, SDE environmental standard deviation, SDG genetic standard deviation, and \(\overline{X}\) the mean of the measured trait.

Wricke (1962) eco valence (Ei) and the main genotype effect plus genotype × environment interaction effect (GGE) biplot were used to evaluate the stability of families. For traits with higher heritability and variation coefficient, Ei values for each family were calculated. A family with lower Ei is stable. The breeding values of families for traits with higher heritability and higher variation coefficient were then estimated using the best linear unbiased prediction (BLUP) at ages with greater values of Vf*l/Vf (i.e., stronger G × E interaction according to Lin et al. 2013). The BLUP data obtained by adding the breeding values and the overall means were used in GGE biplot analysis done by the R software package, GGEBiplotGUI (Yan 2010). To evaluate the yield and stability of families under different environments (i.e., groups of site × age) and the discriminating ability, representativeness and between age stability of the sites, three views of GGE biplots for each trait were analyzed; they were: a “Which-Win-Where” view showing grouping of environments and the best family for each environment, a “Discriminating Ability vs. Representativeness” view showing discriminating ability and representativeness of environments, and a “Mean vs. Stability” view showing mean and stability of families.

Genetic (RG) and phenotypic correlations (RP) between growth traits (i.e., DBH, height and volume) at early ages and volume at age 17 were calculated with Eqs. 7‒8 (Wu et al. 2019). Efficiency of early selection (E) was estimated with Eq. 9 (Wu et al. 2007). Based on the RG, RP and E, the appropriate early selection traits and selection age can be determined.

$$ R_{G} = Cov_{G12} /\sqrt {\sigma_{G1}^{2} \times \sigma_{G2}^{2} } $$
(7)
$$ R_{P} = Cov_{P12} /\sqrt {\sigma_{P1}^{2} \times \sigma_{P2}^{2} } $$
(8)
$$ E = \left( {i_{E} \times R_{G} \times h_{E} } \right)/\left( {i_{L} \times h_{L} } \right) $$
(9)

where, RG is genetic correlation, CovG12 is the genetic covariance between traits (DBH, height or volume) in early years with the volume at age 17 σG1 2 genetic variance for traits in early years, σG2 2 is genetic variance for the volume at age 17, RP is phenotypic correlation, CovP12 is the phenotypic covariance between traits in early years with the volume at age 17, σP12 is phenotypic variance for traits in early years, σP22 is phenotypic variance for the volume at age 17, E is efficiency of early selection, iE and iL are the selection intensity in the early and late ages (iE = iL in this calculation), and hE and hL are the square root of the heritability in the early and late ages.

Results

Estimates of variance components for DBH, height and volume at each age

The p value of site effect and the estimates of variance components for DBH, height and volume at each individual age are shown in Table 2. Site was significant for all the traits for ages between 5 and 17 years (p < 0.001). For DBH, the family effect was significant at most ages (p < 0.05), except for a marginal significance at age 14 (p = 0.058), and the family × site interaction was significant (p < 0.05) after age 13. For height, the family × site interaction was significant (p < 0.05) at most ages except for age 6, and the family effect was not significant from age 5 to 17 (p > 0.1). For volume, the family effect was significant (p < 0.05) after age 6, and the family × site interaction was significant (p < 0.05) after age 11.

Table 2 Estimates of variance components for various traits at each individual age

The ratios of Vf*l to Vf (i.e., Vf*l/Vf) for DBH and volume were larger for ages 13–17, indicating the stronger G × E interactions for DBH and volume at these ages compared with other ages (Fig. 1). The Vf*l/Vf for height changed from 3.67 to 1 778 811, which was much larger than the values for DBH and volume (Fig. 1).

Fig. 1
figure 1

Age trends of Vf*l /Vf values

Heritability and variation coefficient for each trait

The family heritability for DBH, volume and height varied from 0.40 to 0.55, 0.29 to 0.54 and 0.00 to 0.33, respectively, and the corresponding individual-tree heritability estimates varied from 0.22 to 0.51, 0.13 to 0.49 and 0.00 to 0.22, respectively (Fig. 2). Family heritability was higher than the individual tree heritability estimates for all traits. The family and individual tree heritability estimates of DBH and volume were greater than those of height.

Fig. 2
figure 2

The effects of tree age on individual tree and family heritability estimates for DBH, height and volume

The phenotypic variation coefficients for DBH, volume and height varied from 9.0% to 65.8%, 19.6% to 85.0% and 7.6% to 25.9%, respectively. The corresponding environmental variation coefficients varied from 8.4% to 63.9%, 18.4% to 83.6% and 7.4% to 25.9%, respectively. The corresponding genetic variation coefficients varied from 2.6% to 15.6%, 6.8% to 17.4% and 0.01% to 1.8%, respectively (Fig. 3). The three types of variation coefficients for each trait decreased with age and gradually stabilized. For all traits, the phenotypic variation coefficients had the largest values, followed by the environmental variation coefficients, and genetic variation coefficients were smallest. For each type of variation coefficients, volume had the highest variation coefficients, followed by DBH and height.

Fig. 3
figure 3

Age trends of genetic, environmental, and phenotypic variation coefficients for DBH, height and volume

Selection and evaluation of families and sites

Given the higher heritability and higher variation coefficients of DBH and volume compared to height (Figs. 2 and 3), the ecovalences (Ei) of DBH and volume at age 17 for each family were calculated and ranked in order to evaluate the genetic stability of each family (Table 3). Among the top 12 families with high stability, the mean DBH values of families 66, 68, 82, 76, 111, 96 and 78 were higher than the mean DBH of all the other families at all sites (9.9 cm); similarly, the mean volumes of the above-mentioned families (except for family 68) were higher than the mean volume of all the other families at all sites (0.04 m3). Hence, families 66, 68, 82, 76, 111, 96 and 78 were preliminarily judged as high yielding and with above average stability. Family 100 ranked 13th among 25 families, with a mean DBH greater than 9.9 cm and a mean volume equal to 0.04 m3. Therefore, family 100 could be considered as a high yielding and moderately stable family. Among the 12 families ranked after family 100, the mean DBH values of families 42, 72, 79, 77 and 56 were greater than 9.9 cm, and the mean volume of families 72 and 79 was greater than (or equal to) 0.04 m3. Therefore, families 72 and 79 could also be judged as high yielding and with below average stability.

Table 3 Ecovalence (Ei) of DBH and volume at age 17 for each family

For DBH and volume, GGE biplots containing 20 environment combinations (i.e., site × age. Sites = CZ, CC, GY and YDK; ages = 13, 14, 15, 16 and 17) were made based on DBH and the volume BLUP data.

For DBH, the interaction between families and environment is shown in Fig. 4a. The biplot was divided into seven sectors in polygon view, and the vertex families had the best performance in each sector. The 5 site × age combinations of site YDK were in the same group, with family 78 as a possible candidate, and the 15 site × age combinations of sites CZ, CC and GY belonged to the other group, with family 111 performing best. The grouping results of the sites were consistent across ages. Discriminating ability and representativeness of each site × age combination are shown in Fig. 5a. The solid line with an arrow represents the average environment axis (AEA), and the length of the dotted line between the site × age combination and the origin represents the discriminating ability of an environment combination. The angle between the environment combination vector and the AEA indicates the representativeness of environment combination. The smaller angle means the stronger representativeness (Ling et al. 2021). The first two principal components accounted for 76.5% of total G × E interaction. Site CZ had the highest representativeness, followed by sites GY and CC, and site YDK had the lowest representativeness. The representativeness of the sites was consistent across ages.

Fig. 4
figure 4

Site groupings and the best family for each site; a= DBH (Scaled = 0, Centered = G + GE, SVP = Symmetrical); b volume (Scaled = Std Deviation, Centered = G + GE, SVP = Symmetrical)

Fig. 5
figure 5

Discriminating ability and representativeness of the sites; a DBH (Scaled = 0, Centered = G + GE, SVP = Column Metric Preserving); b volume (Scaled = Std Deviation, Centered = G + GE, SVP = Column Metric Preserving)

At ages 15, 16 and 17, site YDK had the highest discriminating ability, followed by sites GY and CZ, and site CC had the lowest discriminating ability. At ages 13 and 14, site YDK had the highest discriminating ability, followed by sites GY and CC, and site CZ had the lowest discriminating ability. For sites YDK, GY and CC, they had the highest discriminating ability at age 17, followed by ages 16, 15 and 14, and they had the lowest discriminating ability at age 13. For site CZ, it had the highest discriminating ability at age 16, followed by ages 17, 15 and 14, and it had the lowest discriminating ability at age 13. Overall, site GY had the large discriminating ability, high representativeness and good cross-age stability, thus was the most suitable site to effectively select the high-yielding and stable families (Fig. 5a).

Yield performance and stability of the families are shown in Fig. 6a. The solid line with an arrow represents the average environment axis (AEA). The vertical solid line to the AEA through the origin denotes the overall mean DBH. The DBH of the families on the left of the vertical solid line was lower than the overall mean, whereas the DBH of the families on the right of the solid line was higher than the overall mean. The vertical dotted lines to the AEA represent the average DBH and the stability of the DBH of each family across all environment combinations. The longer the vertical dotted line, the less stable the family (Ling et al. 2021). Families 111, 76, 82, 96, 66, and 77 were high-yielding and highly stable; families 78, 56 and 68 were high-yielding with above average stability; families 72 and 79 were high-yielding with below average stability; and families 42 and 100 were high-yielding and unstable (Fig. 6a).

Fig. 6
figure 6

Mean and stability of the families; a DBH (Scaled = 0, Centered = G + GE, SVP = Row Metric Preserving); b volume (Scaled = Std Deviation, Centered = G + GE, SVP = Row Metric Preserving)

For volume, the 20 site × age combinations of four sites were in the same group, with family 111 performing the best, followed by family 78. The grouping results of the sites were basically consistent across ages (Fig. 4b). The first two principal components accounted for 77.4% of total G × E interaction. Site GY had the highest representativeness, followed by site YDK, and sites CC and CZ had the lower representativeness. The representativeness of the sites was roughly consistent across ages (Fig. 5b). Overall, site GY had high representativeness and good cross-age stability, thus was the optimum test site. Families 111, 78, 82, 76, 100, 66 and 49 were high-yielding and highly stable, and families 96, 72 and 79 were high-yielding and had average stability (Fig. 6b).

Early selection of families

The genetic correlations (RG) between DBH in early ages and volume at age 17 (v17) started to be significantly positive from age 6 (p < 0.05) and varied from 0.76 to 0.99 (Table 4). The RG between the volume at early ages and the v17 began to be significantly positive from age 5 (p < 0.05) and varied from 0.90 to 0.99 (Table 4). For height, only the RG between the height at a few ages (9, 12, 16 and 17) and the v17 were significant (p < 0.05). The phenotypic correlations (RP) between the traits at early ages and the v17 started to be significantly (p < 0.01) positive (0.31 − 0.82) from age 5 for DBH, age 3 (p < 0.01) for height (0.21 − 0.62) and age 5 (p < 0.05) for volume (0.35 − 0.99) (Table 4). The RG and RP between the traits at early ages and the v17 gradually increased with age. From age 7, most of the RG and RP between the traits in early ages and the v17 exceeded 0.5 (Table 4). The early selection efficiencies (E) of height were lower than those of DBH and volume at all ages (Fig. 7). The E of DBH and volume increased quickly from ages 5 to 7, and after age 7 were kept in the narrow ranges of 0.82 to 0.99 and 0.88 to 0.99, respectively, i.e., the E exceeded the critical value of early selection (0.8) (Zhang et al. 2013) from age 7. Hence, the appropriate early selection age of Larix gmelinii var. principis-rupprechtii was seven years for DBH and volume.

Table 4 Correlation estimates (± standard errors) between traits (DBH, height and volume) at early ages and volume at age 17 (v17)
Fig. 7
figure 7

Early selection efficiency (E) of each trait at different ages

Discussion

Traits used for family evaluation

Traits with higher heritability can be more stably passed on to the next generation and are less affected by the environment. Traits with larger variation coefficients are more suitable for family evaluation. Based on traits with high variation and high heritability and using multi-age and multi-site data, effective analysis can be carried out at ages when the family × location interactions are intense in order to select high-yielding stable families across multiple sites and ages, to select the most suitable site for family selection, and to propose a suitable family selection strategy for different sites.

DBH and volume of the families evaluated at four sites had moderate heritability values from 0.43 to 0.55 after age 6, similar to those of DBH (0.45) and volume (0.43) for half-sib families according to Xing et al. (2019), and also close to the heritability ranges of DBH (0.49 to 0.58) and volume (0.55 to 0.65) for L. kaempferi (Lamb.) Carr. (Diao et al. 2016). Family heritability values were higher than those of the individual trees for DBH, volume and height, which is consistent with Diao et al. (2016) and Lai et al. (2017) and indicates that higher genetic gains may be achieved with family selection in L. gmelinii var. principis-rupprechtii. The fact that the genetic, environmental, and phenotypic variation coefficients for each trait decreased with age and gradually remained stable, is consistent with reports on C. lanceolata and L. kaempferi ( Sun et al. 2004; Diao et al. 2016; Wu et al. 2019), but in contrast with another study on L. kaempferi (Pan et al. 2018). For all traits, the phenotypic variation coefficients had the largest values, followed by the environmental variation coefficients and genetic variation coefficients, indicating that the environmental impacts on the three traits were greater than the genetic impacts.

Heritability at each age was lower for height than DBH and volume, which agrees with Yin et al. (2019) but in contrast with Diao et al. (2016) and Lai et al. (2017). Because the three variation coefficients of volume and DBH were higher than those of height at each age indicates that volume and DBH were more sensitive to environmental influences and showed greater variation among families. This is supported by studies by Sun et al. (2004), Diao et al. (2016) and Yin et al. (2019). Therefore, volume and DBH had higher, more stable genetic characteristics, greater variation, and higher selection potential than height in this study. Hence, volume and DBH were the suitable traits for family selection of L. gmelinii var. principis-rupprechtii, consistent with research results on P. massoniana Lamb. and L. kaempferi (Yang et al. 2010; Bai et al. 2012; Pan et al. 2018). As volume is a function of DBH and height, and particularly dependent on diameter, DBH should be the most suitable and convenient trait for family evaluation.

It is should be noted that the heritability and genetic variation coefficients of height were close to zero over many years, and Vf*l/Vf ratios were higher for height than for those of DBH and volume (Figs. 1, 2, 3), which is related to the insignificant effects of from age 5 to 17 for height (p > 0.1) (Table 2). The 25 families in this study were second-generation superior half-sib families, which may account for the insignificant family effects on height. In addition, the growth data at different ages in this study were obtained by analyzing the stems of harvested trees, and only one average tree of each family was felled in each replicate. This might have affected the accuracy of the results to some extent. In future research, it will be important to increase the number of harvested trees in each plot to obtain more stem analysis data, or to measure growth data yearly in the field, which could reduce the environmental impact and improve the accuracy of genetic parameter estimation.

Different breeding goals require different evaluation and selection methods. Given that L. gmelinii var. principis-rupprechtii is an important timber species, growth traits are key considerations in breeding. Volume and DBH traits were used to evaluate the families, and a relatively ideal selection effect was achieved. In addition, form and wood properties are also important aspects in breeding of the species. In order to achieve the maximum possible gain in all traits simultaneously, multi-trait selection based on selection index can be used (Carreras et al. 2017). Therefore, more precise selection based on various traits should be done in the future.

Early selection

When volume at rotation age is the target trait, some studies indicate that height was the best early selection trait due to its higher heritability, larger juvenile-to-mature correlation, and lower effect due to spacing compared to other traits (Balocchi et al. 1993). In contrast, some research showed that DBH was the more efficient early selection trait (Ma et al. 2000; Sun et al. 2004), and other studies suggested that selection based on both height and DBH was most reliable in early selection (Lai et al. 2014; Diao et al. 2016). In this study, volume at age 17 (v17) was the target trait, and the results indicate that DBH was the more effective early selection trait than height and volume. Compared to total height, DBH at various ages had higher heritability, stronger correlation with v17, and larger early selection efficiency. Moreover, DBH is more easily measured than height. These results agree with the findings of Ma et al. (2000) and Sun et al. (2004). Similarly, to the heritability of height, the fact that the early selection efficiency of height was close to zero before age 14 was also due to the insignificant family effect of height. These results were obtained under the assumption that the stand density of the four sites was similar. In addition, because the growth data of young and middle-aged stands were used to determine early selection age and no data on older plantations were available, more accurate estimates incorporating data from mature stands should be done in future.

Similar studies have been carried out on larch. Considering 15 years as the mature age for pulpwood for L. kaempferi, Sun et al. (2004) found that age 6 was the most suitable early family selection age for DBH. Diao et al. (2016) considered that the optimum selection age for this species in Hubei and Liaoning was 5 years for DBH and four for height. Lai et al. (2014) suggested that the optimum selection age for L. kaempferi was 5 years for DBH. Based on data of 13-year-old hybrid larch, Miao et al. (2018) indicated that 6 years was the optimum age for DBH. Zhang et al. (1996) studied a 30-year-old L. olgensis plantation and considered that 10 years was the most suitable selection age. The present study notes that early selection of L. gmelinii var. principis-rupprechtii can be carried out at age 7, which is close to the results of the above studies. From age 7, the genetic correlation between DBH at an early age, and volume at age 17 was high (above 0.86, Table 4), indicating that early-to-late correlation of growth traits was controlled mainly by genetic factors, further supporting the effectiveness of early selection (Ma et al. 2000; Sun et al. 2004). With the increase in age, phenotypic and genetic correlations generally gradually increased, indicating that the effect of early selection increased with age, which is consistent with the results of Wu et al. (2019).

Cross-site analysis

Both single-site and cross-site analyses can be carried out in multi-site family trials to reveal age-related trends in genetic parameters. The linear mixed model with the family effect and family × block interaction effect as random effects and with the block effect as a fixed effect, is usually used for single-site analysis. The same model with the additional factors of site (fixed effect) and family × site interaction (random effect) is used for cross-site analysis (Diao et al. 2016; Yin et al. 2019; Ling et al. 2021). Both analyses were carried out in this study, and the results show that the genetic parameters of growth traits from cross-site analyses were more stable and reliable than those from single-site analyses. This is consistent with the findings by Weng et al. (2007). Therefore, all the results presented in this study were based on the multi-site analysis model with a view to developing more reliable strategies for tree breeding and early selection.

Conclusion

Based on the stem analysis data of 25 half-sib families of L. gmelinii var. principis-rupprechtii 5 − 17 years of age at four test sites, the variance components, genetic parameters, and correlations between growth traits at early ages and volume at age 17 were calculated, and genotype main effect plus genotype-environmental interaction effect (GGE) biplots based on best linear unbiased prediction data drawn. DBH was determined to be the best trait for L. gmelinii var. principis-rupprechtii early selection, and the optimum time was 7 years. Volume and DBH were more suitable traits than height for family selection. The first two principal components in genotype × environment biplots accounted for 76.5% and 77.4% of the total interaction for DBH and volume, respectively. Site GY was the most suitable site because of its stronger discriminating ability, representativeness and stability across all ages. Families 66, 76, 82 and 111 were high-yielding and highly stable, families 78 and 96 were high-yielding and above average stability, and families 72 and 79 were high-yielding and below average stability. This study can support decision-making for the selection of L. gmelinii var. principis-rupprechtii in northern Hebei and provides an example for the application of multi-year and multi-site data in tree breeding.