Introduction

Under global warming scenarios (IPCC 2007), warmer and shorter winters will occur in many regions, increasing the risk of late spring and early autumn frosts. How these new conditions would affect productivity, quality and distribution of species is a question under debate (e.g. Lindner et al. 2010), as the response to cold is affected by the sensitivity of the species (Sutinen et al. 1992; Sakai and Larcher 1987).

Cold hardiness, i.e. the ability of plants to withstand freezing temperatures without undergoing significant damage, displays a large level of genetic variation in forest trees both among and within populations. Under common garden conditions, some populations harden in autumn more rapidly than others, depending on their origin (Díaz et al. 2009; Weng and Parker 2008), or deharden differently in spring in response to late winter and/or early spring climate conditions (Díaz et al. 2009). Differences in hardening/dehardening have been observed also between families (e.g. Darychuk et al. 2012) or clones within populations (e.g. Anekonda et al. 2000), showing the importance of cold hardiness as a selective factor in forest trees. The implications of such differences are being considered both in assisted migration programs, breeding programs or transfer guidelines for genetic materials in order to increase productivity or adaptability (e.g. Kremer et al. 2011; O’Neill et al. 2001). In most of these applications, reliable screening methods suitable for large numbers of genotypes are a bottleneck to progress in genome-wide selection programs under different climatic scenarios (Neale and Kremer 2011). The efficiency of these methods could vary depending on the used material (populations, families, clones) from different species.

Cold hardiness can be assessed by examining the freezing damages after natural frost events in field trials, but this method poses limitations related to uncontrolled conditions and lack or repeatability which in turn leads to low statistical precision for determining differences of cold hardiness among genotypes. A better solution is to subject tissue samples to different freezing temperatures under controlled conditions and to evaluate the freezing damages in those samples (Burr et al. 1990).

Several alternative screening methods for cold hardiness are available (see Burr et al. (2001) and Calkins and Swanson (1990) for a review). Among them, visual scoring (V), relative conductivity (C) and fluorometry (F) of plant tissue are the most used methods. Visual scoring is an efficient and fast method widely used in conifers (e.g. L’Hirondelle et al. 2006; Anekonda and Adams 2000; Aitken and Adams 1997). The tissue is allowed to develop symptoms of damage for several days after freezing before scoring the damages in discrete classes. The relative conductivity method measures the concentration of electrolytes leaking from the plant tissues after freezing, providing objective and quantitative results 2 days after the freezing treatment. The relative conductivity method requires small amounts of tissue and has been used for measuring cold hardiness in many conifers (Climent et al. 2009; Royo et al. 2003; Burr et al. 2001; Ryyppö et al. 1998; Colombo 1997; McKay 1994; Sutinen et al. 1992). The fluorometry method determines the efficiency of the photosystem II of the plant tissues (as the ratio of variable fluorescence to maximal fluorescence: calculated as F v/F m) and has been found to be efficient in detecting freezing damage in conifers (Corcuera et al. 2011; Peguero-Pina et al. 2008; Perks et al. 2001, 2004; Binder et al. 1996; Binder and Fielder 1996a; Lindgren and Hällgren 1993).

Pinus pinaster (maritime pine) is one of the most important forest species of the western Mediterranean Basin and the Atlantic coastal region of southern Europe, both for its economic and ecological value. The species displays large levels of intraspecific variation in morphological (Alía et al. 1995) and physiological parameters related mainly to drought tolerance (Aranda et al. 2010; Correia et al. 2008) or frost tolerance (Bouvarel 1960; Corcuera et al. 2011; Illy 1966; Le Tacon et al. 1994). Implications of the intraspecific differences in cold hardiness are of high concern for the use of reproductive material of the species in breeding or afforestation programs, as shown by the prohibition of some Iberian origins to be used in certain regions of France (European Commission 2005). Therefore, maritime pine is a good model species to test different screening methods for cold hardiness and to test the variability at different genetic levels.

In this work, we study the cold hardiness of P. pinaster at different genetic levels (families and populations), in different plant organs (needles, buds and stems), with different environmental conditions (years and seasons) and using alternative methods. The objectives of this study were (1) to detect differences in cold hardiness among maritime pine populations, (2) to compare three methods (visual scoring, relative conductivity and fluorometry) for assessing cold hardiness in maritime pine and (3) to evaluate the response of different organs to the freezing experiments.

Materials and methods

Plant material and experimental site

Six populations of P. pinaster covering the range of distribution (34° to 44° N and 1° to 9° W, and at 0 to 1,600 m above sea level (a.s.l)) and with contrasting climatic conditions of origin were chosen. Three of them come from the Atlantic coast (Sergude, SERG; Leiría, LEIR; and Mimizán, MIMI), while the other three come from a Mediterranean are with continental climate (Bayubas, BAYU; Cuellar, CUEL; and Tamrabta, TAMR; Fig. 1).

Fig. 1
figure 1

Location of the six P. pinaster populations sampled in the study and the experimental site (EUFORGEN 2009)

The saplings (4-year-old trees) from the six populations were located in a provenance-progeny test established in Ourense (42° 14′ N, 7° 56′ W and 460 m a.s.l., Galicia, Spain). Different artificial freezing experiments were carried out in spring and autumn over two consecutive years, 2009 and 2010, with different amount of saplings, freezing temperatures and assessed variables as detailed in Table 1. For each freezing temperature, we employed one sapling per family (9 families from each population) in the 2009 experiments and four saplings per family in the 2010 experiments (9 families from MIMI, 11 from LEIR, 12 from BAYU, 13 from TAMR, 14 from CUEL and 16 from SERG). Twigs were taken from the last lateral growth of well-exposed branches, corresponding to branches developed during the fourth growing season (collected in early spring—23 March 2009—before the growth of new shoots), the fifth growing season (collected at two times, in autumn 2009 (28 October) and in spring 2010 (23 April) in an early phenological stage of the new growth before developing new needles) and the sixth growing season (collected in autumn 2010 (5 November)). For each freezing experiment, all twigs were in a similar phenological phase. In the 2009 spring experiment, almost all twigs had elongating buds (stage 2 in a six-level scale), while in the 2010 spring experiment, almost all twigs had elongating internodes, a more advanced stage (stage 3 in a six-level scale). In both autumn experiments, all twigs had well-developed terminal buds.

Table 1 Summary information of the collected samples: assessment year, season, collecting date, plant age (years), number of populations (P), number of families per population (F(P)), total number of families (F), number of plants, number of twigs, number of assessed temperatures (T) and the assessed variables (V)

Climatic conditions in the provenance-progeny test were colder in 2010 than in 2009, 1.7 °C colder on average during the 2–3 months preceding the freezing experiments. Moreover, there were much more freezing days in 2010 than in 2009 (21 versus 9 at the Gandarela weather station, 42° 17′N, 7° 97′W, 623 m, the closest station to the provenance-progeny test).

Artificial freezing experiments

Freeze-thaw treatments

For the 2009 spring experiment, the samples were randomly inserted in trays with vermiculite (Díaz et al. 2009). Two twigs were used in each freezing temperature, one for the visual scoring method and the other for both relative conductivity and fluorometry methods. In the 2009 autumn experiment and 2010 spring and autumn experiments, the samples were wrapped in a lightly moistened tissue and then in an aluminium foil (Anekonda and Adams 2000). One twig of each sapling was used for all the three screening methods.

Samples were placed in a programmable freezing chamber and exposed to the predetermined experimental temperatures: T9 (−9 ± 0.4 °C), T12 (−12 ± 0.4 °C), T15 (−15 ± 0.4 °C) and T17 (−17 ± 0.4 °C) in year 2009 and T6 (−6 ± 0.4 °C), T11 (−11 ± 0.4 °C), T14 (−14 ± 0.8 °C), T16 (−16 ± 1.0 °C) and T19 (−19 ± 0.4 °C) in year 2010. In 2009, all temperatures were tested during the same day. Plants were cooled down at the rate of 2 °C per hour, and after 30 min at each selected temperature, a group of samples was transferred into a refrigerator at 4 °C for 48 h to facilitate slow thawing. In 2010 experiments, each temperature was tested in 1 day. Twigs were cooled down at 2 °C per hour and stayed during 3 h at each selected temperature, and then the temperature was raised at a rate of 5 °C per hour until it reached 4 °C. After the freeze-thaw treatments, damage was assessed by the three methods: visual scoring, relative conductivity and fluorometry; but different measurements were performed depending on the organ (subscript N for needles, S for stems and B for buds), and season (spring or autumn) or year (2009, 2010), as detailed in Table 1.

Visual scoring

Samples remained 10 days at the greenhouse (at 20–25 °C and 90 % relative humidity maintained by a fog system) to allow visible signs of freezing damage to develop (Sakai and Larcher 1987). Visual scoring in needles (V N) was done using a scale of 0 to 6 depending on the percentage of the foliar area damaged (0 = 0 %, no damage; 1 = 1–20 %; 2 = 21–40 %; 3 = 41–60 %; 4 = 61–80 %; 5 = 81–99 %; and 6 = 100 % of foliar area damaged). Visual scoring in stems (V S) and visual scoring in buds (V B) were done after slicing them longitudinally to determine the extent of damage on a scale of 0 to 3 (0 = 0 %, no damage; 1 = 1–33 %; 2 = 34–66 %; and 3 = 67–100 %, completely damaged).

Relative conductivity

Eight pieces of 1 cm length from eight randomly chosen needles, two stem pieces of 3 cm and the half of the bud were transferred to test tubes, separately for each organ, with 20 ml of distilled water for needles and stem and with 10 ml for buds. The tubes were capped and were left for 18 h at room temperature. Following other studies (Bower and Aitken 2006; Hannerz et al. 1999), the tubes were placed on a shaker for 1 h at room temperature to speed up and stabilize electrolyte diffusion from damaged tissues, and then the initial conductivity (C 1) was measured with an electrical conductivity meter (HI 2300, Hanna Instruments, S.L.). The tubes were then oven-heated to 85 °C for 90 min to kill any possible surviving cells and ensure complete electrolyte leakage (Guardia et al. 2013; Repo et al. 2008). Samples were left during 18 h at room temperature; they were shaken for 1 h, and the electrical conductivity was measured again to yield the maximum conductivity (C 2). The percent relative conductivity (C) was calculated for each organ (C N, C S and C B for needles, stems and buds, respectively) as C = 100 × C 1/C 2 (Luoranen et al. 2004).

Fluorometry

Measurements were made in needles (F N) and in twig phloem (F S). First, twigs were left to adapt to darkness for at least 30 min. Then, measurements on needles were done at the top, middle and bottom of a group of needles per each tree. Afterwards, a 1-cm basal piece of the twig stem was detached, and the next 2-cm stem section of the stem was collected. The section was cut longitudinally, the xylem was removed and the remaining tissue was divided into two to fit the leaf clip holder. The minimum (F 0) and maximum (F m) chlorophyll fluorescence were measured to calculate the variable chlorophyll fluorescence parameter, F v, as F v = F mF 0 (Genty et al. 1989). The dark-acclimated, maximum potential photosystem II (PSII) efficiency was calculated as F v/F m (Peguero-Pina et al. 2008). High values of F v/F m reveal undamaged tissue, while low values are indicative of freezing damage. To compare outcomes between methods, this variable was expressed as a percentage of the maximum damage as follows:

$$ F=100-\left(100\times \frac{\raisebox{1ex}{$F\mathrm{v}$}\!\left/ \!\raisebox{-1ex}{$F\mathrm{m}$}\right.}{\raisebox{1ex}{$F\mathrm{v}\mathrm{c}$}\!\left/ \!\raisebox{-1ex}{$F\mathrm{m}\mathrm{c}$}\right.}\right) $$

where F is the percentage of damage estimated by the measurement of the fluorescence, F v/F m is the maximum potential PSII efficiency of the sample and F vc/F mc is the maximum potential PSII efficiency of the control. In the first experiment, in spring 2009, fluorometry measurements were done after 1, 3 and 7 days at greenhouse conditions (20 °C and 90 % relative humidity maintained by a fog system). Readings of fluorometry on the 3 days were highly correlated (data not shown), confirming that damages were irreversible (L’Hirondelle et al. 2006); therefore, we decided to use the results on day 3 for this first experiment and to measure fluorometry only after 3 days in the next experiments. Fluorescence measurements were made by means of a pulse amplitude-modulated fluorometer (MINI PAM, Walz, Effeltrich, Germany) equipped with a 2030-B leaf clip holder, featuring an integrated micro-quantum sensor and a thermocouple.

Statistical analysis

Freezing damage was evaluated separately for each organ, method, temperature, season and year.

The population mean (μ) and coefficient of variation (CV) were estimated for 54 individuals (2009 experiments) and for 300 individuals (2010 experiments).

Different mixed models were used for traits assessed each year due to their different datasets, with population and family as random effects and field block as fixed effect. For 2009 dataset, damages were analysed by the following mixed model:

$$ {X}_{ij}=\mu +{P_i}_{+}{\varepsilon}_{ij} $$

where X ij is the damage value of the j th tree (j = 1 to 54), μ is the overall mean, P i is the effect of population (i = 1 to 6) and ε ij is the experimental error.

For 2010 dataset, we used the following mixed model:

$$ {X}_{ijkl}=\mu +{P}_i+F{(P)_j}_{(i)}+{B}_k+{\varepsilon}_{ijkl} $$

where X ijk is the damage value of the l th tree (l = 1 to 4) from the j th family within the i th population and k th block, μ is the overall mean, P i is the effect of the i th population (i = 1 to 6), F(P) j(i) is the effect of the j th family within the i th population (j = 1–9, 1–11, 1–12, 1–13, 1–14 or 1–16), B k is the effect of the field block (k = 1 to 4) and ε ijk is the experimental error.

Variance components were estimated by the restricted maximum likelihood (REML) method, assuming a normal distribution of the random effects. Several covariance structures were tested to model the residuals, and a first-order autoregressive (AR(1)) structure was selected by the Bayesian information criterion (BIC) (Littell et al. 2006). The significance of variance components was tested using log-likelihood ratio tests. We included population as a random effect to make inference at species level and to obtain an unbiased estimate of heritability and genetic population differentiation (Lamy et al. 2011; Wilson 2008).

The BIC was chosen to compare the fit statistics between methods in spring 2009 and between methods in autumn 2009, as models were the same within each season and year (Littell et al. 2006).

Pooled narrow-sense heritability over populations (h 2) was estimated after removing the population effect, since natural selection seems to occur within populations (Lamy et al. 2011):

$$ {h}^2=\frac{\sigma_A^2}{\sigma_{F(P)}^2+{\sigma}_{\varepsilon}^2}=\frac{4\cdot {\sigma}_{F(P)}^2}{\sigma_{F(P)}^2+{\sigma}_{\varepsilon}^2} $$

where σ 2 A is the within-population additive variance, σ 2 F(P) is the family within population variance and σ 2 ε is the residual variance. In our study, σ 2 A was estimated by σ 2 A  = 4σ 2 F(P) assuming that trees from the same family were half-sibs (open-pollinated seeds). The standard error of the heritabilities was estimated as follows (Visscher 1998):

$$ {\mathrm{SE}}_{h^2}=4\times \sqrt{\frac{2{\left(1-\frac{h^2}{4}\right)}^2{\left[\left(1+\left(s-1\right)\frac{h^2}{4}\right)\right]}^2}{s\left(s-1\right)\left(f-1\right)}} $$

where s is the number of offspring per family (4), and f is the number of families (75).

The estimate of genetic differentiation among populations, Q st, was calculated as described by Wright (1951):

$$ {Q}_{\mathrm{st}}=\frac{\sigma_P^2}{\left({\sigma}_P^2+2{\sigma}_A^2\right)} $$

The best linear unbiased predictors (BLUP) were obtained for population random effects. The Spearman correlations were carried out on individual and population mean bases to explore the relationships between methods, organs and seasons. All statistical analyses were done with the SAS System (SAS 9.1, SAS Institute Inc. 2004).

Results

Cold hardiness variation among and within populations

In spring, we did not find significant differences in cold hardiness among populations for any of the methods and plant organs, neither for any sampling year (Tables 2 and 3, Fig. 2). By contrast, we found highly significant differences among populations in autumn, especially in 2010 when we also increased the assessed number of trees and families per population. Populations from central Iberian Peninsula (BAYU and CUEL) showed lower levels of damage than those from coastal Iberian populations (LEIR and SERG) (see Fig. 2 for needles assessed by fluorometry).

Table 2 Population mean (μ), coefficient of variation (CV), variance components of the random effects (σ p 2 and σ e 2 are the variance components for population and error, respectively) and the Bayesian information criterion (BIC) for freezing damage (%) with different methods and plant organs at different temperatures in 2009
Table 3 Population mean (μ); coefficient of variation (CV); variance components (%) of the random effects σ p 2, σ F(p) 2, and σ e 2 (variance components for population, family nested to population and error, respectively); the individual heritability (h 2) ± standard error (s.e.); and the quantitative population differentiation (Q st) for freezing damage (%) with different methods and plant organs at different temperatures in spring and autumn 2010
Fig. 2
figure 2

Freezing damage per population assessed by fluorometry in needles under different temperatures in spring and autumn in 2009 and 2010. BLUP estimate ± standard errors are presented. Different letters indicate significant (p < 0.05) differences among populations. n.s. is not significant

Additive genetic variation within populations was significant in both the spring and autumn 2010 experiments, especially for needles. The estimated pooled narrow-sense heritability (h 2) for cold hardiness was high in the two seasons (h 2 > 0.6). We obtained estimates higher than 1 in 6 out of the 19 estimates (3 by visual scoring in needles, 2 by visual scoring in stems and 1 by the fluorometry method in needles) (Table 3). The genetic differentiation (Q st) among populations was very low in spring (0.02–0.05) for all screening methods and varied between 0.03 and 0.66 in autumn, depending on the organ, method and experimental temperature (Table 3).

Comparison of screening methods

We compared the three methods assessing needles in year 2009. Damage curves showed that, in general, the methods performed similarly, especially fluorometry and relative conductivity. In spring, visual scoring gave the highest freezing damage levels, whereas in autumn, the comparison was not that clear; hence, the ranking depended on the studied temperature (Fig. 3a, b). In general, fluorometry was more stable across temperatures, since the intrapopulation variability in needles, determined by the CV, had the lowest values (for needles, between 87 and 117 in spring 2009 and between 79 and 107 in autumn 2009; Table 2). Regarding the BIC, in spring 2009, relative conductivity had the lowest values, whereas in autumn 2009, the lowest values were with fluorometry (Table 2).

Fig. 3
figure 3

Mean freezing damage (%, n = 54 in 2009, and n = 300 in 2010) in needles at different temperatures and methods. Black square, visual scoring method; black triangle, relative conductivity method; multiplication symbol, fluorometry method. Error bars represent standard errors. Different letters show statistically significant (p < 0.05) differences. Letters were depicted only when methods were statistically different

In year 2010, we compared visual scoring and fluorometry in needles, obtaining more freezing damages with visual scoring in spring and with fluorometry in autumn (Fig. 3c, d). In spring 2010, the lowest BIC values were with fluorometry, while in autumn, it depended on the considered temperature, being fluorometry the method giving the lowest values in most cases (data not shown).

Correlations between methods assessing needles for the temperatures which caused levels of freezing damage among 30–70 % are shown in Table 4. These correlations were always significant for individual data, and considering population means, they were highly significant in year 2010 between visual scoring and fluorometry.

Table 4 Correlation between screening methods for needles and the experimental temperature which produced 30–70 % of damage: visual scoring in needles (V N), relative conductivity in needles (C N) and fluorometry in needles (F N)

We compared methods using the other two organs—stems and buds—and the best correlation was achieved with stems in autumn 2010, at T19, between visual scoring and fluorometry (r = 0.83 (p < 0.05) for population means; data not shown).

Comparison of freezing damages in different plant organs

Freezing damages differed among organs and years in the spring experiments. Needles were the most damaged organ in 2009 (V and C, Table 2); by contrast, in 2010, when the collection was done 1 month later and the twigs were exposed to the selected temperatures for a longer period of time, buds were the most damaged organ, and needles and stems had similar damage levels (Table 3).

In autumn 2009, buds showed slightly higher levels of damage than needles and the stems were the least damaged organ (Table 2). In autumn 2010, buds and needles showed similar damage levels, and stems were again the least damaged organ (Table 3).

Correlation inferences between organs for the temperatures which caused levels of freezing damage among 30–70 % are shown in Table 5. Individual correlations between plant organs were significant and moderate, both in spring and autumn of years 2009 and 2010. At the population level, we found highly significant correlations between needles and stems in autumn of years 2009 and 2010 with relative conductivity and visual scoring methods. However, in spring of both years, correlations between organs for population means were weak.

Table 5 Correlation between organs—needles (N), stems (S) and buds (B)—by screening method and the experimental temperature which produced 30–70 % of damage

Discussion

Genetic variation of cold hardiness

We observed a high variability in additive variance for cold hardiness among populations in maritime pine, in accordance with the previous knowledge on the adaptive genetic structure of this species (Aranda et al. 2010; González-Martínez et al. 2002). Moreover, genetic differentiation for cold hardiness among populations was much higher in autumn than in spring, in consonance with results of various studies in other pine species (Bower and Aitken 2006; Jonsson et al. 1986; Nilsson 2001). The nonsignificant population differentiation in cold hardiness observed in spring could suggest no differences in timing of dormancy release (Morgenstern 1996), but considering the contrasting origins of the assessed populations, we believe that it is hard to work out the precise moment when spring assessment can be more discriminant between populations, since deacclimation occurs rapidly (Bower and Aitken 2006; Kalberer et al. 2006).

The populations evaluated in this study can be grouped according to their sensitivity to freezing damage in autumn. The least cold-hardy populations were the Iberian Atlantic coastal populations (SERG and LEIR), and the most cold-hardy, the Iberian continental populations (BAYU and CUEL). Population ranking was highly consistent when freezing experiments were applied in different years, plant organs and methods, and results are also concordant with previous works (Corcuera et al. 2011). These differences likely reflect adaptation to their different source environments, through directional selection driven by a faster decrease in minimum daily temperatures during autumn in the high-elevation continental areas (O’Neill et al. 2001). Likewise, these differences mimic those encountered among pine species related to their thermal niche (Climent et al. 2009).

The significant variation observed among families within populations suggests that there is a sufficient additive genetic variation for selection by cold hardiness within populations (Salmela et al. 2011). The family component of variation was similar for spring and autumn sampling, suggesting that cold hardiness in maritime pine is under strong genetic control.

The family effect was nonsignificant in 12 out of the 31 combinations of temperature and screening method, and therefore, it was not possible to estimate an additive genetic variance. For the rest of the 19 cases, we got very high values of heritability (higher than 0.6) both in spring and autumn, obtained by considering all the 75 families as belonging to the same population by removing the population effect; for 6 out of the 19 estimates, the value was higher than 1. These results are affected by the level of damage; as to make good statistical inferences, we should take into account only the intermediate levels of damage (30–70 %) (Anekonda and Adams 2000). Considering this range, only one of the cases presented heritability greater than 1 (visual scoring method in needles, at a temperature of −19 °C and collected in autumn 2010), where the standard error of the estimation is high. The subjective estimation in the visual scoring method could have affected this overestimation (indicated by the discrepancy with the more reliable fluorometry method). We can discard an overestimation based on a higher correlation paternity in maritime pine that would result in a combination of half-sib and full-sib seedlings as deduced by sampling in the natural populations of the species (Gaspar et al. 2011), similar to the sampling performed when establishing provenance-progeny tests. Also, we can discard an environmental effect based on individuals of the same family sharing the same plot (e.g. Dutkowski et al. 2002; Lamy et al. 2013), as we have corrected in our model this environmental correlation of the individuals. We cannot discard the effect of the reduced number of individuals per family, which resulted in high standard errors in the estimation, but the high values of the genetic correlation indicate the reliability of our results in accordance with other studies (Aitken and Adams 1997; Persson et al. 2010). Also, the heritability values could be affected by some population effect (Corcuera et al. 2010) that, in our case, can be linked to a greater additive variance in some of the populations (e.g. TAMR, CUEL; see Supplementary Fig. S1). To what extent this greater variability is caused by some evolutionary effects—linked to different demographic and adaptive processes in these populations (Grivet et al. 2011)—or to a confounding effect with the size of the plant or to differences in the families environmental harshening would need a deeper study.

Comparison of screening methods

When working on needles, the three methods compared (visual scoring, relative conductivity and fluorometry) were well correlated between them (L’Hirondelle et al. 2006).

In the literature, visual scores have been found to be strongly correlated (r 2 > 0.90) with damage assessed quantitatively using relative conductivity (Shortt et al. 1996) and fluorometry methods (Binder et al. 1996). In our study, although in some cases visual scores could overestimate the level of damage compared to relative conductivity and fluorometry, all methods were strongly correlated in most cases, especially when the three analyses were done on the same twig (autumn 2009). Also, the sample preparation system used in spring 2009 could have led to odd results, such as the unexpected percentage of damage recorded, lower at temperature T15 than at temperature T12, with fluorometry and relative conductivity.

In general, visual scoring was less precise than relative conductivity and fluorometry methods; it is subjected to further error depending on the skill and experience of the scorer, and it depends on the timing of the examination, since if it is too early, it could not discriminate lethal from nonlethal damage (Deans et al. 1995). If it is too late, as it seems to have occurred in some cases in the present work, it could lead to overestimation of the damage level. To our opinion, these drawbacks overpass the possible advantages of this method, namely its simplicity (Anekonda and Adams 2000).

Relative conductivity improved consistency and yielded statistically better results than visual scoring in the present work, both for detecting differences among populations and for heritability estimates. However, the relative conductivity method was more labour intensive than the other two studied methods (Jensen and Deans 2004).

Accordingly to our results, fluorometry is the most recommendable method, since it was the fastest among the three tested methods (L’Hirondelle et al. 2006; Perks et al. 2004), and it was more objective than the visual scoring and more reliable than both the visual and the conductivity methods for assessing cold hardiness, in consonance with other published works (Binder and Fielder 1996b; Weng and Parker (2008). In our study, we performed the fluorometry 3 days after the freeze-thaw treatment in order to best accommodate the different tests, but this method can provide results within a few hours after the freezing treatment, enabling high-throughput screening of many genotypes in multilocation field trials. In such a way, high-throughput phenotyping platforms (HTPP) could be particularly useful for obtaining detailed measurements of plant characteristics that collectively provide reliable estimates of phenotypic traits.

Effect of plant organ sampling

Needles are the most assessed organ in freezing damage studies in conifers, but the importance of assessing different organs and tissues has been recognized by several authors (Berrang and Steiner 1986; Burr et al. 1990; Sakai and Weiser 1973). In the present study, stems were the most cold-hardy organ in both seasons, fitting in with the results of other authors (Corcuera et al. 2011). The ranking in cold hardiness in spring for buds and needles seems to depend mainly on bud phenology. In year 2009, twigs were collected at the beginning of the spring (81 growing degree days at the 22nd of March) with the buds in a moderately swollen stage but still not burst, and in that occasion, buds were less damaged than needles. In spring 2010, twigs were collected 1 month later (186 growing degree days at the 22nd of April), with buds showing a more advanced phenological stage, and buds were more damaged than needles. In autumn, some authors found that buds of other conifer seedlings were considerably less cold-hardy than needles (Burr et al. 1990; O’Neill et al. 2001), but in our study, cold hardiness between needles and buds was similar, with buds showing slightly less hardiness than needles.

In our analysis of maritime pine, freezing damage in needles and stems was strongly correlated in autumn; besides, correlations among needles and buds were significant in autumn 2010. Correlations between organs in spring were weaker, contrarily to findings by Aitken and Adams (1996, 1997) in Douglas fir. We hypothesize that the weaker correlation between organs in spring is caused by a different response of each organ to the cumulative effects of chilling and heat sum during the dehardening process. On the other hand, in autumn, all organs were developing during the previous months, and therefore, they should be at a similar hardening status, justifying the stronger correlation between them at this season.

The moderately strong correlations of freezing damage levels between organs in autumn indicate that selecting for cold hardiness based on a single organ can be acceptable for minimizing phenotyping costs (O’Neill et al. 2001). Following O’Neill et al. (2001), the choice of the best organ to score should consider (a) ease of measurement, (b) heritability, (c) correlations with other organs and (d) correlations with the economic impact of frost damage in the field. Although occasional needle freezing damage is not expected to significantly impact survival, growth rate or stem form, practical simplicity, heritability and correlations with other organs make needles highly recommendable for evaluating cold hardiness in autumn in maritime pine. Further investigation is needed in spring cold hardiness evaluations, especially in buds.

Conclusions

We have shown the importance of the sampling date and year in the estimation of genetic variation of cold hardiness in maritime pine, as well as the effect of the screening method used and the analysed organ.

These experiments have demonstrated an important genetic variation among P. pinaster populations in autumn. Within-population genetic variation was always significant in both seasons. Thus, there is much potential for improving spring and autumn cold hardiness in maritime pine through selection and breeding.

Based on our results, we recommend the use of the fluorometry method for cold hardiness assessment in P. pinaster. It is the fastest method of the three tested, more objective than visual scoring and as reliable as the visual scoring and relative conductivity methods for predicting field performance. These characteristics allow fluorometry to be used in high-throughput phenotyping systems, improving the precision of selection and being a useful tool in modelling for predicting genotypic performance in different climate scenarios.

We suggest using needles to evaluate cold hardiness in maritime pine in autumn, since needles are the easiest organ for testing, and needle freezing damage levels in autumn were well correlated to those in stem and buds. More investigation is needed in spring cold hardiness evaluations, especially in buds.