Introduction

The correlation between climate and vegetation growth is attracting increasing interest, especially in the light of the fast climate changes that take place both worldwide (Scholze et al. 2006; Costinot et al. 2016) and in Italy (Gentilucci et al. 2019a, 2020). Changes in trends of mean temperature (Tm) took place at the Northern Hemisphere scale during the period 1850–2000 (i.e. years 1901, 1914, 1942, 1963, and 1975) (Matyasovszky 2011); abrupt changes in Tm have occurred also in the years 825, 1296, 1387, 1656, 1749, and 1883 according to Jones and Wigley (2010a, b). However, abrupt changes may somehow differ at the scale of smaller areas. The variability in climate change trends influences the response of tree growth (Andreu et al. 2007; Ponocná et al. 2016; Schippers et al. 2015). At the local level, the impacts of these changes can reflect on tree growth in different ways (Wilson and Elling 2004; Gallucci and Urbinati 2009; Carrer et al. 2010). In mountain environments, the dominant climate signal tends to be more pronounced at high-elevation sites with extreme conditions (Rolland et al. 1998). For example, changes in the relationships between climate variables and tree-ring growth in silver fir have been detected in the Pyrenees (Macias et al. 2006), in the Lower Bavarian region (Wilson and Elling 2004), in southern-eastern France (Lebourgeois and Mérian 2011), and in Italy (Gentilesca and Todaro 2008; D’Aprile et al. 2012) during the twentieth century. At the regional level, master series of Tm may be used to investigate tree-ring growth if similarity in Tm trends between sites is high. Otherwise, low similarity in trends of Tm series among sites over time can reflect on differences in forest tree growth at the site level. This suggests verifying the presence of temperature variations at the single site level to check whether tree growth response differs among sites and/or from the master series of the climate variable. In this study, the presence of long-term trends in seasonal Tm and monthly Tm series in the Tuscan Apennine Alps (Middle Italy) among sites during the twentieth century, their associations, and variations in similarity of both the seasonal and monthly Tm series are investigated. The aim is to understand whether seasonal and monthly Tm trends can be grouped and effectively represented by master series or whether they vary to an extent that addresses to analyse potential effects of Tm changes on tree growth at the site level. In this study, all statistical computations were undertaken by using two different software, and the results were compared to verify if they are similar.

Methods

Regional setting and dataset

The Tuscany Region (Central Italy) shows high variability in climate patterns due to its diversified topography. This region is located on the Tyrrhenian side of Central Italy and stretches over 22,987.04 km2 with a population of 3,733,897 people. Its landscape is 67% hilly with small flat areas covering only 8% of the region, with mountain massifs reaching 25% of the total surface. This study focuses on the mountainous part of the region, the Apennine Alps, where large forests grow. In this area, the traditional climate pattern is normally Mediterranean montane (Maracchi et al. 2005) with relatively mild summer, and rainfall tends to provide moisture enough to minimize or even avoid drought. Winter is cold and frequently snowy, and the permanence of snow varies from weeks to months; climate change modifies this ‘traditional’ pattern. Four montane forest sites near to the tops of the Apennine Alps were selected in places close to the meteorological stations. The uniqueness of those forests, which is due to their historical, religious, cultural, landscape, touristic, and bio-ecological features, explains why the administrations that succeeded over time installed meteorological stations at Abetone (ABE) and La Verna (LAV) at the beginning of the twentieth century and at the end of the nineteenth century at Camaldoli (CAM) and Vallombrosa (VAL) (Fig. 1).

Fig. 1
figure 1

Location of the meteorological stations on tops of the Tuscan Apennine Alps. A is ABE (1340 m a.s.l.), C is CAM (1120 m a.s.l.), L is LAV (1111 m a.s.l.), and V is VAL (955 m a.s.l.)

The data of the four weather stations functional to the analysis were extracted from both the Annals of the former ex-Hydrographic Office of Pisa, Ministero dei Lavori Pubblici (Ministry of Public Works) and the Settore Idrologico e Geologico Regionale (Regional Hydrological and Geological Sector; S.I.R.) of the Tuscan Regional Government. The rainfall and temperature data of CAM were provided by the former Research Centre of Silviculture in Arezzo, Tuscany (today CREA-Research Centre for Forestry and Wood). Data published both in the Annals and the S.I.R. are official and validated; they are regarded as a reliable source of climate data records. The time series of climate data used for statistical analysis start in 1928 (Table 1).

Table 1 Periods of climate data available of the meteorological stations in the Tuscan Apennine Alps

The statistics

At the regional level, time series of Tm may be utilized to investigate tree-ring growth if similarity in Tm trends among sites is high. Otherwise, low similarity in trends of Tm series over time among sites may cause differences in forest tree growth at the site level. Therefore, the analysis of similarity in trends of Tm among sites is essential to assess whether there are differentiated response in tree growth among sites. Thus, the stationarity in similarity of trends in Tm in the study area has been tested. At first, homogeneity of seasonal Tm data was explored in all the Tm series through normality, similarity, and equality of variance tests in the distributions of Tm data by using both parametric and nonparametric tests (Kolmogorov-Smirnov test, Lilliefors test, Shapiro-Wilk test, Anderson-Darling test, Chi-square test, Student’s test, one-way Kruskal-Wallis test, Bartlett’s test, and Levene’s test) to check whether the seasonal Tm series are statistically homogeneous at all the study sites during the twentieth century. Descriptive analysis of seasonal and monthly Tm series was applied to all sites; the average of the maximum Tm, the average of the minimum Tm, Tm (°C), and the standard deviation (SD) of Tm at the seasonal and monthly level were calculated. The patterns and variability in the distribution of the residual error in trends of seasonal Tm were also analysed by linear regression modelling to explore the amount of variability, which corresponds to the real meteorological events, contained in each series of seasonal Tm in the study area. In this case, the seasonal Tm of all study sites was used as a dependent variable with respect to the seasonal Tm series at each site as independent variables. The nonparametric Mann-Kendall trend test (M-K) was used to verify if any trend occurs in the seasonal Tm series (Dawood 2017). This test has been frequently used in climatological research to verify the presence of trends in series of seasonal and monthly climate variables (Crisci et al. 2002; Önöz and Bayazit 2003; Brunetti et al. 2006; Bartolini et al. 2008; Toreti and Desiato 2008; Mavromatis and Stathis 2011). The possible presence of periods with similar length in the seasonal Tm series at all the study sites was tested through spectral (Fourier) analysis. Potential effects of variability in seasonal Tm and/or monthly Tm on tree growth are expected to be similar among sites. In that case, different trends in seasonal Tm or monthly Tm between sites can affect the growth response of trees in different ways. Hence, the level of similarity between seasonal Tm trends across all study sites was estimated by applying both matrix correlation (Pearson, Spearman, and Kendall) (Santos and Leite 2009) and agglomerative hierarchical clustering (AHC) (Bartolini et al. 2008). However, because of the underlying rationale, matrix correlation and AHC are statistics that show average levels of similarity between Tm series, but they do not provide information on its variability over time, which may influence tree growth in ways otherwise unknown. A common statistical technique used to reduce the influence of high-frequency variability in time series is moving averages (or running means), which smooth high-frequency variability and highlight low-frequency variability (or enhance long-term trends). Thus, it was assessed whether the similarity in trends of seasonal and monthly Tm among sites is stationary during the twentieth century by applying Pearson’s correlations of moving averages (Biond 1997; Kristoufek 2014) to all the Tm series. In other words, to verify if and how the similarity in the trends of Tm is stationary during the twentieth century, the moving averages of Pearson’s r were applied between the paired series of monthly Tm, where the length of the ‘windows’ of the moving averages was defined by the results of the spectral (Fourier) analysis.

The theory of the statistics mentioned above is easily accessible online and in manuals, books, and texts. Here, the main statistics applied are described in the next paragraphs. Computations were made by using the programs Statistica (StatSoft) and XL-Stat (Addinsoft).

The Mann-Kendall trend test

The Mann-Kendall test is used to determine whether a time series has a monotonic upward or downward trend. It does not require that the data be normally distributed or linear. It does require that there is no autocorrelation. The null hypothesis for this test is that there is no trend, and the alternative hypothesis is that there is a trend in the two-sided test or that there is an upward trend (or downward trend) in the one-sided test. For the time series x1, ..., xn, the M-K test uses the following statistic:

$$ S={\sum}_{i=1}^{n-1}{\sum}_{j=k+1}^n\mathit{\operatorname{sign}}\left( xj- xi\right) $$

Note that if S > 0, then later observations in the time series tend to be larger than those that appear earlier in the time series, while the reverse is true if S < 0.

The variance of S is given by

\( \mathit{\operatorname{var}}=\left(1/18\right)\Big[n\ \left(n-1\right)\ \left(2n+5\right)\hbox{--} {\sum}_t{f}_t\ \left({f}_t-1\right)\left(2{f}_t+5\right) \) ]

where t varies over the set of tied ranks and ft is the number of times (i.e. frequency) that the rank t appears.

The M-K test uses the following test statistic:

$$ z=\left\{\begin{array}{c}\frac{S-1}{se},\kern1em S>0\\ {}0,\kern3em S=0\\ {}\frac{S+1}{se},\kern1em S<0\end{array}\right. $$

where se is the square root of var. If there is no monotonic trend (the null hypothesis), then for time series with more than 10 elements, zN(0, 1), i.e. z has a standard normal distribution.

In addition to the classical Mann-Kendall test, the seasonal Mann-Kendall was calculated. It takes into account the seasonality of the series, in this case 12 months, in order not to find out if there is a trend in the overall series, but if from one month of January to another, and from one month of February to another, and so on, there is a trend.

The Mann-Kendall statistic for the gth season is calculated as:

$$ S={\sum}_{i=1}^{n-1}{\sum}_{j=i+1}^n\mathit{\operatorname{sign}}\left({X}_{jg}-{X}_{ig}\right),\kern0.75em g=1,2,\dots, m\ (12) $$

According to Hirsch et al. (1982), the seasonal Mann-Kendall statistic, Sˆ, for the entire series is calculated according to:

$$ \hat{S}={\sum}_{g=1}^m{S}_g $$

Matrix correlation

Matrix correlation measures the relationship between two or more variables. Coefficients have a range of −1 to 1, where −1 is the perfect negative correlation while +1 is the perfect positive correlation. Matrix correlation is also called ‘multiple correlation coefficient’. The most common correlation coefficient is Pearson’s correlation coefficient, which compares two interval variables or ratio variables. But there are many others, depending on the type of data you want to correlate. The following table shows some of the common choices for correlation coefficients:

 

Quantitative

Ordinal

Nominal

Quantitative

Pearson

Biserial

Point biserial

Ordinal

Biserial

Spearman rho/tetrachoric

Rank biserial

Nominal

Point biserial

Rank biserial

Phi, Goodman and Kruskal’s lambda

In this study, we used the Wilk’s approach, which consists in driving the stochastic weather generator with serially independent and spatially correlated random numbers. Each set of serially independent random numbers is then used to generate precipitation occurrence and amounts at a particular station. In essence, Wilks approach concentrates on the bi-variate description of the precipitation field at given rain gauges, expressed by the correlation matrices of both precipitation occurrence and amounts at weather stations.

Therefore, let n be the number of precipitation recording stations and m the length of the synthetic precipitation time series to be generated.

Multiple linear regression

Multiple linear regression attempts to model the relationship between two or more explanatory variables and a response variable by fitting a linear equation to observed data. Every value of the independent variable x is associated with a value of the dependent variable y. The population regression line for p explanatory variables x1, x2, ... , xp is defined to be μy 0 + β1x1 + β2x2 + ... + βpxp. This line describes how the mean response μy changes with the explanatory variables. The observed values for y vary about their means μy and are assumed to have the same standard deviation σ. The fitted values b0, b1, ..., bp estimate the parameters 0, β1, ... + βp of the population regression line.

Since the observed values for y vary about their means μy, the multiple regression model includes a term for this variation. In words, the conceptual model is expressed as Data = Fit + Residual, where the fit term represents the expression 0 + β1x1 + β2x2 + ... + βpxp. The Residual term represents the deviations of the observed values y from their means μy, which are normally distributed with mean 0 and variance σ. The notation for the model deviations is ε. Formally, the model for multiple linear regression, given n observations, is yi = 0 + β1xi1 + β2xi2 + ... + βpxip + εi for i = 1,2, … n. In the least-squares model, the best-fitting line for the observed data is calculated by minimizing the sum of the squares of the vertical deviations from each data point to the line (if a point lies on the fitted line exactly, then its vertical deviation is 0). Because the deviations are first squared, then summed, there are no cancellations between positive and negative values. The least-squares estimates b0, b1, ... bp are usually computed by statistical software.

The values fit by the equation b0 + b1xi1 + ... + bpxip are denoted ȳi, and the residuals ei are equal to yi - ȳi, the difference between the observed and fitted values. The sum of the residuals is equal to zero.

The variance σ2 may be estimated by s2 = ∑ei2/ (n - p -1), also known as the mean-squared error (or MSE).The estimate of the standard error s is the square root of the MSE. Linear regression is, without doubt, one of the most frequently used statistical methods. A distinction is usually made between simple regression (with only one explanatory variable) and multiple regression (several explanatory variables) although the overall concept and calculation methods are identical. The linear regression hypotheses are as follows: the errors ei follow the same normal distribution Ν(Ο,s) and are independent. The way the model with this hypothesis added is written means that, within the framework of the linear regression model, the γi are the expression of random variables with mean μι and variance s2, where

$$ {\mu}_i={\beta}_0+{\sum^p}_{j= 1}\ {\beta}_j{x}_{ij} $$

The estimator of the β coefficients and of their covariance matrix is given by:

$$ \hat{\beta}={\left({X}^tX\right)}^{- 1}\ {X}^tY\mathrm{And}\ Var\ \left(\hat{\beta}\right)={\hat{\sigma}}^2\ {\left({X}^tX\right)}^{- 1} $$

Conceptually, TmAV (TmJ + TmF + ….. + TmD) = β0 + βJTmJ + βFTmF + … + βDTmD + ε)

where TmAV is the average of monthly Tms as dependent variable

TmJ + TmF + ….. + TmD are monthly mean temperature of January (J), February (F), …, December (D) as explanatory variables

β0 = y-intercept (constant term)

βJ, F, …, D = slope coefficients for each explanatory variable

ϵ = the model’s error term (also known as the residuals)

Agglomerative hierarchical clustering

Agglomerative hierarchical clustering (AHC) is a classification method which allows to work from the dissimilarities between the objects to be grouped together. A type of dissimilarity can be chosen which is suited to the subject studied and the nature of the data. One of the results is the dendrogram which shows the progressive grouping of the data. It is then possible to gain an idea of a suitable number of classes into which the data can be grouped. The disadvantage of this method is that it is slow. Furthermore, the dendrogram can become unreadable if too much data is used. Agglomerative hierarchical clustering (AHC) is an iterative classification method whose principle is simple. The process starts by calculating the dissimilarity between the N objects. Then two objects which when clustered together minimize a given agglomeration criterion are clustered together thus creating a class comprising these two objects. Then the dissimilarity between this class and the N-2 other objects is calculated using the agglomeration criterion. The two objects or classes of objects whose clustering together minimizes the agglomeration criterion are then clustered together. This process continues until all the objects have been clustered. These successive clustering operations produce a binary clustering tree (dendrogram), whose root is the class that contains all the observations. This dendrogram represents a hierarchy of partitions. It is then possible to choose a partition by truncating the tree at a given level, the level depending upon either user-defined constraints (the user knows how many classes are to be obtained) or more objective criteria. The proximity between two objects is measured by measuring at what point they are similar (similarity) or dissimilar (dissimilarity). The similarity coefficients are co-occurrence, cosine, covariance (n-1), covariance (n), dice coefficient (also known as Sorensen coefficient), general similarity, Gower coefficient, inertia, Jaccard coefficient, Kendall correlation coefficient, Kulczynski coefficient, Ochiai coefficient, Pearson’s correlation coefficient, Pearson Phi, percent agreement, Rogers and Tanimoto coefficient, Sokal and Michener coefficient (or simple matching coefficient), Sokal and Sneath coefficient, and Spearman correlation coefficient. The dissimilarity coefficients are Bhattacharya’s distance, Bray and Curtis’ distance, Canberra’s distance, Chebyshev distance, Chi2 distance, Chi2 metric, chord distance, squared chord distance, dice coefficient, Euclidian distance, geodesic distance, Jaccard coefficient, Kendall dissimilarity, Kulczynski coefficient, Mahalanobis distance, Manhattan distance, Ochiai coefficient, Pearson’s dissimilarity, Pearson’s Phi, general dissimilarity, Rogers and Tanimoto coefficient, Sokal and Michener coefficient, Sokal and Sneath coefficient, Sokal and Sneath coefficient (2), and Spearman dissimilarity. The quality of an agglomerative hierarchical clustering can be assessed by the cophenetic correlation coefficient. It uses a measure of distance called the cophenetic distance that can be estimated on the dendrogram obtained from a classification. The cophenetic distance separating 2 observations is given by the height of the dendrogram at which the 2 observations belong to the same cluster. Finally, the cophenetic correlation is estimated as the Pearson correlation coefficient between the dissimilarity matrix used for the AHC and the cophenetic distance matrix. The closer to 1 the correlation, the better the quality of the clustering. To calculate the dissimilarity between two groups of objects A and B, different strategies are possible, for example, Simple linkage: the dissimilarity between A and B is the dissimilarity between the object of A and the object of B that are the most similar. Agglomeration using simple linkage tends to contract the data space and to flatten the levels of each step in the dendrogram. As the dissimilarity between two elements of A and of B is sufficient to link A and B, this criterion can lead to very long clusters (chaining effect) while they are not homogeneous. Complete linkage: the dissimilarity between A and B is the largest dissimilarity between an object of A and an object of B. Agglomeration using complete linkage tends to dilate the data space and to produce compact clusters. Unweighted pair-group average linkage: the dissimilarity between A and B is the average of the dissimilarities between the objects of A and the objects of B. Agglomeration using unweighted pair-group average linkage is a good compromise between the two preceding criteria and provides a fair representation of the data space properties. Weighted pair-group average linkage: the average dissimilarity between the objects of A and of B is calculated as the sum of the weighted dissimilarities, so that equal weights are assigned to both groups. As with unweighted pair-group average linkage, this criterion provides a fairly good representation of the data space properties. Flexible linkage: this criterion uses a â parameter that varies between −1 and +1; this can generate a family of agglomeration criteria. For â = 0, the criterion is weighted pair-group average linkage. When â is near to 1, chain-like clusters result, but as â decreases and becomes negative, more and more dilatation is obtained. Ward’s method: this method aggregates two groups so that within-group inertia increases as little as possible to keep the clusters homogeneous. This criterion, proposed by Ward Jr (1963), can only be used in cases with quadratic distances, i.e. cases of Euclidian distance and Chi-square distance.

Spectral Fourier analysis

Fourier analysis is a method of defining periodic waveform s in terms of trigonometric functions. Many waveforms consist of energy at a fundamental frequency and also at harmonic frequencies (multiples of the fundamental). The relative proportions of energy in the fundamental and the harmonics determines the shape of the wave. The wave function (usually amplitude, frequency, or phase versus time) can be expressed as of a sum of sine and cosine functions called a Fourier series, uniquely defined by constants known as Fourier coefficients. If these coefficients are represented by a, a 1, a 2, a 3, ..., a n, ... and b 1, b 2, b 3, ..., b n, ..., then the Fourier series F ( x ), where x is an independent variable (usually time), has the following form:

$$ F\ \left(\ x\ \right)=a/2+{a}_1\ \cos\ x+{b}_1\ \sin\ x+{a}_2\ \cos\ 2\ x+{b}_2\ \sin\ 2\ x+\dots +{a}_n\ \cos\ nx+{b}_n\ \sin\ nx+\dots $$

In Fourier analysis, the objective is to calculate coefficients a, a 1, a 2, a 3, ..., a n and b 1, b 2, b 3, ..., b n up to the largest possible value of n . The greater the value of n (i.e. the more terms in the series whose coefficients can be determined), the more accurate is the Fourier-series representation of the waveform. Spectral analysis allows to discover underlying periodicities. First, data are transformed from time domain to frequency domain. The technical details of spectral Fourier analysis go beyond the scope of this study notes, but there are plenty of works in the literature. In brief, the covariance of the time series can be represented by a function known as the spectral density. The spectral density can be estimated using on object known as a periodogram, which is the squared correlation between our time series and sine/cosine waves at the different frequencies spanned by the series. For large numbers, the periodogram is approximately independent for distinct frequencies. This independence can be improved—as can the visual quality and interpretability of the plot—by smoothing the periodogram using a kernel smoother, which is frequently some kind of weighted of moving mean. In this study, spectral analysis is used to identify the presence of regular periodicities in the chronologies of the seasonal Tm series at all the study sites. Cyclicity is a factor of variability that may reduce the possibility to detect trends and is taken into account when trends in series of climate variables are investigated. Spectral (Fourier) analysis was applied to all the seasonal Tm series to verify whether periods with similar length occur among sites given the potential presence of cycles in seasonal Tm. If one or more cycles in the Tm trends were found in the study sites, they would indicate an objective span of years—or more spans—by which to set the moving averages to smooth the Tm series. This is important because it shows whether, when, and to what extent similarity in seasonal and monthly Tm series varies among sites in the study area, especially when climate/tree-growth relationships are investigated. So, smoothing of all Tm series by moving averages was used to enhance medium-long-term variability and reduce the influence of high-frequency variability, which is usually due to factors other than climate.

Pearson’s r correlation of moving means

In statistics, a moving average (rolling average or running average) is a calculation to analyse data points by creating a series of averages of different subsets of the full dataset. It is also called a moving mean (MM)[1] or rolling mean and is a type of finite impulse response filter. Given a series of numbers and a fixed subset size, the first element of the moving average is obtained by taking the average of the initial fixed subset of the number series. Then the subset is modified by ‘shifting forward’, that is, excluding the first number of the series and including the next value in the subset. A moving average is commonly used with time series data to smooth out short-term fluctuations and highlight longer-term trends or cycles. The threshold between short-term and long-term depends on the application, and the parameters of the moving average will be set accordingly. Mathematically, a moving average is a type of convolution, and so it can be viewed as an example of a low-pass filter used in signal processing. Viewed simplistically it can be regarded as smoothing the data. The period selected (lag) depends on the type of movement of interest, such as short, intermediate, or long-term. If the data used are not centred around the mean, a simple moving average lags behind the latest datum point by half the sample width. A simple moving average can also be disproportionately influenced by old datum points dropping out or new data coming in. One characteristic of the simple moving average is that if the data have a periodic fluctuation, then applying a simple moving average of that period will eliminate that variation (the average always containing one complete cycle). However, a perfectly regular cycle is rarely encountered. For a number of applications, it is advantageous to avoid the shifting induced by using only ‘past’ data. In this case, a central moving average can be computed, using data equally spaced on either side of the point in the series where the mean is calculated. Thus, simple moving average (or running means) is a common statistical technique used to reduce the influence of high-frequency variability in time series, which smooth high-frequency variability to highlight low-frequency variability or enhance long-term trends. In this study, moving averages are featured by lags of years that work as temporal windows over chronologies; normally they smooth time series by yearly shifts. Although the lag of moving average can be chosen subjectively, more objective results are expected when the moving average lags over the length of periods or cycles that have been identified objectively. Therefore, moving means of the Pearson’s r between paired series of monthly Tm were used to verify whether and how their similarity is stationary during the twentieth century. For example, simple moving average was calculated over lags of 15 years (7 years + 1 year + 7 years) for each year (central moving average) of the monthly Tm series, which produced a correspondent series of r values each averaged from 7 years before to 7 years after (TA7). The Pearson’s r correlation was computed between the paired series of each monthly TA7 against the other monthly TA7s.

Results

Exploratory analysis

A simple visual approach shows that the time series of annual Tm at all the study sites (Fig. 2) are discontinuous in some periods and can vary in similarity of patterns among sites such as ABE in the 1960s and CAM and VAL from the late 1910s to the early 1940s.

Fig. 2
figure 2

Annual Tm series at all sites in the study area. Gaps are missing data. ABE is blue, CAM is green, LAV is black, and VAL is red

Annual Tm at the study sites shows high inter-annual variability and non-stationary trends that increase from the mid-1950s onward and grow even faster from the 1980s until the late 1990s–early 2000s at all the study sites (Fig. 2). This pattern has been observed at the northern hemisphere scale also (Brunetti et al. 2000a, b; Matyasovszky 2011). At first sight, in the study area, correlation between annual Tm series is not stationary during the twentieth century, and the time series show the presence of changes in trends every of 6–7 years, which could influence tree growth. In other words, trends in annual Tm seem to differ among sites in the study area in some periods, and fast changes in the slope of trends occur at relatively regular intervals. However, annual Tm may be of little help in investigating effects of temperature on tree growth, which develops mainly at the seasonal and monthly levels. Table 2 shows the seasonal averages of temperatures in the meteorological stations investigated; ABE is the coldest site, and VAL is the hottest. The example of Fig. 3 shows that summer Tm and winter Tm decrease from the late 1870s and increase after the mid-1950s. In both seasons, the period from the late nineteenth century to the mid-twentieth century shows some cooling (Table 3).

Table 2 Maximum Tm, minimum Tm, Tm, and standard deviation (SD) of the winter, spring, summer, and autumn temperature series (°C) at all study sites.
Fig. 3
figure 3

Winter Tm (blue line) and summer Tm (green line) in the study area as averages from the four study sites; relative 7 years moving averages (red lines) and long-term means (magenta lines).

Table 3 Monthly Tm (°C), SD, maximum Tm, and minimum Tm at all sites in the study area

Matrix correlation

In this research, Pearson, Spearman, and Kendall matrix correlation tests of seasonal Tm series were applied to check for possible differences in the level of association among the seasonal Tm series. Different types of coefficients were used on the assumption that the similarity of the results in different tests strengthens the validity of the correlations between matrices. Table 4 and Table 5 summarize the results of the correlation tests; the association of ABE’s seasonal Tm with those at CAM, LAV, and VAL (Table 4) is frequently weaker than the associations among CAM, LAV, and VAL (Table 5).

Table 4 Values of Pearson, Spearman, and Kendall coefficients (r, ρ, τ) in the correlation of the seasonal Tm series at ABE with those at CAM, LAV, and VAL
Table 5 Values of Pearson, Spearman, and Kendall coefficients (r, ρ, τ) in the correlation of the seasonal Tm series among CAM, LAV, and VAL

In general, the statistical significance (p-values <0.0001) is good. The level of association between CAM, LAV, and VAL is lower in summer and autumn and higher in winter. In summer, ABE shows the lower level of association with CAM, LAV, and VAL. In spring, CAM, LAV, and VAL show strong association in Tm, and even ABE is more like their Tm. In autumn and spring, the correlation of ABE’s Tm with Tm at CAM, LAV, and VAL is moderately good (Table 4). Overall results show that the association in seasonal Tm series varies with season and site, in particular at ABE. This suggests that the seasonal Tm series are likely to have different patterns or trends that vary with site and/or time in some periods during the twentieth century. At the level of monthly Tm series, matrix correlation shows that the level of association between the series varies with month and site from moderately good to high (Table 6). In all months, the correlation coefficients of the monthly Tm series between ABE and CAM, LAV, and VAL are lower than the correlation coefficients between CAM, LAV, and VAL. Similarity in the monthly Tm series among sites is lower in summer and higher in winter except December at ABE, which shows lower correlation with CAM-LAV-VAL. The interval between the values of the correlation coefficients is greatest in June, July, and September. For example, the correlation coefficients (r) between CAM, LAV, and VAL vary from 0.65 to 0.94 in June and between 0.84 and 0.87 in December. All this variability may derive from differences in trends of monthly Tm that could influence the response in tree growth in different ways.

Table 6 Upper and lower values of Pearson’s r in the correlations of the paired series of monthly Tm between ABE and CAM-LAV-VAL and between CAM, LAV, and VAL

Agglomerative hierarchical clustering

Similarity-based grouping of meteorological stations in seasonal and monthly Tm patterns was assessed by agglomerative hierarchical clustering (AHC). The AHC results confirm the presence of a moderately higher similarity in the seasonal Tm series to CAM, LAV and VAL, being the seasonal Tm of ABE a relatively separate cluster from CAM, LAV, and VAL in all seasons (Table 7).

Table 7 Grouping of monthly Tm series in the study area as shown by AHC. Decrease in the order of cluster corresponds to reduction in similarity among sites. Coloration indicates similar level of hierarchical clustering

It can be noted that:

  • CAM and LAV are the most similar only in 3 months (Apr, Jun, and Jul) out of twelve despite they have similar elevation and are at short distance (ca. 13 km).

  • CAM and VAL are the less ‘statistically’ distant in 6 months out of 12.

  • CAM’s similarity is intermediate between ABE and LAV-VAL in 3 months (Jan, Feb, and Sep) out of twelve.

Thus, clustering of monthly Tm varies somehow with month among sites in the study area. This phenomenon suggests the presence of differences in the variability of trends during the twentieth century. Therefore, verifying if similarity in trends of monthly Tm among sites is stationary during time is a necessary and important step to ascertain as it can reflect on tree growth.

Linear regression modelling

In this context, linear regression analysis of seasonal Tm averaged from all sites in the study area (dependent variable) against the seasonal Tm series of each site (independent variables) has contributed a further insight into data distribution. The patterns of residual errors have shown the presence of different types of data distribution among sites, where the regression models fit well and are statistically significant. Since the residual errors and their distribution represent the real seasonal Tm events that influence climate/growth relationships, they may retain amounts of variability that could reflect on tree growth by ways that are not shown by the regression model. Therefore, the values and possibly trends modelled by linear regression may not show differences in the relationships between climate and tree growth between forest-level sites, to the extent that this type of information is held back by residual errors.

The Mann-Kendall trend tests

The presence of trends in the seasonal Tm series was tested in all sites through the Mann-Kendall (M-K) trend test; in general, no trends were detected. For example, Table 8 shows the results of M-K relative to summer Tm and autumn Tm at ABE and VAL.

Table 8 Results of M-K test of summer Tm and autumn Tm at ABE and VAL

The statistical normality of the Tm series was tested; however, the results of different types of tests are controversial. The M-K test, which is a non-parametric test, was used to assess the presence of trends in the historical series of monthly Tm. The series tested start in the year 1928 at all sites except LAV, which shows a relatively short series of data (Table 1); their Tm, SD, maximum Tm, and minimum Tm are shown in Table 2. As the Tm series at CAM and VAL go back respectively to the years 1885 and 1872 (long-term series), these data have been used to verify whether any change in trends of monthly Tm occur before the late 1920s by applying the M-K test to all the monthly Tm series at the study sites. The M-K test does not detect any trend in autumn Tm at ABE (Table 8). However, autumn Tm is 7.0°C in the period 1928–1960 and 7.3°C in the period 1971–1998; normally, autumn Tm (Table 2) is greater than 6.2°C and less than 9.2°C except in the 1960s, when a warmer period (10.1°C) occurs. These results suggest that variability in monthly Tm in different months at different elevation may find trees in different ecophysiological stages, and therefore, it can have different influence on tree growth among sites.

Spectral (Fourier) analysis

The presence of recurrences in the historical series was analysed by performing periodograms and spectral density of the seasonal Tm chronologies. The analysis has shown the presence of periods of (2–4), 6–7, 9–11, 14–16, 18–19, 22–23, 27–28, 34–36, 45, 55, 67–70, and 110–112 years of length in all the study sites. In particular, the periods (2–3), 6–7, 22–23, 34–36, 67–70, and 110 years result the most frequent in the seasonal Tm series (Table 9). It was also noted that the harmonic mean of all the periodogram values varies little among sites in all seasons, and the 7-year period, which derives from a harmonic mean that varies between 5.5 and 7.5, is the most frequent interval between all of them. This suggests that the 7-year period approximates the harmonic mean of the peak periodicities over the entire series of seasonal Tm in the study area, which is a sub-dividend of longer periodicities that occur with regularity.

Table 9 Results of the spectral analysis relating to the summer Tm series at CAM and the winter Tm series at VAL

Moving averages

Graphs were elaborated by calculating the moving average over 7 years (TA7) of all seasons to first identify possible cyclical periods. For example, Fig. 4 shows negative trends in winter Tm at CAM and VAL from the late 1910s to the late 1940s; in this period, winter Tm is −0.12 °C at CAM (SD is 1.4) and 2.4 °C at VAL (SD is 1.1). The Tm data start in 1928 at ABE; winter Tm is −1.2°C (SD is 1.4) in the period 1928–1940; and −0.6 °C (SD is 1.1) at ABE, 1.3 °C at CAM (SD is 1.3), and 2.4 °C at VAL (SD is 1.3) in the period 1980–1998. The time series at LAV is short, and no long-term trend could be detected, but it can be noted that the LAV time series is close to CAM.

Fig. 4
figure 4

Winter TA7 at all sites during the twentieth century at all sites. Blue is ABE, green is CAM, black is LAV, and red is VAL. Gaps in the 1940s are missing data.

The trend in winter TA7 at ABE differs from those at CAM, LAV, and VAL, which are more similar each to the other (Fig. 4). In spring, Tm at ABE (Fig. 5) is normally higher than 2.0°C and smaller than 6.2°C except in the 1960s (spring Tm is 7.1°C) and in the year 1930 (spring Tm is 1.6°C). As in winter, spring Tm in the 1960s is higher than in all the previous and following decades observed, it decreases during the 1970s and increases after the mid-1980s. In the late 1970s–mid 1980s, a trough in spring Tm occurs at ABE while a peak emerges in the same years at the other study sites. The TA7 series in spring (Fig. 5) show trends that differ from those in winter TA7 (Fig. 4). For example, spring TA7 shows a peak at all the study sites in the 1960s, while winter TA7 shows troughs in the same period at the same sites except ABE, which peaks instead. Spring Tm from the early 1910s to the mid-1920s increases at VAL and decreases at CAM in spring (Fig. 5) although these trends seem more similar in winter.

Fig. 5
figure 5

Spring TA7 at all sites during the twentieth century at all sites. Blue is ABE, green is CAM, black is LAV, and red is VAL. Gaps in the 1940s are missing data

At ABE, summer Tm is never greater than 12°C or smaller than 17.3°C in the span of time observed. In winter Tm and in spring Tm, the 1960s is the warmest period in summer at ABE; before the 1960s, summer Tm at ABE is lower (14.0°C) than after the 1960s (15.3°C). Indeed it decreases going back from the 1960s and increases after the late 1970s. Although the M-K test does not show trends in summer Tm at ABE (Table 8), the pronounced peak during the 1960s is likely to alter or confound the result.

At VAL, summer Tm decreases from the late 1800s to the mid-1910s, while a warm period takes place from the late 1910s to the early 1930s (Fig. 6). Summer Tm at VAL is normally greater than 15°C and less than 19°C but reaches temperature greater than 20.5°C in some years (i.e. 1928, 1952, and 2003). In the period 1990–2006, summer Tm is 18.3°C, similar to summer Tm in the 1920s–1930s; however, this value is mainly due to the summer Tm of the year 2003 (21.8°C). Although summer Tm at VAL increases from the early 1980s and 5 years in the period 1995–2006 show summer Tm higher than upper SD (18.6°C), there had been other periods with summer Tm higher than upper SD. For example, 5 years with summer Tm greater than 18.6°C were noted in the period 1872–1881 and five years in the decade 1921–1930. Warmer summer Tm occurs frequently before the 1990s–2000s, such as in the years 1928 (20.5°C), 1947 (20.0°C), and 1952 (20.0°C). However, results of the M-K test relative to the presence of trends in the summer Tm series indicate the absence of trends at VAL (Table 8). In summer, CAM shows a trend in Tm that decreases from the end of the 1800s to the late 1920s and turns into positive that starts in the 1930s. The presence of a trend is confirmed by the M-K (p-value <0.004) test. Summer Tm at CAM shows a hot period in the 1960s (16.5 °C), but this is not the hottest period in the series like at ABE. For example, summer Tm at CAM is 16.7°C in the 1940s and 17.3°C in the 1980s. At LAV, summer Tm increases from the mid-1970s after a moderate trough; however, the data series at LAV is short to detect long-term trends. The M-K test suggests that trends in summer Tm are not present at LAV, once the trends in the summer Tm series are smoothed by 7-year moving averages, CAM and VAL show similar trends before the late 1920s. As the TA7 trends at ABE and VAL miss data from the 1940s to the early 1950s, comparison with the other study sites could not be done over this period. In the mid-1960s, a pronounced hot period occurs at ABE and, quite at a lower extent, at CAM. From the late 1960s to the 1970s, summer TA7 at ABE is declining, at CAM it decreases at first and becomes positive in the early 1970s, while at LAV and VAL, it is slightly rising or flat and becomes negative in the second half of the 1970s. However, summer TA7 trends become positive at all study sites after the early 1980s (Fig. 6). Trends in autumn TA7 are positive at ABE and CAM, and negative at LAV and VAL from the 1950s to the mid-1970s (Fig. 7). A peak period is observed in the 1960s at all sites. It is pronounced at ABE, moderate at CAM, and small at LAV and VAL. After the mid-1980s, trends in autumn TA7 are negative also at ABE and CAM, while VAL’s trend is only slightly negative. Trends in TA7 at LAV and VAL are similar from the 1960s to the early 1980s. In autumn, Tm at CAM (Fig. 7) is 9.1°C from the mid-1880s to the 1920s; it is slightly lower (9.0°C) than the long-term mean (9.3°C) from the early 1920s to the early 1950s and increases moderately after the early 1950s (9.7°C in the period 1953–1996). Autumn Tm normally ranges from greater than 7 to less than 11.7°C as lower and upper values, respectively, instead a warmer period takes place in the 1960s at CAM but it is not as pronounced as at ABE; CAM’s autumn Tm (10.3°C) in the 1960s is higher than in the previous periods, followed by a decrease. In the 1980s, autumn Tm is 10.5°C. The M-K (p-value <0.001) test suggests the presence of a trend in autumn Tm at CAM. The short time series (33 years) at LAV shows that autumn Tm is greater than 8.2°C and less than 12.1°C, while in the 1960s (10.8°C) are warmer than the following years (10.1°C from the year 1971 to the year 1989). At CAM and VAL, the 7-year moving averages trends of the fall Tm series are relatively similar before the mid-1930s and from the late 1950s to late 1980s (Fig. 7). At VAL, autumn Tm is normally greater than 9.0°C and less than 12.5°C except in few years; in fact the long-term mean is 10.6°C (1872–2006). The warmest autumnal period occurs in the 1920s (11.4°C), while the 1970s (9.8°C) and the period 1905–1915 (10.1°C) are the coldest; autumn is smaller than 9.0°C in the years 1884, 1912, 1915, and 1974, while it is higher than 12.5°C in the years 1886, 1898, and 1932.

Fig. 6
figure 6

Summer TA7 during the twentieth century at all study sites. Blue is ABE, green is CAM, black is LAV, and red is VAL. Gaps in the 1940s are missing data.

Fig. 7
figure 7

Autumn TA7 at all sites. Blue is ABE, green is CAM, black is LAV, and red is VAL. Gaps in the 1940s are missing data

The sharp increase and decrease observed in autumn Tm at ABE respectively in the early 1960s and in the 1970s rise the question whether some artificial change such as temporary reallocation of the meteorological station and/or change of the type of device or else, may have occurred. However, it can be noted that:

  • Autumn Tm peaks in the same years also in CAM, although it is not as pronounced as in ABE, and slightly in VAL. This is likely to indicate that the warmer period in the 1960s is not a local feature at ABE.

  • Two series of meteorological data of ABE (Boscolungo) are shown in the Annals from the early 1920s to the late 1930s; one is Boscolungo Town, and the other is Boscolungo Forest. Values of monthly Tm in the ‘Town’ series are normally 1–3°C higher than those in the ‘Forest’ series. If a change in the temperature data was caused artificially (e.g. displacement of a weather station), the data series would show it.

Pearson’s r of moving averages

Pearson’s r correlation coefficients of 7-year moving averages between paired series of monthly Tm was finally used to test if the associations in monthly Tm series are stationary during the twentieth century among sites in the study area. As shown in the examples of Fig. 8, similarity in trends of monthly Tm is non-stationary during the 20th in the Tuscan Apennine Alps; variations in monthly Tm do not necessarily occur in the same months at all sites. In particular, the analysis reveals that there are periods when trends in monthly Tm show dissimilarity among the study sites. This variability can reflect on tree growth and unlikely can be highlighted by master series of climate variables.

Fig. 8
figure 8

Variability of the Pearson’s r in the correlation of 7-year moving averages of monthly Tm in February (a), August (b), and October (c) between paired sites in the study area during the twentieth century. ABE-CAM is shown by the blue line, ABE-VAL is magenta, ABE-LAV is green, CAM-VAL is red, LAV-VAL is brown, and CAM-LAV is dark grey.

Summary of results

Descriptive analysis of monthly Tm shows differences in trends that may influence the response of tree growth among sites in the study area. The AHC of monthly Tm series shows that the associations of monthly MT series among sites vary with month and/or site. However, this type of test does not show if and how similarity in trends varies over time, which is a key point to understand if relationships with tree growth are stationary over time and/or among sites. It has been recalled that changes in trends or differences in the seasonal Tm patterns among sites that may occur in some periods are likely to influence tree growth in unpredicted ways among sites. This phenomenon, which takes place even between sites at short distance and similar elevation such as CAM and LAV, highlights that master series of seasonal Tm can be of little utility to analyse climate/tree-ring relationships in case if the objective of the analysis is to verify the response of tree growth to variations in Tm at the forest site level, at least in the Tuscan Apennine Alps. Still, the seasonal level may not show or highlight what occurs at the monthly level that can influence the tree growth response. Results of the analysis of trends in seasonal Tm and monthly Tm during the twentieth century at the study sites in the Tuscan Apennine Alps are summarized below:

• Distributions of data are not homogenous among sites.

• Standard deviation of Tm is pronounced in all seasons and months and varies irregularly over time.

• Inter-annual variability of Tm is high in all seasons and months.

• Some peaks and troughs occur in different periods among sites and more markedly at ABE in comparison to the other study sites. The ABE seasonal and monthly Tm series form a cluster different from that formed by CAM, LAV, and VAL.

• Clustering within the group CAM-LAV-VAL varies also with season. The similarity in the seasonal Tm series of the CAM-LAV-VAL cluster is lower in summer; this season VAL shows a marked dissimilarity between CAM and LAV.

• The similarity in seasonal and monthly Tm trends is not stationary between study sites during the twentieth century in the Tuscan Apennines, even when the sites are short distances and have similar elevations such as CAM and LAV.

• The similarity varies from high to small or even negative values of the correlation coefficients over time. In other words, trends in monthly Tm can differ greatly and even be opposite over time between sites.

Although Tm at the regional level can show some prevailing tendency (i.e. warming), it can be noted that:

• Regional or higher scale trends in Tm smooth variability at the local level, which may have relevant effects on tree growth and health, however.

• Master series of seasonal Tm do not show variations that occur at the monthly level, which are important to identify possible effects in tree growth instead.

• In the Tuscan Apennines, non-stationary similarity in trends of seasonal Tm and monthly Tm among sites is present even between sites at short distance.

Discussion

At the annual scale, a general increase of temperature trend is observed in the mid-1950s that accelerate even faster from the 1980s. Summer Tm and especially winter MT decrease from the late 1870s and turn into an increase from the mid-1950s. This resembles the trends observed during the twentieth century in the Mediterranean area and Italy (Orlandini et al. 2009; Brunetti et al. 2000a, b; Camuffo et al. 2010; Gentilucci et al. 2019a). In the study area, seasonal trends in Tm confirm the occurrence of warming detected throughout South Europe and in the Mediterranean area and Italy (Toreti and Desiato 2008; Gentilucci et al. 2019b). However, months with increasing trends in Tm can differ among sites. For example, in the study area, the clustering of monthly Tm trends by season shows that winter warming occurs only at CAM, although on a regional and higher scale, winter is indicated as the most warming season. Reduction in both the maximum annual Tm and the minimum annual Tm was also observed in some other areas of Tuscany (Maracchi et al. 1998); the period from the late nineteenth century to the mid-twentieth century is cooling in the study area. Increases of summer Tm at all the study sites are also observed at the Tuscan regional scale (Bartolini et al. 2008). In the study area, warming occurs in spring and autumn at CAM and slightly at VAL during the period observed. This variability supports the thesis that differences in trends of seasonal and monthly Tm can either stimulate or reduce tree growth among sites and/or at different elevation. This may reflect on strategies and/or planning to adapt forest management and biodiversity conservation to changing climate conditions (Bolte et al. 2009; Borghetti et al. 2012; Jandl et al. 2019). For example, seasonal variability in Tm suggests that the length of summer and/or winter may extend or shorten over time; this can affect the length of the growing season and/or the growth rate of trees in different ways between sites and/or at different altitudes. In some tree species, a higher summer Tm can promote tree growth at higher elevations and reduce it at lower elevations or even cause a depressing effect on growth at both higher and lower elevations.

Conclusions

This study helps to understand the temperature variability in the Tuscan Apennines (Bartolini et al. 2008; Toreti and Desiato 2008; Scorzini and Leopardi 2019) and provides a more peculiar approach to the evaluation of climate-growth relationships of trees for adaptive (sustainable) forest planning and management. Moreover, the results promote the need for more detailed information on local and regional climate patterns of climate change (Solomon et al. 2007; IPCC 2007, 2014, 2018) and on temperature trends in seasons other than summer in Tuscany (Centro Italy) (Bartolini et al. 2008). In fact, the monthly level is a key factor in gaining a more accurate understanding of the actual influence of climate variability and change on tree growth. A relevant result of this study is the identification of a recurring average period of 6–7 years in the Tm series, for which the smoothing trends of the seasonal and monthly Tm over time tend to coincide with a periodicity that results to be a dividend of various longer periodicities. At the preliminary level, the results seem to support what Dutilleul and Till (2011) observed on Cedrus atlantica in Morocco. In this light, the analysis of climate-growth relationships of trees is likely to provide more accurate results that can be used to improve forest planning and management. For example, the identification of a recurring period makes it possible to better highlight the trends of the Tm series by reducing the influences of global circulation or solar activity and to better identify periods where the growth trend is positive or negative. Indeed, the observed 6–7-year period can divide both the lengths of the Tm periods in the study area and the AMO, Hale, NAO, and LNC cycles, where the Atlantic multidecadal oscillation (AMO) has an approximate cycle of 70 years, the Hale cycle is 22 years, the North Atlantic Oscillation (NAO) shows a period of 18–19 years, and the lunar nodal cycle (LNC) is 18.6 years (Yndestad 2006; Dutilleul and Till 2011). Therefore, this analysis contributes to understanding the weight of local variability in relation to the influence of Tm on the responses of forest growth to climate variability in the Mediterranean area. Actually the relationships between climate and forests are a key point to develop adaptive sustainable forest management. For example, it contributes to identify periods when silvicultural interventions can harvest more or less wood mass on the basis of the rate of growth ongoing (D’Aprile et al. 2015). Another relevant use of this approach in forest planning and management is the possibility to adapt the objectives and methods of management in accordance with trends of climate variables expected in the medium term, which provides relevant help in keeping management within the carrying capacity of the forest ecosystem.