Introduction

Nowadays, there is a high consensus on the possible increase of the average temperature of the planet in upcoming decades. However, such a trend is less evident in precipitation. A probable decline in precipitation and an increase in the frequency and magnitude of droughts have been predicted for the Mediterranean basin even though the uncertainty in predictions about precipitation trends is still high (IPCC 2013).

Several studies analyzing the instrumental data from Spain suggest that drought severity has increased over the past five decades (Vicente-Serrano et al. 2011, 2014). However, the relative short observational data make it difficult to determine changes in drought severity (Redmond 2002) resulting in low confidence in drought trends worldwide (Seneviratne et al. 2012). Despite of major recent efforts, knowledge of droughts affecting the Iberian Peninsula is severely limited due to the fact that most of the historical instrumental climatic records do not begin until the 1950s (Gonzalez-Hidalgo et al. 2011).

The temperature and precipitation dynamics of central and northern Europe are particularly well known due to the paleoclimatic reconstructions of the last millennium (Büntgen et al. 2005; Pauling et al. 2006; Büntgen et al. 2011; Esper et al. 2012 for N-Europe). In southern Europe, recent efforts have increased knowledge on temperature dynamics particularly in the western Mediterranean region (Creus and Puigdefábregas 1982; Büntgen et al. 2008; Dorado-Liñan et al. 2014). However, paleoclimatic reconstructions of precipitation and drought variability are less frequent (see Rodrigo et al. 1999 and Esper et al. 2014 as exceptions).

In Spain, precipitation has recently been considered as the most important climate element directly affecting human society (water availability, human consumption, political and social stability), economic activities (location of dams, water planning, irrigation, demand by industry), and natural systems (water stress, fires, erosion) (de Castro et al. 2005; Randall et al. 2007 in IPCC 2007). Improving knowledge on how drought frequency and intensity have changed over the last few centuries is, therefore, critically important to evaluate the recent trends, validate future scenarios, and adapt to projected climate change (IPCC 2007, 2013).

Drought is a complex phenomenon, which has led to several indices being defined for its study. The Palmer index of drought severity (PDSI) is based on recent precipitation and temperature in terms of a supply and demand model of soil moisture (Palmer 1965). Some of the problems of the PDSI were solved by development of the self-calibrated PDSI (sc-PDSI) (Wells et al. 2004), which makes it spatially comparable and reports extreme wet and dry events. However, the main problem of the PDSI is related to its fixed scale (between 9 and 12 months), and an autoregressive characteristics whereby index values are affected by the conditions up to 4 years in the past (Guttman 1998). Three reconstructions of the PDSI have been developed for Europe (Esper et al. 2007; Nicault et al. 2008; Esper et al. 2014). The standardized precipitation index (SPI) is based on monthly precipitation data and the cumulative probability of a given rainfall event occurring at a station (McKee et al. 1993). This inbuilt memory supports estimates of the cumulative effects of regional drought. Two SPI reconstructions have been developed for Europe (in Turkey, Touchan et al. 2005; in Romania, Levanic et al. 2013). Finally, there is the standardized precipitation and evapotranspiration index (SPEI), which is based on the same terms as the SPI but includes a temperature component in terms of evapotranspiration in the equation (Vicente-Serrano et al. 2010). Both SPI and SPEI have been recognized as effective indices for identifying dry and pluvial periods in the Mediterranean pine forests (Pasho et al. 2011; Camarero et al. 2013).

The aim of this work is to develop for the first time a dendroclimatic reconstruction of the SPI index using a dense multi-species dendrochronological network from the Iberian Range. Through the SPI reconstruction, we identify and discuss the main drought events occurring in the study area over the last 318 years.

Materials and methods

Site description

We compiled a tree ring network from 21 different locations in the eastern Iberian Range of the Iberian Peninsula (Table 1). The Iberian Range is a southeast oriented mountain system between the Ebro depression and the central plateau. It includes a sequence of hills and depressions with diverse lithology, often isolated, or linked through plateaus. Most of the 21 sites used in this study are from higher-elevation sites, since these are the areas where forests have been least exploited and the oldest trees are located (Fig. 1). The altitude of the sampling sites ranges from 1100 to 2000 meters above sea level (masl), with a mean of 1600 m. The higher-elevation trees belong to the oro-Mediterranean bioclimatic belt characterized by large seasonal temperature fluctuations including frequent frosts in winter (ocassionally reaching −20 °C) and heat exceeding 30 °C during the dry summer period (Fig. 2, data from the Spanish Meteorological Agency). The mean annual precipitation is 520 mm and reaches a maximum during spring and a minimum during winter. It is also worth noting that there are frequent storms during the months of June, July, and August.

Table 1 Tree ring site characteristics
Fig. 1
figure 1

Map showing the tree ring study sites (circles) and city of Teruel in the Iberian Range

Fig. 2
figure 2

Climate data. a Climate diagram of the study area made from 30 meteorogical stations for the period 1950–2010. b Annual temperature, and c precipitation from 1950–2010

Forest composition in the Iberian Range is dominated by pinaceaes, and their wide distribution is determined by plasticity and adaptation to various bioclimatic conditions. Therefore, at lower altitudes, Pinus halepensis, associated with typical Mediterranean conditions, are found, while with increasing altitude, Pinus nigra and Pinus sylvestris represent the dominant tree species. The highest elevations are covered by Pinus uncinata reaching its southern distribution limits in the Iberian Range.

Tree ring chronology development

Overall, a total of 336 samples and 45.648 growth rings from five pine species (P. sylvestris, P. uncinata, P. nigra, P. halepensis, and Pinus pinaster) were used for the development of a regional ring width chronology. The samples originate from three different sources (see Table 1 for an overview) constituting the largest dendrochronological database for the Iberian Range. The most recent data including 184 samples from nine sites was collected during field campaigns in 2012 and 2013, i.e., the outermost rings are 2011 and 2012, respectively. Old dominant and co-dominant trees with healthy trunks with no sign of human intervention (e.g., resin collection) or geomorphological processes were selected at each site. We extracted two core samples from each tree using at breast height (1.3 m) perpendicular to the slope to prevent compression wood. On steep slopes (>15°), the samples were taken at a greater height up to 2 m above ground. Cores were air-dried and glued onto wooden holders and subsequently sanded to ease growth ring identification (Stokes and Smiley 1968). The cores were scanned and synchronized using the CoRecorder software (Larsson 2012) (Cybis Dendrochronology 2014) to identify the position and exact dating of each ring, and the width measured, at 0.01-mm precision, using a LINTAB table (Rinn 2005). The cross-dating among trees was quality-controlled using the COFECHA program (Holmes 1983).

An additional set of 101 samples from eight sites was downloaded from the International Tree Ring Data Bank (ITRDB, http://www.ncdc.noaa.gov/data-access/paleoclimatology-data/datasets/tree-ring). These data were developed by K. Richter and collaborators in the late 1970s and early 1980s, i.e., the outermost rings range from 1977 to 1984. Another four sites including 51 samples were provided by the project CLI96-1862, i.e., the outermost rings range from 1988 to 1993 (Creus et al. 1992; Genova et al. 1993; Manrique and Fdez. Cancio 2000; Saz 2003).

In order to assess coherence among sampling sites and support averaging tree ring data at the regional scale, a matrix correlation was developed for the common period (1940–1977) and for the full-length period of each site. Then, a mean regional chronology was developed.

For the preservation of interannual to multi-decadal scale variability and in order to eliminate the biological age trend in the radial growth, the 336 individual tree ring width series were standardized using the dplR standardization package (Bunn 2008), executed in Rstudio (R Development Core Team 2014). Each ring width series was fitted with a cubic spline with a 50 % frequency response cutoff at 67 % of the series length (Cook et al. 1990). A first-order autoregressive model of the residuals and a bi-weight robust estimation of the mean (Cook and Peters 1997) were applied to assemble the regional residual chronology (TRIres) for studying the influence of climate on tree growth and reconstructing the main driver at interannual to multi-decadal frequencies.

Chronology confidence was assessed using the express population signal (EPS), and the interseries correlation (Rbar) (Wigley et al. 1984). EPS provides an estimate of how closely a mean chronology based on a finite number of trees matches its hypothetically perfect chronology (Cook et al. 1990). Values equal to or above 0.85 are considered to ensure that a chronology is suitable for climate reconstruction (Wigley et al. 1984). Rbar estimates the common variance among ring width series averaged in a chronology.

Climatic data, calibration, and climate reconstruction

Monthly temperature and precipitation instrumental data (provided by AEMET) from 30 stations within a maximum distance of 50 km, and spanning 1951–2010, were used to calibrate the tree ring data. In addition, gridded instrumental data from CRU TS v.3.22, (1901–2012 period, 0.5° × 0.5° resolution) were used for comparative purposes (Harris et al. 2014). Due to the size of the study area, the average of the three closest grid points was used to construct a regional time series.

Using both station and gridded climate data, we calculated the standardize precipitation index (SPI) and the standardize precipitation-evapotranspiration index (SPEI). The SPI (McKee et al. 1993) is calculated using monthly precipitation as input data while the SPEI (Vicente-Serrano et al. 2010) uses the monthly difference between precipitation and the potential evapotranspiration. Both indexes were calculated at different time scales from 1 to 24 months for the common period 1951–2010. For calibration, we used the residual chronology (TRIres) to minimize biological memory effects in tree ring width (TRW). Therefore, the high-frequency variability contained in the residual series is expected to be related only with year-to-year climate variability. TRIres was calibrated against the SPI and SPEI for 1 to 24 months using the station-based and gridded climate data.

To evaluate the accuracy of the model used for climate reconstruction, the dataset was split into two equally long periods for calibration and verification (Fritts 1976). These periods were 1951–1980 and 1981–2010. The procedure was then repeated with reversed periods. The consistency of the linear model was tested using the Pearson’s correlation coefficient (r), reduction of error (RE), mean square error (MSE), and sign test. RE provides a highly sensitive measure of the reliability of a reconstruction (Akkemik et al. 2005); it ranges from +1, indicating perfect agreement, to minus infinity. Commonly, positive RE values are interpreted as there is some skill in the reconstruction (Fritts et al. 1990). The MSE is an estimate of the difference between the modeled and the measured.

Sign test compares the number of agreeing and disagreeing interval trends, from year to year, between the observed and reconstructed series (Fritts et al. 1990; Cufar et al. 2008).

A linear regression model was used to transfer the residual chronology into a drought index. To identify and evaluate extreme events, we adopted an approach detailed in Akkemik et al. (2005), in which extreme deviations are identified considering standard deviation thresholds. We normalized the reconstruction time series and define extreme dry and wet years considering ≥±1.76 standard deviations (Türkes 1996; Akkemik et al. 2005). An 11-year running mean was applied to identify periodic changes in the reconstruction. The magnitude and spatial extent of the climate signal were assessed considering the gridded CRU TS v.3.22 over the Iberian Peninsula.

Results

Chronology

The correlation matrix with all the sites (Fig. 3) shows the high correlation between sites and species. The high intersite correlations reaching 0.46 from 1940 to 1977 and 0.42 over the full overlapping period justified the development of a 511-year chronology for the eastern Iberian Range covering the 1503–2013 time period (Fig. 4). The highest correlation is found between the highest elevation sites for both P. uncinata (PRU and 495) and P. sylvestris (PRS and PER) whereas the lowest correlation is found between the lowest P. halepensis site (PDH) and the highest P. sylvestris site (PER). However, the high mean correlation between sites suggests a common regional climate signal. Therefore, the chronology is based on 336 TRW series of five different Pinus spp. The average age of the trees analyzed is 134 years, with a minimum of 20 years and a maximum of 489 years. The mean sensitivity was 0.153, and the first-order autocorrelation for the TRIres was −0.01. The correlation coefficient (Rbar) between trees is 0.29 whereas the correlation within trees equals 0.73. The variance explained by the first principal component is 33.58 % showing that a substantial fraction variability may be due to a single factor. The signal-to-noise ratio reaches 28.09, and EPS exceeds 0.85 after 1694, confining the reconstruction period to 318 years until 2012.

Fig. 3
figure 3

Correlation of the sampling sites sorted by elevation. Top right shows the correlations calculated over the 1940–1977 common period. Bottom left shows the correlations over the full periods of overlap between pairs of chronologies

Fig. 4
figure 4

TRIres chronology (in green); the residual chronologies of the 21 sites used (in gray); number of samples and EPS (computed over 30-y window lagged by 15 years) threshold in 1694 (Color figure online)

SPI and SPEI signals

The correlations between the residual chronology (TRIres) and the SPI and SPEI indexes from 1 to 24 months from both the regional instrumental series and the CRU regional series for the period 1951–2010 are shown in the Fig. 5. Even though the signal followed a similar pattern with both climate sources, the regional instrumental series always showed higher correlation coefficients (Fig. 5). The most consistent signal was observed for the 10–14-month period centered on July and August (r > 0.6) for the SPI correlated with the instrumental climate data (Fig. 5b). Correlation coefficients higher than 0.6 are also observed for other particular months and time spans, such as August of SPI4 (r > 0.6). According to this information, we opted for the SPI12July reconstruction, since this period provides valuable information on a time interval covering a period similar to the full hydrological year and has the highest correlation values (r = 0.63).

Fig. 5
figure 5

Correlation between the SPI and SPEI indices for 1 to 24 months and the regional residual chronology. Results for a, c CRU data and b, d for station data, both over the period 1951–2010. Correlation values higher than 0.25 are significant at p < 0.05

The relationship between the 12-month SPI and the chronology was consistent throughout the eastern Iberian Range (Fig. 6). Correlation values higher than 0.4 extend toward the interior mountain ranges, whereas low correlation values were found in the Ebro depression in the north and in the southern plateau in the southwest.

Fig. 6
figure 6

Spatial correlations for composite TRW chronology with gridded SPI12July data. Correlation values are significant at p < 0.05

The reliability of the model was confirmed by the high correlation (0.63) and adjusted correlation coefficients (0.40). The reduction of error (RE) is positive in both calibration and verification periods, indicating that there is skill in the reconstruction (Fritts et al. 1990). MSE and sing test have similar values to those presented in other SPI reconstructions (for instance, Levanic et al. 2013)

The two models obtained for the calibration periods 1951–1980 and 1981–2010 proved to be significantly effective and valid for the final reconstruction (Table 2 and Fig. 7). For the final lineal model, we used the full 1951–2010 period with the station-based SPI12 data (Eq. 1) to develop a SPI reconstruction reaching back to 1694:

Table 2 Calibration/verification statistics SPI12July reconstruction
Fig. 7
figure 7

Calibration and verification results of the station-based SPI12July reconstruction

$$ \mathrm{S}\mathrm{P}\mathrm{I}{12}_{\mathrm{July}}=4.9381*TR{I}_{\mathrm{res}}\hbox{--} 5.1132\left({r^2}_{\mathrm{adj}}=0.40;p<0.01\right) $$
(1)

SPI reconstruction

According to reconstruction presented here, the 12 months preceding July 2012 were the driest period with respect to the past 318 years in the Iberian Rang (Table 3). In order to investigate extreme drought and pluvial events, we considered 1.76 standard deviation positive and negative thresholds to identify 36 positive and 28 negative outliers since AD 1694 (Fig. 8). The eighteenth century had a higher recurrence of extreme dry events, whereas it was also the century with the fewest pluvial years. The nineteenth century shows fewer years with extreme events, while the twentieth century contains 38 % of all extreme events. Moreover, 6 of the 10 driest reconstructed years occurred within the last eight decades, and 9 of the 10 wettest reconstructed years occurred over the last 100 years. Concerning the 11-year moving mean, we identified seven dry periods in order of severity (1798–1809, 1961–1972, 1744–1755, 1871–1882, 1981–1992, 1701–1712, and 1771–1782). Accordingly, we identified five wet periods in order of intensity (1949–1960, 1756–1766, 1971–1982, 1883–1894, and 1810–1821). The fact that the dry and wet periods are not temporally clustered is to some degree expected after removing low-frequency trends when applying the spline detrending and considering the residual chronology.

Table 3 Reconstructed extreme dry and wet years and SPI12July values (in brackets)
Fig. 8
figure 8

SPI12July (IRDIR) reconstruction since AD 1694 for the Iberian Range. Dashed lines indicate ±1.76 standard deviations used to identify extreme events. Orange shading indicates 11-year-long dry periods (Color figure online)

Discussion

Paleoclimatic studies show that droughts are a frequent phenomenon throughout the Mediterranean region with vast environmental and socio-economic consequences (Martín-Vide and Vallvé 1995; Rodrigo et al. 1999). Detailed information on these events over the past centuries, particularly during the Little Ice Age (LIA), is scarce, however. Although there are some references regarding long-term precipitation changes (Nesje and Dahl 2003), most studies focused on temperature variability (Büntgen et al. 2013).

Based on 336 TRW series of 21 sites with a high coherence between species and elevation, we developed a 318-year SPI reconstruction representative for the central region of Spain. The reconstruction is the only record in southern Europe targeting SPI, a drought index integrating monthly precipitation data and the cumulative probability of a given rainfall event occurring at a station (McKee et al. 1993) and useful to characterize extreme dry and wet events. The main statistics used to verify the accuracy of the reconstruction present values similar to existing SPI reconstructions developed for Turkey (Touchan et al. 2005) and Romania (Levanic et al. 2013). For example, the variance explained by the first eigenvector is 33.58 %, while in Turkey, it is 42.75 % and in Romania 44.96 %. The lower percentage of explained variance by the first eigenvector in our composite chronology might be explained by the fact that we used five different Pinus species, while in the other reconstructions, only one species was used, Juniperus excelsa and P. nigra, respectively. The signal-to-noise ratio of 28.09 is higher than that observed by Levanic et al. 2013 (23.03), however.

The difference between the correlation coefficient (Rbar) between trees (0.29) and the correlation within trees (0.73) is typical for datasets integrating various sites. The mean correlation between sites is 0.46 (Fig. 3) although is higher for the sites above 1800 masl which are P. uncinata, P. sylvestris, and P. nigra. Nevertheless, the mean chronology captures the regional climate signal very well which highlights the advantages of regional averages (Briffa et al. 1998).

The raw chronology was longer than the final reconstruction; however, due to low sample replication, the EPS value drops below 0.85 in the early years of the chronologies (1503–1693). Therefore, the reconstruction was developed for the period 1694–2012. In order to reconstruct droughts for the whole LIA event, more cores of old living trees should be sampled. Additionally, as suggested by Esper et al. (2014), combining the living-tree samples with tree ring width measurements from historical structures would permit extending the reconstruction back in time, perhaps even over the last millennium.

The growth limitations of the Pinus sp. in the study area are expressed by the high positive correlation of the TRIres chronology with July SPI integrating drought conditions over 12 months (SPI12July). A similar metric was considered by Levanic et al. (2013) in Eastern Europe where the August SPI integrating 3 preceding months was reconstructed. In general, similar responses to late summer precipitation and annual precipitation were detected for other species in the Mediterranean basin (Touchan et al. 2007; Martin-Benito et al. 2010; de Luis et al. 2013).

The high correlation with precipitation is remarkable, since in other mountain areas temperature is usually the limiting factor and is commonly used for reconstruction. For instance, in the Alps, TRW has been used to reconstruct temperature (Büntgen et al. 2005), while for the Pyrenees, MXD is needed to get skillful records for a temperature reconstruction (Büntgen et al. 2008). Further south of the IP and with lower-elevation sites than in the Alps and the Pyrenees, we find a strong drought signal.

Our chronology also returned a good correlation with the SPI values on shorter time scales (August SPI for 4 months) and showed the strong influence that late spring and summer precipitation in the study area has on growth of Pinus sp. However, the SPI12July correlation is more consistent and not only is reporting information on the complete hydrological year but also is supported by the heavy influence exerted by nutrient storage in the previous year on the following year’s growth (Levanic et al. 2013).

With regard to the split calibration/verification approach, we are aware of the limitations of testing the chronology with only 60 years of instrumental data. The general distribution of meteorological observatories in Spain did not begin until the mid-twentieth century, however (Gonzalez-Hidalgo et al. 2011). Thus, even though the CRU dataset covers the 1901–2012 period, for the first 50 years, there were too few instrumental stations to represent the specific climate conditions of the study area. Despite these limitations, the correlation with the calibration period (r 2 = 0.40) is similar to other precipitation and SPI reconstructions (r 2 = 0.46, Akkemik et al. 2005; r 2 = 0.34, Levanic et al. 2013) using a longer time span for calibration.

Throughout this SPI12July reconstruction (IRDIR), we were able to identify droughts and pluvial events over the past 318 years. Drought events were most severe in 2012, 1931, 1879, 1741, 1803, 1965, 1726, 1967, 1986, 1994, and 1725. Some of these years (1803, 1879, 1965, and 1994) are also identified by Genova (2012) as years with severe droughts in Spain. In contrast, extremely wet events occurred in 1952, 1976, 1997, 2011, 1977, 1788, 1914, 1959, 2010, and 1964. Although there is overlap in some extreme dry and wet years with other drought reconstructions developed from other regions of Europe (Akkemik et al. 2005 for 1725, 1726,1797; Esper et al. 2007 for 1803, 1808; Buntgen et al. 2010 for 1959, 1967; Levanic et al. 2013 for 1725, 1914), what is truly remarkable is the consistency of the frequency of extreme events. While in the eighteenth century there was a high frequency of extreme events, in the nineteenth century, the frequency decreased considerably making a very quiet century in terms of droughts which might seem to be related to the end of the LIA (Jones and Bradley, 1992). In the northeast of Spain, the end of this episode does not seem to be related to a temperature increase, but to a decrease in the interannual variability (Saz 2003) and a reduced frequency of extreme drought events (Manrique and Fernández 2000), at least compared to what has been observed in the previous centuries, particularly the sixteenth and seventeenth. However, the frequency of the extreme events increases again during the twentieth century, particularly over the most recent 80 years, which is consistent with the last IPCC report (2013) linking anthropogenic activities with global climate change.

A larger number of extreme events are associated with catastrophic historical and cultural changes over the past 318 years. For instance, the great drought of 1725 (SPI of −1.63) is known as El año sin cosecha (the year without a harvest) in Monegros (Alberola 1996). Monegros is located in the north of the study region, which used to be an important bread basket for the Mediterranean population. The year 1725 was also reported to have been an extremely dry year with a serious famine and poverty in Romania (Levanic et al. 2013). Furthermore, Touchan et al. (2007) described 1725 as “the year with major drought in Anatolia and Syria.” The period 1749–1753 was very poor in terms of wheat harvest due to droughts having enormous socio-economic impact.

There are more episodes affecting harvests such as the great famine events during the first decade of the nineteenth century, which is reflected in the IRDIR with two dry extreme events (1803, SPI of −2.20, and 1808, SPI of −1.53). It is well known as the decade of nineteenth-century famines (Hambres decimonónicas). In 1803, thousands of people died in northeast Spain due to the lack of wheat brought about by poor harvests. Vicente-Serrano and Cuadrat-Prats (2007) indicated that 1803 was the year with the second highest number of rogations in the nineteenth century in Teruel and Zaragoza. At the end of that century, there was another period of severe famine caused by wheat shortages driving the price up. The year 2012 is described by the Spanish Meteorological Agency (AEMET) as an extremely dry and warm year all over Spain, and one which had socio-economic consequences.

The spatial representation of the IRDIR serves to characterize the climate of inland continental areas of the Iberian Peninsula. The IRDIR is consistent throughout the north and south plateau with a high correlation (r > 0.50); however, it is not so representative of the great depressions such as the Ebro Valley. The same pattern was presented by Esper et al. (2014), although the PDSI index and Juniperus sp. were used. These patterns allow us to identify a biogeographical region covering the entire Iberian Range as well as the central system and the north and south plateau. The area showing high correlations is mostly above 1000 masl (Fig. 6). However, in the main depressions such as the Ebro, the Duero, or the Tajo valleys, the IRDIR is not representative. Climate conditions affecting other high-altitude areas of Spain, such as the Pyrenees and Sierra Nevada, differ in terms of temperature and precipitation; therefore, the IRDIR does not represent these areas either.