Introduction

It is increasingly recognized that high-latitude regions will be highly responsive and sensitive to future climate warming (Overpeck et al. 1997; Smol et al. 2005; IPCC 2007; Smol and Douglas 2007). In the Arctic, the increases in surface temperature and precipitation that are projected to occur as a result of global warming will not only affect ecosystems, soil carbon storage, surface hydrology and sea-ice extent, but could also further magnify the effects of global climate change due to feedback mechanisms (Oechel et al. 1993; Johannessen et al. 1995; Kattenberg et al. 1996; IPCC 2007). For example, changes in Arctic snow and ice cover extent could have dramatic effects on high-latitude albedo, further increasing temperatures (Bonan et al. 1992; Foley et al. 1994; MacDonald et al. 1998; Chapin et al. 2000).

Meteorological station data for the period 1961–1990 suggest that portions of the Arctic have warmed in recent decades (Chapman and Walsh 1995; Serreze et al. 2000; Moritz et al. 2002; Bonsal and Prowse 2003). However, temperature changes and vegetation response in recent decades have not been spatially uniform. For example, portions of the eastern Arctic, particularly the Foxe Basin, have experienced negligible warming and possibly cooling (Serreze et al. 2000; Saulnier-Talbot and Pienitz 2001; Ponader et al. 2002). In addition, multi-year to multi-decadal patterns of temperature and precipitation variability have been observed in instrumental climate records from the Arctic and adjacent regions (Serreze et al. 2000), whereas paleoclimatic records suggest that even lower frequency modes of variability operating on millennial timescales may be a persistent feature of the Arctic system (Szeicz and MacDonald 2001; Hu et al. 2003). These observations suggest that there are a number of modes of Arctic climate variability that function on varying time scales and can serve to accentuate or dampen the warming that might be anticipated due to global climate change. It is clear that a more detailed understanding of the temporal and spatial patterns of past and present Arctic climate change and the response of this ecosystem to this change is required before the accuracy of modeling scenarios of the future can be evaluated. Unfortunately, observational records of arctic climate and environmental changes are too short (generally <100 years) to document the full range of the periodicity and amplitude of modes of Arctic climate variability.

One means of providing such information is to produce quantitative paleoclimate records. However, paleoclimate records spanning more than a 100 years remain sparse for the Arctic, particularly in the central Canadian Arctic and the adjacent Arctic archipelago. As part of an effort to develop quantitative paleoclimate records for this region, we developed a modern calibration data set for the interpretation of down-core midge remains. Previous research reported the sensitivity and rapid response of paleolimnological proxies such as diatoms and chironomids to recent and long-term climate change (MacDonald et al. 1993; Douglas et al. 1994; Smol and Douglas 2007; Hu et al. 2006). However, until relatively recently (Gajewski et al. 2005; Barley et al. 2006; Walker and MacDonald 1995), only a few studies were undertaken that described chironomid distributions and the relationship between midges and the contemporaneous environment in the central Canadian Arctic. It is vital that we continue to refine our knowledge of chironomid ecology and biogeography to understand better the environmental optima and tolerances of specific midge taxa. Doing so will likely enable more meaningful interpretation of subfossil midge stratigraphies and will increase the robustness of midge-based quantitative paleoclimate reconstructions.

In this paper, we describe the distribution of midges in a suite of 88 lakes spanning boreal forest, forest tundra and tundra in the central Canadian Arctic, we identify environmental variables that account for a statistically significant amount of variance in midge distribution, and we develop a quantitative inference model for average July air temperature based on a high-resolution gridded air temperature dataset. The application of this model to fossil chironomid assemblages preserved in late-Quaternary lake sediment will provide estimates of past air-temperature variability in this region and may help identify the influence of warming phases upon Arctic environments and peoples in the past.

Study region

Sediment and water samples were collected from a suite of lakes crossing strong climate and vegetation boundaries in the central Canadian Arctic. The study area (Fig. 1) includes Victoria Island and adjacent continental Nunavut, extending south to the northern boreal forest ecotone, and spanning the Picea treeline, shrub tundra, and mid-Arctic tundra ecozones. The sites on Victoria Island are located on late-Wisconsin glacial deposits consisting primarily of till draped over pre-Quaternary carbonate bedrock (Sharpe 1993). The mid-Arctic ecozone on Victoria Island consists of undulating lowlands, extensive drumlin ridges, patterned soils and deep, continuous permafrost. The mainland portion of the transect, located in Canadian Shield’s Slave Province, is underlain by Archean granite, gneisses and metamorphosed sedimentary and volcanic rocks (Padgham and Fyson 1992). This portion of the study transect is located in the low Arctic ecozone, which is modified by permafrost processes, resulting in hummocky ground features.

Fig. 1
figure 1

Study area, location of the 77 central Canadian Arctic lakes incorporated within the training set, and location of climate data grid points. A schematic of some lakes surrounded by their relative buffer and selected data grid points is also provided

The study transect spanned five vegetation zones: mid-Arctic tundra, low Arctic tundra, forest-tundra and boreal forest. Boreal forest is distinguished by Picea mariana, P. glauca, Pinus banksiana and Abies balsamea (Ritchie 1984). The forest-tundra transition zone is characterized by P. mariana, P. glauca and Larix laricina (Sirois 1992). Low Arctic tundra is typified by shrub forms of Salix and Betula, as well as Alnus. The mid-Arctic tundra zone is characterized by nearly continuous cover of either dwarf shrub heath tundra, or cottongrass-dwarf shrub heath tundra (tussock tundra), consisting of Ericaceae, Empetraceae, Dryas and dwarf Salix. Cambridge Bay, Nunavut, located in the northern portion of the training set has a mean annual temperature of −14.9°C and a mean July temperature of 8.0°C. Yellowknife, located at the southern extent of the training set, has a mean annual temperature of −5.2°C and a mean July temperature of 16.5°C. Mean annual precipitation varies with latitude; Cambridge Bay receives 141 mm and Yellowknife receives 267 mm. We attempted to maximize the temperature gradient in the calibration set by selecting lakes located along a transect that crossed perpendicularly the mean July temperature isotherms in the region.

Methods

Field sampling

Sediment and water samples for 60 of the lakes, VI 1–60, were collected between August 6th, 2007 and August 11th, 2004. Sediment and water samples for the remaining lakes, TK 1–36, were collected between August 7th, 1996 and August 19th, 1996 (see Table 1 for lake coordinates). Replicate cores were recovered from the approximate centre of each lake using a Glew (1991) mini-corer deployed either from the pontoon of a Bell 206 helicopter (2004) or from an inflatable Zodiac (1996). The top (0–1) cm interval of each replicate sediment core was extruded immediately, stored in a Whirl-Pak® and kept cool and in the dark until shipped back to the lab. During surface sediment collection, physical variables including surface-water temperature, specific conductance, pH and depth were measured. Epilimnetic water samples were also collected 0.5 m below the water surface in pre-cleaned polyethylene bottles and immediately treated in the field following the protocols outlined in the Analytic Methods Manual (Environment Canada 1996a, b). Untreated water samples were collected in one 125-ml glass bottle to analyse total phosphorus and two 125-ml polyethylene bottles for major ions and trace metals analyses. Water samples were filtered for Chl a on a 4.7-cm-diameter GF/F filter, and total phosphorus and total nutrients on a 4.7-cm- and 0.45-cm-diameter cellulose-acetate filter, respectively. Each lake in the training set was characterized by a suite of 23 variables including: maximum depth, summer surface-water temperature (SSWT), average July air temperature (AJAT), specific conductivity, pH, dissolved organic carbon (DOC), dissolved inorganic carbon (DIC), total organic carbon (TOC), total nitrogen-unfiltered (TN-UF), total phosphorus-unfiltered (TP-UF), chlorophyll a (CHLA), Cl, SO4, Ca, Mg, K, SiO2, total Kjeldahl nitrogen (TKN), Al, Ba, Fe, Li and Sr. Trace metal analysis was based on the dissolved and extracted portion of the water samples. Water chemistry analyses were carried out by the National Laboratory for Environmental Testing (NLET) at the Canadian Centre for Inland Waters in Burlington, Canada. Lakes which did not have water chemistry or sediment samples available (TK 1–5, TK-35) were removed from the calibration set and not included in further analyses.

Table 1 Environmental, limnological and chemical data for the 88 lakes sampled in the calibration set

Laboratory analyses

Chironomid analysis followed standard procedures, as outlined by Walker (2001). A minimum of 2 ml (range: 2–10 ml) of sediment was deflocculated in an 8% KOH solution and heated at 30°C for 30 min. The sediment slurry was sieved through a 95-μm mesh and the material retained on the sieve was rinsed with distilled water and washed into a beaker. The resulting solution was sorted in a Bogorov plankton counting tray under a Wild 5× dissection microscope at 50× magnification, and subfossil chironomid remains were removed with forceps. The subfossil specimens were permanently mounted on slides in Permount or Entellan for identification. Identification of the mounted remains was typically made at 400× and was based predominantly on Simpson and Bode (1980), Cranston (1982), Oliver and Roussel (1983), Wiederholm (1983), Walker (1988), Larocque and Rolland (2006), Brooks et al. (2007) and an extensive reference collection of Arctic subfossil chironomid remains housed at the Department of Geography, The Ohio State University (USA). The calibration set discussed in this paper was independently analyzed by two of the authors (Porinchu and Rolland) and systematically harmonized based on the authors’ knowledge of Holarctic subfossil midge taxonomy. Increasingly, chironomid researchers have been dividing the subfossil remains of Heterotrissocladius spp. into finer taxonomic groupings, such as H. marcidus, H. subpilosus, H. maeaeri, and H. grimshawi (Brooks et al. 2007); however, we conservatively chose to merge all the Heterotrissocladius spp. into a single taxonomic entity. Diagnosis for Zalutschia sp. A is available in Brooks et al. (2007) where it is referred to as Zalutschia sp. B; diagnosis for Zalutschia sp. B is available in Barley et al. (2006). Appendix A depicts a number of morpho-types encountered in the sediment and grouped together as C. oliveri-type. At least 45 head capsules (mean = 67.5) were enumerated and identified from the surface sediment (0–1 cm) from each of the calibration lakes with the exception of VI-04, VI-21, VI-31 and VI-40 (40–45 head capsules) and TK-17 and VI-23 from which 38.5 and 39.5 head capsules were identified, respectively. A sum of 45–50 head capsules has been shown to provide relatively consistent estimates of SSWT (Heiri and Lotter 2001). Five lakes (VI-30, TK-22, TK-25, TK-26, TK-28) were removed from further analyses due to low head capsule recovery. One lake, TK-12, was removed due to poor subfossil specimen preservation.

Gridded climate data

In this study, the closest weather stations were Yellowknife (62°27′ N; 114°24′ W), Cambridge Bay (68°07′ N; 105°03′ W) and Kugluktuk (67°49′ N; 115°05′ W), however, only Cambridge Bay is situated on our transect. Because of the paucity of meteorological data, the use of regression to relate latitude and temperature will not provide meaningful estimates of air temperature. However, as already applied by Barley et al. (2006), the use of a high-resolution climate data set compiled by New et al. (2002) provided mean July air temperature for each site. New et al. (2002) used the climatic normal from 1961 to 1990 to develop a 10′ latitude/longitude data set of mean monthly surface climate over global land areas. A subsample of 8,633 points covering the area between 60°N–75°N and 100°W–120°W was extracted from this climate data set. These points, characterized by a latitude (x axis), longitude (y axis) and mean July air temperature for 1961–1990 (z axis) were used in ArcGIS to create a grid in which the sampled lakes were also plotted (Fig. 1). The temperature variations along this grid were represented as a color gradient surface made of 26 classes, each one representing a 0.5°C range. For each lake, a buffer was created to select the grid points that were within the limit of a radius of 8.25 km around the lake (Fig. 1). The radius size was carefully chosen to select at least two points for each lake. The selected grid points were then linked to their mean July air temperature and an average July air temperature for each buffer was calculated. These values were then related to their respective lakes.

Statistical analyses

Non-limnological variables such as vegetation type, latitude and longitude were removed from all subsequent analyses. Forms of indirect gradient analysis, principal components analysis (PCA) and detrended correspondence analysis (DCA), were implemented to assess whether the limnological or biological characteristics, respectively, of specific lakes in the training set should be considered outliers. Lakes were identified as outliers and removed from further analyses if their scores on the first two axes of a PCA of the environmental data and a DCA of midge data were greater than one standard deviation of the mean sample score for each respective axis (Birks 1995).

The ordination analyses (DCA, RDA and partial RDAs) were based on all taxa present in the training set lakes and the square-root transformed relative abundance data. A DCA of the midge relative abundance data, with rare taxa down-weighted and detrending-by-segments, was undertaken to identify the degree of compositional turnover in the midge distribution (Birks 1998). The length of the gradient in the distribution data was used to identify whether linear or unimodal approaches were suitable for direct gradient analyses. Linear approaches are appropriate when the gradient length captured by the assemblage data is short, i.e. <2 S.D. units. The gradient lengths of the DCA axis 1 and 2 were 2.13 and 1.95 S.D. units, respectively, indicating the use of RDA for constrained ordinations (Birks 1995). A series of RDAs constrained to individual environmental (predictor) variables (n = 23) with Monte Carlo permutations (499) was undertaken to identify those environmental variables that could account for a statistically significant amount of variance (P ≤ 0.05) in the midge assemblages. Of the remaining variables, those that were highly co-linear were removed one at a time until all variables had variance inflation factors (VIFs) < 10×. Forward selection, with Monte Carlo permutation tests (499 unrestricted permutations), was used to identify a minimal subset of the remaining environmental variables (n = 16) that could account for a statistically significant and large amount of variance present in the midge assemblage data. The amount of variance that AJAT or SSWT could account for, independent of the other variables identified in the forward selection procedure, was assessed using variance partitioning (Borcard et al. 1992), which was implemented through a series of partial RDAs.

Midge-based inference models for AJAT and SSWT were developed using a number of techniques including: weighted-averaging, weighted-averaging-partial least squares and partial least squares. The robustness of the transfer functions was evaluated through the use of jackknifing (Birks 1998). Model performance was assessed based on three criteria: (1) root mean square error of prediction (RMSEP), (2) maximum bias of the model, and (3) the minimum number of components (Birks 1998), incorporated in the model. Following Jones and Juggins (1995), Lotter et al. (1998) and Porinchu et al. (2002), samples were considered outliers and not included in the final model(s) if the absolute residual (predicted AJAT or SSWT—‘observed’ AJAT or SSWT) was greater than one standard deviation of AJAT or SSWT, respectively. The program C2 (Juggins 2003) was used for inference model development. A number of different statistical approaches such as weighted averaging (WA), weighted averaging-partial least squares (WA-PLS) and partial least squares (PLS) were used to model the relationship between AJAT/SSWT and midge abundances. Lakes with high absolute residuals, determined by a 1-component WA-PLS model, were removed from the AJAT and SSWT inference models prior to calculation of the performance statistics.

Results

Screening of the environmental data (PCA) and midge assemblage data (DCA) indicated that TK-24, which is characterized by unusual water chemistry and midge community composition, is an outlier. The midge community in this lake is atypical with high relative abundances of Procladius, Corynoneura and Cricotopus. Additionally, the water chemistry of this lake is unusual with high trace-metal concentrations (Sr, Ba), conductivity, DOC and TN-UF (see Table 1 for a summary of the values of major environmental variables).

The biogeographical distribution (arranged by latitude) and relative abundances of the major midge taxa are illustrated in Figs. 2 and 3. Lakes on Victoria Island are characterized by chironomid communities dominated by Tanytarsus spp., Chironomus anthracinus-type, Corynocera oliveri-type, Psectrocladius sordidellus-type and, surprisingly, Dicrotendipes, which is generally considered a thermophilous taxon. Chironomid communities found in the northern mainland Arctic tundra zone are distinguished by relatively high abundances of Limnophyes and Zalutschia sp. A and the near absence of Corynocera oliveri-type and Micropsectra. The southern mainland Arctic tundra sites are notable for their high proportion of Corynocera ambigua-type, Heterotrissocladius, Micropsectra, Zalutschia sp. B and the near absence of Chironomus anthracinus-type. The forest-tundra zone is distinguished by Heterotanytarsus, whereas taxa such as Microtendipes, Polypedilum and Dicrotendipes, typically considered warm thermophiles, are most abundant and commonly found in the lakes located in boreal forest.

Fig. 2
figure 2

Relationship between water (field measurements) and average July air temperature (derived from high-resolution gridded climate data)

Fig. 3
figure 3

Relative abundance of chironomids in the training set lakes. Lakes are listed in order of decreasing latitude (°N). Lake id numbers are linked to id numbers in Table 1

The results of the RDA restricted to individual predictor variables (n = 23) indicated that DOC and SiO2 did not explain a statistically significant amount of variance in the midge assemblages. Of the remaining variables, DIC, Ca, Mg and TKN had high variance inflation factors (VIFs) and were removed sequentially until the VIFs of all remaining variables were below 10× (n = 16). As indicated by Fig. 3, SSWT co-varied with AJAT, and was therefore removed in order to keep AJAT in the analysis as an explanatory variable. The eigenvalues of axis 1 and 2 of a RDA limited to the remaining 15 variables (maximum depth, AJAT, specific conductivity, pH, TOC, TN-UF, TP-UF, CHLA, Cl, SO4, K, Al, Ba, Fe, Li and Sr), were 0.226 and 0.045, respectively (see Table 2). These 15 predictor variables explained 40.6% of the variance present in the chironomid abundance data. The forward selection procedure identified six variables which could capture a large and statistically significant fraction of the overall variance present in the midge data: maximum depth, pH, AJAT, TN-UF, Cl and Al. A RDA constrained to these six variables accounted for 75.8% of the variance captured by the full suite of 15 predictor variables. The eigenvalues for the first two RDA axes constrained to the six forward selected variables were 18.4 and 4.5, respectively. These two axes captured 22.9% of the variance in the midge abundance data and 77.6% of the variance in the taxa-environment relationship. The forward selected variables summarized the relationship between midge distributions and the measured environmental variables well, and therefore, were used to construct the RDA bi-plots (Fig. 4).

Fig. 4
figure 4

(a) Redundancy analysis (RDA) of taxa. The arrows represent the vectors of the six environmental variables explaining the distribution of chironomids in the 77 lakes. The numbers refer to the taxon id# presented in Table 6. (b) RDA analysis of the 77 lakes classified by vegetation

The canonical coefficients, intra-set correlations and approximate t-tests indicate that maximum depth, TN-UF and pH are strongly correlated with RDA axis 1, and that AJAT and pH are strongly correlated with RDA axis 2 (Table 3). Although depth does account for the largest amount of variance in this dataset, the primary purpose in developing this training set is to assess the relationship between midge distributions and SSWT/AJAT, therefore, we focus on these variables for the remaining analyses. The ratio of λ1 to λ2 in a RDA restricted to AJAT as the sole explanatory variable is low (0.245); however, this low ratio is due to the large amount of variance captured on RDA axis 2 by AJAT (Fig. 4). Partial RDAs indicate that AJAT captures 3.6% of the variance in the midge dataset, independent of the effects of the other forward selected variables (Table 4).

Table 2 Ratios of the eigenvalues of the first (constrained) RDA axis to the eigenvalues of the second (unconstrained) RDA axis
Table 3 Canonical coefficients, intra-set correlations and approximate t-test values for the four predictor variables identified in forward selection for the first two RDA axes
Table 4 Summary of partial RDAs based on chironomid assemblages from the 77-lake training set

The RDA bi-plots primarily separate the deep, nitrogen-limited, southern mainland arctic tundra sites from the shallow, nutrient-rich Victoria Island and northern mainland arctic tundra sites (Fig. 4). In addition, the majority of boreal forest and forest-tundra lakes load high on the AJAT axis, distinguishing them from the majority of the arctic tundra sites. Midge taxa associated with the deep, nitrogen-limited lakes include: Abiskomyia, Sergentia, Stictochironomus, Protanypus and Ablabesmyia. Taxa associated with the well-buffered, nutrient-rich lakes include: Cladotanytarsus mancus-group, Dicrotendipes and Cladopelma. RDA axis 2, which primarily captures AJAT, indicates taxa such as Microtendipes, Polypedilum, Pagastiella and Stempellina/Zavrelia are associated with the warmest calibration set lakes, whereas taxa such as Pseudodiamesa, Hydrobaenus/Oliveridia, Sergentia and Stictochironomus are most abundant in the coldest lakes.

The inference model developed for AJAT consisted of 50 taxa and was based on 77 lakes, whereas the inference model developed for SSWT consisted of 50 taxa and was based on 75 lakes. Totals of 11 and 13 lakes were removed from the AJAT and SSWT inference models, respectively, due to high residuals. The performance statistics for the various inference models are presented in Table 5. A 2-component WA-PLS model provides the most robust performance statistics for AJAT, with an r 2jack  = 0.77, RMSEP = 1.03°C and a maximum bias of 1.37°C. The beta coefficients used in the 2-component WA-PLS inference model for AJAT are presented in Table 6. Plots of the cross-validated inferred AJAT against observed AJAT and the residuals from the 2-component WA-PLS inference model are depicted in Fig. 5, with no trend apparent in the residuals (negative trend r 2 = 0.22, P < 0.0001). The best model, (Birks 1998), for SSWT, based on a 1-component WA-PLS approach, provides an r 2jack  = 0.75, an RMSEP = 1.39°C and maximum bias of 2.33°C (Table 7).

Table 5 Performance statistics for the different models relating average July air temperature to chironomid variance
Table 6 Values for all taxa of training set for: taxon occurrence (percent of lakes in which taxon was present), taxon id#, the minimum (AJAT Min) and maximum (AJAT Max) associated with beta coefficient values based on square-root transformed data (Beta_C2_X transformed) and untransformed data (Beta_C2_X not transformed) used in the 2 component WA-PLS model
Fig. 5
figure 5

(a) Relationship between observed average July air temperature (derived from high-resolution gridded climate data) and inferred average July air temperature. The solid line represents the 1:1 relationship. (b) Observed average July air temperature versus residuals

Table 7 Comparison of the average July air temperature and water temperature chironomid-based inference models with published Canadian chironomid-based training sets

Discussion

The development and expansion of training sets in the western Arctic (Barley et al. 2006) and the eastern Arctic (Larocque et al. 2006) has further focused attention on the identification and distribution of Corynocera species in the circum-Arctic region. Barley et al. (2006) discuss the difficulty associated with consistently identifying C. oliveri-type, which is characterized by a raised mentum, a diagnostic characteristic shared by Tanytarsus lugens-type, and the presence of a large surface tooth on the mandible. Barley et al. (2006) describe the occurrence of chironomid subfossils possessing C. oliveri-type mandibles in the western Arctic, but note that there are a number of types each with distinct mentum morphologies. Our results are in agreement with the Barley et al. (2006) assessment; there appear to be a number of Corynocera oliveri morpho-types in our training set. These Corynocera oliveri morpho-types all possess a large surface tooth on the mandible and a raised mentum, but each morpho-type has a distinct median tooth complex (see Appendix 1). The biogeography of Corynocera is also interesting. For example, the limited presence of C. ambigua-type in the Canadian Arctic archipelago has been used to support the suggestion that the modern distribution of Corynocera ambigua-type in the Canadian Arctic reflects the existence of a Beringian refugium for this taxon (Barley et al. 2006). However, in our training set, C. ambigua-type is present in 10 of the 30 lakes we sampled on Victoria Island and, in addition, it comprises over 20% of the chironomid community in six of the lakes. These results suggest that the biogeographical history of this taxon in the Canadian Arctic may be more complex than previously thought.

Surprisingly, Dicrotendipes was also present at high levels (30–40%) in three lakes on Victoria Island. Dicrotendipes is generally considered a thermophilous taxon and is more commonly associated with sites located south of treeline (Oliver and Roussel 1983; Walker and MacDonald 1995; Porinchu and Cwynar 2000; Larocque et al. 2006). The Victoria Island lakes in which Dicrotendipes is most abundant are between 0.8 and 1.2 m deep; it is likely that depth and amount of littoral habitat are important determinants of Dicrotendipes abundances. These results could also suggest changes in the distribution of this taxon due to recent regional warming on Victoria Island. A strong negative correlation exists between Dicrotendipes and C. oliveri-type in the northern training set lakes, suggesting a possible inter-specific link between the abundances of these two taxa. Also of note is the bi-modal distribution of Sergentia, with high relative abundances in both the northern and southern portion of the training set. A similar pattern has been reported from northern Quebec and Labrador and may be due to the presence of two distinct Sergentia types (Larocque et al. 2006). However, in our dataset the high abundance of Sergentia in the southern boreal forest lakes appears to be a function of lake depth, with high abundances of Sergentia occurring in the two deepest lakes in the training set.

The majority of calibration sets that have been developed in the Arctic and sub-Arctic regions, to date, have identified depth and various measures of temperature (AJAT, SSWT) and lake productivity (O2 concentration, total P, sediment organic content and Chl a) as being strongly correlated with midge distribution (e.g. Barley et al. 2006; Brodersen and Anderson 2002; Brooks et al. 2001; Larocque et al. 2001; Olander et al. 1999; Quinlan et al. 1998). Because the relationship between midges, depth, temperature and productivity has been discussed in great detail in the paleolimnological literature, it will not be repeated here. The identification of pH as a predictor variable, while not commonly reported, is not unexpected. Researchers have identified the impacts of natural and anthropogenic acidification using midge remains (Johnson and McNeil 1988; Brodin 1990; Velle et al. 2005) and a number of lake-pH classification schemes have been developed that rely on midge community composition (Johnson et al. 1990; Halvorsen et al. 2001; Bitusik et al. 2006). In addition, Barley et al. (2006) found that pH accounted for a statistically significant amount of variance in midge community composition in their calibration set. Owing to variations in underlying geology, our dataset spans a large pH gradient (3.69), with Victoria Island sites well buffered due to the presence of carbonate marine and glacial deposits, whereas most of the northern mainland sites are poorly buffered due to their location on the Canadian Shield. Therefore, it is not surprising that pH has a statistically significant relationship with midge distributions. The midge-based AJAT model compares favorably to previously developed inference models from the Canadian Arctic (see Table 7). The AJAT inference model presented in this paper, based on 77 lakes, has the lowest RMSEPjack and maximum bias relative to other regional training sets (Barley et al. 2006; Francis et al. 2006; Larocque et al. 2006). The range of AJAT captured by this model is less than the previously published models, which accounts for the slightly greater, but comparable value of ‘RMSEP as a percentage of the AJAT range.’ The SSWT, based on 75 lakes, was less robust than the AJAT inference model; however, it captured a larger SWT range and it had the lowest RMSEP and maximum bias relative to the existing SWT inference models (Walker et al. 1997; Francis et al. 2006; Larocque et al. 2006).

A number of studies in northern temperate and sub-arctic regions that used calibration sets and multivariate statistical techniques have demonstrated that mean summer air temperature and/or summer surface-water temperature have a statistically significant relationship with the distribution of chironomids. This has led to the development of many chironomid-based inference models for temperature (Lotter et al. 1997; Walker et al. 1997; Olander et al. 1999; Larocque et al. 2001, 2006; Brooks and Birks 2001; Barley et al. 2006; Francis et al. 2006). Researchers have applied these models to sub-fossil chironomid assemblages preserved in lake sediments to reconstruct quantitatively changes in past surface-water temperature in Atlantic Canada, the northeastern United States and the western United States during the late-glacial and Holocene (Walker et al. 1991; Levesque et al. 1993, 1997; Cwynar and Levesque 1995; Potito et al. 2006) and reconstruct past air temperature in Arctic Canada and western Europe during the late-glacial and the early Holocene (Lotter et al. 1999; Brooks and Birks 2001; Heiri and Lotter 2005; Velle et al. 2005; Rolland et al. in press). In some cases, intra- and inter-regional comparison of model performance through the merging or the expansion of datasets has been attempted (Porinchu et al. 2007; Lotter et al. 1999; Barley et al. 2006). With the inclusion of the lakes in this calibration set, four major training sets, incorporating over 300 lakes, exist for the circum-Canadian Arctic. Therefore, it should be possible to begin systematically assessing the larger-scale distributions and biogeographic relationships of arctic midge taxa and midge communities. Such a project should be pursued.

Increasingly, midge training sets are being used to develop quantitative inference models for air temperature, rather than mean summer/July surface water temperature, the variable most commonly used in earlier calibration studies (Walker et al. 1991; Olander et al. 1997; Lotter et al. 1998). From an ecological standpoint, there are a number of reasons midge community composition may be influenced by ambient air temperature. For example, most midges possess limited dispersal ability and, therefore, thermal conditions during adult emergence can greatly alter the reproductive success (fitness) of specific taxa, which in turn may influence midge community composition (Armitage et al. 1995). The value of using air temperature is two-fold: gridded climate data of sufficient spatial resolution will provide more representative estimates of site-specific thermal conditions than spot measurements of surface-water temperature, and the interpretation of air temperature is more straightforward because surface-water temperature is a function of multiple variables including lake depth, elevation/latitude and air temperature. However, the relationship between SSWT and midge communities is more direct and immediate. Water temperature can directly affect chironomid communities by influencing the rate of egg and larval development and the timing of eclosion (Armitage et al. 1995).

The performance of AJAT or SSWT inference models will be influenced greatly by the range of lake depths present in the training set. Lake depth, which influences lake water temperature, has been shown to strongly influence midge distributions in the circum-Arctic region (Olander et al. 1999; Porinchu and Cwynar, 2000; Larocque et al. 2001, Barley et al. 2006). Application of these models to down-core assemblages, therefore, should take into account the depth of the lake for which the reconstruction is being developed. For example, it may be more appropriate to remove deep lakes (i.e. >10 m) from an inference model if the model is being applied to chironomid stratigraphies recovered from shallow, Arctic ponds. Additionally, it is imperative that training sets incorporating sediment collected in the 21st century use the most recent climate normal period (1971–2000). The Arctic has been dramatically warming in recent decades. Calibrating subfossil midge distributions with the 1961–1990 Climate Normal will likely systematically underestimate Holocene reconstructions, although trends that are identified will still be meaningful.

Global climate models project significant increases in surface temperatures by 2100 AD. Although this change represents a global average, it is clear that differing responses will occur at regional and sub-regional scales. The recent Arctic Climate Impact Assessment (ACIA 2005) and IPCC report (IPCC 2007) indicate that the Arctic has, and will continue to warm over the next century. Recent rates of circum-Arctic climate and environmental change make it clear that improving our understanding of the temporal and spatial patterns of Arctic climate change in the past and the response of Arctic systems, particularly vegetation, to climatic change will be critical to assess the accuracy of future modeling scenarios. For example, the spatial and temporal imprint of the Holocene Thermal Event (HTM), a period of prolonged warming in the Arctic, may provide useful comparative data and potential analogues for anticipated future warming. In addition, a number of abrupt climate perturbations, such as the 9.2 k and 8.2 k events, have been well documented in Greenland ice cores (Alley et al. 1997) and the eastern Canadian Arctic archipelago (Miller et al. 2005), yet little evidence of these events has been discovered in the central Canadian Arctic (Seppa et al. 2003). Application of the quantitative midge-based inference model developed in this paper to subfossil midge assemblages extracted from Holocene sediment cores recovered from the central Canadian Arctic will provide the first quantitative estimates of the thermal conditions that existed during this period. When this approach is combined with pollen analyses, it will improve our understanding of vegetation response to climate change in this region.