Introduction

Although daily weather and seasonal to inter-annual climatic variability influence mosquito vector biology and risk of vector-borne disease, this information is not readily employed in disease control programs. Rather, such programs rely typically on trap-based surveillance (e.g., Reisen et al. 1999; White 2006; Krockel et al. 2006). Although traps provide a cost-effective means of establishing relative mosquito abundance, the resulting count data are highly variable in space and affected by non-biological day-to-day changes in trapping efficiency. Because of this, long-term historical count records provide useful benchmarks only if trap locations (and the environment adjacent to static trap locations) remain constant through the record. Similarly, count data alone provide little insight into potential changes in relative mosquito abundance under altered climate conditions.

From a practical standpoint, the collection and identification of trap data is time-intensive, diminishing the utility of data in operational control and public health advisories. This is further complicated by the fact that the traps provide only a snapshot of relative mosquito abundance at specific locations, rather than a broader (e.g., county-scale) measure of mosquito prevalence. This disconnect in spatial scale is one possible reason why climatic information is not used routinely as a part of vector management or public health awareness strategies. Although related, the local scale environmental factors that drive catch at a particular trap are often not reflective of the broader regional climate conditions that influence relative mosquito abundance over a wider area.

Several models have been developed to describe population dynamics of mosquito disease vectors using climate data. Some, but not all, models include the effect of temperature, which drives mosquito development, and rainfall, which provides availability of breeding sites for mosquitoes. However, rainfall may also reduce larval survival through flushing effects (Koenraadt and Harrington 2008) and can reduce trap counts (DeGaetano 2005) used for estimating mosquito populations. Several detailed studies have been developed for specific vectors. For example, Focks et al. (1993a) developed a detailed model to describe the life history of the dengue and yellow fever vector, Aedes aegypti. Ahumada et al. (2004) developed a matrix model based on temperature and rainfall for the avian malaria vector in Hawaii, Culex quinquefasciatus, and successfully simulated field results. Shone et al. (2006) were able to describe populations of the saltmarsh mosquito Aedes sollicitans in Maryland using time-dependent Poisson regression models. Unfortunately, however, many models describing vector population dynamics contain assumptions about the basic biology and ecology of mosquitoes that are difficult to validate in field settings.

Furthermore, a significant number of models for vector-borne infections do not take into account the variation in vector population dynamics that is a key driver of transmission (Esteva and Vargas 1999; Derouich et al. 2003; Ngwa 2004). Despite the value of models for conceptualizing the relative role of different parameters in the vector-borne disease cycle, few models have been developed that are employed on a regular basis in vector-borne disease surveillance activities (Focks et al. 1993a, b, 1995; Eisenberg et al. 1995; Ahumada et al. 2004). Wonham et al. (2004) describe an epidemiological model for West Nile virus (WNV) with direct applicability to mosquito control.

Given that abiotic and environmental factors can vary greatly at small geographical scales, developing models that are adaptable to different environments is critical. In addition, there are large gaps in the understanding of how climate and environmental change influence mosquito vectors and ultimately vector-borne disease transmission. There is a critical need to fill knowledge gaps for these systems, both in the United States (Ebi et al. 2006) and globally.

As a first step towards improving knowledge of climate and vector-borne disease patterns, two models to simulate the activity of Culex pipiens and Culex restuans arbovirus vectors in the Northeastern US were developed and tuned with laboratory and field data. WNV has become an endemic infection in the US since its introduction in the New York metropolitan area (CDC 1999a, b). WNV introduction, and now endemic transmission, in the US provide an opportunity to develop models that will be useful and adaptable to other existing arboviruses and future introductions of novel mosquito-borne pathogens into the region.

The goals of this study were (1) to understand how climate and environment affect mosquito populations at the microhabitat level; (2) to develop methods to scale data from the microhabitat level to a broader (several square kilometer) area using regional weather observations; and (3) to develop a prototype for a reliable predictive model that can be adapted for mosquito control and public health professionals in a variety of settings. Ultimately, the incorporation of a pathogen component to the model is planned. The initial focus was two Culex species (Cx. pipiens and Cx. restuans) that may be important in local transmission of vector-borne diseases that involve avian amplification cycles. Long-term data sets of adult abundance were utilized, and larval development with temperature and precipitation in local habitats was investigated by sampling, identifying and staging larvae with return to the field habitats.

Materials and methods

Mosquito collection data

Historical mosquito collection data were available for model development from New Jersey Light trap collections from mosquito abatement districts with the following locations and years: Site 1, North Brunswick, Middlesex County, New Jersey (40.4°N,74.5°W) 1990–1993, 1997, 1999, 2001 and Site 2, near Cambridge, Dorchester County, Maryland (38.56°N, 76.07°W ) 1975,1977,1982–1985. A New Jersey Light trap at site 1 was operated from 1984 to 2004, and at site 2 from 1958 to 1989. At site 1, 1990–2001 data and at site 2, 1974–1977 and 1980–1986 data were used for this study based on the completeness of data in individual years. Traps were run from dusk to dawn Monday–Friday from 1 June to 15 September.

These data highlight the spatial variations in trap catch that may occur across a county-size area. In Fig. 1, 7-day running mean trap counts from the North Brunswick site are plotted with coincident data from two other traps within Middlesex County. The Carteret trap is located approximately 32 km to the northeast of the North Brunswick trap. The Edison trap is at an intermediate location, approximately 13 km to the northeast of the North Brunswick trap. Using 1993 as a representative example, the between-trap correlation is low, ranging from about 0.55 for Carteret and the other two sites to only about 0.33 for North Brunswick and Edison. Nonetheless, together the three traps reflect the general county-wide trends in mosquito abundance, such as the rapid increase in catch in late May, a broad early-summer peak in capture, and a prolonged mid- to late-summer period of relatively low capture. In addition, finer temporal scale features such as secondary count peaks around 15 June and mid-September, and local count minima in early June and later in September are well represented by all traps.

Fig. 1
figure 1

Comparison of Culex capture at three trapping locations in Middlesex County, New Jersey (NJ): thin solid line North Brunswick, dashed line Carteret, dotted line Edison. The average capture across the three traps is given by the heavy solid line

In addition, side-by-side NJ light trap and Centers for Disease Control (CDC) trap collections were conducted at a local field site to further validate model simulations in an independent geographical location and relate data from the two types of surveillance traps. The site was located at a farm in Freeville, New York (42° 31′ 52″ N, 76° 19′ 41″ W). Traps were operated on alternate days with collections from the light traps made on Tuesday and Thursday, and the CDC collections made on Wednesday and Friday. Figure 2 shows the total weekly counts from these traps. Like Site 1, the side-by-side trap counts are not strongly correlated (r = 0.41, omitting the first week, which is uncharacteristically low for the CDC trap). However, the general seasonal pattern of mid-July and mid-August peaks and a sharp late-August (early-September) decline is reflected by both traps, albeit with a lag of as much as a week.

Fig. 2
figure 2

Total weekly capture during 2006 at co-located Centers for Disease Control (CDC) (thin solid line) and NJ light (dashed line) traps. The heavy solid line shows the average capture. Mechanical problems affected both traps in early August

Mosquitoes collected in traps were identified to genus and, in some cases, to species by mosquito abatement district personnel for historical data sets. Routinely, Culex species were not differentiated, necessitating the grouping of species, because it can be difficult to reliably identify and differentiate Cx. pipiens and Cx. restuans adults (Harrington and Poulson 2008). The counts of grouped species are typically used in operational mosquito control and surveillance programs, with the prevalence of Cx. pipiens relative to Cx. restuans influenced to some extent by degree day accumulation (Kunkel et al. 2006). For sites 1 and 2, overall Culex counts represented Culex pipiens, Cx. restuans and Culex salinarius. Mosquitoes in more recent collections from New York State were identified to species as larvae and to genus as adults in our laboratory using standard published keys (Means 1987; Andreadis et al. 2005). Total Culex capture for historical data sets were averaged over a 7-day period (3 days prior and 3 days after and including current day) to smooth abundance parameters and account for daily sampling inconsistencies.

Meteorological and local scale environmental data

Daily average temperature, rainfall and evaporation near each trap location were obtained from the National Oceanic and Atmospheric Administration (NOAA) Northeast Regional Climate Center. In addition, hourly air temperature at the New York collection site was gathered at the microhabitat level using HOBO data loggers (Onset® Computer Corporation, Bourne, MA). Hourly daylight was computed for each location using standard astronomical equations (e.g., Walraven 1978).

Model development and structure

The models were developed to encompass both immature and adult stages of mosquitoes by separating the life cycle into two distinct stages: immatures (eggs–pupal eclosion) and adults (both males and females from eclosion to death) as shown in Fig. 3.

Fig. 3
figure 3

Schematic diagrams showing variables and the relationship between variables of a development rate and b degree day (DD) models

Two discrete models were developed to simulate the temporal dynamics of Culex mosquito populations. Both models used a temperature-dependent function to estimate mosquito development and survival. The degree day (DD) model used the accumulated thermal units above the zero development threshold for Culex (see explanation below) to determine the days from egg to adult, while the development rate model used a temperature-dependent function to estimate the percentage of immature mosquitoes to eclose on each day. Both models used the functions of “Survival Rate” and “Egg Laying Rate” described below. The following difference equations were used to calculate the population numbers of mosquito immature life stages and adults on each day.

  1. 1.

    Degree day (DD) model

    $$ {N_I}(t) = {N_I}\left( {t - 1} \right) * {\delta_I}\left( {t - 1} \right) + {N_A}\left( {t - 1} \right) * \beta \left( {t - 1} \right) - {N_E}\left( {t - {t_{DD}}} \right) * {\delta_{EA}}\left( {t - {t_{DD}}} \right) $$
    (1)
    $$ {N_A}(t) = {N_A}\left( {t - 1} \right) * {\delta_A}\left( {t - 1} \right) + {N_E}\left( {t - {t_{DD}}} \right) * {\delta_{EA}}\left( {t - {t_{DD}}} \right) * \left[ {1 - {F_d}\left( {t - 1} \right)} \right] $$
    (2)
    $$ {N_E}\left( {t - {t_{DD}}} \right) = \beta \left( {t - {t_{DD}}} \right) * {N_A}\left( {t - {t_{DD}}} \right) $$
    (3)
  1. 2.

    Development rate model

    $$ {N_I}(t) = {N_I}\left( {t - 1} \right) * {\delta_I}\left( {t - 1} \right) + {N_A}\left( {t - 1} \right) * \beta \left( {t - 1} \right) - {N_I}\left( {t - 1} \right) * {\rho_{Dev}} \left( {t - 1} \right) $$
    (4)
    $$ {N_A}(t) = {N_A}\left( {t - 1} \right) * {\delta_A}\left( {t - 1} \right) + {N_I}\left( {t - 1} \right) * {\rho_{Dev}}\left( {t - 1} \right) * \left[ {1 - {F_d}\left( {t - 1} \right)} \right] $$
    (5)

    Where (for both models):

    NI :

    Number of immatures

    NA :

    Number of adults

    NE :

    Number of eggs laid

    δI :

    Survival rate of immatures

    δA :

    Survival rate of adults

    δEA :

    Survival rate from egg to adults

    β:

    Egg laying rate

    ρ Dev :

    Development rate

    F d :

    Diapausing fraction of the newly emerged adults

    t :

    Time (days)

    tDD :

    Time (days) to accumulate required number of DD.

All rates are daily and per capita. The survival rates used in the above equations are related to the mortality rates depicted in Fig. 3 by the equation δ = 1−mortality rate.

The simulation was started on 1 April and ended on 31 October for each year over the 7-year period at site 1 and the 6-year period at site 2. The mosquito population was initialized every 1 April with a minimum number of 100 eggs and 10 adults.

Functions and parameters

Development rate

Temperature has a major effect on insect development; consequently, the thermal requirements of development are often used as a basis for predicting population dynamics (Wagner et al. 1991). Development rates, ρ Dev, for local strains of Cx. pipiens and Cx. restuans were measured in the laboratory under different constant temperatures (10, 15, 20, 25, 30, 32 and 34°C) (Tables 1, 2). In this model, the reciprocal of development time determined from the laboratory was applied as a rate (Wagner et al. 1991). Different functions that have been used to fit development rate results for insects were evaluated and the Sharpe & DeMichele equation (Eq. 6) with four parameters (A, HA, HH, TH) provided the best fit (Fig. 4) [see Rueda et al. (1990) for details].

Table 1 Development time and rate of the New York strain Culex pipiens females from egg hatch to adult eclosion
Table 2 Development time and rate of the New York strain Culex restuans females from egg hatch to adult eclosion
Fig. 4
figure 4

Comparison of laboratory development rates for local strains of Cx. pipiens (dotted line) and Cx. restuans (dashed line) with model estimated rates (solid line)

$$ {\rho_{Dev}} = {\hbox{A}} * \frac{\text{K}}{{{298}{.15}}} * \frac{{{ \exp }\left[ {\frac{\text{HA}}{{{1}{.987}}} * \left( {\frac{{1}}{{{298}{.15}}}{ - }\frac{{1}}{\text{K}}} \right)} \right]}}{{{\hbox{1 + exp}}\left[ {\frac{\text{HH}}{{{1}{.987}}} * \left( {\frac{{1}}{\hbox{TH}}{ - }\frac{{1}}{\hbox{K}}} \right)} \right]}} $$
(6)

Based on the laboratory data A = 0.25; HA = 28094; HH = 35362 and TH = 298.60 and, K is the air temperature in units of Kelvin. Equation 6 was applied directly (without further fitting) in the population model.

Degree day conversions

Degree day criteria for mosquito development from egg to adult was determined based on development results from laboratory studies with Cx. pipiens (Table 1) and Cx. restuans (Table 2). Based on these studies, a basic minimum development threshold of 10°C was identified. Degree day development time was determined by taking the summation of the difference between the hourly temperature and 10°C threshold divided by 24 (Gerade et al. 2004). Cx. pipiens and Cx. restuans exhibit distinctly different seasonality where they co-occur in North America (Kunkel et al. 2006). Seven years of larval surveillance indicates that Cx. restuans is generally abundant in Central New York State from June to July while Cx. pipiens is generally abundant from July to August (L. Harrington et al., unpublished data). In our study, the estimate of DD development time for the earlier species, Cx. restuans, provided better model performance for both species.

Survival rate

Daily survival rate was estimated using Eq. 7.

$$ \delta (t) = {S_{op}} * \exp {\left[ { - \frac{{T(t) - {T_{op}}}}{{Va{r_{Sur}}}}} \right]^2} $$
(7)

Where:

T(t) :

Average daily temperature

T op :

Optimal temperature for survival

S op :

Survival rate at T op

Var Sur :

Variance of function

The approximate shape of the survival rate function relative to temperature resembled the Gaussian function. This is a common functional form for mosquito survival (e.g., Otero et al., 2006) and was supported qualitatively by field and laboratory trials, The same equation was used to estimate δ I and δ A (above); however, S op, T op and Var sur for immature life stages and adults were different. The survival rate for immature cohorts was dependent on the average temperature over the cohort development period (egg laying to adult eclosion). In the models t represented 1 day. The boundaries for the rate parameterization were estimated by comparing the numbers of larvae that developed from the 1st to 4th instar larvae in the field study.

Moisture index

A moisture variable, M(t) was developed to take into account the cumulative impact of rainfall and evaporation on larval habitats. This index was created by summing the daily difference between precipitation, P, and evaporation, E, (mm) over the preceding 7 days.

$$ M(t) = \sum\limits_{D = t - 6}^t {P(D) - E(D)} $$
(8)

Egg laying rate

Rainfall can have two principal impacts on mosquito population dynamics: (1) the increased near-surface humidity associated with rainfall may enhance mosquito flight activity and host-seeking behavior, and (2) rainfall can alter the abundance and type of aquatic habitats available to the mosquito for deposition of eggs and the subsequent development of immature stages (Shaman and Day 2007).

The “Egg Laying Rate” function was dependent on the moisture index [(M), Fig. 5] and took a form similar to that of the survival rate. In principle, we assumed there was a positive correlation between egg laying rate and M.

$$ \beta (t) = {{\hbox{R}}_{\rm{Base egg}}} + \frac{{{{\text{E}}_{{ \max }}}}}{{1 + \exp \left[ { - \frac{{\left( {{\text{M}}\left( {\text{t}} \right) - {{\text{E}}_{\rm{mean}}}} \right)}}{{{{\hbox{E}}_{\rm{var}}}}}} \right]}} $$
(9)

Where:

E max :

Maximum egg laying rate above baseline

E mean :

Value at which the moisture index produces 50% of E max

E var :

Variance of function

R Base egg :

Minimum constant fecundity rate

Fig. 5
figure 5

Correspondence between a observed moisture index, and b calculated egg laying rate for a representative season

RBase egg was included because we assumed that some oviposition activity occurred even when the moisture index was low (i.e., during periods with minimal or no rainfall). Our assumption of the egg laying rate value was based on laboratory observations within a reasonable range.

Diapause by photoperiod

Initially, the model did not include the influence of photoperiod and diapause. As a consequence, the simulated mosquito population always peaked at the end of the actual mosquito season. Based on work reported by Spielman (2001) with Cx. pipiens in the Northeast US, an estimate of the percent of diapausing adults was added as a function of decreasing hours of daylight (H D).

$$ Diapau\sin g{ }fraction = - {\hbox{k}} * {H_D}(t) + b $$
(10)

The effect of photoperiod started in mid-August when H D dropped below 13 h. The parameter b was set at 15 times k to constrain the fraction at zero for 15 h daylight as shown by Spielman (2001).

Parameterization

Estimates of the function parameters were based on the most relevant information from the literature, and estimates of adult and immature survival from field and laboratory trials for New York strains of Culex species. However, under natural conditions, these values for the model survival, egg-laying and diapausing parameters might be suboptimal. Additionally, some parameters, such as egg laying rate as a function of moisture index, were impossible to determine experimentally. In order to obtain a more accurate model, the parameter space was explored to calibrate all functions with the exception of development rate, which was based solely on laboratory data (Table 3). This calibration was conducted by maximizing the correlation r between adult population from the model output and adult capture from traps using multiple years of data from the two sites.

Table 3 Parameter space search estimates for parameters used in the model along with their upper and lower limits, and the best values. DD Degree day

Model simulations were run on the Earth and Atmospheric Sciences server at Cornell University (Linux AMD Opteron 64 bit). Using site-specific climatic data, the output of each run consisted of simulated adult mosquito population numbers over multiple years. For each run, a different set of parameter values were obtained by a scheduling algorithm where each parameter was scanned by intervals within the boundary. Table 3 presents the parameter and boundaries for both the development rate and degree day models. The highest correlation r and its associated parameters were recorded and retained during the simulation (Tables 3, 4).

Table 4 Model performance for development rate model and DD models based on comparison of simulated abundance and observed trap capture

Assessment of model performance

Capture data provide only relative measures of within-season mosquito population variation as indicated by Figs. 1 and 2. However, unequivocal data are not available. As a consequence, model performance was assessed based on the correlation r between adult population numbers from the model output and the adult capture from traps using multiple years of data. Wong et al. (2008) used a similar approach in conjunction with a genetic algorithm (GA) to obtain an objective fix on the position of tropical storm eyes. Correlation also formed the basis for tuning a GA to predict rainfall intensity (Tan et al. 2008). In both cases the GA was similar to the scheduling algorithm used here.

This approach was also necessary given the large differences in magnitude between trap counts and the total population simulated by the model. The difference in magnitude varied by site, with the simulation results as much as 50 times higher in magnitude than trap counts at site 2 (Maryland) and 20 times higher at site 1 (New Jersey). Such discrepancies are not unexpected, Bell et al. (2005) found Mosquito Magnet traps captured up to 30 times more mosquitoes than did nearby NJ light traps. Similar between-trap differences were noted by Meeraus et al. (2008). Cooperband and Carde (2006) reported trapping efficiencies of less than 10% in wind tunnel experiments using a known number of released Cx. quinquefasciatus.

In addition to the correlation between 7-day average capture and simulated abundance, model performance was also assessed based on the timing of peaks in the simulation and catch series. Peaks represent critical points of mosquito abundance, which may be associated with elevated disease transmission risk. Consequently, correspondence between the timing of observed and simulated peaks is an important measurement for determining the accuracy of the model and its potential for practical application. Prominent peaks above a threshold values were selected from the simulation series as described below (Eisenberg et al. 1995).

Definition of a peak

A peak appears at time t when:

$$ {\hbox{N}}\left( {{\hbox{t - 5}}} \right){\hbox{ \ldots, N}}\left( {{\hbox{t - 1}}} \right){\hbox{ < N}}\left( {\hbox{t}} \right){\hbox{ > N}}\left( {{\hbox{t - 1}}} \right){\hbox{ \ldots, N}}\left( {{\hbox{t + 5}}} \right){\hbox{ and N}}\left( {\hbox{t}} \right) > Peak{ }Threshold $$
(11)

where N(t) is the mosquito population at time t and the Peak Threshold is 70% of the maximum number of adults in the population over the entire season. An 11-day interval (5 days before and 5 days after t) was selected in order to identify the most prominent peaks over time. Shorter intervals failed to identify the occurrence of independent secondary peaks.

Timing of the two highest peaks

Over each year, the timing of the 1st and 2nd highest peaks in the adult simulation and capture series were compared (Table 4). Secondary peaks that were too close to the primary peak were not considered. The distance for excluding such peaks was set at both 10 and 20 days. These criteria assured that the 2nd highest peak was separated from the primary peak by at least 10 or 20 days.

Model simulation

To check the reliability of the model, two different approaches were used. First, a 6-year subset of data from site 1 was selected to conduct the parameter space search. After obtaining the best model, the adult mosquito population was simulated for the year that was excluded from the subset and compared it to the actual trap data (Table 5). In the second approach the best model from site 1 parameterized over all 7 years was applied to simulate mosquito capture during a different year at an independent location (Freeville Farm, NY).

Table 5 Results of correlation between simulated population in year n and capture based on a 6-year simulation with year n excluded for site 1

Results

Model performance

Table 4 presents correlation coefficients and differences in the timing of peaks between the simulations and mosquito trap data. These comparisons were made for both the development rate and DD models for both sites. In general, the basic patterns of seasonal abundance and mosquito population peaks were correctly simulated, as was the timing of commencement and last appearance of mosquitoes based on the first and last captures. Details of the performance for each model type are presented below. Optimal values for each parameter are presented in Table 3.

Development rate model

New Jersey (Site 1)

The best model for site 1 simulates adult populations with average r values of 0.6728 over 7 years (Fig. 6a), where r is the correlation between adult population numbers from model simulation and actual adult capture data from surveillance traps (Table 4). Correlation of this magnitude agrees well with the between-trap correlation for sites in Middlesex County.

Fig. 6
figure 6

Comparison of simulated mosquito abundance (solid line) and capture (dotted line) at a North Brunswick, NJ (Site 1), and b near Cambridge Maryland (Site 2) based on the development rate model

This model also simulated population peaks within 5 days (Table 4). Simulated population peaks for this site were well matched with the actual trap data in both timing and relative amplitude (except for 1991) (Fig. 6a). The years with the highest simulated abundance (1990, 1992, 2001) matched the years in which capture was the highest. Similarly, lower capture in 1993 and 1997 was reflected by lower simulated abundance. Despite relatively high capture in 1991 compared to the other years, the simulated abundance in that year was the lowest. Perhaps due to the low moisture index, the 1991data did not accurately represent the true availability of mosquito breeding sites.

Maryland (Site 2)

The correlation of the site 2 model with the capture data was lower (Fig. 6b) than site 1. The average r value was 0.5797, which again is typical of the correlation in capture from separate nearby traps.

The timing of simulated population peaks matched the trap data well, but was not as close a match as for site 1. There was also generally good agreement in the year-to-year variations in relative abundance and the first and last seasonal appearances of adults. Relatively low capture in 1977, 1983 and 1985 was reflected by low simulated abundance. Higher capture in 1982 and 1984 also corresponded with relatively high simulations. However, the high simulated abundance in 1977 was associated with relatively low capture.

Degree day model

New Jersey (Site 1)

The DD model for Site 1 did not perform as well as the development rate model (Fig. 7a). The average correlation (r) for this model was 0.5347, still in line with the between-trap values for observed data (Table 4). Unlike with the development rate model, with the DD model, simulated peaks early in the season matched the timing of peaks in the trap data well, but late peaks did not match (Fig. 7a). The DD model simulations also did not reflect the observed year-to-year variations in capture, or the start and, particularly, the end of seasonal mosquito collections as well as did the development rate model.

Fig. 7
figure 7

Comparison of simulated mosquito abundance (solid line) and capture (dotted line) at a North Brunswick, NJ (Site 1), and b near Cambridge Maryland (Site 2) based on the DD model

Maryland (Site 2)

As with site 1, the development rate model for site 2 showed a better correlation with the trap data than the DD model (r = 0.5175). This model tended to simulate population peaks weeks earlier than actual capture data (Table 4) and did not provide a good simulation of the interannual variations in capture, or the start and end of adult mosquito activity (Fig. 7b).

Independent validation

Results for the development rate model at site 1 using the approach where each year was dropped from model optimization and subsequently simulated are presented in Table 5. A high correlation between the trap data and simulated year was obtained using this approach, but the strength of this correlation varied depending on which year was dropped from optimization. Nonetheless, the correlation between the simulations and capture was of a similar magnitude to that of capture at sites within the county.

In the second approach, the model for site 1 was applied to simulate mosquito abundance at the independent Freeville, NY site. Local weather data collected at the site were used in the model. Co-located CDC trap and NJ trap data were compared with the simulated adult populations. The capture data were smoothed by averaging over 7 days to match the averaging used in the simulations. The model predicted population dynamics well, with correlations comparable to the development sites (r = 0.70 for simulated versus maximum capture from the CDC and NJ traps) and reasonable timing of peaks (Fig. 8). Figure 2 highlights the difficulties associated with the available trap data, as the day-to-day correlation between the two co-located traps was low during many periods.

Fig. 8
figure 8

Comparison of development model simulation (dotted line) and capture (solid line) for a CDC trap, and b NJ trap capture at Freeville, NY during 2006

Discussion

Two different models to simulate Culex mosquito vector populations in the northeastern US were developed. The models focused on the development of Northeast strains of Cx. pipiens and Cx. restuans mosquitoes. Model parameters were optimized using long-term data sets from locations in the northeastern US. These models perform well in simulating the timing of Culex mosquito populations and the pattern of within-season mosquito abundance as indicated by capture data. However, the model simulations of abundance consistently exceed the capture counts. This is not unexpected given limited trap capture efficiencies. Correlation between simulated abundance and capture, however, agreed with that of capture between traps located across a county-size area (e.g., traps separated by tens of kilometers). The nuances of site-specific meteorological and environmental conditions make it difficult to simulate abundance at the microhabitat level. However, at a larger spatial scale that is better represented by available operational meteorological data, key components of the annual cycle of mosquito abundance such as early summer onset, fall decline and within season abundance peaks and lulls are simulated by the model.

These models differ from existing models based on the inclusion of variables such as the moisture index, which takes into account the effect of water deficit, precipitation and evaporation on mosquito populations. A function to express egg production included a baseline egg production rate that could be adjusted upward in response to an increase in the moisture index. The addition of this function, especially with the baseline egg production rate, improved earlier versions of model significantly, as did the inclusion of a function for the impact of diapause. Unlike most published models, which are based on relatively short (e.g., 3-year) development samples (Ahumada et al. 2004; Shone et al. 2006), up to 7 years of data were used to tune the model parameters developed in this study.

Assessment of development rate and DD models indicated that the development rate model performed better. When applied to simulate mosquito populations at a new site in 2006, the development rate model obtained high predictive power. Better results were obtained when simulating the beginning of the mosquito season and overall correlation with capture using the development time estimates from laboratory studies of local geographic stains of Cx. restuans rather than the average time from both Cx. pipiens and Cx. restuans. This was most likely due to greater early season abundance of Cx. restuans in the Northeast than Cx. pipiens. This difference in activity highlights the importance of optimizing model parameters for relevant mosquito species and geographic strains. Unfortunately this was not possible given the constraint that these two species were grouped in the available long-term capture.

In the New York and New Jersey sites, Cx. pipiens and Cx. restuans are the predominant Culex species. Two other species, Cx. territans and Cx. salinarius, are present, but rarely collected, in NJ traps or CDC traps in the region. In Maryland, Cx. pipiens and Cx. salinarius were the dominant species in the collection data. This led to problems with application of the model to the more southerly location.

Although the DD model was expected to be more adaptable to other regions than the development rate model, it did not perform as well. This was especially apparent when applying the model developed for one site (for example, site 1) to another site (such as site 2). This lack of good correlation may have been due to geographic differences in temperature-dependent development of local vector species. In addition, the presence of Cx. salinarius at the Maryland site may have contributed to the lack of fit with this data. Cx. salinarius is present, but rare, in our collections from Central New York. In addition, there can be significant bias and variation in the composition of mosquito collections based on the type of trap and spatial location of traps within different habitats (Brown et al. 2008).

The development rate model for predicting adult Culex mosquito populations in the Northeast US is robust. It was capable of simulating population trends in terms of the timing of abundance peaks and the weekly variations of capture data in each of 7 years characterized by different climatological conditions. The model was also able to simulate the relative year-to-year variations in the capture data. It was difficult to directly compare simulated abundance with the actual population, as the relationship between the available capture data and total abundance is complex and not readily obtainable.

Further development of this model will include incorporation of vector-borne pathogen parameters for estimating risk and the development of user-friendly interfaces for local scale adaptation. In addition, this model may be useful for conceptually understanding the relative impact that climate change may have on mosquito habitats and the need for scaling data from regional areas to the level of mosquito microhabitats.