Introduction

Seasonal depletion of dissolved oxygen (O2) from coastal waters is a widespread phenomenon that appears to be growing globally (Díaz and Rosenberg 2008; Rabalais and Gilbert 2009). There is mounting evidence that eutrophication is contributing to the expansion of occurrence, intensity, and duration of hypoxic (“hypoxia” = O2 < 62.5 μM, 2 mg l−1, 30 % saturation) conditions in coastal waters worldwide (Díaz and Rosenberg 2008; Kemp et al. 2009). Short- and long-term patterns and trends in climatic forcing, however, also exert control over O2 in bottom waters (Justíc et al. 2005; Scully 2010; Wilson et al. 2008). Considering the potential impacts of hypoxia on the behavior, growth, and mortality of many marine fish and invertebrates (Brady et al. 2009; Vaquer-Sunyer and Duarte 2008); predator–prey interactions and food web structures (Decker et al. 2004; Nestlerode and Diaz 1998); and biogeochemical processes (Conley et al. 2002; Kemp et al. 1990; Testa and Kemp 2012), this phenomenon has received increasing attention in recent decades.

It is well known that O2 concentrations are gradually depleted from deeper waters of Chesapeake Bay from late winter until early summer, resulting in the hypoxic and anoxic conditions that persist from mid-June–September (Hagy et al. 2004). O2 concentrations usually reach their seasonal peak in January–February, when solubility is high, vertical mixing strong, and O2 consuming processes are temperature-limited (Taft et al. 1980). Beginning in February and March, O2 concentrations in deeper water (>10 m) tend to follow a relatively linear decline until June at rates ranging from ~1 to 5 mmol O2m−3 day−1 (Boynton and Kemp 2000). Such rapid O2 declines lead to the onset of hypoxic and anoxic conditions as early as April and June, respectively, near the head of the hypoxic zone (39°N, −76.3°W). As the summer progresses, hypoxic conditions expand southward, reaching as far south as 37.5°N by July.

Previous studies have suggested that the rate of spring O2 decline and the onset of hypoxia are highly variable from year to year (Boynton and Kemp 2000; Hagy et al. 2004). It has been suggested that such high interannual variability in the seasonal development of O2-depleted bottom waters is driven by a suite of biological and physical variables, including the magnitude of the spring diatom bloom, spring water temperature, and winter–spring river flow and wind conditions (Boynton and Kemp 2000; Hagy et al. 2004; Officer et al. 1984; Scully 2010). The response of O2 depletion to these various drivers is almost certainly nonlinear. For example, elevated winter–spring river flow should favor O2 depletion by increasing vertical stratification and reducing vertical mixing (Boicourt 1992), and by elevating phytoplankton growth (Malone et al. 1988). Higher river flow will, however, limit O2 depletion by enhancing landward, longitudinal inputs of O2 (Kemp et al. 1992; Kuo et al. 1991) and pushing spring bloom phytoplankton biomass to seaward regions of the Bay (e.g., Hagy et al. 2005).

Despite the high variability and potential importance of hypoxia initialization in spring, most of the hypoxia research in Chesapeake Bay has focused on the summer (June–August) period when the volume of hypoxia is at its seasonal peak (Hagy et al. 2004; Murphy et al. 2011; Scavia et al. 2006). In part, this summer focus is due to the availability of historical O2 data (1950–1980, Hagy et al. 2004) and an emphasis on summer hypoxic effects on living resources (Breitburg 2002). Consequently, the spatial and temporal patterns in the development of hypoxia during spring and the processes that control these patterns are poorly understood. This is an important gap in understanding for a number of reasons, including the fact that previous investigators have suggested that the spring bloom is the primary source of organic matter driving respiration and summer hypoxia (e.g., Malone 1987; Pomeroy et al. 2006). Although previous studies have addressed questions on the magnitude and controls on spring O2 depletion (Boynton and Kemp 2000; Hagy et al. 2004; Taft et al. 1980), a comprehensive analysis of spatial and temporal patterns has heretofore been lacking.

The objective of this study was to utilize a large and spatially and temporally resolved dataset for O2 concentrations (and other key variables) to understand the dynamics of seasonal hypoxia development in Chesapeake Bay. We used concentration time series along the Bay axis to compute rates of water column O2 uptake, the date of hypoxia initiation, and the volume of hypoxia. Regional and seasonal variations in these processes were also examined. Variations in these rates and dates were related to a suite of physical and biological variables and to summer hypoxia extent, providing a basis for understanding key controls over space and time.

Methods

Vertical Profiles of Concentrations

Vertical profiles of O2, water temperature, salinity, and chlorophyll a were obtained from the Chesapeake Bay Program Water Quality database for the 1985–2009 period (http://www.chesapeakebay.net/data_waterquality.aspx), with profiles collected at 20 stations located along the estuary’s central channel (Fig. 1, Table 1). Measurements of O2, temperature, and salinity were generally made at 1-m depth intervals, while measurements of chlorophyll a were made at 4–5 depths for each station (2–10 m intervals). Profiles were generally sampled monthly between November and March and bimonthly between April and October.

Fig. 1
figure 1

Map of Chesapeake Bay with bathymetry (see “Depth” key) and major tributaries included (also note horizontal “Distance” key). Circles indicate the location of monitoring stations, where the “CB” prefix to each number have been omitted. Open square indicates the location of station LE1.2, where continuous observations of dissolved O2 were made. Asterisk indicates the location of the Patuxent River Naval Air Station (PNAS)

Table 1 Monitoring station characteristics and hypoxia metrics in the modern (1985–2009) Chesapeake Bay Program-MD Department of Natural Resources dataset

Climatic Data

Daily Susquehanna River flow into the Chesapeake Bay was obtained from the USGS Chesapeake Bay River Input Monitoring Program website for the years 1985 to 2009 (http://va.water.usgs.gov/chesbay/RIMP/). Wind direction and speed were collected from the Patuxent Naval Air Station (PNAS) near the mouth of the Patuxent River in Maryland (Fig. 1). The PNAS data were used because they represent the most centrally located site relative to the main stem of the bay. The wind direction was calculated from the hourly north and east wind components after the data are filtered with a 36-h low-pass filter. In computing average wind speeds, only those records where wind blew with a speed greater than 2 m s−1 were used. Wind directions were categorized into eight compass directions (e.g., Murphy et al. 2011).

Stratification

We computed pycnocline depth for each region as the vertical position in the water column (depth (meter), z) where the square of the Brunt–Väisälä frequency (N   2) was at its maximum value (e.g., Pond and Pickard 1983) and that value was used as an index of pycnocline strength, where

$$ {N}^2(z)=\frac{g}{\sigma_z}\cdot \frac{\partial \sigma }{\partial z} $$

and g = gravitational constant, 9.81 m s−2, σ z is the water density at depth (in kilogram per cubic meter), and \( \frac{\partial \sigma }{\partial z} \) is the density gradient at depth z, which was calculated using a 2-m window around z. Density was computed from profiles of temperature and salinity data (Fofonoff 1985) at 1-m depth intervals using the Chesapeake Bay Program Water Quality database.

Interpolations

We interpolated spatial distributions for O2, water temperature, salinity, and chlorophyll a concentrations to a 2D length–depth grid using ordinary kriging (Murphy et al. 2010; Murphy et al. 2011). The statistical package R (R Development Core Team 2009) with the geoR package (Ribeiro and Diggle 2009) was used for all interpolations (Murphy et al. 2011). The resulting 2D distributions were assumed to be constant laterally at a given depth and organized to correspond to tabulated cross-sectional volumes (Cronin and Pritchard 1975). For O2, interpolated concentration data were multiplied by the cross-sectional volumes to compute “hypoxic volumes” for all available profile sets in the years 1985–2009 by summing the volume of all cells with an O2 concentration <62.5 μM.

O2 Depletion Metrics

The first day of the year where bottom water O2 concentrations fall below 62.5 μM provides the most straightforward index of the tendency for hypoxia to occur in a given year. This date was calculated by (1) averaging bottom water O2 concentrations for each sampling date, (2) interpolating temporally through time to extrapolate fortnightly and monthly data to a daily concentrations using shape-preserving piecewise cubic interpolations (i.e., the interpolation is monotonic when data are monotonic and no artificial maxima or minima are generated; Fritsch and Carlson 1980), and (3) calculating the day where interpolated O2 fell below 62.5 μM (Fig. 2). Hypoxia onset dates were calculated for each station in each year from 1985 to 2009.

Fig. 2
figure 2

Seasonal cycle of bottom water O2 saturation and concentration at CB5.1 (Fig. 1) in 2004 and illustration of how rate water column O2 depletion and date of hypoxia onset were derived from the time series data (see text)

Fig. 3
figure 3

Time series of bottom water O2 concentration at two stations near the upper (CB3.3C) and lower ends of the Chesapeake Bay hypoxic zone over 1985–1995. Gray shaded area indicates the hypoxia concentration threshold (62.5 μM)

The rate of O2 depletion was calculated as the slope of a linear temporal decline in O2 concentration (non-interpolated) between March and May for each year between 1985 and 2009. Because maximum depth varies by station (Table 1), the O2 concentrations used were from the deepest depth sampled at a particular station, which does not generally vary by more than 1 m. For each year and station, the slope was calculated by a linear regression model fit to the March–May time series of averaged O2 concentrations versus time (Fig. 2). The slope represents a daily O2 depletion rate, and it reflects the effects on O2 of various biological, physical, and chemical mechanisms. We made the same computations on time series of O2 deficit (O2Saturation − O2Observed) to examine the impacts of changes in solubility (driven primarily by temperature) on our depletion rates (data not shown); these slopes were highly correlated with the slopes from the observations (r > 0.90). For a few years, the O2 time series from March to May was not linear (Fig. 3). This nonlinearity was most often the result of a single, short-term (daily to weekly) increase in concentration due to landward advection or vertical mixing of high-O2 water (e.g., Fig. 4). Correlation coefficients (r values) for the linear model fits to observed concentrations generally exceeded 0.90 at all stations/sites.

Fig. 4
figure 4

Comparison of 6-week time series of bottom water O2 (16 m deep) measured continuously (YSI data sonde every 15 min, small gray lines and circles) and measured fortnightly (14 days, open circles/dashed line) at a station in the lower Patuxent River estuary in the spring of 2004 (Station LE1.2, 38.38°N, −76.51°W, see Fig. 1)

Data used in this analysis, which are based on routine measurements at 2–4 week intervals, are likely to miss some short-term (i.e., hourly–daily) dynamics in O2 concentrations. Continuous dissolved O2 measurements (sampling at 15-min intervals) were made from a fixed buoy deployed adjacent to a routine monitoring station in the lower Patuxent estuary (LE1.2) during spring of 2004 (Figs. 1 and 4; Alliance for Coastal Technologies (ACT), http://www.act-us.info/). Comparisons of these two datasets reveal how different scales of O2 variability are captured by the two sampling intervals. Rate of O2 depletion (ROD) and date of hypoxia onset (DHO) calculated from these two datasets (i.e., LE1.2 and ACT) over the April to May period are similar, despite the fact that several multi-day periods of high variability in O2 occurred between the fortnightly samples (where the fortnightly samples are similar to those used for the majority of calculations in this study; Fig. 4). High-frequency data (15-min) were not available for the mainstem Bay, where we focused our study.

Statistical Analysis

We investigated the potential for multiple interacting controls on the DHO and the ROD using simple linear regression and multiple linear regression models (SASv9.2, PROC REG). The independent variables used to construct the models included mean January to May Susquehanna River flow, March–May wind speed and direction, January to April bottom water chlorophyll a, March to May water temperature, and April and May pycnocline strength (N 2) and vertical position (depth of maximum N 2). The chosen periods of aggregation for river flow and chlorophyll a include 1–2 months prior to the time of the O2 depletion calculations, as these drivers may have delayed effects, while wind and stratification metrics, which have more immediate impacts on O2, were aggregated during the period of analysis. Three multiple linear regression approaches were used, including r 2 selection of all predictor sets, forward selection, and backward selection. Selection of optimal models was based on maximizing the adjusted r 2 of the model and minimizing the Mallow’s CP and Akaike information criterion. We tested for normality of residuals using Shapiro–Wilk’s W and box and normal probability plots. Multicollinearity of predictor variables was examined via variance inflation, condition indexes, and eigenvalues of predictor variables. We used correlation analysis to compare estimates of DHO and ROD to estimated volumes of hypoxia made for latter-month (the second of two monthly cruises) data for May to August cruises.

Results

Temporal and Spatial Patterns in O2

Bottom water-dissolved O2 varies seasonally in Chesapeake Bay, with maximum values in February and minimum concentrations typically occurring between May and August (Fig. 3). In most summers, anoxic concentrations occur in upper and middle Bay stations (CB3.3C to CB5.2), while hypoxic or near-hypoxic concentrations occur in the lower Bay (e.g., CB6.2) and anoxia is rare (Fig. 3). Time series of bottom water O2 at the northern and southern ends of the hypoxic zone stations illustrate consistent seasonal patterns with moderate interannual variability in O2 minima and episodic O2 replenishment during summer in some years (e.g., 1988 in Fig. 3).

The 25-year (1985–2009) monthly mean (March–July) spatial distributions in dissolved O2 reveal the development of hypoxic water over the March–May period, as well and the seaward spread of low-O2 water along the Bay axis during spring (Fig. 5a). Parallel time/space patterns in the variance around these mean conditions are also revealing (Fig. 5b). O2 first begins to decline in bottom water within the upper Bay between 200 and 300 km from the Atlantic Ocean in March–April (Fig. 5a). From this location, O2 further declines through June and the location of the low-O2 water mass appears to migrate seaward over the course of spring, where it is generally retained below 10–12 m depth (Fig. 5a). The standard deviation in O2 around these 25-year means also reveals both spatial and seasonal shifts in O2 variability. In March and April, O2 is most variable in the upper Bay where O2 is initially depleted and where vertical gradients in O2 concentration are strong (Fig. 5b). In contrast, O2 variability is highest in the middle of the water column in May and June (in the vicinity of the pycnocline) throughout most of the central channel of the Bay (Fig. 5b). The standard deviation reaches a seasonal and regional peak near the mid-Bay pycnocline in June, where vertical gradients in O2 are also at seasonal peaks.

Fig. 5
figure 5

Two-dimensional isopleths depicting distributions of dissolved O2 concentration with depth and distance along the Bay axis over the years 1985–2009 for 4 months (March, April, May, June) for the 25-year mean values (a) and standard deviations around the means (b)

Hypoxia Onset (DHO) and O2 Depletion Rates (ROD)

The estimated DHO was highly variable from year to year at a given station, but the spatial pattern was consistent. For example, at station CB3.3C near the Bay Bridge (Fig. 1), the date that O2 fell below 62.5 mM ranged from April 14 to May 31 (46 days) between 1985 and 2009, but hypoxia always initiated here before it arrived at other stations further south (Fig. 6a). Each year from 1985 to 2009, hypoxia developed in the most landward bottom waters of the Bay (near CB3.3C in Fig. 1), but in only half of those 25 years did hypoxia develop as far south as the mouth of the Rappahannock River (CB6.1). After initiating at station CB3.3C, hypoxia gradually developed at seaward stations over the ensuing 4 months, reaching the Rappahannock River by late June/early July (Fig. 6a). The median time it took hypoxia to expand from CB3.3C to CB5.5 was 58 days, although in some years, the travel time was in excess of 3 months. The latest DHO recorded was on September 17, 1991 at station CB6.2. Although the spatial pattern in DHO was relatively unchanged in high- versus low-flow years, differences between high and low Susquehanna River flows were greater north of the Potomac River (CB3.3C to CB5.2) than south (CB5.3 to CB6.2; Fig. 7). When DHO data were divided into two groups (five highest and five lowest Susquehanna River flow years), hypoxia initiated 15–25 days earlier in the years of highest flow than in the lowest flow years, while DHO was significantly earlier in the high-flow years at stations CB3.3C–CB4.3C and CB5.3 (p < 0.05, Mann–Whitney U test; Fig. 7a).

Fig. 6
figure 6

Box plots of hypoxia onset day (a) and water column O2 depletion rate (b) at 12 stations along the Chesapeake Bay axis for the years 1985 to 2009 (see Fig. 1)

Fig. 7
figure 7

Patterns of date of hypoxia onset (a) and rate of water column O2 depletion (b) shown as mean values (± standard errors) for the 5 years of highest (open circles) and 5 years of lowest (closed circles) winter–spring Susquehanna River flow. Asterisks between lines indicate significant differences between high-flow and low-flow groups

The water column O2 depletion rates computed in this study were comparable to those made in previous studies using a similar approach in Chesapeake Bay at select locations (Table 2). Overall, interannual variability in the ROD was relatively greater than that of hypoxia initiation, but the spatial patterns and flow responses were similar. Whereas DHO was earlier in landward regions of the Bay, O2 depletion rates were higher in this region (Fig. 6). Median water column O2 depletion rates were similar north of the Potomac River (CB3.3C to CB5.2) but declined in seaward regions and ROD at stations CB3.3–CB5.1 was significantly higher than CB6.1–CB6.2 (p < 0.05, ANOVA with Scheffe test). O2 depletion rates were generally insensitive to Susquehanna River flow in the upper (CB3.3.C–CB4.1C) and lower Bay stations (CB5.3–CB6.4), but rates were elevated (by 30 %) in high-flow years in middle Bay regions (CB4.2C–CB5.2; Fig. 7b). No significant differences were detected between the means when O2 depletion rates were divided into two groups based on Susquehanna River flow (p > 0.05, Mann–Whitney U test; Fig. 7b), although p values were less than 0.2 at stations CB4.2C to CB5.2. The highest ROD recorded was 5.31 mmol m−3 day−1 at station CB5.4 in 2003.

Table 2 Comparisons of rates of water column O2 depletion from this study with previously estimated rates using several approaches

Winter–spring chlorophyll a concentrations explained more variability in DHO and ROD than physical variables in upper and middle Bay regions (Tables 3 and 4). Although Susquehanna River flow was significantly correlated with DHO and ROD during several winter–spring periods (Tables 3 and 4), we sought explanatory variables that were more proximal to biological or physical processes. Bottom water chlorophyll a concentrations averaged over the January to April period were significantly and negatively correlated to DHO (r = −0.47 to −0.73, p < 0.05; CB4.1C–CB5.2) and positively related to ROD (r = 0.43 to 0.51, p < 0.05; CB4.2C–CB5.2). In contrast, the Brunt–Väisälä frequency was significantly and negatively correlated to DHO in only lower Bay stations (r = −0.43 to −0.78, p < 0.05; CB5.2–CB5.4) and was positively correlated to ROD at select lower Bay stations (Table 4).

Table 3 Table of correlation coefficients (r, top value) and associated p values (bottom value) for correlations between the date of hypoxia onset (DHO) and potential controlling variables, including Susquehanna River flow, April–May Brunt–Väisälä frequency, and bottom chlorophyll a concentration
Table 4 Table of correlation coefficients (r, top value) and associated p values (bottom value) for correlations between the March to May water column O2 depletion rate (ROD) and potential controlling variables, including Susquehanna River flow, Brunt–Väisälä frequency, water temperature, and bottom chlorophyll a concentration

Multiple linear regressions resulted in improved predictability of DHO and ROD at only 2 of the 12 stations examined (CB4.3C and CB5.2), given the variables we included in the analysis (e.g., Fig. 8). For example, at station CB5.2, both January to April bottom water chlorophyll a and April to May mean Brunt–Väisälä frequency (maximum value in the water column) were significant predictors of DHO (Fig. 8). For the majority of stations, however, only a single predictor variable explained a significant fraction of variability in the O2 depletion metrics (Tables 3 and 4). Mean wind speed, wind direction, water temperature, and pycnocline depth were not significantly correlated (p > 0.1) to interannual variability in DHO or ROD.

Fig. 8
figure 8

Correlations of a January to April bottom water phytoplankton biomass (chlorophyll a) and b April to May mean stratification strength (Brunt–Väisälä frequency, N 2) with date of hypoxia onset at station CB5.2 (see Fig. 1). c Observed date of hypoxia onset plotted against predictions from a multiple linear regression with chlorophyll a and N2

Winter–Spring O2 Depletion and Summer Hypoxic Volume

One of the motivating questions in this effort was to understand the relationship between winter–spring O2 depletion and summer hypoxic volume. That is, does the onset and spatial distribution of hypoxia in spring predetermine and thus predict the summer condition? We calculated the correlation between DHO and ROD and Bay-wide hypoxic volumes computed for May, June, and July (Fig. 9). DHO and ROD were weakly correlated with hypoxic volume during the mid- and late-summer (p < 0.05; late July and August; August data not shown) period, but were significantly correlated to May, and to a lesser extent, June hypoxic volumes (p < 0.05; Fig. 9). DHO and ROD at CB4.3C to CB5.4 were most strongly correlated to May and June hypoxic volumes, while these metrics at upper and lower Bay stations correlated weakly with hypoxic volume during May and June (Fig. 9).

Fig. 9
figure 9

Correlation coefficient (r) for relationships between hypoxic volume (during the latter half of May, June, and July) and a hypoxia onset day and b rates of water column O2 depletion as they vary along the main Chesapeake Bay. Asterisks near the circles indicate significant correlations

Winter–Spring O2 Depletion in 2012

In September of 2011, precipitation associated with Tropical Storm Lee (hereafter TS Lee) resulted in extraordinarily high Susquehanna River flow and suspended sediment loading rates to Chesapeake Bay. The peak Susquehanna flow following Tropical Storm Lee was 22,031 m3 s−1 (Cheng et al. 2013), which was the largest flow since Tropical Storm Agnes in June 1972 (~32,000 m3 s−1) and is about 18 times the average annual flow. Flow resulting from TS Lee was estimated to scour 3.6 × 109 kg of sediment from the lower Susquehanna River reservoirs (http://chesapeake.usgs.gov/featuresedimentscourconowingo.html) delivering this and other material to Chesapeake Bay. During the subsequent winter–spring (2012), hypoxia onset in the upper Bay occurred earlier than previously recorded in the available monitoring data (Fig. 10), as water column O2 concentrations were at record-low levels (Fig. 11b, c). This record DHO (April 6) was much earlier than expected from January to April mean bottom water chlorophyll a at stations CB3.3C and CB4.1C, given patterns from the previous decades (Fig. 10). Such early-season depression of O2 may have been driven by respiration of organic matter delivered during the storm and retained in the upper Bay over the winter. In addition, record-high bottom water temperatures were observed in the winter–spring of 2012 (Fig. 11a), and April particulate organic carbon (POC) and chlorophyll a concentrations (Fig. 11d) were 1.5–2 times the long-term mean at these stations. Stratification strength was also relatively high during the winter of 2011–2012, where maximum N 2 at CB4.1C was 0.015 and 0.01 s−2 in February and March, respectively, which is near the long-term (1985–2009) maxima for these months.

Fig. 10
figure 10

Correlations between January to April bottom water chlorophyll a and the date of hypoxia onset (DHO) at stations a CB3.3C and b CB4.1C for the 1985–2009 period (open circles) and for the year 2012 (closed square)

Fig. 11
figure 11

Seasonal cycles of monthly mean water quality conditions in the bottom waters over 1985–2009 at station CB3.3C in Chesapeake Bay (see Fig. 1): for a temperature; b oxygen solubility; c O2 concentration; and d particulate organic carbon (POC). Mean values are shown as a solid line, the shaded area in the top three panels shows the range (i.e., 25-year maximum and minimum value for each respective month), and the shaded area in the bottom panel (i.e., POC) is the 25-year standard deviation (SD was used because maximum values were highly variable). Filled circles are observations from 2012

Discussion

Spatial and Temporal Patterns of O2 Depletion

Although ROD, DHO, and seasonal O2 concentration minima varied spatially in Chesapeake Bay, the pattern of decline in bottom water O2 concentrations over the winter–spring period was generally consistent across stations along the Bay’s longitudinal axis. This pattern is well-described and reflects the combined effects of elevated temperature and associated solubility declines, respiration of spring bloom-generated algal material, and reduced ventilation caused by winter–spring river flows (Boynton and Kemp 2000; Hagy et al. 2004; Taft et al. 1980). A striking regional pattern in O2 decline is the consistent early development of anoxia in the upper Bay stations, but only occasional and short-lived hypoxia in the lower Bay stations (Fig. 3). This results from autochthonous and allocthonous organic material that accumulates in the upper and middle Bay regions in spring (Hagy et al. 2005; Zimmerman and Canuel 2001). Additionally, gravitational circulation limits the oxygenation of upper and middle Bay bottom water via two mechanisms, including (1) reduced exchange with oxygenated surface waters vertical mixing and lateral advection/mixing (Malone et al. 1986; Scully 2010) and/or (2) limited O2 inputs via landward longitudinal advection (Kemp et al. 1992; Kuo et al. 1991). Thus, spatial patterns in winter–spring O2 depletion are driven by a combination of biological and physical processes.

Although these declines in O2 concentration were apparently gradual when analyzing monthly to fortnightly data, higher frequency measurements reveal that this general decline may be occasionally interrupted by brief events of both rapid depletion and ventilation (Fig. 4). Diel variability in these data suggests tidal-mixing effects (Fig. 4), but day-to-week variability is driven by wind-mixing events. For example, O2 increases from April 28 to May 4 (Fig. 4) are associated with parallel salinity decreases, indicating downward mixing of high-O2 and low-salinity water. Although it is likely that such high frequency variability also characterizes the time series in the main channel of Chesapeake Bay (Boicourt 1992; Breitburg 1990), limited data exist to sufficiently examine this variability. At this particular station and sampling period (i.e., Patuxent River in 2004), computations using the high-frequency data (DHO = May 21, ROD = 6.05 mmol m−3 day−1) and fortnightly observations (DHO = May 22, ROD =5.24 mmol m−3 day−1) are comparable, suggesting that fortnightly data reasonably reflect broad regional and seasonal patterns of O2 declines.

Longer term variability in the distribution of O2 reveals seasonally shifting controls on O2 consumption. During March and April in the upper region of Chesapeake Bay where O2 depletion first occurs, the standard deviation of the 1985–2009 mean is highest in deeper water (Fig. 5b), where accumulations of chlorophyll a commonly develop (data not shown) just seaward of the limit of salt intrusion (Sanford et al. 2001). This region is also within the estuarine turbidity maximum, where organic matter and phytoplankton may be concentrated (Lee et al. 2012). In contrast, variability during May and especially June is highest at mid-depth in the middle region of Chesapeake Bay. Such variability may be due to fluctuations in pycnocline location, which commonly occurs around 10 m depth and is characterized by strong gradients in O2 (Murphy et al. 2011). Alternatively, thin layers and/or oxic/anoxic interfaces at or near the pycnocline have been found to be hot spots for particle aggregation and microbial metabolism (Durham and Stocker 2012), potentially driving O2 variation via oxic respiration or chemautotrophy associated with sulfide or ammonium oxidation (Casamayor et al. 2001). This spatial pattern in the primary controlling mechanisms of O2 variability (i.e., organic matter availability in the upper Bay, stratification strength in the lower Bay) is consistent with the statistical analysis of interannual variability in DHO and ROD (Tables 3 and 4).

Interannual Variability in O2 Depletion

Interannual variation in freshwater input is an important driver of biogeochemical processes in estuarine ecosystems, including O2 depletion. Although Susquehanna River flow is statistically the strongest predictor of DHO and ROD at some stations (CB3.3C–CB5.1; Tables 3 and 4), flow impacts both the biology and physics of Chesapeake Bay, and these effects cannot be satisfactorily isolated with simple statistics. The effect of flow on interannual variability in the development and spatial–temporal extent of O2-depleted waters is a result of flow impacts on a suite of biological and physical variables, including the magnitude of the spring diatom bloom, spring water temperature, and associated vertical and horizontal O2 transport (e.g., Hagy et al. 2004; Officer et al. 1984; Taft et al. 1980). For example, elevated winter–spring river flow should favor O2 depletion by increasing stratification strength (Boicourt 1992) and elevating phytoplankton biomass (Boynton and Kemp 2000; Malone et al. 1988). Elevated flow may, however, increase O2 concentrations via enhanced landward, longitudinal O2 transport (Kemp et al. 1992). High flow also tends to push the location of the spring bloom seaward (e.g., Hagy et al. 2005), which should increase ROD in lower Bay regions. ROD was indeed elevated in mid-Bay regions during high-flow years (Fig. 7), suggesting that the O2-depleting effects of elevated organic material and stratification strength may outweigh the O2-replenishment effects of enhanced longitudinal advection, at least in the mid-Bay region. Either way, strong correlations between river flow and DHO/ROD reveal that future changes in precipitation will likely impact the phenology of hypoxia development in Chesapeake Bay.

To avoid interpretation problems associated with the co-variability of freshwater flow and with various biological and physical processes, we focused our analysis on variables that directly represent the functional effect of flow within the estuary (phytoplankton biomass and stratification strength). Winter–spring O2 depletion metrics appear to be driven primarily by phytoplankton-derived organic matter in the upper and middle regions of the Bay. DHO and ROD were most strongly linked to chlorophyll a in bottom water, where phytoplankton biomass tends to accumulate during the winter–spring period (data not shown). Statistical analyses from this study (Tables 3 and 4), as well as previous studies of O2 dynamics in Chesapeake Bay (e.g., Boynton and Kemp 2000) support this assertion. Substantial phytoplankton biomass accumulations associated with the winter–spring bloom (Harding and Perry 1997; Malone et al. 1988) are associated with low grazing rates (White and Roman 1992) and high deposition rates of fresh organic material (Kemp et al. 1999). As temperature begins to rise in spring, high POC concentrations result in elevated respiration and rapid rates of O2 uptake (Malone 1987; Sampou and Kemp 1994). Indeed, O2 depletion rates in Chesapeake Bay were significantly related to chlorophyll a deposition rates derived from sediment traps over the course of several years (Boynton and Kemp 2000), and biomarker studies indicate that sediment organic matter is most labile during the spring bloom (Zimmerman and Canuel 2001). Thus, there is strong evidence for the association of winter–spring O2 declines with the bottom water biomass of phytoplankton over a large section of Chesapeake Bay.

Although the relationships between bottom water chlorophyll a and both ROD and DHO are strong in several regions of the Bay, we do not presume that physical transport effects are small. Several investigators have illustrated the importance of stratification, lateral advection and mixing, and longitudinal advection as important controls on O2 distribution in Chesapeake Bay (Goodrich et al. 1987; Li and Li 2011; Sanford and Boicourt 1990; Scully 2010). Correlation analysis reveals that stratification strength is the strongest predictor of hypoxia onset and O2 depletion at several stations in the middle and lower Bay, and the highest variability in summer O2 concentrations occurs within or adjacent to the pycnocline (Fig. 5b). Stratification strength may be a larger contributor to ROD and DHO in more seaward reaches of the Bay because phytoplankton biomass (Harding and Perry 1997) and deposition (Hagy et al. 2005) tend to be lower in this region, thus limiting the biological influence on O2 depletion. Physical mechanisms may also be of greater importance in summer, when stratification strength reaches its seasonal maxima and vertical gradients of O2 are stronger, at several middle and lower Bay stations (Murphy et al. 2011). Consequently, associations of vertical mixing with hypoxia may be stronger in the lower Bay, where hypoxia onset occurs later in the year when stratification strength is at or near its peak (e.g., June, Fig. 6) (Table 3).

Although there has been limited use of hydrographic data to estimate rates of oxygen consumption in estuaries, there is a long history of such analyses in lake and open ocean research. In lakes, the hypolimnetic oxygen deficit (or hypolimnetic oxygen depletion rate) has long been used as a metric of productivity for evaluating trophic status and for inter-system comparisons (e.g., Burns et al. 2005; Matthews and Effler 2006; Rosa and Burns 1987). In open ocean research, apparent oxygen utilization has been used to estimate the O2 consumption due to biochemical processes relative to a preformed value, which is often the solubility O2 concentration (e.g., Garcia et al. 2005; Pytkowicz 1971). Both metrics utilize relatively easy-to-collect hydrographic data to derive rates and indices of biogeochemical O2 uptake, similar to the ROD methods reported in this paper. Interestingly, long-term (1970–2003) estimates of hypolimnetic volume-corrected oxygen depletion in Lake Erie range from 2.4 to 4.9 mmol O2 m−3 day−1 (Burns et al. 2005), which are less variable, but comparable to ROD reported in this study (Table 2). Although the reason for higher variation in Chesapeake Bay ROD is unclear, this comparison reveals similarities in the rate in which deep-water O2 is depleted in enriched aquatic ecosystems, and emphasizes the utility of such approaches across ecosystems.

Winter–Spring O2 Depletion and Summer Hypoxic Volume

Our metrics of spring O2 depletion rates and dates generally do not correlate with the volume of hypoxic bottom water in the subsequent summer. The only significant correlations that we did find were between late spring (May 1–June 15) hypoxic volumes and DHO/ROD in mid-Bay regions. This suggests that the influence of the winter–spring bloom on hypoxic volume weakens substantially as summer progresses and the spring bloom decomposes. Mid-to-late summer hypoxic volumes were, however, not correlated with DHO and ROD for most Bay regions analyzed, and summer hypoxic volume appears to be a dynamic feature that varies with summer phytoplankton production (Testa 2013) and climatic conditions (Feng et al. 2012; O’Donnell et al. 2008; Scully 2010). One important implication of this observed disconnect between spring and summer O2 conditions is that, in contrast to previous inferences that the spring bloom fuels summer hypoxia (e.g., Malone 1987; Pomeroy et al. 2006), it appears that a large spring bloom does not guarantee large summer hypoxic volumes. This conclusion is supported by various calculations and modeling experiments, which suggest that labile organic matter from spring phytoplankton is largely depleted by summer and that contemporaneous phytoplankton production is necessary for sustained summer hypoxic conditions in Chesapeake Bay (Newell et al. 2007; Testa 2013).

The fact that we did find significant relationships between winter–spring flow and DHO/ROD, but found no such relationship between DHO/ROD and summer hypoxic volume might seem inconsistent with previous reports. For example, strong statistical relationships have been reported for winter–spring flow (and/or TN load) versus summer hypoxic volume (Hagy et al. 2004; Lee et al. 2013; Scavia et al. 2006). Although we did find significant correlations between spring O2 decline (DHO/ROD) and late spring hypoxia, we found no relationship with summer hypoxia. These findings indicate that there is a diminishing connection between spring O2 consumption and the volume of hypoxic water that can be maintained over the course of the summer. Previous studies that have found strong relationships between winter–spring flow (or nutrient loading) and summer hypoxia used temporally integrated measures of hypoxia, including means over the month of July (Hagy et al. 2004; Scavia et al. 2006) or warm-season means (Lee et al. 2013). In contrast, our analysis examined correlations between DHO/ROD and cruise-specific hypoxic volumes computed from data collected at 1–2 week intervals, and recent analyses have revealed that volume and temporal patterns of hypoxia vary systematically from late spring to late summer (Murphy et al. 2011). Thus, we emphasize that the relationships between winter–spring flow (or nutrient loading) and summer hypoxic volume are complex and that further analyses are needed to resolve details of the associated spatial and temporal dynamics of O2 depletion in Chesapeake Bay.

Winter–Spring O2 Depletion in 2012

The onset of hypoxia in upper Chesapeake Bay in 2012 (April 6) occurred 8 days before the earliest onset dates from the previous 26 years (Fig. 10). The April 6 hypoxia onset was also much earlier than expected from January to April bottom water chlorophyll a at stations CB3.3C and CB4.1C, given patterns from the previous decades (Fig. 10). This seemingly unusual pattern in 2012 presents a useful case study for understanding the controls on winter–spring O2 depletion. The passing of Tropical Storm Lee over much of the Susquehanna River watershed in September of 2011 resulted in extraordinary levels of freshwater flow and the highest suspended sediment loads to Chesapeake Bay recorded in 30 years (Hirsch 2012). Although the lability and character of this material is uncertain at this time, modeling studies suggest that large amounts of clay and silt material were deposited to sediments in much of the upper Bay (Cheng et al. 2013). Previous investigators have speculated that organic material deposited to sediments in autumn might remain intact in sediments over the winter (due to low temperatures) and be respired in the following spring (e.g., Boynton and Kemp 2000; Taft et al. 1980).

Observations of bottom water-dissolved O2 at several stations in the upper and middle Bay were at record low levels in the winter–spring of 2012. Although water temperature was historically high during these months (2.1 °C above long-term maximum), associated declines in solubility do not explain the observed low O2 concentrations (Fig. 11a, b). Stratification strength approached the 26-year maxima at stations CB3.3C and CB4.1 during this period (0.01–0.015 s−2), suggesting that high stratification due to low wind stress during March (data not shown) could have led to reduced O2. An alternative (or additional) explanation is that respiration of elevated levels of organic matter was responsible. Indeed, POC and chlorophyll a concentrations in the bottom water at stations CB3.3C and CB4.1C were 1.5 to 2 times the long-term mean in April 2012 (e.g., Fig. 11). Such large pools of POC at these stations, in addition to elevated temperatures to enhance respiration rates (Fig. 11a; Sampou and Kemp 1994), suggest that water column respiration was elevated during winter–spring 2012.

An alternative, yet difficult to assess contributor to reduced O2 concentrations in 2012 is elevated sediment O2 demand (SOD) resulting from carry-over of organic matter inputs from the previous fall. Previous modeling of sediment biogeochemistry in Chesapeake Bay suggested that labile organic material deposition from the previous year can accumulate and carryover to the following spring (Brady et al. 2013). We applied a two-layer sediment flux model (Brady et al. 2013; Testa et al. 2013) to examine the seasonal response of sediments to a large September depositional event, where a month-long pulse of POM (equivalent to 30 % of the total annual POM flux) was simulated. Although this simulation may be conservative relative to the overall delivery of particulate material to the Bay (Cheng et al. 2013), it reflects the fact that the some of the material delivered during the storm is recalcitrant and that much of it was delivered to sediments north of CB3.3C. Four total simulations were run, including (1) a control, where fall POM deposition was equivalent to the long-term mean, (2) the month-long, elevated POM flux representing TS Lee, (3) the elevated POM flux with temperatures elevated by 2.5 °C in the following January to April period, and (4) simulation #3 with dissolved O2 reduced to hypoxic conditions (47 μM O2) beginning on April 1 of the following year (Fig. 12). Runs #3 and #4 are consistent with observed water column conditions in 2012. Organic matter diagenesis rates for all organic matter pools were set to 0.01 day−1, which is consistent with moderately labile organic matter (Burdige 1991; Westrich and Berner 1984).

Fig. 12
figure 12

Time series of sediment O2 demand (SOD) from July to July in a simulation experiment with a two-layer sediment biogeochemical model (Brady et al. 2013) at station CB3.3C in Chesapeake Bay. The black line represents the “control” experiment, based on average conditions, the large dashed line represents the model forced with a month-long elevation of POM deposition representing the impact of Tropical Storm Lee in September 2011, the small dashed line in black represents the TS Lee experiment with January to April temperatures elevated by 2.5 °C, and the gray dotted line represents the TS Lee + Temperature experiment with hypoxia beginning on April 1

The simulations suggest that a depositional event associated with a large autumn depositional event, combined with high observed water temperatures, could have doubled SOD in the following winter–spring period, from 15 to 30 mmol O2 m−2 day−1 (Fig. 12). The effect of the observed early onset of hypoxia in the beginning of April reduced SOD by roughly 30 %, resulting in SOD comparable to that predicted in the control run after April 1 (Fig. 12). However, SOD likely only contributes 10 % of total sub-pycnocline O2 demand at station CB3.3C, given a ~20 m aphotic water column with water column respiration rates of 15 mmol O2 m−3 day−1 (Sampou and Kemp 1994) and SOD rates of 30 mmol O2 m−3 day−1. SOD may have been a larger contributor to respiration and O2 decline, however, in more northerly stations (north of CB3.3C), which tend to be shallow (5–15 m) and received the largest loads of POM from TS Lee (Cheng et al. 2013).

Thus, we conclude that the unusually early hypoxia onset in upper Chesapeake Bay in 2012 was likely due to the synergistic effects of high stratification, elevated temperature, elevated POC availability in early April, and a carryover of organic material from the fall of 2011. To more realistically address this hypothesis, however, analyses should involve coupled physical–biological model simulations to separate effects of physical circulation from water column and benthic biogeochemical processes. Such a model would compute both water column and sediment responses to the loads from TS Lee if coupled to a sediment transport model (Cheng et al. 2013) and would allow a more quantitative assessment of how climatic changes (e.g., river flow, temperature) and the biogeochemical processes they influence (e.g., spring bloom timing and magnitude) influence the rate of O2 depletion and the phenology of hypoxia development over several decades.

Conclusions

The present study provides a novel synthesis of water column O2 depletion rates and the date of hypoxia onset for 25 years in Chesapeake Bay, describing regional and seasonal patterns of O2 depletion over the winter–spring transition. We found clear spatial patterns in both DHO and ROD along the central axis of Chesapeake Bay, despite substantial interannual variability. These analyses illustrates that bottom water, winter–spring chlorophyll a was the strongest single predictor of O2 depletion rates and hypoxia onset dates, illustrating the importance of spring bloom-derived organic material in fueling the respiration that initiates hypoxic conditions. Stratification was an important control in seaward Bay regions. The absence of significant correlation between winter–spring O2 depletion and summer hypoxic volumes suggests that summer phytoplankton production is needed to sustain summer hypoxic volumes and that climatic variations in later spring and summer are strong controls on hypoxic volume. A record early initiation of hypoxia in the upper Bay in 2012 may have been caused by extraordinary winter–spring climatic and biological conditions and the carryover of large organic matter loads associated with a tropical storm in September 2011. This indicates the potential for both proximal and remote controls on hypoxia initiation. Thus, the phenology of hypoxia initiation in Chesapeake Bay may shift with future changes in winter precipitation and temperature patters, as well as tropical storm activity during late summer and fall.