Introduction

Groundwater in fractured-rock aquifers flows through networks of fractures that reflect the structural history of the bedrock. Fracture networks in horizontally layered sedimentary rock typically consist of orthogonal sets of fractures that are either aligned with or cut across the bedding. Groundwater flow in fractured sedimentary rock has been simulated successfully at the kilometer scale with equivalent porous-media (EPM) models using effective hydraulic properties that represent the primary flow paths through the networks (e.g. Yager 1996; Scanlon et al. 2003; Davis and Katz 2007). Anisotropy in sedimentary-rock aquifers that results from preferential flow along bedding can be incorporated in EPM models by appropriate representation of the bedrock structure (e.g. Senior and Goode 1999; Yager et al. 2009; Yager and Ratcliffe 2010; Tiedeman et al. 2010).

Many studies of transport of dissolved tracers through fractured-rock aquifers have focused on the dual-domain nature of these flow systems, whereby tracer concentrations in water flowing though the mobile domain are affected by diffusive exchange with concentrations in a surrounding immobile domain (Zheng and Bennett 2002). Typically, the mobile and immobile domains are assumed to represent fractures and primary pore space within the rock matrix, respectively, although it has been recognized that in some fracture networks a significant portion of the immobile domain can consist of fractures that are poorly connected to the network (Shapiro 2001). Other fracture networks can include preferential paths along intersections with fault zones or within conduits that have been enlarged through dissolution. In general, fractured-rock aquifers can be considered as multi-domain flow systems that are hydraulically connected in a hierarchical fashion. Karst aquifers can be considered as a distinct type of multiple-domain flow system in that groundwater flows through conduits that are hydraulically connected to a pervasive fracture network. These aquifers are often characterized as having quick flow through conduits and slower diffuse flow through the fracture network (e.g., Schuster and White 1971; Jones 1991). The relative proportions of conduit and diffuse flows through karst aquifers are determined by extent and degree of interconnections within the conduit network.

A primary challenge for groundwater-flow simulations of fractured-rock aquifers is the representation of multiple conductivity domains. Several computer codes have been constructed to simulate dual-domain solute transport through discrete fractures within a porous matrix that comprises a fractured-rock aquifer (e.g. Therrien and Sudicky 1996). Such codes are typically applied in site studies at the 100-m scale where a detailed geologic characterization supports the explicit specification of the spacing, orientation and extents of individual fractures within the fracture network. There are some kilometer-scale applications of discrete-fracture models where mining or proposed storage of radioactive waste has necessitated extensive mapping of the fracture network (Beaudoin et al. 2006; Blessent et al. 2011). In most kilometer-scale studies, however, it is not possible to adequately delineate fractures within the network in order to construct an accurate discrete-fracture model, and the computational requirements of such a model would render it difficult to calibrate. Groundwater flow through karst aquifers with conduit-dominated flow has been adequately simulated at the kilometer scale using EPM models with sufficient spatial resolution to delineate high permeability paths that corresponded to mapped conduits (Worthington 2009; Lindgren et al. 2011). Wu et al. (2009) simulated groundwater flow at the kilometer scale through a karst aquifer using a dual-domain, discrete-fracture model in an area in China where data from coal mining allowed a detailed delineation of conduits. In the multi-kilometer scale study presented here, the groundwater flow model was based on detailed geologic mapping and was designed to represent anisotropy in the flow system caused by bedrock structure and the influence of fault and lineament zones. Mapped faults and lineaments were represented as high conductivity zones within a porous media matrix that incorporated anisotropy along bedding planes. The model domain (912 km2) is large enough to simulate groundwater flow on a scale that provides information relevant to groundwater managers, but is small enough to represent some of the geologic complexity that controls the flow of water and transport of environmental tracers to springs.

Environmental tracer concentrations (for example tritium, carbon-14 and chlorofluorocarbons) can provide information regarding the apparent age and age distribution of groundwater in samples from wells and springs and are useful as observations for constraining estimates of parameter values in groundwater flow models (Sanford et al. 2004; Solomon et al. 2010; Sanford 2011; Eberts et al. 2012). When using tracer data for model calibration, it is important to recognize that measured concentrations represent mixtures of waters that have traveled along different flow paths through the aquifer (Bethke and Johnson 2008). The “apparent age” associated with a tracer therefore represents a flow-weighted average age of the sampled water mixture (Weissmann et al. 2002). Consequently, measured tracer concentrations should be used as model observations rather than estimated apparent ages (Shapiro 2011). In some cases, the sampled water can be approximated as a binary mixture of two end members (for example, younger and older water). In such cases, the age and proportions of each end member can be estimated by calibrating binary-dilution models to tracer concentrations if several types of tracers are measured (Long and Putnam 2009; Eberts et al. 2012). An alternative interpretative approach involves the simulation of flow paths through the aquifer and the computation of tracer concentrations using numerical models with advective transport (Troldberg et al. 2008; Eberts et al. 2012) or solute transport models that represent dispersion (Weissmann et al. 2002; Bethke and Johnson 2008). As a consequence of the uncertainty regarding the actual tracer flow-paths and velocities, differences in simulated and observed tracer concentrations are likely to be greater than differences in simulated and observed head and flow data (Sanford 2011).

Interpretation of environmental tracer concentrations in fractured-rock aquifers can be complicated by the potential for mixing between mobile and immobile domains within the flow system, especially in settings where groundwater velocity can range over several orders of magnitude. For example, Neumann et al. (2008) discuss the possible effects of diffusive exchange between mobile and immobile domains on tritium (3H) and helium-3 (3He) concentrations, but their results indicate that such effects generally are most significant for waters associated with the mid-1960s bomb pulse to about 1980. Shapiro (2011) recommends incorporating as much detail as possible in representing the geologic structure that forms the framework for the flow system. The combined interpretation of environmental tracers with different atmospheric histories or those affected by different processes (for example, conservative tracers and radioactive tracers) can provide multiple lines of evidence that constrain estimates of parameter values in models of fractured-rock aquifers (Cook et al. 2005).

Few studies have interpreted environmental tracer concentrations at the kilometer scale in fractured-rock aquifers. Shapiro (2001) used a one-dimensional (1D) transport model to estimate the rate of exchange of 3H and CFC-12 between high- and low-permeability fractures in crystalline rock. Cook et al. (2005) used a discrete-fracture model to represent idealized vertical and horizontal fractures in fractured sedimentary rock in the Clare Valley Australia to simulate depth profiles of CFCs and carbon-14 concentrations and estimate groundwater age. Lindgren et al. (2011) calibrated an EPM model with advective transport to tritium concentrations in the Edwards aquifer in Texas USA and delineated recharge areas and transit times to a public-supply well. Two studies have used lumped parameter models of karst aquifers at the kilometer scale to analyze tracer concentrations. Long and Putnam (2009) analyzed CFCs and tritium concentrations in groundwater pumped from the Madison aquifer in South Dakota USA to estimate the quick and diffuse flow fractions in a bimodal age distribution within the contributing area to a well. Einsiedl et al. (2010) used tritium data in spring discharge to estimate a mean transit time (4.6 years) for diffuse flow within the Franconian aquifer in Germany.

In this study, several types of environmental tracer data were collected in spring discharge from a folded and fractured karst aquifer in the Shenandoah Valley USA in which groundwater is dominated by diffuse flow through the fracture network. Groundwater age distributions were estimated from the concentration data using a binary-dilution model (IAEA 2006; Solomon et al. 2010), and an EPM groundwater flow model with advective transport through particle tracking. The purpose of this study was to demonstrate the utility of environmental tracer data as an aid in the calibration of a numerical model of groundwater flow in folded and fractured karst, an application that has not been previously reported in the literature.

Hydrogeologic setting

Stratigraphy and structure

The watershed of Opequon Creek (referred to as the Opequon watershed herein) occupies the northwest corner of the Shenandoah Valley (Fig. 1), part of a near-contiguous karst terrain called the Great Valley that extends from the state of Alabama to New York, across a distance of approximately 1,500 km (Trapp and Horn 1997; Weary 2008). The Shenandoah Valley is underlain by both carbonate and siliciclastic rocks composed of Paleozoic marine sediments that form a northeast–southwest trending synclinorium. The carbonate rocks of the Shenandoah Valley are bound by two major fault systems, the Blue Ridge thrust on the eastern edge, and the North Mountain thrust on the western edge. The carbonate rocks are exposed along the valley flanks, with younger siliciclastic rocks forming the central portion of the valley along the axial trend of the regional fold structure (Fig. 1). The carbonate rocks comprise a sequence of more than 3,000 m of Cambrian and Ordovician platform deposits (Butts 1940) and are nearly all composed of mixed beds of limestone and dolomite (Figs. 2 and 3). The Ordovician Martinsburg Formation, a siliciclastic unit of shale and interbedded sandstone and siltstone (Dean et al. 1987, 1990), overlies the carbonate rocks. The epikarst in the Opequon watershed, the zone between carbonate bedrock and overlying unconsolidated soil and regolith, has an irregular thickness (Fig. 4) ranging from zero to 30 m (Kozar et al. 2007). Further discussion of the bedrock stratigraphy is included in the electronic supplementary material (ESM).

Fig. 1
figure 1

Location and bedrock geology of the Shenandoah Valley

Fig. 2
figure 2

Bedrock geology of the Opequon watershed showing locations of faults, lineaments and springs

Fig. 3
figure 3

Geologic section A–A’ showing rock units and faults within rocks underlying the Opequon watershed

Fig. 4
figure 4

Photo showing vertical conduits within the epikarst exposed in a quarry south of Martinsburg, West Virginia

Faults and lineaments

The present structure of sedimentary rocks in the Shenandoah is largely a consequence of the late Paleozoic to early Mesozoic (∼320–220 Ma) Alleghanian orogeny (Hatcher et al. 1989). The rock strata were folded with the axes of the anticlines and synclines oriented northeast to southwest, with two prominent sets of joints perpendicular to bedding (Jones and Deike 1981). Longitudinal joints are parallel to the strike of the northeast-trending fold axes and cross joints are perpendicular to the fold axes. Smaller cross faults that cut across bedrock strike are prevalent in the Opequon watershed (Figs. 2 and 3) and can provide pathways for fluids to rise to the surface at springs (Hobba et al. 1979; Perry et al. 1979; Kozar et al. 2007; McCoy and Kozar 2008; Nelms and Moberg 2010). A Light Detection and Ranging (LiDAR) survey of the Opequon watershed conducted by the USGS in March 2011 detected lineaments throughout the basin. These lineaments were digitized, and combined with previous lineament mapping (McCoy et al. 2005a, b). The largest, most prominent lineaments (Fig. 2) were also assumed to provide pathways for groundwater flow in the numerical model discussed further on.

Hydrology

Fractures in the Shenandoah Valley are aligned along bedding planes, joints and cleavage planes and provide a three-dimensional (3D) permeable network for a regional flow system (Kozar et al. 2007). The fracture network is intersected locally by faults that offset bedding planes and can act as barriers to horizontal flow perpendicular to the fault zone and conduits for vertical flow. Spring locations have been correlated with fault zones in carbonate rocks in the Shenandoah Valley (Harlow et al. 2005; Doctor et al. 2009). Upward flow gradients occur where cross faults and joints intersect either bedding planes or longitudinal faults (Jones and Deike 1981; McCoy and Kozar 2008).

Karst features such as sinkholes, springs, and small caves are widely distributed throughout the basin. Despite the occurrence of karst, the groundwater flow system is dominated by diffusive flow (Jones and Deike 1981; Wright 1990; Jones 1991). The perennial discharge at larger springs is artesian and the water is saturated or nearly saturated with respect to calcite, as evidenced by surficial deposits of marl, tufa and travertine that are common downstream of springs (Hubbard et al. 1985; Doctor et al. 2009). Dye-trace studies conducted up-gradient of large perennial springs often yield long travel times (weeks to months), high tracer dilutions, and divergent flow (Jones and Deike 1981; Jones 1991, 1997; Kozar et al. 2007). A quantitative dye-trace from a sinking stream to Fay Spring (spring 6, Fig. 2) in the Opequon watershed yielded a groundwater velocity greater than 500 m day−1, but with overall dye recovery of only 12 %, indicating that recent recharge mixes with a large proportion of slow-moving groundwater before emerging at the spring (Doctor et al. 2011).

Groundwater recharge

Recharge to groundwater in carbonate and siliciclastic rocks in the Shenandoah Valley occurs as infiltration from the land surface through the epikarst and weathered bedrock. Sinkholes concentrate surface runoff and convey it directly to groundwater, so recharge is unevenly distributed across the land surface. Recharge in the Shenandoah Valley has been estimated from base flow at 20 stream flow-gaging stations (Nelms et al. 1997) and corroborated by model calibration (Kozar and Weary 2009). Base flow is linearly related to the proportions of carbonate and siliciclastic rock in the gaged watersheds; the average recharge rates for each rock type are 24.7 and 13.9 cm year−1, respectively (Yager et al. 2009). In the Opequon watershed, stream flow has been measured for more than 60 years at two gaging stations and at least 5 years at two other stations (Fig. 2). The median base flow (recharge), estimated from stream-flow records, derived from both rock types in each sub-basin is presented in Table 1. Median recharge to the Opequon watershed is estimated to be 20.6 cm year−1 (Fig. 5a).

Table 1 Recharge rates for carbonate and siliciclastic rocks estimated for sub-basins within the Opequon watershed
Fig. 5
figure 5

Hydrographs showing a base flow estimated from stream flow at gage on Opequon Creek near Martinsburg, and b water levels in well 46W175 just east of the Opequon watershed (gage and well locations shown in Fig. 2)

Groundwater discharge

Groundwater in carbonate rock in the Opequon watershed discharges to numerous springs; the median discharge of 51 springs is 1,900 m3 day−1 and discharges range from 120 to 35,000 m3 day−1 (Fig. 6). Many of the springs are along faults or lineaments or at stratigraphic contacts (Fig. 2). Spring discharge accounts for 60 to 90 % of stream flow (Harlow et al. 2005), and most streams draining carbonate rocks are gaining rather than losing. Total spring discharge from the Opequon watershed is estimated to be 257,000 m3 day−1. In contrast, groundwater pumped by municipal and industrial wells totaled 19,800 m3 day−1 in 2001, or less than 4 % of the estimated base flow. Groundwater in siliciclastic rocks discharges to stream channels that are oriented generally parallel to cross joints and perpendicular to the strike of the valley.

Fig. 6
figure 6

Spring-discharge rates from carbonate rocks within the Opequon watershed

Circulation of groundwater through conduits and fractures in the Shenandoah Valley can exceed depths of 300 m, as evidenced by open voids intersected by a small number of deep, high-yielding wells (Cady 1936). Temperature measurements of discharge from nine springs in the Opequon watershed ranged between 10 and 16 °C, and the mean temperature (13 °C) is about 1 °C higher than the mean annual, ambient air temperature in Winchester, Virginia. A study of heat flow in folded and faulted carbonate rocks 200 km southwest of the Opequon watershed (Fig. 1) estimated the geothermal gradient below about 300 m depth to be 9 °C km−1 (Perry et al. 1979). Thus, spring temperatures elevated above ambient can be explained as an increase in temperature by 1 °C for each 100 m depth. The spring temperature data in the Opequon watershed indicates that groundwater discharged from the springs likely originated from depths of at least 100 m.

Hydraulic conductivity

Estimates of hydraulic conductivity for the six geologic formations underlying the Opequon watershed range from 0.3 to 3.2 m day−1 (see Table 1 in electronic supplementary material,  ESM), based on the results of hydraulic tests conducted in West Virginia (Kozar and Weary 2009) and assuming an aquifer thickness of 40 m, the median open-interval length of wells in the Opequon watershed. The median hydraulic conductivity from hydraulic tests conducted in wells located in fault zones was 70 m day−1. Estimates of transmissivity (T) for these formations are also available from the specific capacity of wells drilled in the Shenandoah Valley in Virginia and West Virginia. The median T values for the six formations range from 2.2 to 130 m2 day−1 with a large variation in values for each formation (see Figure 1 in  ESM). Median open-interval lengths and well depths were 40 and 60 m, respectively. Regression analyses indicated that the log T values are inversely related to well depth for all formations except the Martinsburg Formation, although there is wide scatter in the data.

Environmental tracers

Concentrations of environmental tracers were measured in discharge from 20 springs (Fig. 2) to obtain information on the groundwater age distribution in the Opequon watershed. Sampling periods were selected when spring discharge was mainly derived from groundwater and unaffected by storm events. The measurements include tritium (3H), dissolved neon (Ne), dissolved helium-4 (4He), the 3He/4He isotope ratio, chlorofluorocarbons (CFC-11, CFC-12, and CFC-113), and dissolved major gases, nitrogen (N2) and argon (Ar). A discussion of sample collection and analysis is included in the ESM. Discussion of the 4He analyses is also included in the ESM.

Environmental tracers and apparent groundwater age

Combinations of environmental tracers were used to identify samples of spring discharge that are mainly young (less than 60 years old) or mixtures of young and old (more than 60 years old). Dissolved major gases were used to estimate recharge temperature and excess air to correct tracer concentrations for use in age interpretation, as discussed in the ESM. The selected environmental tracer data are summarized in Table 2 along with field parameters and information on sample identification. Interpreted recharge temperature, quantities of excess air, apparent age, mixing fractions and estimated age of the young fraction are summarized in Table 3.

Table 2 Selected field parameters, recharge temperature, excess air, and environmental tracer concentrations measured in spring discharge in the Opequon watershed
Table 3 Tritium, tritiogenic 3He, initial tritium, terrigenic 4He, 3H/3He age, fractions of young water calculated from 3H and CFC-113 data, and terrigenic 4He in old fraction of spring discharge in the Opequon watershed

Tritium and chlorofluorocarbons in precipitation

Tritium data from springs in the Opequon watershed were related to a smoothed function of tritium in precipitation that was at 6-month intervals from correlation to tritium records in the US (International Atomic Energy Agency 2012). Pre-1953 values were estimated by correlations to long-term records following Michel (1989), and a constant value of 7.5 TU was assumed for the period 2006 through 2011. The tritium input function in precipitation was decay-adjusted to the year 2005, which represents the median year of sampling, for comparison with the collected data (see Figure 2 in ESM).

Atmospheric concentrations of CFCs (see Figure 2 in ESM) are available for download (US Department of Energy 2012; US Department of Commerce 2012; US Geological Survey 2012). Although CFCs can be degraded in anaerobic environments, water from most of the springs in the Opequon watershed is aerobic (Table 2). The presence of additional CFCs from non-atmospheric sources prevented the use of CFC-11 and CFC-12 concentrations in age interpretation in spring discharge in the Opequon watershed, where most CFC concentrations exceed 100 % of modern atmospheric concentrations (Table 2). In contrast, the CFC-113 concentrations are less than modern atmospheric concentrations for 12 of the 20 samples with a median value of 94 %. CFC apparent ages were calculated by relating the CFC concentration in the sample (expressed in pptv) to the historic atmospheric concentration (see IAEA 2006 for details). The average CFC-113 apparent age of the 12 valid CFC-113 samples is 16.5 ± 3.5 years (Table 3; see Figure 3 ESM). Only spring 1 appears nearly concordant in regard to CFC apparent ages, with ages of 28.1, 20.1, and 21.6 years from CFC-11, CFC-12, and CFC-113, respectively (Table 3).

Initial tritium and tritium-3He age

The apparent tritium-3He (3H/3He) age of unmixed samples is based on the radioactive decay of tritium in precipitation to the noble gas 3He. The 3H/3He age (ττ) in years is:

$$ \tau =\frac{1}{\lambda } \ln \left(1+\frac{{}^3\mathrm{H}{\mathrm{e}}_{\mathrm{tri}}}{{}^3\mathrm{H}_{\mathrm{m}}}\right), $$
(1)

where the decay constant λ = ln 2/12.32 = 0.05626 year−1 (12.32 years is the half-life of tritium), 3 H m is the measured amount of tritium in the sample in tritium units (TU), and 3 He tri is the amount of tritiogenic 3He in the sample produced from tritium decay, in TU. The value of 3 He tri can be calculated from the measured values of 4He, Ne, and the 3He/4He isotope ratio of the sample (Schlosser et al. 1988, 1989). In an unmixed sample the apparent 3H/3He age is the time required for the initial tritium (3 H o=3 H m+3 He tri) to decay to the measured tritium in the sample, 3 H m. The relatively low amounts of terrigenic 4He and excess air in the samples, combined with little evidence for gas exchange following recharge, indicate that most of the apparent ages derived from the 3 H m/3 H o ratio in this study are reliable within 1 to 2 years. Most samples contained an excess of terrigenic 4He requiring correction for terrigenic 3He in age interpretation, as indicated in Table 3.

In the case of binary dilutions of young and old water, the apparent 3H/3He age computed with Eq. (1) is a very good approximation of the age of the young water in the mixture, if the fraction of young water is relatively large compared to that of old water and the concentration of 3 He tri in the old water is small. Both of these conditions are met in spring samples from the Opequon watershed (see ESM). The median apparent 3H/3He age of all samples computed with Eq. (1) is 4.4 years (Table 3). Figure 7a shows the decayed tritium input curve (DTIC) as a function of the 3 H m/3Ho ratio for unmixed (piston flow) samples. The upper horizontal axis indicates the initial recharge date for unmixed samples. In binary dilutions the 3 H m/3 H o ratio varies along a curve from 0 (old) to 1.0 (modern), as a function of age. Two binary-dilution mixing lines are shown in Fig. 7a: one for dilution of water recharged in 2005 (zero age) and one for dilution of water recharged in the year 1998 (7 year-old tritium-bearing water). Most of the samples plot below the DTIC and between these two mixing lines, and appear to be mixtures of young, tritium-bearing water with ages between 0 and 7 years and older water that contains little or no tritium. The sample from spring 1 appears to be unmixed, however, and plots along the DTIC with a 3H/3He age of about 20 years.

Fig. 7
figure 7

Graphs showing a Tritium measured in spring discharge (3 H m) and tritium-in-precipitation decayed to the year 2005, as a function of sample age and as a function of the tritium/initial tritium ratio (3 H m/3 H o), and b CFC-113 measured in spring discharge and atmospheric CFC-113 at recharge date, as estimated from 3H/3He age. The upper horizontal axis (a) is the sample age relative to the year 2005 an unmixed (piston flow) sample. The curved lines (a) represent dilution of water recharged in 2005 and 1998 by old water, the fraction of which increases along the mixing line as it approaches the origin. The old water fraction is assumed to contain 2 TU of 3 H tri. Evidence for dilution with old water also is seen in the CFC-113 data (b) as most samples plot to the left of the piston flow line, toward CFC-113 concentrations lower than expected according to the 3H/3He age

A second graph (Fig. 7b) compares CFC-113 concentrations (in pptv) in the spring samples to atmospheric CFC-113 concentrations corresponding to the apparent 3H/3He recharge date of each sample. Unmixed samples not affected by dilution (piston flow) would plot along the 1:1 line. Spring 1 has the oldest recharge date (about 1985) with the lowest measured CFC-113 concentration and only a small amount of dilution with old water (Fig. 7b). Most of the samples have recharge dates more recent than the year 2000 and plot to the left of the piston-flow line, indicating that they have been diluted with old water. The apparent ages from CFC-113 concentrations and the 3H/3He ratio are compared (see Figure 3 in ESM) for samples with valid CFC-113 concentrations (those samples with less than 100 % modern CFC-113) and those with valid apparent 3H/3He ages (Tables 2 and 3). The apparent CFC-113 ages are all older than the apparent 3H/3He ages (Table 3), suggesting that the spring discharges can be approximated as binary mixtures of young and old water.

Mixing fractions of young and old water

If mixtures of young and old (tracer-free) water are approximated with a binary-dilution model and the 3 He tri concentration of old water is neglected (see ESM) the mixing fractions of young and old water (f young and f old) can be estimated using the apparent 3H/3He age (and date of recharge) of young water in the sample (Eq. 1). The fraction of young water based on tritium (f young( 3 H)) is calculated from the initial tritium in the sample (3 H o) and the amount of tritium in precipitation at the time of recharge (3 H precip), based on the apparent 3H/3He age and the DTIC (see Figure 2 ESM), as:

$$ {f}_{\mathrm{young}\left({}^3\mathrm{H}\right)}=\frac{{}^3H_{\mathrm{o}}}{{}^3H_{\mathrm{precip}}}. $$
(2)

Although 3 H o can be measured within about ±0.3 TU, the amount of 3H in precipitation at time of recharge probably is not known better than about ±10 % and can vary widely with both seasonal and individual precipitation events.

For the 12 samples that contain valid CFC-113 concentrations (Table 3), the fraction of young water (f young(CFC-113)) was calculated using the amount of CFC-113 in the atmosphere at the time of recharge (CFC-113 atm), where the recharge date was defined by the apparent 3H/3He age (Eq. 1):

$$ {f}_{\mathrm{young}\left(\mathrm{CFC}-113\right)}=\frac{ CFC-113}{ CFC-{113}_{\mathrm{atm}}}. $$
(3)

The accuracy of the measured CFC-113 concentrations is about 2 % and the atmospheric concentrations of CFC-113 in North America air are known within a similar precision, although some samples could contain small amounts of non-atmospheric CFC-113 (making the apparent age appear younger).

The fractions of young water for 10 samples computed from both the 3H and CFC-113 data (Table 3) are compared in Fig. 8. The average young-water fraction obtained from the 3H data is 0.82 ± 0.13 (17 samples), while the average young-water fraction obtained from the CFC-113 data is 0.88 ± 0.07 (13 samples). The fractions of young water estimated from the 3H data for three springs (3, 14 and 21) are lower than those estimated from the CFC-113 data (Fig. 8). This difference could be due to either excess CFC-113 in the samples or over-estimation of the amount of tritium in precipitation at time of recharge. The error bars shown in Fig. 8 were computed accounting for a 10 % uncertainty in tritium in precipitation at time of recharge, and 2 % uncertainty in CFC-113 measured concentration. Uncertainty in the measured tritium and in the atmospheric mixing ratio of CFC-113 were judged smaller and not included in the error analysis. Given the uncertainty in reconstructing the tritium concentration in precipitation compared to uncertainty in CFC-113 concentrations, the fractions of young water computed from the CFC-113 data are assumed to be more accurate than those based on the 3H data. The fractions of young water computed from 3H data, however, were used for the eight springs for which the CFC-113 concentrations were not valid (Table 3).

Fig. 8
figure 8

Comparison of fractions of young water based on 3H (Eq. 2) and on CFC-113 (Eq. 3). Error bars for estimated fractions based on 3H are shown

Overall, the fraction of young water for the 20 springs in the Opequon watershed ranges from 0.70 to 0.99 with an average value of 0.86 ± 0.08. The error associated with neglecting the 3 He tri-old concentration in the binary-dilution model is small, as discussed in the ESM. Binary-dilution model estimates of mixing fractions computed with CFC-113 data are more reliable than those based on 3H data because the atmospheric mixing ratio of CFC-113 has been nearly constant for the past 15 years, but could contain an error less than 5 % for young-water fractions more than 0.4 (see Figure 4a in ESM). Estimates of mixing fractions based on 3H data could be over-estimated by less than 5 % for mixtures that contain young-water fractions more than 0.6 and 2 TU of 3Hetri-old (see Figure 4b in ESM). If, in the extreme case, the old water contains 5 TU of 3 He tri-old, the approximation is still very good for mixtures that contain young-water fractions greater than 0.8, which is the case for nearly all the computed mixtures (Table 3).

Numerical model

Groundwater flow in the Opequon watershed was simulated as steady-state with a 3D finite-difference model using MODFLOW (Harbaugh et al. 2000) to represent a period in April 2001 when a long-term well hydrograph indicates that water levels were near 12-year median values (Fig. 5b). The model represents a 500-m thickness of carbonate and siliciclastic rock and overlying epikarst. The geometry of the flow model is based on a 3D geologic model that contains six bedrock units that are intersected by mapped faults and lineaments.

Model design

The model domain covers an area of 915 km2 and includes the watersheds of Opequon Creek and Harlan Run, with the exception of a 50-km2 area in the Opequon watershed west of the model domain underlain predominantly by siliciclastic rock (Fig. 9). The model domain is discretized using a regular grid of 100-m square cells that are small enough to resolve fault zones in the carbonate rock and perennial stream channels within the siliciclastic rock. The grid is oriented N 23° E, parallel to the generalized strike of the sedimentary rocks. The top surface of the model corresponds to land surface based on a 10-m digital elevation model (DEM) and the bottom surface is specified as 500 m below land surface. The rock mass is divided into 6 layers: two upper layers that are each 25-m thick and four lower layers of increasing thickness from 50 to 200 m (Fig. 10b). The resulting grid contains 542,334 active cells. The top model layer represents either siliciclastic rock, or the epikarst in areas underlain by carbonate rocks.

Fig. 9
figure 9

Model domain and boundaries showing distribution of carbonate and siliciclastic rocks within gaged sub-basins within the Opequon watershed

Fig. 10
figure 10

Representation of rock units and fault zones in model of Opequon watershed. a Plan view, and b section A–A’′.

Model boundary conditions

The lateral boundaries of the model domain generally coincide with watershed boundaries and are, therefore, assumed to be no-flow. The generalized water-table surface of the northern part of the Shenandoah Valley supports the assumption that the groundwater and watershed divides overlap, although the position of the groundwater divide probably moves seasonally in response to changes in recharge. The northern boundary of the model domain represents discharge to the Potomac River through seepage faces using a head-dependent flow boundary (drain cells) assigned an elevation of 110 m (Fig. 9). The hydraulic conductivity value that limits discharge through this boundary was estimated through model calibration. Underflow into the model domain from the portion of the Opequon watershed excluded from the domain is represented as a specified flow along the western model boundary and was estimated through model calibration. Underflow out of the model domain toward a pumping center near a quarry operation is represented as a specified flow along the southern model boundary based on measured pumping rates.

The top model boundary represents recharge as a specified flow using the recharge rates for carbonate and siliciclastic rocks presented earlier for each sub-basin in Table 1. Total recharge to the model domain is 509,000 m3 day−1, based on median recharge to the Opequon watershed (20.6 cm−1 year). The bottom model boundary represents vertical flow into and out of the model domain. The depth of the sedimentary basin beneath the Opequon watershed exceeds 4 km (Fig. 3) and it was not practical to represent the entire bedrock thickness. A regional groundwater-flow model of the Shenandoah Valley of Yager et al. (2009) indicates downward flow beneath recharge areas within the Opequon watershed and upward flow beneath tributary stream channels. This pattern of flow was represented in the Opequon model by specifying downward flow through the bottom model boundary within carbonate rock and upward flow along fault zones. The downward and upward flow rates were estimated during model calibration.

Groundwater discharge to 92 springs in carbonate rocks and perennial stream channels in siliciclastic rocks is represented by head-dependent flow boundaries (drain cells). The springs are assumed to discharge water from the upper 125 m of the carbonate rocks, based on measured temperatures of spring water presented earlier, and are represented by a stack of three drain cells in model layers 2–4 (Fig. 9). Each spring elevation was assumed to lie 2 m below the simulated water table in the model cell that contained the spring. Streambed hydraulic-conductivity values (K str) that limit discharge to springs and streams were estimated through model calibration. Pumping from five municipal and industrial wells (19,800 m3 day−1) is represented by specified flow boundaries.

Hydraulic conductivity

Hydraulic conductivity in fractured sedimentary rock can be represented by a 3D hydraulic conductivity tensor (herein referred to as the conductivity tensor) that is oriented with the directions of maximum and medium hydraulic conductivity parallel to the strike and dip of bedding, respectively, and the minimum hydraulic conductivity direction perpendicular to bedding (e.g., Yager et al. 2009). The conductivity tensors defined by the MODFLOW grid that represent the Opequon watershed vary two-dimensionally in the horizontal direction, but are oriented vertically. As a result, the representation of the actual bedrock structure in the Opequon watershed in the model is approximate.

Six rock units and the epikarst overlying carbonate rocks were assigned separate values of hydraulic conductivity that were estimated through model calibration (Table 4; Fig. 10). Bedding in the siliciclastic Martinsburg Formation that underlies the center of the watershed was assumed to be horizontal, while bedding in the five carbonate units located along the east and west flanks of the watershed was assumed to be vertical. This approximation is justified by the generally large dip of carbonate rock units in the study area (Fig. 3). The direction of maximum hydraulic conductivity (K max) in all six rock units was oriented parallel to the generalized strike of the valley (N 23° E) along columns of the model grid (y-direction). In the Martinsburg Formation the direction of medium hydraulic conductivity was specified as perpendicular to strike (along model rows, x-direction), so the K max:K med anisotropy was represented as horizontal anisotropy (K y:K x). The direction of minimum hydraulic conductivity in the Martinsburg Formation (perpendicular to bedding) was vertical (z-direction), so the K max:K min anisotropy was represented as vertical anisotropy (K y:K z).

Table 4 Parameter values estimated and specified in nonlinear regression

The direction of medium hydraulic conductivity in the six carbonate units was specified as vertical (parallel to the dip of the bedding), and the K max:K med anisotropy was represented as vertical anisotropy (K y:K z). The direction of minimum hydraulic conductivity in the carbonate units was horizontal, so the K max:K min anisotropy was represented as horizontal anisotropy (K y:K x). The direction of maximum hydraulic conductivity in the epikarst was assumed to be vertical and the magnitudes of K med and K min were assumed to differ by a factor of four, so K z:K y = 4 · K z:K x . The following power function was applied to decrease K max in the carbonate units, fault zones and lineaments with increasing depth below land surface (as indicated by specific capacity data):

$$ {K}_{\mathrm{depth}}={K}_{\max }{10}^{-\uplambda \mathrm{d}}, $$
(4)

where K depth is the maximum hydraulic conductivity [L T−1] at depth d [L] below a threshold depth D and λ is a decay coefficient [L−1]. The threshold depth was assumed to be 150 m, which is greater than the median well depth and corresponded to the bottom of layer 4. The λ value was estimated through model calibration.

Mapped faults and lineaments generally intersect the carbonate rocks and were classified as longitudinal faults (sub-parallel to bedding strike) or cross faults (perpendicular to bedding strike). Fault zones that correspond to the mapped faults and lineaments are assumed to be 100 m wide and oriented vertically. The conductivity tensors defined for the fault zones using the Model-Layer Variable-Direction Horizontal Anisotropy (LVDA) package in MODFLOW (Anderman et al. 2002) are oriented such that the directions of K max are parallel to the mapped strikes. The direction of medium hydraulic conductivity was specified as vertical and the magnitudes of K max and K med were assumed equal, so K max:K med = 1. The directions of K min were horizontal and perpendicular to the mapped strikes.

Advective transport

Groundwater flow paths and advective travel times were calculated using MODPATH (Pollock 1994) to delineate areas contributing recharge to springs and compute distributions of travel times of recharge through the model domain to springs. Separate porosity values (n) assigned to the epikarst, siliciclastic rocks, carbonate rocks and the fault zones were estimated through model calibration. A power function similar to Eq. (4) was applied to decrease the porosity values with increasing depth in all units with a decay coefficient λ and a threshold depth of 150 m.

Particles were tracked backward from the faces of drain cells that represented spring discharge to inflow boundaries within the model domain. Travel time within the drain cell was assumed to be negligible. The number of particles assigned to each drain-cell face was based on the proportion of the total simulated spring discharge that flowed through the face, so that each particle represented approximately the same quantity of flow. About 2,000 particles were tracked from each spring, but the actual number of particles per face (np face) was varied so the starting positions could be configured as a square array on each cell face. The fraction of the total simulated discharge associated with the pth particle (X p) was calculated as:

$$ {X}_{\mathrm{p}}=\frac{ Inflo{w}_{\mathrm{face}}/n{p}_{\mathrm{face}}}{ Inflo{w}_{\mathrm{tot}}{}_{\mathrm{al}}}, $$
(5)

where Inflow face is the volume of flow through the cell face, and Inflow total is the total inflow into the drain cells representing the spring.

The program MODPATH-OBS (Hanson et al. 2012) was used to compute concentrations of tritium, CFC-113 and 4He in spring discharge. The concentration-histories of tritium and CFC-113 were specified in recharge and tracked along particle flow paths to springs using travel times computed by MODPATH. The average tracer concentration for all particles that discharge at drain cells representing a spring was computed as a flow-weighted average concentration in the simulated discharge. Upward flow along fault zones through the bottom model boundary was assumed to represent water older than 60 years and was assigned zero concentrations of tritium and CFC-113. Tritium and CFC-113 concentrations derived from recharge were, therefore, diluted by mixing with particles that originated from the bottom model boundary.

Simulated concentrations of tritium (C tritium) were decreased along flow paths to account for radioactive decay of tritium to 3He with an equation describing first-order decay:

$$ {C}_{\mathrm{t}\mathrm{ritium}}={\displaystyle \sum_1^{np}{X}_{\mathrm{p}}\left(R{C}_{{\mathrm{rt}}_{\mathrm{p}}}{e}^{-\mathrm{k}{\mathrm{t}}_{\mathrm{p}}}\right)} $$
(6)

where np is the number of particles associated with the drain cells representing a spring, RC is the tritium concentration in recharge at the recharge time (rt p) (calculated as the sample time minus the travel time) of the pth particle, k is the decay rate, and t p is the travel time of the pth particle. The 3He concentration associated with each particle was calculated from the amount of tritium decay along the particle’s flow path.

The simulated 4He concentration along each flow path (C He-4) was assumed to accumulate following an equation that describes zero-order growth:

$$ {C}_{\mathrm{He}-4}={\displaystyle \sum_1^{np}{X}_{\mathrm{p}}\left(R{C}_{{\mathrm{rt}}_{\mathrm{p}}}+{k}_{{\mathrm{t}}_{\mathrm{p}}}\right)} $$
(7)

where k is the accumulation rate. The initial value of C He-4 at the bottom model boundary and the accumulation rate were estimated through model calibration.

Model calibration

The groundwater flow model was calibrated to estimate model parameters through nonlinear regression with UCODE (Poeter and Hill 1998; Poeter et al. 2005) using measured groundwater levels in 485 wells and discharges at 2 streamflow-gaging stations and 51 springs in the Opequon watershed. The regression also included 60 concentration observations derived from environmental tracer data collected at 20 springs. The estimated parameters included K max of six bedrock units, epikarst and fault zones, four anisotropy ratios (K max:K med and K max:K min) of carbonate rocks, and four K str values of stream and spring boundaries (Table 4). Four porosity values of carbonate rocks and fault zones, and the two values associated with accumulation of 4He (Equation 5 in ESM) were also estimated by regression. A discussion of the observations used in model calibration and figures showing the spatial distribution of residuals (observed minus simulated values) are included in the ESM (see Figure 6 in ESM).

Estimated parameters

A total of 40 parameters are defined in the groundwater-flow model and 21 were optimized through nonlinear regression (Table 4). The remainder were specified or estimated through regressions that decreased model error, but did not converge to optimum values. All of the optimized parameters are well-estimated and the coefficients of variation for most estimates are less than 10 %. Seven values of K max were estimated for the bedrock units and fault zones. The K max values were lowest (0.32 m day−1) for the Martinsburg Formation and ranged from 2.3 to 19.9 m day−1for carbonate rocks. The K max values for the bedrock units in the bottom two model layers (70 % of the model domain) were reduced by factors of 3 and 6, respectively, using Eq. (4) with a specified depth coefficient (λ) that was estimated by trial and error. The K max values for cross and longitudinal fault zones were 19.2 and 2.8 m day−1, respectively. Data are not available to either support or contradict the difference in these model-derived estimates for the K max values of fault zones. Field observations indicate that zones of rising groundwater occur more frequently along cross-strike features, however, as indicated by the presence of marl in stream channels parallel to cross-strike faults.

Optimum estimates of four anisotropy ratios were obtained through regression for the carbonate rocks (Table 4) including the K max:K med anisotropy (strike-parallel to dip-parallel conductivity) and three K max:K min anisotropy values (strike-parallel to cross-bedding conductivity). The K max:K min anisotropy ratios ranged from 9:1 for the Conococheague Formation to about 20:1 for the other carbonate units, which is comparable to the ratio obtained through calibration of a groundwater-flow model of the Shenandoah Valley (17:1) by Yager et al. (2009). Other anisotropy ratios in the Opequon model were specified. The K max:K min anisotropy values for the epikarst (K z:K xy) and fault zones (strike-parallel to strike-perpendicular conductivity) were 20:1 and 5:1, respectively. The K max:K med (strike-parallel to strike-perpendicular conductivity) and K max:K min (strike-parallel to cross-bedding conductivity) anisotropy ratios for the Martinsburg Formation were specified as 2:1 and 10:1, respectively.

Streambed hydraulic-conductivity values (K str) that range from 0.01 to 0.1 m day−1 were estimated for the streams in the Martinsburg Formation, springs in the carbonate rocks and seepage faces along the Potomac River (Table 4). The simulated groundwater discharge to the Potomac River from the Opequon watershed is 66,700 m3 day−1 or 10.7 % of the total outflow, and compares favorably with the flow estimated by the Shenandoah Valley groundwater model (70,000 m3 day−1) of Yager et al. (2009). Estimated inflow along the western model boundary (14,400 m3 day−1) accounts for 2.5 % of the total inflow to the model domain. Simulated downward flow from carbonate rocks and upward flow along fault zones through the bottom model boundary each have the same magnitude (98,800 m3 day−1) and account for 15.9 % of the flow through the model domain (see groundwater budget in Figure 6 of ESM). The remaining inflow to the model domain (509,000 m3 day−1) is recharge, which was specified in the regression.

Three porosity values that ranged from 0.4 to 1.6 % were estimated by regression (Table 4). The largest porosity was estimated for epikarst, and the smallest porosity was estimated for the fault zones and was poorly estimated. The porosity values in the bottom two model layers were decreased using Eq. (4) and the depth coefficient λ, but were assigned a minimum value of 0.1 %. The estimated range of porosity values is similar to that estimated for the Edwards aquifer in Texas (0.1–1.3 %) by Lindgren et al. (2011). Optimum estimates of the concentration of 4He entering the bottom model boundary and the 4He accumulation rate were 4.9 × 10−8 ccSTP g−1 and 2.8 × 10−10 ccSTP g−1 year−1, respectively. Using these values and Equation 5 of ESM, the simulated age of old water entering the model domain is 180 years, although due to the expected variability in 4He accumulation rates and transit times along flow paths through the deeper parts of the aquifer (not simulated) the actual age range of old water is probably large. The 4He results are discussed in the ESM.

Model fit

Comparison of observed with simulated heads and flows indicates a reasonable model fit with little bias (Fig. 11a,b). The standard error in head is 12.6 m, about 8 % of the 168-m measurement range, and most heads are within 25 m of the observed values. In comparison, water-level fluctuations more than 25 m have been observed in some wells, so the scatter in residuals is partly the result of using single measurements from the long-term database in the steady-state simulation. The flow observations span 4 orders of magnitude and the median spring discharge (2,400 m3 day−1) is accurately simulated (2,500 m3 day−1). Spring discharges greater than 10,000 m3 day−1are under-predicted, however, including the largest discharge (35,200 m3 day−1) recorded at spring 14 (Fig. 11b). The total measured spring discharge (217,600 m3 day−1) is also under-predicted by about 30 %. This discrepancy suggests that the largest springs are better connected to the fracture network than most springs, possibly as a result of increased dissolution along conduits that convey groundwater to the springs, or that fault zones are more numerous or extensive than represented in the model.

Fig. 11
figure 11

Plots of simulated and observed data used in model calibration: a heads, b flows, c 3H concentration, d 3H/3He age, e CFC-113 concentration, and f percent of old water in discharge

There is more scatter in the concentration data than in either the head or flow data, partly because the relatively few measurements represent single samples from each spring. Discharges vary in both quantity and in tracer concentration at some springs, so repeated sampling (beyond the scope of this study) would be required to ensure that the data are representative of the assumed steady-state condition. The regression results fit the head, flow and concentration observations reasonably well, but there are outliers that the model simulation cannot match.

Simulated 3H concentrations are generally larger than those observed, with the exception of spring 1 where the largest 3H concentration measured is under-predicted (Fig. 11c). Most of the simulated 3H/3He ages agree reasonably well with the apparent 3H/3He ages (Eq. 1), with the median simulated age (5.7 years) being slightly older than the median apparent 3H/3He age (4.4 years). Although the median ages are similar, large differences in simulated and apparent ages were found for springs 1 and 16 (Fig. 11d). The simulated age for spring 1 is younger than estimated with Eq. (1) because less 3He was produced during the short residence time in the numerical simulation. The simulated results for spring 16 are reversed, with more 3He produced during a long residence time, yielding older ages than estimated by Eq. (1).

Both springs 1 and 16 are in the southeastern part of the model domain where the watershed boundary is close to Opequon Creek. Inspection of the simulated flow paths that discharge to spring 16 indicates that the longest flow paths originate in the Martinsburg Formation and account for most of the simulated 3He concentration at the spring. Groundwater in the Martinsburg Formation generally has concentrations of sodium and chloride that are greater than 30 and 60 mg L−1, respectively, while the water discharged by spring 16 is typical of carbonate springs in the area with less than 5 mg L−1 of sodium and 10 mg L−1 of chloride (L.N. Plummer, US Geological Survey, unpublished data, 2012). It is unlikely, therefore, that groundwater in the Martinsburg Formation is discharged at spring 16, as simulated by the numerical model. One explanation for the discrepancy in simulated ages at both springs is that the actual groundwater divide lies further to the southeast in this area than assumed. Extending the model domain to represent this condition would provide a larger recharge area up gradient of these springs, thus preventing Martinsburg water from reaching spring 16 and creating longer flow paths to spring 1.

The observed CFC-113 concentrations are matched with slight bias, although there is scatter in the data (Fig. 11e). The lowest CFC-113 concentration (measured at spring 1) is over-predicted, however, indicating less dilution with older water in the model simulation. If longer groundwater flow paths to spring 1 were simulated (by extending the model boundary to the southeast as described in the preceding), the model simulation would better match both the 3H and CFC-113 concentrations.

Comparison of the percentages of old water in spring discharges estimated by the binary-dilution model with those simulated by the numerical model yields a poor match (Fig. 11f). The simulated percent of old water for spring 1 is larger than that estimated by the binary-dilution model, which is consistent with the conclusion that actual flow paths to the spring are longer than simulated (and would account for the larger 3He values and the smaller 3H values that are observed in spring discharge). The average simulated percent of old water (7.6 %) for the other springs is about one half that estimated from the binary-dilution model from concentrations of CFC-113 and tritium (14.2 %), and simulated discharges for 6 springs contain little or no old water. Although the percentages of old water calculated by the binary-dilution model could be over-estimated (see ESM) for the 6 springs using the 3H data (including the two largest percentages, springs 8 and 9), this would only partly explain the difference in simulated results.

Discussion

Incorporating complexity in karst groundwater models

Representation of bedrock structure in the Opequon groundwater model results in a simulation that depicts anisotropic flow that is expected in folded and fractured karst terrain. Simulated flow paths converge to springs and fault zones enhance vertical flow and provide shortcuts across the bedding of carbonate rocks. The representation of these features requires detailed mapping to locate springs and develop models of geologic structure. The flow paths and travel times produced by the model provide a reasonable match to observed heads, spring discharges and concentrations of environmental tracers, and reflect the salient characteristics of the flow system.

Flow paths to springs delineated by backward particle tracking result in a complex pattern, especially in areas where springs are numerous (Fig. 12). The simulated source areas of the sampled springs reflect the effects of anisotropy and fault zones on the flow system. The bedrock structure is less complex on the west flank of the watershed and the bedding dip is shallower than on the east flank, resulting in more extensive outcrops of bedrock units with a more open pattern of folding. There is also a lower density of mapped faults on the west flank than on the east flank. The flow paths are longer and more uniform on the west flank than on the east flank. The longest flow paths (15 km) are simulated for spring 8 on the west flank and the shortest paths (less than 3 km) are simulated for spring 13 on the east flank.

Fig. 12
figure 12

Simulated source areas to sampled springs

Simulated groundwater transit times through the model domain also reflect the effects of anisotropy and fault zones (Fig. 13). The longest transit times (more than 20 years) are simulated for the Martinsburg Formation from groundwater mounds that form between the numerous tributaries of Opequon Creek. The shortest transit times in carbonate rocks (less than 2 years) are simulated in the vicinity of springs that are located along faults. Many stagnation zones with transit times ranging from 10 to 20 years are simulated in areas that contain few springs or faults. The mean simulated transit-times through carbonate rocks and the Martinsburg Formation are 6.7 years and 10.1 years, respectively. These transit times should be considered as conservative estimates of the flow system, since multiple groundwater ages ranging from a few days to several decades exist within the system, and mix together to provide the mean simulated transit-times.

Fig. 13
figure 13

Simulated groundwater transit time through model domain

Utility of environmental tracers in model simulations

The binary dilution model of environmental tracer data indicates that groundwater discharged from springs in the Opequon watershed can be approximated as a binary mixture of young water containing 3H and CFC-113, and old (tracer free) water that dilutes the environmental tracer concentrations. Numerical simulation of this condition with advective transport requires specification of upward flow of old water along fault zones at the bottom of the model domain, otherwise it is not possible to match observed 3H and CFC-113 concentrations and 3H/3He ages with a single set of porosity values. Results of the binary-dilution model were instrumental in refining the initial model design because the importance of upward flow within the flow system became apparent only after these results were available.

The uncertainty in parameter estimates obtained through model calibration was greatly reduced by incorporating concentration observations derived from environmental tracers in the nonlinear regression. Including the concentration observations decreased the average value of the coefficient of variation associated with the hydraulic parameters (hydraulic conductivities and flows) from 32 to 8 %, and allowed the estimation of one additional parameter (Table 4). This reduction in uncertainty was the result of lower covariance among the parameters with the inclusion of concentration observations. The consideration of concentration observations also suggests further refinement of the model design. For example, the inability of numerical simulations to reproduce the observed 3H and CFC-113 data at springs 1 and 16 indicates that the watershed boundary probably does not coincide with the groundwater divide upgradient of these springs, and that additional areas to the southeast also contribute additional groundwater flow to the watershed.

Model simulations allow the computation of tracer concentrations using flow-weighted averages of concentrations derived from multiple flow paths, rather than simple binary mixtures of young and old waters. The age composition of simulated spring discharge reveals that the travel times along flow paths discharging to springs generally range from less than 1 to 25 years (Fig. 14). Most of the simulated travel times are less than 10 years. The total volume of water within the model domain is about 1.5 × 109 m3, based on porosity values estimated using the concentration observations (Table 4). This volume is equivalent to 7 years of recharge, which is consistent with the mean transit time through the carbonate rocks (6.7 years). About 44 % of the volume of stored water is within the upper 150 m of carbonate rock and 9 % is stored within the epikarst.

Fig. 14
figure 14

Histograms showing the distribution of travel times to selected springs: a spring 3, b spring 12, c spring 14 and d spring 17

Limitations of model results

The Opequon groundwater-flow model is designed to simulate the diffuse flow system within the watershed that sustains perennial discharge to springs. The median spring flow is well represented, an indication that the model achieves this goal. Flow to the largest springs is under-predicted, however, either because high permeability paths created by convergent flow to these springs are not represented in the model, and/or high-conductivity zones (faults and lineaments) are more numerous or extensive than those mapped. Conduit flow through discrete karst features is not simulated, so rapid transport of tracers (for example, during dye tracing tests) or of contaminants from the land surface is not represented by the model. Although rapid movement of dye tracers with velocities in excess of 500 m day−1 occurs in this system, the loss of mass during dye tracing tests (as much as 90 %, e.g., Kozar et al. 2007; Doctor et al. 2011) indicates the relative importance of the diffuse flow system in the Opequon watershed.

The provenance of the old water in the Opequon watershed is uncertain. The numerical model under-predicts the percentage of old water in spring discharge relative to that estimated by the binary-dilution model, implying that either (1) deep high-conductivity faults are more numerous and extensive than presently mapped (a conclusion that is consistent with the under-prediction of simulated spring discharge), or (2) an additional source of old water exists within the aquifer that is not accounted for in the model (for example, old water residing in immobile zones within the carbonate rocks). An alternative 1D simulation was conducted based on equations presented in Neumann et al. (2008) to assess this possibility (see ESM). The 1D simulation represented (1) the transport of 3H and 3He along a flow path in the mobile zone, (2) decay of 3H to 3He in the mobile and immobile zones, and (3) the exchange of 3H and 3He between the mobile and immobile zones. The 1D simulation indicates that the exchange of young water with old water in the immobile zone cannot account for both the dilution of 3H and the apparent 3H/3He age observed in spring discharge, assuming that the model-derived estimate of effective porosity for carbonate rock (0.9 %) is correct. Exchange between mobile and immobile water could yield the observed dilution and apparent age, but only with mobile porosity values less than 0.04 %, which would yield transit times that are more than 20 times faster than those predicted with the numerical model. The 1D simulation results are not constrained by observations of head, flow and hydraulic properties, as in the numerical model, and are, therefore, unreasonable.

A refined simulation of groundwater flow in the Opequon watershed could be constructed using a solute transport model that represents both the exchange of water between mobile and immobile domains, and upward flow of water along fault zones. In such a model, the mobile domain would represent faster flow through the connected fracture network, while the immobile domain would represent slower flow through flooded cavities and poorly connected fractures where old (tracer free) water could reside. This approach could theoretically be extended to include additional domains such as conduits and the bedrock matrix, if data were available to warrant their inclusion. Presumably, this approach would better represent the observed percentage of old water in spring discharge. Such a model would be computationally demanding, however, and require the specification of additional parameters (for example transfer coefficients between mobile and immobile domains) whose values are unknown.

The Opequon model is designed to represent a steady-state flow condition in a system that is highly dynamic. It is likely that transient flow resulting from changes in recharge could produce changes in flow paths that would affect transport of the environmental tracers examined in this study. Water levels in wells can range more than 25 m, and spring discharge can vary by an order of magnitude, causing some springs to dry up when water levels fall (G.E. Harlow, US Geological Survey, unpublished data, 2012). Such changes in flow conditions would alter flow paths to springs and, therefore, modify the mixing proportions of young and old water. For example, a higher proportion of old water is observed in some springs in the Shenandoah Valley during dry conditions when water levels are lower (Nelms and Moberg 2010). Representation of transient conditions using observed changes in heads, flows and concentrations may better estimate advective mixing within the aquifer under changing flow regimes and is the subject of an ongoing study.

The geologic structure in the Opequon watershed is reasonably well represented in the model, but an important component is missing. The dip of bedding in carbonate rocks is assumed to be vertical, but actually varies from near horizontal to vertical, and differs on the east and west flanks of the watershed. It is not possible at the present time to incorporate this feature in a MODFLOW-based model that covers a domain as large as the Opequon watershed. Alternatively, a finite-element model (such as SUTRA, see Yager et al. 2009) could be employed, but at the expense of the particle-tracking capabilities that were used in this study. In addition, detailed structure that is revealed by geologic mapping (such as the deformation of bedding that wraps around the nose of folded anticlines) is not represented in the model. A more detailed groundwater flow model could be constructed if a more refined geologic model were available, and would produce more complicated flow paths than those produced by the model discussed herein. It is important to recognize that the modeling effort described here illustrates that simulated mass balance and transit times based on environmental tracer data can be adequately described using an EPM approach, yet the individual flow paths simulated by the model do not necessarily predict actual local-scale movement of contaminants through this karst aquifer to individual springs.

Conclusions

Equivalent porous-media (EPM) models that simulate flow and advective transport such as MODFLOW and MODPATH, can be used to reproduce environmental tracer concentrations observed in spring discharge from karst aquifers that are dominated by diffuse flow. In this study, the flow system in a 915-km2 folded and fractured karst aquifer in the Opequon watershed of the Shenandoah Valley USA was represented using an EPM model that includes anisotropy caused by bedrock structure and the influence of fault zones and springs. Groundwater in the karst aquifer flows mainly through fractured zones oriented parallel to bedding and sustains spring discharge with water that is nearly saturated with respect to calcite. The environmental-tracer data show that spring discharge can be approximated as a binary mixture of young water with a median age of 4.4 years and old (tracer-free) water. The numerical model produces a flow field in which younger water from recent recharge travels through a complex set of shallow flow paths and mixes with older water upwelling from deeper flow paths along fault zones. Tracer concentrations in simulated water mixtures discharged at model boundaries are comparable to measured concentrations of tritium, helium-3, CFC-113 and helium-4 measured in spring discharge.

Including environmental tracer concentrations in the model calibration reduced the uncertainty in estimated parameter values by a factor of four and revealed shortcomings in the initial model design. Analysis of the environmental tracer data using a binary-dilution model indicated the significance of vertical flow to and from the model domain, which had not previously been considered. Additional discrepancies between measured and simulated concentrations at two springs suggest that the actual groundwater divide is located southeastward of the specified model boundary. The inclusion of environmental tracer data allowed estimation of effective porosity values for the aquifer system and provided the information required to estimate the volume of water within the model domain (1.5 × 109 m3, equivalent to about 7 years of annual recharge). The inclusion of multiple environmental-tracer concentrations in the model calibration constrained the estimated parameter values within a reasonable range.

For the Opequon watershed, the simulated median age of young water discharged from 20 springs (5.7 years) is slightly older than the apparent 3H/3He age (4.4 years). The simulated fraction of old water in spring discharge is 0.076, which is about one half the fraction of old water estimated by a binary-dilution model derived from measured 3H and CFC-113 data. This discrepancy suggests the presence of more numerous and/or extensive zones of high hydraulic conductivity (faults and lineaments) than are presently mapped. The estimated porosity values of epikarst, carbonate rock and fault zones range from 1.5 to 0.4 % and yield a mean transit time of 6.7 years through the karst aquifer. The calibrated (K max:K min) anisotropy of carbonate rocks is about 20:1 and compares favorably with the value obtained through calibration of a regional groundwater-flow model of the Shenandoah Valley (17:1).