Introduction

Structural features such as folds, faults and fracture networks, have the potential to significantly influence regional-scale fluid flow. As a result, hydraulic properties derived from site-scale investigations may not provide an accurate representation of bulk properties when extrapolated to a regional scale. Characterizing the permeability at a regional scale where fracture distributions are heterogeneous, can be aided by defining hydrostructural domains. Such an approach uses regional-scale structural data to divide aquifers into structural domains (Ohlmacher 1999; Martin and Tannant 2004), which are delineated by changes in the geometry, orientation or distribution of fold and fault structures and associated fabric elements, with the premise that each domain is characterized by a unique set of hydraulic properties. The term ‘hydrostructural domain’, which has not been widely adopted within the literature is, in some respects, synonymous with ‘structural domains’, but has the explicit connotation of affecting the hydraulic properties of aquifers. The term ‘hydrostratigraphic unit’ is similarly used to represent differences in the hydraulic properties in porous (sedimentary) rocks. At a regional scale, defining hydrostructural domains on the basis of relative-fracture network permeability offers a means of characterizing aquifers at a regional scale where fracture distributions are heterogeneous (e.g., Turkey Creek Watershed, Colorado, Caine and Tomusiak 2003; Äspö Hard Rock Laboratory, Sweden, Winberg et al. 2003). The use of a hydrostructural domain approach should result, therefore, in more accurate characterization and quantification of regional groundwater resources for fractured rock aquifers.

Mackie (2002) used bedrock geology and mesoscopic structural features to divide the Gulf Islands, British Columbia, Canada into three hydraulically distinct hydrostructural domains. These domains are, in part, lithology-dependent and reflect changes in fracture intensity within the rock mass. They include: ‘highly’ fractured interbedded mudstone and sandstone (IBMS-SS) with fracture spacing <10 cm and an average fracture intensity of 8.0, ‘less’ fractured sandstone (LFSS) with fracture spacing >1 m and average fracture intensity of 1.0, and fault and fracture zones (FZ) with fracture spacing of <5 cm and average fracture intensities of 9.0 (Fig. 1). Fracture intensity is defined as the number of fractures per unit scanline length (L–1). The fracture measurements indicate that permeability likely increases closer to fault and fracture zones and within the highly fractured interbedded mudstone and sandstone domain.

Fig. 1
figure 1

Hydrostructural domain conceptual model for the southern Gulf Islands showing in order of decreasing relative potential permeability (k): fault and fracture zone domain (FZ), highly fractured interbedded mudstone and sandstone domain (IBMS-SS), and less fractured sandstone dominant domain (LFSS). Estimated fracture spacing for each domain is shown as well as typical dimensions at the outcrop scale. Figure not to scale (modified from Denny et al. 2007)

Previous work on Mayne Island, one of the four southern Gulf Islands, demonstrated that a hydrostructural domain model approach can be used to characterize the bulk hydraulic properties of the fractured rock aquifer at a small scale (Surrette and Allen 2007). In that work, the potential porosity and relative potential permeability for the island were estimated using a hybrid discrete fracture network-equivalent porous medium (DFN-EPM) modeling approach. The model results showed that the most permeable units on Mayne Island were the IBMS-SS and FZ hydrostructural domains with relative potential permeabilities on the order of 10–13 m2. This value is an order of magnitude greater than the value of 10–14 m2 obtained for the LFSS domain. These results suggest that the IBMS-SS domains are likely the dominant water transmitting aquifers on Mayne Island, a conclusion supported by drilling and geophysical observations (Allen et al. 2002). However, although these rock units are pervasive throughout the Gulf Islands, the estimates obtained using the DFN-EPM model for Mayne Island may not necessarily be representative of the regional nature of bulk hydraulic parameters throughout the Gulf Islands.

In this paper, a hydrostructural domain approach is used to characterize and quantify the regional nature of bulk hydraulic parameters for Galiano, Mayne, Pender and Saturna Islands, all members of the Gulf Islands (Fig. 2). The aim of the study is to test whether or not the hydrostructural domain approach implemented on Mayne Island can be equally applied across the Gulf Islands region, and to determine whether any regional trends in the hydraulic properties can be related to the regional scale structure. To do this, the hydraulic properties of each domain, specifically relative potential transmissivity and permeability, were derived using stochastically generated fracture data and DFN-EPM modelling. The heterogeneity of the fractured rock is interpreted within the context of the defined hydrostructural domains, and the bulk hydraulic properties for each island are estimated. Model-derived relative potential transmissivity values are compared to estimates of transmissivity from pumping tests at selected sites. The pattern of relative potential transmissivity and permeability across the four islands are then identified using cluster analysis. These patterns enable an assessment of the relationship between large-scale structural features and bulk hydraulic properties in the Gulf Islands.

Fig. 2
figure 2

Location map and regional setting of the southern Gulf Islands in this study

Regional context

The Gulf Islands are located within the Strait of Georgia on the southwest coast of British Columbia. The study focuses on four of the southernmost outer Gulf Islands: Saturna Island, Pender Island, Mayne Island and Galiano Island (Fig. 2). The islands range in area from approximately 20 to 60 km2 with elevations up to 400 m above sea level. The topography of the islands is controlled primarily by NW–SE trending ridges and valleys.

Regional scale groundwater characterization is important in the Gulf Islands, as groundwater is generally the sole source of potable water for domestic and agricultural use in a region with very low recharge rates and fractured, low-yielding bedrock aquifers. Groundwater is derived primarily from the fractured sedimentary rocks of the Late Cretaceous Nanaimo Group (∼91−66 Ma). The Nanaimo Group is divided into eleven formations, which comprise mainly alternating sequences of interbedded sandstone-dominant and mudstone (or siltstone)-dominant formations (Table 1). The contacts between the formations are typically transitional and are usually characterized by the presence of interbeds. While massive sandstone units are common, massive mudstone units are not; mudstone is typically interbedded with sandstone. The primary porosity of the Nanaimo Group is low and is considered to be of minor importance in the storage and transport of groundwater (Dakin et al. 1983; England 1990). As a result, permeability is thought to be derived primarily from fractures.

Table 1 Lithofacies descriptions of the Nanaimo Group strata examined in this study (geological descriptions summarized from Mustard 1994) and hydrostructural domain designations

Four major tectonic features are present in the sedimentary rocks of the Nanaimo Group and these provide the secondary porosity necessary for groundwater movement. From oldest to youngest, these structural features include: (1) northwest trending, northeast dipping thrust faults associated with compression by a southwest vergent thrust system (England and Calon 1991; Mustard 1994; Journeay and Morrison 1999), (2) northwest-trending buckle folds and associated northeast-vergent thrust faults resulting from southeast-directed extension (Journeay and Morrison 1999), (3) shallow dipping features representing bedding plane partings, which are suggested to have formed during uplift and/or isostatic rebound after deglaciation (Mackie 2002) and, (4) northeast trending fault and fracture zones as well as bedding perpendicular jointing, which are present at a local scale (Mustard 1994; Mackie 2002).

The hydrostructural domains defined by Mackie (2002) were defined on the basis of fracture intensity measured in the sandstone (LFSS), the interbedded mudstone and sandstone (IBMS-SS), and in association with faults and fracture zones (FZ) as illustrated in Fig. 1.

Methods

Pumping test data

A total of 29 pumping tests were selected from a previous study to obtain field-based transmissivity values for the aquifers on three of the southern Gulf Islands (Allen et al. 2003). Only well sites in close spatial proximity to fracture data collection sites were selected to provide a reasonable comparison between field-based and simulated aquifer properties. The 29 well sites are situated on three of the four Gulf Islands in this study; the majority of the sites (22 sites) are located on Galiano Island (Fig. 3). Pumping test data were not available for Pender Island. Twenty-seven of the pumping tests were conducted in wells completed in the sandstone-dominant LFSS domain, and only two were completed in the mudstone-dominant IBMS-SS domain. Many of the wells tested had been drilled close to fracture zones within the LFSS in an effort to increase well yield. As a result, there may be a bias in the pumping test data towards the LFSS domain as well as potentially inflated hydraulic properties relating to fracture zones.

Fig. 3
figure 3figure 3

Simplified geologic maps for a Galiano Island, b, Mayne Island, c Saturna Island, and d Pender Island showing the sandstone dominant (LFSS) and mudstone dominant (IBMS-SS) hydrostructural domains of the Nanaimo Group. Fracture data collection sites are shown (2004–2005 signifies fracture data collected for this study (e.g., G7-8); fracture data collected in a previous study are denoted as 2002 and identified with an asterisk, e.g., 148*) with corresponding equal-area projections of poles to fractures (n number of measurements; contour interval is 2σ). Water well sites used for pumping test analysis are also shown. No pumping test data were available for Pender Island

Hydraulic data obtained from the pumping tests were analyzed and reported by Allen et al. (2003) using the derivative method to identify periods of radial flow (Bourdet et al. 1984; Allen 1999). Transmissivity values, calculated from both the pumping and recovery data using the Theis and Theis Recovery Methods (Theis 1935; Kruseman and de Ridder 1994), yielded consistent values of transmissivity. Therefore, only the transmissivity values calculated from the pumping period data were used for this study.

Fracture data

Using Mackie’s (2002) conceptual model for the southern Gulf Islands, representative exposures for each hydrostructural domain present in the study area were selected through field reconnaissance. A total of 87 outcrops comprise the composite dataset (Fig. 3). Fracture orientation, intensity, aperture and trace length were sampled using a scanline technique. In this technique, a graduated tape measure is stretched across an outcrop face at near right angles to major fracture sets to avoid scanline orientation bias (Caine and Tomusiak 2003). Orientation bias can result in complete sub-horizontal fracture sets being missed in the sampling. In this study, exposures with at least two near-orthogonal faces were targeted to capture fractures that were sub-parallel to one of the outcrop faces. This method further reduces scanline orientation bias as fractures sub-parallel to one face will be recorded on the second face (Terzaghi 1965; Baecher 1983). The average scanline length was 30 m, but many were considerably smaller, particularly in the finer grained materials.

Drill core data were not available for this study. Although drill cores provide a solution to measuring subsurface fracture sets, logistical restraints often result in surface fracture expression being the only viable option for estimating subsurface densities (e.g., Caine and Tomusiak 2003; Neuman 2005). Thus, fracture measurements taken at the surface were used as a proxy for the subsurface. In adopting this proxy it is recognized that fracture sets measured on outcrop surfaces do not completely characterize their distribution in three dimensions, especially in the subsurface (Mauldon et al. 2001). Fractures that appeared to be closed at the surface (due to mineralization or simply because they had no observable aperture) were measured, but were not included in the analysis.

A Watson-Williams’ test-of-means for spherical data was conducted on the composite dataset to compare fracture orientation measurements from the three hydrostructural domains in order to determine equivalency of fracture orientation within each domain, and to distinguish the different domains. The method, as proposed by Mardia (1972), converts azimuth and dip fracture measurements to polar coordinates for the calculation of a resultant vector. The resultant length of that vector is a measure of the concentration around a mean direction, if one exists. The calculated resultant lengths are used in an F-approximation to test if the mean fracture orientations of the two data sets are statistically different or not (Mardia 1972). The null hypothesis for the F-test states that the mean directions from two samples are not significantly different, and this hypothesis is rejected when the calculated F-statistic is greater than the critical value for a desired level of significance. Critical values at the 95% confidence interval were obtained from significance tables published by Fisher et al. (1987). Using this confidence interval, the Watson-Williams’ test-of-means results showed that fractures within the same hydrostructural domain tend to have similar fracture orientation, and that these orientations are statistically different from those in other hydrostructural domains on the same island. An interesting finding is that, while the hydrostructural domains had originally been defined based on fracture intensity, the statistics suggest that fracture orientation can also be used to distinguish between domains.

Model construction

The FracSys module of the FracMan package was used for a statistical analysis of the fracture data (Dershowitz et al. 1998). A workflow for the modeling process is shown in Fig. 4. The FracSys algorithm, encompassing a series of sub-algorithms, uses a forward modeling approach to determine the statistical distributions for fracture orientation and trace length that are a ‘best-fit’ of the field data (Dershowitz 1995).

Fig. 4
figure 4

Flowchart outlining the DFN-EPM modeling process. Best-fit probability density functions are determined for fracture orientation data (a), as well as for the fracture trace length estimates (b). Fracture sets are simulated within a DFN model domain (c). An EPM grid is applied to the DFN model (d). Flow is applied according to a simple geographic coordinate system (E). NS north–south; EW east–west; TB top to bottom

For fracture orientation, an algorithm assigns fracture data to sets and determines the probability density functions that best describe those sets. The premise is that the sets should be groups of fractures with similar properties (Dershowitz et al. 1996). Fracture sets can be distinguished by a number of parameters including orientation, mineralization species, relative ages, length and morphology (Caine and Tomusiak 2003). In this study, fracture orientation was the sole parameter used for set designation. Weighting factors of other fracture characteristics such as mineralization, were set to zero. A soft sector convergence method was specified, which increases the probability of the assignment of fractures to sets with mean poles in the same sector. A sector is defined by the midpoint between set mean pole vectors (Dershowitz et al. 1998).

An initial approximation of the orientation distribution that best describes each fracture set, and the mean direction and dispersion factors for each of the distributions were defined in terms of a mean pole vector estimated through parametric analysis using SpheriStat (Pangaea Scientific 1998). The initial guess of the best-fit probability density function for each fracture set was determined through visual inspection of the symmetry of the fracture data on equal area Schmidt nets and also a trial and error approach. Five types of probability distributions for directional data are possible: the univariate and bivariate Fisher distribution, the bivariate normal distribution, the bivariate Bingham distribution, and the spherical Dirac delta constant distribution.

The distribution of properties for the fractures assigned to each set is calculated, and then fractures are re-assigned to sets according to probabilistic weights proportional to their similarity to other fractures in the set. The properties of the sets are then recalculated, and the process repeated iteratively until the set assignment is optimized. Optimization is achieved when the goodness-of-fit between the measured and simulated field observations is statistically significant using two-sample Kolmogorov-Smirnov (K-S) test. A K-S test determines whether two underlying one-dimensional probability distributions differ, or whether an underlying probability distribution differs from a hypothesized distribution, in either case based on finite samples. The two-sample K-S test is one of the most useful and general nonparametric methods for comparing two samples, as it is sensitive to differences in both location and shape of the empirical cumulative distribution functions of the two samples. A statistical significance of 0.1 (90% confidence) was sought, although not always achieved, as some data were difficult to fit to such a high significance level. The results indicate that fracture set orientations are typically best-fit with Fisher (Fisher 1953; Davis 2002) and Bivariate Bingham (Dershowitz et al. 1998) probability distribution functions. K-S test results range from 32.5 to 100%, with an average of 80.2%.

For fracture length, an algorithm is used to determine the distribution of fracture radii that gives the best match to the fracture trace length data collected in the field. The sampling process is simulated by recording fracture sizes encountered within a specified survey window size; window size is defined by the scanline length multiplied by the transverse length (Dershowitz et al. 1998). The method accounts for fracture censoring, truncation and sampling bias by using a simulated sampling method, and similar to the orientation algorithm, requires an assumed distribution of the fracture radii to begin the simulation process. The probability distributions for scalar data supported by the algorithm are the uniform, exponential, power law, normal, lognormal, gamma and Dirac delta constant distributions. An automated search for the best-fit match between the simulated sample distribution and the actual data is done using a simulated annealing optimization algorithm (Dershowitz et al. 1998), which generates five realizations of the assumed distribution for each iteration in an effort to improve the match between the simulated and observed distributions. The simulations are repeated until a satisfactory match is obtained, as determined by the K-S statistic. Similar to above, a statistical significance of 0.1 was considered acceptable. The majority of the fracture trace length data are log-normally distributed; however, some of the data are better approximated by exponential, power law and constant distributions. K-S test results range from 51.7% to 100% with an average of 84.1%.

Discrete fracture network (DFN) models were constructed for each fracture data collection site on the four islands using FracWorks XP (Golder Associates Inc. 2004). Each DFN model domain was constructed as a 20 × 20 × 20-m cube, and populated with fractures according to the orientation and trace length probability distributions (Fig. 4). The model domain size was determined based largely on computational efficiency, as solutions could not be obtained for larger domains in the more densely fractured materials. For most of the sites, the domain size corresponds with the scanline mapping scale, and likely approximates the representative elementary volume (REV). However, for some simulations, a 100 × 100 × 100-m cube would have been ideally used based on maximum fracture lengths of about 50 m measured in some outcrops.

The DFN models were then calibrated to field fracture data using a simulated scanline within the model domain. The simulated scanline maintains the same orientation (trend/plunge) as the field, thereby removing the need to correct for fractures not perpendicular to the scanline, while maintaining the same fracture spacing distribution. During this process, the number of fractures per unit scanline length is first converted to volumetric fracture intensity, resulting in a fracture surface area per unit volume. A simulated scanline is then generated, and the fracture set intensity along the simulated scanline compared to field values. The modeled fracture set is regenerated until the simulated fracture intensity matches the observed fracture set intensity with a relative error less than 10%. Ultimately, the best-fit DFN model for each site was identified.

In conducting a DFN analysis, fracture network geometry and fracture transmissivity distribution are the key parameters (NRC 1996). A number of assumptions had to be made in respect of modelling parameters simply because insufficient data were available from the subsurface (i.e., core data and packer tests) to adequately define these parameters. This is a common limitation of outcrop-based studies. However, the values used were within the ranges used in other studies and provide a baseline for comparison of potential permeability values across the region.

The simulated fractures were modeled as smooth, parallel-walled, hexagonal fractures with no elongation (an aspect ratio of one). This simplification was required as fracture characteristics estimated at surface do not necessarily provide a good estimate of conditions at depth due to the effects of weathering. Fracture spacing was described by an enhanced Baecher model (Dershowitz et al. 1998) following a Poisson process, as field observations suggested that the fractures were randomly distributed with no clear tendency towards clustering. Fracture set termination percentages, calculated from field observations, were honoured statistically in the DFN models. A uniform fracture aperture of 100 μm (0.0001 m) was assumed for all fractures in the model, even though applying a uniform fracture aperture is not thought to accurately represent natural fractures (Liu 2005; Neuman 2005). The value selected is nonetheless consistent with estimates of fracture aperture used in other outcrop-based studies (e.g., Caine and Tomusiak 2003). Furthermore, employing a uniform aperture eliminates one variable and facilitates a comparison of the hydrostructural domains based solely on orientation and fracture length data. In a sense, this provides a standardized approach.

Transmissivity values for individual fractures, typically acquired through packer testing, were not available for this study as domestic wells were inaccessible. Therefore, a constant transmissivity value of 6.24 × 10–7 m2/s, calculated using the cubic law (e.g., Snow 1968) using the assigned aperture, was assumed for all the fractures in the model.

The best-fit DFN model for each site was transformed into a 0.2 × 0.2 × 0.2-m MODFLOW grid (1 million grid cells) that contains cell-specific hydraulic properties calculated using the Oda Analysis process (Fig. 4) (Oda 1985). Grid spacing was selected as a compromise between a reasonable grid refinement and processing capacity. The MODFLOW platform GMS (EMRL 2005) was then used for the continuum modeling. A custom code was written to transform the FracMan output file into a format that GMS recognizes.

Surrette and Allen (2007) tested the significance of the two assumptions expected to most likely influence the results (i.e., fracture aperture and EPM grid size) using a factorial analysis approach fashioned after Starzec and Andersson (2002). They show that the chosen fracture aperture and equivalent porous medium-grid-spacing end members, as well as their interactions, do not significantly affect the difference between relative potential permeability of the model used and the sensitivity analysis models (representing variations in fracture aperture and grid size). Thus, the relative potential permeability estimates, obtained using the model variables, provide reasonably accurate approximations.

Parameter estimation

Estimates of relative potential fracture network permeability require flow simulations through the fracture networks. To do this, steady-state flow was simulated between two opposing faces of the hybrid discrete fracture network-equivalent porous medium (DFN-EPM) model. Boundary conditions were assigned according to a simple geographic coordinate system after Caine and Tomusiak (2003). An arbitrary, uniform hydraulic head gradient of 0.01 m/m was applied to the model using specified head boundaries at opposing cube faces in each of the principal geographic directions (Fig. 4). Zero flux conditions were specified on the remaining four model faces. This approach results in three flow simulations for each model (N–S, E–W, and top to bottom). In conducting flow simulations in this fashion, an attempt is made to standardize the flow field so as to determine if there are relative differences in the potential permeability between the different domains, and identify any trends that might exist at the regional scale. Any differences in relative potential permeability will accentuate the variability in fracture orientation (hence the term ‘relative’) throughout the region. To determine the actual permeability tensor would require construction of a flow field for an infinite number of directions. This problem was tackled for one island in the study area (Surrette and Allen 2007).

The volumetric fluid flux is computed along each specified head boundary, and then used to calculate the equivalent bulk potential permeability for each model domain in each direction using Darcy’s Law:

$$k_{{\text{p}}} = \frac{\mu }{{\rho g}}\frac{Q}{{IA}}$$
(1)

where Q is the simulated volumetric flow rate (L3/T), I is a specified hydraulic gradient (dimensionless), A is a specified cross sectional area (L2) across which Q flows, ρ is fluid density (M/L3), g is the acceleration due to gravity (L/T2), and μ is the dynamic fluid viscosity (M/LT). In this study, the values of relative potential permeability are dependent on the aperture and transmissivity distributions assigned to each fracture in the DFN model, and were not constrained by site-specific hydraulic data.

The estimation of relative potential transmissivity for DFN-EPM models situated close to hydraulic test well sites allows for a comparison between field-based and model-simulated transmissivity values. Potential transmissivity is estimated from hydraulic conductivity (K), which is related to potential permeability (Eq. 1) obtained from the DFN-EPM model results:

$$K = k_{{\text{p}}} {\left( {\frac{{\rho g}}{\mu }} \right)}$$
(2)

where k p is the simulated potential permeability (L2), ρ is fluid density (M/L3), g is the acceleration due to gravity (L/T2), and μ is the dynamic fluid viscosity (M/LT) (Fetter 1994). The calculated hydraulic conductivity is, in turn, used to estimate the model potential transmissivity (T p):

$$T_{{\text{p}}} = Kb$$
(3)

where K is the hydraulic conductivity calculated from the hybrid DFN-EPM model (L/T) and b is the saturated thickness of the aquifer (L). The value of b for bedrock aquifers is assumed to be equal to the length of the open interval (i.e., the depth of the well minus the depth of casing). For this study, the open interval of a well nearest to a model site was used in the calculation of potential transmissivity for that model.

Cluster analysis

Cluster analysis (CA) is a classification method that is used to arrange a set of observations within a population into homogeneous ‘like’ groups (Davis 2002). The aim of CA is to minimize differences between observations within a cluster and maximize differences between observations contained in other clusters. Hierarchical CA achieves this by defining a distance measure (similarity measure) between every pair of data points. A squared Euclidean distance measure of similarity is used in this study, which gives the straight-line distance between two observations:

$$d_{{{\text{x,y}}}} = {\sum\limits_{{\text{i}} = 1}^n {{\left( {x_{{\text{i}}} - y_{{\text{i}}} } \right)}} }{}^{2}$$
(4)

where d x,y is the sum of the squared distances between object x and object y. A large distance signifies dissimilarity between two observations. In agglomerative hierarchical CA, each observation is initially considered as a separate cluster. Two observations with the smallest distance measured and, thus, the highest similarity are combined, and the distance is recalculated. This process is reiterated until all observations have been agglomerated into a cluster.

Ward’s method (Ward 1963) was used in this study. This agglomerative hierarchical method determines the order of clustering of groups containing more than one observation. Ward’s method iteratively joins two clusters that minimize the sum of squares within a cluster, while maximizing the sum of squares between clusters. The sum of squares within a cluster is defined as:

$$SS_{{\text{k}}} = {\sum {{\left( {x_{{{\text{ik}}}} - \overline{x} _{{\text{k}}} } \right)}{}^{2}} }$$
(5)

where SS k is the sum of squares within a cluster k, \(\overline{x} _{{\text{k}}} \) is the centroid of a cluster k, and x ik is an individual object within the cluster (Davis 2002). The sum of squares between clusters 1 and 2 is:

$$SS_{{1,2}} = {\sum {{\left( {\overline{x} _{1} - \overline{x} _{2} } \right)}} }{}^{2}$$
(6)

As a result, Ward’s clustering joins clusters that maximize between-group variability, while keeping the with-in group variability to a minimum. The relative distances between the clusters are displayed on a dendrogram plot where the number of ‘like’ groups is visually identified by breaks in the dendrogram.

Cluster analysis was used in this study to provide a less subjective way of separating model results (e.g., relative potential permeability) into relatively homogeneous subgroups. Following clustering of the model data, breaks in the dendrograms were visually identified and the number of ‘like’ clusters determined. The spatial distributions of the ‘like’ clusters were then mapped in ArcGIS 8.0.

Results and discussion

Pumping test data

The pumping test analysis results show that measured transmissivity values vary spatially throughout the Gulf Islands. Maximum values were obtained on Saturna Island and minimum values on southeast Galiano Island (Table 2). The transmissivity values from four well sites located on the southeast and northwest ends of Saturna Island within the LFSS domain range from 7.2 × 10–5 to 1.5 × 10–5 m2/s with a geometric mean of 4.3 × 10–5 m2/s. The transmissivity values from three pumping tests conducted on northwest Mayne Island within the LFSS domain range from 2.5 × 10–5 to 1.2 × 10–5 m2/s with a geometric mean of 1.9 × 10–5 m2/s. On Galiano Island, twenty wells are completed in the sandstone-dominant LFSS domain and have transmissivity values ranging from 4.3 × 10–5 to 3.4 × 10–7 m2/s with a geometric mean of 3.9 × 10–6 m2/s. Only two pumping tests were available within the mudstone-dominant IBMS-SS domain at the southeast end of Galiano Island. Transmissivity values were 3.6 × 10–5 and 2.0 × 10–5 m2/s with a geometric mean of 2.7 × 10–5 m2/s.

Table 2 Transmissivity values derived from the analysis of pumping test data using the Theis method (after Allen et al. 2003)

To better interpret the pumping test transmissivity results, a cluster analysis (CA) was conducted. Four clusters were identified (Fig. 5a). The range in transmissivity is not significant, with geometric mean values ranging from 2 × 10–6 to 7 × 10–5 m2/s (Table 3). Mapping the transmissivity values identified in each cluster suggests a weak spatial trend of increasing transmissivity to the southeast from Galiano Island to Saturna Island (Fig. 6). Deviations from this relationship are interpreted to be the consequence of elevated transmissivity resulting from localized second-order fracture zones and geologic contacts. These high values are derived from pumping tests conducted in domestic and sub-division wells that were drilled preferentially close to fracture zones in an effort to increase well yield. For example, well sites on mid-Galiano Island have relatively ‘high’ transmissivity associated with a mapped northeast trending fault zone (Fig. 6). Because of this drilling bias, some of the measured transmissivity results are inflated. As a result, not only do the values shed ambiguity on the regional trend, but they may not be representative of the LFSS domain overall.

Fig. 5
figure 5figure 5figure 5

Cluster analysis dendrograms showing the division of the clustered data into three clusters a pumping test transmissivity, b potential transmissivity, and c potential permeability. Statistically, the values in each cluster are similar

Fig. 6
figure 6

Spatial distribution of the clustered field-based transmissivity data with a clustered data points in relation to hydrostructural domains, and b clustered data points highlighted to show spatial pattern overlain on shaded relief map

Table 3 Cluster analysis results for the pumping test transmissivity values

The occurrence of elevated transmissivity within fault zones is consistent with the regional stress model proposed for the Gulf Islands by Journeay and Morrison (1999), which suggests that northeast trending faults lie within a zone of extension. Consequently, these fractures likely have open apertures and, therefore, increased measured transmissivity. In fact, higher well yields are commonly reported to occur in the vicinity of these northwest-trending structures, hence the targeted drilling into fracture zones.

Potential transmissivity

Estimates of relative potential transmissivity calculated from 12 DFN-EPM models situated close to pumping test well sites range from 7.78 × 10–6 to 1.74 × 10–4 m2/s (Table 4). The relative potential transmissivity values estimated from the three model sites on the southeast and northwest ends of Saturna Island (LFSS domain) gave a geometric mean of 4.6 × 10–5 m2/s. The two models situated within the LFSS domain on northwest Mayne Island yielded a geometric mean transmissivity value of 1.1 × 10–5 m2/s, and a single model within the FZ domain, also on northwest Mayne Island, gave a geometric mean transmissivity of 7.58 × 10–5 m2/s. On Galiano Island, the three models situated in the LFSS domain yielded a geometric mean transmissivity of 8.6 × 10–6 m2/s. A geometric mean transmissivity of 1.3 × 10–4 m2/s was calculated for the two models located in the IBMS-SS domain on southeast Galiano, and 1.74 × 10–4 m2/s for a model in the FZ domain, also on southeast Galiano. The general trend of the estimated model transmissivity values is an average increase in transmissivity from northwest to southeast, emulating the subtle trend identified from pumping test analysis.

Table 4 Model simulated potential transmissivity results

A cluster analysis on the DFN-EPM modeled data also identifies a pattern of southeastward increasing relative potential transmissivity values (Fig. 5b). The analysis identified three clusters with geometric mean transmissivity values of 1 × 10–4, 4 × 10–5 and 9 × 10–6 m2/s, respectively (Table 5). Although a map of the spatial distribution of these values is not well defined, the underlying trend does reflect the pumping test results and also shows a subtle pattern of increasing transmissivity to the southeast along the chain of islands (Fig. 7). Of the 12 model sites for which relative potential transmissivity estimates were obtained, two are located within the FZ domain and two within the IBMS-SS domain on Galiano Island (Fig. 7). Therefore, the DFN-EPM simulated relative potential transmissivity values are likely biased towards these highly fractured domains. Despite these biases, the similarity in magnitude and spatial distribution of the measured aquifer transmissivities indicates that the DFN-EPM models are representative of the aquifers of the southern Gulf Islands at a regional scale.

Fig. 7
figure 7

Spatial distribution of the clustered model-derived potential transmissivity data with a clustered data points in relation to hydrostructural domains, and b clustered data points highlighted to show spatial pattern overlain on shaded relief map

Table 5 Cluster analysis results for the model-derived potential transmissivity values

Potential permeability

Due to the low number of sites used to estimate relative potential transmissivity from DFN models (12 situated near sites where pumping tests were conducted), the entire Gulf Islands fracture database was analyzed to provide a more comprehensive view of the results.

Estimates of relative potential permeability throughout the Gulf Islands indicate an overall increase in value to the southeast from Galiano Island to Saturna Island (see Table 8 in Supplementary electronic material). The permeability of the LFSS domain increases to the southeast of Galiano Island, with domain geometric means increasing from 2.2 × 10–14 to 9.6 × 10–14 m2 (Table 6). The distribution and degree of permeability within the IBMS-SS and FZ domains are comparable, and show an increasing trend in permeability from Mayne Island to Saturna Island. The permeability of the IBMS-SS domain increases from 1.1 × 10–13 m2 on Mayne Island to 2.5 × 10–13 m2 on Pender Island (Table 6). The mudstone-dominant sites on Saturna Island were not included in the IBMS-SS analysis as the Saturna sites are in close proximity to a major fault zone and, therefore, were included within the FZ domain. The FZ domain potential permeability also increases to the southeast from 1.2 × 10–13 to 3.8 × 10–13 m2 along the axis of the island chain (Table 6).

Table 6 Summary of potential permeability results for the hybrid DFN-EPM models

Cluster analysis on all of the relative potential permeability estimates identified four clusters (Fig. 5c). The four clusters range in potential permeability from 3 × 10–14 to 9 × 10–13 m2 (Table 7). The spatial pattern of the potential permeability clusters is better defined compared to that of potential transmissivity simply due to the larger database used in the analysis.

Table 7 Cluster analysis results for the potential permeability values

The southeastward increase in the magnitude of relative potential permeability is interpreted to be synchronous with an increase in island scale fracturing. This pattern is likely related to the greater proportion of the IBMS-SS hydrostructural domain on Saturna Island relative to the dominance of the LFSS domain on northwest Galiano Island. It is further suggested that the increase may be partially controlled by structures that are more aligned in the extensional domain of regional crustal stresses along the active plate margin, possibly resulting in open and/or wider fracture apertures. As indicated earlier, fractures that appeared to be closed at the surface (either due to mineralization or simply because they had no observable aperture) were not included in the analysis. Thus, increases in relative potential permeability should reflect the greater percentage of open fractures included in the models.

Potential fracture network porosity (n) was also calculated directly from the DFN model results as the ratio of the fracture void volume obtained by the model. It is a measure of the volume of fracture void space per unit volume of rock for the simulated geometric fracture network (Caine and Tomusiak 2003). Potential porosity provides a measure of the amount of groundwater storage that is potentially available within the fracture network. Unlike relative potential permeability, potential porosity does not require flow simulations through the discrete fracture networks. As a result, potential porosity is not influenced by artificially high values that can be the result of model flow direction paralleling the mean fracture orientation (i.e., N–S, E–W and top to bottom).

The simulated values of potential porosity for the Gulf Islands range over three orders of magnitude; from 0.004 to 0.2% for an assumed fracture aperture of 100 μm (see Table 8 in Supplementary electronic material). Similar to the spatial pattern identified from the pumping test and model-derived transmissivity data, the island scale mean potential porosity of the LFSS domain decreases toward the southeast, from 0.01% on Galiano Island to 0.03% on Saturna Island. Both the IBMS-SS and FZ domains also show the same trend as the LFSS domain, with values increasing from 0.04 to 0.06% and 0.04 to 0.13%, respectively.

Relative potential permeability estimates do not appear to be an artefact of the geographic coordinate flow directions used in this study (Fig. 8). However, fractures that are oriented parallel to a model-induced flow direction do show increased permeability in that direction. For example, the identified fault zone on Saturna Island (Fig. 8) trends approximately NE–SW and also shows the greatest permeability in this direction. Although the relative magnitude of permeability changes slightly between the three flow directions, the same general relationship remains the same.

Fig. 8
figure 8

Graphs showing changes in model permeability for each of the geographic coordinate flow directions a N–S, b E–W, and c top to bottom, with respect to the topographically driven groundwater flow angle relative to north

Measured and modelled values of potential permeability appear to increase with proximity to the hingeline of a regional fault propagation fold, and with proximity to a younger system of north and northeast-trending high-angle brittle faults and associated fracture networks. The sub-linear zone of high permeability identified in the cluster analysis trends approximately NE–SW, along the axis of the fault propagation fold structure (see for example Fig. 9). As fracture densities are known to increase in the hinge zones of fold structures, (NRC 1996), it is reasonable to expect that the highest porosity and permeability would also be found here. As a result, the spatial patterns observed in this study suggest that the regional fracturing and stress regime associated with the folding may have had a significant influence on permeability values in the Gulf Islands. This influence can be seen in decreasing permeability values observed progressively down the limbs of the fold, away from the antiformal hinge zone (Fig. 9). More detailed fracture measurements are needed to confirm this relationship.

Fig. 9
figure 9

Spatial distribution of the clustered potential permeability data with a clustered data points in relation to hydrostructural domains; and b clustered data points highlighted to show spatial pattern overlain on shaded relief map

Conclusions

Results of this study demonstrate that a hydrostructural domain approach can be useful for characterizing and quantifying regional scale relative potential transmissivity and permeability patterns using discrete fracture network-equivalent porous medium (DFN-EPM) modeling. Cluster analysis on the transmissivity values derived from pumping tests identified four clusters with geometric means of 7 × 10–5, 4 × 10–5, 2 × 10–5, and 2 × 10–6 m2/s. Elevated transmissivity values are detected near localized second-order fracture zones and geologic contacts. Cluster analysis on model-derived relative potential transmissivity values, at the 12 sites where pumping tests had been conducted, identified three clusters with geometric mean transmissivity values of 1 × 10–4, 4 × 10–5, and 9 × 10–6 m2/s. Overall, the model-derived potential transmissivity values compare well with the transmissivity values obtained from pumping tests. In both modelled and observed data, there is a weak, but consistent, pattern of increasing transmissivity towards the southeast along the chain of islands, from Galiano Island to Saturna Island.

Cluster analysis on relative potential permeability values obtained from a larger dataset for the region identified four clusters with geometric means of 9 × 10–13, 4 × 10–13, 2 × 10–13, and 3 × 10–14 m2. The general trend is an increase in relative potential permeability toward the southeast, emulating the trends identified in the site-specific analyses. There is also a spatial correlation between increased model-derived relative potential permeability values and proximity to the hinge of a large-scale antiformal fault-propagation fold structure running parallel to the islands and superimposed high-angle brittle fault structures. The similarity in hydraulic properties derived from direct field measurement and modeling serves to validate the hydrostructural domain approach to quantifying hydraulic properties. As a result, the method used in this paper may have applications to other areas where groundwater resources in fractured rock aquifers are of interest.