Introduction

Excess nutrient contamination released to water ways in the USA continues to be a concern for drinking water and aquatic ecosystems because it triggers eutrophication, which fosters the growth of toxic algal blooms, hypoxia in water bodies, and the loss of aquatic biodiversity (Dubrovsky et al. 2010; Jessen et al. 2015; Lopez et al. 2008). For example, non-point source nutrient export from agricultural, urban, and industrial landscapes has led to the creation of a hypoxic, or “dead zone” in the Gulf of Mexico associated with the outflow of the Mississippi River delta (Dubrovsky et al. 2010; Howarth 2008; HTF 2008; Turner et al. 2006; Turner et al. 2012). Given the increased intensity and amount of rainfall predicted to occur from future climate change in the upper Mississippi basin, nutrient contamination of waterways, particularly from nitrogen, will likely increase and require reduction strategies that are difficult to meet without economic hardship (Sinha et al. 2017; Wu and Tanaka 2016). The scale and economic difficulties of nutrient reduction have led researchers to identify critical source areas (CSAs) that are prone to produce excessive nutrients per unit area of watershed, which allows effective prioritization of manageable clean-up targets (Ghebremichael et al. 2013; Giri et al. 2016; Qiu 2009). CSAs are generally classified as areas with a high likelihood to produce large amounts of runoff and sediment due to climate, slope, collection area, low soil permeability, agricultural or urban land cover, and high erosion potential (Giri et al. 2016; Huang et al. 2015). It has been found that CSAs can cover as little as 5% of a watershed’s area, but yield as much as a fifth of total nutrient loads (White et al. 2009).

Central Kentucky can be considered a CSA due to the prevalence of low permeability soils, developed and agricultural land cover, and relatively fast groundwater flow due to fractures, bedding planes, and karst (Crain and Martin 2009; Currens and Graham 1993; Husic et al. 2017; Ray et al. 1994; Wolock et al. 2004). This region discharges into the Ohio River, which is responsible for approximately 41% of total nitrogen load from the Mississippi River system (HTF 2008). Central Kentucky likely contributes to this amount significantly, for example, the Kentucky River at Lock 2 has been measured to contribute 11,339 ± 913 metric tons of total nitrogen and 1088 ± 274 metric tons of total phosphorus (Crain and Martin 2009). Because higher order rivers, such as the Mississippi River, have been found to have low nutrient retention and act more as a nutrient conduit, current research suggests that nutrient reduction strategies should be targeted within headwaters where retention may occur (Gomi et al. 2002; Loken et al. 2018; Peterson et al. 2001). Nutrient reduction strategies in this region could lead to impactful improvement in water quality on small to large scales, but more data defining the timing and processes of nutrient loading from headwaters is needed in order to inform these efforts.

The role of groundwater on the spatiotemporal distribution of nutrient retention and export in small upland catchments has been underscored recently in addition to the controls of variable discharge and channel-sediment interactions (Bernal et al. 2015; Burrows et al. 2018; Siebers et al. 2016; von Schiller et al. 2017). Small upland catchments have variable surface flows depending on climate, connection to groundwater, and geologic properties causing many to be intermittent or ephemeral with spatially heterogeneous hydrologic connectivity (Costigan et al. 2016; von Schiller et al. 2017). Nutrients such as dissolved phosphorous are correlated strongly with stream discharge during storm events that can account for over 70% of phosphorous export despite the fact that storm flow only occurs ~ 10% of the time due to the mobilization of sediment from channels, tile drains, and soils (Gentry et al. 2007; Pionke et al. 1999; Pionke et al. 1996; Royer et al. 2006). Despite the strong connection between nutrient export and rainfall-runoff relationships, groundwater can contribute to the nutrient export in small upland catchments as a result of both seasonal fluctuations in regional gradient and transient activation of shallow perched systems (Zimmer and McGlynn 2017, 2018). Longitudinal trends in nutrient concentration along streams can be a result of dynamic groundwater-surface water interaction and in-stream processes, where groundwater often controls nutrient transport during dry periods by sustaining overland flows, adding nutrients, diluting surface water nutrient loads, and contributing carbon (Bernal et al. 2015; Burrows et al. 2018). Nitrogen export from upland watersheds has been shown to be sensitive to groundwater patterns such as seasonal seep dynamics, riparian residence times, and lithology (Butturini et al. 2003; O’Driscoll and DeWalle 2010; Valett et al. 1996). Agricultural nitrogen in shallow groundwaters has made base flow a potentially dominant mechanism for nitrogen export, where base flow periods can account for ~ 60% of nitrate export (Burow et al. 2010; Pionke et al. 1996). Understanding how groundwater with elevated nutrients controls base flow exports in small upland catchments could help to focus ongoing nutrient management issues in central Kentucky (McAlister et al. 2010; Carey et al. 1994; Kentucky River Watershed Watch 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017).

For mixed land cover (MLC) catchments, the addition of anthropogenic point and non-point sources along with complex groundwater and runoff processes makes it difficult to identify critical nutrient sources as a number of processes combine to produce the overall nutrient export signal (Coulter et al. 2004; Lee et al. 2016). Agricultural land cover often may account for the majority of nutrient export due to agrochemical use, animal husbandry, farm practices, and tile drainage, which add large amounts of nitrogen, phosphorous, and sediment to water ways (Brisbois et al. 2008; Ford et al. 2018; Nguyen et al. 1998; Vadas et al. 2015; Withers and Hodgkinson 2009). Developed land cover also contributes significant levels of nutrients and sediments via deposition from vehicles, household and lawn operations, construction activities, discharges from pipes and gutters, and sewage effluents (Fork et al. 2018; Hobbie et al. 2017; Lee et al. 2016). When agricultural and developed land covers are combined in MLC catchments, the impact of nutrients in water ways becomes exacerbated. Specifically, the continuous nutrient contributions from agriculture overloads the stream retention capacity and episodic runoff events from developed land cover increase nutrient levels further and reduce the lag time between pollution source emissions and peak stream concentration (Chen et al. 2018). Additionally, the loading of streams with nutrients and contaminants in MLC watersheds can be spatiotemporally dynamic given the distribution patterns in anthropogenic land cover and stream flow dynamics, where variable source/sink and dilution processes can lead to unexpectedly patchy distributions of pollution (Hubbart et al. 2017; Shi et al. 2017). The threshold for such impacts from development added to mixed catchments is low, where coverages of less than 1% of catchment area can lead to detectable water quality changes (Mehaffey et al. 2005). Because MLC watersheds in Kentucky export 34% more total P and 53% more total N compared with undeveloped watersheds (Crain and Martin 2009), nutrient reduction in this region could be achieved by identifying sensitive regions in watersheds in order to prioritize development in locations where impacts are likely to be less severe (Martin-Mikle et al. 2015).

The overarching goal of this research was to examine the seasonal nutrient export of a small MLC upland catchment in central Kentucky with an emphasis on how the interaction between discharge from developed agricultural land cover and groundwater leads to base flow export of dissolved constituents. The approach was to spatially and temporally sample a small-scale representative catchment that (1) contained mixed developed and agricultural land covers; (2) was not classified as a first-order stream or higher; and (3) directly discharged to a higher order stream. This approach was chosen because it was suspected that these regions were sensitive, critical source areas of nutrients due to the spatial coverage of this type of catchment, the occurrence of thin low permeability soils, and the well-developed channels that quickly export water past the riparian zone of larger order streams. The aim of quantifying this small-scale system was to better understand how nutrients may be managed in the region and to provide primary data to inform the selection of targeted management practices.

Methods

Site description

Sampling sites were located in a small, intermittent, MLC catchment (0.5 km2) that feeds into the Muddy Creek watershed in east-central Kentucky (Fig. 1). The Muddy Creek watershed is a 4th order basin (176 km2) that ultimately feeds into the Kentucky River, part of the Ohio and Mississippi drainage systems. This region lies in the eastern portion of the Kentucky Bluegrass, characterized by rolling hills, interbedded limestone and shale bedrock, and average yearly rainfall of 1100 mm and average yearly temperature of 13 °C (NRCS 2006). The hydrologic landscape region (HLR) for this area was classified as either low permeability soil with permeable bedrock or low permeability soil with low permeability bedrock (Wolock et al. 2004). In these HLRs, the low permeability of soil leads to higher amounts of runoff with a patchy distribution of springs where permeable bedrock occurs. As typical of streams draining agricultural and developed lands, the Muddy Creek watershed exhibits NO3 loading and elevated Escherichia coli levels (McAlister et al. 2010; LaSage et al. 2006; Kentucky River Watershed Watch 2010, 2011, 2012, 2013, 2014, 2015, 2016, 2017).

Fig. 1
figure 1

(a) Inset map of Kentucky with the Kentucky River watershed shaded black and the study site marked with a white point (coordinates of W1: 37.718202, − 84.151826). (b) The research watershed (black outline) showing sampling stations (black dots) at outfalls (O), tiles (T), springs (SPR), open water (L), stream channels (C), and tributaries (B). Muddy Creek is outlined in blue and flows towards the east in this map as indicated by the arrow. The channels are highlighted in blue. The white dot represents the watershed outlet (W1)

As predicted by the local HLR, the field site contains thin (< 1 m), low permeable clay silt to silty loam soils. Two major drainage channels flow from west to east and north to south in the catchment and meet up at the outlet facing southeast (Figs. 1 and 2a). The channels generally lack riparian zones and were steeply incised in some locations by 1–2 m. Land cover consisted of cattle pasture and cultivated crops with some developed regions (Fig. 2b). Developed areas mainly consisted of roads, a dairy complex, a swine barn, office facilities, farm vehicle facilities, and storage silos. The catchment’s underlying bedrock consisted of the Crab Orchard Formation, Boyle Dolomite, and New Albany Shale (Fig. 2c; Greene 1968; KGS 2020). An expanded regional geologic map shows the relationship of the site to the surrounding area (Supplemental Figure 1). These units were a mixture of shale, limestone, and dolostone. The majority of the watershed, including the upland areas, was immediately underlain by the New Albany Shale (Fig. 2c). The contact between this unit and the underlying Boyle Dolomite was a generally impermeable clay unit that often caused groundwater to discharge at the boundary between these two units, forming springs (Supplemental Figure 2).

Fig. 2
figure 2

(a) Digital elevation model of the research site with 5 m contours. (b) Land cover map for the study site with legend below. (c) Geologic map of the research site showing the extent of the Crab Orchard Shale (Scb), Boyle Dolomite (Db), and New Albany Shale (MDna). Map after Greene (1968); see Supplemental Figure 1 for geologic map of the lower Muddy Creek watershed

Sampling

The sampling sites for the MLC catchment in this study were chosen to account for different land uses and water types in order to observe their impact on water chemistry in resultant in-channel mixtures. These included major springs, outfalls, confluences, tile drains, and the watershed outlet (Figs. 1b and 2b). Springs represented groundwater sources into the catchment or were immediately outside the catchment (Fig. 1b). Springs outside the catchment were sampled to assess local variability in spring chemistry and to observe if inter-watershed connections could exist in the subsurface. Groundwater springs were identified by sight and confirmed with temperature measurements. Outfalls included point sources where pipe discharges fed into the main channel. These included a culvert draining from the developed region of the catchment into the main channel (O1), the outlet of a tile-drained agricultural field located underneath a road into the main channel (O2), and a smaller tile drain outlet in a pasture field (T1). Characteristic reaches included sampling sites following the main drainage channel with regular sampling intervals along the stream and confluences (Fig. 1b). A man-made pond inside the watershed (L1), isolated from the main channel system, was also sampled for comparison to channel waters.

The main channel of the watershed begins at O1, courses to O2, then onward to C1, C2, and C3 through C6 until arriving at W1 and emptying into Muddy Creek. The main channel is joined by several tributaries. B1 is the first tributary joining the main channel just upstream of C4. The second tributary at B2 connects with the main channel upstream of C5. The most easterly tributary joins the main channel upstream of C6. Sampling sites T1, SPR 2, and B3 are situated on the same tributary channel (Fig. 1).

Collection of water samples from the sites occurred monthly between May 2017 and September 2018, usually on fair days with little or no storm-water flow. In the field, sampling sites were analyzed for pH, temperature, and electrical conductivity (EC) with a YSI ProDSS probe, calibrated before every sampling day. Water samples were directly collected with 60-ml syringes and filtered through a 0.45 μm filter. Samples were then stored on ice in a cooler for no longer than 8 h and then transferred to a refrigerator until analysis, which occurred within 1 or 2 days for dissolved nutrients.

Chemical analyses

Dissolved major ions (chloride, Cl; sulfate, SO42−; sodium, Na+; potassium, K+; calcium, Ca2+; and magnesium, Mg2+) were measured via ion chromatography using a Metrohm 930 Compact IC Flex system (Metrohm 2020; EPA 2007). Dissolved nutrients were measured via UV-Vis spectrophotometry using a Thermo Scientific Evolution 201 UV-Visible Spectrophotometer. Nitrate (NO3) was measured using NitraVer5 powder packets (Hach 1997), based on the cadmium reduction method (Eaton et al. 2005b). Ammonium (NH4+) was measured via the phenol hypochlorite method (Eaton et al. 2005a as modified by Gieskes et al. 1991). Phosphate (PO43−) was measured via the ascorbic acid method (Eaton et al. 2005c as modified by Gieskes et al. 1991).

Multivariate analyses

Principal component analysis (PCA) and exploratory factor analysis (FA) were performed on the resulting water sample dataset to identify data patterns that describe how different water sources affect main channel water chemistry. PCA and FA are widely used tools in water quality studies to understand hydrological/geochemical processes in watersheds (Inamdar et al. 2012; Le et al. 2017; Lee et al. 2016; Love et al. 2004; Olsen et al. 2012; Pejman et al. 2009). These statistical methods reduce the dimensionality of datasets with many variables to produce principal components (PCs) and factors. Principal components are uncorrelated variables that capture the majority of dataset variance, whereas factors are latent variables that capture correlations among observed variables (Everitt and Hothorn 2011). Given an adequate set of variables, the PCs and factors should reflect natural processes behind chemical evolution of water originating from different hydrological compartments such as groundwater or tile drains. In order to reconcile PCA and FA results with the original data as an accuracy check, background spatiotemporal trends were analyzed independently prior statistical analysis to improve interpretation (Olsen et al. 2012; Le et al. 2017).

PCA was performed using the FactoMineR package in R on the correlation matrix of 190 individual samples from August 2017 to August 2018 using nine variables (Ca2+, Mg2+, Na+, K+, Cl, SO42−, NO3, NH4+, and PO43) (Lê et al. 2008). Concentration values below detection limits were assigned as 0.0 values in the dataset. Data from May 2018 was excluded from the PCA because anion data (Cl and SO42−) was not measured. PCA loading plots and bi-plot graphics were generated via the FactoExtra package in R (Kassambara and Mundt 2017).

FA was performed using the factanal() function of the core R program on the same raw matrix of 190 individual samples from August 2017 to August 2018 described above (Everitt and Hothorn 2011; R Core Team 2019). The number of optimal factors that best describes the data was determined using the nFactors package in R (Raiche and Magis 2020). Factors were considered significant if their eigenvalue was close to or greater than 1 (Love et al. 2004).

PHREEQC and Piper diagrams

Piper diagrams were constructed in order to characterize geochemical end members based on major ion composition that could be compared with statistical results (Appelo and Postma 2004). In order to construct Piper diagrams, alkalinity data were required. The bicarbonate ion concentration was not measured as part of the ion chromatography suite, because the method utilized a bicarbonate/carbonate eluent. Instead, bicarbonate/alkalinity was measured with a Hach digital titration method for select samples to determine normal ranges at the field site (Hach 2013). Because bicarbonate was determined to be the dominant anion by one to three orders of magnitude across the field site, the bicarbonate concentration was estimated for all samples using PHREEQC (Parkhurst and Appelo 2013). Each sample was simulated as a solution in PHREEQC using all observed parameters as inputs. Bicarbonate was added until the charge balance percent error of each solution fell below 1%. These solution outputs were taken as reasonable estimations given that bicarbonate was shown to be the dominant anion and that simulated values fell within observed ranges. An average concentration of major anions and cations was calculated for each site and plotted on Piper diagrams using the hydrogeo package in R (English 2017). Finally, a simple mixing model was conducted between identified end members using the mix function in PHREEQC in order to examine if channel water chemistry followed a simple mixing pathway.

Regional spatial analysis

In order to explore the possible impacts of small-scale processes on larger scale systems, spatial analysis was conducted to quantify the occurrence and land cover of watersheds similar to this study’s in the local 4th order basin of Muddy Creek. Similar catchments were identified in the watershed by delineating all catchments that met the following criteria: the valley was not classified as a first-order stream or higher, the valley contained a well-developed channel, and the outlet directly discharged to Muddy Creek without first passing through a first-order stream. Valleys with well-developed channels were identified along Muddy Creek using the Kentucky State Single Zone Digital Elevation Model (KGN 2018). First-order and higher stream valleys were then subtracted from these data using the National Hydrography Flowline data, which maps perennial streams (Strahler 1957; USGS 2002). The coverage of the remaining valleys was delineated by using the hydrology toolbox in ArcGIS, which uses DEM data to calculate watershed coverages from user-specified outlets (ESRI 2018). The total coverage of watersheds that met these criteria were then compared to the total watershed area. Additionally, the land cover of these catchments was quantified and compared to the land cover distribution of the whole Muddy Creek watershed. Land cover was quantified by extracting the Muddy Creek watershed area and similar catchment coverages from the 2011 National Land Cover Dataset (USGS 2014). The distribution of land cover between the 4th order basin and similar catchments as well as the distribution between all similar catchments were compared.

Results

Dissolved salts, nutrients, and physiochemical parameters

Major cations varied dependent on their source water (Fig. 3a–d). Sampling site O1 was a major base flow source of Ca2+, Mg2+, and K+. Sites C1–C6 and W1 lie along the main channel originating near site O1 and show a declining average concentration of these cations moving downstream. The outlet of the watershed (W1) had elevated levels of these cations compared with sites not connected to O1. Springs and the tile drain showed relatively lower concentrations of Ca2+ and Mg2+, although variation in Mg2+ tended to be greater. Tributary sites (B1, B2) and the man-made lake (L1) had the lowest concentrations of Ca2+ and Mg2+, again with more variation in Mg2+. Na+ tended to have low variation at all sites, but O1 still had the highest average concentration. There was no clear dilution pattern moving downstream for Na+ with only B2 showing relatively low concentrations. K+ had a spatial concentration pattern similar to Ca2+ and Mg2+; however, groundwater and tributary sites tended to have extremely low concentrations. Only the sites O1 and O2 appeared to behave as sources of K+ that were then detected downstream and exported at the outlet W1.

Fig. 3
figure 3

Box and whisker plots showing statistics of the twelve measured parameters at each sampling site. Arithmetic mean (gray X’s) and median (gray lines) values are shown within black boxes bounded by the 25th and 75th percentile values; lines extend to the 10th and 90th percentile values with outliers shown by crosses (+). Concentration units for solutes are in milligrams per liter (mg/L) with electrical conductivity (EC) in units of micro-Siemens per centimeter (μS/cm). Sites are listed as generally progressing down channel to the watershed outflow (W1)

The anions Cl and SO42− had nearly opposite spatial distribution with respect to each other, but both were elevated at the watershed outlet (W1). Concentrations of Cl were similar to cation distributions, with O1 as the major source, and downstream sites exhibited progressively lower Cl concentrations (Fig. 3e). Site O2 also sometimes had higher levels of Cl. Conversely, SO42− tended to increase downstream with the highest concentration observed at tributary sites (Fig. 3f). The main channel begins to show increasing amounts of SO42− after the confluence with B1. Site W1 had Cl levels that were elevated relative to tributary sites and SO42− levels that were elevated relative to O1, indicating that both sources were important in the chemistry of water exported from the catchment.

Dissolved PO43− and NH4+ had comparably similar spatial patterns to cations and chloride, whereas NO3 had higher concentration in groundwater and tile sources (Fig. 3g–i). O1 and O2 had the highest base flow PO43− and were diluted in downstream stations (C1–C6). One difference PO43− displayed from this pattern, however, was that the watershed outlet (W1) had relatively low levels. Even a large outlier of PO43− concentration observed at O1 and O2 did not elevate W1 levels much. PO43− was also elevated on one occasion in SPR 1. NH4+ concentrations were highest at O1 and elevated at W1 compared with tributary sites. The concentration of NH4+ was clearly elevated at O1 relative to other nutrients. There also was one occasion when NH4+ was high at SPR 1, showing the highest concentration in all spring water samples. Outside the study watershed, SPR 3 had the highest and most variable NO3 concentrations. Sites T1 and SPR 2 have the highest NO3 concentrations of sites connected to W1 and generally the outlet had elevated levels compared with the main channel and B1–B2.

General water physiochemical measurements (EC, pH, temperature) exhibited differences between outfalls, surface water, and groundwater (Fig. 3j–l). Typically, outfalls (O1 and O2) had high electrical conductivity, showing that they were the primary sources of dissolved ions (Fig. 3j). Dissolved ions decreased in-channel waters moving downstream. Springs, tributary sites, and T1 had lower pH values (closer to 7) than other water sources, while L1 had relatively high pH values > 8 (Fig. 3k). Temperature values indicated that groundwater had lower yearly variation in temperature than surface water (Fig. 3l).

Monthly observations demonstrated that the water characteristics of sampling sites at base flow have seasonal and other temporal patterns that impact the watershed outlet W1 (Fig. 4). The outfall O1 had a strong seasonal pattern with dissolved ions (EC, Ca2+, Mg2+, K+, Cl, and NH4+). These dissolved ions increased to a maximum that was several times larger than background concentrations around September and October 2017 and gradually decreased to lower levels in the spring and early summer of 2018. W1 was influenced by these high concentrations in the upper watershed, having higher concentrations during this same time period. Na+ was also generally highest at O1, but the overall trend during the sampling interval was a slight decrease. SO42− had a more variable pattern with the highest concentrations at sites influenced by groundwater contributions, especially during the latter part of 2017. The variable pattern in SO42− at these sites was reflected in higher relative concentrations at the watershed outlet W1. PO43− concentrations were generally lowest in August, and remained below approximately 2 mg/L at B3, W1, and within groundwater. PO43− concentration was typically elevated at O1 under base flow conditions but spiked to about 15 mg/L on one occasion in January 2017, likely due to a recent storm event. NO3 concentrations were highest in groundwater and at B3 and T1. Spring parameters did not show a seasonal pattern, yet were variable over the sampling period. Sites T1 and B3 exhibited seasonal NO3 increases in the winter and decreases in the summer. This seasonal pattern was also observed at W1 where NO3 concentrations peak in March and NH4+ peaks in the summer due to discharge from O1 (Fig. 5a). PO43− at W1 did not show a clear seasonal pattern when compared with nitrogen (Fig. 5b).

Fig. 4
figure 4

Time plots of twelve measured parameters at O1 (black squares), SPR 3 (white triangles), W1 (black circles), and B3 (white squares) from August 2017 to August 2018

Fig. 5
figure 5

(a) Column plots of N-NH4+, N-NO3, and (b) P-PO43− concentrations in milligrams per liter (mg/L) at site W1 during the sampling period, which indicated nutrient levels exported from the study site over time

Multivariate analyses

PCA identified three principal components (PCs) greater than the average PC percentage, capturing 82.9% of total dataset variation (Fig. 6a). PC1, PC2, and PC3 captured 57.4%, 14.8%, and 10.5% of dataset variation, respectively (Fig. 6a). Higher PC variable loadings indicate greater variance captured by the PC. Hence, resulting PC loading significance was evaluated by considering loadings from 1 > 0.75, 0.75–0.50, and 0.50–0.30 as “strong,” “moderate,” and “weak,” respectively, following the classification by Liu et al. (2003) for interpretation. PC1 had a strong and positive correlation with parameters Ca2+, Mg2+, K+, Cl, and NH4+, as well as a moderate correlation for parameters Na+ and PO43− (Fig. 6b). PC2 was strongly and positively correlated with SO42− (Fig. 6c). PC3 was strongly and positively correlated with NO3 (Fig. 6d).

Fig. 6
figure 6

(a) Scree plot of resulting principal components (PCs). The horizontal line shows the mean percent variance, which was used to select the first three principal components for interpretation. Variable loading plots for (b) PC1, (c) PC2, and (d) PC3. The x-axis of these plots shows the direction and strength of parameter correlation

A PC score plot for all sources of water indicated samples from O1 and the main channel typically ranged from large, positive PC1 scores to large, negative PC1 scores (Fig. 7a). The largest PC1 scores occurred at O1. Main channel samples formed a similarly shaped cluster to O1, but often exhibited lower PC1 scores. W1 had a smaller range of PC1 values compared with the main channel sites. Overall, these three groups exhibited the largest scores on September 2017. The lowest PC1 score for main channel and W1 samples overlapped with the tight, negative score ranges exhibited by contributing tributaries (sites B1, B2, B3), local springs (SPR 1, SPR 2, SPR 3), and the farm pond (L1).

Fig. 7
figure 7

(a) PC scores of PC1 vs PC2 for sampling stations. Stations were divided into the farm pond (L1), groundwater (SPR1, SPR2, SPR3, T1), main channel (O2, C1, C2, C3, C4, C5, C6), site O1, tributaries (B1, B2), and site W1. The range of PC scores for each group is traced in a colored polygon. (b) PC scores of PC1 vs PC3 for sampling stations. (c) Simplified polygon traces of O1, tributaries, and W1 for PC1 vs PC2 plot. Tributary B3 is plotted separately to compare the points to groundwater sites. (d) Simplified polygon traces of O1, tributaries, and W1 for PC1 vs PC3 plot. (e) W1 sample PC score labeled by sampling month for PC1 vs PC2 plot. (f) W1 sample PC score labeled by sampling month for PC1 vs PC3 plot

Groundwater exhibited a tight PC2 range with most samples having positive scores (Fig. 7a). Tributaries showed a more variable PC2 cluster, which surrounded the groundwater cluster. The farm pond had a tight, negative PC2 cluster. Tributaries, groundwater, and W1 have similar PC2 score ranges. Main channel PC2 scores were similar to W1 scores, but some samples exhibited negative PC2 scores below the range of the farm pond sample cluster. PC2 ranges at W1 fell between the tributary and main channel scores.

Tributaries formed a tight cluster of PC3 scores along the PC1 axis, separate from the groundwater cluster (Fig. 7b). The farm pond cluster had the tightest cluster of PC3 scores about the PC1 axis, within the tributary group. The groundwater cluster had the highest PC3 scores and partially overlapped with the W1 cluster. Tributaries and W1 had similar PC3 ranges, but proximal sites within the main channel and O1 samples exhibited greater PC3 score ranges. Groundwater, tributary, main channel, and W1 clusters primarily intersected along the negative PC1 axis.

Simplified PCA plots showed a gradational axis between tributaries, groundwater, and O1 for PC1 with W1 samples clustering between these groups (Fig. 7c–d). Tributary B3 scores are more similar to groundwater samples compared with sites B1–B2 (Fig. 7c–d). Along the PC1 axis, W1 typically had the lowest scores during the winter months, and the highest PC1 scores occurred during September 2017 (Fig. 7e–f). The lowest W1 PC1 scores plotted within the groundwater cluster from February–June 2018 (Fig. 7e–f). W1 samples plotted within the O1 sample cluster during September 2017 and for several months throughout the fall. W1 had a negative PC2 score during September 2017 (Fig. 7e). After this time, the remainder of the sampling campaign showed a generally counterclockwise movement of W1 PC1 and PC2 scores around the origin (Fig. 7e). For PC3, W1 had negative PC3 scores within the range of the tributary cluster most of the time, remaining relatively constant over the duration of the sampling year (Fig. 7f). W1 exhibits the highest PC3 scores in the spring, with the lowest score in January 2018.

Three factors were identified for FA each having an eigenvalue close to 1 or greater (Table 1). These three factors capture 68.5% of the total dataset variation with factors 1, 2, and 3 (F1, F2, F3) capturing 44.3%, 12.6%, and 11.6% of dataset variation, respectively (Table 1). F1 had a strong positive correlation with Ca2+, Mg2+, and K+, as well as a moderate correlation with Cl, NH4+, Na+, and PO43−. F2 had a moderate positive correlation with Cl, NH4+, and Na+, as well as a negative correlation with NO3. F3 had a moderate positive correlation with PO43− and a negative correlation with SO42− (Table 1).

Table 1 Details of the factor analysis conducted in R with the nFactor package. Factors with dashes indicate no significant correlation

F1 has a comparable pattern to PC1, showing the largest positive values for O1 and channel sites connected to O1 and smaller negative values for tributary and groundwater samples (Fig. 8a). Generally, O1 has the highest F1 scores and the largest range, while channel water was related the O1 distribution with lower scores (Fig. 8a). W1 had lower F1 scores and a smaller range compared with the main channel sites. Groundwater sites closely clustered around an F1 score of zero while tributaries (B1–B2) and L1 tended to have negative scores (Fig. 8a).

Fig. 8
figure 8

(a) Factor analysis (FA) scores of F1 vs F2 for sampling stations. Stations were divided into the farm pond (L1), groundwater (SPR1, SPR2, SPR3, T1), main channel (O2, C1, C2, C3, C4, C5, C6), site O1, tributaries (B1, B2), and site W1. The range of FA scores for each group is traced in a colored polygon. (b) FA scores of F1 vs F3 for sampling stations. (c) Simplified polygon traces of O1, tributaries, groundwater, and W1 for F1 vs F2 plot. Tributary B3 is plotted separately to compare the points to groundwater sites. (d) Simplified polygon traces of O1, tributaries, groundwater, and W1 for F1 vs F3 plot. (e) W1 sample FA score labeled by sampling month for F1 vs F2 plot. (f) W1 sample FA score labeled by sampling month for F1 vs F3 plot

Groundwater had consistently negative F2 scores, which created a separate cluster of data from the near zero F2 scores of tributaries B1 and B2 (Fig. 8a). The farm pond (L1) consistently had positive F2 scores that form a tight cluster separate from most other groups when plotted against F1 (Fig. 8a). Main channel water and O1 have the highest F2 ranges, with a few main channel points plotting higher than O1 (Fig. 8a). These points correspond to the December 2017 data, when site O2 may have been more influential due to higher flow. F2 scores for W1 cluster between the main channel, tributary, and groundwater data ranges (Fig. 8a).

Most of the F3 scores for tributaries B1–B2 plotted in the low positive range, while groundwater sites plotted in the low negative range (Fig. 8b). When tributaries and groundwater F3 scores are plotted against F1 scores they form mostly separate data clusters (Fig. 8b). O1 has the largest range of F3 scores with the highest score in September 2017 (Fig. 8b). When comparing F3 against F1 for site O1, the cluster of points overlaps the groundwater cluster during the late summer and early fall of 2018 (Fig. 8b). The main channel and W1 sites form data clusters within the O1, tributary, and groundwater data clusters (Fig. 8b).

Simplified FA plots showed near separate clusters of data for tributaries, groundwater, and O1 (Fig. 8c–d). Site W1 plots between these data clusters with most overlap occurring with O1 and groundwater data clusters (Fig. 8c–d). Tributary B3 occurs downstream from the groundwater sources of T1 and SPR2 plots mostly within the groundwater cluster as opposed to the separate cluster shown for B1 and B2 (Fig. 8c–d). W1 samples tended to have positive F1 scores in the fall and winter and negative F1 scores in spring and summer (Fig. 8e). F2 and F3 scores for W1 erratically oscillate between positive and negative values over the 12-month sampling interval without an apparent seasonal pattern (Fig. 8e–f).

PHREEQC and Piper diagrams

Averaged major ion distribution on a Piper diagram indicated that all samples plot closely as Ca2+-Mg2+-HCO3 type water (Supplemental Figure 3). When the three major groups identified by PCA and FA are plotted, main channel waters plot between them in order of upstream to downstream (Fig. 9). The O1 water plots as a marginally lower Mg2+ and lower SO42− end member. Starting from O1, the channels C1–C3 plot in a straight sequential line towards the tributary end member, enriched in Mg2+, Na+, K+, and SO42− (Fig. 9). C4–C6 samples move in a sequential straight line towards the groundwater end member, depleted in Na+, K+, and SO42− relative to tributaries (Fig. 9). There was some variation in anion distribution among the groundwater end member, although the cations plotted similarly (Fig. 9). Typically, SPR 1 and SPR 3 were enriched in SO42− and Cl compared with SPR2, B3, and T1.

Fig. 9
figure 9

Piper diagram showing the major ion percentages calculated from average concentrations of all collected data for tributaries (B1, B2), the main channel sites (C1, C2, C3, C4, C5, C6), groundwater (SPR1, SPR2, SPR3, T1, B3), and site O1. (b) Inset magnifying the major cations in the red box. Note the channel site values move first towards the tributary values and then towards groundwater after the confluence with B3

A simple binary and ternary mixing model between these three end members indicated that channel water patterns could be reproduced with mixing alone (Fig. 10). The proportions of each end member in the mixing models are displayed in Supplemental Table 1. Binary mixing between O1 and tributary waters reasonably simulated the pattern observed for main channel sites C1–C3. A mixture with 40% O1 water and 60% tributary water reproduces the average ion distributions of sites C3–C4. From this point, a ternary mixture of increasing groundwater end member contributions moved average sample data in the same direction as observed in Fig. 9 (Fig. 10). A mixture of about 20% O1, 50% tributary, and 30% groundwater approximates the average ion distribution of water at site C6.

Fig. 10
figure 10

Piper diagrams showing the major ion percentages of simulated mixtures between the three end members found in the catchment. The black diamonds simulate binary mixing between tributaries B1–B2 with water from O1. Green triangles simulate ternary mixing with the additional groundwater end member. (b) Inset magnifying the major cations in the red box. Note that ternary mixing can approximate the average channel concentration pattern found in Fig. 9

Spatial analysis

Spatial analysis indicated that approximately 10% of the 4th order Muddy Creek watershed was covered by catchments similar to the study site. These catchments vary in size between 0.5 and 0.02 km2 and were approximately evenly spaced across the entire basin (Supplemental Figure 4). The catchment in this study represents one of the larger catchments (0.5 km2) as the median watershed size was 0.05 km2 for the 189 catchments identified.

Comparisons of land cover between the 4th order Muddy Creek watershed and the 189 identified catchments imply potentially large impacts from agriculture and development. Both the 4th order basin and identified catchments were roughly similar overall with respect to land cover (Supplemental Figure 5). The most prevalent land cover categories were pasture, forest, and open development. Identified catchments were more likely covered by cultivated crops and pasture, and less likely to have forested or developed land cover. Identified catchments were also likely to have some kind of development although most fell below 10% total land coverage (Fig. 11a). In contrast, agricultural land cover (pasture or cultivation) in identified catchments tended to be > 50% of land coverage with most catchments exhibiting some amount of agricultural land cover (Fig. 11b). Also, most catchments had some forested coverage, which tended to fall below 50% total coverage (Fig. 11c). Almost all of the 189 catchments contained some mixture of agricultural and forested coverage, while ~ 50% of all catchments were classified as undeveloped (Fig. 11d).

Fig. 11
figure 11

(a) Distribution of developed land cover for identified catchments in the Muddy Creek watershed with similar characteristics to the study site. (b) Distribution of agricultural land cover for identified catchments. (c) Distribution of forested land cover for identified catchments. (d) Comparison of the presence or absence of land cover types in identified catchments

Discussion

Identification of water sources

PCA, FA, and Piper diagrams independently suggest three water sources control stream channel chemistry. PC1 and F1 both identified water sourced from the more developed portions of the study area, associated with site O1 (Figs. 6, 7, and 8, Table 1). O1 consistently had higher concentrations of salts, NH4+, and PO43− skewing PC1 and F1 loadings to larger positive values (Figs. 7 and 8). C1–C6 samples exhibit successively diminishing PC1 and F1 values, reflecting dilution of O1 by downstream addition of groundwater and surface water. PC2 and F3 both identified groundwater as a SO42−-rich source associated with springs, tributaries, and tile drains (Figs. 7 and 8). Data clusters range from unaltered groundwater (high SO42−) to modified surface waters based on the PC1 vs PC2 plot (Fig. 7a). Tributary water overlapped the groundwater cluster, reflecting the influence of groundwater at B1 and B2 (Fig.7a). F3 also indicated an inverse relationship between SO42− and PO43− (Table 1). This separated groundwater from O1 and showed the gradual decline in PO43− and increase in SO42− in the channel waters as these two sources interact. PC3 and F2 both identified tributaries B1–B2 as distinct from other groundwater, primarily due to lower NO3 concentrations (Figs. 7 and 8). Groundwater and tributaries form distinct positive and negative clusters in PC3 (Fig. 7b). Additionally, F2 shows an inverse relationship between K+-NH4+-Cl-rich water and NO3-rich water (Table 1). This inverse relationship differentiated B1–B2 from other groundwater due to the lower Cl and NO3 in tributary water (Fig. 8a). Moreover, Piper diagrams support PCA and FA interpretation that three source water end members control main channel water chemistry. Channel samples plot between B1–B2, groundwater, and O1 clusters, suggesting sequential ternary mixing between the three end members, which was supported by mixing models (Figs. 9 and 10). Samples from O2 and L1 were considered to have little influence on the main channel because they plot far from the channel water and mixing models did not require their input to reproduce the average patterns. O2 may have had little influence due to much lower flow volumes relative to O1. L1 was a clay lined farm pond that may have had low connectivity with the catchment.

Base flow nutrient export

SPR1, SPR2, SPR3, T1, and B3 were considered to be representative of a groundwater end member associated with bedding plane flow. These five sites had similar Ca2+, Mg2+, Cl, SO42−, EC, and pH, as well as relatively stable temperature ranges (Figs. 3 and 4) and tight data clusters on PC and FA plots (Figs. 7 and 8). These also plot closely together in the high Mg2+-Ca2+-HCO3 corner of the Piper diagram as expected for water in contact with dolomite and limestone (Fig. 9). The three springs all occur in association with the Boyle Dolomite (Supplemental Figure 1). Bedding plane flow likely controls these sites because an impermeable blue-gray clay layer outcrops in association with this unit (Supplemental Figure 2). Water moving through the area collects on top of this impermeable boundary and flows along its bedding plane until it either intersects a stream valley or emerges as a spring. We infer the controls on where a spring emerges may be related to preferential flow due to minor fractures and heterogeneity of the rock units. Support for this idea included the prevalence of thinly bedded fractured sedimentary rocks in the region and the noted anion variation between springs. Groundwater of the study site was high in nitrate concentration as identified in PC3 and F2 (Fig. 6d, Table 1). High NO3 values reported in regional groundwater have been attributed to agricultural fertilizer applications (Carey et al. 1994). Local springs in the study site are likely loaded with NO3 from proximal agricultural infiltration collecting above the clay layer (Figs. 3h and 4h).

Tributaries B1 and B2 exhibit similar chemical compositions to groundwater, although with slightly depressed Mg2+, Cl, NO3, and EC values and higher proportions of Na+, K+, and SO42− (Fig. 9). For PC1 and PC2, the similarities in overall ion concentration and SO42− cause the tributaries to overlap with the groundwater cluster (Fig. 7a, c). For PC3, B1 and B2 plot as a nearby, but separate cluster from groundwater, likely due to a difference in NO3 concentration (Fig. 7b, d). FA analysis also distinguished tributaries as similar, but produced separate clusters for both F2 and F3 due to higher relative correlations for ions such as Cl compared with PC2 and PC3 (Fig. 8). This suggested that these tributaries were likely sourced from and/or strongly influenced by local groundwater, which was consistent with field observations that many tributaries flowing into Muddy Creek arose from one or more springs. Because B1 and B2 are topographically higher in the research catchment, they begin in the New Albany Shale and do not cut as deeply into the Boyle Dolomite (Fig. 2, Supplemental Figure 1). Hence, B1 and B2 represent a different source water end member than springs associated with the blue-gray clay layer. The overlying New Albany Shale has a high sulfide mineral content, particularly pyrite (Nuttall 2013). Furthermore, weathering of this unit has been found to generate high SO42− concentrations and trigger groundwater acidification in local springs due to dissolution of sulfides (Anderson 2008). Therefore, high SO42− concentrations and lower pH of tributary samples were likely due to interaction with the New Albany Shale. B1–B2 water and groundwater likely represent a continuum between waters influenced only by the New Albany shale and waters with increasing influence from the Boyle Dolomite. This continuum probably causes lower pH at T1 compared with groundwater and generates spatial and temporal variations in spring chemistries.

Sampling site O1 was the source of the largest dissolved salt concentrations, N as NH4+, and P as PO43− in the watershed channel. The O1 site converges downstream with O2 and overwhelms site C1 chemistry with the O1 chemistry signature, which persists at downstream sites (Figs. 3 and 4). Moreover, this O1 signature was attenuated downstream between C1 and C6, suggesting down-channel dilution of salts by groundwater-influenced surface water. This was supported by progressive down-channel SO42− and NO3 enrichment and buffering of pH to higher values in C1–C6. As a result, water at W1 can be considered, in its simplest form, a mixture between groundwater derived from the New Albany Shale (B1–B2), groundwater derived from the Boyle Dolomite (springs), and O1 end members (Figs. 9 and 10).

Declining salt and nutrient concentrations at O1 after October 2017 (Fig. 4) can be attributed to gradual flushing of the agricultural tiles and pipeline complex (O1) that drains the developed region of the catchment. As the O1 end member contribution declines in the channel during this transition, W1 samples begin to move towards the groundwater end member along rough mixing lines observed on PCA, FA, and Piper diagrams (Figs. 7, 8, and 9). Furthermore, NO3 concentrations at W1 surpass those of O1 during January–March 2018, suggesting the primary source of nutrient influx during periods with less O1 influence was NO3-laden groundwater. Groundwater was likely receiving nitrogen from agricultural waste and fertilizer infiltration. Drainage from O1 may reflect farm operations during the growing season, discharging NH4+, PO43−, and major ions, which dominate W1 nutrient output in summer. Dilution of O1 in the summer by groundwater was likely hindered by evapotranspiration and nitrate uptake (O’Driscoll and DeWalle 2010). In winter, when growing season operations and nitrogen uptake by plants cease, O1 discharge was no longer enriched with ions, so groundwater dominated the export of NO3 (Fig. 6). In summary, drainage from O1 exports PO43−, NH4+, and other ions to W1 with less dilution from groundwater in the summer, and groundwater adds larger amounts of NO3 and SO42− to dominate chemical export to W1 in winter.

The high-resolution, spatial sampling occurring monthly in this study illustrated that the developed land cover region of the farm draining through O1 seemed to control solute exports into Muddy Creek during the summer. While there was evidence of dilution from groundwater, the export of NH4+, Cl, and other ions remained elevated and additional NO3 was contributed to waters sampled at the outlet, W1. Aside from dilution, there was evidence that PO43− retention occurred due to concentrations rapidly diminishing from O1–C3 without significant dilution from tributaries. During base flow, the catchment likely retained PO43− while exporting nitrogen as NO3 and NH4+. Despite tilled fields and pasture along the channels, low concentrations of nutrients at site O2 suggested little direct transport of nutrients from these potential sources at base flow. These results are significant because they show that these small systems are sensitive to development, and nitrogen discharge from developed areas can directly enter higher order streams at elevated concentrations, especially in the case of NH4+. The overall impact of this individual, small catchment on Muddy Creek water quality was likely small, but the cumulative effect of similar, multiple watersheds may combine to significantly degrade water quality in higher order streams, perhaps causing problems such as chloride toxicity and eutrophication (Causse et al. 2015; Hubbart et al. 2017).

Broader impacts

Spatial and land cover analysis imply that catchments similar to that of the study site might be significant contributors of nitrogen species to larger streams and downstream river systems. Although this study catchment was larger than most of the identified catchments in the Muddy Creek watershed, processes within these catchments are likely similar, promoting entry of NO3 and NH4+ into higher order streams. These small catchments are individually easy to dismiss because a single one accounts for less than 1% of the water volume in the main channel at base flow. However, when one considers that these catchments cover ~ 10% of this particular basin, the combined impact of their outflows becomes more significant. Moreover, these catchments are sensitive to development and agriculture, and show limited capacity to buffer water quality. In this case, the developed region of this small watershed produced the dominant geochemical signal and overall water quality was compromised, especially for dissolved nitrogen species. This shows that even a small amount of development in these types of catchments will likely have a larger than anticipated impact on water quality. Also, considering that ~ 50% of all of the 189 sub-first-order watersheds have some amount of development (Fig. 11) suggests that this may be a common process impacting water quality in the region. Because particular types or specific uses of developed areas were not distinguished, results also suggest that further research into the impact of developing sub-first-order catchments is required. Additionally, almost all of the catchments show a large amount of agricultural use, likely contributing to the degradation of water quality. The impact of agriculture observed in this research was the addition of NO3 to groundwater, which was released by springs under base flow conditions. These findings show that it may be possible to improve base flow water quality by controlling development in these types of catchments, while agricultural impacts may require regional scale management of groundwater resources.

Limitations

While this work shows the impact of mixing between groundwater and outfall discharge on base flow exports of nutrients in a small MLC stream, there are numerous factors to consider in continuing research. First, the monthly sampling did not include storm flows and therefore neglects well-established export processes of PO43−, other dissolved constituents, and sediment. Storm flows generally export larger amounts of nutrients within significantly higher flow volumes. Second, further research should ideally include more conclusive chemical fingerprinting of water sources, as presence of multiple sources contributing various dissolved constituents over varied seasonal conditions makes it difficult to unambiguously pinpoint specific sources. Third, this study focused on fine-scale sampling of one small catchment, so it is important to compare these results to other such catchments to fully assess the conclusions.

Conclusion

The headwater, mixed land cover catchment in this study exhibited complex, fine-scale groundwater and surface water interactions that produced seasonal changes in nutrient and salt exports. Even small amounts of development controlled base flow nutrient levels and export of other dissolved constituents from the catchment outlet. Local groundwater dilution in the main watershed channel did not completely dilute nutrients sourced from developed regions. During the winter, NO3 was the dominant nutrient exported by watershed base flow and was primarily sourced from groundwater. During the growing season, NH4+ export from an outfall that drains developed land cover overshadowed NO3 export from local groundwater. These conclusions highlight that small amounts of development can have a large impact. Spatial analysis showed that catchments similar to the study site comprised 10% of the total Muddy Creek watershed, containing various levels of development that could have a significant impact on water quality. Future nutrient management strategies in this region could reduce base flow nitrogen loading of higher order streams by limiting development in small headwater catchments and regionally managing agricultural nitrogen applications that seep into groundwater.