Introduction

The Gulf killifish, Fundulus grandis, is a vital component of saltmarsh ecosystems along the US Gulf of Mexico coast. This fish is one of the most abundant nekton species found in saltmarshes and provides an important prey resource for many piscivores, including commercially valuable fishes (Rozas and Reed 1993; Rozas and Zimmerman 2000). F. grandis is thought to be very similar to its Atlantic coast congener Fundulus heteroclitus (Mummuichog) sharing a recent ancestor in the family Fundulidae (Whitehead 2010) with some evidence of hybridization when ranges overlap (Gonzalez et al. 2009). These two species also exhibit similar spawning (Kneib and Stiven 1978; Greeley and MacGregor 1983) and feeding behaviors (Kneib and Stiven 1978; Rozas and LaSalle 1990) and both are saltmarsh residents that exhibit strong site fidelity (Lotrich 1975; Teo and Able 2003; Able et al. 2006; Nelson et al. 2014). This marsh residency and strong site fidelity is the reason why F. heteroclitus (Courtenay et al. 2002; Finley et al. 2009; Dibble and Meyerson 2012, 2014) and recently F. grandis (Whitehead et al. 2012) have been used as indicator species to assess environmental impacts and changes in saltmarshes.

Otolith chemistry has been used for the assessment of past environmental conditions experienced by a fish (Lowe et al. 2009, 2011; Farmer et al. 2013), stock discrimination (Campana et al. 2000; Sohn et al. 2005; Niklitschek et al. 2010), retrospective tracking of fish movements (Walther et al. 2011; Albuquerque et al. 2012; Farmer et al. 2013), and identification of nursery habitat (Thorrold et al. 1998; Gillanders and Kingsford 2003; Vasconcelos et al. 2007; Tanner et al. 2011). This is because otoliths are metabolically inert and therefore not reworked throughout life as bone can be, and they provide records of fish growth and incorporate trace chemical compositions from the environment (Campana and Neilson 1985; Campana 1999; Campana and Thorrold 2001; Sturrock et al. 2012). These trace chemicals are incorporated into the otolith through substitution with Ca in the CaCO3 aragonite matrix (Doubleday et al. 2014) and come from surrounding water (Farrell and Campana 1996; Walther and Thorrold 2006) and/or diet items (Kennedy et al. 2000; Buckel et al. 2004).

The utility of otolith microchemistry as a tool depends strongly on environmental conditions and, specifically, chemical gradients, in which a fish lives. Significant differences in otolith microchemical signatures have been found in fishes collected from different areas within estuarine (Vasconcelos et al. 2007; Bradbury et al. 2008; Tanner et al. 2011), marine (Ashford and Jones 2007; Longmore et al. 2010; Ferguson et al. 2011), and freshwater systems (Brazner et al. 2004; Zeigler and Whitledge 2010; Wolff et al. 2012). Otolith microchemistry is particularly effective in estuarine systems, due to varying amounts of chemical inputs among watersheds (Oros et al. 2007; Reay 2009; Statham 2012; Wang et al. 2012), variable salinity, and variation in freshwater input (Rooker et al. 2004; Lowe et al. 2011). Because otolith microchemical signatures can provide a record of environmental conditions experienced throughout the life of a fish, otolith microchemistry can be a useful tool in environmental impact studies. Elements (Na, Mg, Cr, and Sr) found in crude oil from the 2002 Prestige oil spill in northwestern Spain, were incorporated in otoliths of benthic juvenile Turbot (Scophthalmus maximus) in the laboratory (Morales-Nin et al. 2007).

The Deepwater Horizon oil spill (referred to hereafter as DHOS) in the Gulf of Mexico was the largest oil spill in U.S. history (Abbriano et al. 2011) and possibly the largest offshore spill to have ever occurred (Camilli et al. 2010). It released an estimated 4 million barrels of Louisiana crude oil into the northern Gulf of Mexico (NGOM) during summer 2010 (Camilli et al. 2010). Initial studies on environmental impacts of the DHOS spill suggested that nearshore (Fodrie and Heck 2011; Moody et al. 2013) and offshore fishes (Rooker et al. 2013; Szedlmayer and Mudrak 2014) have been relatively resilient to adverse effects of oiling in the NGOM. However, detection of small genetic differences in F. grandis from Barataria Bay, LA were presumed to be an effect of the DHOS oil (Whitehead et al. 2012). An independent marker such as elements associated with oil being identified in fish otoliths would allow potential impacts, such as the timing or perhaps the magnitude of exposure, to be more clearly assessed. Analysis of DHOS oil using inductively coupled plasma mass spectrometry (ICP-MS) found Al, V, Cr, Co, Ni, Cu, Zn, Se, and Sn in the oil and a lack of heavier metals suggesting that it was a lighter oil (Grosser et al. 2012). Additionally, Se, Ni, and V have been found in other crude oils and refinery effluents (Miekeley et al. 2005; De Almeida et al. 2009; Ellis et al. 2011).

The objectives of this study were to determine whether (1) the elemental composition of F. grandis otoliths across the broader northern Gulf of Mexico could be used to discriminate the source of fish on both small and large spatial scales and (2) oil exposure could be detected using the chemical compositions of F. grandis otoliths, which was accomplished by comparing the otoliths of fish collected from paired oiled and non-oiled marshes.

Methods

Field Collection

Collection of F. grandis occurred at a total of 10 saltmarshes across the northern Gulf of Mexico (Fig. 1). The amount of DHOS oiling at sites was assessed using the NOAA Environmental Response Management Application (ERMA) (gomex.erma.noaa.gov) and oiled sites were paired with non-oiled ones. Louisiana sites were located in Barataria Bay near Grand Isle, LA, and included one that was heavily oiled from the DHOS oil spill (Barataria A, BA) and one that did not receive DHOS oil (Barataria B, BB). Sites in Mississippi were both near Pascagoula. The refinery site (RF) was along a dredge ditch adjacent to an oil refinery (assumed to have been exposed regularly to non-DHOS oil and refinery effluents) and the Crooked Creek site (CK) was a control site in the Grand Bay National Wildlife Refuge (and not regularly exposed to oil). Three sites were located in Alabama, west of Mobile Bay: Fowl River (FR), Fowl B (FB), and Fowl A (FA), and FA received light oiling during the DHOS spill. Two more Alabama sites, Wolf Bay (WF) and Perdido (PD), were located east of Mobile Bay, near Orange Beach, AL. The Perdido site also received light oiling during the DHOS spill. The site in Florida was in Hogtown Bayou (HT), which is part of the larger Choctawhatchee Bay and was not oiled.

Fig. 1
figure 1

Sample sites in the northern Gulf of Mexico. Boxes on the map at the top of the figure correspond to sample areas that are indicated by numbers below the boxes. Enlarged panels of each sample area are shown and denoted with the corresponding number and name of the area in the bottom right of each panel. Within each panel, sites are labeled and denoted with an oiled or non-oiled symbol

Fish were collected in fall 2012, winter 2013, spring 2013, and summer 2013 from all sites using minnow traps baited with cut menhaden (Brevoortia sp.). At each site, eight traps were placed in saltmarsh habitat and left for a soak time of 1 to 1.5 h. All F. grandis in traps were euthanized, stored on ice, and returned to the lab. During each fish collection, water temperature and salinity were measured using a salinity-conductivity meter (Yellow Springs Instruments model 30). Water samples for chemical analysis were collected during the winter, spring, and summer sampling events. Each water sample was filtered through a 0.45-μm filter, preserved with 125 μL nitric acid, stored in 125 ml acid washed polypropylene bottles, and refrigerated until processing.

Lab Processing

Fish were either processed immediately after returning from the field or frozen for later processing. Up to 20 F. grandis were selected from each site for otolith analysis. These 20 fish consisted of four fish from each of five different size ranges (50 to 59, 60 to 69, 70 to 79, 80 to 89, 90 to 100+ mm total length [TL]), if present. If all size ranges were not represented, then the numbers selected were chosen equally from represented size classes to reach a total of 20 individuals per location. Lengths and weights of the selected fish were recorded and their sagittal otoliths were removed. Otoliths were then rinsed in 30 % H2O2 to clean off any residual tissue, followed by triple distilled ultra-filtered water (DIUF) and stored dry in vials until preparation for microchemistry.

For each fish, one of the sagittal otoliths was prepared for microchemistry analysis. First, the otolith was re-washed in 30 % H2O2 and rinsed with DIUF. The otolith was then inspected under a dissecting scope to look for remaining tissue, and if present, it was removed with forceps and the otolith was rinsed again with DIUF. After cleaning, otoliths were polished until a smooth surface was achieved using progressively finer grit 3 M® lapping film (grit sizes 30 to 0.3 μm). After polishing, otoliths were re-rinsed with DIUF and mounted to petrographic slides using Weldbond® glue and stored until microchemical analysis.

Otolith Microchemistry

Otolith microchemical analyses were conducted at the University of Windsor, Great Lakes Institute for Environmental Research (GLIER), Windsor, Ontario, Canada. Trace metal compositions of otoliths from F. grandis were quantified using a laser ablation inductively coupled plasma mass spectrometry (LA-ICP-MS) system comprising a high energy, pulsed, ultrafast femtosecond laser (Integra-C® by Quantronix®, East Setauket, New York) and a Thermo-Electron® X Series-II® ICP-MS. Straight line laser ablations were conducted using a 20-μm diameter laser beam and extended from one edge to the other, across the otolith core. The average element concentration for this entire ablation track was defined as whole otolith analysis, given that this track encompasses the entire life of the fish twice and this term (whole otolith analysis) will be used throughout the rest of the paper. Ablation speed for all otoliths was set at 5 μm/s using a computer-controlled stage. Ablated material was carried to the ICP-MS using Ar gas. The laser energy was 27 μJ, measured after a 2.5-mm beam-constricting aperture, pulse frequency was 100 Hz, pulse width was 130 fs, and wavelength was 785 nm. The ICP-MS was operated with 1200 W RF power, a nebulizer gas flow rate of 1.01 L/min, coolant gas rate of 14 L/min, and a ICP sampling depth of 120 mm. The signal intensities in integrated counts/second (icps) for 33 isotopes were measured and were used to quantify the concentrations of 24 elements (Supplemental Table 1). The signal intensities of multiple isotopes for some elements (e.g., Ca, Ni, Zn, Se, Sr, Sn, Ba, and Pb) were measured for internal standardization and interference correction purposes. Given the speed of the ablation and the number of isotopes measured, concentrations were output roughly every 0.4 second of the ablation.

Two runs ablating synthetic borosilicate glass standard reference material (National Institute of Standards and Technology [NIST] 610) with known concentrations of the elements of interest were analyzed both before and after approximately every 16 otolith samples to provide external calibration standards, calculate the limits of detection (LOD) for each element, and to correct for instrumental and Ar carrier gas background levels (Longerich et al. 1996; Ludsin et al. 2006). For an element to be included in the data comparative analysis, it had to be detected above the LOD in at least 50 % of the samples. Estimated instrument precision and relative standard deviation (RSD %) of each element was obtained from a validation study (Shaheen et al. 2008), which measured performance of the LA-ICP-MS system used in our study. For otolith analyses, Ca was used as the internal calibration standard and was assigned a constant concentration of 400,432 ppm (the stoichiometric abundance of Ca in aragonite (CaCO3)), to correct for differences in mass of ablated material (Campana et al. 1994) and to convert raw isotope signal intensity measurements (icps) to concentration units (ppm) (Ludden et al. 1995; Longerich et al. 1996; Ludsin et al. 2006).

Water samples were analyzed using solution-mode ICP-MS, after a 100-fold dilution with an internal standard solution of 1 % HNO3 containing internal standards spikes of Be, In, and Tl at concentrations of 10, 1, and 2 ppb, respectively, to correct for instrumental mass bias and drift, as well as matrix effects on an individual sample basis (Lowe et al. 2009, 2011).

Statistical Analysis

Fish Length and Condition

A two-way ANOVA with site, collection season, and the site × season interaction was used to determine whether fish lengths differed across sites and seasons. This ANOVA was run using type-III sums of squares and was checked for violation of assumptions using Shapiro-Wilks and Levene’s tests.

Since standard weight equations have not been developed for F. grandis, relative condition was compared across sites, seasons, and lengths using a two-way ANOVA with type-III sums of squares. Relative condition is the ratio of an individual weight to the expected weight of the fish as determined by the length-weight regression for all fish in the study (Neumann et al. 2012). The main effects of site and season and the covariate length, as well as all two-way interactions, were included in the ANOVA. Differences between sites and seasons were compared using Tukey pairwise comparisons, obtained from the most parsimonious model, which was chosen by removing nonsignificant main effects (main effects were only removed if a significant interaction did not include the main effect) and interactions from the model until only significant ones remained. This ANOVA was also checked for violation of assumptions using Shapiro-Wilks and Levene’s tests.

Whole Otolith Analysis

To determine if elemental concentrations differed among sites and between paired oiled and non-oiled sites for whole otolith averages (average element concentration for the entire ablation), a two-way ANOVA with site, collection season, a covariate fish length (TL), and all possible two-way interactions were included in the first model. The most parsimonious model was chosen by removing nonsignificant (p ≥ 0.05) main effects and interactions and re-running the ANOVA until only significant variables remained. If an interaction was significant, but a main effect included in that interaction was not, then the main effect was retained in the final model. All tests were run using type-III sums of squares (Speed et al. 1978; Shaw and Mitchellolds 1993) and Tukey pairwise comparisons were performed when significant site or season effects were shown. Each final ANOVA was tested for violation of assumptions using the Shapiro-Wilks and Levene’s tests. Although the majority of ANOVAs failed these tests and log data transformations did not help, the ANOVA test is robust to violations in normality and variance and has greater power than the similar nonparametric Kruskal-Wallis or rank-sum approaches (Brownie and Boos 1994; Kahn and Rayner 2003); therefore, ANOVA was used.

Elements that showed significant differences between sites in the Tukey tests were used in a linear discriminant function analysis (LDFA) with leave one out cross validation, to determine if fish could be correctly classified to site of collection (prior probabilities set at 0.1 for each site given the ten sites) and then region of collection (prior probabilities set at 0.25 for each of the four regions) based on otolith chemical signatures. Before LDFA was performed, all chemical data were scaled using z-scores. Using the same scaled data from the LDFA, a hierarchical cluster analysis was performed based on a Euclidean distance matrix and Ward’s method of cluster classification (Ward 1963; Wishart 1969). This cluster analysis was performed for all sites to see how sites were grouped and to see if classification of fish to states or some other regional scheme was most appropriate.

Otolith Edge and Core Analysis

To compare elemental concentrations in the cores versus the edges of otoliths, the laser ablation time-resolved analyses were subdivided into core and edge components for a subsample of 200 otoliths. This subsample consisted of one fish from each size class (if present) for each site and season (n = 50/season, total n = 200). If all size classes were not represented in a collection, then the five fish were chosen from size classes that were present. The time-resolved ICP-MS signal for a standard distance of 113 μm was chosen to represent the edge and core signatures. This represents a distance equal to 10 % of the ablation when averaged across all of the smallest (50 to 59 mm TL) fish in the edge/core subsample. Therefore, each edge chemical signature was measured by averaging data obtained for all ICP-MS time slices (~57) over the 113 μm at the end of the ablation, and each core chemical signature was measured by averaging data obtained for all ICP-MS time slices (~57) over the 113 μm straddling the otolith core, where the center of the core was at 56.5 μm, such that an equal number of data points was used on either side of the core. Otolith cores were defined as the small, opaque section in the approximate center of the otoliths.

Split-plot ANOVA (SP-ANOVA) with individual fish as the whole plot and otolith region as the split plot was performed to see if otolith region (core and edge) chemical signatures differed within otoliths. The interactions of season, site, and length with otolith region were used in the SP-ANOVA to determine if differences between the core and edge were consistent across these effects. The most parsimonious model was chosen by removing nonsignificant (p ≥ 0.05) interactions or main effects, with the exception of otolith region and re-running the model until only significant variables remained. These split-plot ANOVAs were run using type-III sums of squares.

Differences in detectable element (>50 % LOD) concentrations among sites and seasons at both cores and edges of otoliths were analyzed with two-way ANOVAs, using type-III sums of squares. The main effects of site, season, and the site × season interaction were used in the first model, and if the interaction was significant (p ≤ 0.05), then no main effects could be removed and this was considered to be the best model. If the interaction was not significant, the remaining significant main effects were used in the final model. If either site or season was shown to be significant, then Tukey tests were performed to identify differences among sites and seasons. ANOVAs for this section were checked for violations of assumptions using Shapiro-Wilks and Levene’s tests, and once again, the majority of tests failed; however, we continued to use ANOVAs because of their robustness.

Salinity and Temperature

Differences in salinity and water temperature across sites and seasons were analyzed using a one-way ANOVA. If the site or season ANOVA was significant, a Tukey honest significant difference (HSD) test was used to find differences among sites and/or seasons. All ANOVAs were tested for assumptions using Shapiro-Wilks and Levene’s tests.

Water Elemental Concentrations

One-way ANOVAs were used to compare water sample elemental concentrations among sites and seasons. If either the site or season ANOVA was significant, a Tukey HSD test was used to identify differences among sites and seasons, and ANOVAs were checked for violation of assumptions. For an element to be included in the evaluation of water chemistry, it had to be above LOD in >50 % of otolith samples and >50 % of water samples. All elemental abundances in water samples were converted into molar ratios with calcium (Ca) prior to analysis and are reported in element (mmol) per Ca (mol) water.

Results

Field Collections

Gulf killifish were successfully captured from every sample site during each collection event. It was not always possible to collect four fish in each of the five size classes (≤59, 60 to 69, 70 to 79, 80 to 89, ≥90 mm TL) as per design. In total, 18 to 20 otoliths were obtained from each site on each sample date for otolith microchemistry (OMC) analysis, except for winter samples from BB and FR, which had 11 and 13 otoliths, respectively. This resulted in a final sample size of 773 otoliths (180 to 200 per season) for OMC analysis, which were used for whole otolith comparisons. Average fish length did not differ significantly among sites (ANOVA: F 9, 733 = 1.330, p = 0.217), but captured fish were significantly smaller in winter than any other season (ANOVA: F 3, 733 = 5.491, p < 0.001). Also, the site × season interaction was significant (ANOVA: F 27, 733 = 4.069, p < 0.001), due to differences among sites that were not consistent between seasons.

Relative Condition

Gulf killifish relative condition differed across sites (ANOVA: F 9, 729 = 9.108, p < 0.001) and seasons (ANOVA: F 3, 729 = 10.304, p < 0.001). Both the site × season (ANOVA: F 27, 729 = 6.377, p < 0.001) and season × length (ANOVA: F 3, 729 = 2.796, p = 0.040) interaction terms were significant, but length alone did not have a significant effect on relative condition (ANOVA: F 1, 729 = 0.331, p = 0.565, Fig. 2a–c). Paired oiled and non-oiled sites did not differ in relative condition, except that the oiled PD site had higher relative conditions than WF. Relative condition was highest at BB and BA, and RF did not differ from these sites (Fig. 2a). Also, relative condition was highest in summer (Fig. 2b).

Fig. 2
figure 2

F. grandis relative condition (mean [±1 SE]) across sites, with letters above bars corresponding to significant differences among sites based on the results of Tukey pairwise comparisons (a). Mean relative condition across sites by seasons, with letters beside seasons in the legend corresponding to significant differences among seasons based on the results of Tukey pairwise comparisons (b) and mean relative condition across fish length classes by seasons (c). Lines connecting points in bc help to visualize patterns and separate seasons and length classes; they do not depict relationships between sites or length classes

Otolith Differences Among Sites

Detectable Elements

Out of the 33 isotopes that were measured using the ICP-MS, 20 (Supplemental Table 1) met the threshold criterion (>50 % LOD) for inclusion in OMC comparisons. For elements where more than one isotope met the inclusion criterion (Zn, Sr, Sn, Ba, Pb), the isotopes were regressed against one another, and if regressions varied too far from the 1:1 slope line (R 2 < 0.90), then both isotopes were considered in OMC comparisons. Only isotopes of Sr (R 2 = 0.361) and Zn (R 2 = 0.878) differed from the 1:1 relationship. The number of samples above LOD for 67Zn was less than 66Zn, which resulted in observations of zero for 67Zn but not 66Zn when concentrations were low, causing deviation from the 1:1 line. Therefore, 66Zn was the only Zn isotope used in OMC comparisons. Both isotopes of Sr were used given the relatively low correlation between 88Sr and 86Sr. The possible cause(s) of this lack of correlation between Sr isotopes (e.g., LOD, systematic interference effects), however, is not readily apparent. This resulted in use of 13 isotopes of 12 elements in the OMC statistical analysis (Table 1).

Table 1 Isotopes of elements whose concentrations were above the limit of detection (LOD) in at least 50 % of otoliths and that were used in statistical analysis. The total number of samples that were above detection limit for each element is shown by the # > LOD row, and the percentage of samples this number equals is shown in the % > LOD row

Concentration Comparisons Among Sites

Entire otolith average concentrations of Mn, Fe, Rb, Sr, Ba, and U differed across sites. Whole otolith average concentrations of Mn differed across sites (ANOVA: F 9, 720 = 8.841, p < 0.001) and collection seasons (ANOVA: F 3, 720 = 7.995, p < 0.001), but length alone did not have a significant effect (ANOVA: F 1, 720, p = 0.060). All three interaction terms were significant for Mn, site × length (ANOVA: F 9, 720 = 5.223, p < 0.001), site × season (ANOVA: F 27, 720 = 3.11, p < 0.001), and length × season (ANOVA: F 3, 720 = 8.357, p < 0.001) (Fig. 3a–d). Paired oiled and non-oiled sites did not differ in Mn concentrations, except FA (light oiling) was higher than FB (non-oiled), but concentrations were similar to the non-oiled FR site located farther up the watershed. FR and FA also had the highest Mn concentrations of any site, while FB and BB composed the next highest group (Fig. 3a). Despite these statistical interactions, the patterns of Mn concentrations were consistent across sites (Fig. 3b–d). Core Mn concentrations differed across sites (ANOVA: F 9, 190 = 6.856, p < 0.001) and the concentration patterns were similar to those seen in whole otolith analysis, while no site differences were observed in otolith edges. Manganese core concentrations were also higher than otolith edge concentrations (SP-ANOVA: F 1, 175 = 156.53, p < 0.001) across all sites and seasons.

Fig. 3
figure 3

Mean (±1 SE) whole otolith Mn concentrations (ppm) across sites, where bars with different letters indicate significant differences among sites based on Tukey pairwise comparisons (a). Mean whole otolith Mn concentrations across sites by season, where letters beside season in the legend correspond to significant differences among seasons, based on Tukey pairwise comparisons (b). Mean whole otolith Mn concentrations across sites by fish length class (c) and across fish length classes by season (d). Lines connecting points in bd help to visualize patterns and separate seasons and lengths; they do not depict relationships between sites or length classes

Whole otolith concentrations of iron differed across sites (ANOVA: F 9, 729 = 21.812, p < 0.001) and seasons (ANOVA: F 3, 729 = 9.421, p < 0.001), but length (ANOVA: F 1, 729 = 3.708, p = 0.055) alone was not related. The interactions of site × season (ANOVA: F 27, 729 = 18.979, p < 0.001) and season × length (ANOVA: F 3, 729 = 4.140, p = 0.006) were both significant as well (Fig. 4a–c). The FB site (non-oiled) had significantly higher mean Fe concentrations than other sites, with FA having the second highest concentrations (Fig. 4a). Besides this, no other differences were observed between paired oiled and non-oiled sites. Differences among sites in both otolith cores (ANOVA: F 9, 160 = 5.532, p < 0.001) and edges (ANOVA: F 9, 160 = 4.750, p < 0.001), as well as seasons (core-ANOVA: F 3, 160 = 5.069, p = 0.002; edge-ANOVA: F 3, 160 = 3.438, p = 0.02), were similar to those seen in whole otolith comparisons and the interactions between site × season (core-ANOVA: F 27, 160 = 6.823, p < 0.001; edge-ANOVA: F 27, 160 = 6.070, p < 0.001) were similar as well.

Fig. 4
figure 4

Mean (±1 SE) whole otolith Fe concentrations (ppm) across sites, where bars with different letters indicate significant differences among sites based on Tukey pairwise comparisons (a). Mean whole otolith Fe concentrations across sites by seasons, where letters beside season in the legend correspond to significant differences among seasons based on Tukey pairwise comparisons (b). Mean whole otolith Fe concentrations across fish length classes for each season (c). Lines connecting points in bc help to visualize patterns and separate seasons; they do not depict relationships between sites or length classes

The Florida site, Hogtown Bayou (HT), had significantly lower mean Rb concentrations for entire otoliths than FB and WF, while Rb concentrations were similar to one another at other sites (ANOVA: F 9, 733 = 21.812, p < 0.001) (Fig. 5). Also, no difference in Rb was observed between any paired oiled and non-oiled sites. Mean Rb whole otolith concentrations were less in the fall than the three other seasons (ANOVA: F 3, 733 = 9.421, p < 0.001), and the interaction between these two main effects (ANOVA: F 27, 733, p < 0.001) was significant (Fig. 5b). Rubidium did not exhibit significant differences among sites in either otolith core or edge comparisons.

Fig. 5
figure 5

Mean (±1 SE) whole otolith Rb concentrations (ppm) across sites, where bars with different letters indicate significant differences among sites based on Tukey pairwise comparisons (a). Mean whole otolith Rb concentrations across sites by seasons, where letters beside season in the legend correspond to significant differences among seasons based on Tukey pairwise comparisons (b). Lines connecting points in b help to visualize patterns and separate seasons; they do not depict relationships between sites

Whole otolith 86Sr concentrations differed among sites (ANOVA: F 9, 720 = 3.425, p < 0.001) and seasons (ANOVA: F 3, 720 = 41.1627, p < 0.001) being highest in the fall (Fig. 6a, b). All interaction terms, site × season (ANOVA: F 27, 720 = 5.586, p < 0.001), site × length (ANOVA: F 9, 720 = 2.276, p = 0.016), and season × length (ANOVA: F 3, 720 = 20.524, p < 0.001), were significant, but the covariate length was not (ANOVA: F 1, 720 = 0.192, p = 0.661). BB had the lowest mean 86Sr out of all sites, followed by BA and FR, which did not differ. CK and PD had the highest mean 86Sr concentrations and also did not differ (Fig. 6a). Although present, significant interactions did not affect overall patterns across sites (Fig. 6b–d). Core and edge concentration differences among sites (core-ANOVA: F 9, 187 = 10.738, p < 0.001, edge-ANOVA: F 9, 160 = 7.623, p < 0.001) and seasons (core-ANOVA: F 3, 187 = 34.438, p < 0.001, edge-ANOVA: F 3, 160 = 8.162, p < 0.001) were similar to whole otolith comparisons, and the interaction of site × season (ANOVA: F 27, 160 = 2.635, p < 0.001) was only significant for otolith edge comparisons.

Fig. 6
figure 6

Mean (±1 SE) whole otolith 86Sr concentrations (ppm) across sites, where bars with different letters indicate significant differences among sites based on Tukey pairwise comparisons (a). Mean whole otolith 86Sr concentrations across sites by seasons, where letters beside season in the legend correspond to significant differences among seasons based on Tukey pairwise comparisons (b). Mean whole otolith 86Sr concentrations across sites for each fish length class (c) and across fish length classes for each season (d). Lines connecting points in bd help to visualize patterns and separate seasons and lengths; they do not depict relationships between sites or length classes

Concentrations of 88Sr across whole otoliths also differed among sites (ANOVA: F 9, 720 = 2.774, p = 0.003) and seasons (ANOVA: F 3, 720 = 49.674, p < 0.001) (Table 4, Fig. 7a, b). Length (ANOVA: F 1, 720 = 0.400, p = 0.527) alone was not significant, but the interactions of site × length (ANOVA: F 9, 720 = 2.045, p = 0.032) and season × length (ANOVA: F 3, 720 = 20.515, p < 0.001) were. The interaction of site × season (ANOVA: F 27, 720 = 8.031, p < 0.001) was also significant. Similar to 86Sr, BB had the lowest mean 88Sr concentration followed by BA, which did not differ from FR. CK and PD had the highest 88Sr concentrations for all sites and they did not differ (Fig. 7a). Winter samples had a lower mean 88Sr concentration than other seasons, fall and spring did not differ, and summer had the highest concentration (Fig. 7b). General patterns of concentrations across sites were consistent (Fig. 7b, c). Differences in 88Sr concentrations across sites in both cores (ANOVA: F 9, 160 = 5.239, p < 0.001) and edges (ANOVA: F 9, 160 = 7.779, p < 0.001) of otoliths were significant and similar to whole otolith patterns. Seasonal differences were only significant in edge comparisons (ANOVA: F 3, 160 = 15.364, p < 0.001), but the season × site interaction was significant (core-ANOVA: F 27, 160 = 2.041, p = 0.004; edge-ANOVA: F 27, 160 = 3.477, p < 0.001) in both core and edge analysis. Although both 86Sr and 88Sr differed between paired oiled and non-oiled sites, differences were not consistently in the same direction; rather, Sr concentration patterns followed salinity trends (Figs. 6 and 7).

Fig. 7
figure 7

Mean (±1 SE) whole otolith 88Sr concentrations (ppm) across sites, where bars with different letters indicate significant differences among sites based on Tukey pairwise comparisons (a). Mean whole otolith 88Sr concentrations across sites by seasons, where letters beside season in the legend correspond to significant differences among seasons based on Tukey pairwise comparisons (b). Mean whole otolith 88Sr concentrations across sites for each fish length class (c) and across fish length classes for each season (d). Lines connecting points in bd help to visualize patterns and separate seasons and lengths; they do not depict relationships between sites or length classes

Concentrations of barium across the whole otolith differed across sites (ANOVA: F 9, 729 = 227.113, p < 0.001) and seasons (ANOVA: F 3, 729 = 39.786, p < 0.001) and were negatively related to fish length (ANOVA: F 1, 729 = 7.595, p = 0.006) (Fig. 8a–c). The interactions of site × season (ANOVA: F 27, 729 = 12.417, p < 0.001) and season × length (ANOVA: F 3, 729 = 7.935, p < 0.001) were also both significant. Barium concentrations were highest at the Louisiana sites (BB, BA) and FR had the closest mean concentration to these sites (Fig. 8a). Also, barium concentrations generally decreased with fish length for each collection (Fig. 8c). Although paired oiled and non-oiled sites differed in Ba concentrations, oiled sites were not always higher than non-oiled ones. The patterns of Ba concentrations in otolith cores and edges were similar to those seen in whole otoliths and were also different across sites (core-ANOVA: F 9, 160 = 24.760, p < 0.001; edge-ANOVA: F 9, 160 = 66.413, p < 0.001) and seasons (core-ANOVA: F 3, 160 = 6.644, p < 0.001; edge-ANOVA: F 3, 160 = 31.187, p < 0.001), with a significant site × season interaction (core-ANOVA: F 27, 160 = 2.146, p = 0.002; edge-ANOVA: F 27, 160 = 4.808, p < 0.001). The PD site exhibited higher Ba in cores than edges, and the within otolith comparison predicted that otolith cores had higher concentrations than edges (SP-ANOVA: F 1, 175 = 17.65, p < 0.001). However, this was not always the case at all sites and seasons, leading to significant otolith region × site (SP-ANOVA: F 18, 175 = 49.660, p < 0.001) and region × season interactions (SP-ANOVA: F 6, 175 = 9.61, p < 0.001).

Fig. 8
figure 8

Mean (±1 SE) whole otolith Ba concentrations (ppm) across sites, where bars with different letters indicate significant differences among sites based on Tukey pairwise comparisons (a). Mean whole otolith Ba concentrations across sites by seasons, where letters beside season in the legend correspond to significant differences among seasons (b). Mean whole otolith Ba concentrations across fish length classes for each season (c). Lines connecting points in bc help to visualize patterns and separate seasons and lengths; they do not depict relationships between sites or length classes

Uranium concentrations in otoliths were higher at BA than HT (ANOVA: F 9, 733 = 2.776, p = 0.003), but paired oiled and non-oiled sites did not differ (Fig. 9a). Fall and winter samples had higher concentrations than spring and summer samples (ANOVA: F 3, 733 = 7.206, p < 0.001) and the significant site × season interaction (ANOVA: F 27, 733 = 1.920, p = 0.004) was driven by different patterns across sites during the fall and winter seasons (Fig. 9b). Also, U was below detection limits in the cores and edges of otoliths, so it was not considered in those comparisons.

Fig. 9
figure 9

Mean (±1 SE) whole otolith U concentrations (ppm) across sites, where bars with different letters indicate significant differences among sites based on Tukey pairwise comparisons (a). Mean whole otolith U concentrations across sites by seasons, where letters beside season in the legend correspond to significant differences among seasons based on Tukey pairwise comparisons (b). Lines connecting points in b help to visualize patterns and separate seasons; they do not depict relationships between sites

Discrimination Among Sites and Regions

Concentrations of Mn, Fe, Rb, Sr, Ba, and U in otoliths were used in the LDFA because the concentrations of these elements differed significantly among sites. This LDFA analysis, where each site was used as a group in the function, assigned fish to the correct collection sites with a relatively low degree of accuracy. Among sites, FR had the highest correct classification percentage (86 %), followed by BB (68 %) and BA (64 %). Also, when misclassifications for BB and BA occurred, these two sites were classified as the other most of the time. Classification was poorest at CK (13 %) and HT (14 %), followed by PD, WF, FA, and FB (all around 40 %, Table 2). The elements that contributed most strongly to the discrimination functions were Ba, Sr, Fe, and Mn (decreasing importance, respectively, Table 3) and sites that were misclassified to one another had similar concentrations of these elements. The hierarchical cluster analysis also had similar results to the LDFA, with BB and BA forming a discrete cluster and FR, FA, and FB being grouped together (Fig. 10). Sites in Mississippi, on the east side of Alabama, and in Florida were clustered without geographical distinction.

Table 2 Results of the linear discriminant function analysis (LDFA) using each site as a group in the analysis (prior probability of being placed into each group = 10 %). Percentage of correct classification of F. grandis to site is shown by the italicized diagonal, and rows correspond to the actual collection site, while columns are the site assigned by the LDFA
Table 3 Linear discriminant variables generated by the LDFA (LD-1–LD-7) with all sites as groups, and the importance of each discriminant variable in classifying fish to sites, the proportion of trace column. The value for each element in a discriminant variable column corresponds to the effect each has on a discriminator. The larger the absolute value of each element, the greater the effect
Fig. 10
figure 10

Hierarchical clustering of individual sites, where the y-axis is distance between clusters based on Ward’s method of clustering

To relate broader regional patterns that might be related to major watershed differences, sites were grouped into four collection regions and another LDFA was performed. These four regions consisted of Louisiana sites (BB, BA), MS sites (RF, CK), Alabama sites on the west side of Mobile Bay (FB, FA, and FR), and Alabama sites on the east side of Mobile Bay combined with the Florida site (WF, PD, HT). Classification rates for these collection regions were better than for individual sites, especially in the LA sampling region (97 % classification success, Table 4). The MS and AL east + FL regions were misclassified to each other, resulting in the low classification percentages for these groups (46 and 58 %, respectively), and the AL west grouping (67 % successful) was misclassified to both MS and AL east + FL 15 % of the time. The main elements contributing to discrimination were similar as in the LDFA above with Ba having the most influence followed by Sr and Mn, respectively (Table 5).

Table 4 Results of the linear discriminant function analysis (LDFA) using regional site clusters as each group in the analysis (prior probability of being placed into each group = 25 %). Percentage of correct classification of F. grandis to region is shown by the italicized diagonal, and rows correspond to the actual collection region, while columns are the region assigned by the LDFA
Table 5 Linear discriminant variables generated by the LDFA (LD-1–LD-3) with regional sites as groups, and the importance of each discriminant variable in classifying fish to regions, the proportion of trace column. The value for each element in a discriminant variable column corresponds to the effect each has on a discriminator variable. The larger the absolute value for each element, the greater the effect

Salinity and Temperature

Each site experienced varying salinities and temperatures throughout the study. Salinity differed significantly across sites (ANOVA: F 9, 30 = 7.530, p < 0.001), but not seasons (ANOVA: F 3, 36 = 1.796, p = 0.165, Fig. 11), and temperature differed significantly across seasons (ANOVA: F 3, 36 = 42.810, p < 0.001), but not sites (ANOVA: F 9, 30 = 0.227, p < 0.001).

Fig. 11
figure 11

Mean (±1 SE) salinity (ppt) across sites, where letters above bars correspond to significant differences among sites based on Tukey pairwise comparisons

Water Chemistry

Only four of the elements that differed across sites in otolith analysis (Mn, Fe, Sr, and Ba) were also present above the analytical LOD at least 50 % of the time in water samples, and of these, all but Fe were detectable in 100 % of samples. Concentrations of manganese (Mn/Ca) in water samples differed across sites (ANOVA: F 9, 20 = 4.086, p = 0.004), but not seasons (ANOVA: F 2, 27 = 0.212, p = 0.810). The concentration of Mn was highest in FR, which is similar to otolith Mn patterns. However, unlike otolith concentrations, water samples from FA did not have higher Mn concentrations (Fig. 12a). Iron (Fe/Ca) concentrations in water samples did not differ across sites (ANOVA: F 9, 20 = 1.519, p = 0.208) or seasons (ANOVA: F 2, 27 = 1.071, p = 0.357). Strontium (Sr/Ca) concentrations in water samples did not vary across sites (ANOVA: F 9, 20 = 1.497, p = 0.216), but concentrations were higher in summer than in spring, and concentrations in winter did not differ from either spring or summer (ANOVA: F 2, 27 = 4.692, p = 0.018) (Fig. 12b). Barium (Ba/Ca) differed across sites (ANOVA: F 9, 20 = 10.32, p < 0.001), but not seasons (ANOVA: F 2, 27 = 0.3663, p = 0.697), and concentrations were highest in water samples from BB, which did not differ from FA. FA was also similar to BA, FB, and FR (Fig. 12c).

Fig. 12
figure 12

Water chemistry ratios of Mn/Ca (a) and Ba/Ca (c) across sites and Sr/Ca across seasons (b). Letters above bars indicate significant different among sites (a, c) and seasons (b) based on Tukey pairwise comparisons

Discussion

Oil Indicators

Other than carbon, hydrogen, and oxygen (the elements that make up the majority of hydrocarbons), several elements have been identified in crude and weathered DHOS oil (Mg, Al, V, Cr, Co, Ni, Cu, Zn, As, Se, Sn, and Pb) (Grosser et al. 2012; Liu et al. 2012), refinery effluents (Se) (Miekeley et al. 2005; De Almeida et al. 2009), and other crude oils (Ni and V) (Ellis et al. 2011). In addition, several elements (Na, Mg, and Cr) have been shown to be incorporated into fish otoliths when exposed to oil (Morales-Nin et al. 2007). In our study, many of these elements were either below detection limits (V, Cr, Ni, and Se) or did not vary among sites (Mg, Al, Cu, Zn, Sn, Pb). Given this and the absence of significant differences between oiled versus non-oiled sites in concentration patterns for elements that did vary across sites, we found no detectable indication of impact by metals as a result of oil exposure from the DHOS oil or refinery processes in F. grandis otoliths.

We expected that otoliths from F. grandis collected at the oiled site in Barataria Bay (BA) Louisiana would likely have a trace metal signature that reflected oil exposure, given that this site received heavy amounts of oiling (gomex.erma.noaa.gov/erma.html). Also, weathered DHOS oil in marshes has a higher concentration of metals than crude oil (Liu et al. 2012), and visible oil was still present in this area of marsh as late as October 2013 (gomex.erma.noaa.gov/erma.html, 25-Mar-14 SCAT Oiling Ground Observations). Our fish collections did not start until 2012, but given that F. grandis do not exhibit large-scale movements (Nelson et al. 2014) and assuming they exhibit similar age and growth patterns (50 to 90 mm TL = 1- to 4-year-old fish) to F. heteroclitus (Fritz and Garside 1975; Dibble and Meyerson 2012), then all fish from this site should have been exposed to some oil throughout their lives.

Otoliths from F. grandis collected at the refinery site (RF) were also expected to exhibit chemical signatures indicative of oil exposure, because this site is located in a borrow ditch beside an oil refinery, where oil exposure is likely to be occurring continuously. Not finding elements in otoliths that were associated with oil exposure in AL sites (FA, PD) was not surprising, given that the amount of oiling in these marshes was light and patchy. Also, a similar study conducted in close proximity to our killifish collection sites in Alabama did not find an effect of DHOS oil on trace element or stable isotope uptake into oyster shells (Carmichael et al. 2012).

In contrast to our results, Morales-Nin et al. (2007) were able to detect an oil signature in Turbot (S. maximus) otoliths, perhaps because in their experiment oil was added directly to food sources; therefore, the level of contamination in the lab experiment was likely much higher than the levels experienced by fish in a natural environment. The differences in incorporation could also be based on physiological differences between Gulf killifish and turbot. Even species within a family have been shown to incorporate elements into the otolith differently (Rooker et al. 2004).

Another possible explanation as to why fish that were exposed to oil did not exhibit elevated levels of oil-related metals in their otoliths is the physiological regulation of trace elements and the pathway of otolith incorporation. It has been well documented that elements similar to Ca readily assimilate into the otolith aragonite matrix (e.g., Sr and Ba) through a substitution with Ca. This is because they are not bound elsewhere in the body and can easily pass from the water into the blood and then the endolymph (Sturrock et al. 2012). Therefore, these elements and resulting element/Ca ratios in the otolith exhibit similar concentration patterns to the surrounding water element/Ca ratios (Walther and Limburg 2012). Other trace metals, including many associated with oil (e.g., Zn, Cu, Pb, Se), can be bound to blood proteins and readily excreted through the liver or other pathways (Campana 1999; Sturrock et al. 2012), resulting in incorporation concentrations which could be much lower than ambient water compositions.

Differences Among Sites

Although differences among sites in whole otolith elemental and isotopic composition that could be attributed to oil exposure were not seen, Mn, Fe, Rb, Sr, Ba, and U concentrations differed significantly among sites. Manganese, Sr, and Ba contributed most to the distinction among both sites and regions in the linear discriminant function analyses, and paired sites within estuaries expressed similar concentrations of these elements. Differences in Sr, Ba, and Mn have also been used in many other studies (Thorrold et al. 1998; Campana et al. 2000; Gillanders and Kingsford 2003; Vasconcelos et al. 2007; Tanner et al. 2011) to classify fish to collection areas.

The regional LDFA was more successful and also had classification accuracies similar to other studies (Thorrold et al. 1998; Vasconcelos et al. 2007; Tanner et al. 2011), but the reason for the low percentage of classification accuracy in both the MS and AL east + FL grouping is that these regional groups exhibited similar otolith Sr, Ba, and Mn concentrations. Perhaps the addition of δ13C and δ18O stable isotope analysis, as well as high precision heavy isotope analysis, to our study could have improved the correct classification to these regions, given that the addition of these stable isotopes was able to improve the classification percentages of Thorrold et al. (1998) and Tanner et al. (2011).

Manganese concentrations were highest at the Fowl River site, followed by Fowl A and Fowl B, which is the main reason for the Alabama west grouping being identified in the cluster analysis and the regional LDFA. We expected that patterns of Mn concentrations in water samples would be similar to otolith concentrations, and while this was the case at one site (FR), correlation was low at the other two west Alabama sites (FA and FB). One plausible explanation for these discrepancies between water samples and otoliths as well as the high Mn concentrations in otoliths is the redox chemistry of Mn. When water is hypoxic, more free dissolved Mn is released into the water column and fish exposed to hypoxic waters have exhibited increased otolith Mn concentrations (Limburg et al. 2014). These conditions are probable for FR because low O2 and hypoxic conditions were observed in bottom waters near this site as part of another study, during the fall in both 2012 (1.72–2.75 mg/L) and 2013 (3.08 mg/L). Perhaps short-lived hypoxic events are also occurring at the FA and FB sites, although they have not been observed, or increased dissolved Mn concentrations could periodically be flowing down river to these sites. Also, given that our water sampling only occurred during each fish collection trip, there is a good possibility that it did not capture increased Mn events at the FA and FB sites.

Otolith core concentrations of Mn were also higher than edges at every site. This higher concentration of Mn in the otolith core has been documented previously in other species (Morales-Nin et al. 2005; Ruttenberg et al. 2005; Melancon et al. 2008) and is the reason why whole otolith Mn concentrations differed across sites, even though no site differences were found in otolith edges. Mn may prove valuable as a natural tag for estuarine nursery habitat for waters on the west side of Alabama, given that the otolith core would be the part of the structure analyzed to determine nursery habitat use.

86Sr and 88Sr were the second and third best discriminators among sites, respectively, with concentration patterns across sites being similar for both isotopes, as well as being similar to salinity patterns. This was expected given that a positive correlation between otolith Sr and salinity has been found previously (Bath et al. 2000; Lowe et al. 2009; Albuquerque et al. 2012). In addition, these published relationships show that otolith Sr concentrations approach asymptotic levels at around 15 ppt, which explains why Sr concentrations in the otolith were similar between PD and CK, even though salinities were higher at PD.

Barium was the best discriminator among sites and regions in the LDFAs, because Ba concentrations at the Louisiana sites (BB, BA) were much higher than all other sites. The primary source of Ba in estuaries has been shown to be riverine suspended particulate matter (SPM) (Hanor and Chan 1977; Coffey et al. 1997) and the Mississippi River is a very large drainage containing high SPM loads, resulting in high concentrations of Ba (Hanor and Chan 1977; Coffey et al. 1997). This occurred even when salinities around 15 ppt were experienced in the fall at the BA site, demonstrating that Ba concentration in otoliths is not purely a function of salinity. Given that fish collected in Louisiana had much higher otolith concentrations of Ba than at other sites, this could prove to be a reliable tag for nursery area discrimination or stock discrimination for members of a species that use estuarine waters close to the Mississippi River.

Conclusions

Mn, Sr, and Ba were the best discriminators among sites and the use of these elements led to a reasonably successful regional classification, particularly for the Louisiana sites. Mn may be useful as an otolith chemical tag to identify nursery habitat use in estuarine waters on the west side of Alabama, especially proximal to Fowl River. Also, Ba may prove to be a reliable otolith chemical tag for estuarine waters near the Mississippi River, providing a useful tool for nursery habitat identification and stock discrimination.

Otolith chemical signatures between paired oiled and non-oiled sites did not differ in trace elements likely associated with oil exposure, and the relative condition of F. grandis also did not differ between paired sites. Given previous published evidence that fish otoliths incorporate elements specific to oil exposure in the lab (Morales-Nin et al. 2007), our results support a minimal effect of the DHOS spill on F. grandis, a trend that has also been observed for other nearshore fauna (Fodrie and Heck 2011; Carmichael et al. 2012; Moody et al. 2013) in the northern Gulf of Mexico.