INTRODUCTION

In recent decades, the relationship between environmental quality and geochemistry (the chemical composition of the Earth) has received considerable attention from researchers and experts. However, there are still numerous problems regarding the accuracy of such studies [6]. With the advancement of more precise study methods and the increase of accuracy, not only can the origin, transport, and other ancient characteristics of sediments and various rocks be determined, but also the state of environmental pollution in the study area can be evaluated with much higher precision [14]. Quaternary sediments are among the environments where various pollutants are abundant. These contaminants have both natural and anthropogenic origins, and the need to study them closely has always been recognized [20]. Man-made pollutants that play a major role in environmental pollution include agricultural pollutants, petroleum residues, urban and rural runoff, industrial pollutants, and other pollutants that result directly from human activities [8]. Heavy metals are of particular concern because of their diversity and widespread distribution, stability and nondegradability, potential toxicity or deficiency, ability to enter the food chain, and biological and physiological effects in general [20]. Soils and sediments contaminated with heavy metals possess the potential to contaminate other environmental components such as water sources, air, and agricultural products, making their assessment crucial and vital [7]. In recent years, numerous studies have been conducted on the chemistry of sediments and pollutants to determine their degree of pollution. Most of these studies have focused on identifying human- induced factors. What distinguishes this research from other studies is the consideration of a crucial point: This study considers that sediment composition is strongly influenced by the geochemical and mineralogical parameters of the source rocks, resulting in a wide variety of geochemical compositions Love and Zumberge [8].

In this research, which is considered a model study for the study of soil pollution in other parts of the world, the chemical composition of surface sediments is investigated. First, a general understanding of the region is established through geological investigations to identify sources and learn about the general geological, tectonic, and mineralogical structure of the area. Then, geochemical data will be used to study the origin of the sediments and the distribution of various elements in the study area. Finally, by studying the geochemistry of heavy metals and their bioavailability, a better assessment of their environmental impact can be made. The importance of geochemical studies in the study area is emphasised by the fact that the study area includes several densely populated villages and is considered part of the active agricultural and ecological zones of Iran, making it a potentially ideal model for similar studies around the world. In addition, this study outlines the methodology for a detailed geochemical study of sediments and explains how these studies correlate with statistical data for conducting similar studies.

GEOGRAPHICAL AND GEOLOGICAL LOCATION

The studied area is located in the south of Iran, in the Khuzestan province, and in the vicinity of the city of Gotvand. Overall, the geology of the study area in Khuzestan Province is characterized by a variety of landforms, including mountains, hills, plateaus, and plains, which have been formed over millions of years by tectonic forces, erosion, and depositional processes [4]. The geological formations of the Khuzestan area, which is part of the folded and thrust Zagros, were formed under different sedimentary and tectonic conditions; therefore, the differences in facies are important in terms of time and place [19]. Based on the factors of lithology, tectonics and age of the geological formations, the rock outcrops of the province can be divided into the following two tectonostratigraphic units [3]: A: marine sequences of Jurassic-Cretaceous age, which have facies of the young Tethys basin. The aforementioned series are generally exposed in the core of the anticlines and in a general northwest-southeast direction; B: Sequences associated with alpine orogeny, Cenozoic, and whose features represent deposits accumulated in a southward retreating sea. The sequence in question, which may be marine or non-marine, includes the following units [9]: first, marine series of Eocene age (Pabde Formation) and marine limestones of Oligocene and Miocene age (Asmari Formation), which form the reservoir rocks of the province’s petroleum reserves. Second, the Miocene clastic evaporite units known as the Fars Group were formed at the same time as the end of the Alpine orogeny and in retreating marine environments. The evaporite series at the base of this complex (Gachsaran Formation) are the oil deposits of Khuzestan Province. Thirdly, the detrital units after the orogeny are known as Bakhtiari Conglomerate, which represents a relatively strong erosion after the end of the Alpine orogeny. Finally, there are the Quaternary debris deposits, which are the result of weathering of various formations [2]. Apart from the Khuzestan plain, which is relatively quiet and has no seismic force, the seismic force is high in the mountainous regions of the province, especially around Behbahan, which is sometimes associated with damage and destruction [4]. Figure 1 shows the geologic location of the area studied the geologic map of the area studied along with the names and lithology of the various formations.

Fig. 1.
figure 1

Geographical and geological location of the study area along with the geographical location of the recorded samples (red circles).

RESEARCH METHODOLOGY

The research method and its implementation play a crucial role in the outcome of the data. In this study, which was conducted as a global model research, the following steps were performed sequentially:

—Office studies: these studies include all preliminary research conducted prior to fieldwork. The most important aspects of these studies include the collection and analysis of books, maps, and relevant articles, as well as the use of various aerial photographs of the study area (southwestern Iran‒Gotvand Plain). The duration of these studies was over a period of four months.

—Preliminary field studies: this phase of the study includes various field investigations. Extensive knowledge of the geologic features of the area, including sedimentologic, tectonic, and mineralogic features, was acquired during this phase. The importance of these aspects to the transport and deposition of the various sediments was evaluated. A map of ideal sampling sites was then developed. These studies took place over a period of three months.

—Sampling: in this phase of the study, we collected 38 sediment samples containing silt, clay, and fine sand particles from different regions of the study area (Table 1). Our sampling strategy aimed to cover the entire study area, from near-mountainous areas to lowlands. At each sampling location, one kilogram of soil was collected manually with a hand auger at a depth of 20 centimeters below the surface. This depth was chosen to avoid surface contamination. After collection, samples were carefully labeled using sample bags. These labeled samples were then sent to the Central Laboratory of the Geological Survey of Iran for geochemical analysis. The analysis was performed using inductively coupled plasma mass spectrometry (ICP-MS) and X-ray fluorescence (XRF).

Table 1. List of samples collected in the study area. These samples are labeled BK (for samples near the Bakhtiari Formation) and G (for samples near the Gachsaran Evaporative Formation)

—Laboratory studies: This section of the study focuses on analysing and testing the results of the geochemical investigations. Data were processed and statistically analysed using SPSS software. This included evaluating the distribution of data through indices and statistical tests, performing correlation analyses, and identifying the sources of elements through cluster and factor analyses. SPSS (Statistical Package for the Social Sciences) is a software used for statistical analysis [5]. These experiments were conducted in the Central Laboratory of the Geological Organisation of Iran and in the Central Laboratory of the Department of Physical Geography of the University of Coimbra, Portugal.

RESULTS AND DISCUSSION

Results of Geochemical Measurements

One of the most effective methods in geochemical studies is the comparison of elemental composition between different sections in terms of the abundance of trace elements and oxides present in the samples [12]. Essentially, the abundance of various elements in each environment reflects the conditions of formation and the nature of the constituent environment, indicating the source of induced pollution [10]. Results obtained from the study of trace elements in the study area, considered as a global model, indicate that the most abundant trace element in the study areas is titanium (Ti). Following this element, the most abundant elements, in order, are strontium (Sr), phosphorus (P), and manganese (Mn) (Table 2). The results obtained from the study of oxides in the study area reveal that the most abundant oxides in the study area are, in order, calcium oxide (CaO) and silicon dioxide (SiO2). These data are based on the average percentage of oxides analysed in 31 samples of silts and clays (based on the predominant lithology) collected in the study area. The results obtained from the study of oxides in the study area reveal that the most abundant oxides in the study area are, in order, calcium oxide (CaO) and silicon dioxide (SiO2) (Table 3 and Fig. 2). A comparison of oxide values in Fig. 3 demonstrates contrasting trends in SiO2 and CaO variations in the study area, which could potentially be attributed to the presence of rock fragments within these sediments. Their presence might lead to an increase in the percentage of CaO and a relative decrease in SiO2 [12]. It is suggested that if similar results are observed in other parts of the world, petrographic studies should be conducted to determine the percentage of rock fragments.

Table 2. Measurement results of different elements in the study area
Table 3. The results of measuring oxides by XRF device in terms of percentage
Fig. 2.
figure 2

The average percentage of oxides in the samples collected from the studied area.

Fig. 3.
figure 3

Comparison of the changes of CaO versus SiO2 in the studied area, which shows the reverse trend of these results.

Particle size analysis of sediment samples can play a significant role in analyzing various data, especially in determining the sources of different contaminants in soil samples [20]. In this study, collected samples were categorized based on their probable sources into two groups: samples related to alluvial fans in the northern part of the area, originating from volcanic deposits (Late Miocene Bakhtyari Formation), and samples associated with alluvial fans in the eastern part of the area, likely originating from evaporite deposits (Early Miocene Gachsaran Formation).

Table 4. Granulation analysis results of some alluvial sediments

The classification of different samples revealed that sedimentary and lithological samples in the northern part of the area are coarse-grained with weak rounding and poor sorting. Gradually, as distance increases from mountainous regions towards the plain, these samples transform from coarse to medium-sized particles and eventually to fine particles. The higher abundance of coarse-grained particles among finer-grained particles in the northern area indicates two important points. Firstly, the transporting floods carrying these sediments in the northern part of the area had higher velocity (not necessarily higher discharge), and secondly, the sediments were transported from environments with both humid and arid climates and under the dominance of physical degradation (Fig. 4). Samples from the eastern part, due to the high solubility of gypsum particles, initially appear as coarse-grained particles and eventually, after a distance of about one kilometer, completely transform into fine-grained particles.

Fig. 4.
figure 4

Geological characteristics of the studied region. (a) The sedimentary facies and lithology in the upstream section originating from Bakhtiari formations in the northern part of the area. (b) Poor sedimentary facies and lithological variations in the upstream section sourced from Gachsaran formations in the eastern part of the area. (c) Distant outcrop from the downstream section in the eastern part of the area, facing southeast.

In this study, for the geochemical and statistical analysis of elements, determining the relationship between major and trace elements, as well as achieving a specific genetic pattern, assessing the elemental contamination of the region, and identifying the sources of contamination in terms of their potential to generate dust, analyzing toxic and heavy element anomalies, and identifying the origins of elements in the plain, statistical analyses of major, trace, and rare elements were conducted. In geochemical studies, various factors such as genetic factors can establish connections between variables. The degree of this dependence can be indicated by correlation coefficients [20]. In this method, by using regression and other statistical parameters, it is possible to identify the controlling factors and processes of sedimentation and to determine the role of natural and anthropogenic (man-made) factors in the distribution of sediments [13]. Before determining the correlation coefficient, one must first ensure the normality of the data distribution in order to use the appropriate coefficient based on that. As can be seen in Table 5, except Ba, Na, Sr, and Pr elements, other elements have a normal distribution. The distribution of these four elements was converted to normal using the logarithm method. Considering the normal distribution of elements in the sediment samples, the correlation between the study parameters was examined using the Pearson correlation coefficient. The results indicate a correlation among the investigated elements. The reason behind this is the lack of diversity in pollution sources in the study area (Table 6). In the cluster analysis based on the studied elements and sampling stations, an initial elimination of cases with low correlations was performed, leaving out elements that exhibited weak relationships for subsequent analyses. Following the dendrogram resulting from the cluster analysis, post the exclusion of elements with minimal or absent correlations, which amounted to a total of 8 elements, the sampling stations were also categorized into two main groups based on the anomalies and element behaviors. Specifically, sampling stations 4, 20, 5, 2, 1, 3, 7, 11, and 12 were grouped together, sharing common characteristics, potentially indicating exposure to similar anomalies or contaminations, or possibly originating from the same sedimentary environment. Furthermore, the second cluster reveals the presence of two subsets with high intercorrelations. The first subset encompasses stations 8, 10, 6, and 14, while the second subset involves stations 18, 22, 17, 19, 21, 15, 23, 16, 38, 27, 33, 31, 37, 13, and 25 (Figs. 5 and 6). In these figures, the numbers represent the identifiers of the sampling sites analyzed in this study, whose characteristics were examined and classified accordingly. Numbers 1 through 19 correspond to samples BK1 through BK19, and numbers 20 through 38 correspond to samples G1 through G19.

Table 5. The results of the K-S test on the studied sample
Table 6. Results of correlation coefficients of studied elements The results of the K-S test on the studied sample
Fig. 5.
figure 5

Dendrogram resulting from cluster analysis for all studied elements: (a) before eliminating elements with low correlations, (b) after eliminating elements with low correlations.

Fig. 6.
figure 6

Dendrogram resulting from cluster analysis for all studied stations: (a) before eliminating stations with low correlations, (b) after eliminating stations with low correlations.

Based on these data, it is possible to infer the origin and genesis of each of the elements; such that elements exhibiting positive correlations are discernible by their terrestrial origin. In the Dendrogram analysis, three main clusters can be identified. Elements Mn, Ni, Ce, Y, Al, Sm, K, La, and Dy form one cluster, elements V, Er, Zr, Co, and Mg form another cluster, and elements Ti, Th, Nd, and Gd form a subcluster within the third cluster. Additionally, elements Fe, Cr, Nb, Tl, Sc, and Ta form another subcluster within the third cluster. The determination of the number of clusters mentioned is based on the genetic behavior of the elements. It appears that clusters containing elements with established terrestrial origins, such as Fe, might signify a terrestrial source for those elements. The grouping of elements V, Er, Zr, Co, and Mg in one cluster, and Mn, Ni, Ce, Y, Al, Sm, K, La, and Dy in another, could potentially indicate a human-related origin for these elements; although the possibility of dual origins remains plausible in this context. Factor analysis involves three stages: data suitability assessment for analysis, factor extraction, and interpretation of results. The Kaiser-Meyer-Olkin (KMO) coefficient and Bartlett’s Test of Sphericity are suitable tools to ensure the data’s appropriateness for this analysis. Low KMO coefficient values (less than 0.5) indicate that the correlation between parameters cannot be explained by other parameters [5]. The KMO coefficient ranges between one and zero. If the KMO value falls between 0.5 and 0.69, cautious use of the analysis is possible, and if it exceeds 0.7, factor analysis can be employed for data decomposition and analysis [15]. The outcomes of this analysis are presented in Table 7, indicating the suitability of the data for factor analysis.

Table 7. KMO and Bartlett test results for the studied samples

Based on factor analysis, elements with high correlation coefficients have been identified as a single factor. As depicted in Table 8 and Fig. 7, seven factors influence the distribution of elements in the study area, where approximately 50% of the variations are attributed to the first factor. The remaining six factors collectively contribute to less than 30% of  the variations. According to this analysis, all measured elements except Na and Pr have high factor loadings (above 0.5) in Factor 1 (Table 9). Consi-dering the  regional context, Factor 1 could potentially signify a terrestrial origin for elements present in sediments, likely derived from the erosion of various source rocks and transported into the basin through river systems [5].

Table 8. The table resulting from the total variance of the studied samples
Fig. 7.
figure 7

Scree diagram of the studied samples.

Table 9. Factor loadings of the studied variables

The presence of various rivers, especially the Karun River, is a potential source for the mentioned elements. In Factor 2, elements Na and Mg exhibit the highest factor loadings. Evaporite minerals, particularly in the eastern part of the region (Gachsaran Formation), could potentially serve as the source for these elements. Elements Zr, V, and Er contribute the most to Factor 3’s factor loadings. In Factor 4, elements Sc and Ta, in Factor 5, elements Pr and V, and in Factors 6 and 7, respectively, Cr and V possess the highest factor loadings. Each of the Factors 2 to 7 indicates a distinct origin contributing to the creation of anomalies. It is conceivable that the presence of evaporites and upstream contaminations from various sources could lead to increased anomaly creation and the emergence of diverse factor loadings. The results of factor analysis further validate the findings of the cluster analysis.

Analysis of Environmental Indices Numerous environmental indices are available to assess pollution levels in soil samples. In this study, a variety of indices, including the Geoaccumulation Index, Enrichment Factor, Pollution Load Index, Ecological Risk Index, Pollution Load, and Contamination Degree, were employed for evaluating soil contamination levels. Various methods of studying environmental pollution using different indices yield fairly similar results for the studied samples. Studies conducted using the Geoaccumulation Index (Igeo) indicated that the investigated samples exhibited a status ranging from uncontaminated to moderately contaminated for most elements, except for elements such as Mg, Cr, Pr, Ni, Sr, and Cd, which were in the uncontaminated to moderately contaminated range. Based on the Enrichment Factor (EF) factor, the studied samples showed varying percentages of enrichment levels from no enrichment to moderate enrichment for certain elements including Ti, P, Mn, V, Cr, Ni, Cu, Y, Co, Sc, Nb, Gd, Er, and Pr. Elements Sr, Mg, and Cd in some samples fell within the moderately enriched range. The investigation of Contamination Factors (CF) for the studied samples revealed that elements including Na, Mg, Mn, Cr, and Pr fall within the moderate range for some samples. For elements Sr and Cd, certain samples exhibited values within a significant range. The assessment of pollution present in the area using the Nemerow Integrated Pollution Index (NIPI) indicated that these samples exhibit low pollution levels for elements like Mg, Cr, Ni, and Pr, while showing high pollution levels for elements Sr and Cd (Table 10). The examination of Modified Degree of Contamination (mCd) values for the studied samples indicated that the studied samples are devoid of significant ecological risk (Table 11). It is worth mentioning that agricultural activities in this region could potentially elevate this risk in the future. In addition to the employed indices for determining pollution and assessing the hazardous nature of the study area, two additional indices, namely Pollution Load Index (PLI) and Potential Ecological Risk (PER), were used to gauge the pollution level of the area. According to these two indices, the studied samples exhibit no considerable pollution (Table 11).

Table 10. Environmental indicators (Igeo, EF, Cf, NIPI)
Table 11. Environmental indicators (PER, PLI, mCd)

The summary of results obtained from various environmental indices indicates that the studied samples exhibit contamination for elements Mg, Sr, Cr, Ni, Pr, and Cd based on the four indices Igeo, EF, CF, and NIPI. Among these elements, the two heavy metals, Cadmium (Cd) and Strontium (Sr), show the highest levels of contamination according to these indices. Studies related to the soil quality of the plain demonstrate that the concentration of strontium significantly increases in areas with gypsum and saline units (Gachsaran Formation). Therefore, it is inferred that the origin of this heavy metal could be both natural and due to transported alluvium from the Gachsaran Formation towards the plain areas. Among the influential factors in soil contamination by other elements, various anthropogenic sources such as agriculture and the utilization of different elements as drilling mud components in multiple wellbore sections being drilled in the upstream section of the studied region can be mentioned.

SOURCE IDENTIFICATION OF AREA SEDIMENTS

Source identification of various sediments plays a significant role in determining the origins of multiple pollutants transported to the area through water or wind currents [17]. In these studies, various methods have been employed for precise source identification of sediments to trace the origins of different pollutants. These methods can serve as templates for similar studies in other parts of the world, which will be further discussed in the following sections.

ANALYSIS OF MAIN ELEMENT RESULTS BASED ON THE ALUMINUM SCALE:

The SiO2/Al2O3 ratio serves as a key indicator for determining the origin of sediments and identifying sediment sources [13]. When comparing different oxides with Al2O3 to identify sediment sources, a positive correlation is observed with Fe2O3, K2O, MgO, SiO2, and TiO2, while an inverse relationship is found with CaO (Fig. 8). The positive relationship between SiO2 and Al2O3 can be attributed to the presence of these elements in quartz. In sedimentary rocks, a SiO2/Al2O3 ratio greater than 5 to 6 indicates a high degree of maturity of the sedimentary rock [15]. This positive correlation may indicate the presence of feldspars, mica and clay minerals [17].

Fig. 8.
figure 8

The trend between aluminum oxide and some of the other studied oxides in the samples.

Examination of the variations of TiO2 and Al2O3 shows a positive correlation between these two oxides (Fig. 8). Titanium (Ti) is mainly concentrated in phyllosilicates and serves as a reliable indicator for interpreting the origin of sedimentary rocks because of its limited mobility and translocation during depositional processes compared to other trace elements [5, 6]. These analyses are of great value for further interpretations related to the origin of sediments.

GEOCHEMICAL CLASSIFICATION OF SEDIMENTS:

The classification of arkosic and lithic arenites and consequently determining the primary provenance of the sandstones present in soil samples is carried out using the log (Na2O/K2O) versus log (SiO2/Al2O3) diagram, which is based on chemical maturity indices [11]. The plotting of the results of major element analysis onto Fig. 9 revealed that most of the sandstone samples in the study area possess an arkosic composition. This arkosic composition aligns with the sandstone composition found in the Bakhtiari and Aghajari formations (in the northern part of the region) and could indicate the primary provenance for these sediments.

Fig. 9.
figure 9

Geochemical classification of sediments in the study area based on Pettijohn’s classification (1987).

PALEOCLIMATIC STUDIES

Plotting SiO2 against (Al2O3 + K2O + Na2O) provides insight into the paleoclimatic conditions of the source area and consequently, the determination of the primary origin of the studied sediments [17]. Based on this diagram (Fig. 10), the paleoclimatic conditions of the source area during sediment deposition ranged from humid to semi-arid. This information allows for tracing the primary origin of sediments and the sourceless pollutants in the sedimentary deposits. As previously mentioned, identifying the primary source of sediments plays a crucial role in geochemical analysis for pollution source identification. Given the current warm and dry climate of the study area, these paleoclimatic conditions suggest that the transported sediments were likely carried by river flows from the upper parts of the watershed known as the “Sannadaj-Sirjan Zone” to the study area.

Fig. 10.
figure 10

Diagram depicting the paleoclimatic conditions in the study area according to the chart by Suttner and Dutta (1986).

OURCE IDENTIFICATION OF SEDIMENTS USING THE INTEGRATION OF SEDIMENTARY AND GEOCHEMICAL DATA

The composition of sandstones changes during diagenesis and metamorphism [13, 14], so geochemical studies are crucial for identifying the sources of detrital particles and evaluating the degree of pollution in sedimentary rocks [12]. The size of sediment particles in the region is a major factor affecting dust composition [10]. Sand particles have lower rare earth concentrations than clays [18].

The current source of sand particles in the study area are late Neogene sediments with strong weathering, mainly sandstone and conglomerate types, which are widespread in the Karun River basin [1]. The contribution of the Bakhtiari Conglomerate Formation to fluvial sedimentation is minimal. Clays derived primarily from rock units with highrates of weathering and sedimentation, such as several marl layers in the upstream Gachsaran and Mishan formations, significantly affect the behavior of rare earths in feldspars. Higher enrichment of light rare earth elements indicates an increased clay-to-sand ratio, as clay minerals have a high capacity to adsorb toxic elements [12].

CONCLUSION

The research conducted has produced numerous results that can be considered a study template for other regions around the world. In the following section, a summary of the results obtained is presented:

—The positive correlation of each element with the source indicator suggests its geologic origin, while the positive correlation with the bioavailability and watershed indicator elements suggests an anthropogenic source for those elements.

—In analysing the dendrogram of the elements, three main clusters can be identified. It seems that elements with geological origin such as Fe are grouped in one cluster, which is probably related to a geological source for these elements. Elements V, Er, Zr, Co, and Mg form one cluster, while elements Mn, Ni, Ce, Y, Al, Sm, K, La, and Dy form another cluster. This could indicate different anthropogenic sources for the mentioned elements.

—Factor analysis showed that seven factors influence the distribution of elements in the studied area, with the most important changes associated with the first factor. According to the factor analysis, all measured elements except Na and Pr are associated with factor 1, which has a high factor loading (above 0.5). Considering the conditions in the region, the presence of various rivers, especially the Karun River, is the probable source of these elements. In the second factor, Na and Mg have the highest factor loading. Evaporite minerals, especially in the eastern part of the region, could be the source of these elements.

—Based on the results obtained from various pollution indices, it can be concluded that the studied samples exhibit moderate contamination for elements including Mg, Sr, Cr, Ni, Pr, and Cd according to the four indices Igeo, EF, Cf, and NIPI. Neglecting environmental factors could contribute to increased soil contamination by these elements in the study area.

—Integration of sedimentological and geochemical data for sediment provenance analysis revealed that the current origin for the sand particles in the studied area is likely Neogene deposits, specifically the Aghajari and Bakhtiari formations, characterized by significant erosion and high sedimentation rates. These deposits predominantly consist of sandstone and conglomerate types and are extensively distributed in the southwestern part of Iran’s Karun watershed. However, the contribution of the Bakhtiari Conglomerate Formation to the alluvial sediments transported to the rivers and carried downstream is limited. The probable source for the studied sediments, which are part of rock units with substantial erosion and high sedimentation rates, is likely the numerous marl layers within the Gachsaran and Mishan formations. These formations are situated in the upstream sections of the watershed.