Introduction

The exploitation of mining sites has become over time a very important economic activity as the industrialization of companies has taken place. With a continuous increase in the demand for metals, mining activity has begun to afect the environment more seriously; with great changes in the landscape and the volumes of rocks, it concerns (Alloway and Ayres 1997; Custer 2003).

Trace metal contamination of soils is a hazard to water, living species and human health. Trace metal concentrations (TMs) are related to the nature of the parent rock and to weathering processes (Baize 2009). According to Dère (2006), the migration of TMs is controlled by several factors: the metal itself, the physico-chemical characteristics of the soil (pH and EC), the intrinsic properties of each horizon (texture, structure, cohesion, etc.), the water flows and their dynamics. Therefore, high levels of trace metal concentrations can be found after migration around metal mines due to the dumping and dispersion of mine waste in the surrounding soils, crops and rivers. They may eventually pose a potential health risk to residents near mining areas (Jung 2001). Trace metals are bio-persistent, disrupt ecosystems, deteriorate soils, surface waters, forests, crops and accumulate in the food chain (Gouzy and Ducos 2008; Jaishankar et al. 2014).

The objective of mining is the extraction of materials consisting of a set of minerals: ores. But this extraction is associated with other activities such as the processing of minerals leading to the production of tailings. It is these tailings that, initially in geochemical equilibrium at depth, are exposed to a new environment, namely the soil, surface water and air. Contact with surface water, which is often less ionically charged, can have an impact on the balance of dissolution and precipitation (Sigg et al. 1992). The oxidation of sulphides causes extreme acidification of the environment and thus of the leach water from these tailings: this is the typical Acid Mine Drainage (AMD) of mine tailings (Edraki et al. 2005; Lottermoser and Ashley, 2005). His acidification has important consequences since it favours the desorption of certain heavy metals or metalloids (Chromium, Copper, Nickel, Arsenic, Lead, etc.) in more mobile and toxic forms, and produces highly polluted water which in turn affects the soil.

Geostatistical analysis consists of predicting an unknown variable using a property measured at a given position and time (Yasrebi et al. 2009). Geostatistics allows for consistent analysis of data, uncertainties and errors in the data, and spatial structuring of grades (Malherbe and Rouil 2003). This analysis in turn allows the production of concentration maps of the studied parameters (Florine et al. 2019). Based on the assumption of estimating an unknown variable at any position, several techniques have been developed to predict the spatial variability of soil properties, such as ordinary kriging (OK), inverse distance weighting (IDW), artificial neural networks and soil transfer functions (Zhao et al. 2010; Veronesi et al., 2014). In recent years, OK has been widely used by many authors for the preparation of spatial variability maps of trace metals and soil physicochemical parameters (Behera et al. 2016; Behera and Shukla 2015; Tripathi et al. 2015). In practice, the first step is to carry out an exploratory analysis of the data (location map, descriptive statistics, histogram, etc.) in order to identify general trends, anomalous data and delineate areas with different properties (GeoSiPol 2005; Mathieu et al. 2014). The next step, which consists of characterizing the spatial variability of the physicochemical properties of the soils, in particular the extent or correlation distance and the anisotropy of the environment, helps to guide the choice of the interpolation method. The geostatistical approach used to map the physicochemical parameters (pH and EC) and the trace metals (Cr, Ni, Cu, As and Pb) spatial variabilities, in this study includes four phases: effective analysis of the data, which consists of determining the statistical characteristics necessary to describe the pollution of the soils; interpolation of the levels of pollutants by the kriging method based on realistic maps of the spatial dispersion of pollutants in soils; estimation of contaminated areas and determination of critical thresholds. Geostatistical approach appears as an effective tool to help managers of polluted sites and soils in possible clean-up projects (Deraisme and Bobbia 2003; GeoSiPol 2005; Jeannée and De Fouquet 2000; Jeannée, 2005; Kumar et al. 2016).

The study of spatial variability of physico-chemical parameters (pH and EC) and trace metals (Cr, Ni, Cu, As and Pb) of a former site requires the creation of pollutant concentration maps at the level of the study area (Bhuniaa et al. 2018; Khan et al. 2019). It is necessary to spatialise the pollutant levels obtained at a few sampling points on the site, thanks to laboratory analyses carried out after a rigorous sampling campaign. Several authors (Leenaers et al. 1989; Scholz et al. 1999; Jeannée and De Fouquet 2003; Demougeot et al. 2009; Garcia et al. 2019) have used geostatistical techniques in the study of soil quality to map and estimate the concentrations of pollutants present at the site and regional scale. The geostatistical approach also makes it possible to integrate other information relevant to the mapping of the pollutant: concentrations from physicochemical modelling, land use, and measurements of other pollutants with similar behaviour (Bobbia et al. 2001).

Numerous studies have evaluated sol pollution using traditional methods. These approaches include: quality guidelines (SQGs), statistical multivariate techniques, calculated pollution indices and/or ecological risk assessments (Ayiwouo et al. 2022; Chen et al. 2021).

In the present study, a site sensitivity analysis demonstrates the value of in-depth geostatistical modeling. This modeling entails a thorough exploratory data examination, which is necessary for any estimating technique. In order to account for the heterogeneity of the distribution of the concentrations on the site, the aim is to detect the spatial variability of the concentrations and quantify it using a variogram. The various variogram models necessary for simple kriging interpolation are chosen using the cross-validation technique.

The main objective of this study is to assess the spatial variability of physico-chemical parameters (pH and EC) and trace metals (Cr, Ni, Cu, As and Pb) in the abandoned gold mining district of Bindiba (East Cameroon) and to predict their values at uns-ampled locations.

Materials and methods

Study area, sampling and analytical procedures

The study area is located about 12 km from the locality of Bindiba, in the district of Garoua-Boulai, Lom and Djerem department, East Region of Cameroon (Fig. 1). It covers an area of about 3092.34 km2. The study area has an equatorial Guinean climate. With around 2.1 months of extreme heat and four months of extreme cold, there are four seasons of differing lengths (PCD, 2012). The temperature ranges from 14 to 35 °C all year long. The South Cameroon plateau, which has an altitude range of 650–900 m, accounts for the majority of the relief in the Eastern Region of Cameroon (PCD 2012). Although gold mining has significant economic benefits, it also degrades the quality of the local soils, which leads to the unregulated dumping of mining waste on the site, which has a significant negative impact on the ecosystem.

Fig. 1
figure 1

Map of the study area and locations of the sampling sites

The sampling map is presented in Fig. 1.

It was vital to optimize sampling and analysis to assure the quality of the data, so the sample sites were chosen based on the mining density. During the course of this study's first site reconnaissance, acid mine drainage was discovered at a few different locations on the site. Thus, there were two soil sampling campaigns. The first was conducted during the rainy season, specifically in August 2020, and resulted in the collection of 33 samples. During this campaign, sampling was done at intervals that were, on average, 100 m apart. In May 2021, a total of 56 samples were collected with an average spacing of 50 m as part of the second campaign, which intended to enhance sampling density. The differences in pitch between the two sample campaigns are caused by the anthropogenic (houses, deep trenches, lakes, embankments, etc.) and natural (vegetation, mountains, rivers, etc.) obstructions present in the study area. To prevent the sample technique from affecting the statistics, all these step changes were implemented. A total of 89 soil samples were collected during the study (Fig. 1).

As recommended by IAEA (2004), a systematic sampling was used as the sampling strategy since it enables for the determination of average pollutant concentrations as well as the parameters pH and EC in various cells. A volume of 500 g per sample was collected, preserved in polythene bags, and brought to the lab for additional examination after samples were manually taken at depths ranging from 0 to 10 cm.

The analysis of the physic-chemical parameters consisted in determining the pH and EC using a pH meter and a multi-parameter apparatus of the brand HANNA HI991300-HI991301 previously calibrated. The method consisted of introducing the sediment sample (6 ± 1 g) into a jar containing distilled water (15 mL) (AFNOR NF X-31-103, 1998). The sample was then stirred in the jar for 20 min and then left to stand for the decantation of the solid elements for 30 min. After complete settling, pH and EC measurements were taken. For each sample, the electrode was rinsed with distilled water and dried, then the operation was repeated for each sample.The pH and EC were analyzed at the Laboratory of Environment and Eco-materials of the School of Geology and Mining Engineering of the University of Ngaoundéré, Cameroon.

The trace metals analyzed in the soil samples were chromium (Cr), nickel (Ni), copper (Cu), arsenic (As), and lead (Pb). Their concentrations were determined in mg kg−1 using the Skyray Instrument EDX Pocket III X-ray fluorescence spectrometer with a detection limit of 0.001–0.01%. The analysis was carried out at the laboratory of the Framework of Support for the Promotion of Mining Handicrafts “(CAPAM)” located in Yaounde, Cameroon.

Geostatistical analysis

The geostatistical analysis consisted in three step. The first step was the exploratory analysis of data, the second was the varigraphic analysis (VA) and the third, production of prediction maps with kriging interpolation (Chiles et al. 2009).

Exploratory analysis

The base of any geostatistical analysis is the exploratory data analysis (Rivoirard 2003). It is a crucial phase in comprehending a dataset and enables first-class information base cleaning (positioning, copying errors, etc.), followed by the development of each quantity's primary statistical characteristics as well as the connections between them.

The exploratory analysis, which follows the numerous sampling phases with their flaws and faults, entails a statistical study of the concentrations of the pollutants being tracked, including correlations between various elements and the statistical distribution of concentrations. By characterizing the histograms, it is possible to identify outliers or suspicious values at this phase. Additionally, it enables the visualization of the distribution curve and perhaps the detection of many modes. We can choose the best interpolation approach by using the exploratory analysis to better understand the spatial organization of the information (Jeannée and De Fouquet 2003).

Variographic analysis

The variogram (or semi-variogram) is a geostatistical tool that allows the quantification of spatial variability as a function of distance. It is defined experimentally as the average over the field studied of the half-square of the difference between the values measured at two points, as a function of the distance and orientation separating these two points. A variogram (or semi-variogram) is used to measure the spatial variability of a regionalised variable and provides the input parameters for spatial interpolation of the kriging (Webster and Olivier 2006). It can be expressed as follows.

$$ \gamma \left( h \right) = \frac{1}{2N\left( h \right)}\mathop \sum \limits_{i = 1}^{N\left( h \right)} N\left( h \right)[Z\left( {x_{i} + h} \right) - Z(x_{i} )]^{2} $$
(1)

where N(h) is the number of pairs of points distant from h; \([Z\left( {x_{i} + h} \right) - Z\left( {x_{i} } \right]^{2}\) the squared deviation of the contents at points x and x + h.

The VA is the first step for obtaining the kriging weights \({\uplambda }_{\mathrm{i}}\). The so-called semivariogram \(\upgamma (\mathrm{h})\) represents the average variance between the observations separated by a certain distance, and describes the structure of the spatial variability of the investigated variable (Chambers et al. 2000). The semivariogram is calculated by using Eq. (2) (Journel and Huijbregts 1978):

$$ \gamma \left( h \right) = \frac{1}{2N\left( h \right)} \mathop \sum \limits_{i = 1}^{N\left( h \right)} \left( {Z\left( {x_{i} + h} \right) - Z(x_{i)} } \right)^{2} , $$
(2)

where N is the number of pairs of sample points separated by the distance h.

The semivariogram, however, often does not provide information for all possible distances. Therefore, a semivariance model must be fitted to the semivariogram (such as a stable, exponential, or spherical model). The type of model is selected based on the properties of the data. We basically follow the rule that the sum of the products of the lag size and the number of lags in the semivariogram should be approximately half the distance between all points (Verfaillie et al. 2006).

In geostatistical modeling, sampled data are considered as the result of a random process. Therefore, there is always some likelihood or uncertainty attached to the predictions. In order to do this, the Kriging system is defined by Eq. (3):

$$\widehat{{Z}_{1} }\left({\mathrm{S}}_{0}\right)= {\mu }_{1}+{\varepsilon }_{1}\left({S}_{0}\right),$$
(3)

where \(\widehat{{Z}_{1}}\) is the estimated variable at location \({\mathrm{S}}_{0}\) decomposed into the deterministic trend \({\mu }_{1}\) and the random, auto-correlated error \({\varepsilon }_{1}\) at location \({S}_{0}\).

The best-fitting model was selected for each soil parameter with root mean square error (RMSE).

$$ {\text{RMSE}} = \sqrt{\frac{1}{N}} \mathop \sum \limits_{i = 1}^{N} [Z(x_{i} ) - \overline{Z}\left( {x_{i} } \right)]^{2} $$
(4)

where \(\mathrm{Z}\left({\mathrm{x}}_{\mathrm{i}}\right)\) is the predicted value from cross-validation; \(\overline{Z }({x}_{i})\) the observed value.

Interpolation by the kriging method

As the most precise spatial interpolation technique from a statistical standpoint, kriging enables a linear estimation based on statistical quantities of spatialized data (Baillargeon 2005). It is a single-variable geostatistical method that minimizes the estimation variance determined using the variogram while computing, interpreting, and modeling the variance as a function of the distance between the data. Kriging, in contrast to other methods, is preceded by an investigation of the spatial variability of trace metal concentrations and physicochemical properties at the site and is followed by a computation of the related estimate's error (Wu et al. 2010). The estimated concentration z of a given pollutant at a point \({x}_{0}\), denoted \({Z}^{*}\left({x}_{0}\right)\) is obtained by linear combination of n concentrations at the measurement points:

$${Z}^{*}\left({x}_{0}\right)=\sum_{i=1}^{n}{\lambda }_{i}{Z}^{*}\left({x}_{i}\right).$$
(5)

The choice of coefficients \({\lambda }_{i}\) called kriging weights makes it possible to differentiate kriging interpolation from other classical interpolation techniques (inverse of distances, nearest neighbour, etc.). \({\lambda }_{i}\) depends on the distances between the data and the target \({x}_{0}\), the distances separating the data from each other (taking into account the grouping of data, in particular at the operation site) and the spatial variability of the soil parameters.

The ordinary kriging, which is appropriate for the analysis of polluted sites and soils (Khan et al. 2019; Gaida et al. 2019), was used to create the prediction maps while also taking into account the spatial variability of the studied parameters. These maps offer helpful visual representations of spatial variability in the field and can be used to summarize and portray soil characteristics where it is possible to identify natural hazards (Goodchild et al. 1993). Any interpolated map can be matched with a precision map using this estimator. ISATIS-neo software was utilized to carry out the geostatistical study.

Accuracy assessment

The effectiveness of spatial variability maps has been evaluated using the cross-validation method (Delfiner 1976; Parker et al. 1979; David 1987). This method entails eliminating the data one at a time, then estimating the remaining data by kriging from the deleted ones (Chilès and Delfiner 1999). It also enables the evaluation of the magnitude of the point estimate's mistakes. The selection of the various models in this study was made following the cross-validation procedure.For this, three evaluation indices were used: the mean absolute error (MAE) and the root mean square error (MSE) measure the accuracy of the prediction, while the prediction quality (G) measures the effectiveness of the prediction (Utset et al. 2000). The experimental histogram of standardised errors (error divided by the standard deviation of the kriging) can be used to validate the assumption of normality of estimation errors. The error is calculated as the difference between the measured concentration and its estimate (De Fouquet et al. 2004). The MAE is a measure of the sum of the residuals.

$$ MAE = \frac{1}{N}\left[\kern-0.15em\left[ {\mathop \sum \limits_{i = 1}^{N} Z(x_{i} ) - \hat{Z}(x_{i} )} \right]\kern-0.15em\right] $$
(6)

where \(\widehat{Z}\) is the predicted value at location i. Smaller MAE values indicate less error. However, the MAE measurement, does not reveal the magnitude of the error that can occur at any point:

$$ MSE = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} [Z(x_{i} ) - \hat{Z}\left( {x_{i} } \right)]^{2} $$
(7)

The squared elevation of the difference at any point gives an indication of the magnitude, e.g. the smaller the MSE values the more precise the point-by-point estimate. The effectiveness of the prediction compared to that which could have been obtained using the sample mean is given by the measure of G, (Schloeder et al. 2001; Reza et al. 2018).

$$ G = \left[ {1 - \frac{{\mathop \sum \nolimits_{i = 1}^{N} [Z(x_{i} ) - \hat{Z}\left( {x_{i} } \right)]^{2} }}{{\mathop \sum \nolimits_{i = 1}^{N} [Z(x_{i} ) - \overline{Z}]^{2} }}} \right] \times 100 $$
(8)

where \(\overline{Z }\) is the sample mean. G is one of the methods used for the accuracy of interpolated maps. (Tesfahunegn et al. 2011). G values are used to check the reliability of the interpolated maps of the concentrations of the studied soil properties. Positive G values indicate that the map obtained by interpolating data from the samples is more accurate than an average. Negative and close-to-zero G values indicate that the average predicts the values at unsampled locations as accurately as or even better than the sampling estimates (Parfitt et al. 2009).

Results

Statistical analysis

The spatial variation and descriptive statistics of the studied parameters are presented in Fig. 1 and Table 1

Table 1 Descriptive statistics for selected chemical soil parameters

Spatial variation of physicochemical parameters

In Table 1, the soil pH in the study area was acidic to near neutral (5.01–6.19), with an average value of 5.58. EC values ranged from 7 to 47 µS/cm, with a mean value of 20.94 µS/cm.

Spatial variation of trace metals

The concentrations of Pb and As ranged from 0.00 to 53.27 mg kg−1 and 0.00 to 46.583 mg kg−1, with a mean concentration of 6.17 and 6.01 respectively. The concentrations of Cr, Cu and Ni ranged from 22.15 to 442.44 mg kg−1, 9.25 to 360.37 mg kg−1, and 1.28 to 320.86 mg kg−1, with a mean concentration of 122.1, 169.41, and 192.07 respectively. The CV, was low for all variables, except for Pb and As and the highest CV were observed for Pb (1.95) and As (1.78), while the lowest CV for Cu (0.37). The concentrations of trace metals in the soil decreased in the following order: Cr > Ni > Cu > Pb > As.

Histograms

The distribution of data sets (histograms) is presented in Fig. 2.

Fig. 2
figure 2

Histograms of studied parameters: a pH; b Log pH; c EC; d Log EC; e Cr; f Log (Cr); g Ni; h Log Ni; i Cu; j Log Cu; k As; l Log As; m Pb and n Log10 Pb

In Fig. 2, the data's histograms revealed that the various parameters were distributed in a skewed unimodal fashion. In order to comprehend the distribution of the data, a transformation was typically used because the data were severely skewed. The issue of zero concentration values of the components Pb and As was resolved by using a decimal log transformation and adding the constant 1.

Geostatistical modeling

Variographic analysis

The interest of geostatistic consists in particular in taking into account the spatial continuity (structure) of the variable that one wishes to map. Figure 3 shows the spatial distribution of the parameters studied as a function of their concentrations. The characteristics of the semivariogram models shown in Fig. 4 are listed in Table 2. These characteristics make it possible to analyze the spatial variability of the studied parameters; in particular, the thresholds, the RMSE, the nugget effect's amplitude, the ranges, and ultimately the nugget/sill ratio are given careful consideration. The ranges (distances of spatial dependence) of studied parameters were 236.58, 321.78, 396.66, 327.06, 739.55, 271.80, 297.02 m for pH, EC, Cr, Ni, Cu, As and Pb, respectively. Two variogram models were fitted, the exponential model for Ni and Cu with the lowest RMSEs of 1.34 and 2.28, respectively and the spherical model for pH, EC, Cr, As and Pb, with the lowest RMSEs of 0.27, 15.52, 1.27, 17.88 and 12.33, respectively.

Fig. 3
figure 3

Sampling points

Fig. 4
figure 4

Omnidirectional variograms of the concentrations and their adjustments: a pH; b EC; c Cr; d Ni; e Cu; f As and g Pb

Table 2 Characteristics of variogram models of studied parameters

Figure 4 shows the experimental variogram of the studied parameters.

The characteristics of the variograms is presented in Table 2.

Cross-validation

The cross-validation results are shown in Table 3 and Fig. 5.

Table 3 Statistics of the cross validation errors
Fig. 5
figure 5

Histogram of the errors of cross validation

The performance of the ordinary kriging methods was determined by cross-validation using, MSE and MAE. The ordinary kriging methods were validated at each sample site by nearby samples using the cross-validation technique developed for testing the variogram model, which then compares approximations with actual values.

Kriged interpolation

Ordinary kriging interpolation was used in this study to obtain the prediction maps. Figure 6 and 7 showed respectively the spatial distribution of physicochemical parameters (pH and EC) and the distribution of trace metals (Cr, Ni, Cu, As and Pb) using ordinary kriging interpolation. In Fig. 6, pH values ranged from 5.40 to 5.90, and the predict values of EC ranged from 15.81 to 31.72 µS/cm. According to trace metals, the range of variation obtained with ordinary kriging interpolation are 70.55–231.31, 40.65–295.12, 24.03–306.57, 2.56–39.14, 0.10–19.45 mg/kg.

Fig. 6
figure 6

Kriged maps of physicochemical parameters: a pH; b EC

Fig. 7
figure 7

Kriged maps of a Cr; b Ni; c Cu; d As and e Pb

The pH ranged from 5.55 to 5.70 (Fig. 6a).

Discussion

Physicochemical parameter pH and EC concentration in soil

The stability of the soil pH leads to the stability of the dissolved trace metal fractions, an increase leads to their decrease (Ciesielski et al. 2007). The EC varies with the concentration of dissolved salts (Bohn et al. 1985). The high EC and pH generally increases with the salt concentration (Seatz and Peterson 1965). In addition, soil EC could be related to other soil properties such as water holding capacity, depth of soil humus cover. The pH values observed across the site indicated an acidic pH (5.58–6.19). This low pH may be a sign of acid mine drainage (AMD) and could be caused, in part, by pyrite (FeS2) alteration that was observed at the gold mining site. According to the EC values observed in Table 1, all the sampled points had low EC values. The EC is conditioned by the geological composition of the study area and varies from medium to high in areas where dolomites predominate (Iavazzo et al. 2012). For pH and EC, Table 1 displays lower values for skewness. This may be related to the inherent properties of variables, mining operations, sampling strategies, and sampling size.

Trace elements in soil

Soil acidity favours the mobility of trace metals Cr, Ni, Cu, As and Pb, notably by dissolving metal salts or destroying the retention phase (Serpaud et al. 1994; Lion 2004). Wide variations in As and Pb displayed values from extremely low to zero that might be regarded as outliers. These abnormally low soil test results could represent a natural or management-induced variance in the study area's soils rather than always being an outlier. Therefore, the presence of outliers in the dataset can alter the structure and characteristics of the variogram. According to Barnett and Lewis (1994), outliers can lead to a phase shift that defies geostatistical theory and cause the variogram to be irregular (Armstrong and Boufassa 1988). To prevent outliers from having a negative impact on the variograms, the outliers for soil Pb and As were changed to the value 1. A number less than 1 implies that the data are distributed normally, and the skewness parameter shows the data's divergence from normalcy (Reza et al. 2018). Considering using a logarithmic transformation was done for skewness coefficients greater than 1 (Webster and Oliver 2008). As and Pb had high skwness values compared to the others (Table 1), which proves that they are irregularly distributed. Various sources of contamination could be responsible for this anomalous distribution. Additionally, the median of each soil attribute was lower than the mean, showing that the sample value was not significantly affected by the anomalous data (Brejda et al. 2000; Cambardella et al. 1994; Cambardella and Karlen 1999; Durdevic et al. 2019; Emadi et al. 2008; Young et al. 1999). With the exception of Pb and As, all soil characteristics had relatively modest standard deviations when compared to their mean values. This might be explained by the fact that the data range contains a lot of zero values.

The highest and lowest Cr concentrations were observed at sampling points S11 and S16 respectively (Fig. 7a). The majority of the sampling points showed values above the WHO recommended value, implying that the soils at the gold mining site were contaminated with Cr.

In Fig. 7b, sample points E19 and E05 had the highest and lowest Ni concentrations, respectively. E05 was located upstream of the gold mining site, and E19 was a sampling point in the gold mining area. According to the SQG, Ni concentrations were higher than those set by WHO guidelines and AFNOR U 44-041. This shows that the soils at the site were contaminated with Ni. The slow percolation of water through the soil allowing the chlorite to dissolve may be the cause of the high Ni concentrations in the soil.

Low concentrations of Cu were found upstream of the site, while concentrations above the AFNOR U 44-041 standard and WHO guidelines were found in the gold mining area. Sample points S09 and S23 had the highest and lowest Cu concentrations, respectively (Fig. 7c). Mineral weathering and soil leaching favour high Cu concentrations. Copper is found in abundant and reasonable amounts in the earth's crust (Wedepohl 1995).

The average concentrations of the trace metals Cr, Ni and Cu were higher (Table 1) than those recommended by the World Health Organisation WHO (Goria et al. 2010) and the AFNOR U 44-041 standard (Villanneau et al. 2008). These results indicated, according to the SQG, that the soil was polluted by Cr, Ni and Cu.

As a result, for concentrations like Cr, Cu, Pb, and As, there was a clear distinction between the readability of the concentration and log histograms. The distribution of the pH, EC, and trace metal histograms was favorably skewed.

Spatial variability of pH, EC and trace metals

The variogram (Fig. 4) describes the spatial continuity and regularity of the phenomenon (Webster and Olivier 2006). The numbers indicated the number of pairs of points for each stage of the experimental variogram. The variograms were fitted with the model as well as the parameters nugget effect, sill and range. The isotropic variograms were chosen since the anisotropic variograms did not demonstrate variations in the spatial dependence on direction. While Ni and Cu displayed an exponential pattern, pH, EC, Cr, As, and Pb displayed a spherical pattern. All of the parameter variograms displayed a nugget effect. The variogram threshold is a measure of the values' dispersion and is less susceptible to the size of the field and the asymmetry of the experimental variance data (Matheron 1970).

When compared to the size of the study area, the various indicators indicated a medium range of structure. This structuring indicated minimal variability at close distances and stationarity at this scale. In contrast to the regular variation of the concentrations of the various parameters, the variance of the ranges was minimal throughout the whole study area. The division that was adopted limits the fluctuation in the geographical structure of the concentrations and enables the best possible monitoring of their behavior across the whole study area.

There was a significant discrepancy between the nugget effect values of trace metals and physico-chemical parameters, as shown by the nugget effect values (Table 2), including at least two sources of variability. The sampling error (collection, handling, and measurement), spatial variability for distances below the minimum distances between experimental points, and random and intrinsic variability are likely to blame for the positive nugget seen in all the analyzed parameters (De Fouquet et al. 2004).

The classification of the spatial dependence of the soil properties was done by the nugget/sill ratio. According to Cambardella et al. (1994), the nugget/sill ratio are classified into three classes: Strong (< 25%); Moderate (25–75%) and Random (> 75%). The nugget/sill ratio showed a strong spatial autocorrelation for Cu (0.06), a moderate autocorrelation for pH, EC, Cr, Ni, As and Pb with values of 0.65, 0.67, 0.62, 0.27, 0.57 and 0.27 respectively. Strong and moderate autocorrelation of trace metals and physico-chemical parameters could be attributed to human activities, soil formations and other external factors such as erosion, leaching. The nugget/sill ratio has suggested that structural factors, such as climate, topography and other natural factors, play an important role in spatial variability (Venteris et al. 2014).

Cross-validation

The results of Fig. 5 and Table 3 revealed that while EC, Cr, Ni, and Cu had relatively high MAEs and MSEs, pH, As, and Pb had relatively low MAEs and MSEs. The G-values were greater than 0 for all the variables, demonstrating that using the variogram parameters for spatial prediction was preferable than assuming that each unsampled location's property value would equal the average of the observed value. This shows that the variogram parameters that were created by fitting the experimental variogram values were enough for representing the spatial variation of pH, EC, Cr, Ni, Cu, As, and Pb. Cross-validation fndings verifed that the variogram parameters created by ftting the experimental variogram values were accurate and consistent.

Interpolation maps

These acidic pH levels may be the result of mining operations and the beginning of acid mine drainage. The northern part of the study area was where the low pH was most prevalent. The EC prediction map (Fig. 6b) revealed that soil salinity was at its lowest in the study area's western portion and at its highest in the study area's northern (low elevation) and majority of center part. Variations in erosion instability, water runoff, and micro-topography may be responsible for this greatest capacity for change over short distances (Behera and Shukla 2015).

High concentrations were observed on the Cr prediction map (Fig. 7a) in the part area's south and southwest. Slight Cr concentrations were found in the center of the study area, and these concentrations moved eastward. The Ni prediction map (Fig. 7b) displayed an uneven distribution of Ni concentrations throughout the whole study area. In the northern part of the study area, there were high Ni concentrations found. According to the Cu prediction map (Fig. 7c), Cu concentrations were seen practically everywhere in the area, with the highest concentrations found in the northern, southern, and eastern regions. Significant As concentrations were found to the north and south-east of the study area (Fig. 7d). Finally, the Pb prediction map (Fig. 7e) displayed average Pb concentrations in the study area's north and east. Mining activities with the extraction works that promote the natural weathering of trace metals in the surrounding rocks and minerals of the study region can be considered the primary cause of pollution.

Conclusions

This paper investigated the spatial variability of pH, EC and trace metals in the soils of the abandoned mining district of Bindiba (East Cameroon) using geostatistical approach. The soils of the mining area were acidic to near neutral, slightly conductible and polluted by trace metals. The variographic analysis revealed that the spatial variability was described by the spherical model for pH, EC, Cr, As and Pb and the exponential model for Ni and Cu. The high spatial dependence of pH, EC and trace metals was observed as a considerable autocorrelation between the variables indicating that these trace metals were controlled by external factors such as mining activity, rocks and minerals weathering and internal factors such as soil composition. Cross-validation was used to choose variogram models for basic kriging, which made it possible to create accurate maps of pH, EC, and trace metal concentrations in un-sampled locations. According to the modeling of the spatial structure, The nugget/sill ratio showed a strong spatial autocorrelation for Cu (nugget/sill ratio < 25%), and moderate autocorrelation for pH, EC, Cr, Ni, As and Pb (nugget/sill ratio 25–75%). Therefore, compared to direct measurement, which involves time and costs, the geostatistical approach to assessing soil contamination in un-sampled locations appears to be a precise and reliable tool for accurate prediction. It could also help in decision-making for developing a better strategy for remediating abandoned mining sites.