Introduction

The determination of quality reference values (QRVs) is the basis for monitoring areas suspected of contamination. They are indispensable for establishing legislation consistent with the local reality, avoiding inappropriate interventions (Baize and Sterckeman 2001; Preston et al. 2014). In Brazil, the state of São Paulo environmental agency pioneered the creation of quality guidelines for soil and groundwater (CETESB 2005, 2014). The agency defined three guiding values for soil monitoring in three scenarios (agricultural, residential, and industrial areas) that were later adopted in the national legislation. These guiding values are (i) quality reference value (QRV), representing the natural concentration of the element in soils without anthropic influence; (ii) prevention value (PV), used only for the agricultural scenario; and (iii) intervention value (IV), a trigger indicating that remediation or contamination containment is mandatory (CONAMA 2009).

The prevention and intervention values are based on human health risk assessment and valid for the whole country. The QRV, on the other hand, must be defined by each state in Brazil as natural values of potentially toxic elements (PTE) depend on the local composition of the parent material, the pedogenesis processes, and the degree of soil development (CONAMA 2009; Paye et al. 2010). Following the national directive, several Brazilian states (e.g., São Paulo, Minas Gerais, Rio Grande do Norte, Paraíba, Pernambuco, and Rio Grande do Sul) have established QRV for PTE in soils (Nogueira et al. 2018; Souza et al. 2015; Preston et al. 2014; Almeida et al. 2016; CPRH 2014; Althaus et al. 2018). The knowledge of the pedogeochemical baseline, i.e., the natural variation of PTE concentration in surface soils, is the first step to determining regional QRV for PTE in soils (Biondi et al. 2011). The QRVs are needed for future monitoring of PTE and political decision-making in the assessment of contaminated soils. The environmental legislation is particularly important to states experiencing substantial economic, industrial, and agricultural growth that can affect soil quality.

The state of Piauí, located in the northeast region of Brazil, is divided geographically into four mesoregions: North, Center-North, Southeast, and Southwest (IBGE 2011). The low regional integration of the state has contributed to the emergence of two distinct territorial portions. In the north, urbanization with a relatively high concentration of people and industrial activities prevail. In contrast, the last agricultural frontier of Brazil lies in the south; this portion of the state underwent the implantation of a modern, highly technical agriculture for the production of grains, fibers, and energy that had the consequence of gradual removal of natural vegetation (Barbosa et al. 2016; França et al. 2016).

Intensive human activities often neglect environmental issues and can pose a risk to ecosystems and human health. In this scenario, the establishment of QRV for PTE in the soils of Piauí is essential to the more effective and sustainable development of the state. Therefore, we determined the natural concentrations of Ba, Cd, Co, Cr, Cu, Fe, Mo, Ni, Pb, Sb, V, and Zn and used these background concentrations to calculate the QRV for the state. Furthermore, we assessed the geological influences on PTE variation in soils and the relationship between soil characteristics and PTE concentrations.

Material and methods

Study area

The study area comprises the state of Piauí, located in an ecotonal area between the Cerrado and the Caatinga biomes with an area of 251,529.186 km2 (Fig. 1). The Piauí state has approximately 85% of its territory inserted in the domain of the Phanerozoic sedimentary rocks; the rest is occupied by metamorphic and igneous rocks. These rocks constitute the crystalline basement of the Sedimentary Basin of Parnaíba (Pfaltzgraff et al. 2010). The main soil classes found are Oxisols (37.44%), Ultisols (37%), and Entisols (11.21%) (IBGE 2015; Soil Survey Staff 2014) that cover over approximately 86% of the studied area. The survey area comprises a significant diversity in geology, climate, and topography. Most areas show relief with elevations below 500 m, but in the extreme southwest, mountainous areas reach up to 900 m.

Fig. 1
figure 1

Map of the state of Piauí with mesoregions (North, North-Central, Southeast, and Southwest), geological context, and sampling locations

Owing to the very extensive area, Piauí has three climatic classifications according to the classification system of Köppen and Geiger (1936). In areas with a predominance of the Cerrado biome’s typical vegetation, the climate is classified as tropical with a dry season of winter (Aw) with temperature and precipitation (mean T 27.7 °C; mean P 1060 mm). In the Caatinga biome domain, the climatic classification is semi-arid with a scarcity of rains and a significant irregularity in rain distribution (Bsh) (26.6 °C; 697–728 mm). In the extreme north, a narrow strip of coastal zone forms a complex mosaic of several habitats such as coastal dunes, rocky shores, mangrove forests, estuaries, and sandy beaches with a hot and humid tropical climate, with the dry season in winter (As) (27.4 °C; 1286–1634 mm).

Soil sampling and analysis

We collected a total of 262 geo-referenced soil samples at 0.0–0.2-m layer in areas that were relatively undisturbed by human activity (Fig. 1). An irregular grid was obtained from the survey as sampling points were chosen based on the state’s pedogeochemical variability. Samples were collected through a stainless-steel Dutch. Each sample was composed of five subsamples, collected in the form of a compass (North, South, East, and West, all of them spaced apart by 4 m in the middle of the compass).

Soil physical and chemical analyses were performed on air-dried < 2.0-mm soil according to the standard procedures (Raij et al. 2001). Soil pH was measured in a soil/water 1:2.5 ratio. Available contents of P, Na+, and K+ were extracted with Mehlich-1 and determined by photocolorimetry and flame photometry, respectively. Ca2+, Mg2+, and Al3+ were extracted by KCl 1 mol L−1 and complexometric determined. Total acidity (H + Al) was obtained by calcium acetate extraction and titration. The sol texture was determined using the Bouyoucos densimeter method. Total organic carbon (TOC) was determined by the modified Walkley-Black method.

Extraction of PTE and quality control of analysis

Extraction of Ba, Cd, Co, Cr, Cu, Fe, Mo, Ni, Pb, Sb, V, and Zn was performed using the EPA method-3051A (USEPA 2007) as suggested by National Environment Council (CONAMA 2009). Subsamples of the soils were sprayed in agate mortar and subsequently sieved through a 0.15-mm mesh stainless-steel screen. Then, 1.000 g (dry weight) of the sample was transferred to Teflon tubes, and 9 mL of HNO3 + 3 mL of HCl (Merck PA) were added. The sample was then digested in a microwave oven at 175 °C for 4 min and 30 s. The extracts were transferred to 25-mL certified flasks and filtered with a blue-band filter paper (Macherey Nagel®). The volume was made up using ultrapure water. All analyses were performed in duplicate. The quality of the analysis was checked using a certified soil sample (SRM 2709 - San Joaquin Soil). Calibration curves were prepared from the 1000 mg L−1 standard (Titrisol®, Merck). The concentrations of the elements were determined by inductively coupled plasma optical emission spectrometry (ICP-OES Perkin Elmer 7000 DV).

Data processing and statistical analysis

Initially, data variability was verified using descriptive statistics (mean, median, minimum, maximum, Skewness, and Kurtosis). Simultaneously with the statistical analyses, the underlying assumptions of ANOVA, normality of errors, and homogeneity of variances were tested for all the analyzed variables using the SigmaPlot 12.0 program. The normality of the data was checked using the Kolmogorov-Smirnov test, and when observing the non-normality of variables, Spearman’s rank correlation was chosen (non-parametric data). A correlation matrix was elaborated to understand the relationship between soil characteristics and PTE concentrations by Spearman’s product-moment correlation coefficient.

The quality reference values (QRVs) were established based on the 75th and 90th percentiles of the sample set, and the anomalous values (outliers) were removed through a box-plot analysis as recommended by CONAMA (2009). The Box-Cox transformed data were submitted to the multivariate exploratory analyses of hierarchical clustering. The hierarchical clustering analysis is an exploratory multivariate technique whose purpose is to assemble the sample units into groups, allowing homogeneity within the group and heterogeneity between them. The structure of groups of the data can be observed in a dendrogram, constructed with the similarity matrix between samples (Sneath and Sokal 1973). The similarity matrix was constructed with the Euclidean distance, and groups were bonded using the Ward method. The clustering analysis was processed in the software Statistica 7.0 (StatSoft Inc., Tulsa, OK, USA).

Results and discussion

Quality assurance of analysis

The EPA 3051a method is not intended to accomplish the total decomposition of the sample; it accesses the environmentally available PTE concentrations (Silva et al. 2014). Therefore, the National Institute of Standards and Technology (NIST) recommends that the results obtained with EPA 3051a should be compared with element recoveries based on leached values rather than total values (Biondi et al. 2011; Preston et al. 2014). Our recovery rates (Table 1) were satisfactory for all PTE and ranged from 75 to 112%, indicating the quality and reliability of the analyses.

Table 1 Recovery rates of potentially toxic elements (PTE) in the soil certificated sample SRM 2709 - San Joaquim

Soil chemical and physical characteristics

The soil samples presented a high variability in chemical and physical characteristics. The pH ranged from 3.1 to 8.6 (mean of 4.7 ± 1.1) with the predominance of the acidic condition; as a result, Al3+, H+ + Al3+, and Al saturation had median values around 0.8, 2.8, and 44%, respectively. The mean value of P in soils is 17 mg dm−3, but the median values confirm the predominance of low P contents. The concentrations of Ca2+, Mg2+, and K+ are also low, indicating the general low natural fertility of these soils (Table 2).

Table 2 Physical and chemical attributes of soil samples from Piauí state, Brazil (n = 262)

Data on particle size distribution shows that soils are mostly loamy sand soils (FAO 2006). The clay content averaged 17.1%; silt and sand mean contents were 9.9 and 73%, respectively. Soil organic carbon contents were medium to low, varying from 0.01 to 2.5 dag kg−1 (mean value of 0.9 dag kg−1) as showed in Table 2. The samples’ sandy nature is due to the sedimentary material that originates most of the studied soils. The low cation-exchange capacity (CEC) and base saturation along with high Al3+ saturation are typical of well-developed, highly weathered tropical soils (Nogueira et al. 2018; Nascimento et al. 2018; Souza et al. 2019). Such characteristics may enhance the mobility and losses of PTE in the soil by leaching and surface runoff in the event of soil contamination (Nascimento et al. 2018).

Background concentrations of PTE in soils

Our study covered a large area (over 251,000 km2) equivalent to the UK. Consequently, we observed a considerable variation in PTE concentration. The concentrations of PTE in the whole Piauí state as well as in the North, North-Central, Southeast, and Southwest regions (Fig. 1) were assessed by cluster analysis; four groups were formed: 1 - Southwest (117 samples); 2 - Southeast (56 samples); 3 - North (50 samples) and Central-North (59 samples); 4 - Piauí state (Fig. 2). Defining QRV for mesoregions can aid in diminishing the data variability and better interpret the PTE spatial distribution. Areas with high natural pedogeochemical variation are a challenge to setting pedogeochemical backgrounds for potentially toxic elements. Therefore, it is essential to understand the distribution and extent of the pedogeochemical variability when assessing the risks associated with PTE contamination to the ecosystem (Araújo et al. 2018).

Fig. 2
figure 2

Cluster analysis of mesoregions of the state of Piauí in Brazil according to soil PTE mean contents. SE, Southeast; SW, Southwest; NC, North-Central; and N, North

The mean background concentrations for PTE varied considerably among the mesoregions and between these and the whole state area (Table 3). The concentrations of PTE in the state are generally lower than reported in the international literature, except for Cr, Fe, Ni, Sb, and V; in contrast, they are higher than the mean concentrations in the Brazilian states except for Cd and Co (Table 3). The variation observed in the QRV established in the soils of Piauí concerning those determined in the different regions of Brazil reflects the diversity of the parent materials and pedogenesis processes, as well as the intrinsic characteristics of each class of soil and metal analyzed (Tume et al. 2011; Bini et al. 2011). These results corroborate the wide variation of QRV between the states of Brazil. It also highlights the importance of establishing regional QRV due to the enormous territorial extension and pedo-geological diversity of the country.

Table 3 Mean concentrations of PTE (mg kg−1, dry soil) in soils from Piauí state (n = 262) as compared with data compiled from Brazilian states and international literature

Geological influences on PTE variation in soils

Geological features are known to influence the concentration of PTE in soils (Abbaslou et al. 2013; Martin et al. 2016). The different geological settings of the Piauí state (Fig. 1) reflected in the relative highs and lows in the mean values of PTE of the different mesoregions (Table 3). For instance, the predominance of sedimentary soils in the southwest is in line with the lowest PTE concentrations observed in this mesoregion. Soils developed from sedimentary rocks generally contain lower levels of PTE than those derived from basic rocks (Paye et al. 2010; Preston et al. 2014; Althaus et al. 2018). The soils in the southwest originated from detrital sedimentary rocks such as sandstones, siltstones, shales, and limestones of the Pedra de Fogo and Piauí Formation, belonging to the Balsas Group (Pfaltzgraff et al. 2010). Soils are mostly Oxisols and Ultisols presenting sand or sandy loam texture and having a natural low adsorption capacity (Abdelwaheb et al. 2019). The southwest is under the Cerrado biome with an altitude of 600 m. This mesoregion is intensively cultivated with soybean and other grains. Hence, the monitoring of PTE in these low buffer capacity agricultural soils is important to guarantee food safety and protection of water resources from metal leaching. However, the southwest is geologically diverse in its lower part, soils derived from sandstones, siltstones, and shales of the Gurguéia river basin (The Formations Longá, Cabeça, Pimenteiras, and Poti of the Group Canindé). A strip towards the southeast mesoregion has a colluvial-eluvial sandy Cenozoic deposit (Parnaiba Province) associated with distinct geological contexts (Fig. 1). Soil classes in this part are Oxisol, Ultisol, Entisol, and Vertisol (Pfaltzgraff et al. 2010) under Cerrado and Caatinga biomes, exploited predominantly with livestock, extractive, and subsistence agriculture.

The southeast mesoregion has a higher diversity of soil parent materials than other mesoregions (Fig. 1). The southeast’s lithology seemed to govern the concentrations of PTE in soils, which were the highest in the state, except for Ba and Mo (Table 3). Two lithologies underlie the region: (i) the igneous-metamorphic basement, characterized by two main groups (migmatites, orthogneisses, and metagranites metamorphic rocks of the Archean and Paleoproterozoic ages and shales and quartzites of the Ipueirinha Group); and (ii) metasedimentary rocks and plutons such as phyllites, marbles, schists and quartzites of the Barra Bonita Formation and diorites, granodiorites, and monzonites of the Itaporanga Intrusive Suite (Pfaltzgraff et al. 2010). This mesoregion is covered by the Caatinga biome, with a semi-arid tropical climate (Bsh) and extensive livestock.

The North and North-Central mesoregions were clustered together regarding PTE concentrations (Fig. 2). This association is likely due to the similar lithology and consequent influence on PTE background concentrations and spatial distribution. The mean values of Cr, Cu, Fe, Sb, and V were higher compared with those of the other mesoregions probably owing to the basalt and diabase rocks of the Sardinha formation; mafic rocks are enriched in PTE and leave their PTE signature in soils (Paye et al. 2010; Nogueira et al. 2018; Althaus et al. 2018). However, the concentrations of Cr, Cu, Fe, Sb, and V are lower than those observed in soils developed over basalt in southern Brazil, Rio Grande do Sul state in Table 3. The North and North-Central pose the highest diversity of soil classes in the Piauí state (Alfisols, Entisols, Histosols, Inceptsols, Mollisols, and Ultisols). Such diversity is related to the various parent materials: basalt, diabase, sandstone, and conglomerates intercalated with siltite and argillite, siltstones, shales, and limestones (CPMR 2009; Pfaltzgraff et al. 2010).

Correlation between soil characteristics and PTE concentrations

The values of Spearman’s correlation coefficients were used to identify the soil characteristics that most influence the concentration and distribution of PTE in the soils. PTE were positively correlated with each other, except for Cd (Table 4), which suggests the common, lithogenic source of PTE in soils. This result was expected as we mostly sampled locations with minimum human interferences. Most PTE were inversely correlated with the sand contents and positively correlated with clay and CEC (Table 4). This result is due to the predominance of sand and loamy sand textures in the studied soils. The adsorption of PTE in these soils occurs in variable charge minerals, mainly kaolinites (Souza et al. 2019). Correlations with TOC are low or inexistent, probably due to the low contents of organic matter in the soils (Table 1).

Table 4 Spearman correlation matrix between soil attributes and concentrations of HM in the natural soils of Piauí state, Brazil (n = 262)

Soil quality reference values

The QRV were calculated based on the natural background concentrations following the recommendations of the Brazilian legislation (CONAMA 2009). We used a box-plot analysis to remove anomalous values (outliers) and then determined the QRV based on the 75th and 90th percentiles of the remaining set of samples. We proposed QRV for the three sets of samples grouped in Fig. 2: North and North-Central; Southeast; and Southwest. The natural pedogeochemical spatial variation in the vast area studied here confirms the necessity of careful evaluation of areas suspected of soil contamination. When a single QRV is defined for throughout the state, there will be an underestimation of some potentially toxic elements concentration in the southwest and overestimation in the Southeast or even North and Central-North mesoregions (Tables 3 and 5).

Table 5 Soil quality reference values (QRV) for HM (mg kg−1) in Piauí state and proposed groups (G), Brazil

The QRV established as the 75th and 90th percentiles are presented in Table 5. Considering the 75th percentile, in the southwest mesoregion, the QRV for all metals were lower than those calculated for the southeast and North/Central-North. They were also lower than those observed in other Brazilian states (Nogueira et al. 2018; Souza et al. 2015; Preston et al. 2014; Almeida et al. 2016; CPRH 2014; Nascimento et al. 2018), except for Cr in Paraíba state (11.2 mg kg−1) and Amazonas state (6.9 mg kg−1), and Pb also in Amazonas (4.4 mg kg−1). The other mesoregions had higher concentrations of Ba, Cr, Cu, Mo, Ni, Sb, V, and Zn related to parent material as previously discussed.

The natural concentrations and QRV of PTE established for the whole state of Piauí and its mesoregions seem to be governed by the diversity of pedogeochemical conditions that characterize the study area. We observed three subpopulations within the samples: soils developed from sedimentary rocks in the southwest with the lowest concentrations of PTE; soils originated from the igneous-metamorphic basement and metasedimentary rocks in the southeast mesoregion with the highest PTE concentrations; and the North and North-Central mesoregions with soils originating from mafic rocks and enriched in Cr, Cu, Fe, Sb, and V, but presenting intermediate values of the other PTE.

Conclusions

We performed the first large-scale systematic survey of natural concentrations of PTE in soils of the Piauí state, Brazil. This background concentration survey contains a wealth of information that provides the basis for the soil guideline values in the state and supports future research on the impact of anthropogenic activities in soil contamination. The mean background concentrations of PTE in the soils were generally lower than those reported in other countries and/or other Brazilian states, and followed the order: Fe > Ba > V > Cr > Cu > Pb > Zn > Ni > Pb > Co > Mo > Sb > Cd. The main factors governing the concentrations of PTE in soils are the parent material and the soil texture. The different geological settings of the Piauí state influenced the spatial distribution of PTE and divided the state into three groups presenting low, high, and intermediate PTE values. Given this pedological complexity, we proposed establishing three sets of QRV based on the cluster analysis rather than a single QRV for the whole state. This proposal is unprecedented in the country, as other states’ regulations use single QRV, and aims to avoid misinterpretation regarding the investigation of areas suspected of contamination.