Introduction

The decline of the steel industry in the northeast of France at the end of the twentieth century left behind more than 6000 ha of polluted wastelands [1]. On these sites, the soils are multi-contaminated with hydrocarbons and metallic trace elements (MTEs) [2, 3]. On sites of former coking plants or slag heaps, polycyclic aromatic hydrocarbon (PAH) pollution results from the use of coal tar during coke manufacturing. PAHs are ubiquitous contaminants produced from the incomplete combustion of organic matter [4], especially during industrial activities. In settling ponds resulting from the sewage sludge storage of blast furnace gases [5], high concentrations of MTEs (Cd, Cu, Zn, Pb, Ni) are also often encountered.

In soils, the fate of PAHs and MTEs depends on various abiotic and biotic processes. Over time, PAHs adsorb onto organic matter and spread into the micropores of soil components. Thus, it is referred to as pollutant ageing, and leads to a decrease in pollutant availability [6]. MTE availability depends mostly on soil properties (pH, organic matter, redox potential, etc.) [7, 8]. PAH compounds can be degraded over time mainly through microbial processes [9] that can be limited by low PAH availability [10, 11]. MTEs cannot be biodegraded, but microorganisms can modify their speciation through direct use or modifications of soil properties [12].

Numerous studies have focussed on the short-term influence of PAHs or MTEs on microbial communities. The selective pressure exerted by PAH toxicity [13, 14] could modify microbial community composition [14, 15], reduce microbial taxonomic diversity [16], or inhibit microbial activity and the global activity of enzymes such as arylsulphatase, phosphatase, urease, and dehydrogenase [13]. Similarly, various studies have highlighted the negative effects of MTE pollution. For example, the effect of zinc addition has been studied on enzymatic activity and microbial community composition [17], as well as on the taxonomic diversity of bacterial communities [18].

In aged polluted soils, various adaptation processes enable the microbial community to cope with and even benefit from pollution. The selection of PAH degraders [15, 19] and MTE-resistant microbes [17, 20] in aged-polluted soils has been shown using acquired tolerance tests [21]. However, the long-term impact of chronic exposure to PAHs and MTEs on microbial community composition has been less addressed [22, 23]. The impact of pollutants has been mainly studied separately, although multi-contamination can have a synergistic effect [24, 25]. Studies considering aged pollution and the impact of multi-contamination on microbial taxonomic diversity (richness and evenness) are scarce [26] and therefore needed.

The carbon cycle is pivotal in soil functioning because it contributes to all biological processes, such as biomass production or respiration by organisms. It also affects other nutrient cycles through organic matter recycling and mineralisation. In a context of soil pollution, one can thus wonder how the different functions of the carbon cycle are affected, and study the microbial functions involved in key C-cycle processes. Metabolic functional diversity can be defined as the number (richness) and the evenness of the metabolic functions of the microbial community. One approach to estimating functional diversity is by comparing the degradation capacities of carbon substrates using Biolog® [27] and MicroResp™ [28] microtiter plate methods in the presence or absence of pollutants, as done with metals that decrease bacterial functional diversity [29].

Apart from this impact of pollutants, it is well known that soil parameters, such as the pH, texture, the C content, or the C/N ratio, highly contribute to shaping microbial community composition [30, 31] and functional diversity [32] in soils of various origins (forest, grassland, agricultural, etc.). In polluted soils, such as urban, industrial, traffic, mining, and military areas [3], physico-chemical parameters have also been found to largely influence the composition of the bacterial and fungal communities [22].

We therefore hypothesised that microbial diversity in multi-contaminated soils may be influenced by both soil properties and pollutant concentrations and availability, but differently at the taxonomic and functional levels. In this context, the present study aimed to (i) assess taxonomic diversity and metabolic functional diversity of microbial communities in a context of aged multi-pollution and (ii) to determine which soil factors (physico-chemical parameters, pollutants, etc.) affected these diversities. We studied microbial communities from a collection of 10 soils presenting gradients of PAH and metal pollution, and under different land uses, ranging from weakly to highly anthropised, i.e. from forest soil to industrial soil. We assessed bacterial taxonomic diversity through 16S rRNA gene tag-amplicon sequencing (Illumina MiSeq). We estimated metabolic functional diversity using the degradation functions of ecologically relevant carbon substrates through Biolog® and MicroResp™ microtiter plate methods.

Materials and Methods

Study Sites and Soil Sampling

Ten soils originating from industrial wastelands and waste sites, natural forests, or ancient gravel pit, located in the “Grand Est” region (north-east of France), were sampled in November 2015. This soil collection was chosen to cover a wide range of anthropisation situations, i.e. from no or low contamination to high contamination, and represented an anthropisation gradient. All sampling sites are within 100 km. Three soils were considered as the control soils (ctrl), i.e. two low-anthropised forest soils collected at Hémilly (He; 49°2′1″N/6°30′51″E; Moselle; Stagnic Luvisol) and Montiers-sur-Saulx (Mo; 48°31′55″N/5°16′8″E; Meuse; Calcaric Cambisol), and one anthropised but unpolluted former gravel pit soil collected at Dieulouard (Di; 48°49′44″N/6°5′2″E, Meurthe-et-Moselle; Fluvisol). Seven anthropised soils (Technosol) known to be polluted by MTEs and/or PAHs, where industrial activities stopped during 1980s, were collected from (i) former slag heaps (sh) at Homécourt (Ho; 49°12′79″N/5°59′72″E, Meurthe-et-Moselle), Terville (Te; 49°20′25″N/6°08′33″E; Moselle), Uckange (Uc; 49°18′58″N/6°9′55″E; Moselle), and Neuves-Maisons (NM; 49°12′52″N/5°59′45″E; Meurthe-et-Moselle), and (ii) former settling ponds (sp), dried since at least 30 years, at Pompey (Po; 48°46′8″N/6°8′8″E; Meurthe-et-Moselle), Mont-St-Martin (MsM; 49°32′9″N/5°46′46″E; Meurthe-et-Moselle), and Russange-Micheville (RM; 49°28′58″N/5°55′51″E; Moselle).

All sites were colonised by trees (mainly birch and/or beech) and herbaceous plants, with a very heterogeneous colonisation density from one site to another, except for the NM soil which was colonised only by some herbs. None of the soil samples were rhizosphere soils or directly under plant influence (sampling at least 2 m from a tree). As plant colonisation was sparse by location, we collected samples on bare soils. At each of the 10 sites, samples were collected from three independent sub-sites 1 m apart. After removing the litter layer, if present, one soil block per sub-site (block of 20 cm on the side and 30 cm deep) was removed using a shovel. The three sub-site soil blocks were mixed to get one composite sample per site. Back to the laboratory, the soil samples were air-dried at room temperature for 1 week, and then sieved at 2 mm. The dried and sieved soils were then stored at room temperature in the dark before chemical and microbiological analyses, and aliquots were stored at − 80 °C before freeze-drying and PAH analyses.

Soil Physico-Chemical Characteristics

The soil physico-chemical characteristics were determined from the dried sieved soils. The pH was measured (PHM210 Radiometer Analytical, equipped with a pH probe, Bioblock Scientific) in a soil suspension prepared in distilled water (1:5 w/v). The CaCO3 content was estimated from measurements of CO2 concentrations released after acid (4 M HCl) decarbonation of 1 g of dw soil, using an infrared absorbance (Binos 1004 analyser), with ʎ 2325.6 cm−1. The water retention capacity was estimated from the difference in weight between dry and water-saturated soil, which corresponded to 100%. Measurements of the soil texture (clay fraction, 0–2 μm; silt fraction, 2–63 μm; sand fraction 63–2000 μm; ISO 11277); the cation exchange capacity (CEC; determined by the Metson method, NF X 31–130); organic carbon, organic matter, and nitrogen concentrations (C, OM, and N, respectively; Dumas method, ISO 10694); the available phosphorus concentration (P; Olsen method, NF ISO 11263); total (fluoridric acid extraction) and available (calcium chloride extraction) concentrations of metals (Cr, Cd, Cu, Ni, Pb, and Zn for total and available concentrations, plus Tl, Mo, Al, Ca, Fe, K, Mg, Mn, and Na for total concentrations) were performed at the Laboratoire d’Analyse des Sols (INRA, Arras, France).

We calculated a metal pollution index (Mi) in order to compare metallic pollution of soils using only one variable. Mi was the sum of the relative proportions (RPmetal) of nine metal concentrations (Zn, Pb, Cd, Fe, Cr, Cu, Ni, Co, Tl). RPmetal in soils were calculated using the following formula:

$$ {\mathrm{RP}}_{\mathrm{metal}}=\frac{{\mathrm{MC}}_{\mathrm{soil}}\times 100}{{\mathrm{MC}}_{\mathrm{max}}} $$

where MCsoil is the total metal concentration in the soil, and MCmax is the maximum total metal concentration found among all the soils of the collection. Total instead of available metal concentrations were used because some of the available metal values were lower than the detection limit. The nine metals were chosen because they commonly resulted from the steel industry, and when for at least one soil, the metal concentration was higher than the mud compost norm (NF 44095) or the natural geochemical background of French soils [33].

PAH Extraction and Analysis

Soil samples stored at − 80 °C were freeze-dried and ground to 500 μm (Mixer Mill MM 400, Retsch). Total PAHs were extracted from 1 g dw soil in triplicate with dichloromethane, at 130 °C and 100 bars, using accelerated solvent extraction (Dionex® 200 ASE), as described in Cennerazzo et al. [34]. Available PAHs were extracted from 1 g dw soil in triplicate with 20 ml of hydroxypropyl-β-cyclodextrin (50 mM) in Teflon™ FEP Oak Ridge centrifuge tubes (Nalgene, USA) [35]. After mixing (16 h) at 24 °C and centrifugation, PAHs were extracted from the aqueous solution through liquid/liquid extraction using dichloromethane. Both solvent extracts of total and available PAHs were evaporated (nitrogen flow) and dissolved in acetonitrile for PAH analysis using a reverse-phase chromatography (UHPLC Dionex® UltiMate 3000 system) equipped with a Diode Array Detector (UV detection, 254 nm) and a ZORBAX Eclipse PAH column (2.1 × 100 mm, 1.8 μm, Agilent).

Carbohydrate, Organic Acid, and Dissolved Organic Carbon Measurements

Carbohydrates, organic acids, and dissolved organic carbon (DOC) were measured on triplicate soil aqueous extracts (1:5 w/v) using 6 g dw soil mixed with 30 ml of distilled water in Teflon™ FEP Oak Ridge centrifuge tubes (Nalgene, USA) for 2 h at 20 °C [36, 37]. After centrifugation and filtration, aqueous extracts were recovered. Seven carbohydrates (inositol, trehalose, sucrose, glucose, xylose, mannose, and fructose) were quantified by an ICS 3000 ion-exchange chromatograph equipped with a Dionex CarboPac SA10 column (Thermo Scientific). Eleven organic acids (gluconic, lactic, acetic, propionic, formic, pyruvic, succinic, maleic, oxalic, fumaric, and citric acids) were quantified using an ICS 2100 ion chromatographer equipped with a Dionex IonPac AS11-HC column (Thermo Scientific). DOC was measured on a TOC-V analyser (Shimadzu).

Quantification of Culturable Bacteria

The most probable number (MPN) was measured on 1 g dw soil (in triplicate) moistened (to 60% of their water retention capacity) and pre-incubated for 2 days (24 °C) for reactivation of microbial community. After soil aliquots were resuspended in 10 ml of NaCl (0.9%) and mixed for 1 h, 10-fold dilution series were prepared in NaCl (0.9%) and used to inoculate 96-well microtiter plates (16 wells per dilution) filled with nutrient Broth No 1 (Fluka Analytical). After incubation for 3 days (24 °C), absorbance at 600 nm was measured using spectrophotometer (Safas, Monaco) and MPN counts were calculated using Mac Grady’s tables, and expressed as MPN per gram of soil.

DNA Extraction, Real-Time Quantitative PCR, and Sequencing

Genomic DNA (gDNA) was extracted in triplicate from ca. 0.5 g of moistened soil (as described above) stored at − 20 °C, using FastDNA SPIN Kit for Soil (MP Biomedicals, France), following the manufacturer’s instructions. Concentration and purity (A260/A280 ratio) were measured using a spectrophotometer (UV-1800, Shimadzu) equipped with a TrayCell™ adapter (Hellma®). gDNA was diluted to 5 ng μl−1 for real-time quantitative PCR (qPCR) and sequencing library preparation. The abundance of fungi and bacteria was estimated by qPCR using the primer sets Fung5F/FF390R [38] and 968F/1401R [39], targeting 18S and 16S rRNA genes, respectively. The qPCR assays were performed as previously described [40, 41]. Briefly, the reaction mixture (20 μl) was composed of 10 μl iQ SYBR Green SuperMix (Bio-Rad), 0.8 μl primers (10 μM), 0.4 μl bovine serum albumin solution (3%), 0.2 μl dimethyl sulfoxide, 0.08 μl T4gp32 (MP Biomedicals, France), and 1 μl DNA (diluted gDNA or tenfold dilution series from 108 to 101 copies μl−1 of the standard plasmids). Quantification was performed using a CFX96 Real-Time PCR detection system (Bio-Rad), using 56 °C and 50 °C as annealing temperature for 16S and 18S rDNA quantification, respectively.

The V3/V4 region of bacterial 16S rRNA genes (ca. 550 bp) was amplified using primers S-D-Bact-0341-a-S-17 and S-D-Bact-0787-b-A-20 [42, 43] and following a previously described dual-index strategy [44] using PCR primers with Illumina adaptor, pad, and index sequences [37]. PCR reactions were performed on 2 μl of diluted gDNA using Phusion high-fidelity polymerase (Thermo Scientific). PCR reactions consisted of 31 cycles with touchdown annealing temperature for 18 cycles (63 to 54 °C with a decrease of 0.5 °C/cycle) and 13 cycles at 54 °C. Amplification products were checked on 1% agarose gel electrophoresis, and purified using the UltraClean-htp 96 Well PCR Clean-Up kit (Qiagen) following the manufacturer’s instructions. After Quant-iT PicoGreen ds-DNA Assay Kit (Invitrogen) quantification, an amplicon library was prepared (equimolar pool at 10 nM), purified on a QIAquick PCR purification Kit Column (Qiagen), and sent for sequencing to Genewiz platform (South Plainfield, NJ, USA) using an Illumina MiSeq V2 Kit for 2 × 250 bp paired-end sequencing. Illumina MiSeq paired-end reads have been deposited in the SRA database under BioProject accession number PRJNA450766. Sequence data were analysed following the MiSeq SOP procedure available in March 2017 and described in Kozich et al. [44], using Mothur v.1.38.0 [45]. Paired-end reads were trimmed using the following criteria: QS > 20, 404 bp < length < 454 bp, and no ambiguous bases. Chimeras detected using Uchime [46], and singletons (sequences appearing only once among all samples) were removed. Alignment of unique sequences and taxonomy was assigned using the Silva bacteria database (cutoff = 80). Sequences affiliated to archaea, eukaryota, unknown, mitochondria, and chloroplasts were removed. Sequences were clustered in operational taxonomic units (OTUs) at 97% similarity. Finally, datasets were rarefied to the lowest number of sequences per sample (34,191 reads/sample). Alpha diversity was expressed by calculating Chao1, Pielou’s evenness J’, and Shannon H′ indices [47], and beta-diversity was calculated with the Bray-Curtis index, using Mothur.

Carbon Substrate Utilisation Using Biolog® Plates and Metal Tolerance Test

Two types of Biolog® microtiter plates were used to assess the metabolic functional diversity of bacterial communities through utilisation patterns of 62 carbon substrates (Table S1): (i) EcoPlates™ containing 31 substrates of ecological relevance (five guilds, nine carboxylic acids, six amino acids, eight carbohydrates, four polymers, and four miscellaneous), and (ii) MT2 Microplates™ allowing to test 31 chosen substrates (four PAHs, six aromatic acids, eight carboxylic acids, three amino acids, six carbohydrates, three polymers, and one miscellaneous). MT2 Microplates™ were prepared following the manufacturer’s instructions (0.3 mg of carbon in each well) except for organic acids, cellulose, and lignin which were tenfold diluted because the recommended concentrations were toxic or the presence of solid particles flawed absorbance measurements (data not shown). PAHs and phenolic acids were dissolved in n-hexane and added to empty plates, and the solvent was eliminated by drying for 12 h in a sterile hood [37]. The water-soluble substrates (20 μl) were added just before inoculation, and 20 μl of sterile water was added in the wells containing PAHs and phenolic acids and in the EcoPlate™ wells.

Microbial inocula were prepared by diluting soil aliquots (1 g of moistened soil prepared as described above) in 10 ml of NaCl (0.9%) and stirring with glass beads (1.5-mm diameter) for 1 h. The supernatant was decanted for 15 min and diluted in NaCl (0.9%) differently for each soil, based on MPN data to obtain a similar microbial abundance (from 102 to 103 cultivable bacteria ml−1) for all soil. Biolog® plates were inoculated with 100 μl of the appropriate dilution, and incubated at 24 °C in a plastic bag containing wet cotton to avoid desiccation.

Zinc was used to test bacterial community tolerance to metals. Biolog® substrate utilisation was measured in the presence of zinc, added as ZnCl2 (10 mg l−1 of Zn) directly in the inoculum. This zinc concentration was chosen to obtain a zinc effect without total inhibition of bacteria (preliminary experiments not shown).

To summarise, our experimental design tested 62 carbon substrates on 10 soils in triplicate, with or without zinc addition. The absorbance at 595 nm was monitored using a spectrophotometer (SAFAS, Monaco) over 7 days at t = 0, 24, 48, 72, and 96 h. Absorbance values were corrected (Abscorr) after subtraction of the absorbance of mean blank wells at the corresponding times. Substrate utilisation was considered positive when the corrected absorbance was > 0.2. For each soil, Abscorr at 96 h was used to calculate functional alpha diversity estimators, i.e. metabolic richness (number of carbon substrates significantly utilised), Pielou’s evenness (J’; based on the relative proportions of utilisation intensity for each substrate), and Shannon (H′; based on the two previous indices) diversity indices using the vegan package in R [48]. For each soil, the average well colour development (AWCD) was calculated over time using the formula:

$$ \mathrm{AWCD}=\frac{\sum {\mathrm{bs}}_{\mathrm{corr}}}{62} $$

where Abscorr was the corrected absorbance of the substrate. Zinc tolerance was assessed for each soil by calculating a percentage of inhibition based on a comparison of areas below the AWCD curves between conditions with or without zinc. Percentages of inhibition were compared to soil total and available zinc concentrations (transformed to 1 when below the detection limit, i.e. 10 μg kg−1) through linear regressions.

Soil Metabolic Profiling Using the MicroResp™ Method

The MicroResp technique was used to measure basal respiration and substrate-induced respiration (SIR) [28]. Soils were loaded (four replicates) into the 96-deep-well plates, and the mean mass of each soil was measured.

Soils were moistened to 45% water retention capacity using sterile distilled water, and plates were pre-incubated 3 days at 24 °C in a plastic bag containing wet cotton to avoid desiccation. After this step, six carbon sources (pyruvate, succinate, citrate, L-asparagine, D-ribose, and D-mannitol), selected according to the contrasting levels of substrate utilisation in Biolog® plates among soils, were added to reach 80% of soil/water retention capacities. Substrates were added at 20 mg of C per well except for L-asparagine (3 mg) and citrate (5 mg) having lower solubility. Water was added to plates for basal respiration. Finally, deep-well plates were sealed to the CO2-trap microtiter plates and incubated in the dark at 24 °C for 4 to 9 h depending on the carbon source (Table S1) and for 8 h for basal respiration. Absorbance was measured at 570 nm using a spectrophotometer (SAFAS, Monaco) just before sealing to measure blank values to subtract to the final absorbance. Mean basal respiration (n = 8) was measured on two independent series, and six SIR were measured on four replicates for the 10 soils. Absorbance values for SIR were corrected by subtracting the absorbance mean (n = 4) of the corresponding soil basal respiration series, and transformed into a CO2 concentration using the following formula:

$$ \mathrm{ppm}\ \mathrm{of}\ \mathrm{C}02=138.72\times {\exp}^{\left(6.7974\times \mathrm{ODc}\right)} $$

where ODc is the corrected absorbance. Then, the CO2 concentration was standardised by dividing it by the soil quantity and the incubation time. Based on the MicroResp instruction manual, ppm values of CO2 were then expressed as quantity of carbon released as CO2 per gram of soil per hour. For each soil, these corrected and standardised absorbance values were used to calculate the same functional diversity estimators (richness, evenness, and Shannon diversity indices) as the ones described above for Biolog data.

Statistical Analyses

All statistical analyses were performed using RStudio v1.0.136. Significant differences among soils or among soil groups were assessed using the Kruskal-Wallis rank sum test followed by a multiple-comparison test included in the Vegan R package [48]. For principal component analysis (PCA), nine soil variables among soil physico-chemical, pollution, and texture characteristics were used, and soil groups were made with the Monte Carlo test using 1000 iterations. For redundancy analysis (RDA), explanatory variables were chosen by using the ordistep R function. Permutation tests on the RDA axes and variables were performed. Multivariate analyses were carried out on standardised data using the ade4 [49] and vegan R packages. Linear regression was assessed by fitting to a linear model using the “lm” function. The partial least square regression (PLSr) algorithm [50, 51] in regression mode was used to identify possible relationships between the soil physico-chemical characteristics and 96 bacterial OTUs (each representing at least 1% of the total). The two matrices were z-normalised. The PLSr was implemented using the mixOmics package [52, 53]. Once calibrated, the PLSr model was validated using the leave-one-out cross-validation method. R2 and the mean squared error of prediction were then used (data not shown) to select the appropriate number of principal components used to implement the model. The correlation between bacterial OTUs and soil physico-chemical parameters was displayed in a heatmap (“cim” function) resulting from the similarity matrix obtained from the PLSr model.

Results

Grouping of Soils According to Their Physico-Chemical Characteristics

The physico-chemical and microbial characteristics of the 10 soils are summarised in Tables 1 and 2. Many characteristics differed among soils and soil groups (settling pond, slag heap, control). Although one of the control forest soils (ctrl-He) had a pH of 5.4 while all other soils had a slightly basic pH (from 7.0 to 8.0), no statistical pH difference was observed among the soil groups. The slag heap soils differed by a high available PAH concentration and intermediate values of the metal pollution index, whereas the settling pond soils had a high metal pollution index with especially high Zn, Pb, Cd, and Tl concentrations and intermediate values of available PAH concentrations. These two groups also presented high total PAH concentrations and high Fe concentrations as compared to the control soils. The control soils were mainly characterised by the highest dissolved organic carbon concentration and the lowest total PAH concentration and metal pollution index. The control soils also contained higher total potassium and aluminium concentrations than the other soils, probably in part due to their proportion of clay. No difference among soil groups was observed for the C/N ratio or for concentrations of total organic carbon, organic matter, nitrogen, organic acids, carbohydrates, and available phosphorus. Our collection of soils also presented highly variable iron concentrations, ranging from 18 g kg−1 in ctrl-Di to a particularly high concentration of 452 g kg−1 in sh-Uc. Concerning microbial parameters such as MPN counts and 16S rRNA gene abundance, soil groups were not statistically different. Interestingly, the ratios of fungal to bacterial abundance (18S /16S rDNA ratios) were similar between the control and slag heap soils, and the lowest for settling pond soils, probably because the three soil groups had similar 18S rRNA gene copy numbers while the settling pond soils tended to have higher 16S rRNA gene copy numbers (p = 0.053).

Table 1 Physico-chemical characteristics of the 10 soils. Parameters were measured one time on one mean sample. For soil groups, values are means (n = 3 for ctrl and sp soils, and n = 4 for sh soil) ± standard error (SE) of the mean. Significant results of Kruskal-Wallis tests (p < 0.05), testing differences among the three soil groups, are indicated by different letters, and no letter appears when non-significant differences were found. For statistical analyses, values lower than the detection limits of the method were replaced by 1, except for available phosphorus for which the value was replaced by 0.05. The metals used for calculating the metal pollution index (Mi) are indicated by a star. Abbreviations of soil names and groups: Di: Dieulouard, Ho: Homécourt, He: Hémilly, MsM: Mont St Martin, Mo: Montiers, RM: Russange-Micheville, NM: Neuves-Maisons, Po: Pompey, Te: Terville, Uc: Uckange; ctrl: control, sh: slag heap, sp: settling pond. Abbreviations of soil characteristics: PAH: polycyclic aromatic hydrocarbons, CEC: cation exchange capacity
Table 2 Microbial, taxonomic, and functional characteristics of the 10 soils. Values are means (n = 3 or 4) ± standard error of the mean. For soil groups, values are means (n = 3 for ctrl and sp soils, and n = 4 for sh soil) ± standard error (SE) of the mean. Significant results of Kruskal-Wallis tests (p < 0.05), testing differences among the three soil groups, are indicated by different letters, and no letter appears when non-significant differences were found

We performed a PCA on the basis of soil physico-chemical, pollution, and texture characteristics (Fig. 1), which confirmed the distribution of the 10 soils in three groups based on their use. The first three components of the PCA explained 76.75% of total variance, with the first (PC1), the second (PC2), and the third (PC3) components of the PCA accounting for 35.3%, 28.5%, and 12.8%, respectively. The main contributing variables were the metal pollution index Mi (24.8% of total variance on the considered axis) and the total potassium concentration (18.9%) on the first axis, soil texture (sand 33.1%; clay 24.9%) and the pH (13.8%) on the second axis, and PAH (46.9%) and organic carbon (30.9%) concentrations on the third axis (not shown). On PC1, control soils were mainly separated from slag heap and settling pond soils by the potassium and clay fraction parameters. Interestingly, settling pond soils were separated from the control and slag heap soils by the metal pollution, organic carbon, and nitrogen parameters. On PC2, slag heap soils were separated from the control and settling pond soils by the sand fraction and pH parameters. PAH pollution was not a discriminative parameter (10.6% of the weight on PC2) and did not separate the soil groups according to their use on the PC3 axis (46.86% of the weight on PC3). To test if the soil groups were robust, we performed a Monte-Carlo test, and confirmed that our grouping was statistically significant (1000 iterations, p = 1.998 × 10−3). We also performed a second PCA without the pollution characteristics (data not shown) and confirmed the separation of the soils in three groups corresponding to their use (Monte Carlo test, 1000 iterations, p = 0.016).

Fig. 1
figure 1

Principal component analysis (PCA) and correlation circle based on the physico-chemical, pollution, and texture characteristics of the 10 soils. On the PCA: soils (Di: Dieulouard, Ho: Homécourt, He: Hémilly, MsM: Mont St Martin, Mo: Montiers, RM: Russange-Micheville, NM: Neuves-Maisons, Po: Pompey, Te: Terville, Uc: Uckange) were grouped in three types according to their industrial history (sp: settling pond; sh: slag heap; ctrl: control). On the correlation circle: N: total nitrogen; C_org: total organic carbon; P_Olsen: available phosphorus; Mi: metal pollution index; PAH: sum of 16 regulatory PAHs; K_t: total potassium; sand and clay: proportions of the sand and clay fractions, respectively

Bacterial Taxonomic Diversity

We determined bacterial taxonomic diversity by sequencing 16S rRNA genes. After read treatment and rarefaction to 34,191 reads per sample, we classified OTUs with 97% similarity taxonomically. The taxonomic alpha diversity estimators based on OTU data are shown in Table 2. Interestingly, slag heap soils harboured more diversified bacterial communities with higher Shannon’s diversity (mean value of 6.203) and Pielou’s evenness (mean value of 0.791) indices than settling pond soils. Compared to the other two soil groups, control soils had similar bacterial taxonomic diversity indices due to the low and high bacterial taxonomic diversity indices of ctrl-Mo and ctrl-Di, respectively. We investigated bacterial taxonomic diversity at the phylum level (alpha, beta, gamma, and delta classes were found for Proteobacteria). Their relative proportions in each soil are presented in Fig. 2. Based on the dissimilarity distance matrix, the soils were all highly different: the minimum Bray-Curtis dissimilarity index between two different soils was 0.814, while it ranged from only 0.254 to 0.519 for replicates of a same soil (data not shown). Phylum proportions varied among soils, with a majority of Acidobacteria, Actinobacteria, and Proteobacteria. Acidobacteria were dominant (10.8 to 31.4%), except in ctrl-Mo (7.1%) and sh-NM (4.3%) communities. Within the soil collection, Actinobacteria represented 10.1 to 25.9% of bacterial communities. Proteobacteria represented 16.2 to 35.1% of soil bacterial communities and were mainly represented by Alpha-Proteobacteria (up to 18.5% in sh-Te), except in ctrl-Mo where Beta-Proteobacteria were the dominant proteobacterial class and represented 13.2%. We can note that ctrl-He, sp-MsM, and sp-RM bacterial soil communities harboured the lowest proportions of Proteobacteria but high proportions of Verrucomicrobia (22.1% in ctrl-He) and of unclassified OTUs (16.5% and 17.8% in sp-MsM and sp-RM, respectively). Bacterial communities from soils sharing the same use had very different compositions. For example, Firmicutes represented 13.8% of the sp-MsM soil community, while they represented a maximum of 0.4% of the other settling pond soil bacterial communities. By comparing soil groups, we noted that Verrucomicrobia were present in a significantly higher proportion in control soils than in polluted soils (p = 9.415 × 10−5). On the contrary, polluted soils harboured a significantly higher proportion of Candidate division TM7 phyla than control soils (p = 0.001). We investigated the relationship between physico-chemical soil parameters and bacterial phylum proportions by searching significant linear correlations. The proportion of Actinobacteria was positively correlated to the total nitrogen concentration (Pearson, p = 6.74 × 10−4, r = 0.88) and the soil cation-exchange capacity (Pearson, p = 2.04 × 10−3, r = 0.85). The proportion of Proteobacteria was positively correlated to total and available PAH concentrations (Pearson, p = 1.59 × 10−3, r = 0.86, and p = 9.99 × 10−4, r = 0.87, respectively), mostly due to a correlation with gamma-Proteobacteria (Pearson, p = 0.05, r = 0.63, and p = 0.02, r = 0.71, respectively). Positive correlations between the proportion of Chloroflexi and total zinc and lead concentrations (Pearson, p = 2.43 × 10−3, r = 0.84, and p = 0.03, r = 0.70, respectively) were also found. The proportion of unclassified OTUs was correlated with the total zinc concentration (Pearson, p = 5.42 × 10−3, r = 0.80) and the metal pollution index (Pearson, p = 1.93 × 10−2, r = 0.72). Similarly, the proportion of Gemmatimonadetes was positively correlated with the metal pollution index (Pearson, p = 1.44 × 10−2, r = 0.74) and with various metals such as total nickel and cobalt concentrations (Pearson, p = 1.66 × 10−2, r = 0.73, and p = 1.52 × 10−2, r = 0.74, respectively).

Fig. 2
figure 2

Taxonomic composition of the bacterial communities of the 10 soil samples. Values are means (n = 3), and error-bars represent standard errors of the mean for each soil. The group named Others included OTUs affiliated to the following phyla: BD1–5, Chlamydiae, Chlorobi, Cyanobacteria, Deinococcus-Thermus, Fusobacteria, Spirochaetes, WCHB1–60, Synergistetes, and Candidate divisions BRC1, OD1, OP10, SR1, TG-1, TM6, WS3, and WS6. Statistical differences among phylum proportions in the soil groups are indicated by different letters (Kruskal-Wallis test, p < 0.05)

The analysis at the OTU level generated 32,920 OTUs, which ranged from 2159 (sp-RM) to 4434 (ctrl-Di) mean OTUs per sample. The PLSr was computed to gain insights into the soil physico-chemical parameters that drove bacterial taxonomic diversity. The heatmap in Fig. S1 shows that the distribution of correlations between bacterial diversity and soil characteristics was rather scattered, reflecting a great heterogeneity of the relationships linking bacterial diversity to soil properties. Each soil factor selectively correlated only to a small number of OTUs (10–17). In particular, as identified by the high correlation coefficients (> 0.9), the strongest correlations between soil properties and OTUs concerned the pH, the clay content, and the PAH content. Whereas the pH and clay seemed to correlate mostly to OTUs affiliated to the Verrucomicrobia (negatively) and Firmicutes (positively), respectively, the PAH content appeared to affect a more diverse range of OTUs. The soil PAH content was indeed positively and strongly correlated to the abundance of five OTUs affiliated to Bacteroidetes (Flexibacter and Flavobacterium members) but also to that of three OTUs affiliated to Chloroflexi, and four affiliated to Alpha-Proteobacteria (2 OTUs belonging to Sphingomonadaceae), Nitrospirae, and Actinobacteria (one OTU affiliated to Arthrobacter). Interestingly, these groups were mostly unaffected by all other soil parameters and were the only taxa affected by PAHs. Moreover, no OTU was negatively affected by PAHs. With respect to the other soil characteristics such as total N, available P, and the metal concentration index, we observed weaker correlations with bacterial diversity. These factors were positively correlated to one group of OTUs dominated by members of Actinobacteria. The total organic carbon and sand contents were slightly correlated to two other groups of OTUs, but with opposite effects. Organic C was positively correlated to a highly diversified taxonomic group, while the sand content was positively correlated with a group of eight Acidobacteria.

Metabolic Functional Diversity

The soil basal respiration and the substrate-induced respiration (SIR) measured for six different carbon sources (L-asparagine, citrate, succinate, pyruvate, D-mannitol, D-ribose) using MicroResp microtiter plates are shown in Fig. 3. The functional alpha diversity estimators based on MicroResp data are shown in Table 2. No statistical difference among soil groups emerged for substrate richness, probably due to the low number of carbon substrates we tested. The control and slag heap soils had similar functional Shannon indices and higher ones than the settling pond soils. Pielou’s functional evenness index was statistically different among the three soil groups, with the highest for slag heap soils and the lowest for settling pond soils. The sum of SIRs for the six different carbon sources was used as a proxy of the soil metabolic activity (Fig. 3). Most of the carbon sources were utilised by every soil, i.e. the SIR level was higher than basal respiration, except with D-mannitol that was not utilised by sh-Te, and citrate that was not utilised by ctrl-Mo, ctrl-Di, sh-Ho, and sp-Mi. The highest measured SIR was with pyruvate and the lowest with citrate. sp-Po showed the highest metabolic activity, while sh-Ho showed the lowest, mostly due to the great differences in SIR levels with the pyruvate substrate (data not shown). Kruskal-Wallis tests showed that all SIR values except with D-ribose significantly differed among the soils. Based on soil groups, Kruskal-Wallis tests revealed that the settling pond soils had a significantly higher soil metabolic activity than the slag heap and control soils (p = 1.93 × 10−3), mostly due to the high metabolic activity of sp-Po. Basal respiration and SIR with pyruvate were also the highest in the settling pond soils and the lowest in the slag heap soils (p = 1.73 × 10−7 and p = 9.54 × 10−5, respectively). Finally, a positive linear correlation (Pearson, p = 6.99 × 10−5, r = 0.94) was found between soil metabolic activity and soil basal respiration. Additionally, these last two parameters were positively correlated with the total nitrogen concentration (Pearson, p = 8.42 × 10−3; r = 0.77, and p = 1.53 × 10−2; r = 0.74, respectively).

Fig. 3
figure 3

Cumulated substrate-induced respiration (SIR) in the presence of six carbon sources, and basal soil respiration measured using the MicroResp method. Values are means (n = 4 for SIR and n = 8 for basal respiration), and error bars represent standard errors of the mean for each soil. Statistical differences among the soil groups are indicated by different letters (Kruskal-Wallis test, p < 0.05)

We assessed carbon substrate utilisation using Biolog microtiter plates to calculate soil metabolic richness and average well colour development (AWCD) (Fig. 4). The functional alpha diversity estimators based on Biolog data are shown in Table 2. The Kruskal-Wallis test showed that bacterial communities from the settling pond soils had a significantly lower metabolic richness and Shannon’s diversity indices than bacterial communities from the slag heap and control soils, while no difference among the three soil groups was observed for Pielou’s evenness index. The metabolic richness of the 10 soils ranged from 5 (sp-RM) to 31 (ctrl-Mo) utilised carbon sources, out of a total of 62 carbon sources. None of the 10 soils significantly catabolised PAHs in the conditions of the experiment, probably due to a too short incubation time. Thus, except PAHs for all soils, and polymers and miscellaneous substrates for sp-RM, at least one substrate from each substrate guild was utilised by each soil. Among the 62 carbon substrates, protocatechuic acid, L-asparagine, Tween 40, D-mannitol, succinic acid, and putrescine were the best utilised carbon substrates in the aromatic acid, amino acid, polymer, carbohydrate, carboxylic acid, and miscellaneous guilds, respectively. On the contrary, for these same carbon guilds, the less utilised substrates were syringic acid, glycine, alpha-cyclodextrin, D-sucrose, propionic acid, and pyrocatechol, respectively. Kruskal-Wallis tests showed significant differences in metabolic richness among soils, especially between ctrl-Mo and sp-RM. ctrl-Mo indeed exhibited higher total metabolic richness and a wider range of utilised carboxylic acids, carbohydrates, and polymers than sp-RM. No significant difference was observed among soils for miscellaneous, amino acid, and aromatic acid metabolic richness. Besides, as regards the soil groups, the settling pond soils showed significantly lower total metabolic richness (p = 0.007) and carboxylic acid (p = 0.017) and carbohydrate (p = 0.004) utilisation richness than the slag heap and control soils. Regarding utilisation of substrates from the polymer guild, the settling pond soils showed the lowest metabolic richness, and the control soils the highest (p = 0.03), respectively. Finally, a positive linear correlation (Pearson correlation, p = 1.182 × 10−5, r = 0.959) was found between total metabolic richness and AWCD.

Fig. 4
figure 4

Catabolic richness and average well colour development (AWCD) measured after 96 h of incubation using Biolog microtiter plates. Values are means (n = 3), and error bars represent standard errors of the mean for each soil. Sixty-two carbon sources were tested: 4 PAHs, 6 aromatic acids, 9 amino acids, 7 polymers, 14 carbohydrates, 17 carboxylic acids, and 5 miscellaneous (for details, see Supplemental Table S1 ). Statistical differences among the soil groups are indicated by different letters (Kruskal-Wallis test, p > 0.05). As none of the PAH compounds was significantly degraded, they were excluded from the statistical analysis

Zinc Tolerance

Soil microbial zinc tolerance was assessed using Biolog microtiter plates by adding ZnCl2 solution, and then percentages of inhibition due to zinc addition were calculated. Figure 5 shows the linear regression between the soil available zinc concentration and the percentage of inhibition of carbon substrate utilisation. A similar linear regression was found with the total zinc concentration (p = 3.975 × 10−5, R2 = 0.45). The settling pond soils had the highest available zinc concentration, and their metabolic activity was the least inhibited by zinc addition. The ctrl-He soil also had a high available zinc concentration (38.1 mg kg−1), due to its low pH (5.4; Table 1) leading to a higher MTE availability and a lower zinc inhibition of metabolic activity (46.2%). In comparison, ctrl-Mo and ctrl-Di had neutral pH values (7.0 and 7.3, respectively), a low available zinc concentration (< 10 and 11 μg kg−1, respectively) and a higher zinc inhibition of metabolic activity (49.3 and 54.7%, respectively).

Fig. 5
figure 5

Linear regression between the inhibition rate of carbon substrate utilisation measured using Biolog microtiter plates (n = 3) with and without zinc addition, and available zinc concentrations in the 10 soils. The blue line represents the linear model of regression (y = − 0.0006x + 60.98, where y is the inhibition rate and x is the available zinc content of the soil). The dark grey area represents the confidence interval, and the light grey area represents the prediction interval (95%). Settling pond, slag heap, and control soils are represented by red, green, and blue dots, respectively

Effects of the Soil Characteristics on Taxonomic and Metabolic Functional Diversity

We investigated the effects of soil environmental characteristics on bacterial taxonomic diversity (Fig. 6a), and MicroResp (Fig. 6b) and Biolog (Fig. 6c) metabolic functional diversity through redundancy analyses. We performed three RDAs using soil environmental characteristics as explanatory variables, and the taxonomic and functional diversity indices of the soil microbial communities (Tables 1 and 2) as response variables. The explanatory variables that best explained the variation of diversity indices were selected by backward selection to maximise the percentage of explained variability leading to at least 99.3% of total variation. The supplemental Table S2 provides p values and coefficients for axes and soil properties significance. Only the first axis (p < 0.005) in Fig. 6a, b was significant, and so were the first two axes (p < 0.005 and p < 0.02) in Fig. 6c. Among the explanatory variables (metal pollution index, PAH concentration, pH, and sand fraction), only the sand fraction had a significant effect (first axis; Fig. 6a) on bacterial taxonomic diversity (p < 0.01, F = 16.30). This relationship was confirmed by significant positive linear correlations between the proportions of soil sand fractions and Chao1 (p = 5.97 × 10−3, r = 0.80), Shannon diversity (p = 3.50 × 10−3, r = 0.82), and Pielou’s evenness (p = 0.01, r = 0.76) indices. Total organic carbon and nitrogen concentrations were the two significantly explanatory variables (p < 0.005, F = 8.71, and p < 0.01, F = 4.46, respectively) accounting for the variation of MicroResp soil metabolic diversity (Fig. 6b). The total nitrogen concentration showed negative relationships with Shannon diversity (H′) and Pielou’s evenness (J) indices, confirmed by significant Pearson linear correlations (p = 1.50 × 10−4, r = −0.92, and p = 3.08 × 10−3, r = 0.83, respectively). The metal pollution index and the total nitrogen concentration were the strongest determinants of both axes (Fig. 6c) and had a significant effect on Biolog metabolic diversity indices (p < 0.01, F = 7.77, and p < 0.05, F = 4.71, respectively). The metal pollution index showed a negative relationship with metabolic richness and Shannon diversity index (H′) confirmed by slightly significant negative linear correlations (p = 0.06, r = − 0.61 for both of them). Total nitrogen exhibited a negative relationship with Pielou’s evenness index (J), confirmed by a significant linear correlation (p = 0.02, r = − 0.73).

Fig. 6
figure 6

Redundancy analysis (RDA) of soil bacterial community diversity (a), MicroResp soil catabolic diversity (b), and Biolog catabolic diversity (c), each of them constrained by soil environmental characteristics. For each RDA, the explanatory variables that best explained the variation of diversity indices (response variables) were selected by backward selection. Explanatory and response variables are in red and black, respectively. Abbreviations of explanatory variables: N: total nitrogen; C_org: total organic carbon; Mi: metal pollution index; PAH: sum of 16 regulatory PAHs; K_t: total potassium; sand: proportion of the sandy fraction. Abbreviations of response variables: S: richness; Chao1: extrapolated richness; H: Shannon’s diversity index; J: Pielou’s evenness index; 16S: index for bacterial taxonomic diversity. MR: index for MicroResp soil catabolic diversity. BLG: index for Biolog catabolic diversity. Abbreviations for soils: Di: Dieulouard, Ho: Homécourt, He: Hémilly, MsM: Mont St Martin, Mo: Montiers, RM: Russange-Micheville, NM: Neuves-Maisons, Po: Pompey, Te: Terville, Uc: Uckange). Soils were grouped in three types according to their history (sp: settling pond; sh: slag heap; ctrl: control). The p values and F coefficients, for axes and soil properties, are provided in Supplemental Table S2

Discussion

We studied 10 different soils presenting metal and/or PAH pollution gradients and exhibiting a wide range of anthropisation levels. Although the 10 soils came from various sites with different types and levels of plant colonisation, they mostly clustered in three groups based on their physico-chemical properties and pollution levels. These three groups reflected their use, whether industrial or low anthropised. Settling pond soils were highly polluted with MTEs, with higher contents of total organic carbon and nitrogen and a siltier texture than the soils from the other groups. Slag heap soils were the sandiest and the most PAH-polluted soils, and also exhibited middle-high metal pollution—mostly MTEs for sh-Te, sh-NM, and sh-Ho—while sh-Uc contained a very high iron concentration. The control soils were the least metal- and PAH-polluted ones; they were also the most clayey soils, and had the highest dissolved organic carbon and potassium contents. Apart from the identification of well-known edaphic factors impacting the microbial diversity, we also aimed at identifying the impact of pollutants on the microbial community composition. The taxonomic and metabolic functional diversity of the microbial communities inhabiting the 10 soils was characterised. We studied bacterial taxonomic diversity based on 16S rRNA gene sequencing, and functional diversity based on measurements of the utilisation of various carbon substrates involved in different functions to estimate the impact of land use on microbial community diversity and functions. We focussed on functions related to the carbon cycle, one of the most important biogeochemical cycles for soil functioning [54, 55]. Although the taxonomic and functional diversity of soil microorganisms is extensively studied to better understand ecosystem functioning, the real link between taxon occurrence and functional richness remains poorly understood in most ecosystems [56], particularly for microbial communities. Various authors suggest that the functions related to carbon mineralisation are redundant [57, 58]. Besides, the loss of species or the modification of microbial community composition does not necessarily induce a loss of functions. In order to better understand the effect of pollution on microbial communities, it is thus important to study both taxonomic and functional diversity, using a combination of tools.

One of the objectives of this study was to identify soil parameters mostly shaping microbial community diversity in the 10 soils. Although some of the measured diversity indicators seemed constant in all soils, our data revealed that both pollutant and physico-chemical parameters strongly affected part of the taxonomic and functional structures.

Non-Impacted Microbial Parameters

We noted few variations in bacterial community composition among our soils, with all the major phyla present in each soil, and no phylum significantly dominant or under-represented in highly anthropised soils. The taxonomic diversity indices of the polluted soils were not significantly different from those of the control soils. Previous studies reported similar trends with no difference among the bacterial taxonomic diversity indices of historically mining-impacted vs. unpolluted sediments [59]. Indeed, long-term adaptation and resilience [60] of the bacterial communities, being as diverse as in control soil [22] or over a metal polluted gradient [61], were previously observed. Additionally, except for sp-RM that exhibited a very strong decrease in metabolic richness, at least one substrate from each carbon substrate guild from Biolog plates was utilised by each soil. In some cases, metabolic potentials were not affected by metal pollution as shown in three forest soils gradually polluted by Zn and Cd [62]. The differences of substrate-induced respiration observed among soils were less pronounced using the MicroResp than the Biolog method, and were essentially linked to the soil basal respiration rates as previously observed [63]. Contrarily to many studies [63,64,65], basal respiration rates and substrate-induced respiration were mostly not affected by pollution (metals and PAH) level. Additionally to the bacterial activity, MicroResp can also integrate fungal respiration. Although fungi are major actors of the carbon cycle [66], their relative abundances (ratio between 18S and 16S rDNA copy numbers) were not correlated with the respiration rates because the three soil groups had similar fungal density (18S rDNA copies g−1 dw soil). Moreover, even if fungi harbour high metal tolerance [67], no link between fungal abundances, respiration rates, and soil pollution was highlighted.

Impact of Metals on Microbial Diversity

The relative toxicity of a pollutant depends more on its available content than on its total content. The bioavailability of MTEs in these soils was relatively low since most of them had a pH value above 7. Only soil He with a pH of 5.4 had a relatively higher MTE availability than the other soils of the same group, although it was not contaminated. Except for this soil, using the relative metal pollution index or the available fraction of MTEs such as Zn showed the same tendency, that is why we mainly used the former to compare the soils.

The metal pollution index and/or the total zinc content were positively correlated with the relative proportion of unclassified OTUs and Gemmatimonadetes and Chloroflexi phyla (correlation with available zinc content have p values of 0.077, 0.088, and 0.120, respectively). Unclassified OTUs may represent extensive unexplored bacterial diversity that may have original properties allowing them to survive and grow in strongly metal-contaminated environments. Gemmatimonadetes are commonly found in heavy metal–contaminated soils [68]. A positive relationship between the relative proportion of Gemmatimonadetes or Chloroflexi and metal pollution was shown in lake sediment historically polluted with zinc and lead [69] and in the soil of an abandoned Pb-Zn mine [70]. The Chloroflexi group is known to have a high stress tolerance ability [71], which could explain its development in our metal-polluted soils. At the OTU level, a few OTUs were positively correlated to the metal pollution index, and these OTUs were also positively correlated to the P and N contents. These OTUs were affiliated to Actinobacteria members, closely related to the AKIW543 clone, the Acidimicrobineae and Aeromicrobium genera, and to Firmicutes affiliated to the Bacillus genus. Very little information is available about the two former, but few strains belonging to Aeromicrobium and Bacillus genera have already been isolated from former mining sites [72] and from heavy metal–contaminated soil [73].

The metal pollution index and/or the total and available zinc contents were negatively correlated to the richness and diversity of the substrates utilised in Biolog plates, suggesting that functional diversity (number of metabolic functions) was reduced in metal-polluted soils [74]. Similarly, when comparing soils sampled at different distances from a site historically metal-polluted with As, Cu, and Pb, Boshoff et al. [29] found lower functional diversity in the soils closest to the contamination source. It may result from the replacement of sensitive species by more resistant ones having different and less versatile metabolic properties [75] maybe due to an energy cost balance between metal stress survival mechanisms and substrate utilisation efficiency [76]. Functional diversity may indicate the capacity of the community to adapt its metabolism, relative composition, and size to various changes in environmental conditions [77]. In our study, the low metabolic richness of the bacterial community in the metal-polluted soils may have disturbed carbon recycling and soil functioning.

Additionally, using Biolog for tolerance tests, lower zinc sensitivity (or higher tolerance) of the bacterial community inhabiting high metal-contaminated soils was shown as compared to low-polluted or unpolluted soils, suggesting an adaptation of communities to metal stress, as previously observed [78]. It is well accepted that microorganisms coming from highly polluted environments are more tolerant to pollution than those coming from less polluted environments [21].

Among our soil collection, sh-Uc had the highest iron concentration and relatively low concentrations of other metals, resulting in a middle-high metal pollution index. This soil had similar taxonomic but slightly higher functional diversity indices as compared to other soils presenting a middle-high metal pollution index. Unlike other metals such as Cd or Pb, Fe is an essential element for almost all living organisms and is involved in numerous metabolic processes. Moreover, due to its extremely high total concentration in sh-Uc, iron was probably mostly present as insoluble oxides and unavailable forms. This finding suggests that although iron was present in a very high total concentration in sh-Uc, it had a lower effect on microbial diversity than other metals such as Zn, Cd, or Pb.

Impact of PAH on Microbial Diversity

We highlighted various correlations between PAH pollution and the proportions of some phyla or OTU abundance, but found no relationship with functional metabolic diversity, suggesting a limited impact of PAH on microbial community carbon cycle–related functions in our soils. Total and available PAH contents were both positively correlated to the proportions of Proteobacteria (mostly gamma-Proteobacteria) and Nitrospirae phyla, even if in the literature the Nitrospirae phylum seemed to be negatively impacted by PAH content and bioavailability [79]. Proteobacteria and gamma-Proteobacteria were previously found in greater proportions in pyrene-amended soil [80] and can be involved in PAH degradation [81]. The abundance of some OTUs was also positively correlated to the total PAH pollution level, but was not affected by all other soil parameters, suggesting an advantage conferred in the presence of PAHs. Most of them were found predominant in PAH-polluted environments, i.e. bacteria belonging to Flavobacterium [82], Flexibacter [83], Sphingomonadaceae [84], and Chloroflexi [85] were detected or isolated from PAH-contaminated soils. Firstly, PAH pollution could have contributed to shape bacterial communities by favouring species able to metabolise PAHs. Secondly, in our soils, due to ageing, available PAH concentrations were low as compared to total concentrations and did not negatively affect taxonomic or functional diversity at the community level, indicating no major toxicity impact of PAHs on bacterial diversity.

Impact of Physico-Chemical Parameters on Microbial Diversity

Apart from the soil pollutants, microbial communities can be affected by physico-chemical parameters. It is well known that the pH is one of the major parameters that shapes microbial community composition [30, 86]. Except one control soil (ctrl-He), our soils were chosen with a similar slightly alkaline pH to avoid a too high influence of this parameter that could have hidden the structuring impact of other parameters. Thus, the pH has little impact on taxonomic and functional diversity, although the proportions of few OTUs mostly affiliated to Verrucomicrobial DA101 members and commonly found as dominant bacteria in soils [87] increased when the pH decreased.

The soil texture, and especially the sand fraction proportion, was the strongest determinant of bacterial taxonomic diversity in our soils, but no link with metabolic functional diversity was highlighted. The relative proportion of the soil sandy fraction was indeed positively correlated with taxonomic richness, evenness, and Shannon diversity indices, and the relative proportion of alpha-Proteobacteria [88], whereas the relative proportions of silt and clay negatively influenced the richness and evenness indices. Fine-textured soil can protect microbes from predation by protozoans, reduce variations in water availability, and maintain higher microbial abundance [89]. Conversely, coarse-textured soils allow for a better aeration and water circulation providing various microhabitats and supporting growth of diverse microorganisms adapted to limited nutrient conditions or able to use a wider range of substrates [88]. Similarly to our results, Chau et al. [90] found a positive linear relationship between sand content and taxonomic richness, while Kandeler et al. [91] found that the small-size fraction promoted bacterial diversity. These contradictory results could be related to soil fractionation techniques [90], as we based textures on particle granulometry, not on mineralogy (i.e. the sandy fraction could contain sand but also other particles). Finally, the texture may affect pollutant availability, which could indirectly affect microbial diversity.

While the soil total organic carbon content was not the main parameter explaining taxonomic diversity, it was negatively correlated to the functional diversity indices, calculated from MicroResp and Biolog data, to the utilisation level of six substrates (phthalic, ferulic, succinic, and aconitic acids, Tween 40, and phenylethylamine), and to the number of polymer, carboxylic acid, and miscellaneous substrates utilised in Biolog microtiter plates. We noticed that in the redundancy analysis, the total organic carbon content co-varied with the metal pollution index, as previously shown by Valsecchi et al. [92] on a collection of 16 soils. As explained above, metal pollution may have reduced the functional capacities of the microbial community. The resulting lower microbial functional diversity and activity may have reduced organic carbon recycling in the soils, resulting in an accumulation of total organic carbon and organic matter. Similarly, rates of organic matter (i.e. litter or plant polymer such as cellulose) degradation were lower in metal-contaminated than unpolluted soils [93, 94]. This finding could be explained by a decrease in enzymatic activities involved in the C cycle in metal-polluted soils [91]. Moreover, Lucisine et al. [95] also explained that accumulation of organic matter at the surface of brownfield soil could be partly due to the lack of endogenous metal-sensitive earthworm species limiting organic matter incorporation in the deeper soil layers.

We observed that the total nitrogen content was positively correlated with soil basal respiration, with ribose-, pyruvate-, and asparagine-induced respiration, and with 16S rRNA gene copy numbers. These relationships suggest that in some of our soils, nitrogen present in very low concentrations was one of the limiting factors of microbial growth and activity. N limitation can indeed occur in some cases depending on the quality of the available carbon substrate [96, 97]. Additionally, the relative proportions of the Actinobacteria phylum and of a few OTUs affiliated to Actinobacteria and Firmicutes were positively correlated to the total nitrogen content. As discussed above, these OTUs were also positively correlated to the metal pollution index indicating a simultaneous influence of various soil parameters on microbial community composition. As explained for total organic carbon, metal pollution partly inhibits soil enzyme activities and organic matter decomposition [98], so that the total nitrogen content potentially increases in the most polluted soils. Moreover, several strains of Bacilli isolated from soils are known as nitrogen-fixing bacteria [99] and could contribute to an increase in the nitrogen content. On the contrary, Bacteroidetes phyla and a few OTUs affiliated to Acidobacteria were negatively correlated with the total nitrogen content. Acidobacteria are often classified as oligotrophs [100], which are favoured in nutrient-poor environments. Moreover, a negative relationship between the total nitrogen content and the functional evenness index was mostly explained by a higher mineralising activity in a few substrates. Nitrogen may have impacted bacterial community composition, favouring the development of certain species with higher metabolic activity for certain specific substrates.

Conclusions

We studied a collection of 10 soils presenting different anthropisation levels. Based on their physico-chemical characteristics and on metal and PAH pollution levels, these soils were gathered in three groups corresponding to their use, i.e. either industrial (slag heap and settling pond) or low anthropised (control soil). Although taxonomic diversity largely varied among soils, the occurrence of a few bacterial phyla and OTUs was influenced by metal and PAH pollution as well as by the pH, texture, and the total N content. Furthermore, metal pollution, particularly zinc concentrations, seems to have selected more metal-tolerant communities but with reduced metabolic functional diversity and activity. Apart from metal pollution, the N content appeared to be one of the main factors limiting microbial activity in our soils. These reduced activity and metabolic functional diversity may have contributed to an accumulation of organic matter in the most metal-polluted soils. Considering all abiotic and biotic soil characteristics, especially metabolic functional diversity, we are now able to rank our soils based on their impact on microbial communities (i.e. control soils have the lowest effect, while settling pond soils have the highest). Although it would be very interesting to study a wider range of soil to draw more generalizable conclusions, the study of 10 soils already allowed us to identify the drivers of microbial diversity at both the taxonomic and functional level.