Introduction

The tiny microbes that dominate the planet are ubiquitously distributed all over the biosphere, even tolerating a wide range of physicochemical stresses in extreme habitats. Their abundance and widespread presence in almost all ecosystems make them the most successful organisms on the planet. It is well known that microbes are able to control and sustain all possible life forms on earth by essentially maintaining all the biogeochemical cycles. The potential of microbes to command over metabolic and catabolic processes as well as the ability to adapt to changing environmental conditions or survive in hostile environments are universally accepted (Kyrpides et al. 2014). Studies have shown that mining industries are among the major sources of pollution as they produce large amounts of contaminants, which not only disperse into the surrounding environment through aerial deposition or water erosion but also cause pollution of the areas away from their original site (Maynaud et al. 2013).

Mining activities have played a major role in both economic development and environmental pollution in South Africa (Adler et al. 2007). Despite regulations related to mine-water management in South Africa, the continuing discharge of untreated mine water with varying levels of hazardousness causes major environmental problems (Oelofse 2009). Contamination of South African water sources with heavy metals is a widespread problem, especially in the vicinity of mines. Heavy metal contamination has serious consequences mainly for human and animal health. Heavy metals are dangerous as they tend to bioaccumulate in the tissues of receiving organisms through food chain, resulting in toxicity at higher concentration. South Africa is a semi-arid country, and the management of its precious water resources is essential to counteract pollution caused by mining industries. It is important to note that these unique environments harbour metabolically active microorganisms that are well adapted to the multiple environmental stresses they encounter on a daily basis. Analysis of microbial diversity in the mine waters may allow the quantification of the possible effects of heavy metals on different taxonomic groups and potential effects on the biological properties of the receiving ecosystem. It is now generally accepted that more than 95 % of microbes living in the environment are yet to be discovered and explored in physiological terms (Hennecke et al. 2013). Mine water is an example of an extreme environment that contains a large number of very diverse and specific bacteria (Kamika and Momba 2013). Complex interactions between diverse bacterial populations are known to exist; diverse bacteria coexist to cope with the extreme conditions prevailing in the mine waters. Characterizing bacterial communities in the mine waters containing heavy metals would provide valuable data for assessing the various metal-tolerant microbial communities existing in these extreme environments.

The standard microbiological culturing approaches are effective techniques for assessing the microbial diversity, but the majority of the bacteria cannot be cultured in the laboratory (Hugenholtz et al. 1998; Whitman et al. 1998). The accurate assessment of the microbial communities in any environmental medium could be achieved using molecular biological techniques, and findings significantly promote our understanding of the microbial communities (Liu et al. 1997; Muyzer et al. 1993; Quince et al. 2009). 16S ribosomal RNA (rRNA) gene is considered as molecular signature universally present among all bacteria and is being widely used in next-generation sequencing approach to decipher the bacterial diversity of several habitats, e.g. bacteria associated with Lucilia spp. (Singh et al. 2014), tobacco waste extract (Liu et al. 2014a, b14), styrene-degrading biofilters (Portune et al. 2014), activated sludge (Zhao et al. 2014), high-temperature biogas reactors (Röske et al. 2014), phyllosphere of rice (Ren et al. 2014), river receiving effluents from swine farm and farmhouse restaurant (Lu and Lu 2014), endophytes of sugar beet (Shi et al. 2014), petroleum reservoir (Wang et al. 2014), intestinal environment of rats (An et al. 2014), mine tailings (Liu et al. 2014a, b), heavy metals polluted soil (Gołębiewski et al. 2014), soil horizons and drain sediments of coastal acid sulfate landscape (Stroud et al. 2014). The first 16S rRNA gene-based microbial analysis of mine-water systems was done in the mid-1990s (Goebel and Stackebrandt 1994). Subsequently, some of the mine-water systems from different geographical locations were assayed using molecular techniques, e.g. Iron Mountain in California, USA (Bond et al. 2000; Druschel et al. 2004) and the Rio Tinto basin in south-western Spain (Gonzalez-Toril et al. 2003; Garcia-Moyano et al. 2007). These studies, however, were based on a single mining environment, and only limited sequencing depths were achieved using a standard clone library approach. The high-throughput sequencing can be easily implemented at low cost by using the next-generation sequencing platforms (Ye and Zhang 2013). Recently, the bacterial communities of mine waters have been assessed using the high-throughput approach (Kamika and Momba 2013; Kuang et al. 2013). The Illumina MiSeq personal sequencer is one of the popular high-throughput sequencing systems which can generate up to 4.6 GB of data per run. This approach has been applied to decipher the microbial community in distinct samples, such as biofilm in urban drinking water distribution system (Wu et al. 2014), upflow anaerobic sludge blanket reactor (Zhang et al. 2014), polychlorinated biphenyl contaminated environments (Su et al. 2014), river water (Staley et al. 2013), clinical specimen (Reuter et al. 2013; Loman et al. 2013), beer and wine (Bokulich and Mills 2013), the gut microbiome (Markle et al. 2013), ancient DNA (Gansauge and Meyer 2013), soil, host-associated environments (Caporaso et al. 2012) and the human microbiome (Kuczynski et al. 2012). One of the exciting possibilities provided by this technology is the accurate estimation of the diversity and structure of microbial communities, which may lead to a better understanding of their ecology in the mine water.

Despite an increase in mining-related studies over the past decades, our current knowledge regarding the prevalence and complexity of bacterial communities is still incomplete, partly due to the lack of reproducible screening methods. The present study compares two geographically isolated mine waters using high-throughput sequencing of the 16S rRNA gene using a 500-bp amplicon generated from the universal bacterial primers 27 F and 518R (Lane 1991). Assessment of the microbial communities in mine waters will not only provide information about their ecophysiology and the ecophysiological adaptations required to cope with extreme environments; knowledge of these complex communities will also assist in developing and providing sustainable and reliable remediation solutions (Mohapatra et al. 2011). The objectives of this research were to classify the bacteria present in the mine waters from South Africa and to compare the bacterial communities in the samples across geographic locations. To date, there has been no report of using high-throughput/next-generation sequencing to compare the complexity of the bacterial communities present in the different mine waters of South Africa. To our knowledge, this study describes the first attempt to explore and compare the bacterial diversity and community structure in selected South African vanadium and gold mine waters of South Africa using next next-generation sequencing and to provide useful information on the bacteria inhabiting the mine-water ecosystems. It was hypothesized that bacterial populations would be distinct based on geographic location and physicochemical parameters.

Materials and methods

Chemicals sourced

Bacterial DNA isolation kits were supplied by Epigenetics, USA, while the DreamTaq™ Green PCR Master Mix for PCR and DNA markers were purchased from Fermentas (Thermo Fisher Scientific, Pittsburgh PA, USA). TAE buffer and molecular grade agarose were sourced from Sigma-Aldrich (Steinheim, Germany).

Mine-water sample collection

Mine-water samples from vanadium and gold mines were collected in 1-L sterile plastic sampling bottles in triplicate from the evaporation dam of Vanchem Vanadium Products Ltd., Ferrobank, Mpumalanga and Mogale Gold Mine, Randfontein, Gauteng, South Africa, respectively. No specific permit was needed for the collection of the wastewater samples in the described sample area, and this study did not involve endangered or protected species. However, the responsible officers, namely the Process Development Manager at the chemical plant of Vanchem Vanadium Products Ltd. and the Water Plant Foreman at Mogale Gold Mine, were informed to assist the research team with the collection of mine-water samples. Samples were kept in a cooler box (4 °C) while being transported to the laboratory for further analysis.

Physicochemical analysis

For physicochemical analyses, samples collected were thoroughly mixed prior to filtration and No. 1 filter papers (Whatman) were used. The profile of the filtered samples was determined in terms of the chemical oxygen demand (COD), dissolved oxygen (DO), pH, electrical conductivity (EC) and heavy metals. The COD concentration was measured using closed reflux methods as described in Standard Methods (APHA 2001), while the pH and the DO were analyzed using a pH probe (Model: PHC101, HACH) and DO probe (Model: LDO, HACH), respectively. Heavy metals were determined using the inductively couple plasma optical emission spectrometer (ICP-OES) (Spectro Arcos, Kleve Germany). The limits of detection (LOD) varied between 10 and 60 mg/L depending on the elements.

Metagenomic DNA extraction

For each replicate, a 500-mL sample was filtered using filter paper with a pore size of 0.22 μm (Whatman filter No. 1) and the residues on the filter papers containing the microbes were washed with 1X PBS. The metagenomic DNA was extracted independently for all the samples with the ZR Fungal/Bacterial DNA MiniPrep™ Kit (Zymo Research Corporation), as per manufacturer’s instruction. Finally, the DNA was eluted in 50 μL of MilliQ water. The integration of the metagenomic DNA was assessed on the 0.8 % (w/v) agarose gel, and the DNA was quantified using a NanoDrop spectrophotometer (Nanodrop 2000, Thermo Scientific, Japan).

PCR amplification and sequencing

The PCR reaction was performed in triplicate on all the extracted DNA samples using the universal primer 27F and 518R. These primers have been reported to amplify approximately 500 bp of the 16S rRNA gene sequence targeting variable regions V1 and V3 of the 16S rRNA gene (Lane 1991). Each PCR reaction contained 25 μL of 2X DreamTaq™ Green Master Mix (DNA polymerase, dNTPs and 4 mM MgCl2), 22 μL of nuclease-free water, 1 μL of forward primer (0.2 μM) and 1 μL of reverse primer (0.2 μM), template DNA 50–100 ng up to the final reaction volume of 50 μL. The following cycling parameters were used: initial denaturation step at 94 °C for 5 min, followed by 30 cycles of 94 °C for 1 min, 55 °C for 30 s and 72 °C for 1 min 30 s, with a final 10-min extension at 72 °C, followed by final incubation at 4 °C. The PCR products were loaded onto 1.5 % (w/v) agarose gel and then visualized under a UV transilluminator (InGenius Bio Imaging System, Syngene, Cambridge, UK). The PCR amplicons were pooled together for the respective samples and purified using the QIAquick PCR purification kit (Qiagen, Germany). The DNA concentrations were quantified by using a NanoDrop spectrophotometer. The purified amplicons were pooled at equal concentrations for both different samples. The pooled samples were sequenced at Inqaba Biotechnology Industries, South Africa on the Illumina MiSeq platform.

Sequence analysis

The raw sequences were processed in MG-RAST pipeline (http://metagenomics.anl.gov/) for quality control (QC) assessment which removes artificial replicate sequences produced by sequencing artefacts and low quality sequences (Meyer et al. 2008). Although 16S rRNA-specific primers were used, a few non-ribosomal sequences were obtained. All QC passed sequences were pre-screened using qiime-uclust for at least 70 % identity to ribosomal sequences from the following RNA databases—Greengenes, LSU, SSU and RDP—and the non-ribosomal sequences were removed from the each data set. The ribosomal sequences were screened for chimeric sequences using UCHIME (Edgar et al. 2011). All the chimeric sequences were removed from further analysis. The entire non-chimeric rRNA sequences were analyzed using Ribosomal Database Project (RDP) Classifier tool of RDP pyrosequencing pipeline (Wang et al. 2007) with a confidence threshold of 80 %. These sequences were uploaded on the RDP Align tool and aligned in the same pipeline, and the cluster files were generated for each sample with the RDP Complete Linkage Clustering tool. These cluster files were then used as input to generate rarefaction curves by using the RDP Rarefaction tool. Genetic distance was calculated, and sequences were clustered into operational taxonomic units (OTUs) using 0.03 and 0.20 dissimilarity cut-off levels. The diversity index Shannon and richness estimator Chao1 were calculated for each sample at both distances. The relative abundance (%) of individual taxa within each community was calculated by comparing the number of sequences assigned to a specific taxon against the number of total sequences obtained for that sample. Data was made publicly available at the NCBI Sequence Read Archive under the accession number SRR1584394.

Results

Physicochemical profile of mine waters

The gold mine waters were more acidic and saline than the vanadium mine waters, but the DO and COD values were found to be comparatively lower for gold mine-water samples (Table 1). The following heavy metals were detected in the respective gold mine-water samples: V, Ni, Cd, As, Cr, Hg and Co. However, in vanadium mine-water samples, Cr and Hg could not be detected among these heavy metals.

Table 1 Physicochemical profile of mine-water samples

Sequencing

The DNA preparations were of high quality, as their A260/280 ratios were approximately 1.8 for all samples, indicating high purity, and they performed well as PCR templates. In total, 15,802 reads were obtained (Table 2); out of which, 14,557 (92.13 %) met the quality criteria. These sequences were searched for identity with rRNA sequences, and 14,492 sequences showed ribosomal origin. The 2618 chimeric sequences were removed, and 11,874 (75.14 % of the total number) high-quality non-chimeric rRNA reads were included in the downstream analysis. The mean read length ranged between 290 and 268 bp for vanadium and gold mine-water samples, respectively. All the filtered reads were successfully classified as belonging to the domain bacteria.

Table 2 Sequencing statistics

Community species richness and diversity indices

The communities were moderately sampled at the genetic distance level of 0.03, indicating that many more reads would be required to capture all the diversity, but at the distance of 0.2, it can be observed that major phyla were recovered (Fig. 1). More OTUs were observed for the metagenome of vanadium mine-water samples than in the gold mine-water samples (Table 3). Both the observed and estimated richness (Chao1) were higher for the vanadium mine-water samples. The parametric diversity index Shannon also indicated higher bacterial diversity in vanadium mine-water samples.

Fig. 1
figure 1

Rarefaction curves using 0.03 and 0.20 dissimilarity cut-off levels: a vanadium mine water and b gold mine water

Table 3 Diversity indices

Bacterial community structure

In total, 8 bacterial phyla were observed apart from unclassified bacterial sequences. Proteobacteria, Firmicutes, Bacteroidetes, Actinobacteria, Candidate phyla OD1 and Cyanobacterial sequences were observed in both types of mine-water samples. Representatives of Verrucomicrobia were only found in vanadium water while Deinococcus-Thermus only in gold mine water (Fig. 2). Proteobacteria was the most abundant phylum in gold mine water (75.59 %), whereas the majority of sequences from vanadium mine-water samples could not be classified even up to the phylum level and unclassified bacterial sequences dominated the vanadium mine-water sample (58.37 %) (Fig. 2, Table S1). Notable differences in the proportions of these phyla (i.e. composition) were observed between both mine-water samples assayed (Fig. 2). For instance, in gold mine water, the unclassified bacteria were the second most dominant category (17.64 %) followed by Actinobacteria (3.37 %), Bacteroidetes (1.84 %), Firmicutes (1.43 %), OD1 (0.04 %) and Deinococcus-Thermus (0.02 %). However, unclassified bacteria were found to be dominant in vanadium mine water, while the Proteobacteria (23.28 %) were the second most dominant phyla followed by Bacteroidetes (10.38 %), OD1 (5.58 %), Cyanobacteria (1.84 %), Firmicutes (0.15 %) and Verrucomicrobia (0.01 %). The most abundant class was Alphaproteobacteria accounting for 35.51 % in gold mine water and 11.75 % in vanadium mine water. Gammaproteobacteria and Betaproteobacteria were found in both types of mine-water samples, whereas Deltaproteobacteria was found exclusively in gold mine water. Classification of the reads into lower taxonomic levels revealed extremely diverse bacterial communities in both types of mine-water samples, with up to 20 orders, 39 families and 48 genera detected in the vanadium mine-water samples, and 23 orders, 45 families and 57 genera detected in the gold mine-water samples (Table S1).

Fig. 2
figure 2

Taxonomic distribution of different bacterial phylogenetic groups in vanadium and gold mine water. Analysis of 16S rRNA gene sequences was done in comparison with the RDP II database. The percentages of the phylogenetically classified sequences are plotted on the y-axis. VMW vanadium mine water, GMW gold mine water

The rank abundance curve represented taxonomic richness and evenness showing the abundance of previously unclassified sequences (Fig. 3, Table S2). In the vanadium mine water, the unclassified members of bacteria were most abundant followed by unclassified Flavobacteriaceae, genus OD1_genera_incertae_sedis, Marinobacter sp. and Thalassospira sp. and these groups accounted for 77.71 % of total sequences. The five most abundant categories in gold mine waters were Brevundimonas sp., Pseudomonas sp., unclassified bacteria, unclassified Proteobacteria and unclassified Gammaproteobacteria, which covered 79.48 % of total sequences. In total, 82 % of sequences in vanadium mine water could not be classified up to genus level. However, in case of gold mine water, 42 % of sequences could not be assigned to any genus (Table S2). The most dominant families among classified sequences in vanadium mine water were Alteromonadaceae (Gammaproteobacteria) and Rhodospirillaceae (Alphaproteobacteria) whereas Caulobacteraceae (Alphaproteobacteria) and Pseudomonadaceae (Gammaproteobacteria) were most abundant in gold mine-water samples. The analysis based on the occurrence and relative abundance of different families suggested a strong dissimilarity among the bacterial communities in both types of mine-water samples (Fig. 3, Table S2). It was observed that many different bacterial communities resided in both mine waters with least community overlap between these two samples. Only 25 sequences were found to be common to both mine waters. These common sequences belonged to the categories unclassified bacteria (11), Pseudomonas sp. (4), Brevundimonas sp. (3), unclassified Gammaproteobacteria (2), OD1_genera_incertae_sedis (2), unclassified Flavobacteriaceae (1), unclassified Rhodobacteraceae (1) and unclassified Proteobacteria (1).

Fig. 3
figure 3

Rank abundance plots: a vanadium mine water and b gold mine water. The plots show the taxonomic abundances ordered from the most abundant to least abundant. The y-axis plots the abundances of annotations in each taxonomic group on a log scale

Discussion

In this study, the bacterial community structure and species richness in the metagenome of two mine waters located in the Gauteng and Mpumalanga Provinces of the Republic of South Africa were analyzed. The physicochemical analysis revealed that both types of mine-water samples were more acidic than the permissible minimum limit of pH 5.5 set by the South African National Water Act, No. 36 of 1998. The electrical conductivities of the mine waters were found to be higher than the maximum permissible limit of 250 mS/cm for effluent discharged into the receiving water bodies. If this water is discharged directly into water bodies, it might pose a major risk factor to existing community structure and diversity of microbial populations (Kuang et al. 2013). It has been reported that the low pH in mine waters increases the solubility of many metals, and coupled with low availability of organic carbon, it presents extremely challenging and hostile environmental growth conditions (Dopson et al. 2003; Slonczewski et al. 2009). The heavy metals such as lead, cadmium, mercury, silver and chromium have been reported to be toxic even at low concentrations and have no known beneficial effects on bacterial cells (Nies 2004). Microbial community composition is highly correlated to physicochemical parameters such as pH and metal ion concentrations, as described by previous investigators (Johnson and Hallberg 2003; Imarla et al. 2006).

The rarefaction analysis suggests that microbial diversity could be even higher than observed and similar results were reported in other extreme ecosystems, e.g. hydrothermal chimneys (Brazelton et al. 2010), acidic hot spring (Bohorquez et al. 2012) and acid mine drainage (Kuang et al. 2013). The diversity indices (Table 3) indicated higher bacterial diversity in vanadium mine-water samples, and this could be due to the fact that vanadium mine waters were less acidic and saline than the gold mine waters.

Microbial survival in mine waters depends on intrinsic physiological and genetic adaptation. Various types of resistance mechanisms are used by microbes in order to cope with heavy metals (Nies 2004). The present study is the first report to present a comprehensive comparative sequence analysis of the bacterial diversity from different mine waters of South Africa by high-throughput sequencing method. The member of the phylum Proteobacteria are frequently observed in mines as the gold mine water is dominated by Proteobacteria, and in vanadium mine water, they are the second most dominant after unclassified bacterial sequences. The abundance of Proteobacteria was also found in a Chinese copper mine (He et al. 2007) as well as in deep mines in South Africa (Raji et al. 2008). The dominance of Proteobacteria was also reported in a previous study on the microbial diversity of the effluents of the same vanadium mines (Kamika and Momba 2013) as well as on acid mine drainage sites across Southeast China (Kuang et al. 2013). Nevertheless, in this study, we found that unclassified bacterial sequences were more abundant than the Proteobacteria in this vanadium mine water which supports the finding that there is a high abundance of unculturable microbes in any environment (Aslam et al. 2010). Compared with highly diverse Proteobacteria communities, the other communities in both the mine-water samples were less complex and only Bacteroidetes accounted for more than 10 % of the reads in vanadium mine water (Table S1). It has been stated that most of the bacteria inhabiting the metal-contaminated water systems are able to develop various types of resistance mechanisms to adapt to and resist metal toxicity; as a result, microbial diversity remains high (Gough and Stahl 2011; Berg et al. 2012). In the vanadium mine water, the occurrence of Marinobacter sp. supported this fact as the Marinobacter sp. isolated from vanadium mine effluents showed high tolerance to vanadium ions (Kamika and Momba 2013). It should be mentioned that the use of culture-independent methodologies to assess the microbial diversity of any gold mine water in South Africa has never been reported previously.

The difference in microbial communities between these mine waters (Fig. 3, Table S2) can probably be attributed to the effects of the particular physicochemical parameters of the mine waters. These data are indicative of substantial heterogeneity in microbial community composition and provide valuable information about the impacts of various stresses on the biological integrity of ecosystems. The mine waters harbour indigenous microorganisms that are physiologically adapted to tolerate the prevailing extreme conditions and are able to maintain their viability.

The present study reports the comparative analysis of geographically isolated mine water microbes using a high-throughput sequencing approach to elucidate the diversity and community structure of the bacteria that are physiologically adapted to tolerate high concentrations of heavy metals. In summary, mine-water samples were found to harbour unique bacterial communities that are well adapted to this extreme environment. This study provides an inventory of the metal-tolerant bacteria which will be helpful in the determination of culturing conditions if the intention is to isolate and identify them by culture techniques in laboratory settings to determine specific metabolic capabilities. Future investigations related to the functional analysis of the microbial community in mine waters based on metatranscriptomics will likely reveal a more pronounced understanding of their ecophysiological role.