Introduction

The Next Generation Sequencing (NGS) techniques developed in this century are being increasingly used in various environments for microbial ecology studies. They provide fast, efficient and low cost DNA sequencing compared with standard and traditional methods. The software tools available for assembly, elimination of chimaeras and quantitative analysis are constantly improving, allowing rapid (though memory-consuming) processing of sequences. The collection of software applications used to process a DNA data set is called the pipeline and is often developed “in house” for analysis of specific environments (see, for example, [10] and references therein). The use of this technology for the study of microbial communities directly in their natural habitats (metagenomics) has only recently spread to the field of biodeterioration [e.g., 12, 21]. Previous investigations have used traditional microbiological methods to identify organisms involved in discoloration [e.g., 8, 16] and degradation [e.g., 5] of historic and culturally important buildings and Gaylarde and Gaylarde [6] have discussed the microorganisms involved in biofilm formation on siliceous stone buildings in Latin America. For a review written prior to the introduction of NGS techniques into this area, see [19].

We have applied NGS methods, along with more traditional techniques, to study the microbial communities in biofilms on degrading stone buildings of historic and cultural importance, using siliceous stone façades of churches in the city of Rio de Janeiro, Brazil, as study sites [7, 9] and found some problems resulting from the analysis of the NGS results. This led us to carry out a more detailed comparison of the methods employed in such studies.

Materials and Methods

Sampling

Samples were obtained from the front façades of three churches in the centre of the city of Rio de Janeiro, Brazil: Nossa Senhora do Monte do Carmo (CA), situated on Praca XV de Novembro, São Francisco de Paula (SF), in the Largo de São Francisco de Paula, and Nossa Senhora da Candelária (C), located in Pio X Square. The façade blocks are composed, respectively, of granite, granite and augen gneiss. Churches C and CA are subject to higher levels of air pollution from passing traffic than SF, which is located in the middle of a leafy square. Specimens were taken randomly from obviously colonized areas at a height of 1–2 m, using the adhesive tape technique [20] and by collecting samples of contour scales composed of dark surface crust and underlying stone. Approximately 15 samples were taken from each church and were used in a variety of chemical and microbiological analyses.

Chemical and Microbiological Analyses

These were carried out as previously reported [9]. Water-soluble salts were quantified by standard chemical methods and SEM was used to detect both salt crystals and microbial cells in the samples. DNA was extracted from duplicate ground stone samples and subjected to amplification with bacterial and fungal primers, followed by NGS using the Illumina® technique. Adhesive tape and ground stone samples were also examined using the light microscope to detect microbial cells. Adhesive tape samples were not used for either SEM or DNA analyses, but were easily examined by light microscopy.

Data Analysis

DNA sequences were analysed by 3 different pipelines, involving software packages and the Greengenes (pipeline 1) and Silva (pipeline 2) databases for bacteria and the UNITE database for fungi; details can be found in [9] and [3]. Raw data files were deposited in the NCBI Sequence Read Archive (SRA) with accession number SRP076760.

Statistical Analysis

Classes and genera of bacteria and fungi, determined as Operational Taxonomic Units (OTUs), were subjected to Principal Components Analysis (PCA), with two or three components. Similarity levels between the three communities were also determined using the Jaccard Similarity Index (Sj), where:

Sj = Total number of shared taxa ÷ (number of shared taxa + number of unique taxa in Sample a + number of unique taxa in Sample b….etc.). A similarity index of unity means that the populations are the same.

Results and Discussion

Light microscopy of adhesive tape samples indicated the heterogeneous nature of the biofilms, a fact that is well known and has been pointed out as a relevant factor in the design of sampling techniques for traditional analyses of building stone [4].

Pipelines and Results for Bacterial Analyses

Both pipelines used for bacterial DNA analysis indicated that Actinobacteria were major components of the C and CA biofilms, but that Gammaproteobacteria were the principal members of the SF biofilm (Figs. 1, 2). There were differences, however, between the taxa detected by the two pipelines. Pipeline 2 detected a higher proportion of Epsilonproteobacteria; pipeline 1 of the Firmicutes class Clostridia, and of Deinococci, Cyanobacteria, Sphingobacteriia and Armatimonadetes. Different classes of Bacteroides were detected by the 2 pipelines, but the two predominant classes, Flavobacteria and Cytophagia, were detected by both. Both pipelines allowed the detection of many halophilic bacterial taxa, Salinicoccus, Oceanospirillales, Halobacteriaceae (Archaea), Salinimicrobium, Kocuria, Cellulomonas and Halococcus, while pipeline 1 only showed OTUs belonging to the halophilic genera Salinicola, Marinobacter and Halobacillus; in correlation, the chemical analyses indicated high levels of sodium chloride and calcium sulphate in the samples.

Fig. 1
figure 1

NGS analysis using Pipeline 1, with the Greengenes database, showing main bacterial classes detected in the biofilm community on the three church façades. Groups below 0.2% are not included. SF church of São Francisco de Paula; CA church of Nossa Senhora do Monte do Carmo; C church of Nossa Senhora da Candelária

Fig. 2
figure 2

NGS analysis using Pipeline 2, with the Silva database, showing main bacterial classes detected in the biofilm community on the three church façades. Groups below 0.2% are not included. SF church of São Francisco de Paula; CA church of Nossa Senhora do Monte do Carmo; C church of Nossa Senhora da Candelária

Coccoid cyanobacteria, tentatively identified as Chroococcidiopsis, were seen in light microscopy examinations of the tape strips and crushed stone samples and filamentous cyanobacteria were detected by SEM in several samples; some were seen penetrating into the stone, becoming endolithic in nature. Neogypsum-encased filamentous cyanobacteria were also seen, confirmed by EDX, indicating the mobilization and reprecipitation of salts from within the stone onto the cell surfaces, leading to degradation [9]. The proportion of cyanobacterial OTUs in the three duplicate samples was, however, low. Using Pipeline 1 and 2 respectively, the proportions were 0.015 and 0.01% for C, 0.02 and 0.025% for CA and 0.065 and 0.04% for SF, the major genus identified being the coccoid Chroococcidiopsis. It is possible that these figures might have been increased by the use of specific cyanobacterial primers in the PCR, but Brandes et al. [2] stated that many cyanobacterial morphotypes have never been subjected to molecular analysis and their sequences are thus not present in the databases. The failure of databases to detect many members of a microbial community has also been emphasized by Abad et al. [1]. The present results emphasize that NGS techniques should not be used in isolation from other microbiological analyses.

A number of as yet uncultured bacteria were detected by the NGS technology. These included members of the Acidobacteria, Actinobacteria, Bacteroidetes, Chloroflexi, Cyanobacteria, Firmicutes, Gemmatimonadetes, alpha- and gamma-proteobacteria. The Chloroflexi phylum was especially notable for its high proportion in C (12.1%, compared with 3.18% for CA and 3.76% for SF). All of these were identified by pipeline 2 as belonging to the class Thermomicrobia, which contains thermotolerant or thermophilic bacteria, some of which have been isolated in culture [14 and references therein]. Pipeline 1 also identified high levels of this class (12.03% in C, 3.28% in CA and 3.76% in SF), but in addition allowed the detection of 0.04% OTUs of the uncultured organism TK10 (class Chloroflexi) in sample C. Problems of detection may sometimes be resolved by using alternative primers [17], but the importance of the pipeline used in DNA sequencing data analysis is emphasized by the present results.

Fungal Analysis

The fungal DNA sequences, analysed only by one pipeline and the UNITE database, indicated the presence of ascomycetes and basidiomycetes, with few other groups (Fig. 3). Few fungal filaments were seen by either light microscopy or SEM and so the detection of these 2 groups by NGS suggests that the organisms were present mainly in their yeast-like form. Fungi known to be involved in stone degradation [13], the microcolonial rock-inhabiting black fungi (members of the dothidiomycetes), were detected at almost 1% OTUs. This group is difficult to culture, requiring long incubation periods on laboratory media [11], suggesting that molecular methods would have a particular advantage in their detection. However, this is also a group whose molecular biology has been relatively little studied. Insufficient DNA sequences are available in the databases [18] and, indeed, these organisms may account for a considerable proportion of the OTUs (22.8% on average) that were identified in the current NGS analyses merely as “fungi”, with further identification impossible.

Fig. 3
figure 3

NGS analysis of fungal classes detected in the biofilm community on the three church façades, using the UNITE database. Groups below 0.2% are not included. SF church of São Francisco de Paula; CA church of Nossa Senhora do Monte do Carmo; C church of Nossa Senhora da Candelária

Similarity Analyses

PCA plots, using both 2 and 3 components, placed the 6 samples into their appropriate 3 groups, although pipeline 2 produced more compact groups (Fig. 4 shows the 3-component analyses only). It appeared that the SF bacterial community was more different from C and CA than those two were from one another and this was confirmed by the Jaccard similarity indices. For C vs CA, Sj was 0.46, for C + CA vs SF, 0.21. This pattern was repeated for the fungal communities, where the Sj value for C + CA vs SF was 0.14, indicating an even greater difference.

Fig. 4
figure 4

PCA plots of data produced using a pipeline 1 and b pipeline 2 for the bacterial DNA sequence analysis

A closer examination of the bacterial and fungal communities for the various sites suggested why that on the façade of SF might differ from the others. This biofilm contained fungal genera that are associated with trees and insects, which might be expected to occur in the leafy environs of the Largo de São Francisco de Paula. These included Acremonium, Hirsutella, Rhodotorula, Colletotrichum, Sporobolomyces and Khuskia (the anamorph of Nigrospora), none of which were detected in the C and CA samples. In addition, the SF biofilm contained higher levels of cyanobacteria and fewer thermophilic bacteria such as the Actinobacteria order Thermoleophilia (0.425 and 0.96% average OTUs for C and CA, 0.06 and 0.06% for SF, using pipeline 1 and 2, respectively). Once again, this can be linked to the local environment, with vegetation providing a source of Cyanobacteria and protection from the intense sunlight and associated UV and high temperature in the sub-tropical climate of Rio de Janeiro.

Conclusions

NGS techniques are easy, rapid and relatively cheap. They can detect uncultivated, or hard-to-cultivate, microorganisms without the need for more exigent techniques such as FISH, and will doubtless lead to a vast increase in our knowledge of the microbial communities present in cases of biodeterioration. Newly developing technologies, such as micro-metabolomics, will enable us to correlate presence of microorganisms with their activities at a cellular level, the “holy grail” of biodeterioration. In these early days, however, it is important to realize the problems associated with NGS methods. Here, we have highlighted the variations in results from using different pipeline analyses and the problems involved in identifying organisms with insufficient sequences deposited in databases, as well as pointing out the necessity for the use of a variety of microbiological techniques in addition to DNA analysis. Primer size and bias is also a factor that may result in a high ratio of uncultured microorganisms. Another potential trap is the existence of extracellular DNA (e-DNA), which may persist after release from either living or dead microbial cells that may no longer be present in the biofilm [15]. Such DNA is no longer subject to intracellular homeostasis and DNA repair enzymes and maybe altered to produce a sequence that is recognized as a different organism, or as an uncultured, or unknown, organism. It is essential that other techniques be used together with NGS analysis, when the latter is employed to determine the members of a microbial community.