Introduction

The biological process of composting involves the complete or partial degradation of organic materials by a consortium of microorganisms, the composition of which changes as the composting process progresses. Relatively little is known about the microorganisms involved and their exact activities in the different phases of the composting process. A typical composting procedure comprises a series of different temperature stages, which are known to be associated with specific populations of bacteria and fungi [7, 14]. Continual change in the environmental conditions in composts (temperature, pH, aeration, moisture, and availability of substrates) results in different stages of exponential growth and stationary phases for different organisms. Knowledge on the diversity and dynamics of the microbial communities of compost is increasing rapidly through the use of molecular biology tools such as community fingerprinting by amplified rDNA restriction analysis, denaturing gradient gel electrophoresis (DGGE), and DNA hybridization techniques [1, 22, 40, 47].

Nucleic acid microarrays offer the possibility to analyze a specifically selected array of microorganisms, concerning their presence or absence in a particular environmental sample, in a single experiment [9]. Microarray technology has been applied successfully to study microbial communities in several habitats such as municipal wastewater [29], landfill methanotroph communities [43], and trinitrotoluene-contaminated soils [17]. Clone library studies and metagenome approaches like pyrosequencing yield information beyond the cultivated part of the community; however, these techniques involve much work and are expensive. In an accompanying study with composts [13], we were able to demonstrate that the COMPOCHIP microarray supported the results of cloning.

A microarray specific for compost microorganisms would allow for the quick detection of many different microorganisms in a single test and thus for the fine-tuning of production processes. Specific qualities of the end-product could thus be obtained, such as the absence of pathogens, the presence of beneficial microorganisms, or plant disease suppressiveness. An improperly managed composting process can allow the survival, proliferation, and spread of animal, human, and plant pathogens [6, 16]. The detection of pathogens using the microarray could alert compost producers to the presence of these organisms, although confirmation by traditional methods would also be required. Plant disease suppressiveness is among the most important features of high quality composts, and composts can help in the control of several important soil-borne pathogens of crops [20, 26]. In most cases, these effects are attributed to the presence of beneficial microbes in the compost.

A microarray would also allow the differentiation between compost samples according to their bacterial communities. In previous publications [18, 19], we described the development and optimization of a compost microarray that included 65 probes targeting plant, animal, and human pathogens, as well as microorganisms that had been previously reported in the composting process. In this study, we describe the further design and validation of the 16S rRNA-gene based COMPOCHIP microarray and its first application to different types of composts and compare the results with those obtained by polymerase chain reaction (PCR)-DGGE.

Materials and Methods

Source of Bacterial Strains

Reference strains for specificity validation of the microarray oligonucleotide probes were obtained from the German Collection of Microorganisms and Cell Cultures (DSMZ), from the culture collection at the Institute of Microbiology, University of Innsbruck, and from the Austrian Agency for Health and Food Safety. Cultivation conditions for organisms were as suggested by the DSMZ or according to their references.

Collection of Compost Samples

Compost samples used in this study were collected from the composting plants in Oensingen and Fehraltorf, Switzerland. Samples of all three composts were collected at 2, 8, and 16 weeks. The green compost (CH-Oensingen) comprised 35% tree cuttings, 20% lawn clippings, 38% green clippings, 3–4% soil, and 3–4% cereal processing residues. The manure mix compost (CH-Fehraltorf) comprised 25% horse manure, 10% mature compost, 50% green clippings, 10% soil, and 5% cereal processing residues. The anaerobic digestate compost (CH-Oetwil am See) was formed from green waste, kitchen waste, and food residues that were digested for 12 days in an anaerobic digester, dewatered and then composted in Fehraltorf. The green compost had pH values of 8.1, 9.1, and 8.5 at weeks 2, 8, and 16, and the composting process reached a temperature maximum of 71°C (for 8 days, the temperature was >60°C).The manure mix compost had pH values of 7.5, 8.5, and 8.2 at weeks 2, 8, and 16, and the composting process reached a temperature maximum of 65°C (for 31 days, the temperature was >60°C). The anaerobic digestate compost had pH values of 8.1, 8.4, and 8.4 at weeks 2, 8, and 16, and the composting process reached a temperature maximum of 68°C (for 14 days, the temperature was >60°C).

Windrows (3.4-m width, 1.8-m height) were turned every 2–3 days with a self-propelled Sandberger (Neuson Hydraulic, Linz, Austria) turner, and immediately prior to sampling, windrows were turned at least twice. Humidity was manually controlled before each turning operation, and, if needed, water was added during the turning operation. A profile was dug into each of the compost windrows, and approximately every 3 m, a 10-cm “slice” of the windrow was sampled and placed into an aluminum container. After thorough mixing of the ten 60 l subsamples, 500-g samples were collected and sent directly to the laboratory for molecular analysis. One composite sample was available for each compost type and sampling time.

DNA Extraction

DNA was extracted from bulk compost samples using a procedure described previously [19]. A Promega Wizard DNA clean up kit (Madison, WI, USA) was used to purify DNA, according to the instructions of the manufacturer. Extracted DNA was subjected to electrophoresis in a 1% agarose gel in 1× Tris-acetate-EDTA (TAE) buffer, and DNA concentration was determined by fluorescence using a PicoGreen® dsDNA quantitation kit (Molecular Probes, Oregon, USA) and a fmax Fluorescence Microplate Reader (Molecular Devices, CA, USA), as described by the manufacturer.

Design and Evaluation of Oligonucleotide Probes

Oligonucleotide probes were designed with the ARB software package [34] using the PROBE_FUNCTIONS tool and the PROBE_MATCH tool for specificity testing. All possible probe mismatches to non-target organisms were centralized to minimize the risk of unspecific binding [49]. The GeneRunner program (http://www.generunner.com) was used to calculate the GC%-content, predicted melting temperature, and formation of secondary structures. Sequences were subjected to a basic local alignment search tool analysis [3], and further evaluation of the specified probe sequences was done by searching for matches between the probe and its potential targets within the rRNA sequences in the RDP II database. The Probebase on line resource for rRNA-targeted oligonucleotide probes was used as a tool to search for already published probes to be included in the study [32]. A table provided in the Electronic supplementary material lists the sequence, specificity, and predicted melting temperature of all probes. Probe names were based on the nomenclature of Alm et al. [2].

Microarray Manufacture and Processing

Oligonucleotide probes for microarray printing were synthesized by ThermoHybaid (Ulm, Germany), and the 5′ end of probes was tailed with 12 2′-deoxythymidine 5′-triphosphate molecules (T-spacer). Microarrays were printed by Lambda GmbH (Freistadt, Austria). To allow covalent coupling of oligonucleotide probes to the aldehyde-coated CSS-100 glass microscope slides (Genetix GmbH, München, Germany), the 5′-terminal nucleotide of each oligonucleotide was amino modified with C 6 MMT.

Lyophilized oligonucleotides were resuspended in water and diluted to a printing concentration of 23 μM with 3× saline-sodium citrate (SSC) buffer prior to spotting onto the activated slide surface using a Piezo-printer (GeSim Nano-Plotter NP 1.2) with a “NanoTip” micropipetting tip. A volume of 0.3–0.4 nl of each probe was spotted in duplicate on slides. Quality controls were printed as single spots on the left and right sides of each row of the microarray, and hybridization controls were printed in the middle of each row.

Preparation of Fluorescently Labeled Target DNAs by PCR

Labeled target DNA was prepared by PCR amplification of 16S rRNA genes using a Cy5-labeled universal bacterial 8F forward primer. Briefly, target DNA from reference strains and compost DNA samples was subjected to PCR amplification of the nearly complete 16S rRNA genes with the universal bacterial primers 8F (AGAGTTTGATCMTGG), to which a Cy5-label was attached at the 5′ end, and 1492R (TACCTTGTTACGACTT), to which a PO4-group was conjugated at the 5′ end [11]. PCR amplifications were performed in a ThermoHybaid PCR Express thermalcycler in 50-μl volumes, with each standard reaction mix containing a final concentration of 1× reaction buffer [16 mM (NH4)2SO4, 67 mM Tris–HCl pH 8.8, 1.5 mM MgCl2, 0.01% Tween 20] (GeneCraft, Münster, Germany), 200 μM each deoxyribonucleotide triphosphate (dNTP), 0.8 μM of the forward primer, 0.2 μM of the reverse primer, 1× enhancer (Peqlab, Germany), 1.25 U BioTherm™ DNA polymerase (GeneCraft, Münster, Germany), and sterile water. In addition, 10 mM tetramethylammonium chloride was included in reactions to enhance the specificity [33]. Five microliters of extracted compost DNA (diluted 1/60) was applied directly to the PCR reaction mix.

Thermal cycling was performed according to [37]. After an initial denaturation at 95°C for 5 min, amplification reactions were subjected to 10 min at 80°C, 1 min at 50°C, and 3 min at 72°C. Thermal cycling then proceeded with 33 cycles of 95°C for 1 min, 50°C for 1 min, and 72°C for 3 min. Temperature cycling was followed by a final extension at 72°C for 10 min. PCR products (100 μl) were purified with the GenElute™ PCR Clean-Up Kit (Sigma, Missouri, USA). For the subsequent preparation of fluorescently labeled single-stranded DNA targets, the phosphorylated DNA strand was removed. One thousand nanograms of a purified PCR product was digested using 30 U Lambda exonuclease (Epicentre, Madison, WI, USA) in 1× Lambda exonuclease buffer, at 37°C for 3 h.

Hybridization

Single-stranded Cy5-labeled PCR product (500 ng) was vacuum-dried and resuspended in 20 μl of a hybridization buffer consisting of 5× SSC, 1% blocking reagent (Roche, Mannheim, Germany), 0.02% sodium dodecyl sulfate (SDS), 0.1% n-laurylsarcosine, and 5% formamide [33]. One microliter of a 50-nM Cy5-labeled control oligonucleotide (Electronic supplementary material, Table 1) was added to each tube, and the mixture was denatured for 10 min at 95°C, before being placed on ice. Nineteen microliters was transferred onto a prechilled microarray (on ice) and covered with a glass coverslip to guarantee a uniform moistening of the array surface, and arrays were placed into small plastic containers with hybridization buffer saturated blotting paper for equilibration. Hybridization was conducted at 56°C for 4 h. After hybridization, slides were washed immediately at room temperature, each for 3 min in buffer 1 (1× SSC, 0.2% SDS), followed by buffer 2 (0.01× SSC, 0.2% SDS) and buffer 3 (0.05× SSC). Finally, arrays were briefly submerged into distilled water and air-dried. Array experiments were conducted in triplicate.

Scanning of Arrays and Image Analysis

Microarray slides were scanned with a ScanArray Gx microarray scanner (Perkin Elmer, Massachusetts, USA). Scan power was set to 90% and PMT gain to 500 (635 nm) and 450 (532 nm) for scanning. Fluorescent images were analyzed using the ScanArray Gx software (Perkin Elmer) by superimposing a grid of circles onto the image to designate each fluorescent spot to be quantified. The median foreground and background signals for all spots were determined. The signal-to-noise ratio (SNR) for all spots was calculated according to the following calculation:

$${\text{SNR = }}{{\left[ {{\text{median}}\;{\text{foreground}}\;{\text{of}}\;{\text{probe}} - {\left( {{\text{median}}\;{\text{foreground}}\;{\text{of}}\;{\text{nonsense}}\;{\text{probe}}\;{\text{KO}}\;{\text{19}} - {\text{median}}\;{\text{background}}\;{\text{of}}\;{\text{nonsense}}\;{\text{probe}}\;{\text{KO}}\;{\text{19}}} \right)}} \right]}} \mathord{\left/ {\vphantom {{{\left[ {{\text{median}}\;{\text{foreground}}\;{\text{of}}\;{\text{probe}} - {\left( {{\text{median}}\;{\text{foreground}}\;{\text{of}}\;{\text{nonsense}}\;{\text{probe}}\;{\text{KO}}\;{\text{19}} - {\text{median}}\;{\text{background}}\;{\text{of}}\;{\text{nonsense}}\;{\text{probe}}\;{\text{KO}}\;{\text{19}}} \right)}} \right]}} {{\text{median}}\;{\text{background}}\;{\text{of}}\;{\text{probe}}{\text{.}}}}} \right. \kern-\nulldelimiterspace} {{\text{median}}\;{\text{background}}\;{\text{of}}\;{\text{probe}}{\text{.}}}$$

Signals were assumed to be positive if a SNR value of ≥2 was obtained [33]. Statistical analyses of data were performed using the program Canoco 4.5 [46]. Probes were sorted by the maximum SNR for all samples, and of the 369 probes on the COMPOCHIP microarray, 102 probes had a maximum SNR above 2, and were thus included in the analyses. Because a strong correlation between standard deviation and means of replicate samples was found, a log-transformation of data was conducted to equalize variances. Covariance based redundancy analysis (RDA), a canonical form of principal component analysis, was used for data analysis, and the canonical variables were compost type and time. RDA used the following settings: inter-sample distance scaling, no post-transformation of scores, log data transformation (no offset), and center by species.

PCR and DGGE

DNA for DGGE analysis was amplified with the primer set 338fGC-907r [35, 41]. Each PCR mixture contained 0.5 ng of extracted DNA, 0.2 μM of each primer, 1.25 U Bio Therm™ DNA Polymerase (Gene Craft), 1× DNA polymerase buffer [16 mM (NH4)2SO4, 67 mM Tris–HCl pH 8.8, 1.5 mM MgCl2, 0.01% Tween 20], 0.1 μg μl−1 bovine serum albumin, 4% (v/v) dimethylsulfoxide, 200 μM each dNTP, and 2.5 mM MgCl2 in a final volume of 25 μl.

DGGE was performed using the Ingeny PhorU2 system (Ingeny International BV, The Netherlands). PCR products (60 ng) were loaded into the wells of a 7% (w/v) polyacrylamide gel with a denaturing gradient of 40% to 70% (100% denaturant consists of 7 M urea plus 40% formamide in 1× TAE buffer), and each gel was run for 16 h at 100 V at a constant temperature of 60°C in 1× TAE buffer (pH 7.4). After electrophoresis, gels were stained with silver nitrate using an automated gel stainer (Amersham Pharmacia Biotech, Germany), photographed and air-dried. DGGE banding patterns were analyzed using the GelCompar II version 4.0 software package (Applied Maths, Ghent, Belgium). A matrix of similarities was calculated based on the Dice correlation coefficient, and dendrograms were constructed using the algorithm of Ward [30].

Results and Discussion

Probe Design

In our previous microarray publication, we described the methodological development of a set of 65 probes specific for microorganisms involved in the composting process [19]. This initial probe set has been extended to include 369 probes, following a multiple-probe approach. Probes targeting organisms commonly found in soils and composts have been included, and the microarray can be applied to different environmental samples. Ideally, all probes in a set of probes that perfectly match a target organism should hybridize in order to reduce the risk of a false-positive identification of the target. For most target organisms, at least three probes were spotted on the slide, and only for a few organisms were there two probes (Electronic supplementary material, Table 1). In order for a target to be considered present in a sample, more than one probe specific for the target needed to yield a SNR above the detection limit. Probes were designed to target at either the phylum, family, genus, or species level. Because of the high level of sequence conservation of the 16S rRNA gene between phylogenetically related bacteria, it was not always possible to design a probe specific to only a particular species. Therefore, some probes were designed that targeted two or more members of a genus (Electronic supplementary material, Table 1). Due to the fact that all the probes on a microarray are subjected to the same conditions of hybridization, probe design can be a difficult process requiring much optimization, with the maximization of probe sensitivity and specificity very often being conflicting goals. All the probes included on the COMPOCHIP microarray were designed so to have similar melting temperatures, and probe sequences ranged in length from 17 to 25 nucleotides (Electronic supplementary material, Table 1). The T m for the different probes varied from 55.9–63.7°C (calculated according to the nearest neighbor model) [10], and it was not possible to further narrow this due to the limited number of differential sequence regions in the 16S rRNA gene. As a result, probes with suboptimal predicted melting temperatures were included if no alternatives were available.

Probe design was conducted so that mismatches were as centrally located within oligonucleotide sequences as possible. Single mismatches in the end positions were not considered as differences, as mismatch positions near, or at the terminus of a short duplex are reportedly less destabilizing than internal mismatches [42]. Prior to probe synthesis, secondary structure formations were also studied. Oligonucleotides with strong hairpin structures, dimers, internal loop, or bulge loop formations were considered suboptimal and were not included if other possible probes could be designed. We considered secondary structures with dG values ≤ approximately −3, as stable [50] (C. Hoebartner, personal communication).

COMPOCHIP Specificity Validation

The specificity of all probes was assessed in silico, using the ARB program. However, the results obtained in silico and obtained upon microarray hybridization can differ; thus, the specificity of a majority of the oligonucleotide probes included on the array were tested with pure cultures. Due to reasons of restricted distribution of high-risk group biological agents and difficulty in cultivation, not all pure culture microorganisms were available to check probe specificities. Of the 369 probes included on the array, 67 of the probes were not validated. Of the 302 probes that were validated, 213 probes (71%) were found to hybridize exclusively to their target organisms (Electronic supplementary material, Table 1). Of the 30,258 individual probe-target hybridization reactions performed (hybridization of 82 reference microorganisms), there were only 22 false-negative signals (0.07%) and 188 false-positive signals seen (0.62%). These hybridization results are statistically similar to those obtained by other authors [8, 33, 39]. Certain probes were found to bind non-specifically with several targets, these probes being the Alphaproteobacteria KO 240 (9), Low G + C KO 319 (17), Nitrospira KO 295 (7), Thermomonospora chromogena KO 330 (7), and Actinomyces KO 342 (6). Other probes also bound non-specifically to one or a few of the target DNAs. This happened despite the fact that there existed a varying number of mismatches between target and probe sequences. For example, the Burkholderia probe KO 234 was found to hybridize with Saccharomonospora DNA, despite six base-pair differences in the probe target sequence, and Clostridium bifermentans DNA hybridized with the Clostridium butyricum KO 377 probe, despite ten mismatches in the probe target sequence. Because microarray experiments use the one set of conditions for hybridization and washing, false positives are not an uncommon event [33]. Both cultured and uncultured/unknown microorganisms can cause the generation of false-positive signals. One approach that can be used to guarantee the reliability of microarray results is to include several probes with identical specificities for each target species or group. Using this approach, all probes in a set of probes matching the target microorganism must show positive hybridization signals to avoid the risk of false-positive identification [33]. This approach has been used in our study.

Probes that did not show a positive hybridization signal with the target reference microorganisms (6% of all designed probes) were excluded from the analysis and will be excluded from the microarray in future experiments. False-negative signals are a common phenomenon in microbial microarray analysis [28, 33] and can be explained by parameters such as secondary structures in the target molecule or steric hindrance, causing a reduction in the hybridization efficiency in a probe-binding site-specific way [37].

The signal intensities of the individual hybridizations were found to vary significantly for the different probes spotted on the microarray. This is an expected consequence of using the one set of hybridization conditions to evaluate many different probes on a microarray and has been reported by others [21, 31]. The positioning of the oligonucleotide probe sequence on the 16S rRNA gene is important and can determine the strength of hybridization signal obtained, firstly because every nucleic acid binds in a defined orientation to its complement, and secondly because certain regions of the gene are less accessible to probe binding due to the secondary structure of the gene [37]. Other possible explanations for the difference in signal intensities include different probe melting temperatures, varying dG values of secondary structure formations of probes, and different lengths of oligonucleotide probes.

The detection sensitivity of PCR-amplified organisms with the COMPOCHIP was determined in a previous study [18] and the detection limit of the probes tested was found to be approximately 103 cells, or 5% of the total microbial population. It must however be considered that the sensitivity limits for different probes vary due to the ability of certain probes to give higher signal intensities than others. It is also possible that a detection limit of 5% will allow potential pathogens to go undetected by the COMPOCHIP microarray.

COMPOCHIP Analyses of Compost Samples

The aim of designing the COMPOCHIP 16S rRNA gene microarray was to develop a tool for detecting the presence of different bacteria in a particular environmental sample in a single test. Composts contain a large and diverse community of microorganisms that are involved in the digestion of organic wastes. When the nucleic acids of pure bacterial cultures are examined with a microarray, unambiguous hybridization patterns are generally observed. However, when microarrays are used to evaluate environmental samples, the relative abundance of various target and closely related non-target microorganisms can be more difficult to determine due to the complexity of natural communities and potential cross-hybridization events [17]. For this reason, more significance to the presence of certain bacteria in different samples was attributed when more than one of the probes specific for a particular target was positive.

Due to a potential bias that may be introduced by PCR [44], the results obtained by hybridizing PCR-amplified templates may reflect a biased microbial community composition [51]. However, the use of PCR to amplify target DNA greatly increases the detection sensitivity of microorganisms in an environmental sample in comparison to non-PCR-based methods. This was especially important in this study, as we are interested in the presence of different bacteria, which may be present in relatively low numbers, yet still be of importance. Without PCR, it is possible that the levels of such microorganisms would be too low for detection.

Due to the known difficulties associated with working with compost and with obtaining sample homogeneity, bulk samples were carefully prepared for this study. According to Dionisi et al. [15], PCR amplification is responsible for a greater variability in results than DNA extraction. These authors found that DNA extraction contributed little to variability, as determined by real-time PCR. We thus decided not to conduct replicate DNA extractions. In order to reduce bias, three replicate PCR amplifications were conducted on bulk compost sample DNAs. The COMPOCHIP microarray was applied to the three compost types (green, manure mix, and anaerobic digestate composts), each sampled at different maturity stages (2, 8, and 16 weeks), so to determine if the array was able to discriminate between the various composts and to determine which bacteria were present in each of them. Because a linear correlation between signal intensity of a particular probe and target concentration has been reported by others [45, 48], the results obtained with the COMPOCHIP microarray were analyzed semi-quantitatively. However, because different probes have different affinities for their targets and because of the inherent bias involved with PCR, the information on the relative abundance of different microorganisms derived from the microarrays needs to be interpreted with some caution.

Figure 1 shows a heat map containing the hybridization results of probes for which the highest SNR for all the samples tested was ≥2. Probes for which all samples gave SNR values <2 were negative and not considered to be useful in the differentiation of compost samples. Interestingly, the 2-week composts were found to have the highest number and greatest intensity of signals (indicated by darker colors in the heat map) when compared with the 8- and 16-week compost samples (Fig. 1). In the green compost, there were 28 probes with normalized signals above 0.03065 in the 2-week compost, while only ten probes with signals above this value in the 16-week sample. In the anaerobic digestate and manure mix composts, there were 34 and 43 probes with normalized signals above 0.03065 in the 2-week compost, respectively, and only seven and five probes with signals above this value in the 16-week sample, respectively. In the 8- and 16-week manure mix compost samples, the only signals were from probes targeting groups (e.g., Alphaproteobacteria, Low G + C bacteria), and there were no signals from probes targeting individual species. It is possible that the probes on the array are biased toward bacteria that are more commonly found in earlier composting stages rather than in later stages. For this reason, the diversity in the more mature composts may be under-represented by the array. It is also possible that microorganisms targeted by probes on the array were present in the compost samples but at levels below the detection limit of the microarray.

Figure 1
figure 1

Visualization of microarray results for all composts. The probe names are listed in each row, and columns represent the different samples. The SNR values for all probes were normalized to the UNIV 1389 probe, and attributed a value between 0 and 1 (UNIV 1389 = 1). The relative intensity of the normalized SNR signal of probes is indicated by the colour bar. Only probes which had a maximum signal for all hybridizations above the threshold value of 2 were included in the table

Detection of Different Microorganisms

Bacteria belonging to the genus Actinomyces were found to be present in high numbers in all the different compost types (Fig. 1). In the green compost, Actinomyces levels were higher in the mature compost than in the fresher compost samples. These results are supported by previous findings, whereby Actinomyces have been reported to develop more slowly than most other microorganisms and are thus comparatively ineffective competitors under high-nutrient conditions [24, 38]. The presence of Actinomyces in composts has been reported to be desirable due to several positive effects exerted by members of the genus, among which is the ability to produce various antibiotics that can suppress pathogenic microorganisms.

The green composts were also found to have the highest levels of Chryseobacterium (Fig. 1). Three Chryseobacterium probes were included on the array, and although not all the probes worked equally well, all probes resulted in hybridization and SNR values above the threshold. For all the composts, Chryseobacterium levels were higher in the fresher composts than in the more mature composts. Chryseobacterium belongs to the phylum Bacteroidetes, members of which are known for their importance in the degradation of complex biopolymers in composting situations [1].

Higher signals for the probes targeting Microbacterium and Sphingobacterium were also detected in the 2-week composts and for the green compost in particular. The group probes targeting the genera Xylella, Xanthomonas, and Stenotrophomonas hybridized well with all 2-week composts and with the 16-week green compost. These bacterial genera include species that are involved in plant disease suppression and are also known as plant and human pathogens [23]. The presence of these bacteria could be of concern in a mature compost product. Probes targeting the grape pathogen Xylella fastidiosa did not hybridize with any target DNA from these composts. Of the five probes targeting Xanthomonas (a genus comprising many plant pathogens), only one probe produced a detectable hybridization signal. As all five probes were found to work well with a pure culture of Xanthomonas, it can be concluded that either very low levels of Xanthomonas were present in the composts or that the hybridization with the KO 561 probe was due to a non-specific hybridization with other DNA. The much lower SNRs obtained for this probe in the 16-week compost samples (all below the threshold value of 2) indicate a reduction or elimination of this potential pathogen during the composting process. On the other hand, probes targeting Stenotrophomonas maltophilia (KO 243 and KO 443), an organism well known for its involvement in plant disease suppression [25], gave detectable signals with all the compost types. DNA from S. maltophilia may have been responsible for the significant SNR values seen for the group probe.

The COMPOCHIP microarray included probes targeting known plant disease suppressive (PDS) bacteria (Bacillus subtilis, Bacillus pumilus, Burkholderia cepacia, Paenibacillus, Pseudomonas fluorescens, Pseudomonas putida, Pseudomonas stutzeri, S. maltophilia, and Streptomyces; Electronic supplementary material, Table 1). With the exception of P. stutzeri, P. putida, and Streptomyces, which were not found in any compost samples, probes targeting the other PDS bacteria were found to hybridize with DNA from various samples, indicating the presence of such microorganisms in the composts. Higher levels of the PDS bacteria were found in the fresher composts. A study was conducted by Nakasaki et al. [36] with the purpose of producing compost capable of consistently suppressing Rhizoctonia large patch on mascarene grass. The authors found that by inoculating the suppressive bacterium in the initial stage of composting, after heating the compost to 80°C, the suppressive bacteria could grow sufficiently well to form spores. Without this inoculation step, it was not possible to foster high concentrations of disease-suppressive bacteria in compost because of the concentration of other bacteria originally present in the raw material.

Low signals were obtained for the Archaea probe KO 31 upon hybridization of compost DNAs. These results must clearly be due to cross-hybridization events, as hybridizations were conducted with PCR products generated using bacterial primers, and thus, it should only be bacterial DNAs yielding signals upon hybridization. Cross hybridization with Archaea probes has also been reported by Eyers et al. [17].

No signals were obtained for probes for most of the pathogens included on the array. Probes targeting the plant pathogens Erwinia amylovora, Pantoea, Azospirillum brasilense, Acidovorax avenae, Ralstonia solanacearum, and Pseudomonas syzygii gave no signals upon hybridization. Similarly, probes targeting the species Vibrio cholerae, Helicobacter pylori [5], and Staphylococcus aureus, and genera including the human pathogens Nocardia, Streptococcus, and Clostridium (with the exception of the probes KO 260 and KO 264, likely to be false positive reactions) gave no signals upon hybridization. This would indicate that the different composts do not contain these organisms or, if they do, that the organisms are present at levels below the detection limit.

Multivariate Analysis of Microarrays

Probes providing little or no information, as indicated by a SNR <2, were not included in analyses. The universal bacterial probes KO 27 (EUB 338) and KO 32 (UNIV 1389) [4] were also excluded from the analyses, as these probes bound all bacteria and thus did not allow differentiation of microbial communities between samples. One hundred probes were included in the analyses. An average value for the two duplicate probe spots was calculated for each of the three replicate arrays. Covariance-based redundancy analysis was conducted. Figure 2 shows a canonical analysis loading plot; the two axes explain 70.4% of variance, the first axis representing 51.2% of the variance, the second axis representing 19.2%. Multivariate analysis showed that the bacterial community compositions of the three composts were different at the beginning of the composting process. The 2-week compost samples appeared to differ significantly from each other, being plotted in different areas of the ordination graph. However, over time, the bacterial communities became more similar. This can be seen by the 16-week compost samples clustering together. The replicates of the compost samples were found to group closely together. The 2-week green compost sample was found to be the most different of all the compost samples, based on the microarray results. This is not surprising, as the other two composts were more similar in nature, both having undergone anaerobic digestion processes prior to composting (partly or fully), which inhibit and enhance the growth and survival of different microorganisms (the manure mix compost comprised 25% horse manure).

Figure 2
figure 2

Loading plot obtained by redundancy analysis, depicting the organisms responsible for community differences amongst the composts. The figure shows the development over time of the three compost types. The numbers 2, 8 and 16 represent the number of weeks of composting, and the vectors show the covariance structure of the probe signals. The two axes represent 70.4% of the explained variance

The loading plot (Fig. 2) shows the probes that were responsible for community differences among the composts. The lengths of the arrows indicate the significance for compost differentiation. Arrows of probes point in the direction of samples with above-average signal, and probes with similar arrow directions have high covariance, meaning they tend to occur jointly on the microarrays. The Sphingobacterium genus probes KO 548, KO 549, and KO 550 were found to give high signals in the less-mature green composts (2 and 8 weeks) and manure mix compost. Sphingobacterium is an aerobic bacterium, known to frequently occur in compost [27]. The incidence of Sphingobacterium in the anaerobic digestate composts was lower. There was also a higher association between the presence of Microbacterium (probes KO 515, 516, and 517) and the less-mature green and manure mix composts. The Actinomyces KO 342, Low G + C, and Alphaproteobacteria probes correlated well with the mature (16 weeks) green and manure mix composts, as well as with the immature (2 weeks) anaerobic digestate compost. Overall, there did not appear to be above-average incidences of probes with the compost samples located more in the left-hand side of the canonical plot, mostly either the 8- or 16-week samples and also in the 2-week manure mix compost. However, the EUB 338II and EUB 338III probes specific for the Planctomycete and Verrucomicrobia groups respectively [12], did correlate well with the older composts and would appear to be indicator probes for more mature composts.

PCR-DGGE Analyses of Compost Samples

In order to compare the newly developed COMPOCHIP microarray with a method that has been frequently used to study microbial community composition of complex habitats, PCR-DGGE analysis was performed. For the DGGE analysis, DNA of all three compost types was used to obtain the banding patterns of the contained microbial communities. Cluster analysis of the 16S rRNA gene-based banding patterns revealed that the age of the composts had more influence on the contained microbial community structure than the compost type (Fig. 3). Furthermore, the three different compost types became more similar over time, suggesting that the mature composts had more bacterial species in common than did the starting material. This is in accordance to the microarray data, which showed the same overall results.

Figure 3
figure 3

Cluster analysis of DGGE fingerprints based on 16S rDNA extracted and amplified from green, manure mix and anaerobic digestate composts at 2, 8 and 16 weeks of composting. Values at the branches of the dendrogram indicate the percentage of similarity, based on the Dice correlation coefficient

Conclusions

This study has resulted in the development of the COMPOCHIP microarray, which contains probes targeting microorganisms that have been previously reported in the composting process, as well as plant, animal, and human pathogens and plant disease suppressive bacteria. The array has been tested with pure cultures of microorganisms and has been shown to work well with only a low percentage of non-specific hybridizations (false positive, 0.62%; false negative, 0.07%). Using the COMPOCHIP microarray, it was possible to discriminate between different compost types and composts of different ages. Using a multivariate redundancy analysis, it was found that the younger 2-week composts were distinct from each other in terms of the bacterial communities, while the more mature 16-week composts were similar. The array also indicated that particular probes (targeting Sphingobacterium, Actinomyces, Xylella/Xanthomonas/Stenotrophomonas, Microbacterium, Verrucomicrobia, Planctomycetes, Low G + C, and Alphaproteobacteria) were more important in discriminating between different composts. The results of DGGE correlated well with the results of the microarrays in showing differences in amplicon sequences between the different compost samples. It must however also been mentioned that the array only targeted known compost microorganisms, and as such, it is possible that microorganisms with no cultured members may have been performing important roles in the composting process, yet going undetected. Clone library studies may reveal the identity of as yet unknown and uncultured composting microorganisms, to which probes could be designed in the future.

This manuscript describes the design, development, and application of the COMPOCHIP microarray and has demonstrated the huge potential of microarrays to analyze the bacterial communities of different environmental samples in a single test. More comprehensive sequence-based assessment of the microbial diversity associated with the different phases of the composting process is needed and will allow the design of more probes and the expansion of the COMPOCHIP microarray. Application of the microarray should allow for better compost process management in the future and the tracing of compost amendments to soils.