Introduction

Natural environments and multicellular organisms are colonized by various microbes. In many cases, microbial cells form complex communities wherein they communicate (e.g., quorum sensing signals), cooperate (e.g., symbiosis of Clostridium acetobutylicum and Bacillus cereus; Wang et al. 2015), and fight (e.g., production of antimicrobials) each other, instead of existing as isolated cells. Recent studies propose that microbial communities are mostly present in a sessile state and not in a planktonic state (Watnick and Kolter 2000). Since microbes are usually cultured as free planktonic cells in laboratory experiments, the sessile state needs to be given more attention in order to understand microbial activities.

A biofilm is a population of cells growing on a surface and is a type of sessile cell community. Biofilm structures are ubiquitously found on both abiotic and biotic surfaces like kitchen sinks, river stones, and human teeth. Biofilms are known to be responsible for nosocomial infections, contamination of medical devices, chronic bacterial infections, and periodontal diseases (Donlan and Costerton 2002). Microbial cells within biofilm structures are packaged along with extracellular polysaccharides (EPS), nucleic acids, proteins, and other materials, and are tolerant to antimicrobials, heat treatment, and the host immune system (Carvalhais et al. 2015; Stoodley et al. 2002). Therefore, it is difficult to kill microbial cells inside a biofilm and to eliminate the entire biofilm structures. As an example, in the lungs of patients with cystic fibrosis (a genetic disorder), pathogenic Pseudomonas aeruginosa cells form strong biofilms and acquire high tolerance to medical treatments (Magalhães et al. 2016).

Biofilm research so far has mainly focused on biofilm samples consisting of one or a few microbial species, but only a limited number of studies have been performed that concern complex multi-species biofilms. It is therefore important to perform meta 16S ribosomal RNA (rRNA) gene and metatranscriptome analyses of biofilm samples from complex microbiomes or environments. These analyses would be helpful for a detailed understanding of microbial ecology and the development of anti-biofilm drugs.

In the present study, we cultured biofilm samples collected from Japanese rivers and succeeded in establishing a multi-species biofilm model in the laboratory. Culture of this biofilm model resulted in the formation of both planktonic and biofilm cell communities. Although meta 16S rRNA analysis showed little difference between the planktonic and biofilm cell communities, metatranscriptomic analysis revealed various differentially expressed genes.

Materials and methods

Sample collection and biofilm culture

Biofilm samples were cultured in one third strength of Luria broth (1/3 LB; LB broth was purchased from Difco, Sparks, MD, USA) at 25 °C, unless otherwise stated.

Ten river samples were collected from different points of the Tamagawa River, including a rock covered with biofilm from midstream (north latitude 35.610566 and east longitude 139.624680), a stone and its covering slime from midstream (35.610461, 139.625142), a stone and covering slime from downstream (35.541471, 139.752706), brown slime from downstream (35.541471, 139.752706), slime attached to the underside of a rock from downstream (35.541471, 139.752706), slime attached to the underside of a mat from downstream (35.541471, 139.752706), a stone and covering slime from upper stream (35.621639, 139.572920), slime of the top of driftwood from upper stream (35.621642, 139.572930), river water from upstream (35.621700, 139.572478), and river water from downstream (35.541471, 139.752706). Three samples were collected from the Yazawagawa River: river water (35.6037972, 139.6464361), waterfall (35.6037972, 139.6464361), and red clay (35.605759, 139.645380). Since this location is not privately owned or protected, no specific permits were required for the described field studies. A sterilized slide glass was placed in a Petri dish (90 mm diameter), and 10 ml of 1/3 LB along with a drop of the sample (mixture of biofilm and river water) was added. Three-day-old statically cultured cells (both planktonic and biofilm cells) were used to prepare glycerol stocks of these samples with 20 % glycerol and stored at −80 °C.

After a week, inoculation was initiated from glycerol stocks in a similar manner using 50 μl of the glycerol stock solution from each sample. After 3 days, planktonic and biofilm cells were collected separately. (i) The planktonic cells were collected by aspirating 1.5 ml culture broth, (ii) the remaining culture broth was discarded, (iii) the inner part of the dish and the surface of the slide glass were washed twice with 5 ml of 1× PBS [made using 10× PBS(−) purchased from Wako Pure Chemical Industries, Osaka, Japan], (iv) 800 μl of 1× PBS was added, and (v) the biofilm cells were scraped off from the inner part of the dish and the surface of the slide glass using an interdental brush (Yawaraka-Shikan-Burashi; Kobayashi Seiyaku, Tokyo, Japan). Glycerol stocks were prepared again as described above, and these culture and sample collection steps were repeated two more times (thrice in total).

DNA extraction and meta 16S rRNA sequence analysis

Metagenomic DNA from the original biofilm samples and the cultured planktonic and biofilm cells was prepared using a Power Soil DNA Isolation Kit (MO BIO Laboratories, Carlsbad, CA), according to the manufacturer’s instructions. The V3–V4 regions of the 16S rRNA genes (position 342 to 806 in Escherichia coli numbering) were PCR-amplified using a non-degenerate universal primer set, 342F and 806R (Kato et al. 2015), with EX Taq HS polymerase (Takara Bio, Ohtsu, Japan). All PCR samples were mixed to generate a 21-plex sample library, and the sequences were read by the MiSeq instrument (Illumina, San Diego, CA) in a paired-end-read mode for 300 cycles according to the manufacturer’s protocol.

High-quality sequences (of >150 and <450 bases in length) were selected for subsequent processing (i.e., alignment to a standard sense strand and removal of the primer sequences used for sequencing). Only the forward sequences were used for subsequent processing since the reverse sequences were mostly of low quality and merging of each forward and reverse sequence was not possible. Sequence clustering of these high-quality reads was performed using the UCLUST program (Edgar 2010) version 6.0.307 with an identity of ≥97 % and a query and reference coverage of ≥80 %. Chimeric clusters detected by the UCHIME program (Edgar et al. 2011) in the de novo mode and the reference mode [searching the reference Gold Database (DB) (http://drive5.com/uchime/gold.fa)] were removed. Taxonomic assignment of the resulting MiSeq reads was performed using the RDP Classifier program (Wang et al. 2007) version 2.6 with a bootstrap value ≥0.5 against the representative sequences of each 97 % sequence cluster chosen by the UCLUST program. Unless otherwise stated, the programs for read assemblage and analysis in this study were run using default parameters. The 16S rRNA sequence data have been deposited under the BioProject accession number PRJDB4599 in the DDBJ BioProject database (http://trace.ddbj.nig.ac.jp/bioproject/index_e.html).

mRNA extraction and construction of a cDNA library

The culture conditions and sample collection methods were the same as described above. The culture was started with the second glycerol stock of the sample, and 30 Petri dishes were handled in parallel to obtain sufficient amounts of RNA for RNA-seq. Cells were collected separately from the planktonic and the biofilm communities from 12, 24, and 48-h cultures. Total RNA was extracted from the cultured biofilm and planktonic cells using the acid hot phenol method (Aiba et al. 1981). The extracted RNA samples were purified using an RNeasy Mini Kit (Qiagen, Valencia, CA) according to the manufacturer’s protocol for enriching the messenger RNA (mRNA), since most RNAs of <200 nucleotides (including 5.8S rRNA, 5S rRNA, and transfer RNAs (tRNAs), together constituting 15–20 % of total RNA) are selectively excluded. rRNAs were removed from total RNA using a MICROBExpres Bacterial mRNA Enrichment Kit (Ambion, Austin, TX). Complementary DNA (cDNA) libraries were then constructed using a TruSeq Stranded mRNA and Total RNA sample preparation kit (Illumina) and sequenced using MiSeq in the paired-end-read mode for 300 cycles.

RNA-seq analysis

High-quality RNA-seq reads (of >150 and <450 bases in length) were selected for subsequent processing (i.e., alignment to a standard sense strand and removal of primer sequences used for sequencing). RNA-seq reads from each sample were separately assembled using the IDBA-UD program (Peng et al. 2012) version 1.1.0 with the parameters “–mink 20 –maxk 70 –step 5.” Protein coding sequences in the contigs were predicted using the MetaGeneMark program (Zhu et al. 2010) version 2.10 with the parameters “-g 11 -a -d -f G.” Functional annotation of each read was performed by BLASTP search using the parameters “-g T -F F -e 1e-8 -m 8 -b 10 -v 10” against the KEGG protein sequence DB, which was obtained in March 2014. Taxonomic (phylum, class, order, family, and genus levels) and functional [KEGG Orthology (KO)-level] assignment was performed for reads that exhibited the highest BLAST scoring hits with an identity ≥70 % and a BLAST bit-score ≥40. Reads from each sample were mapped to the predicted ORFs using Bowtie2 (Langmead and Salzberg 2012). KO abundance was calculated by dividing the number of reads belonging to the same KO by the median length of proteins annotated to the same KO in the KEGG protein DB. KO abundance was normalized to the number of reads included in each sample. KEGG pathway abundance was calculated as the sum of KO abundances belonging to a single pathway. To compare the abundances of individual KOs and their pathways among different samples, abundances in each RNA-seq sample were normalized to sample read numbers. This method for normalizing functional abundances among samples was applied to all BLAST-based functional assignments in this study. RNA-seq data have been deposited under the BioProject accession number PRJDB4600 in the DDBJ BioProject database (http://trace.ddbj.nig.ac.jp/bioproject/index_e.html).

Determination of biofilm amounts

Samples from glycerol stocks were cultured in a Petri dish (60-mm diameter) containing 2 ml medium. Only planktonic cell communities were collected from the grown samples, and their optical density at 595 nm was measured. To measure the amount of biofilm, the planktonic cells were completely removed, and the remaining biofilm samples were gently washed twice with 2 ml distilled water. The biofilm samples were kept in a fume hood with a blower until the samples dried out (approximately 1 h). Then, 2 ml of 0.1 % crystal violet solution was added to each sample and incubated for 15 min at room temperature. After discarding the crystal violet solution, the biofilm samples were washed twice with 2 ml of distilled water, gently to keep the biofilm intact. The stained biofilm samples were then dried out as previously described, in a fume hood (for approximately 1 h). To each sample, 2 ml of 70 % ethanol was added and the samples were kept for 30 min at room temperature. Finally, 100 μl of the stained ethanol was collected from each sample, and the amounts of the eluted dye were quantified by measuring absorbance at 595 nm.

Results

Establishment of a model biofilm system in a laboratory

Considering that most natural biofilms consist of multiple bacterial species, we believe that the lack of model biofilm systems has hampered intensive research in this area. To establish a biofilm model, 13 river samples were collected, including biofilm structures from both abiotic surfaces and the surrounding river water. These samples were cultured in a Petri dish containing a slide glass (see “Materials and Methods” and Fig. 1a). Three of the samples, SM (a stone and its covering slime from midstream), BD (brown slime from downstream), and WU (river water from upstream), were found to form a clear biofilm on the wall of the dish and on the slide glass after 3 days. Additionally, these three cultures could form a biofilm even when the first cultures were frozen as glycerol stocks and the second inoculation was restarted from these stocks. In the same manner, we confirmed that biofilm formation could be repeated at least two more times.

Fig. 1
figure 1

Biofilm model and 16S rRNA gene analysis. a An image of the cultured biofilm BD is shown. b 16S rRNA gene sequences were classified up to the generic level. “O,” “B,” and “P” characters indicate original culture, biofilm community, and planktonic community, respectively. The numerals indicate the number of inoculations repeated

The bacterial species existing in planktonic and biofilm communities of the above samples were analyzed by massive sequencing of 16S rRNA (Fig. 1b). The sequencing results indicate that the generic composition differed depending on the source of the biofilm and that the composition between the planktonic and biofilm communities from the same culture was not clearly different. The generic composition of the original samples (i.e., uncultured samples, Fig. 1b) was also analyzed in a similar manner. We discovered that the variation in the biofilm community was lost upon culture in laboratory conditions. Taken together, we successfully established model biofilm systems, which are reculturable and reproducible in the laboratory. Nevertheless, the fact that species composition might be variable among repeated cultures should be considered.

An overview of the RNA-seq analysis of planktonic and biofilm communities

As mentioned, phylogenetic analysis based on 16S rRNA sequences showed little difference between the planktonic and biofilm communities. In order to understand the dynamic but not the static differences among the communities, metatranscriptome analysis by RNA-seq was performed using 12-, 24-, and 48-h cultures (see “Materials and Methods”). Among the above-mentioned three samples, only the sample BD formed sufficient amounts of biofilm to be subjected to RNA-seq, and therefore, only this sample was used further. The growth and biofilm formation kinetics observed during culture are shown in Fig. 2a, b, and we defined 12-, 24-, and 48-h cultures as early, middle, and late stages of biofilm formation, respectively. Growth was found to drift into a period of stagnation around the middle stage of the culture, possibly due to the co-existence of fast and slow growers in this culture.

Fig. 2
figure 2

Growth kinetics and quantitation of biofilm formation. a Growth kinetics of the BD culture are shown. Optical density at 595 nm was measured at three time points. The data are presented as mean ± SD from triplicate experiments. b Quantitation of crystal violet staining of the BD culture is shown. The amounts of crystal violet indicating the amounts of biofilm formed were measured at three time points. The data are presented as mean ± SD from triplicate experiments

From the metatranscriptome data, we first examined the bacterial genera and species that were alive and vital at the specific biofilm formation stages in both communities. To this end, the average expression levels of gyrB, rpoB, and rplC genes were determined (Fig. 3a). These three genes were chosen as a representative of the replication, transcription, and translation machineries and as universal single copy genes (Kato et al. 2015). At the early stage of culture, Aeromonas and Pseudomonas bacteria were dominant, and with time, Pseudomonas bacteria became highly dominant (Fig. 3a). This result is consistent with that shown in Fig. 1b. However, bacteria that were not significantly detected in meta 16S rRNA analysis were also found to be present and vital; these included (i) Acinetobacter at the early and middle stages, (ii) Laribacter and Xanthomonas at the middle and late stages, (iii) Enterobacter at the early stage of the planktonic community, and (iv) Pseudogulbenkiania at the late stage of the biofilm community (Fig. 3a). Although determination of bacterial species was difficult using meta 16S rRNA analysis (Fig. 1b), it was possible with the metatranscriptome data (Fig. 3b). At the species level, at least 20 bacterial species were found to be present and vital, and our culture conditions were feasible for these bacteria. It should be noted that the metatranscriptome data do not reflect the simple presence ratio of bacterial species.

Fig. 3
figure 3

Dynamic differences between planktonic and biofilm communities. a The relative abundance for each genus was calculated from the average of the number of reads assigned to gyrB (KEGG orthology K02470), rpoB (K03043), or rplC (K02906). “B” and “P” characters indicate biofilm and planktonic communities, respectively. b The relative abundance for each species was calculated in the same manner as for panel a

In the next level of metatranscriptome data analysis, the upregulated genes in the biofilm community were identified (Table 1). Over 30 % of the genes were upregulated in the subcategories of “biosynthesis of other secondary metabolites,” “xenobiotics biodegradation and metabolism,” “membrane transport,” and “lipid metabolism” (Table 1, limited to subcategories in which at least 10 genes were detected). These results suggest that changes in cell envelope structure and in material import/export across the cell envelope are important for biofilm formation. In Tables 2, 3, 4, and 5, upregulated genes in the biofilm community at three or two time points are listed, and in Tables S1–S3, the genes upregulated at only one time point are listed. Notably, the genes for cysteine biosynthesis (Fig. 4, see below for details) and iron-sulfur metabolism (iscR, ycdN, iscA, hscB, ftnA, sitA, sitB) were detected to a high extent. Upregulation of a number of stress response genes (bolA, pspC, uspE, pspA, pspB, osmY, cpxP, osmE, treA/treF) is reasonable, because a number of previous studies have reported that environmental stresses induce biofilm formation (O’Toole and Stewart 2005). In relation to the stress response, a functional cell envelope and peptidoglycan biosynthesis is necessary for cell attachment and biofilm maturation (Dong et al. 2011; Loo et al. 2000); therefore, induction of pgm, amiA, yhcB, prc, rfbC, murC, and dacB expression is assumed to be plausible according to the literature (see Tables 2, 3, 4, and 5). The expression of multiple genes for rRNA methyltransferases (rlmB, rsmH), chromosome partitioning (KEGG K03497, mukB), or leucine biosynthesis (leuD, leuC, lrp) was found to be upregulated, but the reason for this upregulation is unclear.

Table 1 Summary of upregulated genes in the biofilm
Table 2 Upregulated genes in the biofilm community compared to the planktonic community at all the time points
Table 3 Upregulated genes in the biofilm community compared to the planktonic community at 12 and 24 h
Table 4 Upregulated genes in the biofilm community compared to the planktonic community at 12 and 48 h
Table 5 Upregulated genes in the biofilm community compared to the planktonic community at 24 and 48 h
Fig. 4
figure 4

Regulation of cysteine and methionine biosynthesis. The cysteine and methionine biosynthesis pathway is shown. Dark and light red colors indicate genes whose expression is upregulated at least at two time points and one time point, respectively. Dark and light blue colors indicate the downregulated genes at least at two time points and one time point, respectively

The downregulated genes in the biofilm community at all the time points are listed in Table 6, and the downregulated genes at one or two time point(s) are listed in Tables S4–S9. Downregulation of genes involved in the respiratory chain (cyoB, cyoC, nuoI, cyoA, nuoM, cydB/appB), nucleotide biosynthesis (pyrD, purB, purN, preA, uraA, purK, purT, pyrF), fatty acid metabolism (fabF, fadR, fabD, fabH, accA, accD), and DNA repair (mutS, recF, KEGG K03630, dnaE, sulE) was characteristic. Concerning the respiratory chain genes, this result may reflect that the interior of the biofilm is anaerobic. Anaerobic conditions in the biofilm are also suggested by the upregulation of arcB (Table 4), since ArcB represses the genes required for respiratory metabolism and the tricarboxylic acid cycle enzymes. Downregulation of the fatty acid metabolism genes listed above is consistent with upregulation of fadE expression (Table 3) because FadR is a positive and negative transcriptional regulator for fatty acid biosynthesis (fab) and fatty acid degradation (fad) genes, respectively in E. coli (My et al. 2013). We can conclude that fatty acid synthesis decreases during biofilm formation. Repression of DNA repair genes is expected to facilitate replication errors and increase the frequency of spontaneous mutations; since a biofilm acts as the evacuation spot from unfavorable environments, it may be involved in generating new genetic characters and diversity (see “Discussion” section for details). In addition, multiple genes for aminoacyl-tRNA synthase (glnS, glyS, proS, valS; note that serS is exceptionally upregulated) and thiamine biosynthesis (thiG, thiI) were downregulated, but the reason for this downregulation is unclear.

Table 6 Downregulated genes in the biofilm community compared to the planktonic community at all the time points

In concluding this section, many upregulated and downregulated genes were identified, and therefore, planktonic and biofilm communities were revealed to be in different dynamic states. The expression of many genes seemed to be consistent with their description in previous studies (references are indicated in Tables 26 and S1–S9), whereas some genes were newly identified as biofilm-inducible or biofilm-repressible in this study.

Cysteine and methionine biosynthesis

In Tables 2, 3, 4, 5, and 6 and S1–S9, a number of genes that are involved in cysteine and methionine biosynthesis are listed. The cysteine and methionine biosynthesis pathway is depicted in Fig. 4, and the regulation of individual genes upon biofilm formation is also shown. This figure shows that cysteine and methionine biosynthesis is upregulated and downregulated, respectively. Although the reason for this result is unclear, similar positive and negative effects of these amino acids on biofilm formation have been described earlier (Gnanadhas et al. 2015; Singh et al. 2015; Xu et al. 2014).

Regulation of flagellar genes

Previous studies have shown that the flagella play important roles in biofilm formation in both gram-negative and gram-positive bacteria (Belas 2014). The heat map presented in Fig. 5 shows the expression pattern of flagellar genes. In general, at the early stage of biofilm formation, many genes were upregulated, whereas in the middle stage, the appearance of downregulated genes was remarkable. At the late stage, the expression of genes that were downregulated at the middle stage seemed to be restored. This result is consistent with those of previous studies (Monds and O’Toole 2009; Okuda et al. 2012; Watnick and Kolter 2000). It is generally considered that at the early stage, flagella are necessary to sense and attach to solid surfaces and during biofilm growth, the flagella become obstructive as they destabilize biofilm structure. After biofilm maturation, it is assumed that flagella formation and flagella-associated motility again gains importance for escaping from the biofilm structures and to find new solid surfaces.

Fig. 5
figure 5

Heat map of expression pattern for flagella-related genes. The 29 genes indicated were detected out of 37 genes in the KEGG pathway KO02040 (Flagellar assembly) from the RNA-seq data. The heat map legend at the bottom indicates the color values corresponding to transcription levels, with upregulation shown in red and downregulation in green

Discussion

From the metatranscriptome data (Tables 5, 3, 4, 5, and 6 and S1–S9), we speculated that the iron-dependent signaling mechanism triggers cells to form a biofilm and, as a consequence, genes related to stress response, cell envelope, flagella, and many others are regulated. Iron is reported to be important for Pseudomonas bacteria, both as a positive signaling molecule for biofilm formation and as a structural stabilizer for the biofilm matrix (Balaban 2008). In E. coli, iron-sulfur homeostasis and Fe-S cluster assembly are signals for biofilm formation (Roche et al. 2013; Wu and Outten 2009). These facts raise the question of why iron and sulfur are the chosen signaling molecules for biofilms. In primordial Earth, iron and sulfide were abundant, and the Fe-S cluster is thought to be one of the oldest and most versatile inorganic co-factors (Ayala-Castro et al. 2008; Lill 2009). In addition, Stoody and co-workers proposed the possibility that biofilms were the main form in which microbial cells existed in primordial Earth and that planktonic cells appeared much later during the evolutionary process (Stoodley et al. 2002). Taken together, it is natural that primitive microbes incorporated iron and sulfur in many of their cellular functions including biofilm formation and still utilize them as signaling molecules for transition between the biofilm and planktonic states.

Antimicrobials are widely used globally, but they give bacteria the opportunity to build resistance, allowing the emergence of resistant bacteria. Moreover, cells within biofilms are frequently tolerant to ordinary antimicrobials (Carvalhais et al. 2015; Stoodley et al. 2002). Therefore, the development of anti-biofilm drugs that do not inhibit cell growth is desirable. For some bacteria, iron chelators such as deferoxamine, lactoferrin, and 2,2′-bipyridyl are known to function as anti-biofilm drugs (Banin et al. 2005; Moreau-Marquis et al. 2009; Weinberg 2004). However, iron has a varied effect on bacterial biofilm formation; low iron concentration has a negative effect in Pseudomonas, E. coli, and Vibrio cholerae (Balaban 2008; Wu and Outten 2009), whereas a positive effect of iron is observed in Staphylococcus aureus, Acinetobacter baumannii, and Streptococcus mutants (Balaban 2008). These observations suggest that, when removing undesired biofilms through manipulation of iron concentration, investigating the effect of iron on each biofilm is important. In this context, the metatranscriptome analysis approach described here would be effective for determining the effect of iron. The metatranscriptome data obtained in this study implied that iron had a positive effect on our biofilm model. Our preliminary experiments have shown that addition of 2,2′-bipyridyl to the starting culture indeed reduced the amount of biofilm without reducing bacterial growth (data not shown).

The targets of anti-biofilm drugs are not limited to iron and sulfur metabolism, and all genes regulated in the biofilms can be potential targets. For example, the expression of treA/treF was upregulated in our biofilm (Table 5), and the enzymatic activity of its gene product, α,α-trehalase, is known to be inhibited by casuarine or deoxynojirimycin (Cardona et al. 2009; Forcella et al. 2010). Similar candidates for anti-biofilm drugs are listed in Table S10, and we are now trying to develop novel tailor-made and target-oriented anti-biofilm drugs.

Downregulation of multiple DNA repair genes in the biofilm was observed in this study. Consistent with our result, it has been reported that P. aeruginosa cells within the biofilm community show up to 100-fold increased mutation frequency compared to that in corresponding planktonic cells (Driffield et al. 2008; Luján et al. 2011; Oliver et al. 2000). In addition, bacteria possess a mechanism of stress-induced mutagenesis in which error-prone DNA polymerases are upregulated and error-correcting enzymes are downregulated under stress conditions (Al Mamun et al. 2012; Foster 2007). These facts suggest that biofilm formation is a means for adaptive evolution and generating phenotypic diversity. This idea is further supported by the fact that many biofilm structures contain extracellular DNA, which is a source of genes in horizontal gene transfer (Dominiak et al. 2011; Madsen et al. 2012) and that Streptococcus mutans cells grown in biofilms have a 10- to 600-fold higher rate of natural transformation than planktonic cells (Li et al. 2001).

In addition to cysteine, methionine, and leucine biosynthesis genes (see the “Results” section), glutamate (gdhA, Table 4, upregulated), histidine (hisH, Table 6, downregulated), proline (proB, Table S5, downregulated), and arginine (argE, Table S6, downregulated) biosynthesis genes were identified as biofilm-regulated genes. The positive effects of cysteine and leucine and the negative effects of methionine and arginine on biofilm formation are in good agreement with observations from previous studies (Kolderman et al. 2015; McFarland et al. 2008; Noothalapati Venkata et al. 2011; Singh et al. 2015). The mechanism involved in the effect of amino acids on biofilm formation is unclear, but they are attractive anti-biofilm drugs because of their safety for human health. Intriguingly, high levels of leucine were found to accumulate within microcolonies in E. coli biofilms (Noothalapati Venkata et al. 2011), implying that leucine is a functional component of biofilm structures or a signaling molecule for biofilm formation. Upregulation of lrp (encoding leucine-responsive regulatory protein, Table 4) supports the signaling molecule hypothesis, because Lrp activates transcription from many fimbrial genes that depend on leucine (Noothalapati Venkata et al. 2011).

EPS is a key component for maintaining biofilm architecture, possibly acting as an intercellular cement. Quorum sensing positively regulates EPS production and facilitates the development of mixed species populations (Madsen et al. 2012). LsrB (Table 5, upregulated) is a subunit of the transporter complex for the autoinducer-2 molecule, which is produced by many bacterial species and is one of the quorum-sensing signaling molecules (Shao et al. 2007). In addition, PssA (Table 4, upregulated) catalyzes the first step of EPS synthesis (Chen et al. 2014), and a pssA mutation in Rhizobium leguminosarum results in complete inability to form biofilms (Russo et al. 2006). Collectively, we conclude that EPS plays an important role in our multi-species biofilm community. However, the EPS produced by some bacteria is also known to interfere with biofilm formation by other bacteria, and some bacteria produce enzymes to degrade quorum-sensing molecules possibly to interfere with the inclusion of unfavorable bacteria in the biofilm (Madsen et al. 2012). Since the generic compositions of planktonic and biofilm communities were almost indistinguishable in this study (Fig. 1b), it seemed that there were no such interference mechanisms in this case.

Phosphotransacetylase (Pta, Table 6, downregulated at all the time points) and acetate kinase (AckA, Table S3, upregulated only at 48 h) are enzymes that convert acetyl-CoA to acetyl-phosphate and acetyl-phosphate to acetate, respectively. The role of these proteins on biofilm formation has been discussed in many bacteria, but both positive and negative effects have been reported. A consensus view is that acetyl-phosphate functions as a signaling molecule through two-component response regulators (possibly, as a phosphate donor) and its intracellular level influences biofilm formation (Kim et al. 2015; Wolfe et al. 2003). In E. coli, the acetyl phosphate level affects the expression of about 100 genes including those related to flagella synthesis, type 1 pili, capsule, and stress effectors (Wolfe et al. 2003). In our biofilm community, the intracellular acetyl-phosphate level is expected to be lower than that in the planktonic community. Many genes for two-component response regulators are listed in Tables 2, 3, 4, and 5 and in S1–S9, and thus, the expression of these genes may be regulated by acetyl-phosphate. As discussed above, iron, leucine, and autoinducer-2 may also act as signaling molecules for biofilm formation. Studies on the interplay between these signaling molecules are the next important issue.

For upregulated and downregulated genes, it is necessary to determine expression dynamics accurately using quantitative real-time RT-PCR. However, quantification of these genes according to KO is difficult, because multiple orthologous genes are assigned to one KO and a universal primer set to amplify all the orthologous genes is not designed. One solution is quantifying each orthologous gene separately and combining the results. In addition, quantifying the amount of proteins expressed from the upregulated and downregulated genes is important in the future.

The multi-species biofilm has been poorly studied so far, because it is too complex to understand overall activities and functions. However, recent advances in next-generation sequencer have enabled us to study the multi-species biofilm as a system without investigating individual microbial cells. There is, however, still one hurdle; examples of multi-species biofilm in nature, marine sediments, chronic wounds, and dental plaques are well known (Si et al. 2015; Yang et al. 2011), but none of these biofilms can be maintained easily, safely, and over a long duration. Therefore, we believe that a study on reproducing multi-species biofilms in the laboratory is important to quantitatively and repeatedly compare planktonic and biofilm samples. The biofilm model used in this study may differ from the original biofilm because of its culture in rich media (Fig. 1b). It is important to establish model biofilms wherein microbial populations are maintained in a nature-identical manner for deeper understanding of biofilms and for designing tailor-made anti-biofilm drugs.