Introduction

Biofilms are surface-attached aggregates of bacterial cells that are embedded in a self-produced matrix known as extracellular polymeric substances, which are mainly composed of exopolysaccharides (EPS), extracellular proteins, lipids and extracellular DNA [1, 2]. Biofilm matrix protects the embedded microbial cells from stress conditions such as the usage of disinfectants and antibiotics [2]. Biofilm is generally formed through four basic stages: initial attachment, biofilm development, maturation stage and dispersal stage [1]. Each stage is strictly regulated by complex regulatory processes such as quorum sensing (QS), cyclic di-GMP (c-di-GMP) and small non-coding RNA (sRNA) [1].

Vibrio parahaemolyticus, the leading cause of bacterial seafood-borne gastroenteritis, is widely found in marine ecosystems. Human infections are usually caused by consumption of raw or undercooked seafood [3]. V. parahaemolyticus not only produces various virulence factors, including thermostable direct hemolysin (TDH), TDH-related hemolysin (TRH), type III secretion system 1 (T3SS1), T3SS2, type VI secretion system 1 (T6SS1), and T6SS2, but also has a strong ability to form biofilms on the surface [4, 5]. Biofilm formation requires specific structures, including flagella, type IV pili, and EPS [5,6,7,8]. Flagella are required for the initial stages of biofilm formation, and the polar flagellum deficient strains failed to form mature biofilms [5, 8]. V. parahaemolyticus expresses two kinds of type IV pili, mannose-sensitive haemagglutinin type IV pili (MSHA) and chitin-regulated pili (ChiRP), that contribute to surface attachment and agglutination [7]. EPS is the most prevalent component of biofilms that holds bacterial cells together on the surface [5]. The cpsA-K and scvA-O gene clusters are responsible for EPS synthesis in V. parahaemolyticus that are required for the mature biofilm formation [5, 9, 10]. In addition, other substances, including extracellular proteins, extracellular DNA, and membrane fusion proteins (Mfp), also play roles in biofilm formation [6, 8].

Biofilm formation by V. parahaemolyticus is regulated by a number of factors, including positive regulators such as OxyR [11], ToxR [12], RpoN [13], CpsQ [14], and AphA [15], negative regulators such as OpaR [16], QsvR [8], RobA [17], and VP0610 [18], environmental cues such as temperature [19] and salinity [20], cyclic nucleotide messengers such as c-di-GMP [5, 21], and sRNAs such as Qrr1-5 [22]. Moreover, comparative transcriptome analysis also revealed that there were significant differences in the expression levels of 956 genes in biofilm cells compared with planktonic cells in V. parahaemolyticus, including many putative regulatory genes and biofilm-associated genes [23]. In addition, biofilm cells and planktonic cells are interchangeable in the processes of biofilm formation [2]. Therefore, clarifying the gene expression profiles in both biofilm cells and planktonic cells during biofilm formation is helpful to understand the mechanisms of biofilm formation.

In this work, total RNA of V. parahaemolyticus RIMD2210633 were extracted simultaneously from biofilms and planktonic fractions for the RNA sequencing (RNA-seq) analysis to compare the gene expression profiles during biofilm formation. The data showed that the significantly differentially expressed genes (DEGs) were involved in multiple cellular pathways, including motility, virulence, biofilm formation, metabolism, and regulation.

Materials and Methods

Bacterial Growth Conditions

V. parahaemolyticus RIMD2210633, a pandemic O3: K6 strain isolated from a diarrhoea patient in 1996 [24, 25], was used in this study. A total of 20 μl glycerol stock of bacterial cells was inoculated into 5 ml of 2.5% (w/v) Bacto heart infusion (HI; BD Bioscience, USA) broth and incubated at 37 °C with shaking at 200 rpm for 12 h. The cell culture was diluted 50-fold into 2 ml of fresh Difco marine (M) broth 2216 (BD Biosciences, USA) in a 24-well cell culture plate and then grown at 30 °C with shaking at 150 rpm.

Crystal Violet (CV) Staining

CV staining assay was performed similarly as previously described [26]. Briefly, V. parahaemolyticus RIMD2210633 was culture in M broth at 30 °C with shaking at 150 rpm for 6, 12, 24, 48, 60 and 72 h, respectively. The media containing the planktonic cells were collected for absorbance measurements at OD600. The adherent biofilms were gently washed three times with 2 ml of deionized water, followed by incubated at 80 °C for 15 min to fix the attached biofilm cells. The attached cells were stained with 2 ml of 0.1% CV for 15 min, and then washed three times with 2 ml of deionized water. The bound CV was dissolved with 20% acetic acid, and the OD570 values were measured as an index of CV staining. Relative biofilm formation was calculated as OD570/OD600.

RNA Sequencing (RNA-seq)

V. parahaemolyticus RIMD2210633 was incubated at 30 °C with shaking at 150 rpm for 6, 12, 24 and 48 h (here termed as B6, B12, B24 and B48), respectively, with three biological replicates at each time point. Bacterial cells were harvested simultaneously from biofilms and planktonic fractions for the preparation of total RNA, which were extracted using TRIzol Reagent (Invitrogen, USA). One RNA sample was prepared from each biological replicate at each time point, with a total of 12 samples. RNA concentration was measured using a Nanodrop 2000, and RNA integrity was evaluated using agarose gel electrophoresis. RNA amount in each sample was required to be greater than 2 μg with OD260/OD280 value between 1.8 and 2.2. The rRNA removal and mRNA enrichment were performed using an Illumina/Ribo-Zero™ rRNA Removal Kit (bacteria) (Illumina, USA) according to the manufacturer’s instructions. All RNA-related manipulations were performed in GENEWIZ Biotechnology Co. Ltd. (Suzhou, China). cDNA sequencing was performed on an Illumina Hiseq platform [23].

Raw RNA-seq data were filtered using Cutadapt (v1.9.1) [27] to remove adapters, contamination and low quality reads, and alignment of filtered reads was performed using Bowtie2 (v2.2.6) [28] with V. parahaemolyticus RIMD2210633 as the reference genome. Gene expression in bacterial cells harvested at 12, 24 and 48 h (test groups) were compared with that in bacterial cells harvested at 6 h (reference group). HTSeq (v0.6.1) [29] and FPKM (Fragments Per Kilo bases per Million reads) [30] methods were applied to calculate gene expression. DESeq (v1.6.3) [31] of the Bioconductor package was used to analyze the difference in gene expression between the test and reference groups with selection criteria of qvalue (fdr, padj) ≤ 0.05 and absolute log2 fold change (|log2 FC|) ≥ 1. Gene Ontology (GO) functional annotation was performed to analyze DEGs involved in molecular functions, cellular components, and biological processes [32]. The Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analysis was applied to analyze DEGs involved in the main metabolic pathways [33]. The functions of proteins encoded by DEGs were predicted by the Cluster of Orthologous Groups of proteins (COG) database [34].

Quantification of c-di-GMP

Quantification of c-di-GMP was performed as previously described [35]. Briefly, the planktonic and biofilm cells of V. parahaemolyticus RIMD2210633 were harvested at different time points (6, 12, 24 and 48 h), resuspended in 2 ml ice-cold PBS, incubated at 100 °C for 5 min, and then sonicated for 15 min (power 100%, frequency 37 kHz) in an ice-water bath. The supernatant containing c-di-GMP was collected, and the pellet was resuspended in 2 ml ice-cold PBS and re-extracted another two times. Intracellular c-di-GMP levels were determined using a c-di-GMP Enzyme-linked Immunosorbent Assay (ELISA) Kit (Mskbio, China). Total proteins in the supernatant were determined using a Pierce BCA Protein Assay kit (ThermoFisher Scientific, USA) according to the manufacturer’s instructions. Concentration of c-di-GMP was expressed as pmol/g of protein. The experiment was performed three independent times with three biological replicates in each. The results were expressed as the means ± standard deviation (SD). A two-way ANOVA with Tukey’s post hoc corrections for multiple comparisons was used to calculate statistical significance, with a P value less than 0.05 considered significant.

Validation of RNA-seq Data Using Reverse Transcriptase Quantitative PCR (RT-qPCR)

RT-qPCR was performed as previously described [36]. Briefly, total RNA was extracted from the planktonic and biofilm cells of V. parahaemolyticus RIMD2210633, which were harvested from different time points (6, 12, 24 and 48 h), using TRIzol Reagent. cDNA was generated from 1 μg of total RNA using a FastKing First Strand cDNA Synthesis Kit (Tiangen Biotech, China) according to the manufacturer’s instructions. The relative mRNA levels of each target gene were determined using the classic 2−ΔΔCt method. The 16S rRNA gene was used as the internal control. Primers used in this work are listed in Supplementary Table 1. The experiment was performed three independent times with three biological replicates in each. The results were expressed as the means ± SD. A two-way ANOVA with Tukey’s post hoc corrections for multiple comparisons was used to calculate statistical significance, with a P value less than 0.05 considered significant.

Results and Discussion

Biofilm Formation by V. parahaemolyticus

Bacterial biofilm is generally formed through several steps [2]. Previously, the processes of biofilm formation by V. parahaemolyticus were demonstrated to be divided into three main stages: bacterial attachment and growth stage, biofilm maturation stage, and biofilm diffusion stage [37]. Here, the data showed that biofilm formation by V. parahaemolyticus, measured by CV staining, increased gradually from 6 to 48 h, but decreased from 48 to 72 h (Fig. 1). However, the biofilm formation speed from 6 to 24 h was much faster than that from 24 to 48 h, and the maximum amounts of biofilm occurred at 48 h (Fig. 1). These results are consistent with previous observations [37]. Therefore, although we did not use other methods such as confocal laser scanning microscopy to further observe the development of biofilm formation, we still concluded that the biofilm growth stage might occur between 6 and 36 h, the maturation stage was around 48 h, and the diffusion stage probably appeared after 48 h.

Fig. 1
figure 1

Crystal violet staining assays of time course of biofilm formation by V. parahaemolyticus. Experiment was performed three independent times with three biological replicates in each. The results are expressed as the mean ± standard deviation (SD). The asterisks indicate the statistical significance between biofilms at 12, 24, 48, 60 or 72 h and that at 6 h

Overview of Gene Expression During Biofilm Formation

A total of 12 Illumina libraries were sequenced with more than 14.1 million raw reads for each library, which were deposited in the Sequence Read Archive (accession number: PRJNA874225). The correlations between the transcriptomes of all samples were notably high, with the Pearson correlation coefficient values ranging from 0.919 to 1.000 (Supplementary Figure S1). A total of 183, 503 and 729 DEGs were identified in the samples from 12, 24 and 48 h, respectively, compared with that from 6 h (defined here as regulon 12, 24, and 48 h, respectively), including 82, 208 and 321 downregulated genes and 101, 295 and 408 upregulated genes (Fig. 2a–c). Comparisons between the three regulons showed that 133, 99, 334 and 92 genes with various predicted functions coexist in the regulons 12 and 24 h, 12 and 48 h, 24 and 48 h, and 12, 24 and 48 h, respectively (Fig. 3 and Table 1). The KEGG pathway enrichment analysis demonstrated that the regulon 12 h contained 10 organismal system genes, 118 metabolism genes, 12 human disease genes, 12 environmental information processing genes and 17 cellular process genes (Fig. 2d); the regulon 24 h contained 12 organismal system genes, 273 metabolism genes and 11 human disease genes (Fig. 2e); whereas the regulon 48 h contained 4 organismal system genes, 330 metabolism genes, 17 human disease genes and 54 environmental information processing genes (Fig. 2f). The results of COG enrichment showed that DEGs were divided into approximately 20 functional categories, including several metabolism-associated pathways such as amino acid transport and metabolism, inorganic ion transport and metabolism, lipid transport and metabolism, and coenzyme transport and metabolism (Supplementary Figure S2). The detailed descriptions of DEGs are listed in Supplementary Tables S2, S3 and S4.

Fig. 2
figure 2

Gene expression in different stages of biofilm formation. ac Volcano plot shows genes that are upregulated (red), downregulated (blue) and non-changed (grey), respectively. df Pathways of differentially expressed genes analyzed by Kyoto Encyclopedia of Genes and Genomes (KEGG). The number on the right of each bar indicates the number of differentially expressed genes (Color figure online)

Fig. 3
figure 3

Venn diagram of V. parahaemolyticus gene expression profiles during biofilm formation

Table 1 Simultaneously differentially expressed genes in the three regulons

Validation of RNA-seq Data by RT-qPCR

The RT-qPCR assays were employed to validate the results of RNA-seq. Seventeen genes were randomly selected as the target genes. As shown in Table 2, RT-qPCR revealed a consistent trend with the RNA-seq data, confirming the reliability of the transcriptome data.

Table 2 Validation of RNA-seq data by qPCR

DEGs Involved in Motility and Chemotaxis

V. parahaemolyticus expresses dual flagellar systems, a sheathed polar flagellum for swimming in liquids and unsheathed lateral flagella for swarming over surfaces [38]. Polar flagellar genes are organized on chromosome 1, whereas lateral flagellar genes are found on chromosome 2 [39, 40]. Polar flagellar genes are expressed continuously, whereas lateral flagellar genes are induced by certain conditions [40, 41]. In addition, flagella-propelled motility is also controlled by various regulators. AphA and OpaR are the master QS regulators operating at low cell density (LCD) and high cell density (HCD), respectively [42, 43]. AphA activates the transcription of polar and lateral flagellar genes at LCD, whereas OpaR represses their transcription at HCD [44,45,46]. ToxR activates both swimming and swarming motility of V. parahaemolyticus [12]. Swimming motility is activated by VapH [47] and OxyR [11], whereas swarming motility is induced by LafK [48], SwrT [49] and ScrABC [50]. Moreover, swarming motility is also repressed by SwrZ [49], H-NS [51] and CalR [52].

In this study, the RNA-seq data showed that expression of 16, 10 and 5 polar flagellar genes including flagellin genes were downregulated in the bacterial cells incubated for 12, 24 and 48 h, respectively, compared to those incubated for 6 h (Supplementary Tables S5, S6 and S7). Flagellar filaments, which consist of self-assembling flagellin, act as propellers to drive motility in liquids [38]. Six polar flagellin genes, flaA, flaB, flaC, flaD and flaE, were found in the polar flagellar gene loci [39]. flaA, flaB and flaD contribute more to the polar flagella filament [53]. However, deletion of any of the six flagellin genes has little or no effect on swimming motility of V. parahaemolyticus [53]. Three polar flagellin genes, flaB, flaD and flaE, were continuously downregulated from the initial attachment stage to biofilm maturation stage (Table 1). These results were consistent with the fact that flagella-mediated motility is required for the initial stages of biofilm formation [5].

In contrast, the RNA-seq data demonstrated similar expression levels of most lateral flagellar genes between the different stages of biofilm formation (data not shown). Only one gene (flgF) encoding the lateral flagellar basal body rod protein was downregulated in the regulon 24 h (Supplementary Table S6). It is generally understood that transcription of flagellar gene is inhibited during the motility-to-biofilm transition [54]. The method of collecting bacterial cells may affect the results of RNA-seq, but the mechanisms behind the observation that the lateral flagellar genes were stably transcribed during biofilm formation need to be further investigated.

A total of 3 (VP0421, VP0183 and VP2827), 5 (VP2159, VP2225, VP2229, VP2827 and VPA1449), and 8 (VP0963, VP1486, VP2225, VP2629, VP2827, VPA0024, VPA0941 and VPA0596) genes involved in chemotaxis were detected in the regulons 12, 24 and 48 h, respectively (Supplementary Tables S5, S6 and S7). All of the putative chemotaxis genes were significantly downregulated during biofilm formation except for VPA0491 and VPA0596. In addition, VP2827 encoding a potential methyl-accepting chemotaxis protein was continuously downregulated during biofilm formation (Table 1). However, the roles of these chemotaxis genes are still unknown in V. parahaemolyticus.

DEGs Involved in Biofilm Formation

The extracellular matrix is a basic ingredient of bacterial biofilms that plays multiple roles in bacterial biofilms [55]. The biofilm matrix predominantly consists of EPS, but also contains extracellular proteins, extracellular DNA and lipids [55]. However, a previous study showed that biofilm formation by V. parahaemolyticus was strongly positively correlated with the contents of extracellular DNA and proteins, but not with that of EPS [6]. The presented data also demonstrated similar expression levels of EPS biosynthesis-associated genes during biofilm formation (data not shown).

Four MSHA-associated genes, VP700, VP2701, VP2702 and VP2703, were downregulated in the maturation stage (48 h) compared with the initial attachment and biofilm development stages (Supplementary Table S7). However, the ChiRP genes were similarly expressed between the different stages of biofilm formation (data not shown). MSHA and ChiRP are positively correlated with V. parahaemolyticus biofilm formation [7]. Defective biofilms formed by MSHA mutants can be compensated for over time [8], which may aid to explain why the expression levels of MSHA genes were decreased at the maturation stage. MSHA also functions as an adherence factor of V. parahaemolyticus to Caco-2 cells, and thus may act as a potential virulence factor [56]. As one of the important factors affecting biofilm formation by V. parahaemolyticus, expression of type IV pili was regulated by several factors, including chitin and QS [57, 58], but its regulatory mechanisms are still far from being elucidated.

The VP0215-0237 locus is related to capsule polysaccharide (CPS) production [9]. V. parahaemolyticus undergoes opaque (OP) and translucent (TR) variation on agar plate based on whether CPS is produced [9, 59]. CPS acts as a negative effector of biofilm formation, but both OP and TR strains can form stable biofilms with different architectures [8, 60]. In this work, two CPS genes, rfbC and wcvB, were downregulated in the maturation stage (48 h) compared with the initial attachment and biofilm development stages (Supplementary Table S7), suggesting that CPS synthesis was repressed in the biofilm maturation stage. OpaR represses biofilm formation by V. parahaemolyticus but activates CPS gene expression [16, 61]. Thus, regulation of CPS gene expression may be one of the most important mechanisms adopted by V. parahaemolyticus to regulate biofilm formation.

The changes of c-di-GMP levels in bacteria influence multiple cellular pathways, including biofilm formation and virulence factor production [62]. c-di-GMP is catalyzed from guanosine triphosphate by diguanylate cyclase (DGC) containing a GGDEF domain, and is degraded by phosphodiesterase (PDE) carrying HD-GYP or EAL domain [62]. V. parahaemolyticus expresses more than 50 proteins that may be involved in c-di-GMP metabolism [24], and several of them, including the GGDEF-EAL-type proteins, ScrG [63] and ScrC [64], GGDEF-type proteins, ScrO [65], GefA [66], ScrJ [67] and ScrL [67], and EAL-type protein, LafV [67], have been confirmed to affect c-di-GMP metabolism, motility and biofilm formation. In this study, a total of 5 genes encoding the GGDEF and/or EAL-type proteins were significantly differentially expressed during biofilm formation (Supplementary Tables S6 and S7). Of these, VP2366 (GGDEF-type) and VPA1735 (GGDEF-EAL-type) were downregulated in regulon 24 h; VP1979 (EAL-type) and VPA0609 (GGDEF-EAL-type) were upregulated in regulon 48 h; VPA0360 (GGDEF-type) was simultaneously upregulated in regulons 24 and 48 h. Furthermore, there was a significant drop in intracellular c-di-GMP levels at the beginning of biofilm formation (Fig. 4), and further work is needed to understand the complexities of how c-di-GMP levels are governed during biofilm formation.

Fig. 4
figure 4

Intracellular c-di-GMP levels of V. parahaemolyticus. Bacterial cells were harvested after 6, 12, 24 and 48 h of incubation at 30 °C with shaking at 150 rpm in a 24-well cell culture plate. The results are expressed as the mean ± SD of three independent experiments. Asterisks represent statistically significant differences (P < 0.05)

The RNA-seq data also showed that mfpB and mfpC were significantly upregulated in regulon 24 h (Supplementary Table S6). mfpB and mfpC, transcribed from the mfpABC operon, are required for the mature biofilm formation of V. parahaemolyticus [8, 68]. Transcription of mfpABC was activated by OpaR [68], CalR [69], QsvR [70] and H-NS [71], but repressed by AphA [68]. Most importantly, OpaR, QsvR, H-NS and AphA are all involved in the regulation of biofilm formation [8, 46, 71, 72]. Therefore, regulation of mfpABC expression should be one of the mechanisms adopted by transcriptional regulators to regulate biofilm formation by V. parahaemolyticus.

DEGs Involved in Virulence

V. parahaemolyticus RIMD2210633 expressed a number of virulence determinants, including TDH, T3SS1, T3SS2, T6SS1, and T6SS2 [24]. TDH contributes to the hemolytic activity, mice lethal toxicity, enterotoxicity and cytotoxicity of V. parahaemolyticus, and can cause β-hemolysis on Wagatsuma agar [73]. T3SS1 is involved in the cytotoxicity, whereas T3SS2 is a major contributor to enterotoxicity and has cytotoxic effects on a limited number of cell lines [74, 75]. The T3SS2 gene cluster and the two copies of tdh genes are located on an 80 kb pathogenicity island on chromosome II, known as Vp-PAI (VPA1312-1398) [24]. T3SS2 delivers TDH from V. parahaemolyticus as an effector into host cells, resulting in the intestinal fluid accumulation in a rabbit diarrhoeal model [76]. T6SS1 is a contributor to the antibacterial activity of V. parahaemolyticus, whereas T6SS2 is an adherence factor of the bacterium [77, 78]. In addition, other factors such as lipopolysaccharide, proteases and outer membrane proteins (OMPs) are also involved in the pathogenesis of V. parahaemolyticus [4, 79].

The RNA-seq data showed that T3SS1 was consistently downregulated during biofilm formation (Supplementary Tables S5, S6 and S7). One, thirteen, and twenty-five T3SS1 genes were included in regulons 12, 24, and 48 h, respectively. Of these, VP1680 encoding the T3SS1 effector VopQ [80] was downregulated continuously (Table 1). In particular, the regulon 48 h contains exsC, exsD and exsE (Supplementary Table S7) encoding proteins that belong to the ExsACDE regulatory cascade. ExsA, an AraC-type transcriptional regulator, activates T3SS1 gene expression [81, 82]. ExsD binds ExsA to prevent ExsA-dependent activation of T3SS1, whereas ExsC binds ExsD to block the binding of ExsD to ExsA [83]. ExsE is a secreted substrate of T3SS1 and interacts with ExsC to prevent the binding of ExsC to ExsD, and thereby negatively regulating T3SS1 [84, 85]. Some regulators including H-NS, OpaR, CalR, QsvR, AphA, and HlyU control T3SS1 expression by directly regulating exsACDE [86,87,88,89]. Moreover, ExsA represses lateral flagellar gene expression and swarming motility [52], suggesting that the ExsACDE cascade may be also involved in the regulation of biofilm formation.

The tdh gene encoding TDH and the vscS2 gene encoding a putative type III secretion apparatus protein were significantly downregulated and upregulated, respectively, in the bacterial cells incubated for 24 h compared to those incubated for 6 h (Supplementary Table S6). In addition, seven genes located in Vp-PAI were significantly differentially expressed in the bacterial cells incubated for 48 h compared to those incubated for 6 h. Of these, VPA1342 encoding the EscR/YscR/HrcR family type III secretion system export apparatus protein was downregulated, and the other six genes (VPA1319, VPA1376, VPA1387, VPA1388, VPA1396 and VPA1379) were upregulated (Supplementary Table S7). These results suggested that the Vp-PAI genes were differentially expressed during biofilm formation, with downregulation of the virulence-related T3SS2 and TDH genes, and upregulation of the other genes.

Each regulon contains several of T6SS1 and T6SS2 genes, which were all downregulated (Supplementary Tables S5, S6 and S7). In particular, a total of 17 T6SS2 genes were downregulated in the regulon 48 h (Supplementary Table S7). T6SS plays multiply physiological roles including bacterial competition, ion transport, host infection, and stress response [90]. T6SS also participates in bacterial biofilm formation. For example, expression of T6SS were remarkably enhanced in biofilm cells of Pseudomonas fluorescens, and the lack of T6SS impeded biofilm formation [91, 92]. The T6SS gene mutants of Acidovorax citrulli produced less biofilms than the wild-type strain [93]. The phosphatase PppA, which is encoded in the T6SS gene cluster of V. alginolyticus, negatively regulates c-di-GMP production and biofilm formation [94]. V. cholerae T6SS is still active within biofilms that kill bacteria lacking T6SS within the interior of biofilm populations [95]. However, whether V. parahaemolyticus T6SSs play roles in biofilm formation remain unclear.

Extracellular proteases can digest host proteins into amino acids, aiding the spread of V. parahaemolyticus into host tissues when wound infection occurs [79]. As shown in Supplementary Table S5, vppC encoding a zinc metalloprotease and prtA encoding a serine protease were up- and down-regulated in the regulon 12 h, respectively. In addition, VPA1071 encoding another serine protease was downregulated in the regulon 24 h (Supplementary Table S6). VppC, which is secreted during the early stationary phase at 26 °C, digests native collagen [96, 97]. PrtA, which is secreted during the late-log phase, contributes to hemolytic activity and cytotoxicity of V. parahaemolyticus [98]. However, roles of VPA1071 are completely unknown.

OMPs can strongly elicit the host immune response and thus can be good targets for vaccine against bacterial infections [99]. However, some OMPs such as OmpF and YfiB also affect biofilm formation and virulence of bacteria [100, 101]. In this work, the data showed that one, six and twelve OMP genes were significantly differentially expressed in regulons 12, 24 and 48 h, respectively (Supplementary Tables S2, S3 and S4), suggesting that OMPs were remodeled during biofilm formation. Of these, VP1008, VP1634, VP1901, VP2212, VP2467, ompW, VPA0860 and VPA1644 were downregulated, while VP0802, VP1218, VP1631, aqpZ, VP2362 and VP2385 were upregulated. However, whether these genes play roles in biofilm formation needs to be further investigated.

DEGs Involved in Regulatory Functions

Five, nineteen and twenty-nine putative regulatory genes were significantly differentially expressed in regulons 12, 24 and 48 h, respectively (Supplementary Tables S5, S6 and S7). Many of them encode global regulators, including the LysR family transcriptional regulators (VPA0602, VPA0387, VPA0388, VPA0717 and VPA0961), MarR family transcriptional regulators (VPA1219 and OhrR), Fis family transcriptional regulator (VP1704), TetR/AcrR family transcriptional regulator (VP1229), PadR family transcriptional regulator (AphA) and two component regulator systems (PhoR/PhoB and BtsS/BtsR). AphA regulates virulence, biofilm formation and motility of V. parahaemolyticus [15, 44, 46, 87, 102]. OhrR, a sensor of organic peroxides, is involved in the resistance of exogenous cumene hydroperoxide stress [103]. PhoR/PhoB represses biofilm formation and c-di-GMP production but activates motility of V. cholerae [104]. BtsS/BtsR represses curli production and biofilm formation of E. coli [105]. VPA0961 is induced in the low-salt growth condition [106], but its roles are still unknown. In addition, scvE encoding a sigma-54 dependent transcriptional regulator was upregulated in regulon 48 h (Supplementary Table S7). ScvE promotes V. parahaemolyticus biofilm formation via activation of scvA-O transcription [10]. It is noteworthy that VPA1682 was continuously downregulated during biofilm formation (Table 1), suggesting that it may play important roles in biofilm formation. Moreover, many other putative regulators with unknown functions, such as VP0252, VPA0717 and VPA0804, were also significantly differentially expressed during biofilm formation.

A total of 1065 putative small non-coding RNAs (sRNAs) were detected during biofilm formation. Of these, 1049 were cis-encoded and 15 were trans-encoded (Supplementary Table S8). However, whether they truly exist remains to be experimentally confirmed. sRNAs regulate gene expression at the posttranscriptional levels [107]. In V. parahaemolyticus, Qrr1-5 sRNAs control OpaR expression, motility, CPS production and metabolism, but Qrr2 alone is also sufficient for OpaR regulation [22]. The sRNA Spot42 interacts with a sequence encompassing the ribosomal binding site and start codon of VP1682 mRNA to inhibit the expression of VP1682 [108]. V. parahaemolyticus RyhB binds to the 5′-UTR of pvsABCDE mRNA to enhance the expression of vibrioferrin [109]. The two sRNAs, svpa1401.1 and svpa1453.1, which are exclusively present in V. parahaemolyticus pandemic strains, are remarkably decreased at 3 h post-infection of Caco-2 cells, and involved in regulating the expression of VPA1379 [110]. A novel sRNA srvg23535 conserved among V. harveyi, V. parahaemolyticus and V. splendidus can affect the utilization of carbon/nitrogen sources and the response to osmotic and pH stress [111]. More studies should be performed in the future to gain more knowledge on sRNAs in V. parahaemolyticus.

Conclusions and Expectations

In this work, we showed that the growth stage of biofilm formation by V. parahaemolyticus occurred from 6 to 36 h, the maturation stage was around 48 h, and the diffusion stage appeared after 48 h of growth. A total of 183, 503 and 729 genes were significantly differentially expressed in the bacterial cells at 12, 24 and 48 h compared with those at 6 h, respectively. Of these, 92 genes were consistently activated or repressed from 6 to 48 h. The genes involved in polar flagellum, chemotaxis, MSHA, CPS, T3SS1, T3SS2, TDH, T6SS1, and T6SS2 were significantly downregulated during biofilm formation from 6 to 48 h, whereas those involved in Vp-PAI (except for the T3SS2 and TDH genes) and mfp genes were upregulated. Three extracellular protease genes (vppC, prtA and VPA1071) and some OMP genes were also significantly differentially expressed during biofilm formation. Five c-di-GMP metabolism-associated genes were also significantly differentially expressed that may account for the drop in c-di-GMP levels after the beginning of biofilm formation. In addition, many putative regulatory genes were significantly differentially expressed, and more than 1000 putative sRNAs were detected during biofilm formation, suggesting that biofilm formation is tightly regulated by complex regulatory networks. In brief, the RNA-seq data provided a global view of gene expression during biofilm formation, showing that the significantly differentially expressed genes were involved in multiple cellular pathways, including virulence, biofilm formation, metabolism and regulation.