Introduction

Plant cell walls are dynamic structures that determine and maintain the size and the shape of the cells and serve as a protective barrier. The cell walls are highly complex structures composed mainly of polysaccharides that vary in structure, function, and abundance (Santiago et al. 2018). Expansin gene family is important regulators of extension and relaxation of cells in growing tissues through a non-enzymatic activity (Cosgrove 2015). They bind to glucan-coated cellulose in the cell wall, causing a reversible disruption of hydrogen bonds between cellulose microfibrils and the glucan matrix to loosen the cell wall (Sampedro and Cosgrove 2005). Typical expansins in plants have 250–275 amino acids and comprise of two domains preceded by a signal peptide. Domain 1 is homologous to the catalytic domain of proteins in the glycoside hydrolase family 45 (GH45); domain 2 is homologous to group-2 grass pollen allergens, which bind to cell-wall polysaccharides. Expansins in plants is divided into four families, namely α-expansin (EXPA), β-expansin (EXPB), expansin-like A (EXLA), and expansin-like B (EXLB) (Kende et al. 2004). EXPA and EXPB families are the two major subfamilies. EXPAs were identified as mediators of acid-induced wall loosening, while EXPBs include a subset of well-studied proteins known as group-1 grass pollen allergens as well as a larger set of proteins about which we know very little (Sampedro et al. 2015). EXLA and EXLB families are identified only by their conserved amino acid sequences, with far less known about their precise functions. They play important roles in many plant growth and developmental processes by modifying and elongating the cell-wall structure, such as seed development and germination (Chen and Bradford 2000; Chen et al. 2001), root elongation and growth (Lee et al. 2003; Che et al. 2016), stem growth and internode elongation (Takada et al. 1997; Wang et al. 2011), leaf formation and development (Reinhardt et al. 1998; Kuluev et al. 2017), flower development, opening and fertilization (Harada et al. 2011; Castillo et al. 2018), fruit development, ripening, firmness, and weight (Dotto et al. 2006; Xie et al. 2009). These genes could also been associated with nutrient uptake and efficiency (Zhou et al. 2014), abiotic stress (Noh et al. 2009; Zhou et al. 2011), and biotic stress tolerance (Song et al. 2017). Expansins in plant breeding programs present an opportunity to improve crops in various aspects. These include but are not limited to improving germination, leaf size, fruit growth, and ripening and tolerance to abiotic and biotic stresses (Marowa et al. 2016).

Some studies characterized expansins in plant genomes of angiosperms (Arabidopsis, poplar, grape, soybean, apple, Chinese cabbage, rice, and maize) and nonflowering plants (Selaginella moellendorffii and Physcomitrella patens) (Cosgrove 2015). However, systematic analysis of expansins in watermelon has not been reported. Watermelon is an important cucurbit crop grown throughout the world, and the annual world production of watermelon is about 118 million tons (http://faostat.fao.org/). In the Cucurbitaceae family, the genome of watermelon has been sequenced after cucumber and melon (Wechter et al. 2008). The availability of these sequences and large-scale transcriptome data provide an excellent opportunity to investigate watermelon in the field of molecular biology. In this study, we identified the expansin genes in the watermelon genome, analyzed gene structures and phylogenetic relationships, and detected the cis-acting elements in ClEXP gene promoter regions. Publically available and our own RNA-seq dataset were employed to study the expression of the ClEXP genes response to fruit and seed development, and abiotic and biotic stresses. QRT-PCR was used to study the ClEXPs’ tissue expression patterns. Our findings should pave a way for further functional researches on expansin gene family in watermelon.

Materials and methods

Genome-wide identification of expansin genes in watermelon (ClEXPs)

First, expansin protein sequences in Arabidopsis (AtEXPs) (http://www.arabidopsis.org/) were used as tblastn (E values ≤ 1 × 10−5) against the watermelon reference genome sequence (ftp://cucurbitgenomics.org/pub/cucurbit/genome/watermelon/97103/v1/watermelon_v1.genome.gz). The highly matched sequences were reorganized and merged. Then, all protein sequences of putative expansin genes were scanned for conserved domain by InterProScan (http://www.ebi.ac.uk/interpro/). The sequences lacking expansin domains were rejected. Finally, we further verified these sequences using EST database in NCBI (http://www.ncbi.nlm.nih.gov). Identification method of expansin genes in cucumber (CsEXPs) and melon (CmEXPs) was similar to that in watermelon.

Protein properties, chromosomal location, and gene structure analysis

ExPaSy (http://expasy.org/) was used to compute the molecular weights (MWs) and isoelectric point (pI). All expansin genes’ chromosomal locations were obtained from Cucurit Genomics Database (CuGenDB, http://cucurbitgenomics.org/) and then were mapped to the chromosomes by MapInspect Version 1.0 (http://www.softsea.com/review/MapInspect.html) software. The exon/intron structures of ClEXPs were generated using GSDS (http://gsds.cbi.pku.edu.cn) through aligning the coding domain sequences (CDS) and DNA sequences of expansin genes. The conserved motifs were identified using the MEME program (http://meme.sdsc.edu/meme/). The maximum motif search value was set at 15 and an optimum motif width of 10–100 amino acid residues. Other parameters are default.

Sequence alignment, and phylogenetic and homology analysis

The full-length amino acid sequences were aligned with ClustalW (http://www.ebi.ac.uk/Tools/msa/clustalw2/) and a phylogenetic tree was constructed using the neighbour-joining method with 1000 bootstrap replicates by MEGA 7.0 software (Kumar et al. 2016). The expansin genes in watermelon and Arabidopsis were clustered using the entire protein sequences by OrthoMCL software (http://orthomcl.org/orthomcl/; E values < 1 × 10−5) and the homologous relationships were drawn using Circos software (http://circos.ca/).

Analysis of promoter regions cis-acting elements

The promoter regions of the watermelon expansin genes were identified by searching the watermelon genome database, and were analyzed using PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) to predict cis-acting elements.

Expression patterns analysis of watermelon expansin gene response to low light

We study watermelon expansin gene expression pattern response to low light stress through RNA-seq technology. The study was conducted in plastic greenhouse at Luhe production base at Jiangsu Academy of Agricultural Sciences. Watermelon variety ‘Sumi No.8’ (Institute of Vegetable Crops, Jiangsu Academy of Agricultural Sciences, Nanjing, China) was as the plant materials. Low light (yin) treatment was carried out 7 days before flowering. The light intensity under the sunshade net was 50% less than that under natural conditions (CK). Flesh of watermelon fruit center under yin and CK at 0, 3, 9, and 15 days after pollination (DAP) was harvested for RNA-seq. The 24 cDNA library was sequenced on the Illumina sequencing platform (Illumina HiSeqTM2500) using the paired-end technology. The raw sequence data (in FastQ format) have been submitted to the National Center for Biotechnology Information (NCBI) Sequence Read Archive databases with the study accession (SRP243255). The levels of gene expression were estimated by FPKM values. The raw FPKM values were first normalized by logarithmic method and a heat map was constructed to show the different expression profiles by the Heml software (version 1.0, http://hemi.biocuckoo.org/).

Expression patterns analysis of watermelon expansin gene response to watermelon fruit and seed development and other stress

Cucurbit Genomics Database (CuGenDB, http://cucurbitgenomics.org/) have collected all available RNA-Seq data from NCBI Sequence Read Archive (SRA, https://www.ncbi.nlm.nih.gov/sra) for the cucurbit species that have available reference genome sequences. In addition, a unified pipeline for RNA-Seq data processing and analysis has been applied to these RNA-Seq datasets. Raw RNA-Seq reads are processed to remove adaptor and low-quality sequences, etc. Raw counts are derived for each predicted gene model and normalized to FPKM. Gene expression profiles for a specific gene can be accessed under the ‘Gene Expression’ section of the gene feature page, where after selecting an RNA-Seq project. The average expression values and the standard deviation from biological replicates are calculated for each gene and loaded into CuGenDB using the ‘RNA-Seq’ extension module (Zheng et al. 2018). Therefore, transcriptome sequencing data for watermelon fruit development (SRP012849), seed development (PRJNA319011), root or leaf response to osmotic stress (PRJNA326331), cold stress (PRJNA328189) and drought stress (PRJNA454040), and fruit flesh response to cucumber green mottle mosaic virus (CGMMV) (PRJNA389184) were obtained from CuGenDB using the identified ClEXPs ID.

The levels of gene expression were estimated by FPKM values. The FPKM values were first normalized by logarithmic method and a heat map was constructed to show the different expression profiles by the Heml software (version 1.0, http://hemi.biocuckoo.org/).

Plant materials, RNA isolation, and qRT-PCR analysis

The watermelon variety ‘Sumi No.8’ (Institute of Vegetable Crops, Jiangsu Academy of Agricultural Sciences, Nanjing, China) was used as the plant materials to investigate the expression profiles of expansin genes in different tissues. Root, stem, and leaf samples were collected 1 month after planting and stored at − 80 °C.

RNA was extracted from triplicated biological replicates of the aforementioned samples using RNAprep Pure Plant Kit (TIAGEN, China), and treated with DNaseI (Transgen). The quality of RNAs was examined by agarose gel electrophoresis. The RNA samples were reverse transcribed to cDNAs with TranScript® One-step gDNA Removal and cDNA Synthesis SuperMix (Transgen). The cDNA were then diluted ten times and stored at − 20 °C for the subsequent qRT-PCR.

The watermelon actin gene (Cla007792) was used as an internal control. The primers for 15 randomly selected expansin genes in watermelon were designed by Primer Premier 5.0 software. Primer sequences were listed in Table S1. qRT-PCR assays were performed with three replicates. The SYBR Premix Ex Taq II reagent (Takara, Japan) with SYBR Green I as the fluorescent dye was used for qPCR on an ABI 7300 real-time PCR system (Applied Biosystems, Foster City, CA, USA). Each reaction contained 10 μL 2 × SYBR Premix Ex TaqII Reagent, 2.0 μL cDNA sample, and 500 nM gene-specific primers in a final volume of 20 μL. QPCR conditions were set at 95 °C pre-denaturation for 3 min and followed by 40 cycles of 95 °C denaturation for 15 s, 60 °C annealing for 30 s, and 72 °C extension for 15 s. All the data were analyzed using the \(2^{{ - \Delta \Delta C_{\text{t}} }}\) method, where Ct is the cycle threshold measured according to a previous method (Livak and Schmittgen 2001).

Results

Identification of expansin gene family in watermelon

After filtering and removing the sequences lacking expansin conserved domain, 30 expansin genes with two conserved domains in their protein sequences were identified in the watermelon genome. In accordance with the nomenclature methods for members of expansin superfamily proposed by Kende et al. (2004), expansins of watermelon, melon, and cucumber were sequentially named on the basis of family identity and chromosomal position (Table S2). As shown in Table S3, the identified ClEXP genes encoded proteins ranging from 237 (ClEXLA1) to 434 (ClEXPA11) amino acids (aa) and had predicted MWs of 26.39–45.54 kDa. The majority members of ClEXPA (except ClEXPA18), ClEXPB (except ClEXPB2), and ClEXLA (except ClEXLA7) subfamilies had pI values above 7.0, whereas pI values of all members of ClEXLB were below 7.0.

The chromosomal distribution map of ClEXPs is presented in Fig. 1. All ClEXPs were successfully mapped to the 10 watermelon chromosomes. Chr2 contained the maximum number (10) of ClEXPs genes and all ClEXLA genes members are distributed on this chromosome. The two ClEXLB genes are both located on the Chr9, and two ClEXPBs were distributed on Chr2 and Ch5, respectively. ClEXPAs were unevenly distributed on ten watermelon chromosomes.

Fig. 1
figure 1

Chromosomal distribution of ClEXPs genes. Physical locations of watermelon expansin genes were present on ten watermelon chromosomes. Values on the left of each chromosome represent megabases (Mb). The color column represents the chromosome

Sequence alignment and phylogenetic analysis

To obtain detailed information regarding each ClEXPs gene and superfamily, we completed multiple alignments of the deduced protein sequences of 30 ClEXPs genes (Fig. S1). Most of ClEXPs protein consisted of three parts, namely, a 20–30 amino acid signal peptide, domains 1 and domain 2. Only ClEXPA7 lack the signal peptide region. The amino acid sequence of domain 1 was more conserved than that of domain 2. As reported by Li et al. (2002), the EXPA and EXPB subfamilies could be easily distinguished based on two types of insertions. The α-insertions existed only in the EXPA subfamily and consisted of about 14 residues, with most insertions containing four highly conserved residues at the 3′ end, WCNP. The β-insertions were present among EXPB, EXLA, and EXLB members. All EXPBs and the majority of EXPA members (excluding ClEXPA3, ClEXPA4) contained a conserved HFD motif in domain 1. Among the nine expansin-like protein sequences with an incompletely conserved HFD motif, seven were classified into the EXLA subfamily based on the presence of a characteristic EXLA extension at the C-terminus; two were classified into EXLB subfamily.

To evaluate evolutionary relationships of expansin proteins, complete protein sequences of expansins from Arabidopsis, cucumber, melon, and watermelon, were subjected to phylogenetic analysis. As shown in Fig. 2, all expansin genes were divided into four major subfamilies: EXPA, EXPB, EXLA, and EXLB. EXPA was the largest subfamily, and EXLB was the smallest. The four expansin gene subfamilies were further divided into 17 subgroups according to the grouping rules used for Arabidopsis expansin gene families (Sampedro and Cosgrove 2005). EXPA-V and EXPA-VI subgroups did not contain any watermelon expansin gene. We found that EXLA-I subgroup, containing 23 members, constituted the largest subgroup, while the smallest subgroup, EXPA-VI, comprised only one gene.

Fig. 2
figure 2

Phylogenetic analysis of AtEXPs, CsEXPs, CmEXPs, and ClEXP. The full-length amino acid sequences were aligned with ClustalW and a phylogenetic tree was constructed using the neighbour-joining method with 1000 bootstrap replicates by MEGA 7.0 software. The subgroups are marked by color bars. Red, green, purple, and mazarine represent melon, watermelon, cucumber, and Arabidopsis expansins, respectively. The phylogenetic tree was generated using the maximum likelihood method in MEGA 7.0. Arabidopsis expansins sequences were retrieved from the databases of Arabidopsis Information Resource (http://www.arabidopsis.org/) and the Expansin Central website (http://www.personal.psu.edu/fsl/ExpCentral/index.html)

Gene structure and putative motif identification

A structural analysis can provide valuable information concerning duplication events, so we analyzed the exon–intron structures (Fig. 3). The number of exons in the ClEXPs family varied from two to five, with members within each subgroup or subfamily having similar exon–intron structures. In the EXPA subfamily, 13 members (72.22%) had three exons; three members (16.67%) had two exons, and one member had four (ClEXPA4) or five (ClEXPA11). The EXPB subfamily had four exons except of ClEXPB2. Members of the ClEXLA and ClEXLB subfamily possessed five exons with the exception of ClEXLA2, ClEXLA4, and ClEXLB2, which contained four.

Fig. 3
figure 3

Phylogenetic relationships and structural analysis of ClEXPs. Introns and exons are represented by gray and green boxes, respectively. Lengths of introns and exons of each expansin gene are displayed proportionally. Gene structures were generated with GSDS 2.0 (http://gsds.cbi.pku.edu.cn/)

We detected conserved motifs in ClEXPs family and found 15 distinct motifs (Fig. 4). Schematic diagrams of these ClEXPs motifs are given in Table 1. Motifs 2 and 10 specified the well-conserved N-terminal domain 1, and motifs 3, 6, 9, and 11 consisted of the well-conserved C-terminal domain 2. At least two of these six motifs were shared among the ClEXPs (Table 1). Genes from the same subfamily, especially within subgroups, were generally characterized by a similar motif type and distribution. Motif 2, 5, and 6 were detected in all ClEXPs, and Motifs 1 and 3 were only detected in the ClEXPA subfamily, but Motif 9, 11 and 15 were found in all ClEXPs, except ClEXPA. These results indicate that the ClEXPA subfamily differs significantly from the other three subfamilies in terms of gene structure and motif composition. Others motifs were unevenly distributed in different clades.

Fig. 4
figure 4

Putative ten different motifs of ClEXPs by MEME

Table 1 Information of conserved motifs in ClEXPs

Homology analysis and detection of cis-acting elements in ClEXPs gene promoter regions

Because Arabidopsis is the most important model plant and many of its expansin genes function have been well characterized, we performed a homology analysis of watermelon and Arabidopsis expansin genes to predict the function of ClEXPs (Fig. 5). The analysis revealed the presence of 34 homologous gene pairs between watermelon and Arabidopsis as well as 11 gene pairs in watermelon and 15 in Arabidopsis. All ClEXP genes showed duplication events, and 13 tandem duplications between expansin genes were determined in watermelon with the percentage of 43.33%. These tandem duplications located on chromosome 2 (ClEXLA1, ClEXLA2, ClEXLA3, ClEXLA4, ClEXLA5, ClEXLA6, and ClEXLA7), chromosome 4 (ClEXPA6 and ClEXPA7), chromosome 7 (ClEXPA12 and ClEXPA13), and chromosome 9 (ClEXLB1 and ClEXLB2) (Table S4). However, there have no segmental duplications which were determined.

Fig. 5
figure 5

Homology analysis of expansin genes between watermelon and Arabidopsis. Black, red, and blue lines indicate homologous gene pairs between watermelon and Arabidopsis, and Arabidopsis and watermelon

As detailed in Table 2 and Supplementary Tables S5, cis-acting elements in the promoter regions of ClEXPs are extremely diverse. We detected 91 types of cis-acting elements, 34 with unknown functions. The most abundant elements were light-responsive elements, which contained 24 types (Table S5). Each promoter possessed 3–19 types, which suggests that ClEXPs are differentially regulated by light (Table 2). Twelve types of hormone-responsive elements were detected. The CGTCA-motif and TGACG-motif, which both involved in methyl jasmonate (MeJA) response, were also the most abundant hormone-responsive elements in the 19 ClEXPs gene promoters. One AuxRR-core, one AuxRE, and one TGA box were found in the promoter regions of ClEXPA, ClEXLA, and ClEXLB genes, respectively, which indicate that auxin may regulate these genes. Moreover, six types of environmental stress-related elements were identified in ClEXPs promoters. ARE and WUN-motif elements were present in over half of the identified 30 watermelon expansin genes. The fourth important type of cis-acting elements in the promoter regions of ClEXPs was development-related elements. The O2 site was the most abundant regulatory element of the 7 types. ClEXLA1, ClEXPA9, and ClEXPB1 contained MSA-like involved in cell cycle regulation, which suggests that these genes have important function in cell division and cell enlargement. (Table S5).

Table 2 Number of cis-acting elements in the promoter region of each watermelon expansin gene

Expression profiles of ClEXPs in fruit and seed developmental stages

We examine the expression profiles of ClEXPs genes in fruit and seed at multiple developmental stages (Fig. 6). 22 ClEXPs were detected in fruit and seed developmental stages, which were divided into three subgroups. Group I showed relatively higher expression levels in fruit flesh and rind at the early developmental stage. Moreover, ClEXPA4 had higher expression levels in all seed developmental stage, ClEXPA10 and ClEXPB1 had higher expression in 31 DAP. Group II have highest expression in seed early developmental stage. Group III have lower expression level in all seed development stages, but higher levels in early stage of fruit flesh and rind development.

Fig. 6
figure 6

Expression profiles of ClEXPs genes in different watermelon fruit (cultivars 97103; SBA number SRP012849) and seed developmental stages (SBA number PRJNA319011). Subdivided into groups (labelled I–III) based on the transcriptome data. The legend represents the logarithmic normalized FPKM. DAP days after pollination, FF fruit flesh, FR fruit rind, WmSeed watermelon seed. Fruit flesh and rind was used as RNA-seq material at 10, 18, 26, and 34 days after pollination (DAP) during watermelon fruit development. Watermelon seed tissue was taken out at 25, 31, 37, and 49 days after pollination for RNA-seq

Expression profiles of ClEXPs under abiotic and biotic stresses

According to our RNA-seq results of watermelon fruit response to low light stress (Table S6), the 27 ClEXPs could be divided into five major clusters (Fig. 7). It is noteworthy that the expression of ClEXPA4, ClEXPB1, ClEXPA16, and ClEXPA12 in group I was significantly induced by low light stress at 0 DAP and then rapidly declined. Gene expression level of group II was significantly induced at 3 DAP, while group III had highest expression level at 9 DAP except for ClEXPA3, other genes in group III were significantly downregulated by low light at 9 DAP. Gene expression levels in group IV and V were repressed by low light stress.

Fig. 7
figure 7

Expression profiles of ClEXPs genes response to low light stress in watermelon (cultivars: Sumi No.8; tissue: fruit flesh). Subdivided into groups (labelled I–V) based on the transcriptome data. The legend represents the logarithmic normalized FPKM. DAP days after pollination. Low light (yin) treatment was carried out 7 days before flowering. The light intensity under the sunshade net was 50% less than that under natural conditions (CK). Flesh of watermelon fruit center under yin and CK at 0, 3, 9, and 15 days after pollination (DAP) were harvested for RNA-seq

After cold stress, 23 ClEXPs divided into seven major clusters (Fig. 8). It is worth noting that the expression level of ClEXPs in group VII (ClEXLB1, ClEXPA16, ClEXPA5, ClEXLA1, ClEXLB2) was significantly induced by cold stress. In group III (ClEXLA7), melatonin treatment can significantly increase the gene expression level under cold stress and normal temperature (CK). Except this two groups, expression of ClEXPs in other groups showed significant down-regulation or no significant change in response to cold stress.

Fig. 8
figure 8

Expression profiles of ClEXPs genes response to cold stress in watermelon (cultivars: Y134; tissue: Leaf; SBA number: PRJNA328189). Subdivided into groups (labelled I–VII) based on the transcriptome data. The legend represents the logarithmic normalized FPKM. MT melatonin treatment, Cold cold stress, Wmleaf watermelon leaf. Y134. Normal temperature was 25/18 °C (day/night); cold stress was 4 °C. Seedlings at the four-leaf stage were sprayed with 150 mM melatonin solution for 3 days, with distilled water used as the control

Under drought stress, the 20 ClEXPs could be divided into nine major clusters (Fig. 9). Expression of group V (ClEXPA4, ClEXPA12) was up-regulated by drought stress in drought-resistant variety M20 and drought-sensitive varieties Y34, while expression pattern in group III was contrary. In group VI, gene expression in M20 was lower under stress treatment, while that in Y34 drought stress can significantly induced the gene expression. ClEXPs in group I were induced after 4 days and declined rapidly after 8 days under drought stress in M20, while gene expression patterns in Y34 were opposite. Gene expression levels in group II and IV were lower and not affected by drought stress in M20, while expression level in Y34 was highest at 4 days and drought stress can significantly decrease expression level.

Fig. 9
figure 9

Expression profiles of ClEXPs’ genes response to drought stress in watermelon (cultivars: Y34 and M20; tissue: Leaf; SBA number: PRJNA454040). Subdivided into groups (labelled I–VI) based on the transcriptome data. The legend represents the logarithmic normalized FPKM. Y34 watermelon Y34, M20 watermelon M20, d days after drought stress or control treatment. Y34 was M20 was drought tolerance material and Y34 was drought-sensitive material

The 29 ClEXPs genes in response to osmotic stress were divided into two major clusters (Fig. 10). Group I have 10 ClEXPs and expression of these genes showed increasing pattern upon osmotic stress. The gene expression patterns of group II were opposite, which decreased significantly after osmotic stress.

Fig. 10
figure 10

Expression profiles of ClEXPs genes response to osmotic stress in watermelon (cultivars: M08; tissue: root; SBA number: PRJNA326331). Subdivided into groups (labelled I–II) based on the transcriptome data. The legend represents the logarithmic normalized FPKM. OS osmotic. M08 was relatively high tolerance to water deficits. The root samples were harvested at 6 h after 20% polyethylene glycol (PEG) 6000 treatment and untreated samples were used as controls

Through comparative analyses of ClEXPs expression profiles of watermelon fruits inoculated with cucumber green mottle mosaic virus (CGMMV) or non-inoculated, we found 19 ClEXPs which can be divided into two groups (Fig. 11). Group I has 5 ClEXPs and showed rapid and negative response, while group II showed rapid and positive response.

Fig. 11
figure 11

Expression profiles of ClEXPs genes response to CGMMV stress in watermelon (cultivars: Jingxin No.3; tissue: Fruit flesh; SBA number: PRJNA389184). Subdivided into groups (labelled I–II) based on the transcriptome data. The legend represents the logarithmic normalized FPKM. CGMMV cucumber green mottle mosaic virus

Expression profiles of expansin genes in different watermelon tissues analyzed by qRT-PCR

To comprehensively understand the physiological functions of watermelon expansin genes, we analyzed the expressions of 15 randomly selected expansin genes in watermelon under normal growth conditions in three different tissues by qRT-PCR. As shown in Fig. 12, expression levels of the 15 ClEXP genes differed in three tissues, indicating their involvement in the development of the three tissues. ClEXLA7 was dominantly expressed in root, and ClEXLA1 and ClEXLA5 have higher expression level in root than in leaf and stem, but ClEXLA2 and ClEXLA6 showed high expression in leaf. ClEXLB2 was notably high in root. ClEXPA5, ClEXPA6, and ClEXPA11 were notably high in root, but ClEXPA4, ClEXPA9, ClEXPA12, ClEXPA15, ClEXPA16, and ClEXPA18 showed high expressions in leaf. These results further highlighted that expansin genes were involved in watermelon plant growth.

Fig. 12
figure 12

qRT-PCR analysis for 15 watermelon expansin genes in different tissues. Root, stem, and leaf samples were collected 1 month after planting. The mean expression value was calculated from three independent replicates. The vertical bars show the standard deviation

Discussion

In this study, a total of 30 watermelon expansin genes containing full-length ORFs and two conserved domains, DPBB_1 and Pollen_allerg_1, were first identified in watermelon genome through genome-wide analysis. Similar to other plants, the 30 watermelon expansins were divided into four subfamilies, ClEXPA, ClEXPB, ClEXLA, and ClEXLB. They clustered together with genes of the same subfamily from other plants rather than genes of the same species from different subfamilies, suggesting that their ancestors differentiated before the divergence of different plant species. Seader et al. (2016) estimated that there were 12–13 EXPA genes, 2 EXPB genes, 1 EXLA gene, and 2 EXLB genes in the last common ancestor of all angiosperms. In our study, the four watermelon expansin subfamilies ClEXPA, ClEXPB, ClEXLA, and ClEXLB were accordingly divided into 9, 2, 1, and 2 subgroups with each group of genes deriving from a common ancestor by frequent gene duplication. The missing genes belonged to the other three subgroups in the ClEXPA subfamily might span different genome sequences that have not been scaffolded and, therefore, cannot be identified or be removed in our sequence analysis, and the results was similar to other studies (Ding et al. 2016). The disappearance of descendents from these ancestors could also attribute to gene deaths which were observed in Arabidopsis and rice as well (Sampedro and Cosgrove 2005). Through analysis and comparison of the sizes of expansin subfamilies in Arabidopsis, melon, cucumber, and watermelon, we found an uneven distribution of each gene subfamily among species. For example, Arabidopsis has 26 EXPA genes, 6 EXPB genes, 3 EXLA genes, and 1 EXLB genes, which contain more EXPA and EXPB members than Cucurbitaceae family (i.e., watermelon, melon, and cucumber). In addition, these four eudicots species have less EXPB members than a monocot rice, which correlate to the previous studies (Li et al. 2016; Hou et al. 2018) and possibly due to the differences in their cell-wall composition and their evolutionary differences (Cosgrove et al. 2002; Dal Santo et al. 2013). Watermelon, melon, and cucumber have more than five members in EXLA subfamily, but Arabidopsis have only two members in EXLA subfamily. Plants retaining certain kind of genes with larger subfamily size during the long evolutionary time were to increase its adaption to certain functions and environments (Ding et al. 2016). Therefore, the additional ClEXLA genes might have special functions in watermelon, which need to be validated. We also found that the size of EXPA was the biggest and the size of EXPB and EXLB were equal in Cucurbitaceae, which was in accordance with the observation in Solanaceae (Lu et al. 2016). Gene duplication events are considered to be one of the evolutionary forces in genome evolution, providing materials for generation of new genes and development of new functions (Kong et al. 2007). Segmental and tandem duplications are considered to be the two main causes of gene family expansion in plants (Cannon et al. 2004), the impacts of which on the expansion of watermelon expansin gene family were focused in our study. Our study showed that 43.33% ClEXPs have tandem duplication events. Therefore, our results indicate that tandem duplication events may be one of the main reasons for ClEXP gene family expansion. Phylogenetic analysis revealed that the majority of watermelon expansin genes show more closely related to melon and cucumber expansin genes than those of Arabidopsis. The reason was that there was a set of ancestral expansin genes corresponding to each subfamily predating the divergence of monocot and eudicot (Valdivia et al. 2007). Within the same phylogenetic subfamily and even subgroup, members mostly have a conserved gene structure, thus confirming their close evolutionary relationship and classification (Zhang et al. 2014).

Gene expression pattern can provide important clues for gene function, which are believed to be associated with the divergence of the promoter regions (Ding et al. 2016; Hou et al. 2018), so we used the PlantCARE program to analyze cis-acting elements in all ClEXP promoter regions. The four classes of cis-acting elements, namely, development-related elements, environmental stress-related elements, hormone-responsive elements, and light-responsive elements, are significantly abundant in ClEXPs. In plants, both environmental and internal factors can affect the expression patterns of expansin genes, which enable them to participate in various developmental processes conjectured to be regulated by the corresponding cis-acting elements contained within them (Hou et al. 2018).

Plant hormones and environmental stress can regulate expansin gene expression levels, which have been reported in many plants, such as TaEXPB23 in wheat, was induced by exogenous MeJA and salt stress, but suppressed by exogenous gibberellins, ethylene, indole-3-acetic acid, and α-naphthlcetic acid (Han et al. 2012). Variations in number and type of cis-elements associated with each ClEXPs indicate that the functions of these genes are differentially regulated by these signals and have varied functional roles in many developmental process related to cell-wall modification.

According to the results of RNA-seq data and qRT-PCR analyses, ClEXPs are expressed in various tissues and at many developmental processes, which indicated their possible involvement in certain tissues and developmental stages. The expression levels of ClEXLB2, ClEXLB1, and ClEXPA4 were relatively higher in fruit flesh, fruit rind, and seed at all stages, respectively, which suggested that these genes might be important for fruit flesh, fruit rind, and seed development. Watermelon fruits develop and mature rapidly, and early fruit development involves rapid cell division, followed by a long phase of cell expansion to form large vacuolated cells that make up the flesh of watermelon fruits (Wechter et al. 2008). Most ClEXPs have higher expression levels in early stages of fruit flesh, fruit rind, and seed shows that ClEXPs play important role in fruit and seed early developmental stage. And different ClEXPs show that different tissue expression patterns, such as ClEXLA7, were dominantly expressed in root, while ClEXLA2 and ClEXLA6 showed high expression in leaf. The results show that different ClEXPs play important roles in the growth and development of certain tissues. Expansins are likely to be involved in physiological adaptation in response to abiotic and biotic stresses (Kuluev et al. 2016; Song et al. 2017). Our results of the expression patterns of ClEXPs response to abiotic and biotic stresses are also similar to the previous studies. Notably, seven ClEXPs (ClEXLA1, ClEXLA6, ClEXLB1, ClEXLB2, ClEXPA5, ClEXPA10, and ClEXPA16) exhibited positive response to at least three kinds of stresses, implying their involvement in the crosstalk of stress signal pathways.

Comprehensive analysis of phylogenetic, homologous relationship and gene expression profiles can provide important clues for gene function. Based on our phylogenetic tree, ClEXPA9 and AtEXPA10 belong to the EXPA-I subgroup, and our homology analysis revealed that ClEXPA9 is homologous to AtEXPA10. AtEXPA10 influences the sizes of vegetative organs (Kuluev et al. 2012), our transcriptome and qRT-PCR analysis also showed ClEXPA9 have higher expression level in the early developmental stage of fruit flesh, fruit rind, seed, and in leaf.

Conclusions

In the current study, we presented a genome-wide analysis of expansin gene family in the watermelon genome and identified 18 ClEXPAs, 3 ClEXPBs, 7 ClEXLAs, and 2 ClEXLBs, respectively. The four subfamilies were further grouped into 9, 2, 1, and 2 subgroups which were derived from 14 common ancestors by gene duplications. The ClEXPs exhibited specific characteristics in terms of exon–intron structural, amino acid sequences, or protein motif composition within subfamilies or subgroups. Tandem duplication events might have contributed to the expansion of ClEXPs. Each expansin gene contained a number of cis-acting elements in its 1.5 kb upstream region, suggesting that its expression was regulated by various internal or environmental factors, such as plant hormones, and light and environmental stresses, thus participating in watermelon development and resistance to stresses. Here, we showed a global expression landscape of ClEXPs in response to development and various stress. The expression patterns of the ClEXPs were also studied using our and public RNA-seq and qRT-PCR analysis, which revealed that most ClEXPs are expressed in various tissues and at many developmental processes, which indicated their possible involvement in certain tissues and developmental stages. In addition, here are seven ClEXPs (ClEXLA1, ClEXLA6, ClEXLB1, ClEXLB2, ClEXPA5, ClEXPA10, and ClEXPA16), exhibited positive response to at least three kinds of stresses, implying their involvement in the crosstalk of stress signal pathways. Our results provide valuable information for further elucidation of the evolution and divergence of plant expansin genes. Results of this study may also aid understanding of the biological function of ClEXPs and molecular basis of many watermelon important agriculturally traits.