Introduction

The fig (Ficus carica L.) is classified in the Cronquist system as a genus of the family Moraceae in the order Utricales. More than half of the species in the Moraceae belong to the Ficus genus (Datwyler and Weiblen 2004), and fig is the most common species among them.

Fig is a gynodioecious plant with two major sex types. The caprifig (hermaphroditic) type, the presumptive ancestral species, has male flowers and long-style female flowers, whereas the fig type (female) has only short-style female flowers (Beck and Load 1988; Dellaporta and Calderon-Urrea 1993; Stover et al. 2007). The female fig type is further classified into three types according to its cultivation type: the Smyrna type is non-parthenocarpic, the San Pedro type is parthenocarpic in the first crop but not in the second crop, and the common type is parthenocarpic in both first and second crops (Storey 1975) (Fig. 1a). Every fig species thus falls into one of the four ecotypes: caprifig, Smyrna, San Pedro, or common.

Fig. 1
figure 1

Diagrammatic representation of gynodioecious fig (Ficus carica L.) fruit-type differentiation and analyzed fruits. a Fig taxonomy matrix based on parthenocarpy and sex traits. The thick double-headed arrow indicates the comparison undertaken in this study. The dotted double-headed arrow indicates the parthenocarpy range of caprifig type. b Left: Caprifig 6085 first crop (caprifig type), right: Houraishi second crop (common type). The maturity stage of the displayed fruits was between periods II and III. Bar = 2 cm

Phenotypic differentiation in traits associated with sexuality and parthenocarpy plays an important role in plant–insect ecosystems as well as in our agricultural history. The differentiation between the hermaphroditic and female strains forms a basis to maintain the close symbiotic relationship between Ficus plants and the Blastphaga wasp (Galil 1977; Wiebes 1979). The appearance of a trait for parthenocarpy suggests the possible first cultivation by humans; the fig may be the earliest cultivated plant (Kislev et al. 2006).

Economically, the fig is an important fruit tree grown mainly in Mediterranean countries, such as Turkey, Egypt, and Iran, but also elsewhere (FAO 2006). The fig plant is of value mainly for its edible fruit, particularly that of the sexual species (female fig type). This fruit has a unique morphology (hypanthodium) with countless small flowers contained within the fruit receptacle or syconium. The edible parts are the torus and the small flowers; these parts are consumed mainly in the dried form (as dried figs) or as processed or fresh fruit. Dried figs contain a high mineral and fiber content and are regarded as among the most convenient and nutritious preserved foods (Vinson et al. 2005). Recent studies have reported that anthocyanins such as cyanidin-3-rhamnoglucoside contained in fig fruits have antioxidant potential, possibly preventing fibroblast oxidation (Solomon et al. 2006; Duenas et al. 2008). The ripening process of fig fruits is climacteric (Watkins 2002), and, as in other climacteric fruits, ethylene hastens the ripening process (Owino et al. 2006). Because fig fruits dramatically increase their ripening speed in the brief period at the end of the growth stage, fruit quality control is a major issue in pre- and post-harvest management.

Fig fruits have many traits of physiological and economic importance. However, owing to the limited molecular details (in January 2012, the keyword “Ficus carica” retrieved 509 records in a National Center for Biotechnology Information (NCBI) “Nucleotide” search), the genetic structure underlying the traits of fig fruits is not well understood. In addition, there is no ongoing large-scale molecular research. Among the four fig ecotypes, the non-parthenocarpic caprifig type differs substantially from the common type, while the other types fall morphologically between these two types. For this reason, a large-scale comparative study of gene expression in the non-parthenocarpic caprifig and common strains should provide comprehensive data on gene expression in terms of fruit physiology and also elucidate the genetic factors underlying the traits involved in type differentiation.

The primary objective of this study was to obtain comprehensive and large-scale expressed sequence tag (EST) data to serve as a basis for the genetic understanding of fig fruit physiology. This was accomplished by 454 pyrosequencing, for the rapid generation of large genetic data sets, on non-parthenocarpic caprifig-type and parthenocarpic common-type fruits. As sequences generated from 454 pyrosequencing are highly accurate and longer than those from other platforms, 454 pyrosequencing has been utilized for the transcriptome analyses of non-model plant species, such as grapevine (Vitis vinifera L.) (Bellin et al. 2009), olive (Olea europaea L.) (Alagna et al. 2009), chestnut (Castanea spp.) (Barakat et al. 2009), and chickpea (Cicerarietinum L.) (Garg et al. 2011). Our second objective was to compare the transcriptomes of the two types to obtain molecular information on the genetics underlying the polymorphic traits, in particular the genes governing sexuality and parthenocarpy.

Material and methods

Plant materials and RNA extraction

We used a 15-year-old Caprifig 6085 (caprifig type; accession: JP number 113491) tree and a 24-year-old Houraishi (common type) tree for EST analyses.

Caprifig 6085 is a hermaphroditic strain introduced to Japan in the mid-20th century. It shows little parthenocarpy at the first crop (approximately 10 % bearing), and at the second crop (approximately 0 % bearing) (Awamura et al. 1996). Houraishi is a highly productive cultivar and the oldest representative variety in Japan (Ikegami et al. 2009a). Both plants were provided by the former Fruit Tree Experiment Station (Tsukuba, Japan) and planted in the Fukuoka Agricultural Research Center, Buzen Branch (Fukuoka, Japan; Fig. 1b).

Sample fruits were harvested in 2009 and 2010. The first crop was harvested from the caprifig (non-parthenocarpy) type and the second from the common (parthenocarpy) type. To obtain gene expression data during the maturation period, fruits were harvested at the end of period II, when second rapid fruit growth and ethylene production start. In general, fig fruit development is divided into three periods based on changes in fruit size. In the first stage, intense cell division, differentiation and rapid growth occur (period I). A large period of stasis follows (period II) and a second phase of rapid growth occurs (period III), in which cell expansion and a change of color and texture are observed (Chessa 1997; Owino et al. 2006). Two harvested fruits for each type were sliced vertically and preserved at −80 °C following snap-freezing with liquid nitrogen.

Library preparation and 454 sequencing

The preserved fruits were ground in liquid nitrogen, and subjected to total RNA extraction, combining Fruit-mate (TakaraBio, Inc., Shiga, Japan) and an RNeasy Maxi kit (Qiagen, Hilden, Germany) (Ikegami et al. 2009b). Purified poly(A)-RNA was then obtained from the total RNA using a MicroPoly(A)Puristkit (Ambion, Carlsbad, CA). We used a cDNA Synthesis System kit (Roche, Penzberg, Germany), Roche “random primers” (Roche), and aGS FLX Titanium Rapid Library Preparation kit (Roche) to synthesize cDNA and prepare the library, which we then sequenced with a GS FLX Titanium Sequencer (Roche) following the manufacturer’s instructions.

Sequence quality controls and de novo assembly

Pre-processing and assembly were carried out as described by Habu et al. (2012). The raw reads obtained from 454 pyrosequencing were processed by Seqclean software (http://compbio.dfci.harvard.edu/tgi/software/) to trim low complexity [poly(A)] sequences. Then, the reads were further processed by RepeatMasker (Smit et al. 1996–2000) (http://www.repeatmasker.org) with RepBase (Jurka et al. 2005) to mask the repeat sequences to avoid mis-assembly. Finally, masked reads were processed by a Perl script as follows: (1) low-quality regions were masked, (2) masked regions of both ends were trimmed, (3) reads that were shorter than ten bases were removed, and (4) reads that contained more than 30 % of the masked regions were removed. We then ran MIRA v3.0.2 for sequence assembly, specifying the stringent parameter settings “de novo, accurate, EST, 454” with a minimum read length of 40 bases, minimum sequence overlap of 40 bases, and minimum percentage overlap identity of 95 %.

Sequence annotation and estimation

The assembled contigs and singletons were annotated with information from the NCBI non-redundant protein database and from the Arabidopsis Information Resource protein database (TAIR10) using the BLASTx program v2.2.24+ by cut-off E values of 1e−5, 1e−10 or 1e−100 with other default parameters (Altschul et al. 1990). The assembled unigene sequences were also annotated with functional Gene Ontology (GO) (The Gene Ontology Consortium 2000) using Blast2GO tool (http://www.blast2go.com/b2ghome). Then, the assigned GO terms were classified based on the GO slims (http://www.geneontology.org/GO.slims.shtml) by CateGOrizer (Hu et al. 2008). A statistical comparison of GO distributions in the two fruit types was performed with the R program using Fisher’s Exact Test. EC numbers were obtained from KEGG ENZYME and UniProt ENZYME.

To check whether each unigene was full length or not, ORFs of unigenes were predicted by ESTScan (Iseli et al. 1999) and the predicted ORF sequences that contained both start and stop codons were extracted as candidates for full-length sequences.

Identification of maturation-related gene pathways

We searched for metabolic pathway genes involved in maturation, including the ethylene, sugar, and anthocyanin synthetic pathways, using the local BLAST command of GENETYX (Genetyx Corporation, Tokyo, Japan) using Arabidopsis genes as queries and the unigene set as the target database.

Gene expression analysis by reverse transcription polymerase chain reaction

We extracted total RNA from fig fruits at stages I and II using Fruit-mate (Takara Bio, Inc.) and an RNeasy Plant Mini kit (Qiagen) with DNase I treatment (Takara Bio, Inc.). For reverse transcription polymerase chain reaction (RT-PCR) template synthesis, we used a SuperScriptIII First-Strand Synthesis System (Life Technologies, Inc., Carlsbad, CA) to generate cDNA from 80 ng of total RNA. The PCR reaction mixtures were prepared in a 12.5 μl volume containing 0.25 U AmpliTaq Gold DNA polymerase (Applied Biosystems, Foster City, CA), 0.5 μM primers, 2.0 mM dNTPs, 1× PCR buffer and 1 μl of cDNA as template. The PCR amplification reaction was carried out as follows: 94 °C for 2 min followed by 38 to 40 cycles of 94 °C for 1 min, 68 °C for 2 min, and 72 °C for 2 min. The final extension was performed at 72 °C for 7 min. Gene-specific primers were designed using GENETYX based on the acquired EST sequences (Electronic supplementary material (ESM) Supplemental Tables 1 and 2). A primer pair specific for the F. carica β-actin gene (accession:AY487315.1) was used as the endogenous control.

Results and discussion

Pyrosequencing and de novo assembly

Using pyrosequencing, we generated 290,594 reads, with 165,442 from the caprifig and 125,152 from the common type (Table 1). Read quality was high, with an average read length of more than 300 bp and a QV40+ value of over 94 for each fruit type. After pre-processing, we obtained a total of 270,268 reads, with 155,391 from the caprifig and 114,877 from the common type. We first assembled these pre-processed reads for each type and obtained 62,420 unigenes (17,454 contigs and 44,966 singletons) for caprifig and 49,491 unigenes (12,771 contigs and 36,720 singletons) for the common type. We then assembled the combined set and obtained 19,166 contigs and 52,289 singletons for a total unigene number of 71,455 (Table 1, Fig. 2). The subsequent analysis used this combined unigene set.

Table 1 Summary of de novo assembly results of 454-pyrosequencing data from gynodioecious fig (Ficus carica L.) fruits: caprifig and common fig
Fig. 2
figure 2

Strategy for the assembly and identification of type-specific transcripts in gynodioecious fig (Ficus carica L.) fruits. We first generated 290,594 total reads, with 165,442 from the caprifig and 125,152 from the common type using pyrosequencing. After pre-processing and assembly, we obtained a total of 270,268 reads and 71,455 unigenes with 19,166 contigs, and 52,289 singletons. We also assembled these pre-processed reads for each type and obtained 62,420 unigenes for caprifig and 49,491 unigenes for the common type. A statistical comparison of GO term distributions was conducted between each type’s unigene set. Functional annotations and extractions of type-specific expressed genes were performed using the total unigene set

Function and feature annotation

We functionally annotated the unigenes using three parameters: BLASTx match, GO term, and EC number. The hit rates of the BLAST search against the NCBI non-redundant and TAIR10 databases (at a threshold E value < 1e−5) were 60.6 and 57.8 %, respectively, while that for the EC number was 12.7 %. The GO slim terms for biological process, cellular component, and molecular function could be assigned to 34.9, 30.4, and 38.3 % of the unigene set, respectively. Of the annotated processes, for example, “metabolism” accounted for 15.09 %, and “biosynthesis” for 5.96 %. GO processes accounting for less than 5 % of the unigenes composed nearly half of the annotated set, while “biological process unclassified” composed 32.82 % (Fig. 3). These results suggest the expression of a wide range of genes with various processes and many unclassified genes in fig fruits. In total, 46.5 % of the unigene set were assigned at least one GO slim term. The number of unigenes annotated by BLAST, GO term or EC number was 44,070 and the total proportion annotated was 61.7 % (Table 2, Fig. 2).

Fig. 3
figure 3

Distribution of F. carica fruit unigenes from both caprifig and common fig types according to their associated biological process, cellular component, and molecular function GO terms. Fig EST sequences annotated by TAIR Gene Ontology were grouped by GO slim category

Table 2 Annotation information of the unigene set from gynodioecious fig (Ficus carica L.) fruits: caprifig and common fig

By ESTScan, a total of 38,308 (53.6 %) unigenes could be predicted ORFs but only 1,303 (1.8 %) unigenes were candidates for full-length sequences. Among the candidates, 411 (0.6 %) unigenes were covered more than 80 % of the length of the most homologous Arabidopsis protein sequences (Table 2). Although our data contains few full-length sequences, many partial gene sequences obtained in this study will be useful for cloning full-length sequences and for expression analyses of unidentified genes in fig.

Major genes expressed in fig fruits

To identify the major genes expressed in fig fruits at the late period II, we extracted the 20 contigs that contained the largest number of reads in the unigenes (Table 3). The extracted list contained 1-aminocyclopropane-1-carboxylate oxidase (ethylene-forming enzyme), pectin lyase, beta-galactosidase, and expansin, and it confirmed the active expression of known maturation-related gene complexes in fruits. Highly expressed genes not related to maturation included the nucleotide-binding site leucine-rich repeat (NBS-LRR) family genes that function in DNA repair and disease resistance and the plasma membrane intrinsic protein 1C (PIP1C) genes that promote symplasticwater transport. These genes are also presumed to be required for fruit development.

Table 3 The 20 most common transcripts detected in gynodioecious fig (Ficus carica L.) fruits: caprifig and common fig

Fruit ripening involves many biochemical events such as changes in, sugar content, acidity, color, texture, and aroma volatiles. These changes are controlled by the coordinated expression of maturation-related gene complexes (Bouzayen et al. 2010). Elucidation of the control mechanisms involved in individual events will be of great significance to the understanding of fig fruit physiology. We have summarized below our results obtained by extracting gene complexes for specific pathways related to fruit maturation utilizing gene conservation across fruit species.

Ethylene synthesis and signal transduction

The first stage of the fruit maturation process is ethylene synthesis. As recognition of synthesized ethylene by receptors is followed by many downstream maturation processes via the ethylene signaling pathway (Ohme-Takagi and Shinshi 1995; Solano et al. 1998; Riechmann et al. 2000; Klee 2004; Alba et al. 2005; Gupta et al. 2006; Kesari et al. 2007), ethylene synthesis and signaling are fundamental to the fruit maturation process. Accordingly, we first tried to detect the sequences of genes involved in the ethylene synthetic and signaling pathways in our unigene set. We were able to identify sequence fragments of all of the major genes ranging from SAMS at the beginning of the synthetic pathway down to ERF1 at the end of the signaling pathway (Fig. 4). Many reads were detected in this pathway as well as in the downstream glucose and anthocyanin synthesis pathways, suggesting that both early and late fruit maturation processes were simultaneously active in the sampled fruits.

Fig. 4
figure 4

Ethylene synthesis and signal transduction in F.carica fruit as inferred from Giovannori (2004) and Adams-Phillips (2004). The boxes show genes encoding enzymes that were isolated from fig fruit ESTs in this study (BLASTx, e–05 cut-off). The first and second numbers in the parentheses refer to the number of contigs and singletons, respectively. SAMS, S-adenosylmethionine synthase; ACS, 1-aminocyclopropane-1-carboxylate synthase; ACO, neutral invertase; SS, sucrose synthase; ETR1, ethylene response 1; ETR2, ethylene response 2; ERS1, ethylene response sensor 1; ERS2, ethylene response sensor 2; EIN4, ethylene insensitive 4; CTR1, constitutive triple response1; EIN2, ethylene insensitive 2; EIN3, ethylene insensitive 3; ERF1, ethylene response factor 1

Sugar synthesis pathway

Among downstream pathways, the understanding of the sugar, anthocyanin synthesis and cell-wall degradation pathways are of great importance for fruit quality management, because these pathways are directly linked to factors such as taste, appearance, and softening. Among the cell-wall degradation pathways, Owino et al. (2006) had already focused on cell wall modifying enzymes during fruit ripening. For these reasons, we extracted genes related to the sugar and anthocyanin synthesis pathways.

The major glucose content in fig fruits has been reported to comprise sucrose, glucose, and fructose (Yahata and Nogata. 1999). Our unigene set contained the synthase genes for all these saccharides (Fig. 5) plus the galactose synthase genes reported by Ersoy et al. (2007). Besides these saccharides synthases, we sought to extract genes encoding the sugar transporters known to be important in unloading from leaves to fruits and glucose storage in vacuoles (Yamaki 2010). We identified at least four homologous genes including SUT (a sucrose transporter), SORT (a sorbitol transporter), MANT (a mannitol transporter), and HEXT (a hexose transporter) (Fig. 5). In other fruit trees, translocated saccharides include sucrose, sorbitol, raffinose, stachyose, and mannitol (Ziegler 1975; Yamaki 2010). In the fig tree, the saccharide types translocated may correspond to the genes encoding the sugar transporters detected in this study.

Fig. 5
figure 5

The sugar metabolism pathway in F. carica fruit. The boxes show genes encoding enzymes that were isolated from fig fruit ESTs in this study (BLASTx, e–05 cut-off). The first and second numbers in the parentheses refer to the number of contigs and singletons, respectively. NADP-SDH, NADP-dependent sorbitol dehydrogenase; NAD-SDH, NAD-dependent sorbitol dehydrogenase; NIN, neutral invertase; SS, sucrose synthase; VIN, vacuolar invertase; SUCT, sucrose transporter; SORT, sorbitol transporter; MANT, mannitol transporter; HEXT, hexose transporter

Anthocyanin synthesis pathway

Anthocyanins synthesized in fig fruits include cyanidin and pelargonidin, with a higher proportion of cyanidin in both the fruit skin and florets (Duenas et al. 2008). The cyanidin synthesis pathway genes identified in our study are shown in Fig. 6. We identified all the enzymatic genes, including phenylalaninecyanidin-3-rhamnoglucoside and cyanidin-3-glucoside. However, we did not identify flavonoid 3′ 5′ hydroxylase, which catalyzes the conversion of dihydroquercetin to dihydromyricetin in the delphinidin synthesis pathway. This supports the inactivity of the flavonoid 3′ 5′ hydroxylase pathway and is consistent with previous studies of fig fruit biochemical mechanisms (Solomon et al. 2006; Del Caro and Piga 2008; Duenas et al. 2008). Fig fruits have a wide variety of skin colors ranging from dark purple, purple, red, pink, and green to yellow. Further structural analyses and gene expression studies of the anthocyanin synthesis pathway genes such as those conducted in other studies (Kobayashi et al. 2004; Xie et al. 2011) may elucidate the formation mechanisms of these various skin colors.

Fig. 6
figure 6

Anthocyanin and anthocyanidin biosynthesis in F.carica fruit. The boxes show genes encoding enzymes that were isolated from fig fruit ESTs in this study (BLASTx, e–05 cut-off). The first and second numbers in the parentheses refer to the number of contigs and singletons, respectively. MYB, Myb transcription factors; PAL, phenylalanine ammonia lyase; 4CL, 4-coumarate-CoA ligase; C4H, cinnamate 4-monooxygenase (trans-cinnamate 4-monooxygenase); CHS, chalcone synthase; CHI, chalcone isomerase; F3H, flavonone 3-hydroxylase; F3′H, flavonoid 3′-hydroxylase; DFR, dihydroflavonol-4-reductase; LDOX, leucoanthocyanidin dioxygenase; 3GT, flavonoid 3-glycosyl-transferase; 3RT, anthocyanidin-3-glucoside rhamnosyltransferase

Comparison of GO term distribution

Classification of ecotypes such as the caprifig and common types is based mainly on sexuality and parthenocarpy. According to the segregation data obtained from inter-type hybridizations, the inheritance of sexuality and parthenocarpy has been explained by one or two genes (sexuality: GA/ga, with two closely linked pairs of alleles; parthenocarpy: P+, with a single pair of alleles) (Storey 1975; Saleeb 1965; Awamura 1996). Type differentiations can thus be considered to be controlled by this limited number of genes. However, specific differences between the caprifig and common type at the whole transcriptome level remain unknown. We thus compared GO terms representative of each type.

As a result, there was no significant difference observed in the distributions of GO terms (ESM Supplemental Fig. 1). This suggests that the period II fruits of the two types do not differ in their macro-level transcriptome expression patterns, but rather only in the expression of a relatively small set of genes. Therefore, we then studied the expressions of the sexuality and parthenocarpy related-genes by semi-quantitative RT-PCR.

Type-specific expression analysis of MADS-box genes

As stated above, two linked alleles (GA/ga) are involved in the sexual traits of fig fruits, where allele G is responsible for pistil length and allele A is responsible for the presence/absence of stamens (Storey 1975). However, the physiological identities of these genes are unknown. The ABCDE model describes a relationship between floral organ formation and gene expression (Ferrario et al. 2003; Theissen2001; Theissen and Saedler 2001), and its relevance to sexual traits has been examined in various plants (Kater et al. 2001; Sather et al. 2010; Park et al. 2003; Elo et al. 2001; Yu et al. 1999; Sheppard et al. 2000; Ainsworth et al. 1995; Hardenack et al. 1994; Heuer et al. 2001). We thus examined how well this model is conserved in the caprifig and common type fruits.

This was accomplished by extracting nine MADS genes corresponding to each class of the ABCDE model (ESM Supplemental Table 2) and analyzing their expression level by RT-PCR in stages I and II for each fruit type. While no polymorphic expression was observed for either stage or type for the genes in classes A, D, and E, there were clear differences in the expression levels of B- and C-class genes. The expression levels of the PISTILLATA homologs PI1 and PI2 (B class) in the caprifig type at stage II were 1.7 to 5.0 times higher than those in the common type. A polymorphism in the amplicon size was observed for the AGAMOUS homolog AG (C class), in which the common fig amplicon size was slightly smaller than caprifig amplicon size (Fig. 7).

Fig. 7
figure 7

RT-PCR analysis of ABCDE model MADS family genes in F.carica fruit (stages I and II). The abbreviations refer to fig homologs of each Arabidopsis MADS gene: AP1, APETALA1; AP2, APETALA2; AP3, APETALA3; PI1, PISTILLATA1; PI2, PISTILLATA2; AG, AGAMOUS; SHP, SHATTERPROOF; STK, SEEDSTICK; SEP1, SEPALLATA1; SEP3, SEPALLATA3; AGL6, AGAMOUS-like 6. Additional abbreviations: M, 100 bp molecular weight marker; Actin, β-actin. cp, caprifig type; cm, common type. The number under each lane indicates relative expression measured by AlphaEaseFC software v4.0.1 (Alpha Innotech, corp., SA)

Two alternative mechanisms account for the development of unisexuality: the degeneration of sexual organs or exclusive differentiation of only one type of sex organs (Heslop-Harrison 1964). Given that hermaphroditic fruits transition from the female period to the male period (Ramirez 1974), we suspected that figs fall into the former type and that different activities of the MADS box gene complexes are not the direct causes of sexual differentiation. These differences in activities are considered side effects of the sex determination process (Golenberg and Freeman 2006). This conjecture was supported by our finding that the expression level of the PI homologs between the two types was the same in period I, but different in period II.

Nevertheless, it is likely that the PI homologs are important genes in caprifig stamen formation because they showed distinct polymorphic expression. The relevance to the sexuality of the expression–product polymorphism of the AG homolog is unknown, but the variety of functional roles played by the C-class genes in floral-organ formation (Drews et al. 1991; Mizukami and Ma 1992; Busch et al. 1999; Lohmann et al. 2001) suggests many possibilities. To understand the physiological basis of the G and A genes, we need to examine the involvement of the PI and AG homologs further.

Type-specific expression analysis of the gibberellin and chalcone synthase genes

For the parthenocarpic traits of figs, the relevance of fruit auxin and gibberellin concentrations has been investigated previously (Crane et al. 1959; Lodhi et al. 1969). We accordingly focused on eight homologs (one GA20ox, two GID1, one GAMYB1, and four DELLAs) of the gibberellin synthesis genes, whose roles in the genetic pathway are well-known, and conducted inter-type comparative analyses of their respective gene-expression patterns in period II fruits. However, we did not observe any clear polymorphisms among these homologs (data not shown).

Previous studies have reported that RNA interference-mediated suppression of the chalcone synthase (CHS) genes induced parthenocarpy in tomatoes, and that fruits with RNA interference-mediated downregulation of CHS displayed impaired pollen tube growth (Schijlen et al. 2007). Moreover, overexpression of the grape-derived stilbene synthase (STS) gene induced male sterility and parthenocarpy in tomatoes due to the depletion of coumaric and ferulic acids, which are necessary for lignin and sporopollenin biosynthesis (Ingrosso et al. 2011). Interestingly, a CHS homologous gene is known to lie upstream of the genes that expressed specifically in the common type (ESM Supplemental Tables 3 and 4.). Our analysis of CHS homologs by RT-PCR detected no polymorphisms in period I fruits. However, in period II fruits, we detected a new smaller sized transcriptional product in the common type, and its transcripts had lower total expression levels (Fig. 8).

Fig. 8
figure 8

RT-PCR analysis of Arabidopsis chalcone synthase (CHS)-homologous genes in F.carica fruit (stages I and II). M, 100 bp molecular weight marker; Actin, β-actin; cp, caprifig type; cm, common type. The number under each lane indicates relative expression measured by AlphaEaseFC software v4.0.1 (Alpha Innotech, corp., SA)

CHS and STS are similar in their mechanistic and structural aspects (Yamaguchi et al. 1999) and they are believed to be involved in pollen development and plant reproduction through flavonoid-synthesis metabolism (Mo et al. 1992; Ylstra et al. 1994; Hanhineva et al. 2009). It is thus possible that the polymorphic expression that we observed triggers changes in flavonoid-synthesis metabolism and governs parthenocarpy. We are currently investing the causes of the polymorphism by screening for CHS gene clones in a genomic library derived from the common type “Houraishi”.

Screening for new type-specific transcripts

It is likely that genes not mentioned above are also important factors for sexual and parthenocarpic traits and that genes showing polymorphic expression between the types are not limited to those controlling sexuality and parthenocarpy. Such genes may be studied using known genetic information as well as by searching for genes showing type-specific expression. We extracted the contigs consisting of the reads derived from either caprifig or common type based on the count data, so that 4,844 contigs were specific to the caprifig type and 2,260 were specific to the common type (ESM Supplemental Tables 3 and 4).

To evaluate the validity of the extracted gene lists, we randomly selected 18 ESTs (ten caprifig-specific and seven common-specific) and investigated them by RT-PCR. We observed significant inter-type differences in five genes in either expression level or in transcription-product size (Fig. 9, ESM Supplemental Table 1). The size and amount of transcription products varied for homologs of the pectin lyase-like superfamily protein, heavy metal transport protein, plant cadmium resistance protein and gibberellin-regulated protein. Expression in only one type was observed in nucleotide-binding site-leucine-rich repeats. The fact that five of the 18 tested genes were confirmed to be type specific shows that the extracted gene lists are useful for finding genes differentially expressed between types.

Fig. 9
figure 9

RT-PCR analysis of 18 subtracted genes in caprifig and common fig fruits (stages I and II). The analyzed genes were preferentially selected from the subtraction lists on the basis of their read number ranking. The abbreviations refer to fig homologs of Arabidopsis genes: PL, pectin lyase-like superfamily protein; MFS-1, major facilitator superfamily-1; AT, HXXXD-type acyl-transferase family protein; PE, pectin methylesterase inhibitor superfamily; bHLH, basic helix-loop-helix DNA-binding superfamily protein; SKU5, SKU5 similar 13; HMT, heavy metal transport; PCR11, plant cadmium resistance 11; TPC, terpenoid cyclases; GR, gibberellin-regulated family protein; MFS2, major facilitator superfamily protein 2; PP2C, highly ABA-induced PP2C gene 1; PAR1, PAR1 protein; COP1, COP1-interactive protein 1; NBS-LRR, disease resistance protein (CC-NBS-LRR class) family; XYL1, beta-xylosidase 1; PR; pathogenesis-related family protein. Additional abbreviations: M, 100 bp molecular weight marker; cp, caprifig type; cm, common type. The number under each lane indicates relative expression measured by AlphaEaseFC software v4.0.1 (Alpha Innotech, corp., SA)

It is known that pectin lyase softens fruits by breaking down pectin, while the heavy metal transport protein plays important roles in homeostatic maintenance of essential trace elements and heavy metal detoxification (Nelson 1999; Thomine et al. 2000). With regard to the gibberellin-regulated protein, its target gene product gibberellin is known to widely control traits including intercalary elongation, vegetative growth, and reproductive growth. There could be variations in the homolog functions or the splicing processes of these three genes that lead to functional differences in fig fruits. The plant cadmium resistance protein, like the heavy metal transport protein, is related to heavy metal transportation. The polymorphic expression of these two genes with similar functions may suggest characteristics of metal transportation specific to each type. The nucleotide-binding site-leucine-rich repeat gene is believed to have disease-resistance functions such as pathogen recognition or host defense (DeYoung and Inne 2006). This polymorphic expression also suggests differences in the disease resistance of each type.

A comprehensive search for type-specific genes would require, in addition to RT-PCR, a large-scale screening method such as microarray analysis or deeper RNA-sequencing. In the present study, we analyzed only one strain for each type. Therefore, a confirmatory studies using other varieties would also be required to corroborate inter-type polymorphisms.

Conclusion

Using high-throughput sequencing, we extracted and identified gene complexes including genes regulating maturation, expressed in fig fruits. Our GO term analysis did not detect a significant difference between the fruit types, suggesting that genetic differences between the types are not expressed over the entire transcriptome in the tested fruits. Polymorphic expression was detected for several genes including CHS, PI, AG homolog genes, and gibberellin-regulated protein. The CHS gene is of special interest in understanding the induction of parthenocarpy owing to its putative role in the origin of plant domestication.

Because there would be other genes that were not analyzed in the present study but that contribute to trait differentiation among the types and varieties, further studies using large-scale screening are recommended. However, the EST data that we generated in this study, our arrangement of maturation-related genes and our findings of inter-type polymorphism will contribute to the elucidation of the physiological traits of fig fruits and thus to the study of fig genetics.