Introduction

The genetic determinism of the morphology of some plant organs has not received enough attention. In the case of fruit shape, there are plenty of papers describing their genetic and hormonal control in species such as tomato, Cucumis and capsicum (Peterson 1959; Grandillo et al. 1999; Chaim et al. 2001; Périn et al. 2002; Tanksley 2004). However, in species such as grapevine the fruits or berries are arranged in clusters, which in turn correspond to inflorescences. The architecture of the inflorescence determines to a large extent the fruit quality. Several authors have presented evidence showing that both the architecture and size of the inflorescences are genetically determined (Shavrukov et al. 2004; Weiss et al. 2005; Busov et al. 2008). This phenomenon has been demonstrated by studies in pea, rice, Arabidopsis and recently in rose (Nomura et al. 1997; García-Maroto et al. 2000; Nakagawa et al. 2002; Okada and Shimura, 1994; Goosey and Sharrock 2001; Kawamura et al. 2011). In general, the inflorescence architecture depends on branching pattern, number of flowers per node and the extent of terminal meristem growth in the reproductive phase where in turn developmental decisions occur (Singer 2007; Liu et al. 2013). Several QTLs associated with inflorescence architecture have been found in a number of species: Poaceae (Zhu et al. 1999; Brown et al. 2006; Upadyayula et al. 2006; Larson et al. 2013), Arabidopsis (Ungerer et al. 2002, 2003) and rose (Kawamura et al. 2011). In Arabidopsis, shoot identity gene TERMINAL FLOWER1 (TFL1) and floral meristem identity genes, such as APETALA1 (AP1) and LEAFY (LFY), regulate the pattern of inflorescence branching (Liu et al. 2013). Liu et al. (2013) have reported a conserved genetic pathway determining inflorescence architecture in Arabidopsis and rice and demonstrated that MADS-box genes in rice determine panicle branching by regulating TFL1-like genes.

In modern viticulture, the architecture of the cluster plays an important role in vineyard management and modification. According to several authors, the architecture of this organ has an important impact on the cost of grape production (Weaver and McCune 1959; Weaver et al. 1962; Vail and Marois 1991; Vail et al. 1998; Shavrukov et al. 2004). Many methods are used to modify the cluster architecture. For example, the pre-bloom application of gibberellins is used to stimulate the elongation of the main axis of the rachis, which results in less compact clusters (Weaver and McCune 1959; Weaver et al. 1962; Hed et al. 2011). Chemical thinning of the berries and removal of parts of the cluster are other methods used to reduce the number of berries per cluster, to improve the shape and to decrease its compaction. These modifications help to increase the size of the remaining berries, which is an important quality factor in table grape production. These measures also help prevent the attack of fungi such as Botrytis cinerea (Vail et al. 1998; Shavrukov et al. 2004; Hed et al. 2011).

The rachis is the dorsal spine of the cluster, which comprises a main axis and several lateral branches (shoulders). The shoulders are further divided into secondary and tertiary branches. The last branch sub-division corresponds to the pedicel, the structure that harbours each individual berry. At flowering time, this structure forms an inflorescence called the panicle (Pratt 1971). The last branch sub-division corresponds to the pedicel, the structure that harbours each individual berry. It has been proposed that genotype is an important factor in determining the size and the architecture of the grapevine cluster (Shavrukov et al. 2004). However, research on the cluster architecture and its genetic variation is currently scarce (Shavrukov et al. 2004; Fernandez et al. 2010). In fact, there have been no systematic studies undertaken to identify the defining characters and the genetic determinants of the grapevine cluster architecture.

The study of genetic variation (i.e. genetic variance or genetic diversity) and the heritability of traits of interest are important to define whether an improvement is possible and what materials to use to achieve a better outcome (Poehlman 1986; Falconer and Mackay 1996). An interesting approach to study the genetic structure of any trait is the use of quantitative trait loci (QTLs), which are segments of the genome where there exists a higher probability of finding genes that are responsible for traits of interest (Collard et al. 2005). QTL study and the observation of associated allelic diversity (molecular markers) have accelerated the identification of those genotypes with higher probabilities of expressing the desired traits. This practice is particularly useful when the trait of interest cannot be measured because it has not yet been expressed (for example, reproductive traits in the seedling stage) or it is too difficult to evaluate in a consistent way (as is the case with disease resistance) (Collard et al. 2005; Costantini et al. 2007). One of the main objectives of the use of molecular markers in applied research in agriculture has been the construction of genetic maps for several species of economic interest. Genetic maps have been used to identify QTLs related to traits of interest (Collard et al. 2005). QTL mapping is based on the principle that genes and markers segregate by recombination (“crossing over”) during meiosis, affecting the variation or the variance of quantitative traits in the resulting progeny or segregating population (where there is a mixture of parental and recombinant genotypes), allowing analysis by statistical approaches (Mackay 2001; Collard et al. 2005). Many QTLs for several traits have been identified in different grapevine progenies. The main interest in these studies has been to identify the genetic determinants related to traits associated with disease and pest resistance (Marino et al. 2003; Zyprian et al. 2003; Doucleff et al. 2004; Fischer et al. 2004; Krivanek et al. 2006; Xu et al. 2008). Some effort has been made to identify genetic determinants associated with the quality of the berry (berry size, berry seedlessness) (Doligez et al. 2002; Zyprian et al. 2003; Fischer et al. 2004; Fanizza et al. 2005; Mejía et al. 2007; Costantini et al. 2008; Cabezas et al. 2006). There have also been studies undertaken on other berry quality attributes, such as aroma profile and anthocyanin content (Eibach et al. 2003; Fournier-Level et al. 2009), fertility-associated traits (Doligez et al. 2010) and phenological stages (Costantini et al. 2008; Duchêne et al. 2012), among others. As very little is known about the genetic determinism of cluster architecture in the grapevine, the aim of this work was to analyse the genetic variation of traits associated with this organ architecture. The main traits analysed were rachis and shoulder length, rachis fresh weight and main axis internode number. These studies were performed using progeny derived from table grape varieties (n = 137) evaluated during three seasons. We compared clusters grown on plants receiving gibberellic acid at flowering time to those not receiving this hormone, which is used commercially for cluster enlargement in table grape production. The traits selected showed a clear genetic basis and can be considered good candidates for determinants of cluster architecture due to their importance in determining the genotypic variance.

Materials and methods

Plant material and treatments

This study was performed during three growing seasons (2009–2010 to 2011–2012) at the Chilean Institute of Agriculture Research, La Platina Center (33°34′20″S; 70°37′32″W; 630 m.o.s.l.), on F1 progeny (n = 137 segregants) of a controlled cross of ‘Ruby Seedless’ × ‘Sultanina’ (R × S) and obtained through embryo rescue. Note that each season begins approximately in September of year 1 and ends in May of year 2. These progenies were cultivated in an overhead trellis system (pergola type), with three plants per segregant grafted in 2006 onto adult plants of the ‘Sultanina’ variety that were used as rootstocks. Watering, fertilisation and pest/disease control followed standard conditions used in a commercial orchard. For every genotype, three inflorescences were treated by immersion before flowering (panicle length: 12–15 cm) with a solution of 10 mg L−1 of gibberellic acid (GA3) (GibGro® 20 %) plus 0.1 mL L−1 of siliconate coadjuvant (Break SL®). An equivalent number of control clusters were treated with water plus coadjuvant.

Phenotypic evaluation

To facilitate the different measurements, phenotypic evaluations were performed 2 weeks before veraison—the point at which the rachis stops its elongation (Coombe 1995). We based the phenotyping on Costes (2003) and Segura et al. (2006, 2007) works dealing with apple tree architecture and on Shavrukov et al. (2004) related to cluster architecture in grapevine. Immediately after the harvest of samples, all the traits were measured manually with the exception of rachis fresh weight and berry fresh weight, which were measured with an electronic balance. The latter was used to estimate the total number of berries per cluster and per shoulder by using the fresh weight of 50 berries. A semicircular protractor was used to measure the angles in degrees (°) between the main axis and the first and second shoulder. Peduncle diameter was measured by a vernier calliper. In summary, 23 traits and subtraits were measured in treated and control clusters during the first season (Online Resource 1 and Fig. 1). A multivariate factor analysis based on the 23 traits evaluated was performed, using principal components as the method of factor extraction, percentage of variance explained and latent root as criteria of factor extraction, and VARIMAX orthogonal rotation to interpret the factorial results. With this analysis, subsets of co-varying variables were extracted in a few conceptual and orthogonal variables called factors (Acquaah et al. 1992). To supplement this exploration, the relationships among these traits and among the genotypes were analysed by a correlogram according to Pearson’s correlation coefficient and cluster analysis, respectively. The average linkage method and the Euclidean distance measure were used according to the results of the cophenetic correlation coefficient to complete the cluster analysis. Genotypic effects and genotype × GA3 interaction were calculated and used for this purpose; analysis of variance with mixed linear models was employed, as shown below.

Fig. 1
figure 1

Schematic diagram representing the architecture of cluster at fruit set of Vitis vinifera L. 1ir, 2ir and 3ir: first, second and third internode of the main axis or rachis, respectively; 1is, 2is and 3is: first, second and third internode of the second shoulder, respectively; a: angle between the main axis and the second shoulder

During the second (2010–2011) and third (2011–2012) seasons, the multivariate first-factor and GA3 responsive traits were measured according to multivariate factor analysis as indicated for the first season. To test the effect of each genotype on each trait given by genotypic variance (random genotypic effect) and their possible interaction with the GA3 treatment (fixed effect) at each season, mixed linear models were proposed. The genotypic effect was defined according to a linear additive model:

$$y_{ijk} = \, \mu \, + g_{i} + t_{j} + \, \left( {g \times t} \right)_{ij} + \, \varepsilon_{ijk} ,$$
(1)

where y ijk is the phenotypic value measured for the trait y on the cluster k of the genotype i with treatment j; μ corresponds to the overall mean; g i is the random effect of genotype i representing the effect of each genotype or genotypic effect on trait y; t j is the fixed effect of GA3 treatment with two levels (0 and 10 mg L−1); (g × t) ij is the random interaction between genotype i and treatment j; and ε ijk is the random residual error per cluster k of the genotype i with treatment j. The significance of the effects was assessed using the Akaike and Bayesian criteria and tested by likelihood ratio (Sillampää and Corander 2002; Segura et al. 2007). Broad-sense heritability (h 2) was estimated according to REML variance components according to Holland et al. (2003), Kawamura et al. (2011) and Duchêne et al. (2012).

The seasonal effect on phenotypic expression of the selected traits treatment was analysed by repeated measurement analysis of variance using the plant as the experimental unit (subject factor) and considering the genotype × season interaction.

The complete set of analyses was performed using the statistical software R (R Development Core Team 2012) and its corrplot (Wei 2011) and lme4 (Bates and Maechler 2009) packages for correlogram and linear mixed models, respectively.

Genetic evaluations

The Chilean Institute of Agriculture Research (INIA) developed a genetic map using a segregating population of R × S consistent in 137 siblings, with the aim of studying the genetics of various quality traits. The consensus genetic map was based on 272 markers (SSRs or microsatellites, AFLPs, gene-based SNPs, phenotypic marker and SCARs), with SSRs previously mapped in other crossings being the most abundant (Online Resource 2). SSRs and AFLPs were analysed by silver-stained polyacrylamide denaturant gel electrophoresis, as described by Mejía et al. (2007). Gene-based SNPs and their alleles identification was done by silver-stained SSCP gels as described by Guerrero (2007). SCARs were designed after RAPD markers and evaluated by agarose gels as described (Mejía et al. 2007). Nomenclature and primer sequences were obtained from the National Center for Biotechnology Information (NCBI) and Sefc et al. (1999), Di Gaspero et al. (2000) and Merdinoglu et al. (2005). Other newly developed SSR markers were designed from the Vitis vinifera PN40024 reference genome sequence (Jaillon et al. 2007). The genetic/linkage map of R × S was built as described by Mejía et al. (2007).

QTLs of the phenotypic means of each selected trait under each GA3 treatment, as well as some of their relationships with each other (Table 1), were identified using the consensus map via nonparametric Kruskal–Wallis test with a stringent significance level (p value) of at least 0.005. Interval mapping was followed by the multiple-QTL model (MQM) procedure using the MAP QTL 4.0 software (van Ooijen et al. 2002). The LOD genome-wide error thresholds used to declare the existence of a QTL (significance level of 0.05) were determined with the same software using 1,000 permutations. Interval mapping analysis was performed with a step size of 1 centiMorgan (cM) to find regions with potential QTLs, i.e., where the LOD score was greater than LOD thresholds. For each putative QTL given by interval mapping, the closest markers to the peak of the LOD profile were tested using the automatic cofactor selection procedure to select cofactors significantly associated with each trait at a p value of <0.02. Markers accepted as co-factors were used to perform a MQM procedure with a step size of 1 cM. Each significant QTL was characterised by its maximum LOD score, its percentage of explained variation and its confidence intervals in cM corresponding to the maximum LOD score −1 and −2 units on either side of the LOD peak. The location of each QTL detected on each linkage group in the consensus map was plotted using the program MapChart (Voorrips 2002). Allelic effects of QTLs were calculated according to Segura et al. (2007).

Table 1 Ratio among traits used for QTL analysis

Search for candidate genes

In each QTL, the search for candidate genes was done in the genomic region corresponding to the confidence interval determined on the consensus map. The scrutinised sequence was limited by the most proximal SSR or SNP markers that were present both in the reference genome and in the consensus map. The genes were selected based on the information available for the annotated reference genome (Genoscope 12×) of the quasi-homozygous line 40024 derived from Pinot Noir (http://www.genoscope.cns.fr/externe/GenomeBrowser/Vitis/; Jaillon et al. 2007). The genes were classified according to their biological function registered in the database. The genes catalogued as with “unknown function” or equivalent were not considered for further analyses. In addition, a gene ontology (GO) enrichment analysis was performed considering the genes identified in the physical genomic regions associated with the confidence interval for each QTL, comparing the frequency found in each QTL vs. the complete reference genome and searching for a possible enrichment in gene functions that could be related to cluster architecture. All enrichment analyses were done with the agriGO tool (http://bioinfo.cau.edu.cn/agriGO), with the options singular enrichment analysis and complete GO. Significant GO terms (p value <0.05) were calculated using hypergeometric distribution and Yekutieli multi-test adjustment method (Du et al. 2010).

Results

During the 2009–2010, 2010–2011 and 2011–2012 seasons, a phenotypic and genetic characterisation of traits related to cluster architecture was carried out on a R × S (F1: n = 142) crossing conducted in overhead trellis systems. Only 137 of the 140 segregants of this progeny were analysed, according to the fertility of the plants. In the 2009–2010 season, a phenotypic characterisation of the 23 traits was performed with the purpose of exploring the cluster architecture, including analysis of genotypic and genotype × GA3 interaction effects, estimation of trait heritability and analysis of both the relationships among traits and relationships among the genotypes (correlation and cluster analysis respectively). During the following two seasons, only eight traits were evaluated, according to a selection based on their correlation pattern. Furthermore, to decipher their underlying genetic structure, a QTL mapping and analysis was carried out.

Phenotypic and genotypic variation

During the first season of evaluations, a trait selection was carried out according to the correlations among the 23 traits measured (Online Resource 1) to simplify the experimental procedures and measurements during the following seasons. According to mixed linear models, the phenotypes of all characters had a significant genotypic effect. Only a few of the measured traits showed response to GA3 (rw, rl, s2l and tb) (Online Resource 1). From these results, the genotypic effect of each genotype on each of the traits and the effect of genotype × GA3 were computed via the ranef() function of lme4 package of R after treatment by the best linear unbiased prediction (BLUP).

Relationships among traits

The Pearson correlation for the genotypic effects of the traits associated with the architecture of the cluster showed several correlations of high value (r > 0.6). The most important finding corresponded to the correlations among the lengths of the first and the second shoulders, the rachis fresh weight, the total number of berries per cluster and the number of nodes of the central axis (r > 0.75). In the case of genotype × GA3 effects, these variables were correlated except for the interaction given for the total number of berries. These results are summarised in a correlogram (Fig. 2) that shows a characteristic pattern of correlations between the different traits.

Fig. 2
figure 2

Correlogram for the different traits evaluated for cluster architecture of Vitis vinifera L. The correlogram was based on the Pearson’s correlation coefficient (r) of BLUP values of genotype effects of the 23 traits and the interaction genotype × GA3 of rachis fresh, rachis length, second shoulder length and total number of berries per cluster (rw GA, rl GA, s2l GA and tb GA, respectively). Each trait was ordered in terms of its eigenvalues obtained from principal component analysis. Each cell of the matrix represents the r value by colours. Right bar indicates the r values. Red negative value (−1 ≤ r < 0), blue positive value (0 < r ≤ 1), white values near to 0 (r ~ 0) (colour figure online)

To analyse these correlations, a multivariate factor analysis was performed. With respect to this analysis, 27 factors were obtained as an initial solution, of which the first five factors explained ~60 % of the total variance (up to ~90 % if considering the first 16 principal factors). According to the percentage of variance explained and latent root criteria, nine principal factors were extracted. After the VARIMAX rotation, the first nine principal factors explained 77.1 % of the total variance. The characteristic of each of these factors is shown in Online Resource 3.

The first major factor (Factor 1) explained 27.8 % of the total variance and was given (correlated) by the following variables (factor loadings > |0.5|): rachis length (rl), rachis fresh weight (rw), second shoulder length (s2l), first shoulder length (s1l), peduncle diameter (pd), number of nodes of the central axis (nr) and the number of berries on the second shoulder (bs2) (Online Resource 4). This factor accounted for approximately 80 % of the variance of these variables (communalities, Online Resource 4). Because it was more associated with the rw, nr, s2l and pd (factor loadings > |0.7|), this factor was labelled as “Traits related to the cluster size”. Using the same approach, the other factors were named as shown in Online Resource 4.

Relationships among genotypes

With the intention of representing the relationship among the traits, a cluster analysis based on genotypic and interaction effect was carried out. The combination of Euclidean distance with average linkage method had a higher cophenetic correlation coefficient (0.73). Four groups of genotypes (clusters) with at least 75 % of similarity were found. These clusters were as follows: cluster A or ‘Ruby’ type (n = 12 genotypes, including the maternal parent ‘Ruby’); cluster B or ‘Sultanina’ type (n = 91, including the paternal parent ‘Sultanina’); cluster C or ‘Intermediate’ (n = 22); and cluster D or ‘Outlier’ (n = 12). The common phenotypic characteristics of the individuals in each of these groups are summarised in Fig. 3.

Fig. 3
figure 3

Clustering of the ‘Ruby Seedless’ × ‘Sultanina’ progeny based on the different traits evaluated. Four cluster groups were identified and shown by several methods: a by a heat map with the standardised means of random effect (BLUPs) for each trait during the 2009–2010 season; and b by representative genotypes of each group. Clusters without gibberellic acid treatment (−GA3) and treated with 10 mg L−1 of gibberellic acid (+GA3) are shown. Vertical dashed lines are 40 cm long

During the 2010–2011 and 2011–2012 seasons, the traits included in the first factor and the GA3-responsive traits were measured as described for the first season according to multivariate factor analysis. These traits corresponded to rachis fresh weight and length, peduncle diameter, length of the first and second shoulders, number of nodes on the central axis and total number of berries per cluster.

Genotypic variation and heritability across the seasons

As in the first season, the subsequent seasons revealed the importance of genotypic effect and also that not all traits responded to GA3 (Online Resource 1 and Table 2). Additionally, in each season (Table 2), all traits had a significant genotypic variance (p ≤ 0.05), indicating that there is a real effect of different genotypes on the various measured traits (genotype effect). Regarding the relative contribution of genotypic effect as given by the percentages of REML variance, their effect on the different characters was on average medium to low (~39 %) (Online Resource 1 and Table 2). The heritability in turn had medium to high values (h 2 ~ 70 %). The interaction between genotype and GA3 treatment for the responsive traits contributed <20 % of the total variance. All traits showed a clear dependence on the season (Table 3), accounting for about ~35 % of phenotypic variation, regardless of whether they were treated with GA3 or not. This effect was mainly given by the interaction genotype × season, indicating that genotypes respond differentially to environmental conditions. Despite this result, the genotypic effect was significant and was always greater than the effect of the season and its interaction with genotypes. The phenotypic distribution of some traits across seasons and treatments is shown in Fig. 4.

Table 2 Descriptive statistics, heritability and percentages of REML variance in the total phenotypic variance for selected traits related to cluster architecture of progenies of the Vitis vinifera L. ‘Ruby Seedless’ × ‘Sultanina’ cross during 2009–2010, 2010–2011 and 2011–2012 seasons
Table 3 Relative importance of season effect on phenotype of traits related to cluster architecture of progenies of the Vitis vinifera L. ‘Ruby Seedless’ × ‘Sultanina’ cross
Fig. 4
figure 4

Phenotypic distribution for some traits of cluster architecture of Vitis vinifera L. across seasons. Histograms of phenotypic mean of each genotype are shown. The white boxes indicate the clusters without gibberellic acid treatment (−GA3); the black boxes indicate clusters treated with 10 mg L−1 (+GA3) gibberellic acid. Traits depicted are as follows: a rachis fresh weight (g); b rachis length (cm); c number of nodes of the central axis

Genetic structure

With the purpose of analysing the different genetic determinants associated with the cluster architecture, quantitative trait locus (QTL) analyses of the phenotypic means of each selected trait under each GA3 treatment, as well as some of their relationships measured during the 2009–2010, 2010–2011 and 2011–2012 seasons, were performed.

Genetic mapping

According to these results, a consensus linkage/recombination map was built with the aim of saturating the genome and therefore reducing both the mean intermarker distance and gaps over the different linkage groups (LGs). The main features of the framework map for the progeny R × S are summarised as follow: the consensus map consisted of 272 markers distributed on the 19 V. vinifera linkage groups, covering a total map length of 1,334.4 cM with an average intermarker distance of 4.9 cM. The longest LG corresponded to the LG18, harbouring 40 markers distributed over 136.8 cM (Fig. 5).

Fig. 5
figure 5

Genetic map and profile of LOD values for detected QTLs of the ‘Ruby Seedless’ × ‘Sultanina’ progeny. The black boxes represent intervals of QTLs for cluster traits without gibberellic acid treatment; the grey boxes represent cluster traits with 10 mg L−1 gibberellic acid (GA3)

QTL analysis

According to nonparametric analysis (Kruskal–Wallis rank sum test per marker) and the MQM procedure, 19 QTLs were detected on LG5, LG8, LG9, LG14, LG17 and LG18, with 6, 2, 1, 3, 1 and 6 QTLs, respectively. These QTLs were grouped in ten bounded genomic regions. Each of these QTLs was closely linked with 15 markers or cofactors (Table 4 and Online Resource 5). On average, these QTLs had LOD values of ~5.3, with the results ranging from 4.1 to 6.6. Their genomic threshold was approximately 4.4. The traits with more QTLs were number of nodes of central axis (nr) and total number of berries per cluster (tb), each one with four QTLs in total, both with and without GA3 treatment. In contrast, only LG5, LG14 and LG18 harboured QTLs showing stability after treatment with GA3 for Peduncle length (pd), nr and tb, respectively. The QTLs detected by this procedure are listed in Table 4, and their confidence intervals are represented in Fig. 5. According to the variance explained, the main QTL (corresponding to rachis length) was found on LG9, close to the microsatellite VMC2D9 locus. On average, these QTLs had a moderate effect, explaining approximately 17 % of the total variance (ranging from 12 to 24 %). The majority of these QTLs were determined mainly by the male additive effect, which almost doubled the female effect. Only two QTLs had an important maternal or female effect, both detected under GA3 treatment. Although, on average, there was no visible difference between the additive and dominance effects, there were some QTLs determined mainly by a dominance effect.

Table 4 QTL for traits related to cluster architecture detected via multiple-QTL mapping (MQM), estimated from genotypic means of 137 F1 progeny from the cross ‘Ruby Seedless’ × ‘Sultanina’

Candidate genes identification

At a first glance, 1,173 genes underlying the 19 detected QTLs were identified (Online Resource 6), a significant fraction of which (282 genes corresponding to 24 %) were catalogued as “unknown protein function” (Online Resource 7). Of these 1,173 genes, 613 potentially coding genes were associated to at least a single GO term (level 3). The genes identified in the physical genomic regions associated with the confidence interval for each of the 10 QTLs for rachis architecture were scrutinised and compared to the frequency found in the complete reference genome for each Gene Ontology term. With this approach, 48 GO categories were overrepresented (Online Resource 8), which suggested a significant increase in their frequency in comparison to the whole reference genome. Indeed, significant enrichments were found in five confidence intervals, encompassing diverse functional categories such as lignin metabolic process, laccase and lyase activities, phenylpropanoid metabolic process as well as coenzyme and cofactor binding. A complete list of significantly enriched GO terms is presented in Online Resource 9. A significantly higher percentage of over-represented GO terms were observed for the gene set included in the LG18-QTL3 confidence interval, raising 19 categories under Biological Process and Molecular Function classifications (Online Resource 10). Therefore, GO analysis provided complementary evidence to support the selection of candidate genes (CGs), which was done on the 891 annotated genes. Of these genes, we selected the 50 most promising ones according to their annotation (Table 5). The selection of these genes was mainly, but not exclusively based on their biological function already described in model plant species such as Arabidopsis, rice and poplar. These genes were unevenly distributed among the 19 QTLs, with 1 to 14 genes per QTL (Table 5). These CGs were classified into six major groups, corresponding to water transport (mainly aquaporins), cell division (cyclins and related genes), cell wall remodelling [including pectinmethylesterase (PME), polygalacturonases (PG), pectate lyase (PE), expansins (EXP), laccase (LAC), among others]; genes induced by or related to hormonal responses of the auxin and gibberellin pathways; transcription factors such as TCP13, AGL8, AGL11, BIM1 and the peptide BEARSKIN1; lipid synthesis, including a phospholipid transporter and two genes directly involved in its synthesis. Finally, a gene related to floral induction (GIGANTEA) located in one of the LG18 QTLs was included in this list of candidate genes because of its possible role in organ (rachis) development.

Table 5 Candidate genes selected for each QTL that could participate in the cluster architecture traits determination

Discussion

Genotypic effect

In grapevine, in a similar way as has been shown in other species including pea, rice, Arabidopsis and rose (Nomura et al. 1997; García-Maroto et al. 2000; Nakagawa et al. 2002; Okada and Shimura 1994; Goosey and Sharrock 2001; Kawamura et al. 2011), genotype is indicated as an important factor in determining the size and architecture of the inflorescence and later cluster (Boss and Thomas 2000; Shavrukov et al. 2004; Fernandez et al. 2010). The results of this study are consistent with that hypothesis: it was found that the genotypic effect given by genotypic variance was significant for all the traits selected to describe the architecture of the cluster, regardless of the season. Therefore, it would be possible to rely on these traits to improve the architecture of the grapevine cluster either through conventional breeding and/or by marker-assisted selection (MAS). However, the latter method requires the breeder to consider the heritability of the traits of interest. A trait with a low heritability will present an ineffective selection response (Dudley and Moll 1969; Poehlman 1986; Piepho and Möhring 2007). Typically, a heritability value >50 % is considered to be high and a value <20 % is considered to be low (Stanfield 1991). In the grapevine, broad-sense heritability (h 2) has been estimated to be approximately 96 % for the soluble solids content of berry, 88 % for the berry fresh weight and 75 % for the titratable acid of berry juice (Schneider and Staudt 1979). In addition, we have estimated an h 2 of ~92 % (confidence interval 90–96 %) for seed dry weight during the 2009–2010 and 2010–2011 seasons in a particular genetic background (unpublished data). In other perennial plant such as the rose, h 2 for the inflorescence architecture ranged between 82 and 93 % (Kawamura et al. 2011). These values exceed those for the traits associated with the architecture of the grape cluster evaluated here, which had on average an h 2 of ~65 % (Online Resource 1 and Table 2). The central axis length (rl), however, had a value of 75 %; this trait could therefore be useful for strategic breeding. In contrast, the medium to low heritability found for some traits would complicate their use in selection and improvement, especially for those with very low genetic contribution, such as first and second shoulder angle (percentage of variance explained by genotypic effect <15 %). These h 2 magnitudes and ranges were similar to those found by Segura et al. (2007) for architectural traits of the apple tree. The broad ranges found here most likely result from the low number of cluster replicates. Season effect made an important contribution to the phenotype of these traits, on average accounting for more than 30 % of the observed phenotype. This contribution was particularly significant for rachis fresh weight (rw) and number of nodes of central axis (rn), for which this effect was as important as genotypic effect (Table 3).

Gibberellic acid response

The results with regard to possible interaction of genotype with GA3 were determined by linear mixed models (Online Resource 1 and Table 2) along the different seasons. The results are consistent with what has been known traditionally for pre-bloom applications of gibberellin in grapevine, i.e. stimulating the growth of the clusters results in loose or less compact clusters—due mainly to the elongation of the rachis (Weaver and McCune 1959; Weaver et al. 1962). The significant effect of genotype × GA3 represented by its variance demonstrated the existence of genetic diversity within the tested population that segregates in its response to GA3, mainly shown by the length of the central axis, total number of berries per cluster and the rachis fresh weight. Genetic diversity in the response to GA3 in length of the grapevine rachis was previously well known, the concentration of gibberellin used for this purpose being dependent upon the grapevine variety (Winkler et al. 1974). It is also known that this response is more marked with applications performed in earlier phenological stages (Weaver and McCune 1959). These factors may be involved in the effect of genotype × GA3 interaction observed, which would be shown by the differences in response (some genotypes respond constitutively to GA3, while others do not), differences in the degree of this response and differences in the timing and concentration of GA3 used.

In this work, a profuse production of small seedless berries was observed in some of the GA3-treated clusters and especially in the genotypes of larger rachises and seeded berries (not quantified). This phenomenon was also observed by Pratt and Shaulis (1961), who found that gibberellin application to two seeded varieties (‘Fredonia’ and ‘Concord’) during flowering time increased the fruit setting by stimulating the production of parthenocarpic berries and delaying or completely inhibiting their abscission, regardless of whether pollination had occurred (emasculated plants). According to Weaver and McCune (1959), one of the side effects of pre-flowering applications of gibberellin in seeded varieties is the production of parthenocarpic berries. The earlier the treatment is performed, the more does the effect increase in intensity. Considering that seeded genotypes were evaluated within the segregating population (Mejía et al. 2007), this effect could explain in part the increase in fruit observed with the application of GA3 as measured by the total number of berries per cluster (tb). In this work, all types of berries were considered in the estimation of tb, including “shot berries” (small seedless berries with approximately 2 mm in equatorial diameter).

Mapping and analysis of QTLs

QTLs were located on linkage group 5 (LG5), LG8, LG9, LG14, LG17 and LG18 (Table 4). LG5 presented six QTLs for the traits rw, pd and some trait ratios (rl/nr, s1/rl, rw/nr), some of them with overlapping confidence intervals (co-located) (Fig. 5 and Online Resource 5). The latter could explain the correlation among these characters, which would be in part due to pleiotropic effects or close linkage. We emphasise that other QTLs have been found in LG5 for cluster weight and number of berries per cluster (Fanizza et al. 2005), and also number of clusters per shoot (Doligez et al. 2010).

With the grapevine genome sequence being available (Jaillon et al. 2007; Velasco et al. 2007), the next step would be a direct search for candidate genes to be identified within these confidence intervals. To further this aim, we focused our exercise on directed local saturation of the preliminary genetic map available (unpublished data). We focused specifically on LG5 due to the presence of co-localised QTLs for rachis length (rl), second shoulder angle (s2l) and rachis weight (rw) during the first growing season. On the previous genetic map, LG5 only had seven markers. On the current genetic map, this LG has 19 markers in total. After the identification of these markers, the search for specific sequence or single nucleotide polymorphisms (SNP markers) that can explain or be associated with the observed phenotypic variance could also be undertaken.

The presence of overlapping QTLs between GA3-treated and untreated condition found in non-responsive traits could indicate that those QTLs were stable or did not respond to GA3. This is the case for the number of nodes of central axis (nr), which did not respond to GA3 and showed the same marker and genetic region in both conditions. In contrast, those QTLs found in different genetic regions and under different treatment conditions in GA3-responding traits could correspond to QTLs interacting with GA3.

The occurrence of QTLs with an important paternal additive effect and the fact that the paternal parent ‘Sultanina’ traits were located in the cluster (“Male” group, with a 75 % similarity) with the largest number of genotypes indicate that the bulk of the segregants had a higher similarity to ‘Sultanina’ than to the maternal parent ‘Ruby Seedless’. This result could be explained by the pedigree of the parents, as ‘Ruby Seedless’, a “grandchild” of ‘Sultanina’, presents a relationship of 25 % to the pollen donor (Mejía et al. 2007), causing the additive genetic contribution given by ‘Sultanina’ to largely exceed the contribution by ‘Ruby Seedless’.

Heritability is one of the factors that affect the detection of a QTL (Bernardo 2004). As expected, most of the mapped QTLs corresponded to those characters with greater genetic determination or heritability. However, QTL detection is not always related to the heritability value (Segura et al. 2007). Another factor involved in the QTL detection is the proportion of phenotypic variance explained by the locus or genetic effect of QTL on the trait (Darvasi et al. 1993; Collard et al. 2005). Another factor to consider is the population size. Increasing the population size provides gains in statistical power for QTL detection and analysis (Darvasi et al. 1993), a condition that could explain why few QTLs were detected—the power of QTL detection is low when a population size such as our sample (<200) is used (Collard and Mackill 2008). This handicap is common to many mapping exercises in woody species; we expect to overcome it a few years from now because we are developing a mapping population of ca. 900 segregants using parents with contrasting phenotypes for cluster architecture.

One characteristic of a QTL is that the environment strongly influences both the detection and the effect (amount of variance of a character given by the segregation of a particular QTL) of the QTL (Korol et al. 1998; Collard et al. 2005). Moreover, the latter would be the rule in tree species (Asíns 2002). The comparison among conditions, according to heritability estimated and genotype × season interaction, showed an important effect of season (Tables 2 and 3). In this work, there was a tendency: if heritability was high and genotype × season interaction was low, the QTL effect will be high to moderate (percentage of variance explained), depending on the trait considered. The latter is evident in the case of rachis length vs. peduncle diameter; the first trait has in average a higher heritability (Table 2), a lower genotype × season interaction (Table 3) and a higher QTL effect (Table 4) than the second one. For grapes grown in temperate regions, the reproductive cycle and inflorescence development are completed for two consecutive growing seasons separated by a recess period between fall and spring (Galet and Morton 1988; Carmona et al. 2002; Calonje et al. 2004). Thus, there is a long period during which several environmental factors could affect the development of the inflorescences. Some of the most important climatic factors for inflorescence differentiation correspond to the light and temperature exposure of the fruiting bud during the season prior to flowering (Srinivasan and Mullins 1981; Sánchez and Dokoozlian 2005). For example, these factors have an impact on the size of primordia and dry weight of inflorescence (Sánchez and Dokoozlian 2005). The best approach to understand the environmental impact and review the stability of a particular QTL is to assess the traits of interest under different environments. In evaluating the genomes of woody plants such as vines, researchers are usually faced with a series of evaluations in different seasons and the need to increase the population size to reduce the false negative rate in QTL detection. In this study, the effect of the season made a particularly important contribution to the expression of selected traits, regardless of whether they were under the influence of GA3.

These factors together, in addition to others, would explain the evasive nature of the genetic nature of cluster architecture. According to this study, these considerations should be taken into account in future studies to improve the genetic characterisation of genomic features associated with the architecture of the cluster of grapes.

The ideal grapevine inflorescence architecture in the case of table grapes consider aspects such as the minimal requirements of hand labour and the convenience to have a “round-shaped” cluster which is more prone for long-distance transportation. Furthermore, it is desirable to have less compact clusters resulting in less contact area between the berries, which in turn increases the size of the berries and reduces fungal infections. A target architecture should consist of a long peduncle and long internodes, their shoulders and berries being low in number per cluster (<10 and <150, respectively) with open angles. We have identified QTLs that could be to some extent related to that ideal shape (number of nodes along the rachis; total number of berries and their relation to rachis length; distance between nodes) and a list of candidate genes that could be useful for breeding purposes. In any case, these genes could become selection tools only after identifying polymorphisms that could be associated with the phenotype of interest in a defined genetic background. This approach most probably will include a group of selection markers, instead of the more commonly described approach of using a single selection marker (Mejia et al. 2011).

Candidate genes for the cluster architecture

The GO term enrichment analysis in QTL confidence interval is a novel and useful approach to discover possible enrichment in gene functions that could be related to traits under study (reviewed by Drinkwater and Gould 2012). One limitation is based on the available Gene Ontology annotation for plant species such as grapevine that is quite limited in comparison to model plants such as Arabidopsis thaliana (Monclus et al. 2012). Indeed, in the case of Vitis reference genome which encompasses 26,346 genes, only 14,511 are currently associated with at least one GO term. In spite of that, the search for GO terms is very useful when the list of candidate genes is too large to be considered individually for further analyses, as happened in this case (1,173 identified genes).

We selected 50 out of the 891 genes annotated in the confidence intervals corresponding to the complete set of 19 QTLs distributed in six linkage groups (Online Resource 5). We decided this selection based on the annotated function for each gene. However, considering the poorness in the curation of the current version of the automatic annotation of the grapevine genome (A.-F. Adam-Blondon, personal communication), using this approach we could be missing some relevant genes that may have not been annotated in the targeted areas covering ca. 20.3 Mbases (Online Resources 6 and 7). This situation would not be improved until a better annotation of the grapevine genome is afforded. Most of the 50 candidate genes correspond to six different functional groups, including water transport, cell division, cell wall remodelling, hormonal response, transcription factors and lipid synthesis. Some of these genes are described in the following.

Aquaporines are key proteins for plant growth and development because of their role in water transport into the cells (Maurel and Chrispeels 2001). This could be the case during rachis elongation, a biological process that could determine the size and architecture of the cluster (Table 5). Aquaporins have been classified based on their sequences in four sub-families: plasma membrane intrinsic proteins (PIPs), tonoplast intrinsic proteins (TIPs), nodulin 26-like intrinsic proteins (NIPs) and basic plasma membrane intrinsic proteins (Chaumont et al. 2005). These proteins have been postulated as key factors for the development of plant organs in general, participating in cells elongation and expansion. Then, we could expect differential expression level or changes in functionality between alleles corresponding to each phenotype, a matter to be evaluated in further transcriptomic and gene-directed studies. We found three of these genes (two PIPs and one NIP) among the identified QTLs (Table 5).

The second group is related to cell wall structure and metabolism proteins. Any gene that could be related to cell wall would be a candidate gene for tissue elongation. The cell wall is composed of ca. 90 % polysaccharides made of three main structural components: cellulose, a matrix of intercrossed glycans (hemicellulose) and pectin-related polysaccharides (Goulao et al. 2012). The remaining components correspond to structural glycoproteins, phenolic esters and minerals. Xyloglucans are the predominant hemicelluloses in the primary cell wall of dicots. Pectins are embedded in the cellulose–hemicellulose lattice, forming hydrophilic gels that provide mechanical strength to the cell wall (Goulao et al. 2012). In Arabidopsis the gene BEARSKIN1 (BRN1) is related to transcription factors (TFs) of the VND/NST family, and these TFs have been related to the synthesis of the secondary cell wall in tissues or anatomical parts such as xylem, interfascicular fibres and anthers (Bennett et al. 2010). Pectate lyase is an enzyme involved in the degradation of the cell wall through the breakage of esterified pectins, one of the main components of the higher plants cell wall. A recent report shows that a pectate lyase present in the cell wall is associated with berry texture and cluster size (Vargas et al. 2013).

The identification of a number of TFs in various QTLs attracts our attention, considering their pivotal role in the control or coordination of many cellular events by modulating the expression of groups of genes. We identified a few TFs that we suspect could be relevant to the growth of rachis based on the available evidence mainly from Arabidopsis. This is the case of BIM1 that plays a key role in this model plant’s growth and development, participation in cellular expansion, proliferation and differentiation in young tissues (Chandler et al. 2009). Two other TFs identified were the MADS-Box AGAMOUS family AGL8 and AGL11, whose role could be related to the regulation of floral meristem development (Rounsley et al. 1995). Another member of this gene family, AGL11, has been recently demonstrated to be the key gene determining the seed content in grapevine (Mejia et al. 2011), but this gene also explains over 30 % of the phenotypic variance of berry size, a trait that we are now associating with cluster structure. In contrast, it was recently reported that the gene TERMINAL FLOWER1 (TFL1), well known as a key factor regulating the inflorescence development in Arabidopsis, would also regulate the cluster shape or architecture (Fernandez et al. 2010); however, this gene mapped in a region of LG17 where we did not find any QTL related to cluster architecture. This could be explained in different ways. For instance, the genomic organisation could be different between ‘Carignan’, the genotype harbouring the TFL1 mutation, and the table grape genetic background considered in this study. In this respect, we have recently found that the ‘Sultanina’ genome has a certain number of intra- and interchromosomal translocations in comparison to the PN-40024 grapevine reference genome (Di Genova et al. 2014). The most probable explanation, however, is that there is no trait variation associated with this gene in the studied R × S population. Alternatively, individual table grape genes can exhibit a different organisation, as we have observed in a number of cases when comparing transcripts of individuals of this mapping population vs. the reference genome (Muñoz-Espinoza et al. manuscript in preparation).

The identification of the most relevant genes related to rachis architecture would impact heavily on the future breeding of table grape cultivars, considering both the necessity to reduce the use of pesticides and the raising of the labour costs related to the hand-made cluster management.

Conclusions

The cluster architecture in a grapevine, represented by an R × S segregating population, is mainly determined by the length and the fresh weight of the rachis, the number of internodes, the peduncle diameter and the length of the lateral shoulders. There is genetic diversity in the response to GA3, mainly manifested in the length of the central axis, the number of berries and the rachis fresh weight. Furthermore, season has an important effect on the expression of these traits, regardless of whether the cluster is under the influence of GA3. The QTLs determining these traits and their response to GA3 are distributed on six LGs. The correlations found between these traits can be partially attributed to the pleiotropic effects and to a close linkage between the QTLs. A set of 50 genes underlying these QTLs are proposed as possibly related to the cluster architecture phenotype determination.