Introduction

One of the largest and most diverse of plant genera, Rubus contains between 600 and 800 species, including blackberry and red and black raspberry, is distributed world-wide (Thompson 1995) and has a long history of use for food and medicinal purposes (reviewed by Hummer 2010). Biologically active (bioactive) compounds found in red and black raspberry have been extracted for individual evaluation, in either animal models or in vitro assays, for their health-promoting characteristics including the reduction of cell proliferation in rat models (Chen et al. 2011), anti-inflammatory effect on ulcers (Montrose et al. 2011), and reduction of blood cholesterol levels (Ash et al. 2011). These reports, and others, have led to the marketing of bioactive compounds as ingredients in “functional foods”, that is products reported to have a specific health benefit (Espín et al. 2007; Finley et al. 2011).

Anthocyanins (ACY) are a class of water-soluble flavonoids that are derived from phenylalanine and stored in the plant vacuole. They provide plant tissues with a range of colors from orange/red to violet/blue, are widely distributed in flowering plants (Holton and Cornish 1995; Tanaka et al. 2008), and are among the bioactive compounds found in Rubus plants. Although the ACY profiles of cultivated red raspberry (Rubus idaeus L.), black raspberry (R. occidentalis L.), tetraploid blackberry and hybrid berry (Rubus sp.) fruit are the best characterized of the genus (Connor et al. 2005; Dossett et al. 2010; Jennings and Carmichael 1980; McGhie et al. 2002; Scalzo et al. 2008; Torre and Barritt 1977), the ACY biosynthetic pathway has yet to be examined in Rubus. However, the biosynthetic pathway of ACY production appears to be well conserved in Arabidopsis thaliana, maize, snapdragon, and petunia (reviewed by Holton and Cornish 1995), as well as in apple (Takos et al. 2006), and many of the genes encoding the enzymes involved are conserved in both sequence and function (reviewed by Grotewold 2006; Holton and Cornish 1995; Tanaka et al. 2008).

Several studies have been conducted to determine the predominant ACY compounds in Rubus species (Connor et al. 2005; Cooney et al. 2004; Dossett et al. 2010, 2011; Jennings and Carmichael 1980; Määttä-Riihinen et al. 2004; Ozgen et al. 2008; Seeram et al. 2001; Torre and Barritt 1977; Tulio et al. 2008). Black raspberry fruits are characterized by two xylose-containing pigments, cyanidin 3-O-sambubioside (C3Sb) and cyanidin 3-O-2G-xylosylrutinoside (C3XR), and overall high ACY concentrations (200 mg/100 g fruit) (Ozgen et al. 2008; Tulio et al. 2008); whereas red raspberry fruits are characterized by the presence of cyanidin 3-O-sophoroside (C3S), cyanidin 3-O-2G-glucosylrutinoside (C3GR), and overall lower ACY concentrations (60 mg/100 g fruit) (Jennings and Carmichael 1980; Torre and Barritt 1977). Other ACY compounds found in Rubus fruits are cyanidin 3-O-rutinoside (C3R), cyanidin 3-O-glucoside (C3G), and pelargonidin 3-O-rutinoside (P3R) (Dossett et al. 2011; Kassim et al. 2009; Torre and Barritt 1977). Each of the cyanidin-containing compounds is a variation on the molecule cyanidin 3-O-glucoside, resulting from the enzyme-catalyzed addition of different sugar moieties, either glucose, rhamnose or xylose (Grotewold 2006; Holton and Cornish 1995). Pelargonidin 3-O-rutinoside (P3R), and cyanidin 3-O-glucoside (C3G) have similar structures, with the basic pelargonidin and cyanidin three-ring molecule differing by the presence of a second hydroxyl on the cyanidin B-ring 3′-carbon, the addition of which is catalyzed by flavanone 3′ hydroxylase (F3′H) early in the biosynthetic pathway (Grotewold 2006; Holton and Cornish 1995).

Four previous studies have been conducted to detect QTL in a population of Rubus idaeus ‘Latham’ × ‘Glen Moy’, both red-fruited cultivars. An assessment of fruit development from bud break to ripeness identified QTL involved in stages of fruit ripening and cane height associated with regions on linkage groups (LG) 2, 3, 5 and 6 (Graham et al. 2009). Two QTL involved in the variation of the accumulation of eight specific ACY compounds were mapped to regions of LG 1 and 4 (Kassim et al. 2009). Two candidate genes were strongly associated with the QTL mapped by Kassim et al. (2009), a putative bHLH transcription factor on LG1 and a putative bZIP transcription factor on LG4. QTL involved in the variation in fruit color during ripening and the accumulation of total ACY were mapped to LG 2, 3, 4, and 6 (McCallum et al. 2010). Candidate genes associated with the QTL for total ACY accumulation mapped by McCallum et al. (2010) were a putative TIP small molecule transporter on LG2; and dihydroflavonol 4-reductase (DFR), along with flavonol synthase (FLS) both on LG4. Each of these studies identified unique QTL, although the QTL on LG3 show some overlap, and the QTL associated with marker RubFruitE4 on LG4 are in common between the studies of McCallum et al. (2010), and Kassim et al. (2009), although for different traits. The most recent study assesses QTL associated with total ACY content along with other fruit quality traits (Dobson et al. 2012). This study identified two QTL associated with total ACY on LG2 and LG6 with the QTL on LG2 associated with the candidate TIP transcription factor that is also underlying the QTL mapped by McCallum et al. (2010). The linkage group nomenclature used in these four studies follows Graham et al. (2004). These four studies provide insight into QTL of several fruit quality traits in a single population of red raspberry progeny over several years, as well as in different locations and growing conditions.

New genetic linkage maps constructed from a progeny of black raspberry (Rubus occidentalis) 96395S1 (deep purple fruit), and red raspberry (R. idaeus) ‘Latham’ (red fruit), have been developed recently (96395S1 × ‘Latham’) (Bushakra et al. 2012). These maps, deliberately enriched for markers derived from orthologous sequences and candidate genes coding for enzymes involved in the polyphenolic biosynthetic pathway, allow the association of candidate genes with QTL for variations in amounts of fruit ACY compounds in a new population and previously unstudied species of Rubus. Improving our understanding of the genetics controlling the production of and interactions between levels of ACY compounds will assist in the development of marker-assisted breeding (MAB) to improve the efficiency of breeding new Rubus cultivars with the most beneficial balance of compounds for human health. To this end, fruit ACY phenotype data were collected over three seasons and analyzed for marker associations using the ‘Latham’ (red raspberry) and 96395S1 (black raspberry) genetic linkage maps (Bushakra et al. 2012) to identify QTL.

Methods

Plant material

The diploid purple raspberry mapping population consists of single, non-replicated population of 500 full-sib F1 individuals derived from an inter-specific cross between parents of contrasting fruit color, Rubus occidentalis selection 96395S1 and R. idaeus ‘Latham’ (Bushakra et al. 2012). The pedigree of each parent is described by Bushakra et al. (2012). The population is located at the Plant & Food Research (PFR) site near Motueka (41°058′S, 172°584′E). A subset of 155 individuals (Bushakra et al. 2012) was selected for ACY extraction from fruit and QTL analysis.

Marker nomenclature and linkage groups

Markers were designed previously, and Tables 1 and 2 provide information on the sources of the orthologous sequence used for primer design (Bushakra et al. 2012). Rubus linkage groups (RLG) are named to correspond with the diploid Fragaria vesca LG nomenclature (Bushakra et al. 2012).

Table 1 Derivation of markers mapped on Rubus occidentalis 96395S1 × R. idaeus ‘Latham’ that were designed from transcription factors and genes for enzymes involved in the polyphenolic biosynthetic pathway (Bushakra et al. 2012)
Table 2 Derivation of markers mapped on Rubus occidentalis 96395S1 × R. idaeus ‘Latham’ that were designed from genes from pathways other than the polyphenolic pathway and other sequences (Bushakra et al. 2012)

Sample collection, extraction, preparation, and analysis

In each of the 2009, 2010 and 2011 harvest seasons, field-grown fruit was collected three times, once per week over 3 weeks to represent the typical harvest season for raspberry in New Zealand (mid-December through early-January). Individuals were arranged in rows and after planting were assigned a unique identification number (EA501–EA701). Plants were grown following standard New Zealand commercial raspberry practices.

Fruit samples were collected on ice and then stored at −20 °C until processing. The three samples collected per individual were pooled for processing, either by whole fruit solvent extraction (2009), or by juice extraction (2010, 2011) methods. In 2009, the first 200 F1 individuals that were producing fruit were selected for analysis. Of those 200 plants, 155 had sufficient fruit production accumulated over the three harvests for analysis. In 2010 and 2011, the fruit from all 500 F1 progeny were netted to protect the fruit from bird predation, and samples were collected from all individuals, with the subset of 155 F1 progeny being used for the QTL analysis.

Sample extraction, 2009: solvent extraction method

Frozen whole fruit samples were shipped overnight on dry ice from PFR, Motueka, New Zealand to PFR, Palmerston North for processing. The three replicates of fruit from each individual were pooled. Twenty grams, including seeds, of the pooled fruit samples, was homogenized with 100 mL ethanol/water/formic acid (80:20:1, v/v/v) using an Omni GLH homogenizer (Omni International, Marietta, GA) and stored at 1°C for 48 h before centrifugation at 1000 g for 10 min with a Jouan® Hema-C centrifuge (Jouan, Saint Herblain, France). An aliquot (180 μL) of the supernatant was transferred to a 96-well plate for analysis by reversed-phase ultra high-performance liquid chromatography (UHPLC).

UHPLC analysis of fruit anthocyanins, 2009

Analyses of ACY composition and concentration in individuals in the subset of 155 F1 progeny of 96395S1 × ‘Latham’, were performed on a Dionex Ultimate® 3000 Rapid Separation LC system equipped with a SRD-3400 solvent rack with four degasser channels, HPR-3400RS binary pump, WPS-3000RS thermostated autosampler, TCC-3000RS thermostated column compartment, and DAD-3000RS Diode Array Detector (Sunnyvale, CA, USA). A volume of 1.0 μL supernatant was injected into a Zorbax™ SB-C18 2.1 × 150 mm, 1.8 μm analytical column (Agilent Technologies, Santa Clara, CA, USA) and analyzed at a mobile flow rate of 0.5 mL/min with column oven temperature maintained at 45 °C. Solvents were (A) 0.5 % phosphoric acid in water, and (B) acetonitrile/water (80:20 v/v); a binary gradient elution was performed as follows: isocratic at 90 % A:10 % B (0–0.5 min); linear gradient to 85 % A:15 % B (0.5–3.5 min); linear gradient to 50 % A:50 % B (3.5–9.0 min); linear gradient to 100 % B (9.0–9.5 min), isocratic at 100 % B (9.5–10.5 min); linear gradient to 90 % A:10 % B (10.5–11.0 min). Total sample run time was 13.5 min and spectral data were collected for the entire run.

Compound quantification, 2009

Quantification of ACY extracted from the subset of 155 F1 progeny was carried out at 520 nm as cyanidin 3-O-glucoside equivalents (Mullen et al. 2003). Chromatographic data were collected and manipulated using the Chromeleon® Chromatography Management System version 6.8 (Dionex Corporation, Sunnyvale, CA, USA). External calibration curves were constructed for cyanidin 3-O-glucoside using standards from Extrasynthese (Genay, France). Components for which standards were not available were quantified using the standard curve of a related compound. The results are presented as μg/mL.

Sample extraction 2010 and 2011: juice extraction method

The three replicates of fruit from each individual from the F1 progeny and the parents (fruit from both parents collected in 2010, fruit from only ‘Latham’ collected in 2011) were pooled to give an average fruit weight per individual of 348 g. Juice was extracted using a manual juicer with fine-screen mesh to exclude pulp and seeds. The juicer was washed well and dried between samples. For each extraction, a 1.0-mL sample was collected from the total juice, transferred to a 1.5-mL Eppendorf® tube, and stored at −80°C until analysis. Juice samples were shipped overnight on dry ice from PFR, Motueka to PFR, Ruakura for analysis.

HPLC analysis of fruit anthocyanins: 2010 and 2011

Analyses of ACY composition and concentration for fruit from the subset of 155 F1 progeny of 96395S1 × ‘Latham’ were performed on a Shimadzu 20-series analytical HPLC with a column oven, auto-sampler, 4-channel vacuum solvent degas module and diode-array detector (Portland, OR, USA). A volume of 10 μL supernatant was injected into a 150 × 2 mm, Synergi Polar-RP, 4 μ particle size, 80 Å pore size analytical column, fitted with a Security-Guard 3 × 2 mm Polar-RP guard cartridge (Phenomenex, Auckland, New Zealand). Flow rate was 0.6 mL/min and column oven temperature 50 °C. Solvents were (A) methanol (neat) and (B) 2 % aqueous formic acid, and the initial mobile phase was 5 % (A):95 % (B). The time program for pump B concentration was set up as 92 % at 2.5 min, 82 % at 5 min, 72 % at 7 min, 58 % at 9.5 min, 45 % at 11 min, 30 % at 12 min, 20 % from 12.5–13.3 min, returning to 95 % at 13.8 min and holding at that concentration until the end of the run at 15 min.

Quantification of ACY extracted from the 500 F1 progeny was carried out at 520 nm, in comparison with standard solutions of cyanidin 3-O-glucoside (Mullen et al. 2003). UV/visible spectra were recorded from 245–600 nm in 1.2 nm steps. Results were expressed as mg cyanidin 3-O-glucoside equivalent/100 mL of juice.

Confirmation of compound identity, 2010 and 2011

To assist identification of compounds observed during HPLC runs, some samples were rerun on the same HPLC instrument, with the addition of a Shimadzu LCMS 2020, single quadrupole mass spectrometer, fitted with an electrospray ionization (ESI) source [Shimadzu Scientific Instruments (Oceania) Pty Limited, Auckland, NZ], following manufacturer’s default parameter settings. MS scans were carried out in both positive and negative modes in the same run, using a mass range of 140–2,000 Da. Confirmation of identity was achieved both through comparison of expected and observed molecular ion masses and “neutral losses” of sugars during ionization to leave aglycones or partially glycosylated ions. For example, cyanidin 3-O-glucoside exhibited a positive molecular ion at 449 Da and an ion at 287 Da, corresponding to loss of the glucose residue (162 Da), to leave the cyanidin ion.

Statistical analyses

The GenStat 14th edition software package (VSN International 2010) was used to analyze the phenotypic data. The mean concentration values for the 155 F1 progeny for all 3 years were calculated for each compound and within-year, pair-wise compound correlations were conducted using the data. A probability distribution plot was applied to verify the normality of the trait distributions, and the means, ranges, standard deviations (SD), skewness and kurtosis were calculated for these distributions. Transgressive segregants (those progeny whose values exceed or are less than the parental values by at least two SD) were identified for the compounds analyzed in 2010, as the parental data were available only for that year.

As the methods of analysis resulted in the presentation of data with different units for 2009 and 2010/2011, each compound was evaluated as a percentage of the total ACY for each year, to assess the proportion of each compound per year. Restricted maximum likelihood (REML) analysis was conducted on the mean percent data to access the trait variance contributed by year-to-year variation and by differences in genotype. The response variate was the trait’s percent of total ACY, the fixed model was constant, the random model was Year + Genotype + Year.Genotype, and there were 462 U (3 years × 155 individuals − 3 individuals with no data for 2011), with 3 U excluded for missing data.

QTL map construction

Markers for QTL mapping were selected from the parental linkage maps (Bushakra et al. 2012) with the goal of developing linkage groups with one marker every 2–5 centimorgans (cM) and including as many gene-based markers as possible (Tables 1, 2). Marker selection was based on map location, least missing data, best segregation ratio, and when possible, markers in common with other Rubus linkage maps. The linkage maps were generated using MapChart® 2.1 (Voorrips 2002).

QTL mapping

QTL analyses were conducted for the concentration of each ACY compound and for total ACY for each year using MapQTL® 5 (van Ooijen 2004). To limit the genotypic classes to two, QTL detection was performed on the parental maps (Bushakra et al. 2012). Logarithm of odds (LOD) thresholds for each map was estimated with a 1,000-permutation test for each trait and year. An interval mapping (IM) analysis was then performed to locate preliminary QTL. QTL with LOD scores greater than the genome-wide threshold of 95 % were considered significant. Markers with significant association with a trait were selected for use as cofactors in the Multiple QTL Method (MQM) to uncover the presence of minor QTL that might be hidden by large-effect QTL. A step size of 1 cM was used for both IM and MQM analyses. The square of the partial correlation coefficient (R 2) was used to calculate the phenotypic variance explained by a single QTL. The 95 and 99 % confidence intervals for QTL positions were obtained by taking the two positions left and right of the point estimate of the QTL for the LOD ± 1 interval, and the next two positions out, left and right, for the LOD ± 2 interval (van Ooijen 1992), respectively, for QTL detected with IM and with MQM.

Epistasis calculations

An evaluation of potential epistasis was conducted on the traits C3S, C3XR C3XR_C3R, and C3GR, as these traits had at least two QTL that were stable over 2009 and 2010. Simple fixed-effect models, whereby the particular ACY was modeled by the putative QTL (as fixed effects) and all possible interactions were fitted. Residual plots were checked for the presence of outliers and to ascertain the validity of the assumption of normality (Supplemental Fig. S1). All analyses and figures were undertaken in R 2.13.0 with the models using the asreml 3.0.1 (P. Alspach, PFR, pers. comm.).

Comparative gene analysis

The annotated genes from the syntenic region of Fragaria vesca LGII (1:4 Mb) were mined to identify candidates potentially involved in the expression of the large-effect QTL located at the top of ‘Latham’ RLG2. Gene models identified in Fragaria were compared with the Malus × domestica ‘Golden Delicious’ (Velasco et al. 2010) and Fragaria vesca ‘Hawaii 4’ (Shulaev et al. 2011) draft genome sequences. Over 400 gene models were represented. The best basic local alignment search tool (BLAST) matches for the gene models were then compared with The Arabidopsis Information Research (TAIR) database (http://www.arabidopsis.org) to validate the predicted gene function.

Results

Description of traits

Seven ACY compounds were detected in the F1 progeny in all 3 years and in both parents in 2010. Five of the compounds, co-eluting compounds cyanidin 3-O-glucoside and cyanidin 3-O-sambubioside (C3G_C3Sb), cyanidin 3-O-sophoroside (C3S), co-eluting compounds cyanidin 3-O-2G-xylosylrutinoside and cyanidin 3-O-rutinoside (C3XR_C3R), cyanidin 3-O-2G-glucosylrutinoside (C3GR), and pelargonidin 3-O-rutinoside (P3R) were detected in all 3 years, identified based on UHPLC or HPLC retention times confirmed with mass spectrometry data, and therefore were selected for QTL analysis (Table 3). In the samples from 2009 and 2011, compounds C3G and C3Sb co-eluted as a single peak (C3G_C3Sb), while in the samples from 2010, C3G only was detected. In the samples from 2009, compounds C3XR and C3R were detected as separate peaks, while in the samples from 2010 and 2011, the two compounds co-eluted as a single peak (C3XR_C3R). Therefore, the concentrations of C3G_C3Sb 2009 and 2011 and C3G 2010, as well as the sum of C3XR 2009 and C3R 2009 (C3XR + C3R) and co-eluting compounds C3XR_C3R 2010 and 2011 were used for within-year comparisons and the proportions of total ACY for each compound were used for between-year comparisons. As the co-eluting compounds are each considered as a single trait, five traits in total were considered for analysis. The two unknown ACY, present in trace amounts, were excluded from further analysis. Total ACY was calculated as the sum of all analyzed compounds for each year.

Table 3 Anthocyanin compounds extracted from purple raspberry fruit collected from 155 F1 progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’, analyzed with ultra high-performance liquid chromatography (UHPLC) in 2009 and HPLC in 2010 and 2011, and used for quantitative trait locus (QTL) analysis

Relative proportions of total ACY content of each analyzed compound, calculated from the mean compound values of the subset of 155 F1 progeny, were similar for all 3 years (Table 4). C3GR had the highest overall proportion of total analyzed ACY in the progeny (33 %, 2009; 39 %, 2010; 37 %, 2011), followed by co-eluting compounds C3XR_C3R (sum of compounds 39 % 2009; 33 % 2010; 34 % 2011). The mean compound concentrations in 2010 and 2011 were not significantly different with the exception of C3GR, which was higher in 2010 (2010 81.7 mg/100 mL; 2011 71.1 mg/100 mL; p < 0.05). As the mean differences between 2010 and 2011 were largely not significant, all between-year comparisons are performed using data from 2009 and 2010 only.

Table 4 Mean proportions of anthocyanins detected in the F1 progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’ in 2009, 2010, and 2011

Weather data, compiled by MetWatch Online (http://www.metwatch.co.nz), for the typical New Zealand raspberry growing season (1 August through 8 January) for 2009, 2010 and 2011, are shown in Supplemental Table S1. Total rainfall (millimeters, mm), mean daily temperature (degrees Celsius, °C), and growing degree days (GDD, number of hours with day-time temperatures above the base temperature, in this case the base temperature is 7 °C) were all higher in 2010 than in 2009 and 2011.

Descriptive statistics

The probability distribution plot p-values performed on the concentration data indicate that the phenotypic distributions of co-eluting compounds C3G_C3Sb (2009) (Supplemental Fig. S2a), C3S (all 3 years) (Supplemental Fig. S2b), C3R (2009), C3XR (2009), co-eluting compounds C3XR_C3R (2010 and 2011) (Supplemental Fig. S2c), and P3R (all 3 years) (Supplemental Fig. S2e) deviated significantly (p ≤ 0.01) from normal (Table 5, Supplemental Fig. S2). The phenotypic distributions of ACY compound C3G (2010), co-eluting compounds C3G_C3Sb (2011) (Supplemental Fig. S2a), and C3GR (all three years) (Supplemental Fig. S2d) were approximately normal (p ≥ 0.05; Table 5).

Table 5 Mean phenotypic values and basic statistical parameters for the five traits analyzed in 155 F1 progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’ in 2009 and 2011, and the progeny and parents in 2010

In 2010, fruit collected from the black raspberry female parent, 96395S1, had higher concentrations of compounds P3R, C3G, and C3XR_C3R than the red raspberry male parent, ‘Latham’, which had higher concentrations of compounds C3S and C3GR (Table 5, columns 96395S1 and ‘Latham’). In 2010 and 2011, ‘Latham’ contained moderate amounts of C3S and C3GR, both red raspberry-specific compounds, as well as relatively low quantities of black raspberry-specific compounds C3XR_C3R and C3G, while in 2010, 96395S1 had trace amounts of red raspberry-specific C3S and C3GR along with concentrations of C3XR_C3R and C3G typical for black raspberry. Concentrations of C3GR were greatest of all individual compounds in all 3 years. In 2010 both parents had concentrations of C3GR less than all of the progeny, 68 of which had concentrations greater than two SD of ‘Latham’, which had the higher concentration of the two parents (Table 5). Transgressive segregants (TS) were identified for traits P3R 2010, with two TS (96395S1 9.5 mg/100 mL, ‘Latham’ 0.8 mg/100 mL; progeny range 0.3–18.3 mg/100 mL); C3S 2010, with 10 TS (96395S1 1.3 mg/100 mL, ‘Latham’ 34.6 mg/100 mL; progeny range 4.1–99.7 mg/100 mL); and C3GR 2010 with 68 TS (96395S1 3.0 mg/100 mL, ‘Latham’ 28.8 mg/100 mL; progeny range 32.1–178.8 mg/100 mL).

REML variance component analysis conducted on the mean percentage of each compound for each year suggests that the amount of variation contributed by differences between years has much less influence on compound proportion of total ACY accumulation than does genotype (Table 6). This observation is supported by the standard error (s.e.) of each of the components. The s.e. for component “year” is nearly the same as the variance of the component itself, whereas the s.e. for component “genotype” is 6–7 times less than the variation contributed by genotype.

Table 6 Variance components for five anthocyanin compounds analyzed for 3 years (2009, 2010, 2011) in 155 F1 progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’

Pearson’s correlation coefficients calculated for each pair of compounds are shown in Table 7. A number of compounds correlated significantly with each other, with the strongest positive correlation (r > 0.70) between C3S and C3GR for all 3 years; the strongest negative correlation (r = −0.55) was between C3XR_C3R and C3S in 2010. In general, the correlation coefficients agreed between years. The results of the correlation analysis suggest that C3G (and C3G_C3Sb) is positively correlated with all other compounds, except a slightly negative correlation with C3GR 2009. Compound C3S has a strong positive correlation with C3GR and is negatively correlated with all other compounds, except a slightly positive correlation with P3R 2011. The correlations of compound C3GR generally agree with those of C3S with the exception of a positive correlation with both C3G_C3Sb and C3XR_C3R in 2011. All remaining compounds show a mix of positive and negative correlations relative to the other compounds. As some of the data distributions were non-normal, a Spearman’s ranking analysis was conducted and showed similar results as the Pearson’s correlations (Supplemental Table S2).

Table 7 Pearson’s correlation coefficient calculations and statistical significance (p value) for each pair of the six anthocyanin compounds analyzed for 2009, 2010 and 2011 in 155 F1 progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’

Scatter plots for compound associations between C3S 2009 and C3G_C3Sb 2009, C3S 2009 and C3XR + C3R 2009, C3GR 2009 and C3XR + C3R 2009, C3S 2010 and C3G 2010, as well as C3S 2011 and C3G_C3Sb 2011 show two distinct groupings. The occurrence of subgroups was investigated and an association was detected between the genotype of the progeny at locus RubFruitE4 on ‘Latham’ RLG2, and the compound associations that segregate into two subgroups (Fig. 1; Supplemental Fig. S3). The two subgroups consist of 61 heterozygous (“lm”; Set A) and 94 homozygous (“ll”; Set B) individuals, where “lm” and “ll” represent allele combinations of marker RubFruitE4. Individual EA509 (“ll”) consistently groups with the phenotypes of Set A, while individuals EA611 and EA697 (both “lm”) consistently group with the phenotypes of Set B.

Fig. 1
figure 1

Scatter plot for compounds cyanidin 3-O-sophoroside (C3S) and cyanidin 3O-glucoside and cyanidin 3O-sambubioside (C3G_C3Sb) in 2009. The subset of F1 progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’ segregates into two groups according to genotype at locus RubFruitE4 on ‘Latham’ Rubus linkage group (RLG) 2. Set A (open diamond symbols) consists of 61 heterozygous (“lm”) individuals; Set B (solid cross symbols) consists of 94 homozygous (“ll”) individuals, where “ll” and “lm” represent allele combinations. The correlation coefficient for the entire data set is r = 0.08; the correlation coefficient for Set A is r = 0.57, and the correlation coefficient for set B is r = 0.85. Individual EA631 is an outlier of Set A. Arrows indicate progeny that group opposite of their genotype

When the Pearson’s correlation coefficient analysis was conducted on data Set A (Table 8) and Set B (Table 9), most of the correlations became positive, some strongly so, with r values as high as 0.85 for the correlation between C3G_C3Sb and C3S for 2009, in contrast to the finding of no significant correlation when the data were considered as a whole. The negative correlations between C3S 2009 and C3R 2009, and between C3S 2009 and C3XR 2009, when the data were considered as a whole, changed to non-significant correlations when the data were considered as separate sets.

Table 8 Pearson’s correlation coefficients calculated for each pair of compounds that segregated into two distinct groups in the F1 progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’
Table 9 Pearson’s correlation coefficients calculated for each pair of compounds that segregated into two distinct groups in the F1 progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’

A comparison of average, minimum, and maximum values of the four compounds that formed two distinct groups based on genotype at RubFruitE4, showed that though the compound concentrations overlapped, the mean concentration of Set B (“ll”) was higher for red raspberry compounds C3GR and C3S, and the mean concentration of Set A (“lm”) was higher for black raspberry compounds C3XR_C3R and C3G_C3Sb. A Student t test performed on the differences between the means of the two sets indicates that the differences are statistically significant (p < 0.001) for all compounds except C3S 2010 (Table 10).

Table 10 Comparison of Set A and Set B for the four compounds that segregated into two distinct groups in the F1 progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’ during correlation analysis

Linkage maps used for QTL analysis

The ‘Latham’ linkage map used for QTL analysis comprised nine LG with 69 markers spanning 533 cM. Markers per cM ranged from one marker every 2.4 cM (RLG3b), to one marker every 14.8 cM (RLG4), with an average of one marker every 7.8 cM. The largest gap was on RLG4 spanning 42.4 cM, followed by a gap of 31.4 cM on RLG6. RLG3 and RLG7 were both in two parts (designated a and b).

The 96395S1 linkage map used for QTL analysis comprised four LG with 20 markers spanning 161.5 cM. Markers per cM ranged from one marker every 5.2 cM (RLG5), to one marker every 11.1 cM (RLG3), with an average of one marker every 8.1 cM. The largest gap was 24.0 cM found on RLG6, followed by a gap of 19.7 cM on RLG3.

Identification of QTL

Individual compounds

QTL were identified using the mean trait concentration values for each individual progeny from each year separately on the parental genetic maps, the genotype data set for each individual compound, and IM followed by MQM. Markers RubFruitE4 and RiF2KP were selected for use as co-factors for MQM analysis conducted on the ‘Latham’ parental map; marker Ri60S was selected for use as the co-factor used for MQM analysis conducted on the 96395S1 parental map. Co-location of traits with markers was detected, at LOD scores greater than the genome-wide 95 % threshold, on all Rubus Linkage Group (RLG) except RLG4 (Supplemental Fig. S4). The 2009 phenotype data taken alone showed that all five traits had significant LOD scores associated with markers on the ‘Latham’ genetic map, while four traits (C3S, C3XR, C3G_C3Sb, and C3GR) had significant LOD scores associated with markers on the 96395S1 genetic map. Similarly, the 2010 and 2011 phenotype data, when each was taken alone, showed that all five traits had significant LOD scores associated with markers on the ‘Latham’ genetic map. On the 96395S1 map, traits C3S, C3G, C3GR, and P3R for 2010 had significant LOD scores associated with markers, while in 2011 traits C3XR_C3R, C3G_C3Sb, and C3GR had significant LOD scores associated with markers.

In total, 49 QTL were detected (Table 11; Supplemental Fig. S4), using IM followed by MQM, on all ‘Latham’ linkage groups except RLG4, and on 96395S1 RLG 3, 5, and 6. The 49 QTL can be summarized as 28 significant associations (21 on the ‘Latham’ map, and seven on the 96395S1 map), as QTL that mapped to the same interval, or with overlapping confidence intervals, in all 3 years are considered the same. Of the 21 QTL mapped to ‘Latham’, four (19 %) were stable over all 3 years, while of the seven QTL mapped to 96395S1, two (29 %) were stable in all 3 years; these six stable QTL are indicated in bold font on Table 11.

Table 11 Mapping and significance information for five anthocyanin compounds mapped in Rubus occidentalis 96395S1 and R. idaeus ‘Latham’, % variation explained per locus, and the nearest mapped marker

QTL were identified on seven of the nine RLG, varying from one each on ‘Latham’ RLG1, RLG7a and 96395S1 RLG6, to 12 on ‘Latham’ RLG2 (Supplemental Fig. S4). For the compounds analyzed in 2010, red raspberry-specific compounds C3S and C3GR, 50 % (5 of 10) of the loci displayed gene effects in the same direction as predicted by the phenotype of the parental lines, while 50 % of the loci showed opposite effects (Table 11). For traits C3XR_C3R and C3G_C3Sb, of which C3XR and C3Sb are black raspberry-specific compounds, predicted trait direction agreed with the parental phenotypes in 5 of 7 instances (71 %). Predicted trait direction for P3R was opposite of the parental phenotypes in both instances. One to six QTL have been identified for each trait, with the total phenotypic variation (R 2) explained by the QTL for each trait ranging from 0.07 (one QTL for P3R 2010 on 96395S1 RLG5) to 0.88 (four QTL for C3S 2010 on ‘Latham’ RLG 2, 3, 6 and 7). The largest cluster of QTL, representing all five traits over all 3 years, is located on ‘Latham’ RLG2 in association with marker RubFruitE4 and its LOD ± 2 confidence interval spans approximately 32 cM.

While the composition and overall proportions of the compounds were similar among years, the concentrations of the five ACY compounds analyzed varied widely among individuals in the progeny of interspecific cross 96395S1 × ‘Latham’ (Table 5). Table 12 summarizes the proportions of individual ACY in the subset of 155 analyzed progeny of 96395S1 × ‘Latham’ compared with published results for red, black, and purple raspberry. Cyanidin 3-O-rutinoside (C3R) and cyanidin 3-O-2G-xylosylrutinoside (C3XR) were the major ACY compounds identified in the black raspberry cultivars analyzed by others (Dossett et al. 2010; Tulio et al. 2008) comprising 39–90 % of the total ACY. The proportions of individual ACY in the subset of 96395S1 × ‘Latham’ analyzed progeny were more evenly distributed among the five identified compounds when compared with the other black raspberry cultivars (Dossett et al. 2010), but with a greater proportion of the co-eluting compounds cyanidin 3-O-glucoside and cyanidin 3-O-sambubioside (C3G_C3Sb) and smaller proportions of cyanidin 3-O-rutinoside (C3R), and co-eluting compounds cyanidin 3-O-2G-xylosylrutinoside and cyanidin 3-O-rutinoside (C3XR_C3R). The subset of 96395S1 × ‘Latham’ analyzed progeny has smaller proportions of cyanidin 3-O-sophoroside (C3S) and cyanidin 3-O-glucoside (C3G), with greater proportions of cyanidin 3-O-rutinoside (C3R) and cyanidin 3-O-2G-glucosylrutinoside (C3GR) when compared with ‘Latham’ × ‘Glen Moy’ (Kassim et al. 2009). The proportion of pelargonidin 3-O-rutinoside (P3R) was consistent among studies.

Table 12 Proportions of the total anthocyanin for each compound detected in the analyzed set of F1 progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’ in 2009 and 2010. Proportions of anthocyanins from populations of black raspberry cultivars, red raspberry ‘Latham’ × ‘Glen Moy’ and purple raspberry from other studies are provided for comparison

Ranking of individuals

Figure 2 shows the relative accumulation of percent C3S per genotype with individuals ranked from lowest to highest percent for 2009 (data for other compounds are given in Supplemental Fig. S5). In general, rankings were consistent between 2009 and 2010 (those individuals with low accumulation in 2009 also had low accumulation in 2010). Percent accumulation in 2011 was the most variable and at the extremes appears to be opposite of the percent accumulation in 2009.

Fig. 2
figure 2

Ranking of individuals based on the percentage of cyanidin 3-O-sophoroside (C3S) of the total anthocyanin (ACY) accumulated in 2009. This graph suggests that, generally, percent C3S accumulation in 2009 (diamonds) agreed with percent C3S accumulation in 2010 (squares). Percent C3S accumulation in 2011 was the most variable (triangles), with the low and high extremes trending to the opposite of 2009

Total anthocyanin

QTL for total ACY were identified using the sum of the mean trait values for each individual progeny for each year separately on the ‘Latham’ parental genetic map, the genotype data set for total ACY, and IM. Two QTL were identified: one on ‘Latham’ RLG3 associated with marker RiF2KP in all 3 years (2009: LOD = 8.3, R 2 = 0.22; 2010: LOD = 5.1, R 2 = 0.14; 2011: LOD = 2.6, R 2 = 0.08) and one on ‘Latham’ RLG2 associated with marker RubFruitE4 (2010 only, LOD = 5.8, R 2 = 0.16) (Supplemental Fig. S6).

Candidate genes co-locating with QTL

On the ‘Latham’ parental map, two polyphenolic biosynthetic pathway gene-derived markers (RiF3′H and RiPAL) and four transcription factor (TF)-derived markers (RubFruitE4, RibHLH, RiMYB, RubFAC-003_MYB1) are associated with QTL on three different RLG (Table 11, Supplemental Fig. S4). No QTL on the 96395S1 parental map are associated with polyphenolic biosynthetic pathway gene- or TF-based markers.

Association of QTL with markers derived from polyphenolic biosynthetic pathway genes and transcription factors

Marker RiF3′H on RLG5 that was designed from flavanone 3′-hydroxylase (F3′H) sequence encoding the enzyme that catalyzes an early step in flavonoid metabolism, provides precursors for many classes of flavonoid compounds, and regulates the flavonoid pathway (Pelletier and Shirley 1996). On ‘Latham’ RLG5, QTL for P3R (2009, 2010) and C3GR 2010 co-located with the candidate gene-based marker RiF3′H. On ‘Latham’ RLG6, marker Ri4Co1, designed from the gene encoding 4-coumarate:coA ligase 1 which catalyzes the formation of 4-coumaroyl-CoA during the formation of flavonoids (Kuhn et al. 1984), is associated with, or is within the LOD ± 1 confidence interval of, the QTL for C3S 2009. On ‘Latham’ RLG7a, marker RiPAL2, designed from the candidate gene phenylalanine lyase (PAL) that encodes the enzyme for biosynthesis of a wide range of phenylpropanoid products from phenylalanine (Liang et al. 1989), is within 2 cM of marker Ri_5O21, as well as within the LOD ± 1 confidence interval of the QTL for co-eluting C3G_C3Sb 2009.

QTL for all traits analyzed on ‘Latham’ RLG2 co-segregated, with three recombinants (Fig. 1), with marker RubFruitE4, a putative bZIP TF (Kassim et al. 2009). On ‘Latham’ RLG7b, the marker RibHLH, designed from a TF similar to apple MdbHLH33, co-located with QTL for C3S (2009, 2010), co-eluting compounds C3XR_C3R 2010, C3G 2010, and C3GR 2011. Marker RiMYB, derived from a putative MYB TF, on ‘Latham’ RLG6 co-locates with, or is within the LOD ± 1 confidence interval of, the QTL for C3XR + C3R 2009. Marker RiMYB is just outside the confidence interval for QTL for total ACY and yield identified by McCallum et al. (2010).

Association of QTL with markers derived from additional gene sequences

On ‘Latham’ RLG3, QTL for traits C3GR (2009, 2010), C3G (2010) and C3G_C3Sb (2009, 2011), C3S (2009, 2010), and C3XR + C3R (2009) co-located with marker RiF2KP, designed from a regulatory enzyme-encoding gene (Draborg et al. 1999). On 96395S1 RLG3, marker Ri60S, which is similar to the 60S ribosomal protein L10 (Woodhead et al. 2010), co-located with three of the same four QTL (C3S, C3G, C3G_C3Sb, and C3GR) mapped on ‘Latham’ RLG3. Finally, on ‘Latham’ RLG6, marker RiRosCOS2381 co-located with QTL for C3G_C3Sb 2009, C3G 2010, and C3S 2010.

Comparative gene analysis of the top of RLG2

The genomic region surrounding the QTL associated with marker RubFruitE4 on RLG2 was examined for positional candidate genes, based on synteny with Fragaria vesca (alpine strawberry) (Bushakra et al. 2012) (Supplemental Fig. S7). The gene model prediction identified 409 gene models within the 4-Mb region of Fragaria LGII surrounding the homolog of Rubus marker RubFruitE4. Of these 409 gene models, 59 were considered as potential candidates for, or regulators of, genes involved in the phenylpropanoid biosynthetic pathway. Within this region, genes T51560 (probable flavonol 3-O-glucosyltransferase (EC 2.4.1.91), F2K13_200 [similarity]—Arabidopsis thaliana), and AAD17392.1 (putative glucosyltransferase [Arabidopsis thaliana]) are the most promising candidates, as their function is to catalyze the addition of sugar moieties to the cyanidin molecule.

Discussion

Individual anthocyanin concentrations in the F1 progeny of 96395S1 × ‘Latham’

The interspecific progeny of Rubus occidentalis 96395S1 × R. idaeus ‘Latham’ provides a genetic background in addition to the ‘Latham’ × ‘Glen Moy’ population (Graham et al. 2004) that can be used for analysis of the genetic and environmental controls of the ACY biosynthetic pathway and subsequent compound accumulation in the fruit of purple raspberry. The comparison of the 96395S1 × ‘Latham’ F1 progeny subset with other black raspberry, red raspberry, and purple raspberry populations suggests that the concentrations of individual ACY differ widely between individuals, although the proportions and composition of the compounds appear to be consistent within a population. This indicates that the proportions and compositions of ACY are under strong genetic control, while ACY concentration is more subject to environmental influences.

Pearson’s correlation coefficient analyses

The positive correlation between C3G and the other compounds suggests that the enzyme-mediated addition of sugar moieties to this base molecule to produce other compounds does not deplete C3G (Fig. 3). The strong positive correlation between C3S and C3GR is interesting as they are both end-products resulting from the enzyme-mediated addition of glucose. This positive correlation suggests that glucose is prevalent in the fruit (Kafkas et al. 2008).

Fig. 3
figure 3

Six cyanidin-based anthocyanin compounds found in red and black raspberry fruit. The addition of sugar moieties (above the arrows) modifies cyanidin 3-O-glucoside (C3G). C3S cyanidin 3-O-sophoroside, C3XR cyanidin 3-O-2G-xylosylrutinoside, C3R cyanidin 3-O-rutinoside, C3G cyanidin 3-O-glucoside, C3Sb cyanidin 3-O-sambubioside; C3GR cyanidin 3-O-2G-glucosylrutinoside

‘Latham’ Rubus linkage groups 2 and 7 and cyanidin modification

The comparisons conducted are based on markers in common between this and the other four Rubus QTL studies as presented in the introduction; however, we use the linkage group nomenclature which is based on the alignment of Rubus with Fragaria vesca (Bushakra et al. 2012). Dobson et al. (2012) conducted an analysis of the accumulation of total ACY in the ‘Latham’ × ‘Glen Moy’ progeny and identified two QTL on LG2 and LG6 (our RLG3 and RLG1, respectively). The analysis of individual compounds conducted in the same population by Kassim et al. (2009), identified two QTL on LG4 and LG1 (our RLG2 and RLG7, respectively). McCallum et al. (2010) present data based on Kassim et al.’s (2009) analysis of compounds that indicate that heterozygous loci at FruitE4 and at bHLH both independently influence the concentration of some of the compounds studied. Our study supports these QTL and validates that the QTL associated with markers RubFruitE4 and RubbHLH are involved in the sugar modification of cyanidin, rather than the production of total anthocyanins.

On RLG7, marker RubbHLH was designed from Rubus sequence similar to MdbHLH33, which encodes a component of the MYB/bHLH/WD protein complex involved in the regulation of the expression of ACY biosynthetic pathway genes and the subsequent accumulation of ACY in plant tissues (Allan et al. 2008; Baudry et al. 2004). On RLG2, apart from marker RubFruitE4 derived from a putative bZIP TF, which exhibited three recombinants with the phenotype (Fig. 1), no TF or biosynthetic gene-based marker co-segregated with the QTL. Comparative gene analysis of RLG2 with the homologous region of the Fragaria whole genome sequence (Shulaev et al. 2011) identified two genes that encode enzymes that may be responsible for the addition of sugars to the cyanidin molecule: a putative flavonol 3-O-glucosyltransferase (F3OGT) whose enzyme product is involved in the glycosylation of kaempferol (Tohge et al. 2005), and a putative flavonol 7-O-glucosyltransferase (F7OGT), the enzyme product of which catalyzes the transfer of a glucosyl group from UDP-glucose to the 7-hydroxy group of a quercetin molecule (Lim et al. 2004). These two genes, along with Rubus bHLH, would be candidates for gene expression analysis in tobacco (Espley et al. 2007), with in planta gene expression in Rubus progeny exhibiting the extremes of specific ACY compound composition.

Comparison of QTL identified in the progeny of 96395S1 × ‘Latham’ with those identified previously in the progeny of ‘Latham’ × ‘Glen Moy’

The QTL for total ACY accumulation in all 3 years that is located on ‘Latham’ RLG3 and associated with marker RiF2KP may be the same as the QTL identified by Dobson et al. (2012) (LG2 in their nomenclature). The QTL for total ACY accumulation in 2010 located on ‘Latham’ RLG2 and associated with marker RubFruitE4 were not identified by Dobson et al. (2012) and in the current study has a much lower LOD score and explains a much lower percentage of variation than the QTL for individual compounds. This finding could indicate that the large-effect QTL associated with marker RubFruitE4 on ‘Latham’ RLG2 may be associated with the modification of the base cyanidin molecule, rather than production of the cyanidin molecule, reinforcing F3OGT and F7OGT as candidates for enzyme-mediated sugar addition.

QTL for levels of eight ACY compounds [cyanidin 3-O-sophoroside (C3S), cyanidin 3-O-2G-glucosylrutinoside (C3GR), cyanidin 3-O-glucoside (C3G), cyanidin 3-O-rutinoside (C3R), pelargonidin 3-O-rutinoside (P3R), pelargonidin 3-O-sophoroside (P3S), pelargonidin 3-O-glucoside (P3G) and pelargonidin 3-O-glucosylrutinoside (P3GR)], were previously mapped in association with marker RubFruitE4 and marker bHLH on the linkage map of ‘Latham’ × ‘Glen Moy’ (Kassim et al. 2009). QTL for five (C3S, C3GR, C3G, C3R, and P3R) of these eight compounds have been identified in this study and are also associated with the locus RubFruitE4. Neither C3Sb nor C3XR were reported as being detected in the progeny of ‘Latham’ × ‘Glen Moy’ (Graham et al. 2004). In the current study, the greatest amount of variation in expression of all six traits (14.9–64.3 %) is explained by ocus RubFruitE4 on ‘Latham’ RLG2 (Table 10). This locus on the ‘Latham’ × ‘Glen Moy’ linkage map explains less (between 10 and 40 %) of the variation in expression of levels of the eight compounds (Kassim et al. 2009). The difference between these two studies in percent variation explained by this locus could result from the presence of black raspberry-specific xylose-containing compounds in the progeny of 96395S1 × ‘Latham’, as the ‘Latham’ × ‘Glen Moy’ progeny lack these compounds. The enzyme-mediated addition of another sugar moiety to the base cyanidin molecule is hence supported by this finding.

In the current study, markers RibHLH and RiFRK2 mapping within 3.2 cM of each other on ‘Latham’ RLG7b are associated with 6.8–11.2 % of the variation in expression of C3S 2009 and C3G 2010 in the set of 96395S1 × ‘Latham’ progeny analyzed, while locus bHLH on the ‘Latham’ × ‘Glen Moy’ linkage map explains 35–60 % of the variation in the expression of the eight compounds detected in the progeny of ‘Latham’ × ‘Glen Moy’ by Kassim et al. (2009). Compound C3XR was not reported as being detected in the progeny of ‘Latham’ × ‘Glen Moy’ by Kassim et al. (2009). The difference between these two studies in percent variation explained by this locus could be the result of the more consistent proportions of compounds observed in the progeny of 96395S1 × ‘Latham’ compared with ‘Latham’ × ‘Glen Moy’. For example, excluding P3R, the range of percent compounds in 96395S1 × ‘Latham’ is 10–39 %, whereas for ‘Latham’ × ‘Glen Moy’ progeny the range is 1.6–64 % (Table 12).

On ‘Latham’ RLG6, the QTL for C3XR + C3R 2009, C3XR_C3R 2010, and C3GR 2010 are all associated with, or within the LOD ± 2 confidence interval of, marker RiMYB, which was designed from Rubus sequence homologous to TF MdMYB10 (Lin-Wang et al. 2010) which, like bHLH, encodes a component of the MYB/bHLH/WD protein complex (Baudry et al. 2004). MdMYB10 was isolated from red-fleshed apple and is responsible for controlling ACY biosynthesis in apple flesh (Espley et al. 2007). Additionally, PavMYB10 from sweet cherry (Prunus avium) is located within the confidence interval for a QTL associated with fruit color on the ‘New York 54’ × ‘Emperor Francis’ consensus linkage map (Sooriyapathirana et al. 2010). RiMYB maps within 7 cM of candidate gene marker Ri4Co1 and within 5 cM of marker Rub2a1, which is shared between ‘Latham’ RLG6 and ‘Latham’ × ‘Glen Moy’ LG3. Marker Rub2a1 falls within the 95 % confidence interval of the QTL for total ACY identified by McCallum et al. (2010) and within 8 cM of Ri4CL1, which is derived from the same sequence as marker Ri4Co1 on ‘Latham’ RLG6.

QTL have been mapped to neither ‘Latham’ nor 96395S1 RLG4, nor its equivalent in the other Rubus studies. In summary, each linkage group, except RLG4, appears to carry one to several loci involved in the control of various aspects of fruit quality in Rubus spp. The markers that are associated with stable QTL (Table 10), for example RubFruitE4, RibHLH, RiRosCOS2381 and RiF3′H, would be good candidates for marker-assisted breeding (MAB) applications.

Conclusions

The accumulation of total ACY and the accumulation of individual compounds in the fruit of Rubus appear to be controlled by different regions of the genome, as QTL for total ACY are located on RLG2 and RLG3, while QTL that control accumulation of individual compounds are located on every linkage group except RLG4, with especially important QTL on RLG2 and RLG7. The cause of the differences in the production of cyanidin 3-O-2G-glucosylrutinoside (C3GR) by progeny, as demonstrated by the division of the analyzed progeny into two sets of individuals, indicates the need for closer examination of the genetic control of that trait. Additional comparative gene analysis of the region underlying the RLG2 QTL, as well as analysis of the forthcoming Rubus idaeus ‘Heritage’ genome sequence and gene annotation, will help to identify candidate genes that may encode the enzymes necessary to catalyze the reactions that produce individual ACY compounds.