Introduction

In flowering plants, genome size is highly variable and may dramatically change even between closely related species (Ohri et al. 2004; Bennett and Leitch 2005; Zonneveld et al. 2005) or within species (Kalendar et al. 2000; Greilhuber 2005). This raises questions on the mechanisms of genome size evolution and its role in adaptation and speciation. Several studies have been carried out to gain greater insight into the origin of such variation by assessing correlations with the geographical distribution of species (Bennett 1987) and many biophysical traits such as environmental physical parameters of habitats, e.g., temperature, latitude, and elevation (Knight and Ackerly 2002; Slovák et al. 2009). Ecological factors appear to be more important in shaping genome size at lower divergence levels (Eilam et al. 2007; Dušková et al. 2010) than at higher taxonomic levels (Jakob et al. 2004). In some cases, genome size variation was found to be related to phenotypic and/or life traits (Beaulieu et al. 2007; Knight and Beaulieu 2008). As a contribution to this debate, it was interesting to study a genus characterized by both high species richness and a wide geographical distribution.

The genus Coffea belongs to the Rubiaceae family and currently includes 124 species native to Africa, Comoros, Madagascar, Mascarenes, India, Papua New Guinea, and Australasia (Davis et al. 2011). Based on internal transcribed spacer or plastid or both sequences, closely similar geographic clades were obvious from molecular phylogenies (Lashermes et al. 1997; Cros et al. 1998; Maurin et al. 2007 and Anthony et al. 2010) approximately corresponding to west, west-central Africa, east Africa, and Madagascar plus Mascarenes. Coffea species from western Indian Ocean islands (currently accounting for almost half the genus) named Mascarocoffea were first identified as caffeine-free species (Chevalier 1947).

West plus west-central African and east African species have several distinctive features such as a strong reproductive barrier (Louarn 1993) and different cytogenetic characteristics (Noirot et al. 2003a; Hamon et al. 2009). Studies on Mascarocoffea morphology and phenology have led to their classification into eight (Chevalier 1947) then six (Charrier 1978) botanical series, namely Humblotianae/Mauritianae from Comoros and Mascarenes, and Garcinoides, Millotii complex, Multiflorae, Subterminales and Verae from Madagascar. All Coffea species (restricted in this study to the genus Coffea) are diploids with 2n = 2x = 22 (Bouharmont 1963; Louarn 1972), except for Coffea arabica, an amphidiploid with 2n = 4x = 44 (Bouharmont 1959; Lashermes et al. 1996, 1999). Previous studies performed on fresh leaves revealed 2 C DNA content variations (1.03–1.76 pg) among a set of 23 African Coffea species (Noirot et al. 2003a) and two Mascarocoffea species (Cros et al. 1995).

Our objectives were (1) to develop a methodology that permits reliable estimations of genome size and investigations on large panels; (2) to assess the genome size of a broad sample of Mascarocoffea and to investigate possible association with species geographic distribution, leaf parameters, and Coffea species relationships.

Material and methods

Plant material

The study includes a total of 416 accessions from 52 populations corresponding to 44 Mascarocoffea species (from Comoros, Mauritius, and Madagascar) and six African species. Leaves from wild Malagasy and Comoros species maintained at the Kianjavato FOFIFA station, Madagascar, were sampled between 2009 and 2010. The six African species used as references were sampled from IRD Montpellier greenhouses (France), while the Mauritius species came from the Coffea international collection maintained at the Station CIRAD - Armeflhor de Bassin Martin, Saint-Pierre (Réunion).

Fully developed leaves were sampled from one to 10 trees (depending on the availability) per species or population (when several populations were available). Half of each fresh leaf (FL) sample was lyophilized (LyL) and stored until use for flow cytometry, while the remaining halves were used for stomata measurements. A full description of the analyzed accessions is given in Table 1.

Table 1 Description of the Coffea populations studied and mean 2 C DNA content assessed on nuclei obtained from lyophilized leaves

DNA content estimation

DNA content was measured using flow cytometry at the Imagif Cell Biology platform (Gif-sur-Yvette, France) according to Marie and Brown (1993). Lycopersicon esculentum cv. Roma (2 C = 1.99 pg, 40.0 % GC; Coba de la Peña and Brown 2001) was used as internal standard. About 1 cm2 LyL of the studied species and one of the internal standards were chopped using a razor blade in a petri dish with 800 μL of cold Galbraith nuclear-isolation buffer (Galbraith et al. 1983), brought to 0.5 % (w/v) Triton X100 and supplemented with 10 mM sodium metabisulfite, 1 % polyvinylpyrrolidone 10,000 and 5 μg/mL RNAse (ref 10109 169001 Roche, Mannheim, Germany). The suspension was passed through a 48-μm mesh nylon filter. The recovered nuclei were stained with 50 μg/ml propidium iodide (PI) (Sigma-Aldrich, Saint-Quentin, France), an intercalating agent.

Lyophilized leaves were just imbibed in the buffer for a few minutes. Three extractions per tree and 10 trees per population were made. When less than 10 trees were available, independent extractions were made in order to have at least 10 measurements.

The DNA content of 5,000-10,000 stained nuclei was determined for each sample using a CyFlow SL3 flow cytometer (Partec, Sainte-Geneviève-des-Bois, France) with a 532-nm green solid-state laser (100 mW). Fluorescence emission of propidium iodide was collected through a 590-nm Longpass filter. Total 2 C DNA values were calculated on the basis of the linear relationship between fluorescent signals from stained nuclei of the unknown specimen and the known internal standard (DNA content in pg = [sample peak mode / standard peak mode] × 2C standard DNA content) (see Discussion).

Leaf parameters

Transparent nail polish was applied to the lower surface of mature fresh leaves (no stomata were observed on the upper surface) on the middle portion on either side of the midrib. This leaf region contains guard cell lengths (GCL) and stomata densities (SD) comparable to the mean of the entire leaf (Beaulieu et al. 2008). GCL was measured on two prints per leaf and five measurements per print. The SD (number per mm2) was recorded after counting the number of stomata per field of view in five random fields from two prints out of five fresh leaves per accession. Overall, the measurements concerned five accessions per population. Species values were then calculated from the arithmetic mean of the 50 (5 × 2 × 5) individual records. Measurements were recorded under an optical microscope (Leitz Dialux 22 EB) at × 200 magnification for GCL and SD. A micrometer was used for measurement calibration.

In addition to stomata records, the leaf length (LL) was measured and the arithmetic mean calculated from five adult leaves per accession picked on the third branching from the top. The arithmetic mean per population was then calculated and used thereafter. All measurements concerned Malagasy species from Kianjavato, Madagascar.

Statistical analyses

Linear regression was used to analyze: (a) correlations between 2 C DNA values from LyL and FL, and (b) relationships between the genome size and stomata parameters. A two-way nested ANOVA model was used to test variations in DNA content estimation due to series (fixed effects) and populations (random effects) nested in series. A three-way nested ANOVA was performed to test variations in physiological parameters (SD, GCL, and LL) due to population/taxa (fixed effect), accessions (random effects), and leaves nested in accessions (random effects). Comparisons between arithmetic means were further assessed by honestly significative difference (HSD) test.

To gain further insight into the Mascarocoffea structure, a principal component analysis (PCA) was performed using four variables (LL, GCL, SD, and 2 C DNA) and 44 populations. A hierarchical clustering (HC) was then performed on PCA coordinates using the WARD method. All statistical analyses were performed using R software 2.11.1, (Bretz et al. 2010, http://cran.r-project.org/).

Results

Lyophilized leaves are valid material for 2C DNA estimation

Mascarocoffea lyophilized leaves turned out to be suitable materials for flow cytometric analyses. Most measurements revealed coefficient of variation (CV) values of around 3 %, comparable with those generally obtained from fresh material. Overall, a very high linear correlation was obtained between measurements of FL vs. LyL of a set of eight Malagasy and six African species used as references (R 2 = 0.96; Fig. 1). Hence, for coffee trees, the relationship between 2 C values obtained from the two data sets (LyL vs FL) was as follows: 2 C value (in pg) LyL = 1.052 × 2 C value FL − 0.029. This was used to predict the 2 C DNA values that would be obtained using LyL from data obtained previously with FL of African species (Noirot et al. 2003a), (see Table S1 in additional files).

Fig. 1
figure 1

Linear regression between 2 C DNA content values for lyophilized and fresh leaves (eight from this study and six from Noirot et al. 2003a) of 14 species covering the range of 2 C DNA of African species. CAN C. canephora, CON C. congensis, EUG C. eugenioides, HET C. heterocalyx, HUM C. humilis, and PSE C. pseudozanguebariae

Mascarocoffea genome size variation

Mascarocoffea genome sizes (2C DNA values) were estimated from 416 accessions, corresponding to 52 populations representing 44 species. DNA content variation between populations within a species was not significant (Table 1). This allowed determination of 2 C DNA values per species by a simple arithmetic mean. Although 2 C DNA values ranged from 0.96 (Coffea mauritiana) to 1.41 pg (C. millotii ex Coffea dolichophylla), discrimination between all species, based only on genome size, was impossible.

The global 2 C DNA value distribution (this study and values calculated from Noirot et al. 2003a, Table S1) was bimodal and was obtained by overlapping two curves, one corresponding to Mascarocoffea and the other to African species (Fig. 2). Highly significant differences in genome sizes between Africa and Madagascar (1.43 and 1.20 pg, F1,75 = 59.5, p < 0.0001) were obtained.

Fig. 2
figure 2

2C DNA values distribution obtained for the two groups: Mascarocoffea (open) and African species (dark, from Noirot et al. 2003a normalized to lyophilized leaves according to the regression equation in Fig. 1)

Moreover, the Mascarocoffea 2 C DNA mean value was similar to that obtained for east Africa (1.19 and 1.18 pg, respectively) and lower than for west plus west-central Africa (1.50 pg, F1,74 = 147; p < 0.0001).

Regarding the habitat, species from dry environments had significantly bigger genomes in Africa (1.21 pg) than in Madagascar (1.13; F1,22 = 5.2; p = 0.03). However, comparable mean values were calculated for African species from dry habitat (1.21 pg) and all Malagasy species (1.20 pg, F1,53 = 0.09, p = 0.8) or species restricted to humid habitat (1.24 pg, F1,32 = 0.63, p = 0.43).

Among Mascarocoffea, significant 2 C DNA value differences were noted between the botanical series (F5,46 = 43.7; p < 0.0001). However, only the Mauritianae/Humblotianae plus Subterminales series (mean values: 1.08 ± 0.12 vs 1.10 ± 0.06 pg), the Multiflorae (1.20 ± 0.05 pg) and the Millotii complex (1.33 ± 0.06 pg) could be differentiated from one to another (HSD tests highly significant).

Geographical distribution of Mascarocoffea genome size

Regarding their biogeography, the Malagasy series were not evenly distributed (Fig. 3). Two (Multiflorae and Millotii complex) had a broad northeast to southeast range. Among Multiflorae species, Coffea perrieri, occurring in gallery forest, has the largest but most fragmented distribution resulting from past climatic changes and/or anthropic factors. Two series (Subterminales and Garcinoides) were mainly found in the north, while Verae was restricted to the southeast. Significantly different genomes sizes (F5,41 = 9.22, p = 0.0001) were obtained among Mascarocoffea species. Three clusters were evidenced according to their geographical distribution: northeast (1.12 ± 0.05 pg) plus northwest (1.14 ± 0.06 pg), east (1.27 ± 0.008) plus southeast (1.24 ± 0.09 pg) distinct from one another (HSD test highly significant, p < 0.001) and center plus south (1.17 ± 0.07 and 1.18 ± 0.04 pg) with intermediate values, statistically not different from the two clusters mentioned above.

Fig. 3
figure 3

Geographical distribution of the Mascarocoffea populations studied. The species codes used are as follows: ABA: C. abbayesii; AND: C. leroyi ex C. andrambovatensis; ANK: C. ankaranensis; AMB: C. millotii ex C. ambodirianensis; ARE: C. arenesiana; AUG: C. augagneuri; BET: C. betamponensis; BER: C. bernardiniana; BERT: C. bertrandii; BON: C. bonnieri; BOI: C. boiviniana; COS: C. leroyi ex C. costei; DAP: C. leroyi ex C. daphnoides; DOL:C. millotii ex C. dolichophylla; DUB: C. dubardii; FAR: C. farafanganensis; HEI: C. heimii; HOM: C. homollei; HUMB: C. humblotiana; JUM: C. jumellei; KIA: C. kianjavatensis; LAN: C. lancifolia; LER: C. leroyi; MAC: C. macrocarpa; MAN: C. mangoroensis; MOG: C. mogenetii; MAU: C. mauritiana; MON: C. montis-sacri; MIL: C. millotii; MYR: C. myrtifolia; PER: C. perrieri; PERV: C. pervilleana; RAT: C. ratsimamangae; RES: C. resinosa; RIC: C. richardii; SAHA: C. sahafaryensis; SAK: C. sakarahae; TET: C. tetragona; TSA: C. perrieri ex C. tsaratanensis; TSI: C. tsirananae; VIA: C. vianneyi; VAT: C. vatovavyensis; VOHE: C. mcphersonii ex C. vohemarensis

Finally, the results highlighted global trends (Fig. 4a): (a) the smallest genomes were native to Mauritius and Comoros; (b) in Madagascar, 2 C DNA values tended to increase from north to southeast; (c) considering the species habitat reported by Leroy (1972a, b), Charrier (1978), and Davis et al. (2006), (F Rakotonasolo, Royal Botanic Gardens, Antananarivo, Madagascar, ‘pers. Comm.’), genomes of the species that occur in dry forests (yellow boxes; Fig. 4a) covered the lower-middle part of the total size variation range, while genomes of species from humid forests (green boxes; Fig. 4a) mainly represented the middle-upper part of the genome size spectrum.

Fig. 4
figure 4

2C DNA content distribution. Box color indicates the habitat: in green, humid forest; yellow, dry forest; orange, dry environments (top of hills or shrub savanna); red, gallery forest. a For Mascarocoffea, species from Madagascar, Comoros and Mauritius, the species codes used are as given in Fig. 3 and Table 1. b For African species, data given here are those calculated for lyophilized leaves from Noirot et al. (2003a). The species codes used are as follows: ANT: C. anthonyi, BRE: C. brevipes, CAN: C. canephora, CHA: C. charrieriana, CON: C. congensis, COST: C. costatifructa, DEW: C. liberica var dewevrei, EUG: C. eugenioides, HET: C. heterocalyx, HUM: C. humilis, KAP: C. kapakata, LIB: C. liberica var liberica, POC: C. pocsii, PSE: C. pseudozanguebariae, SAL: C. salvatrix, SES: C. sessiliflora, STE: C. stenophylla, RAC: C. racemosa, SpCO: C. sp Congo, KOT: C. sp Koto, MAY: C. sp Mayombe, NGO2: C. sp Ngongo2, NGO3: C. sp Ngongo3, NKO: C. sp Nkoumbala. 2 C DNA values are in pg

Estimation of leaf parameters and its correlation with 2C DNA content

Differences among species for the three morpho-physiological parameters investigated (SD, GCL, and LL) were highly significant (F48,156 = 15.8, p < 0.0001; F44,146 = 51.6, p < 0.0001; F40,136 = 31, p < 0.0001 respectively). Correlations between morpho-physiological parameters and 2 C DNA content, and relationships among leaf traits were observed. A negative correlation was found between SD and GCL (r = -0.65, p < 2.2 10-16; Fig. 5a), but no correlation was obtained between SD and LL. A significant positive correlation was obtained between LL and 2 C DNA content (r = 0.48, p < 0.0001, Fig. 5b). There was a weak but significant correlation between GCL and 2 C DNA content (r = 0.18, p = 0.013, Fig. 5c). Despite some marked differences between species in stomata density, no correlation was found with 2 C DNA content.

Fig. 5
figure 5

Linear regression showing: a A negative correlation between stomata density and guard cell length; b Medium positive correlation between leaf length and 2 C DNA content; and c weak positive correlation between guard cell length and 2 C DNA content

Mascarocoffea organization

The Mascarocoffea organization was assessed from multivariate analyses based on both leaf parameters and 2 C DNA content. Two components explained 79.9 % of the generalized variance with 43.1 and 36.8 % due to the first and the second component respectively. The differentiation between species with long leaves and large genomes (mostly from southeast) and long guard cell length species (mostly from the north) was clear on the first PCA axis (Fig. 6 insert). While the second axis mainly differentiated species with high stomata density (from east).

Fig. 6
figure 6

Hierarchical clustering on factorial coordinates of a principal component analysis and WARD aggregation method. The four variables used are stomata density, guard cell length, leaf length, and genome size. The geographic origin (NE northeast, NW northwest, C center, S south, E east, and SE southeast as noted in Fig. 4a) and the habitat (D dry, H humid, and G gallery forest) of each population are indicated. Population codes are those used for Fig. 3 and Table 1

An HC (Fig. 6) performed on factorial coordinates revealed three clusters respectively defined by the following general trends: (1) high stomata density and medium-sized genomes (1.21 pg mean value), (2) large stomata guard cells associated with small leaves and genomes (1.13 pg mean value), and (3) long leaves associated with large genomes (1.33 pg mean value). When the geographical origin and habitat of species were reported on the HC, it appeared that species characterized by large stomata and generally small leaves and smaller genomes were mainly from the north and dry habitats, while species with long leaves and the larger genomes were mainly from humid habitats in the southeast. Species with high stomata density were mainly from humid habitats of east and southeast.

Discussion

Lyophilization, a reliable process to assess Coffea 2C DNA content

Generally, fresh plant material is requested for 2 C DNA content and/or ploidy estimation. However, as reported by Doležel and Bartoš (2005), this poses a strong limitation upon cytometry of material collected far from laboratories and with very large sampling. Hence, silica-dried material (routinely practiced in the field for sample preservation) has been used (Suda and Trávníček 2006). For plants with high content in phenolics, as coffees, such material is often oxidized and could therefore be inappropriate. For this reason we chose lyophilization.

Propidium iodide was used as an intercalary dye, as it is not sensitive to base composition. Taking 400 mg leaves (corresponding to 4 cm2 for Noirot et al. 2003a), Barre et al. (1996) concluded that PI saturation is obtained with 330 μg/mL, but that this tends to destabilize the DNA. We used only 1 cm2 leaf tissue and the typical concentration of 50 μg/mL PI, six-fold less than Barre et al. (1996). This gave reproducible relative peak positions for Coffea to standard (see the column CV in Table 2).

Noirot et al. (2005) demonstrated that temperature modifies chromatin-condensation and consequently PI accessibility, but the effects (increase/decrease PI accessibility) are not similar for Coffea and the standard. So, the practice for decades has been to work over ice with cold buffers, as done here. The level of Triton was raised from the typical 0.1 % (w/v) to 0.5 %. Cytosolic compounds (such as caffeine or phenolics, Noirot et al. 2003b) can also cause stoichiometric errors in the estimation of 2 C DNA content, possibly giving artefactual intraspecific variation (Noirot et al. 2000). Our buffer contained 1 % PVP to absorb polyphenols, and fresh 20 mM metabisulfite as antioxidant. The African species used to test the relationship between lyophilized and fresh material (Fig. 1) cover a large range of chlorogenic acids and caffeine content (Campa et al. 2005), while Malagasy are caffeine-free or display a low caffeine content (Rakotomalala 1992). They also have low levels of total chlorogenic acids (<2.5 % dry matter basis, Anthony et al. 1993). Yet the values of these biochemically diverse species apparently give the same relationship whether they are analyzed on lyophilized or fresh material, which is reassuring. Finally, as well demonstrated by Noirot et al. (2003 caffeine, 2005), there is no perfect solution to suppress all effects (negative and positive) having an impact on the genome size measurement. However, these authors clearly recommend that this “should not prompt scientists to drop genome size investigations using flow cytometry”. When estimating genome size for new species, as it is the case here, “it is still better to have a slightly biased estimation than no estimation at all”. We therefore used a linear relationship to convert the ratio of fluorescent signals to the genome size applying necessary precautions (Triton, PVP, metabisulfite, cold, an internal standard, and high replication).

Coffea genome size variation and leaf parameters

Regarding leaf parameters, for Coffea species, despite the absence of or poor correlation with genome size, it generally appeared that the smaller genome species from dry environments had smaller leaves with larger stomata, while larger genome species from humid habitats had longer leaves with a higher stomata density (Fig. 6). The positive correlations between genome and stomata sizes documented within eudicots, or between genome size and leaf shape and structure, reported across angiosperms, are seldom noted in trees (Hodgson et al. 2010). However, trees tend to have a small genome size and small and dense stomata, even though within tree samples there is no significant relationship between genome size and any of these physiological traits (study on 101 angiosperms, Beaulieu et al. 2008). One possible explanation could be that trees have a genome size range that is generally too small to detect such correlation. If genome size sets the minimum size of both guard cells and epidermal cells, the resulting change in stomatal density may predispose a species to a particular ecological and life history strategy. For Coffea, if the genome sizes were larger, there would probably be a more significant correlation between genome size and stomata size and density. This would probably also indicate that the species are well adapted to their growing environment and may not be or may be less able to colonize new niches. This last point would disagree with the assumption of recent speciation in Madagascar (Maurin et al. 2007).

Moreover, GCL is a complex trait that also depends on other factors such as hormonal regulation of plant growth and development, drought stress, and light quality. Moreover, Coffea genome size appeared also related to adaptive traits such as rainfall and water deficit (Cros et al. 1995; Noirot et al. 2003a; this study). Hence, for Coffea as for all other trees, the interaction between all these factors may result in an absence of clear relationship between each of these traits and the genome size.

Finally, for Coffea species, the global trend is that species with the smaller genomes and smaller and thicker leaves generally grow in dry areas of east Africa or northern Madagascar, while species from humid forests of west and central Africa or eastern and southeastern Madagascar have bigger genomes and larger and thinner leaves.

Geographical distribution of Coffea genome size

Genome size variation was observed at different levels throughout the genus Coffea, i.e., the main geographic regions (Mascarocoffea, west plus west-central Africa, and east Africa) and the series among Mascarocoffea. Although the Coffea phylogeny is not well resolved, three main clades corresponding to the main geographic regions were identified in all studies (Lashermes et al. 1997; Cros et al. 1998; Maurin et al. 2007). All African species and all Mascarocoffea were differentiated by their mean genome size (1.43 and 1.19 pg, respectively). Interestingly, within each landmass (Africa and Madagascar), a genome size gradient was identified: east to west for Africa and north to southeast for Madagascar. Furthermore, despite some exceptions, notably with respect to the two smallest genomes found in humid habitat of Mauritius and Comoros, the genome size distribution according to habitat in Madagascar as well as in Africa was in line with the general trend, i.e., the smaller genomes in dry forests or dry environments and the bigger genomes in humid forests (Figs. 4 and 6). Previous gradients were emphasized in Africa (Cros et al. 1995; Noirot et al. 2003a), the first with rainfall and the second with water deficit. Large-genome species appear to be less frequent in environments characterized by low precipitation and high temperatures (Knight and Ackerly 2002). Similarly, the genome size variation for diploid Lasiocephalus species (Asteraceae), was closely correlated with environmental and life traits (elevation, habitat, and growth form, Dušková et al. 2010).

At the geographical level, the genome size ranges for Mascarocoffea (0.96–1.41 pg) and for east African species (1.05–1.47 pg) were similar and lower than for west plus west-Central Africa (1.33–1.84 pg). However, the differences between minimum and maximum values are similar, indicating dynamic genome size variation in the three regions.

Given that the genome sizes are similar between east Africa and Madagascar (1.18 and 1.20 pg/2C respectively), it could be argued that the Malagasy species likely came from east Africa as previously postulated by Charrier (1978) and Leroy (1980). Under this scenario, it would be more likely that migrants from east Africa have been reached first the western coast or the north coast of Madagascar corresponding to rather dry environments. The founding species may have been adapted to dry environments in a first place, and only recently radiated into more humid environments of Madagascar. This could be congruent with the fact that dry forests in Madagascar would be older than humid ones (Wells 2003). Moreover, climate was rather stable in Madagascar until 2–3 My ago (Yoder and Nowak 2006), contrary to what was observed in tropical Africa during Miocene (Morley 2000). This may partly explain a more pronounced speciation process in west plus west-central Africa and finally the significant differences in genome size between humid forest species found in Africa compared to Madagascar.

Finally, dispersal event(s) could more likely explain the origin of Malagasy Coffea species, while environmental factors and recent dispersal radiation would be responsible for the total endemism of Mascarocoffea. However, given the range of genome size variation, two questions could be addressed: has there been an increase or a decrease in DNA content, or both? Can genome size variation be correlated with genetic relationships among species?

Coffea genome size variation and species relationships

Coffea genome size and species relationships either from the molecular phylogenic tree or from their genetic divergence were not associated. Indeed, plotting genome sizes on Maurin’s tree (the one including the wider Mascarocoffea sampling) did not reveal any clear relationship between genome size and phylogeny neither in Africa nor Madagascar. Indeed, some unexpected relationships appeared. For instance, Coffea stenophylla and Coffea humilis (two west African species) proved to be genetically close despite their very different 2 C DNA contents (1.33 vs. 1.84 pg), likewise for the two west-Central African species Coffea brevipes (1.57 pg) and Coffea heterocalyx (1.81 pg). In the same way, in Madagascar, C. millotii and Coffea heimii appeared to be genetically close despite their distinctive genome sizes (1.40 vs. 1.25 pg), as did Coffea augagneuri (1.18 pg) and Coffea tsirananae (1.06 pg) (red boxes; Fig. 7).

Fig. 7
figure 7

Genome sizes plotted on the Coffea phylogeny tree adapted from Maurin et al. (2007). Only branches for species studied here were kept, while the others are dotted lines. Red boxes underline examples of closely related species differing by their genome size (see text). 2 C DNA values are in pg for lyophilized material

Relationships among species can also be estimated through the cross-hybridization success rate or through the F1 hybrid pollen fertility rate. For instance in Madagascar, two pairs of species, i.e., Coffea resinosa and C. millotii, with a 31.4 % cross-hybridization success rate and Coffea vatovavyensis and Coffea bertrandii with a 73 % F1 hybrid pollen fertility rate (Charrier 1978), should be considered as relatively close while they differed significantly by their 2 C DNA content (1.27 vs. 1.40 pg for the former and 1.03 vs. 1.22 pg for the latter). Regarding African species, C. humilis × Coffea liberica var liberica inter-specific F1 hybrids have good fertility (54–67 % pollen viability, Louarn 1993), while the two species differ significantly by their genome size (1.84 and 1.44 pg, respectively) and belong to separate clades. Altogether, these observations lead to the same conclusion that there is no relationship between genome size and genetic divergence or between genome size and phylogeny. This conclusion supports previous reports on Hordeum (Jakob et al. 2004), Aegilops (Eilam et al. 2007), and Lasiocephalus (Dušková et al. 2010).

Finally, like the cotton tribe (Wendel and Cronn 2002) or the genus Agropyron (Caetano-Anollès 2005), our results suggest that during Coffea genome evolution, both DNA content increase and/or decrease occurred independently in Africa and in the Indian Ocean Islands.