Why the olive?

Olive (Olea europaea L.) is a typical and widespread tree of the Mediterranean region, where its cultivation started in the third millennium B.C. (Loukas and Krimbas 1983). Olive is the second most important oil fruit crop cultivated worldwide after oil palm. Its cultivation covers over eight million hectares of land, predominantly concentrated in the Mediterranean basin, where 70% of the olive oil produced is consumed (Baldoni and Belaj 2009). The olive tree is a glycophytic species that shows a high tolerance to drought and salt stresses, if compared with other fruit trees that are generally salt sensitive (Gucci and Tattini 1997).

Several studies have highlighted the beneficial effects of olive oil on human health (Keys 1995; Pérez-Jiménez et al. 2007). As consequence of the increased interest for this crop, the consumption of olive oil has expanded also in non-traditional producer countries such as the United States, Australia and Japan (Pinelli et al. 2003).

Some minor constituents, that differentiate olive oil from all the other vegetable oils used in the human diet, seem to be responsible for the major effects on health. These components, named secoiridoids, represent the most important class of the olive phenolics and they are exclusively present in the Oleaceae family that includes Olea europaea L. The protective activity of olive oil against chronic degenerative diseases and tumours is credited to this group of secondary metabolites, which are also responsible for the agreeable sensory properties of virgin olive oil (Servili et al. 2004).

The olive tree belongs to the Oleaceae family that comprises 30 genera and 600 species (Cronquist 1981). Within the genus Olea, which includes 30 species and has spread to Europe, Asia, Oceania and Africa, only Olea europaea is cultivated. The wild olive or oleaster (Olea europaea subsp. europaea var. sylvestris) and the cultivated olive (Olea europaea subsp. europaea var. europaea) are two co-existing forms of the subspecies europaea (Green 2002). Other five subspecies constitute the Olea europaea complex: (a) laperrinei, present in Saharan massifs; (b) cuspidata, present from South Africa to southern Egypt and from Arabia to northern India and south-west China; (c) guanchica present in the Canary Islands; (d) maroccana present in south-western Morocco; (e) cerasiformis present in Madeira (Green 2002).

The cultivated olive is an evergreen, out-crossing, vegetatively propagated tree with a very wide genetic patrimony that is the result of both plant longevity and the scarcity of genotype turnover through centuries of cultivation. The large number of cultivars, added to the many cases of synonymous and homonymous name, makes particularly difficult the description and classification of olive varieties (Fabbri et al. 2009). As a result the size of this germplasm is controversial: about 1,250 varieties, cultivated in 54 countries and conserved in over 100 collections, were included in the FAO olive germplasm database (Bartolini 2008), also if this number is certainly higher because the lack of information on many local cultivars and ecotypes (Cantini et al. 1999). The most part of these cultivars comes from southern European countries such as Italy (538 varieties), Spain (183), France (88) and Greece (52) (Baldoni and Belaj 2009). Due to this richness of the germplasm, olive is an unusual case among horticultural crops and its biodiversity can represent a rich source of variability for the genetic improvement of this plant (Baldoni and Belaj 2009).

Cytogenetic studies

Olive is a diploid species having 46 chromosomes (2n = 2x = 46) (Breviglieri and Battaglia 1954). However, karyological studies in this plant are very complex due to the small size, similar morphology and large number of chromosomes, which make difficult a satisfactory characterisation of the chromosome complement. In order to solve this problem differential staining of the chromatin and in situ hybridization of highly repeated DNA sequences and ribosomal cistrons were used in combination and were able to distinguish the most part of the olive chromosomes pairs (Minelli et al. 2000). The determination of nuclear DNA content was employed for characterising the olive genetic resources. Nuclear DNA content was determined by cytometric methods in several Italian cultivars. Rugini et al. (1996) found 2.26 and 2.20 pg of DNA per haploid nucleus, respectively, in Frantoio and Leccino, while Bitonti et al. (1999) found 3.90 pg/2C in Dolce Agogia and 4.66 pg/2C in Pendolino. The genome size of six Portuguese cultivars and a wild olive was estimated by Loureiro et al. (2007) using flow cytometry methods. The obtained data were slightly different from those of the Italian cultivars, with a nuclear DNA content ranging between 2.90 ± 0.020 and 3.07 ± 0.018 pg/2C for the Portuguese cultivars and 3.19 ± 0.047 pg/2C DNA for the wild olive. These studies proved an intraspecific variation of genome size in Olea europaea and confirmed the nuclear DNA content analysis as an useful tool for characterising genotypes within species.

The analysis of ploidy level was also made in order to study the Olea europaea complex. A different level of polyploidy was highlighted by Besnard et al. (2008) in the six subspecies, using flow cytometry and six highly variable nuclear microsatellites. Four subspecies appeared to be diploids, while subspp. cerasiformis was tetraploid and maroccana hexaploid. Concerning the subspp. europaea and cuspidata, Rallo et al. (2003) assessed a partial polyploidy by microsatellites analysis, although this hypothesis was not confirmed by Besnard et al. (2008). With regard to the olive tree relatives, recently Besnard and Baali-Cherif (2009) reported the first evidence of the coexistence of two ploidy types (diploid and triploid genotypes) in a relict laperrinei’s olive population from Algeria.

DNA-based molecular markers in olive studies

Molecular markers revealing polymorphisms at the DNA level are very useful tools in genetics studies and in the improvement of crop plants. Indeed they can be applied to a variety of purposes including DNA fingerprinting, genetic screening and chromosome mapping.

In olive DNA based markers were widely used both in theoretical and applied research fields, such as the revision of species Olea europaea, the characterisation of the huge olive germplasm, the breeding programs and the cultivars traceability of olive oil. A description of the main genetic markers used in olive studies and their use in this crop is presented below and summarised in Table 1.

Table 1 Applications of DNA-based molecular markers in Olea europaea studies

RAPDs (Random Amplified Polymorphic DNA)

This procedure, developed in 1990 by Williams et al., detects nucleotide sequence polymorphisms by using primers (usually 8–10 bp long) of arbitrary sequence. A single species of primer anneals to the genomic DNA at two different sites on complementary strands of DNA template and, if these priming sites are within an amplifiable range of each other, a discrete DNA product is formed through PCR (Polymerase Chain Reaction) amplification.

Due to their simplicity of application and low cost, these markers were the first used to evaluate the genetic variability in olive (Bogani et al. 1994) Subsequently RAPDs has been widely used for characterisation of varieties from the collections of several countries such as Italy (Fabbri et al. 1995; Cresti et al. 1997), Spain (Belaj et al. 2001; Sanz-Cortés et al. 2001; Belaj et al. 2004), Israel (Wiesman et al. 1998) and Australia (Mekuria et al. 1999; Guerin et al. 2002). This technique was also very useful in the identification of plant material from the nursery (Rubio and Arus 1997; Belaj et al. 1999). The high discriminating power of these markers allowed their use in the detection of intra-cultivar variability (Belaj et al. 2004; Gemas et al. 2000). Applications of these markers to phylogenetic studies in Olea europaea species was also reported (Hess et al. 2000). Moreover, their applicability to the traceability of cultivars in the olive oil was evaluated by Muzzalupo and Perri (2002). Finally, this kind of markers was used, together with other DNA-based markers, in the construction of the first linkage maps (De la Rosa et al. 2004; Wu et al. 2004).

AFLPs (Amplified Fragment Length Polymorphism)

This technique developed by Vos et al. (1995) is based on the detection of the variation among genomic restriction fragments by PCR amplification. The procedure consists of a double digestion of genomic DNA with two restriction enzymes. The fragments generated are ligated with adaptors (short double stranded oligonucleotide with a known sequence) and then reduced in number by a selective amplification with arbitrary primers containing a core sequence that is a part of the adapter. This technique, which does not require any prior knowledge of the sequence, is very useful in the detection of polymorphisms between closely related genotypes (Belaj et al. 2004). In olive, AFLPs have been widely used for DNA fingerprinting of cultivars (Angiolillo et al. 1999; Owen et al. 2005), to analyse the relationships between wild and cultivated olive (Baldoni et al. 2006), for the construction of linkage maps (de la Rosa et al. 2003) and for cultivar traceability of olive oil (Busconi et al. 2003; Pafundo et al. 2005).

SCARs (Sequence Characterised Amplified Region)

This technique, that was introduced by Paran and Michelmore (1993), involves the conversion of single RAPD or AFLP products in sequence-characterised amplified regions by the development of specific primers, drawn on the nucleotide sequence of the RAPD or AFLP fragment. SCARs are PCR-based markers representing genetically defined loci that have been widely and successfully used in crop plants for marker-assisted selection (MAS) (Zhang and Stommel 2001).

In olive, SCARs have been used for cultivar identification (Busconi et al. 2006) and in olive oil traceability (De la Torre et al. 2004; Pafundo et al. 2007). Putative associations of several SCAR markers with fruit characteristics (Mekuria et al. 2002) and resistance to pathogenic fungi (Hernández et al. 2001) were found, suggesting the applicability of this kind of marker for marker-assisted breeding programs.

SSRs (Simple Sequence Repeats)

Microsatellites or SSR markers are regions of DNA, consisting of tandemly repeating mono-, di-, tri-, tetra- or penta-nucleotide units, which are arranged throughout the genomes of most eukaryotic species (Powell et al. 1996). The number of repetitions of these nucleotide units generates a polymorphism among genotypes. As for SCARs, the development of these molecular markers requires prior knowledge of the DNA sequences of the SSRs flanking regions. Nowadays, microsatellites are likely the markers of choice for genetic studies in olive because of their high polymorphism and reproducibility. Many authors have reported on SSR development in olive and several of them are currently available for DNA analysis (Sefc et al. 2000; Cipriani et al. 2002; De la Rosa et al. 2002; Sabino Gil et al. 2006).

In Olea europaea, these markers have been used for different applications such as cultivar discrimination (Sarri et al. 2006; Fendri et al. 2010), study of relationships between wild and cultivated olive tree (Belaj et al. 2007), construction of association maps (De la Rosa et al. 2003), paternity analysis (Mookerjee et al. 2005) and identification of olive oil varietal composition (Alba et al. 2009; Ayed et al. 2009). Recently, some attempts to improve the application of SSRs have been made in order to compare results among different laboratories. A list of recommended SSR markers and protocols for olive genotyping has been provided with the aim to develop a robust method to track the origin of olive cultivars (Doveri et al. 2008; Baldoni et al. 2009).

ISSRs (Inter Simple Sequence Repeats)

ISSRs are DNA fragments of about 100–3,000 bp located between adjacent, oppositely oriented microsatellite regions. This technique, reported by Zietkiewicz et al. (1994), based on the amplification of inter-SSR DNA sequences by using microsatellite core sequences as primers for PCR reaction. About 10–60 fragments from multiple loci are generated simultaneously, separated by gel electrophoresis and scored as the presence or absence of fragments of particular size.

ISSRs were applied both in phylogenetic analysis within the Olea europaea species and in olive cultivar identification. In a study about the structure of Olea europaea complex in the oceanic islands of Macaronesia, molecular evidence, provided by ISSRs and RAPDs markers, clearly indicates that Madeiran and Canarian populations of ssp. Cerasiformis do not form a monophyletic group, supporting for the hypothesis of two independent dispersal events of Olea in Madeira and Canary Islands (Hess et al. 2000). In an other study, Vargas and Kadereit (2001) confirmed the wild status of some olive trees populations in the Iberian Peninsula by means ISSRs analysis. These markers were also used with success to distinguish 10 Italian varieties, by analysing genomic DNA extracted from the olive fruit (Pasqualone et al. 2001), and for the study of intra-cultivar variability of 201 accessions belonging to 11 Portuguese cultivars (Gemas et al. 2004).

SNPs (Single Nucleotide Polymorphism)

A Single Nucleotide Polymorphism is a small genetic variation that can occur within a DNA sequence. SNPs are the most abundant and ubiquitous type of polymorphisms in living organisms, since they occur in virtually unlimited numbers as differences in single nucleotides between individuals (Ganal et al. 2009). Due to their abundance along the genome, coupled with the development of next-generation high-throughput genomic sequencing technologies, they could be the marker system of choice in the future. The development of this kind of marker requires a high level of genome sequence information: it is therefore not surprising if only a few SNPs have been reported in olive, where only a small amount of sequence data was available before the year 2009.

To overcome this lack in sequence knowledge, in 2006 Reale et al. (2006) used both a sequence-based and an arbitrary approach to identify eight SNPs in olive. In the second approach, products from a generic fingerprinting technique were sequenced, amplified by specific primers in several olive cultivars and then compared to found polymorphisms. Muleo et al. (2009) found several SNPs in the Phytochrome A gene by means of high-resolution melting (HRM) analysis of DNA. Using this technique, the authors were able to easily detect the presence of mutations for substitution, either homozygous or heterozygous status of the gene. All the SNPs were confirmed by subsequent analysis. Finally, they concluded that HRM analysis has a very high reproducibility and sensitivity for detecting SNPs, allowing olive cultivar genotyping and resulting in an informative, easy, and low-cost method able to greatly reduce the operating time. Five SNPs were also identified by Santos Macedo et al. (2009) in the partial sequence of the gene for alternative oxidase OeAOX2. Recently, Hakim et al. (2010) discovered nine new SNPs by direct sequencing of the lupeol synthase (OEW) and cycloartenol synthase (OEX) genes in 16 Tunisian olive cultivars.

Sequence variation of ribosomal and cytoplasmic DNA

The polymorphism of ribosomal and cytoplasmic non-coding DNA, such as internal transcribed spacer (ITS) and intergenic spacer (IGS), is widely used for phylogenetic studies. The lack of strict mechanisms of conservation in function for these sequences followed indeed in a high nucleotide variability, promoting the usefulness of these markers for evolutionary purposes. Variations in these regions can be detected in several ways: by direct sequencing and by digestion of amplified sequences with restriction enzymes (RFLP—Restriction Fragment Length Polymorphism—Botstein et al. 1980; Neale and Williams 1991).

Ribosomal DNA polymorphism

The sequence variation in the internal transcribed spacers (ITS1 and ITS2) of the nuclear ribosomal genes 18S, 5.8S and 26S has been analysed for the description of Olea europaea complex. Hess et al. (2000) used different nuclear markers, including sequence variations inside the ITS1 region, to reconstruct the colonisation history of Olea europea L. in the Macaronesian islands. Besnard et al. (2001) studied the genetic differentiation of cultivated olive from its wild relatives recovered from different geographic areas using RAPD and RFLP of rRNA genes. Additionally, the structure of invasive populations of olive tree (belonging to subspp. europaea and cuspidata) from Australia and Hawaii was studied by Besnard et al. (2007a), using different markers, including ribosomal DNA polymorphism. The authors determined that East Australian and Hawaiian populations (subsp. cuspidata) have originated from southern Africa while South Australian populations (subsp. europaea) have mostly derived from western or central Mediterranean cultivars. Besnard et al. (2007b) used ribosomal DNA sequences from genes and pseudogenes (18S and 5.8S) and ITS1 in order to evaluate the relationships between taxa and populations inside the genus Olea europaea and to allow the reconstruction of evolutionary patterns involved in the differentiation of the olive complex. More recently, Besnard et al. (2009) revised the relationships within the Oleaceae family by using both ribosomal and cytoplasmic sequences.

Cytoplasmic DNA polymorphism

Polymorphisms in chloroplast (cpDNA) and mitochondrial DNA (mtDNA) have been used in olive for different purposes.

In 2000 Besnard et al. associated the male sterility displayed several olive cultivars with particular chloroplast and mitochondrial RFLP polymorphisms. Besnard et al. (2002a) analysed the genetic structure of Mediterranean olive trees by comparing the chlorotypes of oleasters and cultivated forms. A chlorotype-specific marker from the Eastern basin was found in several cultivated forms throughout the Mediterranean basin, suggesting a strong presence of human influence on the phylogeography of olive trees. García-Díaz et al. (2003) tested the effect of prolonged vegetative multiplication in the maintenance of mitochondrial homoplasmy and the generation of heteroplasmy. By using an intergenic spacer of the mitochondrial genome, the authors found that several sequence changes were detected in 88.5% of the investigated genomes after several rounds of vegetative reproduction. Analysis of the same sequence in clones from olive trees obtained by sexual reproduction showed only a few changes, confirming the role of sexual reproduction in the maintenance of mitochondrial homoplasmy. Finally, Intrieri et al. (2007) used chloroplastic markers for cultivar identification and oil traceability. The authors analysed 13 cultivars and 1 feral accession of Olea europaea for polymorphisms in the intergenic spacer of chloroplast DNA. Four out of the 13 cultivars analysed were discriminated, and an identification protocol for these cultivars, based on the amplification and subsequent sequencing of the chloroplast trnT-trnD intergenic spacer, was suggested.

Applications of DNA-based molecular markers

In this section, we reported the main results achieved in olive studies by using DNA molecular markers. An overview on the most relevant fields of application is given, with the aim to highlight the relevant contribute that these molecular techniques gave to improve the knowledge about some theoretical and practical aspects of olive tree.

Studies on genetic variability of Olea europaea complex

Genetic variability study is a key step in acquiring knowledge on the resources available for the genetic improvement of a crop. This is particularly important for olive where a high number of different genotypes are currently cultivated.

In order to supplement and refine the traditional morphological description of cultivars, which show some limitations because of environmental influences, molecular markers have been applied for characterisation of the olive germplasm (Dorado et al. 2005; Trujillo et al. 2005; Ganino et al. 2006). Information on the classification of Olea europaea L. has resulted in very interesting advances in our understanding of the share of Olea taxa intercompatibility with the Mediterranean olive tree. The phylogenetic reconstruction by AFLPs of this complex confirmed O. europaea L. as a monophyletic group having six subspecies recognised in Eurasia and Africa (Rubio de Casas et al. 2006).

RFLP analysis of ribosomal sequences (IGS-RFLP) confirmed that the Mediterranean basin is the area of olive domestication and that the taxa more related to the olive tree are the subspecies laperrinei, maroccana and cerasiformis (Besnard et al. 2001). Further analysis of the subspecies of Olea europaea L. by RFLP analysis of chloroplast sequences highlighted a strong differentiation between chlorotypes from the Eastern and Western parts of the Mediterranean area and confirmed the close relatedness of olive to the subspecies laperrinei (Besnard et al. 2002b).

Several analyses regarding the patrimony of Mediterranean wild olive have been performed with the aim to characterise this as a source of genetic traits potentially useful for olive improvement programs (Hannachi et al. 2009). Mitochondrial RFLP analysis confirmed a clear genetic distinction between wild olives from the Eastern and Western parts of the Mediterranean area (Bronzini de Caraffa et al. 2002; Besnard and Bervillé 2002), while the pattern of genetic variability and the genetic relationships among different populations of wild olive from the north-western Mediterranean basin area were clarified by SSR markers (Belaj et al. 2007). With regard to the relationships between wild and cultivated olive trees, RAPD (Bronzini de Caraffa et al. 2002), AFLP (Angiolillo et al. 1999; Baldoni et al. 2006) and SSR analysis (Erre et al. 2010) showed that cultivars had originated only in few cases by selection of local wild olive and that they have been prevalently introduced in their area of cultivation from the outside.

In olive the richness of the germplasm, coupled with the absence of references and mistakes made on cultivar denominations, remarkably complicates the classification of varieties. With the aim of overcoming these problems, numerous studies on cultivar identification have been made in the last 15 years by means of different molecular markers. RAPDs were the first molecular technique used with this purpose (Fabbri et al. 1995; Belaj et al. 2001). This approach was also used to evaluate intra- and inter-cultivar variability (Gemas et al. 2000). RAPD studies revealed the wide genetic variability existing at the regional (Sanz-Cortés et al. 2001), national (Belaj et al. 2003) and Mediterranean levels (Belaj et al. 2001). AFLP analyses have been used to study relationships among cultivars both from wide (Owen et al. 2005) and restricted areas of cultivation (Angiolillo et al. 2006). Among the available molecular markers, SSRs are becoming the preferred choice in olive cultivar identification because of their high discriminatory power and usually straightforward interpretation. They have been used for genotyping cultivars from different areas of the Mediterranean basin (Sarri et al. 2006), but also to characterise the local germplasm from small areas of cultivation (Poljuha et al. 2008; Bracci et al. 2009).

Olive oil traceability

Food traceability is important in order to prevent deliberate or accidental mislabelling during food production processes (Marmiroli et al. 2009). As for other products, the introduction of certifications of origin and quality for virgin olive oil as PDO (protected designation of origin) makes necessary the implementation of traceability procedures. At the moment, DNA analysis seems to be a promising approach to this problem, since it is less influenced by environmental and processing conditions in respect to other methods (i.e., metabolites). DNA recovery methods from olive oil have been developed by several authors (Busconi et al. 2003; Doveri et al. 2006; Pasqualone et al. 2007; Consolandi et al. 2008). In addition several commercial kits, providing adapted protocols, were used in different works (Martins-Lopes et al. 2008; Spaniolas et al. 2008; Ayed et al. 2009; Pafundo et al. 2010). All of these studies confirmed that the DNA of the cultivars is recoverable from extra virgin olive oils, but it is low in quantity and quality. The first works, carried out using genomic DNA extracted from drupes, showed the possibility to make amplification using RAPDs markers (Cresti et al. 1997). By means of SCAR and AFLP markers, Busconi et al. (2003) were able to show that DNA recovered from olive oil had both organellar and nuclear origin. Pafundo et al. (2005) traced the cultivar composition of monovarietal olive oils by AFLPs, suggesting that DNA extraction is the most critical step affecting the procedure. Pafundo et al. (2007), starting from AFLPs amplified in olive oil, developed some SCARs that amplified successfully on DNA extracted from olive oil. Using SSR analysis, Pasqualone et al. (2007) demonstrated that microsatellites are useful in checking the presence of a specific cultivar in a PDO oil, thus verifying the identity of the product. However, they obtained only the marker profile of the main cultivar in the oil: no signal was detected for the secondary varieties. Montemurro et al. (2007) analysed by AFLP markers 10 virgin monovarietal olive oils prepared in the laboratory. They were able to distinguish all the olive oils examined, even if only a partial correspondence with the AFLP profile obtained from the leaves was obtained. Martins-Lopes et al. (2008) evaluated the efficiency of RAPD, ISSR and SSR molecular markers for olive oil varietal identification and their possible use in certification purposes.

Consolandi et al. (2007, 2008) reported the development of a semi-automated SNP genotyping assay to verify the origin and the authenticity of extra-virgin olive oils. The authors developed a Ligation Detection Reaction (LDR)/Universal Array (UA) platform by using several olive SNPs. They found that 13 accurately chosen SNPs were sufficient to unequivocally discriminate a panel of 49 different cultivars.

Finally, in a recent study, Pafundo et al. (2010) investigated the effect of the storage time on the degradation of the DNA purified from the oil, a negative correlation between storage time and quality–quantity of recovered DNA has been observed. The authors showed that 1 month after the production of the oil the degradation increases making harder traceability efforts.

Considering all these investigations, it is possible to note that DNA-based olive oil traceability is a topic of great importance, but in addition to studies reporting good results using DNA analysis, Doveri et al. (2006) published a cautionary note on the use of DNA markers for provenance testing. Their observations were based on non-concordance between the genetic profiles of olive oil and fruit. The authors suggested that this could be due to the contribution of pollen donors in DNA extracted from the paste obtained by crushing whole fruits. They concluded that care needs to be taken in the interpretation of DNA profiles obtained from DNA extracted from oil for resolving provenance and authenticity issues.

Paternity analysis and molecular linkage maps

Similar to other woody species, olive is characterised by a long juvenile phase that ranges between 10 and 15 years. This represents a great obstacle to breeding programs and makes the genetic improvement of olive very difficult and expensive. Although seedling-forcing growth protocols have been developed to reduce the length of this phase, the evaluation of the agronomic performance of mature olive plants still requires at least 5 years of experimentation (Santos-Antunes et al. 2005). For this reason, the application of molecular markers both to confirm the parental origins of the progeny and to select early agronomical characteristic-associated markers (Martín et al. 2005) can be very useful to reduce the time and cost of the development of new genotypes.

With regard the paternity analysis, SSRs are the most suitable to trace the genetic contribution of alleles from the parents to the offspring, being codominant and highly polymorphic markers (Mookerjee et al. 2005). The effectiveness of SSRs in the identification of paternity contribution to progeny obtained from olive breeding programs has been demonstrated by several authors (de la Rosa et al. 2004; Diaz et al. 2007a, b). The results demonstrated that SSR analysis is a convenient technique to routinely assess the crosses made in breeding programs and to for check self-incompatibility in olive cultivars (Diaz et al. 2006). These studies have highlighted that no contamination by self-pollen was found, indicating that placing pollination bags well before anthesis is important and that emasculation to avoid selfing is unnecessary (de la Rosa et al. 2004). The analysis also revealed that the main factor affecting the success of crosses seems to be the inter-compatibility among the parental cultivars, since this significantly influences the rate of contamination from external pollen donors. These results indicate that knowledge of cross-compatibility among cultivars is necessary to plan efficient olive breeding crosses (Diaz et al. 2007a, b).

The possibility of associating genetic characteristics and DNA-based molecular markers is very important to select the progeny showing interesting agronomical traits at the first stages of development. However, this technique, called marker-assisted selection (MAS), requires some knowledge on the co-segregation of molecular markers and genetic characteristics in the progeny.

Several efforts to build an association map in olive have been ongoing in the last several years. The first attempt to construct a linkage olive map by means of RAPDs, AFLPs, RFLPs and SSRs was conducted using 95 seedlings from the cross Leccino × Dolce Agogia (de la Rosa et al. 2004). Two maps, one for each parent, were drawn and a partial coverage of the olive genome was obtained with the molecular markers used. The stearoyl-ACP desaturase gene, an important enzyme in the production of oleic acid from stearic acid (Baldoni et al. 1996), was linked to Leccino group four. Wu et al. (2004) constructed an integrated map, using linkage data from the two parents of the cross (Frantoio × Kalamata), based on RAPDs, SCARs and SSRs sequences in the progeny of 104 individuals. In this map, the gene for resistance to peacock disease, discovered by Mekuria et al. (2002), linked with the integrated linkage group 2 and with the linkage group 1 of the Frantoio cultivar.

Genomics studies

In olive, the knowledge of genome is back if compared with those of other crops (Table 2). As for other plant species, studies about olive nucleotide sequences identification started in nineties. The first DNA sequence of Olea europaea L. was released in NCBI database in 1994 but, in the following years, the olive genomic research has been slower than in other plants. Probably due to the renewed interest of the market for this crop and its products, recently many efforts have been made to fill this gap and from 2009 many several thousand of EST (expressed sequence tag) have been identified (Table 3). In this section, we described the most relevant results obtained in the last years in olive genome analysis.

Table 2 Comparison of development of genomic studies between Olea europaea and several major crop plants
Table 3 Olive genomics information present on NCBI database (http://www.ncbi.nlm.nih.gov) from 1994 (years of the first released sequence on database of Olea europaea) to October 2010

ESTs identification

Understanding the function of genes and other parts of the genome is known as functional genomics. In olive, efforts to improve the identification and annotation of genes are prevalently based on EST identification (Table 3), which are predominantly related to pollen allergens and characteristics of olive fruit.

Respiratory allergy caused by olive pollen is an important health problem in several geographic areas worldwide (the Mediterranean basin, North America, South America and Australia) affecting more than 30% of the Mediterranean population (Bousquet et al. 1984). The relevance of this problem on human health induced to identify olive pollen allergens: it is not surprising that the first nucleotide sequences isolated in 1994 in olive (Table 3) coded for allergenic proteins (Villalba et al. 1994). To date, 10 different allergens (named Ole e 1 to Ole e 10) have been found in olive pollen (Rodríguez et al. 2002; Hamman-Khalifa et al. 2008), and for almost all these genes, with the exception of Ole e 7 and Ole e 8, the nucleotide sequences are also available. Recently, Hamman-Khalifa et al. (2008) showed the relationship between cultivar origin and high heterogeneities present in the nucleotide sequence of Ole e 1 gene among different genotypes. The authors found that the origin of an olive cultivar is a major factor determining the diversity of Ole e 1 variants among different olive pollens. Sequence polymorphisms can influence the folding of the corresponding protein which leads to variability in the allergenicity of Ole e 1. This agrees with previous in vivo and in vitro observations that different olive cultivars differ in their capacity to bind IgE antibodies. Notably, sequence polymorphisms within the Ole e 1 gene are so high that closely related cultivars can be recognised as different, as has been reported in the case of the cultivars Picholine marocaine and Menara (considered a clonal selection of the former).

With regard the characteristics of fruits and olive oil, particular attention has been focussed on genes involved in fatty acid biosynthesis, including enoyl-ACP reductase, stearoyl-ACP desaturase, omega 6 plastidial desaturase, omega 3 plastidial desaturase, cytochrome b5, omega 6 cytoplasmatic desaturase, omega 3 cytoplasmic desaturase, acyl-CoA diacylglycerol acyltransferase and oleosin enzymes (Hatzopoulos et al. 2002). Several studies have dealt with the cloning, characterisation and spatial/temporal activation of genes involved in these pathways (Banilas et al. 2005; Poghosyan et al. 2005; Hernández et al. 2005; Giannoulia et al. 2007). In particular, very recently Banilas et al. (2010) deeply investigated the triacylglycerols (TAGs) biosynthesis. They showed that DGAT1 and DGAT2 (two families of diacylglycerol acyltransferase, the last and key enzyme of the triacylglycerols pathway) contribute differentially to the TAGs storage in the olive tissues, highlighting DGAT2 to be more involved than DGAT1 in oil accumulation in the mesocarp.

Other than fatty acid composition, the presence of minor components with antioxidant activity also has great value for human health, in terms of protecting DNA, proteins and lipids from oxidative damage. Among the minor components olive, phenolic compounds have been the most studied. As reviewed by Hatzopoulos et al. (2002), to achieve high quality olive oils with resistance to oxidation, it is crucial to enhance the quantity and efficiency of antioxidants in olive. Some studies concerning the clarification of the biosynthetic pathways for antioxidant biosynthesis have been carried out (Shibuya et al. 1999; Hatzopoulos et al. 2002; Saimaru et al. 2007).

A monosaccharide transporter (OeMST2), whose expression increases during fruit maturation has been also cloned (Conde et al. 2007). In 2009, significant progress in our understanding of the olive transcriptome were achieved by the identification of genes differentially expressed during fruit development, with particular attention to those involved in lipid and phenolic metabolism. By using the 454 sequencing platform, Alagna et al. (2009) sequenced four different cDNA libraries obtained at the beginning and at the end of fruit development from two cultivars, Coratina and Tendellone, characterised, respectively, by high and low phenolic content. A total of 261,485 reads were obtained, for an output of about 58 Mb. The EST sequences generated from this study are available at the Olea EST database web site (http://140.164.45.140/oleaestdb/).

Galla et al. (2009) also identified large sets of differentially expressed genes at three different stages (i.e., initial fruit set, completed pit hardening and veraison) of fruit development in the Leccino cultivar. Four subtractive hybridisation libraries were constructed and all sequenced clones (1,132 in total) were analysed by bioinformatic tools; 60% of these showed similarities to known proteins.

Other investigations concerning the fitness of olive in responding to different environmental conditions have been carried out. Secchi et al. (2007) investigated the effect of water stress on olive by analysing the change in the expression level of genes related to the aquaporin family in plants subjected to drought treatment. The authors found a strong downregulation in these genes following drought stress, probably resulting in reduced membrane water permeability and preventing the loss of water in periods of water stress. Additionally, Bruno et al. (2009) isolated the gene encoding a geranylgeranyl reductase (OeCHLP) and hypothesised its role in organ development and responses to abiotic and biotic stresses in relation to tocopherol action.

Chloroplast genome sequencing

A very important results, recently published, in Olea europaea L. genomic studies have been the DNA sequencing of the entire plastome of the Italian cultivar ‘Frantoio’ (Mariotti et al. 2010). This sequence has a length of 155,889 bp and showed an organisation and gene order that is conserved among numerous Angiosperms. The olive chloroplast contains 130 genes and 644 repetitive sequences (among which 633 mono-nucleotide SSRs, 6 di-, 3 tetra- 2 penta-nucleotide SSRs were identified).

Forty polymorphic plastid markers were identified by using eight cultivars for a comparative study. Among these, 10 markers were previously reported while 30 new cpDNA markers were identified. All these information about the chloroplast sequence will be used to better understand the evolutionary and ecological processes involved in olive domestication, the function of plastid genes on plant metabolism and they will be applied in olive cultivar identification with particular relevance to the application of DNA-based tracking of olive oil, in which one of the problems is the possibility to amplify in DNA extracted from oil and also the genomic DNA of pollinators’ varieties.

Conclusion

Although many efforts have been made in the last years, genome studies in Olea europaea L. are currently behind those of other crops. Several groups have started to work on the olive genome sequencing (i.e., OLEAGEN genomics project, Fundacion Genoma, Spain, www.chirimoyo.ac.uma.es/oleagen/) and, thanks to the rapid development of the new sequencing technologies, soon the complete sequence of olive genome will be available.

The new informations on genome sequence will be very useful to identify genes involved in agronomical traits that could be used to improve the productivity and the nutritional characteristics of this crop. A possible application could be, for example, the studies of molecular mechanisms of drought and salinity tolerance of olive, in order to improve the cultivation of this important fruit crop also in the most arid and semiarid areas of the world.

The knowledge of genome nucleotide sequences also could be useful to identify new sequence polymorphisms, which will be very useful in the development of many new cultivar-specific molecular markers (e.g., SNPs) and in the implementation of more efficient protocols for tracking and protect olive oil origin.