Introduction

Banana (Musa acuminata) represents a staple food for millions of people living in developing countries and an important export commodity for producing countries (Jain 2004). Banana fruit production is currently threatened by a wide spectrum of pathogens including viruses, bacteria, fungi and nematodes. They reduce yield, affect the appearance, shelf life, and marketability of harvested fruit; debilitate or kill the host plant (Ploetz 2005). Currently, the fungi Mycosphaerella fijiensis and Fusarium oxysporum f. sp. cubense (FOC), which attack the leaves and roots of susceptible bananas, respectively, are the most devastating pathogens of this crop. M. fijiensis is controlled by an intensive regime of fungicide applications which is costly to the growers and detrimental to the environment. The case of FOC is even worse since an effective chemical control does not exist (Ploetz 2005). Genetic resistance is the most suitable strategy to control these pathogens in the field and there are sources of resistance to both of them in Musa germplasm (Jones 2000). However, breeding for pathogen resistance in banana is limited by the long life cycle, triploidy and sterility in most commercial cultivars. Molecular biotechnology has the potential to overcome these constraints by transferring single or even multiple disease resistance (R) genes into the genome of susceptible banana cultivars using either biolistics (Sagi et al. 1995; Becker et al. 2000) or Agrobacterium-mediated transformation (May et al. 1995; Khanna et al. 2004). Although transformation technologies have been developed for banana in various laboratories around the world, no banana R gene has been isolated to date.

Disease resistance in plants occurs in a gene-for-gene manner by a direct or indirect interaction between the plant R gene encoded protein and the pathogen avirulence (Avr) gene encoded protein. Such interactions trigger a cascade of defense responses that halt pathogen spread (Chisholm et al. 2006). These defense responses can also be associated with the development of a localized or, in some cases systemic, programmed cell death known as hypersensitive response (HR) (Gilchrist 1998). R gene-mediated resistance has several attractive features to protect crop plants against pathogen attack. When induced in a timely manner, the concerted responses can efficiently halt pathogen growth with minimal collateral damage to the plant. No input is required from the farmer and there are no adverse environmental effects (McDowell and Woffenden 2003). Consequently, R gene-mediated resistance is one of the top priorities in plant breeding and genetic improvement programs. For effective utilization of these natural R genes and for engineering novel resistance, it is important to clone and characterize them. Over 40 plant R genes that confer resistance to various pathogens including viruses, bacteria, fungi or nematodes have been isolated from several plant species (Martin et al. 2003), using either map-based positional cloning or transposon-tagging. Their gene products have domains consistent with roles in both pathogen recognition and subsequent signal transduction (Dangl and Jones 2001; Hammond-Kosack and Parker 2003). The presence of conserved domains in the protein sequences of these R genes allows their classification into eight classes (Hammond-Kosack and Parker 2003). The tomato Pto gene encodes a cytoplasmic serine/threonine protein kinase (STK) of 321 amino acids and represents one of these classes of R genes. Pto was the first plant R gene cloned that participates in a gene-for-gene interaction with a pathogen (Martin et al. 1993), and it is one of the best-characterized and most intensively studied R genes (Pedley and Martin 2003). Pto confers hypersensitive response-mediated resistance against strains of Pseudomonas syringae pv. tomato that express the avirulence proteins AvrPto or AvrPtoB (Martin et al. 1993; Kim et al. 2002). Overexpression of Pto in tomato under the control of the strong cauliflower mosaic virus (CaMV) 35S promoter has been shown to activate defense responses in the absence of pathogen inoculation. Pto-overexpressing plants show resistance not only to P. syringae pv tomato but also to Xanthomonas campestris pv vesicatoria and to the fungal pathogen Cladosporium fulvum (Tang et al. 1999). These findings make Pto an interesting candidate for engineering broad-spectrum pathogen resistance in agriculture and encourage the search for functional Pto-like genes in other plant species.

Degenerate primers designed from conserved motifs of the nucleotide binding site-leucine rich repeat (NBS-LRR) class of R genes have been used to amplify R-like genes or resistance gene candidates (RGCs) from genomic DNA of numerous plants (Collins et al. 1998; Noir et al. 2001; López et al. 2003; Martínez-Zamora et al. 2004). Many of these RGCs are phylogenetically related with known R genes (Meyers et al. 1999) and some of them have facilitated the cloning of full-length functional R genes from different plant species (McDowell et al. 1998; Zhao et al. 2005). In addition to their potential use for genetic improvement, RGCs also provide opportunities and tools to answer some fundamental questions about disease resistance genes, such as structure, R gene organization, distribution and evolution (Michelmore and Meyers 1998; Meyers et al. 1999). The use of PCR with degenerate primers targeting the highly conserved subdomains of STK proteins has also proven to be an efficient method for isolating Pto resistance gene candidates (Pto-RGCs) in bean and grapevine (Vallad et al. 2001; Di Gaspero and Cipriani 2003), indicating that this approach could be used to retrieve this type of gene from other plant species. Since no molecular characterization of banana Pto-RGCs has been published, the objectives of this study were: (1) to obtain Pto-RGCs from banana using degenerate PCR and (2) to determine the structure and phylogenetic relationships of the banana Pto-RGCs.

Materials and methods

Plant material and DNA extraction

The banana cultivar ‘Tuu Gia’ was chosen for PCR amplification of Pto-RGC sequences because it shows resistance to a range of banana pathogens, including the most destructive such as M. fijiensis and F. oxysporum, and also because there is a binary bacterial artificial chromosome (BIBAC) genomic library from this cultivar available for banana transformation (Ortiz-Vázquez et al. 2005). The plant material was obtained from the international network for the improvement of banana and plantain (INIBAP) Musa Germplasm transit centre in Leuven, Belgium. The youngest fully developed leaf from a greenhouse-grown ex vitro plant of Musa acuminata cv. ‘Tuu gia’ (accession no. ITC 0610) was harvested for nucleic acid extraction. The genomic DNA was isolated using the Nucleon PhytoPure® kit (Amersham™) according to manufacturer’s instructions.

Degenerate PCR

A pair of degenerate primers designed by Vallad et al. (2001), forward 5′-TNGGNSANGGNGKNTTYGG-3′ and reverse 5′-ACNCCRAANGARTANACRTC-3′, was used to amplify the region between the subdomains I and IX of STKs. The degenerate PCR reaction was performed in a 50-μl reaction volume containing 300 μM of dNTPs, 4 μM of each degenerate primer forward and reverse, 1 U of Taq DNA polymerase (Invitrogen™), 1× PCR buffer, 1.5 mM MgCl2 and approximately 200 ng of genomic DNA. PCR conditions were 95°C for 3 min, followed by 35 cycles of 95°C for 30 s, 45°C for 30 s and 72°C for 1 min; and an additional 10 min extension at 72°C was included.

Cloning and sequencing

PCR products were visualized on a 1% agarose gel stained with ethidium bromide. A band of the expected size was excised from the gel and purified using the High Pure Purification kit (Roche) according to manufacturer’s instructions. Purified PCR products were cloned into the pGEM-T easy plasmid vector (Promega). Plasmids were transferred by electroporation into Escherichia coli DH10B competent cells. Bacteria were plated onto LB medium containing ampicillin, X-Gal and IPTG, and recombinant plasmids were chosen by blue/white selection (Sambrook and Russell 2001). Plasmid DNA was purified by the alkaline lysis method (Sambrook and Russell 2001) and sequenced using the BigDye terminator sequencing kit version 3.1 (Applied Biosystems) according to manufacturer’s instructions. The sequencing products were separated with an ABI 3730×1 automatic sequencer (Applied Biosystems) through the capillary separation service of the Australian Genome Research Facility (AGRF) (http://www.agrf.org.au). Selected clones were sequenced in both orientations.

Sequence edition, similarity searches and multiple sequence alignment

All sequences were assembled and edited using the programs SEQMAN and EDIT, respectively of the Lasergene software package version 4.03 (DNASTAR, Madison, WI, USA). The degenerate primer sequences were removed from each sequenced clone so only the region between the end of subdomain I and the start of subdomain IX of STKs was considered for further analysis. Predicted amino acid sequences were generated using the translate tool of the EDIT program (Lasergene software). Similarity searches were conducted with the BLASTX program (Altschul et al. 1997) through the National Center for Biotechnology Information (NCBI) GenBank database (http://www.ncbi.nlm.nih.gov) using the default settings. Percent amino acid identity between predicted protein sequences was determined with the MEGALIGN program of the Lasergene software using the default settings. Determination of conserved amino acids in banana Pto-RGC sequences was carried out with the programs ClustalX version 1.81 (Thompson et al. 1997) and WebLogo version 2.8.2 (Crooks et al. 2004)(http://weblogo.berkeley.edu/) using the default settings.

Phylogenetic analysis

Phylogenetic trees were constructed by the neighbor-joining (NJ) method (Saitou and Nei 1987) using the NJ algorithm implemented in the Molecular Evolutionary Genetics Analysis (MEGA) software version 2.1 (Kumar et al. 2001) with the Poisson correction. Bootstrapping (1,000 replicates) was used to evaluate the degree of support for a particular grouping pattern in the phylogenetic tree. Protein sequences belonging to 12 groups of characterized STKs from Arabidopsis thaliana (Hardie 1999), a phosphoenolpyruvate carboxylase kinase (PEPck) (GenBank accession no. AF162660) from A. thaliana (Hartwell et al. 1999), the tomato Pto protein (GenBank accession no. A49332) and Pto-RGCs from different plant species were retrieved from the GenBank for the phylogenetic tree constructions. The tomato Pto disease resistant protein was used as query in BLASTP (Altschul et al. 1997) searches to retrieve amino acid sequences of Pto-RGCs from the GenBank. Only the region between the end of subdomain I and the start of subdomain IX was considered for the phylogenetic tree constructions.

Results

Identification of Pto resistance gene candidates in banana

PCR amplification of banana genomic DNA with a pair of degenerate primers previously used by Vallad et al. (2001) generated an expected band of ∼550 bp. This band was cloned and a total of 70 clones were sequenced. The primer sequences were removed from each sequenced clone for further analysis. Of the 70 sequenced clones (Tg-1 to Tg-70), 56 presented uninterrupted open reading frames (ORFs), while the other 14 sequences presented multiple stop codons in all reading frames, and as a result they were not further investigated. Similarity searches of the 56 banana sequences using the BLASTX algorithm (Altschul et al. 1997) against the NCBI non-redundant database revealed significant similarity to known STKs (E value <3e-53), including the disease resistance protein Pto from tomato. A threshold value of 85% amino acid identity previously used by Vallad et al. (2001) to classify Pto-RGC clones from bean into classes or groups was used in the present study, therefore banana clones with greater than 85% amino acid identity were considered to be part of the same group. A total of 13 distinct groups of STK-like sequences were identified, most of which contained redundant or highly similar clones (>97% amino acid identity). Seven groups were designated as Pto resistance gene candidates (Pto-RGCs) based on their significant similarity with the tomato Pto disease resistance protein (E value <3e-53). The other six groups showed significant similarity to other types of STKs, which are described later in the text. Each group was designated by the name of a single clone representative of the group and used for further analysis. Percent amino acid identity between the predicted amino acid sequence of Pto-RGCs and the corresponding region of the Pto protein ranged from 58% (Tg-67) to 68% (Tg-13) (Table 1), whereas amino acid identity among the Pto-RGCs ranged from 84.1% (Tg-4 vs. Tg-13) to 65.3% (Tg-10 vs. Tg-12) (Table 1). BLASTX searches also revealed that two Pto-RGCs (Tg-9 and Tg-12) were highly similar (>92% amino acid identity) to Musa sequences present in the GenBank database as accession nos. AAM97913 and AAM97914 from Musa balbisiana and Musa acuminata, respectively. These sequences were considered to be part of the groups represented by the sequences Tg-12 and Tg-9, respectively.

Table 1 Percent amino acid identity between the banana Pto-RGCs and the tomato Pto disease resistance protein (Martin et al. 1993)

Isolation of other banana serine/threonine kinase-like sequences

The degenerate primers used in this study were designed from the conserved subdomains I and IX of the STKs Pto, Fen and Pti1 of tomato, and MHK and APK1 of Arabidopsis (Vallad et al. 2001). Therefore, these primers have the potential to isolate not only Pto-RGCs but also other types of plant STKs. In agreement with this observation six additional STK-like sequences from banana were identified in BLASTX searches (Table 2). Five of them (Tg-1, Tg-20, Tg-34, Tg-36 and Tg-55) showed significant similarity to the receptor-like kinase (RLK) subfamily (E value <2e-64), whereas the remaining sequence Tg-2 showed a significant similarity to a putative PEPck (E value = 8e-74) from Oryza sativa.

Table 2 Best BLASTX hits of other banana serine/threonine kinase-like sequences isolated in this study

Multiple sequence alignment and phylogenetic analysis

A multiple sequence alignment using the ClustalX program was performed with the predicted amino acid sequences of the seven banana Pto-RGCs and the corresponding region of the tomato Pto protein (Fig. 1). The alignment revealed that several features of the Pto protein are highly conserved in the banana Pto-RGCs such as the STK subdomains internal to the degenerate primer sequences, the presence of the activation domain between subdomains VII and VIII, and its internal P+1 loop site, which is responsible for the specific binding of AvrPto (Frederick et al. 1998), and also several invariant amino acids distributed throughout the sequences. In addition, three of the four autophosphorylation sites (serine or threonine) in the activation domain of Pto (Sessa et al. 2000) are conserved in the corresponding region of all banana Pto-RGCs (Fig. 1). The alignment also showed that all the banana Pto-RGCs presented a two amino acid deletion (subdomain V) and a three amino acid insertion (subdomain VIa) with regard to the Pto protein. We found that the two amino acid deletion was also present in Pto-RGCs from other monocot species such as Oryza sativa (GenBank accession no. XP476621) and Triticum aestivum (GenBank accession no. AAL51075). This deletion was also present in Pto-RGCs from other dicot species such as Arabidopsis thaliana (GenBank accession no. NP197789) and Cucumis sativus (GenBank accession no. AAP57674) but absent in Phaseolus vulgaris (GenBank accession no. AF363819). The extent and significance of this polymorphism in both monocot and dicot Pto-RGCs awaits further research. In the case of the three amino acid insertion, it is present in Pto-RGCs from other monocot and dicot species but absent in Pto-RGCs from the Solanaceae family (Vleeshouwers et al. 2001).

Fig. 1
figure 1

ClustalX alignment of the deduced amino acid sequences of seven banana Pto-RGC sequences together with the corresponding region of the Pto protein. The subdomains, activation domain and P + 1 loop of protein kinases (Hanks et al. 1988; Hanks and Quinn 1991; Vallad et al. 2001) are indicated. Conserved amino acids in plant serine/threonine kinases (Hanks and Quinn 1991) and Pto autophosphorylation sites (Sessa et al. 2000) are indicated with asterisks and grey circles, respectively. Identical amino acids are shaded in black and conservative substitutions are shaded in grey

In order to highlight the Pto autophosphorylation sites that are conserved in the banana Pto-RGCs and other critical residues for Pto function located in the activation domain, a sequence Logo was generated with the banana Pto-RGC products and it is shown in Fig 2. Of the three Pto autophosphorylation sites (Thr195, Ser198 and Thr199) conserved in the banana Pto-RGCs, Ser198 is required for the AvrPto-Pto-mediated hypersensitive response (Sessa et al. 2000) and it is present in the majority of banana Pto-RGCs with the exception of Tg-12, in which the serine has been replaced by threonine, thereby providing an alternative phosphorylation site. The Pto (Thr190) autophosphorylation site was replaced by a proline in the banana Pto-RGCs with the exception of Tg-6, which contains a threonine at this site. Two other Pto residues crucial for Pto function but not autophosphorylated in vitro are Thr204 and Tyr207 (Pedley and Martin 2003). Individual mutations in these residues (pto Thr204Ala or pto Tyr207Ala) abolished Pto interaction with AvrPto in yeast and transient expression of the mutants pto Thr204Asp and pto Tyr207Asp in leaf tissue of Nicotiana benthamiana produced a hypersensitive response-like cell death in the absence of AvrPto (the phenotype of pto Tyr207Aspwas the stronger of the two mutant genes) (Rathjen et al. 1999). The sequence Logo showed that Pto (Thr204) was replaced by a serine in the majority of banana Pto-RGCs and Pto (Tyr207) was present in all Pto-RGCs (Fig. 2).

Fig. 2
figure 2

Comparison of the tomato Pto activation domain (30 amino acids in length) with the sequence Logo of the putative activation domain of seven banana Pto-RGCs. The Logo sequence consists of stacks of letters, one stack for each position in the sequence. The overall height of each stack indicates the sequence conservation at that position (measured in bits), whereas the height of symbols within the stack reflects the relative frequency of the corresponding amino acid at that position (Crooks et al. 2004). The autophosphorylation sites in the activation domain of Pto (Sessa et al. 2000) are indicated with grey circles

The phylogenetic analysis of Fig. 3 shows that the seven banana STK-like sequences identified as Pto-RGCs formed a cluster with the tomato Pto protein, which is supported by a high bootstrap value (99%). This result supported the designation of the seven banana STK-like sequences as Pto-RGCs. The most closely related sequences to the Pto-like kinase cluster were sequences of the receptor-like kinase subfamily (Hardie 1999). Regarding the other banana STK-like sequences, five of them were related to receptor-like kinases as previously observed in the BLASTX results and the remaining Tg-2 sequence formed a highly supported cluster with a PEPck of Arabidopsis (Hartwell et al. 1999) (Fig. 3). This phylogenetic tree also showed that the protein kinase region used for its construction contains sufficient sequence information to represent clusters defined by analysis with full sequence data of the kinase catalytic domain (Hardie 1999). Of the 12 groups of plant STKs (Hardie 1999) used for the phylogenetic analysis, only the receptor-like kinases were not grouped in a single cluster, which is consistent with the phylogenetic tree performed by Vallad et al. (2001). Furthermore, phylogenetic analysis of the banana Pto-RGCs with Pto-RGCs from different plant species (Fig. 4) revealed that the banana Pto-RGCs were more closely related to Pto-RGCs from other plant species than each other. Another interesting finding was that the clades where the banana Pto-RGCs were grouped (I, II and III) also contain Pto-RGCs from dicot species, suggesting that the origin of this type of sequence may have preceded the divergence of monocot and dicot plants.

Fig. 3
figure 3

Neighbor-joining phylogenetic tree based on the ClustalX alignment of serine/threonine kinases from Arabidopsis thaliana (Hardie 1999; Hartwell et al. 1999), tomato Pto disease resistance protein, Pto-RGCs (black circles) and other STK-like sequences (grey circles) from banana. Names of STKs are given for each sequence followed by GenBank accession numbers in parentheses. Amino acids from the end of subdomain I to the start of subdomain IX of STKs were used for the analysis. The numbers above the branches indicate the percentage of 1,000 bootstrap replications supporting the particular nodes, and only those with >60% support are shown

Fig. 4
figure 4

Neighbor-joining phylogenetic tree based on the ClustalX alignment of tomato Pto disease resistance protein, Pto-RGCs from banana (black circles) and other Pto-RGCs from monocot and dicot species. The name of the Pto protein is in bold and GenBank accession numbers are given for each sequence followed by species name in italics. Amino acids from the end of subdomain I to the start of subdomain IX of STKs were used for the analysis. The numbers above the branches indicate the percentage of 1,000 bootstrap replications supporting the particular nodes, and only those with >60% support are shown

Discussion

There is evidence that Pto-RGCs are highly conserved in many plant species. Southern hybridization using the tomato Pto gene as probe revealed the presence of Pto-RGCs in many plant species such as Arabidopsis, bean, soybean, pea, rice, maize, barley, wheat and sugarcane (Martin et al. 1993). Recent studies report the cloning and characterization of Pto-RGCs from potato, bean and grapevine (Vleeshouwers et al. 2001; Vallad et al. 2001; Di Gaspero and Cipriani 2003). Furthermore, other Pto-RGC sequences from different plant families have been deposited in the GenBank (http://www.ncbi.nlm.nih.gov), however these sequences have not been characterized. In this study, a set of Pto-RGC sequences and other STK-like sequences were identified from the banana cultivar ‘Tuu Gia’. These sequences were isolated by PCR using a pair of degenerate primers previously designed and used by Vallad et al. (2001) to isolate Pto-RGCs from bean. In total, seven distinct Pto-RGC sequences and six other STK-like sequences were identified in the banana genome. Vallad et al. (2001) reported the identification of a lower number of Pto-RGCs (five distinct sequences sharing from 56.9 to 63.9% amino acid identity with Pto) and no further cloning of other STK-like sequences. The PCR annealing temperature used in this previous study was high (60°C) in comparison to our study that used a less stringent temperature of 45°C, which may explain the broader diversity of STK-like sequences isolated in banana. This low PCR annealing temperature could explain the isolation of the banana PEPck-like sequence, which was quite divergent from the rest of the banana STK-like sequences isolated (Pto-RGCs and RLKs) (Fig. 3). Although, the degenerate primers amplified preferentially Pto-RGCs and RLK sequences, we believe that they could still be exploited to isolate other types of STK-like sequences by lowering the annealing temperature of the PCR. Overall, our data demonstrated that the degenerate primers used are capable of amplifying Pto-RGCs and other types of STK-like sequences from a monocot species.

The complete genome sequence of Arabidopsis (genome size of 130 Mbp) revealed the presence of 15 Pto-RGCs (Arabidopsis Genome Initiative 2000), while a draft of the rice genome sequence (genome size of 420 Mbp) revealed a similar number of Pto-RGCs with 14 (Goff et al. 2002). These data indicate that the number of Pto-RGCs in these two plant genomes is conserved even though the rice genome is 290 Mbp larger than Arabidopsis, and also indicate that the number of Pto-RGCs in a plant genome is small in comparison to the NBS-LRR class of R genes, which has a large number of divergent genes in the Arabidopsis and rice genomes, with 149 and 480 genes, respectively (Meyers et al. 2003; Zhou et al. 2004). The genome size of Musa acuminata is estimated to be 600 Mbp (Dolezel et al. 1994), assuming that the number of Pto-RGCs in a plant genome do not increase significantly according to the genome size, then it is possible that in banana the number of Pto-RGCs could be similar to Arabidopsis or rice. Hence, it is tempting to speculate that the number of Pto-RGC sequences identified in this study represents a significant proportion of the total number of Pto-RGC sequences in the banana genome. Indeed, the complete sequence of the banana genome will provide comprehensive data on the number of Pto-RGC sequences in this Musaceae species (http://www.musagenomics.org).

All banana Pto-RGC products displayed conserved serine-threonine kinase subdomains (Hanks and Quinn 1991), suggesting that the uncovered genes are likely to encode active kinases. Moreover, most residues of the Pto activation domain involved in pathogen recognition and HR induction (Pedley and Martin 2003) are highly conserved in banana Pto-RGCs suggesting that these residues might play a similar role in banana. Indeed, the cloning of the full cDNA sequence and protein expression of these banana Pto-RGCs will allow the possibility to answer some fundamental questions regarding for example, whether the Pto-RGC encoded proteins are autophosphorylated in vitro and also whether substitution of tyrosine by aspartate in the corresponding site of Pto (Tyr207) will lead to a HR-like induction. Regarding the other STK-like sequences reported in this study, some of them are similar to receptor-like kinases that are known to be involved in the response to pathogens, for example banana Tg-1 was related to the cell wall-associated kinase (WAK) group. WAK1, a member of this kinase group in Arabidopsis, is required for plants to survive P. syringae infection. Moreover, WAK1 expression is induced by salicylic acid in an NPR1-dependent manner, demonstrating that it is a pathogenesis-related gene (He et al. 1998). Other interesting examples are Tg-20 and Tg-36, which were related to the Lectin receptor-like kinase (LecRLK) group. A LecRLK gene is strongly induced upon chitin elicitation in Arabidopsis (Zhang et al. 2002), and recently, a LecRLK gene from rice (Pi-d2) was cloned and identified as a novel class of R gene involved in conferring resistance to Magnaporthe grisea (Chen et al. 2006). The role of the banana RLK sequences in disease resistance remains to be determined.

Phylogenetic analyses of Pto and Pto-RGC sequences have suggested that these sequences form a unique group of kinases in plants (Vallad et al. 2001; Vleeshouwers et al. 2001). In agreement with this finding the banana Pto-RGCs formed a highly supported group with the Pto disease resistance protein (Fig. 3) suggesting that these sequences share a common evolutionary origin with the tomato Pto protein and possibly a similar function in disease resistance. Furthermore, phylogenetic analysis of Pto-RGCs from different Solanum species has revealed that Pto orthologue genes are more similar than paralogues suggesting that the origin of Pto could predate the radiation of Solanum species (Vleeshouwers et al. 2001). This ancient origin of Pto is further supported by the fact that both Pto and a Pto orthologue (LhirPto) are functional in Nicotiana benthamiana (Riely and Martin 2001). Additional evidence of this ancient origin is the presence of Pto-RGCs in other dicot species and also monocots that have been recently deposited in the GenBank (http://www.ncbi.nlm.nih.gov). The phylogenetic analysis of Fig. 4 supports and extends these previous observations since all banana Pto-RGCs were grouped in clades that contained Pto-RGCs from both monocot and dicot species suggesting that the origin of this type of sequence might have predated the divergence of monocot and dicot plants which took place about 200 ± 40 million years ago (Wolfe et al. 1989).

The tomato Pto protein is capable of recognizing at least two Avr proteins (AvrPto and AvrPtoB) from P. syringae (Kim et al. 2002). Surprisingly, these two Avr proteins share limited sequence similarity. This dual recognition specificity has also been reported in R proteins of the NBS-LRR class, for example the Rpm1 protein from Arabidopsis confers resistance to P. syringae and recognizes two different avirulence proteins, AvrB and AvrRpm1 (Bisgrove et al. 1994). Another interesting example is the Mi-1 gene from tomato, which confers resistance to a nematode and an aphid pest (Vos et al. 1998). This dual (and perhaps even multiple) pathogen recognition specificity for a single R protein may prove to be common in R genes (Martin et al. 2003) and raises the possibility that Pto may confer resistance to pathogens other than bacteria. Whether the banana Pto-RGCs are involved in conferring bacterial resistance as in tomato or are involved in conferring resistance to other types of pathogens will require functional analysis, which could be carried out with genetic complementation or loss-of-function experiments. In the case of genetic complementation, the banana Pto-RGCs could be used as probes to screen a banana BIBAC library (Ortiz-Vázquez et al. 2005) for the isolation of BIBAC clones containing Pto-RGCs. These Pto-RGC-BIBAC clones could be used to transform a banana disease susceptible cultivar using Agrobacterium tumefaciens (Khanna et al. 2004). These experiments would lead to a collection of Pto-RGC-BIBAC transgenic lines ready to be used for disease resistance tests. The BIBAC technology coupled with Agrobacterium-mediated transformation not only promises to unravel the function of banana RGCs but also the development of disease resistance in this crop. In the case of the loss-of-function strategy, the banana Pto-RGC sequences could be used in RNA interference (RNAi) constructs (Waterhouse and Helliwell 2003) in order to silence their corresponding targets in a resistant genotype. Those resistant plants that show disease symptoms after the infection with a particular type of pathogen would allow the identification of an R gene. The RNAi technology has been recently used to determine the function of genes involved in disease resistance in barley (Douchkov et al. 2005). The banana Pto-RGCs could also be used to produce molecular markers tightly linked to R genes for genomic mapping and positional cloning. In this respect, several RGCs of the NBS-LRR class have been shown to be quite useful as molecular markers to assist the isolation of functional R genes through map-based positional cloning (McDowell et al. 1998; Zhao et al. 2005).

The Pto gene is considered as a promising candidate for engineering broad-spectrum pathogen resistance in tomato since plants overexpressing this gene display resistance to both bacterial and fungal pathogens (Tang et al. 1999). Moreover, expression of Pto mutants such as pto Thr204Asp or pto Tyr207Asp can constitutively activate a HR-like response in the absence of P. syringae (Rathjen et al. 1999). Expression of these engineered Pto genes under the control of a defined inducible promoter has been considered as another promising strategy to protect crops against pathogens through the hypersensitive response (Rathjen et al. 1999). The cloning of the full cDNA sequences of the banana Pto-RGCs will permit assessing their potential to confer disease resistance using the strategies mentioned above.

In summary, this study has uncovered a set of banana Pto-RGC sequences and provided the first insights about their amino acid sequence structure and evolution. The presence of several conserved amino acids in the banana Pto-RGCs that are crucial for Pto function, and the fact that these sequences were phylogenetically closely related to Pto, make of them a valuable sequence resource for plant–pathogen interaction studies in banana. The banana Pto-RGCs could be used to generate not only a collection of BIBAC clones or RNAi constructs for functional analysis but also they might be useful as molecular markers for genetic mapping. The availability of these sequences will facilitate the cloning of their corresponding full gene sequences, which in turn will allow further genetic and biochemical characterization that may lead to the development of specific or even broad-spectrum pathogen resistance in banana. Moreover, the other banana STK-like sequences identified in this study may be used as a research platform for further studies in this crop.