Introduction

The PAPhy_a gene expresses a purple acid phosphatase phytase during grain filling and is the main source of mature grain phytase activity (MGPA) in cereals belonging to the Triticeae tribe (Brinch-Pedersen et al. 2013). Phytate represents the largest pool of phosphorus in cereal seeds and it chelates nutritionally important metals including calcium, iron and zinc. Monogastric animals excrete the majority of phytate bound phosphorus and metals unless the feed has sufficient phytase activity to degrade the phytate (Brinch-Pedersen et al. 2002). PAPhy enzyme activity therefore exerts a positive influence on the nutritional value of cereals as well as the environmental impact of farming. Accordingly, it is of major interest to find ways of increasing the activity in elite cultivars (Brinch-Pedersen et al. 2013). A transformation mediated gene duplication of PAPhy_a resulted in approximately doubled phytase activity in barley (Hordeum vulgare L.) (Holme et al. 2012). This demonstrated that there is no physiological barrier against a significantly higher expression of the PAPhy_a gene. Increased levels of activity will be within reach through conventional breeding, providing that more transcriptionally active PAPhy_a alleles are available. One potential strategy is to utilize the genetic diversity among wild and extant relatives of the major crop species wheat (Triticum aestivum L.) and barley. In cases where donor and recipient species are sexually compatible, favourable alleles can be introduced into elite cultivars by crossbreeding. Alleles from more distant relatives may be introduced by transformation or serve as templates for genome editing. However, there is very little background information available to guide the effort to increase MGPA. The PAPhy_a gene was only recently described and its sequence is only known for a handful of species (Madsen et al. 2013). Isolated PAPhy_a genes were mapped to chromosome 5H in H. vulgare and chromosomes 5A, 5B and 5D in T. aestivum using aneuploid lines. A single PAPhy_a gene was also identified in the diploid species einkorn (T. monococcum L. subsp. monococcum) and Aegilops tauschii Coss. whereas two copies were identified in rye (Secale cereale L.) (Madsen et al. 2013). This suggests that PAPhy_a existed as single gene on chromosome 5 in the diploid common ancestor of Hordeum L., Triticum L. and Aegilops L. and further that the PAPhy_a complement in modern wheat is directly inherited from its diploid ancestors. The search for more favourable PAPhy_a alleles and the study of wheat genome evolution may therefore be mutually informative.

The Triticeae cereals include diploids such as einkorn, barley and rye. However, the economically most important Triticeae cereals are the allopolyploid wheats. The allopolyploid wheats belong either to the ‘emmer’ group or the ‘timopheevii’ group. The ‘emmer group’ is derived from wild emmer (T. turgidum subsp. dicoccoides (Körn. ex Asch. et Graebn.) Thell.) and include among others, the domesticated, tetraploids emmer (T. turgidum subsp. dicoccon (Schrank) Thell.), durum (T. turgidum subsp. durum (Desf.) Husn.) and the hexaploid bread wheat (T. aestivum). Similarly, the ‘timopheevii’ group is derived from the wild T. timopheevii subsp. armeniacum (Jakubz.) van Slageren (syn. T. araraticum Jakubz.) and includes the domesticated forms sanduri (T. timopheevii Zhuk. (Zhuk.) subsp. timopheevii) and Zhukowskys wheat (T. zhukovskyi Menabde et Ericz.) (Salamini et al. 2002; Wagenaar 1966). The ‘emmer’ group became crops of global importance whereas the ‘timopheevii’ group remained restricted to the Transcaucasus region. Nevertheless, the ‘timopheevii’ group represent an important genepool for the improvement of wheat and has been the donor of e.g., disease resistance genes (Perugini et al. 2008). The genomes included in the ‘emmer’ group are denoted A, B and D and in the ‘timophevii’ group A and G (Kilian et al. 2007).

The evolution of the Triticeae cereals, in particular the identity of the donors of the T. aestivum genomes, has been studied for almost a century and the literature on the subject is immense [see reviews by Peng et al. (2011) and Haider (2013)]. Briefly, the D genome has been assigned to Aegilops tauschii (Kihara 1944; McFadden and Sears 1944, 1946). The A genome was initially assigned to wild einkorn (T. monococcum spp. aegilopoides (Link) Thell.), but the recognition of T. urartu Tumanian ex Gandilyan as a separate species prompted new investigations. Strong evidence now supports the later species as the A genome donor (Dvorak et al. 1993). The superscripts A u and A m are used to distinguish between the genomes from T. urartu and T. monococcum, respectively. The potential donor of the B genome remains controversial. Based on morphology, the search has focused on the Sitopsis section of Aegilops consisting of Ae. speltoides Tausch, Ae. sharonensis Eig, Ae. longissimi Schweinf. et Muschl., Ae. bicornis (Forssk.) Jaub. et Spach and Ae. searsii Feldman et Kislev ex Hammer (Sarkar and Stebbins 1956; van Slageren 1994). Each of these species has been suggested as donors of the B genome, but the most compelling evidence has been accumulated for Ae. speltoides (Haider 2013), though this is not universally accepted (Fernández-Calvín and Orellana 1994; Huang et al. 2002; Salse et al. 2008). The hybridization which created T. turgidum subsp. dicoccoides (AB) happened approximately 3–500,000 years before present (yBP), whereas the hybridization that created T. aestivum (ABD) took place 10,000 yBP (Haider 2013). Whole genome sequencing provided more ancient estimates of 580,000–820,000 yBP for AB and 230,000–430,000 yBP for ABD (Marcussen et al. 2014). It is however likely that these estimates partially reflect population divergence prior to the hybridization events as discussed by the authors. The more recent estimate for the ABD hybridization event is supported by the evidence which connect the hybridization event to the spread of agriculture (Salamini et al. 2002). The origin of bread wheat is so recent that its genomes have not diverged much form the donors and favourable alleles can relatively easily be introduced in T. aestivum from Ae. tauschii and T. turgidum via synthetic hexaploids (Dreisigacker et al. 2008). The phylogenetic relationships of diploid Triticeae has received comparatively less attention since molecular methods were introduced. Indeed, most studies have a priori limited themselves to species that are Triticum or Aegilops by name. This is unfortunate, since Petersen et al. (2006) placed a number of less studied species within the Triticum/Aegilops clade. Amblyopyrum (Boiss.) Eig and Thinopyrum (Saÿvul. et Rayss) Á. Löve was for example found to be much closer relatives of the A genome group than any of the Aegilops. Recently, the availability of whole genome sequence data has prompted renewed interest in the phylogenetic relationships of the diploid Triticum and Aegilops. Marcussen et al. (2014) proposed that Ae. tauschii and by extension the D genome resulted from an ancient hybridization between ancestors of the A and B genomes. A more complex hybridization history was subsequently suggested (Li et al. 2015a, b). El Baidouri et al. (2017) expanded on this, suggesting a model in which both the B and D genome ancestors had undergone ancient hybridizations. Further, the concept of subgenome genome dominance was used to explain different mutation rates following polyploidization. While genomics is certainly providing intriguing new insights it is unfortunately still limited to a few taxa. Approaches which can be applied to many taxa, such as single gene phylogeny, are therefore still indispensable.

Here we provide a survey of the diversity of PAPhy_a alleles in the Triticeae with emphasis on the Triticum/Aegilops complex. The survey includes seven samples of Ae. tauschii and T. turgidum since these species are of special interest to the improvement of bread wheat. Also rye has been used in the improvement of wheat and for the generation of synthetic polyploids, triticale (X Triticosecale Witt.) (Oettler 2005; Rabinovich 1998). We have therefore included four Secale L. accessions. A phylogenetic tree was constructed to visualize the evolution of the gene. The closest relatives of the wheat alleles were subjected to pairwise comparisons in order to assess the potential for finding useful genetic variation. To meet this end, we analysed approximately 2300 bp conserved gene sequence encompassing the core promoter, some 3′UTR and all introns and exons from the PAPhy_a gene. Fifty sequences from 34 samples of 21 species were included.

Materials and methods

The 34 samples used representing 21 species are listed in Table 1, which also include the authors of the taxa used. Note that two samples labelled as T. urartu by the IPK genebank (Tri 11494 and Tri 11495) have been re-identified as T. monococcum subsp. aegilopoides. This was done by morphology according to van Slageren (1994).

Table 1 Investigated samples and accession numbers for new and previously published sequences

DNA was extracted for the current study as described by Møller et al. (2003). Some DNA samples were reused from Petersen et al. (2006), see Table 1. PCR was performed with Herculase II polymerase (Agilent Technologies) in 50 µL reaction mixture with 6% DMSO according to the manufacturer’s instructions. The primers used are listed in Table 2. Primers 1, 2 and 3 were reused from (Madsen et al. 2013). Primer 4 was designed to overcome the poor performance of primer 3. It is shifted 9 bp in the upstream direction. I.e. amplicons of this primer are 9 bp shorter when excluding the primer sequences themselves. Most samples were amplified using primers 1 + 3 and/or 1 + 4. Combination 1 + 4 produce stronger bands than combination 1 + 3 and requires no pre-annealing (see below). However combination 1 + 4 is not as universal as combination 1 + 3. Samples which failed to produce amplicons with these primer sets (i.e., Secale) were amplified using primers 2 + 3. Reactions with primer 3 required a pre-annealing step with this primer alone: 98 °C, 5 min; 30 °C, 20 s; heating to 72 °C at 2.0 °C/s; 72 °C, 5 min; 4 °C, ∞. The forward primer was subsequently added, whereafter the PCR proceeded using the same conditions for all primer combinations: 96 °C, 2 min; 38× (96 °C, 1 min; 59 °C, 20 s; 72 °C, 2 min 30 s; 72 °C, 3 min); 72 °C, 3 min; 4 °C ∞.

Table 2 Primers used to amplify and sequence PAPhy_a

Amplicons were cloned in the pCR 4Blunt-TOPO vector (Invitrogen) and examined with EcoRI digests. Clones with 2000–2500 bp inserts were selected for sequencing. The restriction pattern from internal EcoRI sites supported the identification of homeologous clones in samples from the polyploid species. Sequencing was performed with two vector specific primers and two internal primers (Table 2 primers 5–8) by Eurofins Genomics, Ebersberg, Germany. The reads were assembled in CLC Main Workbench and trimmed to remove vector and incorporated primer sequences and to standardize the length of amplicons produced by different primer combinations. At least two clones from independent PCR reactions were sequenced per locus until a consensus was reached. The PAPhy_a sequence of Ae. sharonensis was downloaded from the published genome sequence after blasting with the TaPAPhy_a1 full length genomic sequence at http://wheat-urgi.versailles.inra.fr/Seq-Repo<3sitory/BLAST. All new sequences are deposited in ENA. See Table 1 for accession numbers.

Sequences were aligned using the CLC Main Workbench with default settings (gap open cost 10 and gap extension cost 1) and subsequently adjusted manually (supplementary file 1). CLC Main Workbench uses a proprietary algorithm based on progressive alignment (CLCbio 2007; Feng and Doolittle 1987). A phylogenetic analysis was performed using the RAxML maximum likelihood algorithm (Stamatakis 2006) implemented in Geneious version 8.1.2 (Kearse et al. 2012). The analysis was done with data in four partitions (exons, introns, upstream and downstream regions) and using the GTR + GAMMA + I nucleotide model and running 1000 bootstrap replicates. Pseudoroegneria (Nevski) Á. Löve was used to root the tree in accordance with Petersen et al. (2006). The sequences from the taxa in the individual genome groups A, D and B/G/ Ae. speltoides were realigned using above mentioned phylogenetic analysis (Fig. 3) as a guide tree. Again, the alignments were generated using CLC Main Workbench with the default settings, but without manual adjustment. The alignments were used for pairwise comparisons of the full length nucleotide sequences within the three genome groups. Subsequently, the alignments were shortened to include only the promotor and 5′UTR sequences (5′flanking region). Separate pairwise comparisons were generated from this alignment. Finally, the encoded protein sequences were aligned and compared [excluding the signal peptide as defined in Dionisio et al. (2011)].

Results

A total of 50 PAPhy_a sequences were obtained. The sequences varied in length from 2239 to 2598 bp and the alignment had a length of 3253 bp. The conservation along the alignment is compared to the structure of the gene in Fig. 1. The gene structure was determined using the TaPAPhy_a1 coding sequence as scaffold. The exons are more conserved than non-conding promoter, intron and UTR sequences. Lengthy insertions in a single sequence cause the three stretches of only 2% conservation at 178–397 bp (insertion in S. cereale cv. Picasso 2), 1882–1995 bp [intron gain in S. strictum NGB9846.1 (2)] and 2656–2933 bp (elongated intron in H. vulgare cv. Igri). A pairwise comparison of similarity was generated from the alignment (Fig. 2). The lowest similarity of 71% (851 nucleotide differences and indels) was found between H. vulgare cv. Igri and S. cereale cv. Picasso. Seventeen sequences were identical with at least one other sequence. With a single exception, all identical sequences were from the same species or from homologous genome groups (e.g., the A u genomes of tetraploid and hexaploid wheats). The maximum likelihood tree is shown in Fig. 3. Generally, there is no or only little bootstrap support (BS <50%) for the backbone of the tree whereas the various wheat genome groups and Secale are all strongly supported (BS = 100%). Accordingly, the data give only limited indication of relationships between the wheat genome groups. It does, however, allow us to infer the relationships within the groups including the relationships between polyploids and their diploid donor species. In the A genome group, sequences from T. monococcum and T. zhukovskyi are, with high support (BS = 100%), sister group to sequences from T. timopheevii, T. aestivum, T. turgidum and T. urartu. The highly supported D genome group (BS = 100%) consists only of sequences from Ae. tauschii and T. aestivum. The S′ genome group consists of sequences from three Aegilops species (Ae. bicornis, Ae. sharonensis and Ae. searsii) and is also highly supported (BS = 100%); within this group Ae. bicornis and Ae. sharonensis are most closely related (BS = 99%). Full length nucleotide sequences in the A genome group were between 94 and 100% similar (Fig. 4 top panel left). The sequences can be divided in three groups: sequences A–G represent the A u genome of the ‘emmer’ group and T. urartu. Sequences H–K represents the Au genome of the ‘timopheevii’ group and sequences L–O represent the A m genome. The full length nucleotide sequences within each group are >99% similar (Fig. 4, top panel left). The 5′flanking sequences were 67–100% similar (Fig. 4, top panel center). As for the full length sequences, the A genome 5′flanking region are highly similar within the subgroups. The pairwise comparison can therefore be simplified: A–G is 74–75% similar to H–K and 91–92% similar to L–O. The groups H–K and L–O are 67–68% similar. At the protein level, the T. aestivum cv. Bob White allele stands out because of a frame shift insertion. The remaining proteins in the A genome group has 0–10 amino acid differences but no indels.

Fig. 1
figure 1

Conservation along the PAPhy_a gene sequence. The top bar shows the gene structure. The 5′flanking region is blue, exons green, introns red and 3′UTR yellow. The hatch intron is only found in one sequence (S. cereale cv. Picasso 2). The conservation is shown as a percent similarity graph below the bar. (Color figure online)

Fig. 2
figure 2

Pairwise comparison based on the alignment of all isolated sequences. Below the diagonal, percent similarity; above the diagonal, number of differences

Fig. 3
figure 3

Maximum likelihood tree based on the four different partitions of the PAPhy_a gene. The numbers below or above the branches are bootstrap proportions

Fig. 4
figure 4

Pairwise comparisons of sequences from the taxa included in the A genome group (top), the B genome group (middle) and the D group (bottom). The comparisons include the full nucleotide sequences (left), the 5′flanking regions (middle) and the encoded proteins (right). The nucleotide sequences are compared according to % similarity below the diagonals and number of differences above the diagonals. The protein sequences were compared according to differences below and gaps above diagonals. Differences minus gaps equal amino acid substitutions

Three T. aestivum and four Ae. tauschii sequences belong to the D genome group (Fig. 4 bottom panel). The full length T. aestivum nucleotide sequences and one Ae. tauschii sequence (Ae231) are identical and all sequences are >99% similar (Fig. 4, bottom left panel). Also the 5′flanking region and proteins are highly conserved with maximum three nucleotide differences or amino acid substitutions respectively (Fig. 4, bottom panel centre and right).

The B/G/Ae. speltoides group include six sequences from the ‘emmer’ group (A–F), five sequences from the ‘timopheevii’ group (G–K) and three from Ae. speltoides (L–N) (Fig. 4 middle panel). The full length nucleotide sequences from the polyploid taxa are 89–100% similar, but none of the sequences from polyploid taxa are more than 94% similar to any of the Ae. speltoides sequences. One sequence of the ‘emmer’ group (T. aestivum cv. Bob White) is identical to sequences of the domesticated representatives of the ‘timopheevii’ group but only up to 90% similar to other members of the ‘emmer’ group. With this exception, sequences within the respective groups ‘emmer’ and ‘timopheevii’ are >98% similar. Excluding the Bob White allele, emmer group 5′flanking regions are >99% similar having just one nucleotide difference. The ‘timopheevii’ group display two distinct 5′flanking regions with 24 nucleotide differences. The proteins have 0-18 amino acid substitutions and no gaps (Fig. 4 middle panel right). The Bob White protein is identical to the proteins in the ‘timopheevii’ group and has 16 or 17 amino acid substitutions compared to other proteins in the ‘emmer’ group. Only one amino acid substitution is found among the remaining ‘emmer’ group proteins and there are no substitutions in the ‘timopheevii’ group.

Discussion

The present study includes samples of most, if not all, potential donors of all wheat genomes. For a more thorough representation of Triticeae diversity it also includes a number of wild, diploid species belonging to other genera. Further, it includes representatives of the frequently neglected ‘timopheevii’ group. Phylogenetic analysis and pairwise comparisons were based on a complete nuclear gene spanning more than 2.2 kb in all samples.

The three T. aestivum gene copies from cv. Chinese Spring have previously been mapped to chromosome 5A, 5B and 5D using aneuploid lines (Madsen et al. 2013). Their positions in the tree confirm that they have been inherited from the diploid genome donors and that these donors are T. urartu (A), Ae. speltoides (B) and Ae. tauschii (D) in line with the increasingly accepted view (Haider 2013; Peng et al. 2011). Including less or other taxa and accessions of both Triticum and Aegilops, largely the same patterns were found by Petersen et al. (2006). One interesting finding is that, Amblyopyrum which van Slageren (1994) excludes from the Aegilops based on morphology, here falls within Triticum and Aegilops though with no support. Petersen et al. (2006) found high support for a position within Triticum and Aegilops.

The A genome group

The sequences representing T. urartu and T. monococcum together with one sequence from each of the polyploid wheat samples form a monophyletic group with high support (BS = 100%). This A genome group has two distinct subgroups containing among others T. urartu and T. monococcum, respectively. These subgroups represent the A u and A m genome. The only polyploid taxon in the A m group is T. zhukovskyi. This is in agreement with Dvorak et al. (1993) who found evidence of A m as well as A u genomes in T. zhukovskyi and suggested it to originate from hybridization between T. timopheevii and T. monococcum. Accordingly an A u gene copy was also expected in T. zhukovskyi but we were unable to isolate it. It could be mutated in a way that prevents amplification with the current primes, but it is also possible that the A u gene copy has been replaced by the A m copy through heterogenous chromosome pairing. The apparent loss of A u DNA in T. zhukovskyi has also been observed by Baum and Bailey (2004) and Dvorak et al. (1993). Within the A u group, the sequences from the ‘timopheevii’ wheats form a well-supported clade (BS = 91%). A prominent characteristic of the ‘timopheevii’ group A u allele is a 69 bp deletion in the 5′flanking region, but also SNP’s contribute to differentiate this allele form other A u sequences. The deletion in the 5′flanking region includes cis acting regulatory elements previously identified (Madsen et al. 2013). It is therefore very likely that the gene regulation is affected. The A genome allele of the T. turgidum subsp. dicoccoides accession Tri18485 is identical to the allele of T. aestivum cv. Chinese Spring whereas those of Tri18508 and Tri18523 both have 8 differences resulting in 1 and 3 amino acid substitutions in the encoded proteins (Fig. 4). These results show that exotic breeding materials for the A genome may provide novel PAPhy_a alleles but they are not guaranteed to do so. It is therefore advisable to use molecular screening methods to support the breeding effort.

The D genome group

All sampled sequences from Ae. tauschii form a strongly supported group together with three identical sequences from T. aestivum, thus supporting that Ae. tauschii is the D genome donor of wheat. One sequence from Ae. tauschii (Ae 231) is identical to those of T. aestivum whereas the other Ae. tauschii sequences have 10 or 20 nucleotide differences, respectively (Fig. 4). One accession (AE229) has nucleotide differences in the 5′flanking region and the encoded protein (Fig. 4). Whereas the D genome PAPhy_a gene copy thus seems to be very conserved within T. aestivum, the variation among sequences in Ae. tauschii may potentially be useful for crop improvement. As for T. turgidum, it should be useful to apply molecular screening methods to ensure that Ae. tauschii specimens harbour novel alleles before crossing.

The B genome group

Aegilops speltoides has hitherto also been postulated to carry the S genome (Friebe et al. 1995). In the current tree it groups within a highly supported clade (BS = 100%) which also contains sequences from the B and G genome. This finding is in agreement with Kilian et al. (2007) and Golovnina et al. (2007) both of which found that the B and G genomes can be understood as two examples of Ae. speltoides haplotype diversity sequestered by independent hybridization events. Since the B and G genomes are both derived from Ae. speltoides, a more appropriate nomenclature for the group would be B b (emmer group), B g (timopheevii group) and B s (Ae. speltoides). Within the highly supported (BS = 100%) B genome group, sequences from the hexaploid wheats are placed in either of two strongly supported groups: one includes sequences from two T. aestivum cultivars and tetraploid T. turgidum, and the other includes sequences from one T. aestivum cultivar (Bob White), T. zhukovskyi and the four T. timopheevii accessions. A sequence from one of the accessions of Ae. speltoides is strongly supported (BS = 86%) as the sister group to the latter Bob White plus T. timopheevii and T. zhukovskyi clade and sequences from the remaining two accessions of Ae. speltoides are weakly supported (BS = 64%) as their sister group, thus rendering Ae. speltoides paraphyletic. It is reasonable to assume that the ‘timopheevii’ group sequences in this clade are representatives of the B g genome since the B b and B g genomes are known to be very similar (Kilian et al. 2007) and the other ‘timopheevii’ group sequences clearly belongs to the A genome group. The deepest bifurcation in the group splits a weakly supported (BS = 64%) clade with Ae. speltoides and the B g genome Triticum’s from the strongly supported (BS = 100%) B b genome Triticum’s. The position of the Bob White sequence appears to be an abnormality, which can be explained by the use of the T. timopheevii hybrid Timstein in breeding Bob White (GRIS database: http://wheatpedigree.net/sort/show/8192). Thus, it is an example of genetic variation of the PAPhy_a gene which has already been introduced in bread wheat from the ‘timopheevii’ group by chance. The five accessions belonging to the ‘timopheevii’ group produced only two different PAPhy_a sequences from the B g genome locus. The sequences distinguish domesticated and wild accessions and differ in 25 aligned positions due to one SNP and a 24 bp indel. Sequences assigned to the B b genome are more diverse than those of the B g genome. The least similar B b genome alleles are those of Tri18508 and Tri18523 which differ in 47 aligned positions because of 18 SNPs and 7 indels. On the other hand, identity was found between the sequences from Chinese Spring (a hexaploid) and Tri18485 (a tetraploid). The three T. turgidum sequences have identical 5′flanking sequences to Chinese Spring, whereas Villa Jurez has one SNP. Only the sequence Tri18508 encodes a different protein than the other B b alleles (one amino acid substitution).

Perspectives for the improvement of MGPA in bread wheat

It was demonstrated that direct progenitors of bread wheat, Ae. tauschii and T. turgidum, may carry PAPhy_a gene copies which are identical to those of bread wheat or copies with up to 44 nucleotide differences. Some of these differences were found in the 5′flanking region and others resulted in amino acid substitutions in the encoded proteins. Thus, it is very likely that alleles with different activity profiles are available. Further, it should be possible to evaluate them before introgression in bread wheat by direct comparison of the MGPA in candidate specimens with specimens of the same species carrying PAPhy_a gene(s) identical to those of bread wheat. The effect of amino acid substitutions on the enzyme properties can be evaluated after recombinant expression of the proteins as demonstrated by Dionisio et al. (2011).

Additional gene variation is found in T. timopheevii and T. zhukovskyi. As discussed above, the T. timopheevii B g genome allele has already been transferred to T. aestivum cv. Bob White (and presumably many other cultivars breed from Timstein). Bob White does not have a high MGPA (Madsen et al. 2013) so the B g genome allele does not appear to confer very expression levels or a much more active enzyme. It should be considered though, that Bob White also has an allele on the A genome which is most likely nonfunctional because of a frameshift insertion. It is therefore not possible to precisely evaluate the potential of the T. timopheevii allele based on Bob White MGPA. The T. timopheevii A u allele with its notable deletion of regulatory elements in the promotor is very likely to provide a different expression level but it cannot be predicted if it will be higher or lower. Also the A m allele of T. zhukovskyi and T. monococcum holds promise, since T. monococcum is known to have a high MGPA (Madsen et al. 2013).

However, the search for better expressed PAPhy_a alleles does not need to be restricted to very close relatives of wheat. Rye translocation lines have long been used in wheat breeding (Graybosch 2001) and it is clear from Fig. 3 that considerable genetic variation is available in the Secale. Rye has consistently shown higher MGPA compared to wheat and barley (Eeckhout and De Paepe 1994; Madsen et al. 2013; Steiner et al. 2007; Viveros et al. 2000), but it remains to be examined if the Secale genes will prove as active when transferred to wheat.