Introduction

The almond (Prunus dulcis Miller [D.A. Webb]) belongs to the Rosaceae family, genus Prunus. Over 30 species have been described by botanists. The genus almond is one of the oldest tree nut crops and occupies a very peculiar place among fruit trees (Sorkheh et al. 2009). Because of almond’s tolerance to cold, drought and salinity, it is considered an important tree crop and is cultivated in different climatic regions of world. Almond was reported to originate in Central Asia, but almond species grown under a subtropical Mediterranean climate of mild, wet winters and warm, dry summers (Kester and Gradiziel 1996). Since the climate in Türkiye changes from subtropical to terrestrial, most plant species such as almond grow naturally and economically throughout the country (Ercişli 2004). Almond tree is one of the popular nut trees worldwide and produces fruits with high economic value. Türkiye is the main producer of this crop (FAO 2022).

There has been a genetically rich almond population since it has been mostly grown with seed for many years in Türkiye (Aslantas and Guleryuz 1999). In this regard, Türkiye, which has very valuable almond types, has an advantage that is not found in many other countries in terms of the breeding material. This large genetic diversity provides great benefits in the selection of superior genotypes especially in breeding research (Sumbul and Bayazit 2019). It is very significant in terms of almond cultivation that the variation of genetic material is revealed. The morphological and phenological properties are usually used for cultivar identification in almond (De Giorgio and Polignano 2001). Morphological characterization of the almond accessions is the credentials for plant breeders and is also essential to study the genetic diversity within the collection (Van Hintum et al. 2000; Sepahvand et al. 2015). Revealing the diversity of genetic sources, traditional multivariate statistical analyses such as principal component analysis (PCA), hierarchical cluster analysis, and multidimensional scaling analysis are used (Odong et al. 2011).

Although almond is generally a self-incompatible species, some self-compatible cultivars have been identified (Halasz et al. 2006). Almond is a gametophytic species in terms of self-incompatibility. Self-incompatibility is controlled by a single S locus with multiple codominant alleles. It is expressed within the styles of flowers as S‑RNase glycoproteins, which are responsible for the inactivation of the self-pollen tube growth (Wirthensohn et al. 2011). It was reported that these glycoproteins displayed a similar effect in most species of genus Prunus, including almond, apricot, sweet cherry and plum (Yamane et al. 1999; Wunsch and Hormoze 2004; Alonso and Socias I Company 2006; Hajilou et al. 2006). In total, 23 S-alleles of incompatibility (from S1 to S23, and also Sf) have been identified in the cultivated almond species (Ortega et al. 2005). For determining the self-compatible cultivars, two methods have mainly been used to: controlled pollination in the field and S‑RNase analysis in the laboratory (Boskovic et al. 2003). However, new technologies based on DNA analysis have lately been developed (Mousavi et al. 2010, 2014).

Identification of a self-incompatible genotype by PCR consists of the amplification of target DNA by specific primers designed for DNA sequences coding for the S alleles. To determine the different S‑alleles, researchers have designed specific primers. In these studies, it was determined the S1 and S3 alleles by Ma and Oliveira (2001); S7, S8, and Sf alleles by Channuntapipat et al. (2001); S2, S5, S9, S23, and Sf alleles by Channuntapipat et al. (2003). Sanchez-Perez et al. (2004) reported that 10 different S‑incompatibility alleles and Sf-compatibility alleles could be determined by the PCR method, but they could not be differentiated because the S3 allele and Sf allele had the same band size. Sheikh-Alian et al. (2010) used AS1II and AmyC5R primers to determine the self-compatibility of almond cultivars and genotypes. In the study, S1, S2, S3, S5, S7, S9, S10, S11, S12 and S13 alleles were identified, and foreign cultivars showed the same band size S‑alleles. The Iranian genotypes showed alleles as follows: S1 and S7 alleles in ‘Yalda’; S3 or Sf allles in ‘Shahroodi’, ‘Azar’ and ‘Shokoofeh’; S7 and S12 alleles in ‘Talkh-13’; S1 and S13 alleles in ‘Genotype-5’; S13 allele in ‘Genotype-11’. Rasouli (2017) examined some morphological traits and self-incompatibility of 71 almond genotypes. In the study, simple and multiplex specific PCR analyses were performed to identify self- incompatibility alleles in genotypes. As a result of the PCR method performed with the combination of AS1II and AmyC5R primers, the amplifications of 10 self-incompatibility alleles (S1, S2, S3, S5, S6, S7, S8, S10, S12 and S unknown allele) and Sf alleles were determined.

The identification of self-compatible almond cultivars is important for orchard planning and for designing crosses and selection parents in breeding programs. The aim of this study was to evaluate the morphological traits and determine self-incompatibility alleles in 48 foreign and Turkish almond cultivars and genotypes.

Materials and Methods

Plant Material

The 48 almond genotypes (nine foreign cultivars and 39 Turkish cultivars and selected genotypes) (Table 1) were located the ex-situ germplasm collection on the Mediterranean coast of Mersin, Türkiye. Turkish genotypes were selected based on some important morphological and phenological traits from different regions of Türkiye.

Table 1 Genotypes/cultivars and their identification (ID), name and origin

Morphological Traits

The morphological trait values can be found in previous studies (Pinar et al. 2016). The seven phenological and five morphological characteristics were used to perform statistical analysis. The times of bud swell, bud burst, pink bud, start of blooming, full blooming, end of blooming and foliation of all almond genotypes was used as phenological characterization data. The almond cultivars and genotypes had considerable differences in blooming time. The minimum and maximum values in genotypes were as follows: 5 January–14 February for bud swell time, 10 January–18 February for bud burst time, 16 January–26 February for pink bud time, 20 January–1 March for start of blooming time, 25 January–13 March for full blooming time, 29 January–20 March for end of blooming time and 2 February–28 March for foliation time. Genotypes were then divided into three groups based on flowering date early, middle, and late within this period, indicating differences in the chilling requirements of the studied genotypes. Most of the genotypes were middle flowering according to full blooming time. ‘Genotypes 6/12’ (TA-22) and ‘2902’ (TA-24) were the earliest to bloom, flowering on 20 January. The latest genotypes to bloom were ‘42/2’ (TA-8) and ‘47/2’ (TA-26) that was very-late flowering (1 March). In addition, leaf length (mm), leaf width (mm), leaf colour, leaf edge serration and petiole length (mm) were used for morphological data analysis. The leaf length, leaf width and petiole length of almond genotypes ranged from 14.86 mm to 84.91 mm, from 2.11 mm to 30.79 mm, and from 1.80 mm to 35.90 mm, respectively. The genotypes were categorized as light green, green and dark green in leaf colour, and serrated and non-serrated in leaf edge serration.

Data Analysis

Morphological data analysis was performed by NTSYS-pc software (Version 2.2. Exeter Software. Setauket, NY, USA) (Rohlf 2004). Morphological data matrices were standardized using the STAN option. Then SIMINT option was used to compute population distance matrix. The distance matrix was used to set up a dendrogram based on the unweighted pair group method with arithmetic mean (UPGMA) method using the SHAN option. To determine the efficiency of clustering, the cophenetic correlation coefficient (CPCC) was calculated with the Mantel method (Mantel 1967).

PCA was used to measured morphometric distances between trees of all the populations. The PAST program was used to follow the distribution of genotypes and morphological traits according to the first two PCs in the scatter plot.

Identification of S-alleles

DNA Extraction

The leaf tissue was ground in a muller under liquid nitrogen, then DNA was extracted using the CTAB method according to the protocol of Doyle and Doyle (1990). DNA concentration was determined by spectrophotometric measurement. The DNA solution was brought to a concentration of 10 ng/μL and stored at −20 °C.

PCR Amplification

In this study, specific PCR primers [AS1II (forward; 5′-TATTTTCAATTTGTGCAACAATGG-3′) and AmyC5 (reverse;5′-CAAAATACCACTTCATTGTAACAAC-3′)] were used to determine of S‑genotypes (Tamura et al. 2000; Sanchez-Perez et al. 2004). PCR was carried out 15 μL of reaction volume containing 2 μl DNA (30 ng), 1.5 μl 10xPCR Buffer, 0.2 μl Taq DNA polymerase (5 u/μL), 1 μl dNTP (2.5 mM), 1.5 μl MgCl2 (25 mM), 2 μl 10 mM each primer and 6.8 μl H2O. Reactions were performed in accordance with the procedure: initial denaturation at 95 °C for 3 min, 30 cycles with denaturation at 95 °C for 30 s, annealing at 54 °C for 45 s (depending on the primer), extension at 72 °C for 1 min 15 s, and a final extension step at 72 °C for 10 min (GeneAmp_PCR System 9700, Perkin–Elmer Corp, California). PCR products were run in 1.5% agarose gel at 110 volts for 2 h. Resultant bands were visualized under UV light after staining with ethidium bromide. The band widths determined by using 100 bp DNA ladder (Vilanova et al. 2005).

Furthermore, the deduced amino acid sequences of S1–S29 almond alleles were compared with sequences of other Prunus S‑RNases, displaying 97% similarity or higher, as deduced using the FASTA program in the European Bioinformatics Institute toolkit (http://www.ebi.ac.uk).

Results and Discussion

Principal Components (PCs) and Cluster Analyses

PCA was applied based on morphological data of almond genotypes with the PAST software. The major components were described by variables and their relationship was mirrored on principal components scores (Table 2). Most of the total variation (71.94%) in the seven phenological and five morphological traits occurred from the first three PCs with Eigenvalues > 1. The PC‑I, PC-II and PC-III represented 45.21%, 16.63% and 10.13% of total variability, respectively. In each PC, a coefficient greater than 0.3 was determined as the border to define the cut-off limit for the coefficients of the correct vectors (Hanci 2019). PC‑I showed six variables with higher scores (over 0.369 absolute value) related to all phenological characters, except foliation. The highest value of PC-II corresponded to leaf length and width. The separation in PC-III was due to changes in leaf colour and leaf edge serration. The all-important characteristics provided a positive value, but leaf colour had a negative value for the first three PCs in the PCA. On the other hand, foliation time and petiole length showed low variation among the studied genotypes.

Table 2 Principal component (PC) coefficients of seven phenological and five morphological traits in almond genotypes

Our finding, for the first three PCs, was similar with the values obtained (68%) by Nikoumanesh et al. (2011) for 62 almond genotypes from Iran, but was higher than those found by Zeinalabedini et al. (2012) for 71 almond genotypes from Iran, by Colic et al. (2012) for 19 almond genotypes from Serbia, and by Khadivi-Khub and Etemadi-Khah (2015) for 90 almond genotypes from Iran. The highest first three PCs in our study show that the properties observed are excellent for the classification of almond genotypes. Sakar et al. (2019) reported the first two principal components explained 95% of total variance in five phenological traits of almond from 12 local genotypes. Similarly, in a study conducted in Iran, it was reported that 13 phenotypic traits successfully explained genetic diversity in 33 almond cultivars/genotypes (Imani et al. 2021). Iranian almond germplasm, a large proportion of the variation was explained by the traits related to leaf and tree (Nikoumanesh et al. 2011). In the study by Sepahvand et al. (2015), the variables contributing to nut and kernel size were more important than leaf traits in different almond collections.

The distribution of the 48 almond accessions on the two-dimensional graph is shown in Fig. 1, according to the first two major PCs. Almond genotypes showed generally distribution on the graph, but not grouped. According to this analysis, accessions TA-22 and TA-24, accessions TA-25 and TA-32, accession TA‑8 and accession TA-40 are located separately from others as in the cluster analysis. The distribution of the measured morphological traits is shown in Fig. 2. According to this graph, the leaf edge serration of the genotypes is located in a different region compared to the other traits. On the other hand, the foliation character had the lowest effect on the distinction of accessions.

Fig. 1
figure 1

Distribution of accessions based on the first and second principal components obtained from morphological data

Fig. 2
figure 2

Distribution of observed characters based on the principal component I (PC)-I and PC-II in the almond genotypes

The cluster analysis grouped the 48 almond accessions into two main clusters, and they were further divided into two sub-clusters (Fig. 3). Five genotypes were in the first cluster (I), and the 42/2 genotype (TA‑8 accession) was grouped separately from the others in the first sub-cluster (I-A). In all, 43 of the 48 genotypes were included in the second cluster (II). There were 10 genotypes in the 1st sub-cluster (II-A), and 33 genotypes in the 2nd sub-cluster (II-B). On the other hand, genotypes in the 2nd sub-cluster (II-B) were grouped into two sub-groups with a distance value more than 15.0. The UPGMA dendrogram and the scatter plot approved the high variation between almond genotypes. The relative harmony between the results of PCs and cluster analyzes showed that morphological traits can ensure credible information on the variability in genotypes. Similar results were observed in almond genotypes in Lebanon (Talhouk et al. 2000), Serbia (Colic et al. 2012) and Iran (Sorkheh et al. 2009; Zeinalabedini et al. 2012; Khadivi-Khub and Etemadi-Khah 2015).

Fig. 3
figure 3

Dendrogram of the 48 almond accessions based on morphological characters. I First group, II Second group, IA Subgroup of I, IB Subgroup of I, IIB Subgroup of II, IIA Subgroup of II

Identification of Self-Incompatibility Alleles

In this study, the primers developed by Tamura et al. (2000) were used for identification of S‑alleles in some almond genotypes. According to the results of PCR using the AS1II- and AmyC5R-specific primers in a single reaction, the amplification was successful. In the study by Sanchez-Perez et al. (2004) was used for identification S‑alleles the two primers (AS1II/AmyC5R). In the same study, it was proved that the lengths of amplified fragments in both single PCR and multiplex PCR were similar. Martínez-Gómez et al. (2003) reported that the eight S‑alleles in almond cultivars identified in single PCR reactions using these PCR primers. The primers (AS1II and AmyC5R) used in this study were quite efficient in the identification of several S‑alleles. The results showed amplification of nine different self-incompatibility alleles (S1, S2, S3, S5, S6, S10, S11, S12 and S13) and of the self-compatibility allele Sf. The PCR-amplified fragments were size from 600 to 1600 bp. The amplified fragments of Sf self-compatibility allele and S3 self-incompatibility allele had the same band size in 1200 bp. The findings of Sanchez-Perez et al. (2004) was similar with findings in this study. DNA fragment sizes corresponding to each S‑allele agreed in most status with the available data. Thus, S‑genotypes of the new tested almond selections were determined (Table 3).

Table 3 Sizes of amplification products and S‑alleles identified in 48 almond genotypes using single polymerase chain reaction

The number of self-compatible genotypes was 12, including ‘Marta’, ‘F. Barese’, ‘Tuono’ and ‘Super Nova’ cultivars. For 16 genotypes, two alleles were confirmed; and for other genotypes, only one allele was determined. S1, S2, S5 and S6 were the most common alleles, as each was found in almond genotypes assayed here. The least common alleles were S10, S11, S12 and S13 alleles, and especially S10 was determined only in ‘Dokuzoguz’ cultivar. S‑allele frequencies were probably related to geographical origin. The S‑allele frequencies results were a little different than those reported by Lopez et al. (2006), who found that the alleles S1, S5, S7, and S8 in 133 almond cultivars were observed more frequently than others. Valizadeh and Ershadi (2009) reported that the alleles S2, S7 and S13 were the most common alleles in 16 Iranian almond cultivars. On the other hand, the S‑genotypes of Iranian and foreign almond genotypes have been reviewed by Rasouli (2017). The study found that the alleles S1 and S2 were observed more frequently than others.

Conclusions

In this study, a wide phenotypic and morphologic diversity in all evaluated traits was detected. The presence of such diversity and the maintenance of genetic diversity are important for future breeding of almond. But it may be beneficial to use molecular data to determine genetic diversity and relatedness. On the other hand, the determination of eight local genotypes with Sf allele in the study is promising to consider the self-inefficiency in almond species. According to the results, the selection of the appropriate variety and pollinator in the new orchards will contribute to production and cultivation. However, in parallel with the research and goals in the world, especially in almond type, the process of obtaining a national variety of almond type that can grow naturally in Anatolia for thousands of years will be initiated by contributing to the selection of the right parents to determine a late and self-efficient variety.