Introduction

Banana farming is socially and economically important in the tropical and subtropical regions worldwide, especially for small farmers. Bananas are consumed by all age groups, has high energetic value and various vitamins and minerals, and is the base food of millions of people, in both fresh and processed forms (Castañeda et al. 2017). The banana export industry is a relevant economic factor to many Latin American and Asian countries, such as Costa Rica, Ecuador, and the Philippines. Worldwide banana production in 2017 was approximately 115 million tons cultivated on 5.4 million hectares (FAOSTAT 2018). In 2017, Brazil produced 7.0 million tons of bananas, on approximately 517 thousand hectares, placing it in third place in terms of production worldwide (FAOSTAT 2018).

Among the limiting biological factors in banana production are various phytopathogens (fungi, bacteria, viruses), nematodes, and insects that have negative impacts on production and compromise the final quality of the fruits. The most important fungi are the causative agents of yellow Sigatoka (Pseudocercospora musicola, Leach), black Sigatoka (Pseudocercospora fijiensis, Morelet), and Fusarium wilt (Fusarium oxysporum f. sp. cubense). The development of resistant cultivars through crosses between resistant diploids and susceptible commercial cultivars is the most viable way to minimize damages caused by diseases. This strategy is limited by the low fertility observed for commercial triploids, especially those of the Cavendish subgroups and some diploids (Alakonyaa et al. 2018). However, tools such as genetic engineering (cisgenics), induced mutation via gamma ray irradiation, induced in vitro somaclonal variation, somatic hybridization, and induced polyploidy, can be used to improve banana plants (Pestana et al. 2011; Amaral et al. 2015; Borges et al. 2016).

Polyploidy is the presence of more than two complete genomes in the cell nucleus and is an important phenomenon in plant evolution due to an increase or doubling of the basic number of chromosomes (Eng and Ho 2019). Polyploidization affects the genetic and phenotypic constitution of an organism, promoting diversification and environmental adaption of the species, and occurs in at least half of the plants cultivated (Hanzl et al. 2014). This technique has been used to genetically improve various species, including corn, cabbage, and wheat (Aslan et al. 2017; Würschum et al. 2012), as well as bananas (Hamill et al. 1992; Bakry et al. 2007, 2009; Costa et al. 2011; Amaral et al. 2015; Borges et al. 2016).

According to Zhou and Gui (2017), polyploidization increases phenotypic variability, plant vigor, tolerance to biotic and abiotic factors, and adaptability to new environments. Therefore, methods have been developed to double chromosomes, including the use of various antimitotic agents, such as colchicine, trifluralin, oryzalin, and amiprofos-methyl (Van Duren et al. 1996; Ganga et al. 2002; Bakry et al. 2007). Genetic variability induced by polyploidization can be quantified using molecular markers such as simple sequence repeats (SSR) and those associated with families of transposable and retrotransposable elements (IRAP). These markers are widely used in Musa spp. diversity studies (Pereira et al. 2012; Pachuau et al. 2014; Rotchanapreeda et al. 2014).

Thus, the objective of this work was to analyze the genetic variability available through doubling chromosomes of autotetraploid banana populations using colchicine, based on agronomic characteristics and SSR and IRAP molecular markers.

Materials and methods

Plant material

Fifty-six autotetraploid plants originated from the wild diploids Malbut, Niyarma Yik, Pisang Mas (subgroup Sucrier), Thong Dok Mak, and Pisang Lilin (subgroup P. Lilin) were evaluated. The plants were from experiments conducted previously to double chromosomes using colchicine (Costa et al. 2011; Pio et al. 2014).

Phenotyping the autotetraploid populations

The experiment was conducted in the Embrapa experimental field, located in Cruz das Almas, BA (12° 40′ 19′ S, 39 °06′ 22′ W′, 220 m above sea level), between July 2015 and September 2017, totaling two production cycles. The climate in the region is humid tropical, with an average temperature of 25 °C, average relative humidity of 72.6%, and average annual rainfall of 1932.3 mm (AGRITEMPO 2018). The soil is a dystrophic yellow latosol that is well drained, deep, and has a medium texture with medium levels of clay. Fertilizer was applied every 6 months based on a soil analysis.

The experimental area was irrigated, using a micro sprinkler system, and the soil was monitored with tensiometers. The experiment was conducted using the augmented block design (Federer 1956), with 56 regular treatments (autotetraploid plants), repeated only once in each of the 12 blocks, and three common treatments, improved diploids 091087-12 and 041054-04 and an autotetraploid genotype derived from Pisang Mas, used as the control, repeated in each of the blocks. Each parcel (treatments) contained nine plants spaced 2.5 m × 2.0 m. Augmented block consists of a special design where a large number of genotypes can be evaluated at the same time when you are initiating a breeding program and do not have availability of the genotypes being tested. Therefore, you have controls, such as commercial varieties which will become your replicates. This facilitates the installation of the experiment. The controls were exclusively used to estimate the experimental error, which allowed an analysis of variance to be conducted.

The evaluation was carried out for the first and second production cycles and 13 agronomic characteristics were considered: time interval between planting and harvesting (NPH), pseudostem diameter (PSD), plant height (PLH), number of live leaves during flowering and harvesting (LLF and LLH), number of suckers (NSU), bunch and hand weight (BUW and HAW), total number of fruits (NFR), average fruit weight (FRW), fruit length and diameter of the second hand (FLH and FRD), and number of hands (NHA).

For the meiotic analysis, immature male flowers, approximately 1.5 cm long, were collected and fixed in Carnoy 3:1 (ethanol: 24 acetic acid – v/v), for 24 h at room temperature, and then stored in 70% ethanol in a refrigerator in the Plant Tissue Culture Laboratory. To prepare the slides, the anthers were cross-sectioned, lightly crushed to free the microspores into a drop of 2% propionic carmine, covered with a slide, and slightly heated. The number of meiocytes per autotetraploid was then evaluated, considering meiotic abnormalities in the metaphase, anaphase and telophase I and II stages, and in the tetrads. The most representative samples were micro-photographed with an Olympus BX51 microscope. Only the best autotetraploids from each original diploid selected from the Mulamba and Mock sum of ranks index (1978), were analyzed.

Genotyping the autotetraploid populations

Fourteen pairs of SSR primers derived from the wild diploid Calcutta 4 and six combinations of IRAP primers (Creste et al. 2006; Teo et al. 2005) were used to molecularly characterize the autotetraploid populations derived from five wild diploids (Supplementary Table 1). The five original diploids were included in the analysis. The amplification reactions were conducted at a final volume of 25 μL, which included the following: KCl 50 mM, Tris-HCl 10 mM (pH 8.3), MgCl2 2.0 mM, 0.5 μM of each of the dNTPs (dATP, dTTP, dGTP, dCTP) at a concentration of 0.2 mM, 0.5 μM of each primer, 30 ng of genomic DNA, and 1 U of DNA polymerase (Invitrogen, EUA).

Amplifications were carried out with a Perkin Elmer 9700 Thermal Cycler using the following program: one denaturation cycle for 5 min at 94 °C, followed by 35 denaturation cycles of 1 min at 94 °C, 1 min of annealing with a specific temperature for each primer, 1 min of extension at 72 °C, and a final extension of 10 min at 72 °C and ∞ 14 °C. The fragments were separated on 3% ultrapure agarose 1000 gels (Invitrogen) under standard conditions, and the amplified products stained with ethidium bromide, viewed under ultraviolet light, and photographed with UVITEC equipment.

Statistical analysis

The intrablock analysis procedure was adopted (Cochran and Cox 1957; John 1971). This statistical analysis assumed the model was fixed, to consider the effects represented by constants, except for the experimental error. The analyses were conducted using the SAS statistical program (Institute SAS 2010) and the proc glm linear models (procedure for general linear models). The means of the treatments were adjusted by least squares using the “lsmean” model in SAS. Subsequently, the genotypes were classified based on the adjusted means. For each genotype, the sum of the numbers relative to its classification was calculated according the sum of ranks index proposed by Mulamba and Mock (1978):

$$ \left({I}_{\mathrm{MM}}=\sum \limits_{j=1}^m{n}_{\mathrm{ij}}\right), $$

whereas

I MM :

is the sum index of the classification,

and nij:

is the classification number of genotypes i in relation to character j.

A principle component analysis (PCA) was conducted to quantify the relevance of the characteristics measured to eliminate redundancies. The selection was based on the mean of each characteristic, from the correlation matrix, and conducted using the princomp procedure in the software SAS v.8.1 (Institute SAS 2010). The PCA analysis was conducted using the package FactoMineR of the program R (R Core Team 2016).

The agronomic characteristics and molecular data (SSR and IRAP) were analyzed together using the Ward-MLM procedure (Franco et al. 1998). The Ward-MLM clustering method was conducted using the combined matrix obtained with the Gower algorithm (Gower 1971) and the packages StatMatch, cluster, ade4, and NbClust of the program R (R Core Team 2016). The qualitative data (SSR and IRAP) was analyzed using the packages qlcMatrix, ade4, and NbClust of the program R (R Core Team 2016). The molecular variance between and within the autotetraploid populations was explored by AMOVA (analysis of molecular variance) (Excoffier et al. 1992) using the software GENES (Cruz 2006).

Results and discussion

A summary of the analysis of variance for the 13 agronomic characteristics measured for the 59 genotypes (56 autotetraploids and three controls) is presented in Table 1. Based on the F test of the analysis of variance there were significant differences for the source of variation in the treatments among the means of the genotypes for all agronomic characteristics evaluated, except for the variable number of live leaves during flowering (LLF).

Table 1 Average of the thirteen plant characteristics evaluated in 56 banana autotetraploids. Embrapa 2019

The eight autotetraploids originating from the wild diploid Pisang Lilin had variation for the number of days between planting and harvesting, with a mean of approximately 500 days; Pisang Lilin 7 was notable for having the lowest values for this parameter. Significant statistical differences were not observed for PSD and PLH among the autotetraploids; however, mean height of the genotypes was similar to the results of Amaral et al. (2015) who measured the original diploid of P. Lilin under the same cultivation conditions. For pseudostem diameter, the mean observed in the present study was greater than that reported by these authors. Plant height is an important characteristic in the improvement of bananas because it is directly related to planting density, management, and productivity. In addition, thicker pseudostems are preferred because plants are less prone to tipping over or breaking due to winds (Amorim et al. 2013). Amaral et al. (2015) evaluated 17 autotetraploids derived from the wild diploid of Pisang Lilin, using colchicine, for different agronomic characteristics. In general, the data corroborate the results of the present study, including the mean height of plants (230.0 cm), number of live leaves during flowering and harvesting (6.1 and 2.8), bunch weight (3.8 kg), and number of hands (4.5). According to the authors, the autotetraploid plants of Pisang Lilin were more fertile compared to the diploid of this genotype, which is considered sterile. This result was confirmed by crossing the autotetraploids of P. Lilin with improved diploids, which resulted in the production of viable seeds. Restoring fertility through induced polyploidy has been reported for other species (Nimura et al. 2006; Rhee et al. 2005; Dunn and Lindstrom 2007).

For the population of autotetraploids obtained through chromosome doubling of the diploid Malbut, statistical differences were observed between the number of live leaves during flowering and harvesting (LLF and LLH) and the number of suckers (NSU). No genotype presented at least eight leaves during flowering, which might be due to possible susceptibility to black and yellow Sigatoka (Table 1). Similar results were observed for the autotetraploids of Niyarma Yik, which is reflected in the low genetic variability for the characters studied.

There was variation for four of the six characteristics evaluated in the autotetraploid population from the Pisang Mas diploid. The mean NPH was approximately 450 days, with an amplitude of variation of approximately 100 days; genotypes 11 and 12 were notable for being the earliest. The autotetraploids 3, 8, 12, 13, and 14 presented more than eight leaves during flowering; however, the same behavior was not observed at harvest (Table 1). There was also variation in the number of suckers, ranging between 1.8 (Pisang Mas 6) and 5.2 (Pisang Mas 11).

Azhar et al. (2000) doubled chromosomes of the cultivar Pisang Mas, which is the most appreciated cultivar in Malaysia, using 1% antimitotic colchicine. The study produced 1420 plants, of which 37.5% were confirmed to be autotetraploids using flow cytometry. The autotetraploid seedlings were shorter; the circumference of the pseudostem was higher, and the leaf blades were thinner compared to the original diploid, data that corroborate the results obtained in the present study.

The autotetraploids obtained from the diploid Thong Dok Mak had a mean NPH of 443 days and double the amplitude of variation observed for the population of Pisang Mas. Two autotetraploids had eight or more live leaves during flowering (Thong Dok Mak 7 and 9). As with Pisang Mas, low LLH values were also observed.

The autotetraploid Pisang Lilin 8 was notable for the production parameters BUW, HAW, FRD, and NHA, which significantly differed from the other seven genotypes that originated from the same wild diploid. A similar behavior was observed for Malbut 10 that was superior for the parameters BUW, HAW, and FRW. With the lowest number of autotetraploids, Niyarma Yik exhibited little variation for the parameters measured; however, it is possible to infer the superiority of Niyarma Yik 3 (Table 1).

Pisang Lilin 8 is notable for LLF and LLH, which had ten and six live leaves during flowering and harvesting, respectively. This result is associated with greater resistance to yellow and black Sigatoka, which are diseases present in the experimental area. Black Sigatoka was officially identified in the experimental area in the second half of 2016, making it impossible to analyze the behavior of the autotetraploids against this disease. The complete characterization of the promising autotetraploids for resistance to black sigatoka should be conducted, especially for autotetraploids singled out using the Mulamba and Mock selection index. The autotetraploid Pisang Lilin 8 also had the greatest number of suckers among the genotypes evaluated (Table 1).

The autotetraploid population Pisang Mas, represented by 14 genotypes, exhibited wide variation for the production characters, except for NFR and FLH. For BUW, the mean was 3067 g, varying from 1403 g for Pisang Mas 5 to 4515 g for Pisang Mas 3. Similar behavior was observed for HAW and FRW, of which autotetraploid no. 3 was superior. For the character NHA, Pisang Mas 8 was notable for surpassing the general average by more than 30% (Table 1). Twenty genotypes of Thong Dok Mak, doubled by colchicine, showed wide variation for the production characters. Thong Dok Mak 2 exhibited the highest values for BUW and HAW. Two genotypes had more than 100 fruits per bunch and six or more hands (Thong Dok Mak 7 and 15).

Based on this result, it can be inferred that promising autotetraploids can be selected to use in crosses with the goal of developing secondary triploid hybrids, since there was genetic variation for the characters measured, both among and within populations. According to Amaral et al. (2015), doubling chromosomes changes various characteristics, including leaf architecture, plant height, and size of flowers and fruits.

The choice to double the chromosomes of Pisang Lilin was based on reports in the literature of its resistance of black Sigatoka and Fusarium races 1 and tropical race 4 (TR4), which are relevant diseases to the banana agribusiness (Zuo et al. 2018). The diploid Malbut was selected because it has six times more carotenoids than cultivars of the Cavendish type, 6.87 μg g−1 and 1.06 μg g−1, respectively, which could lead to opportunities to develop biofortified triploid cultivars (Amorim et al. 2009). The diploid Niyarma Yik was selected because of reports that it is the probable ancestor of cultivars from the Cavendish subgroup (Perrier et al. 2011; Hippolyte et al. 2012; Christelová et al. 2017). The diploid Pisang Mas was selected for chromosome doubling with the goals of recovering its fertility and using it in crosses, since this cultivar is highly appreciated by both Brazilian and Malaysian consumers (Azhar et al. 2000; Amorim et al. 2013). Thong Dok Mak was selected for polyploidization because of its short stature, high number of fruits per bunch and precocity, which are characteristics of interest when considering improving bananas (Brandão et al. 2013).

Due to the large volume of data, selecting the best autotetraploids based on values per se is limited. Thus, the Mulamba and Mock sum of ranks index (1978) was applied, which considered all the characteristics together. The results of this index are presented in Table 2 and reflect the classification by population. One of the advantages of this index is the fact that it does not require the establishment of economic weights or estimates of genotypic and phenotypic variances and covariances. In the classification, the autotetraploids Pisang Lilin 8, Malbut 10, Niyarma Yik 2, Pisang Mas 3, and Thong Dok Mak 2 were selected. These genotypes were superior for most of the agronomic characteristics evaluated in this study and had the lowest IMM values (Table 2). When considering the classification regardless of the population, it was found that of the ten ranked the best, eight were derived from the diploid Thong Dok Mak; the other two were from Pisang Lilin 8 and Pisang Mas 3.

Table 2 Mulamba and Mock (1978) IMM index used to select banana autotetraploids

The meiosis analysis of the autotetraploids selected from each population with the Mulamba and Mock (1978), sum of ranks index is presented in Supplementary Table 2. In general, high stability was observed, allowing us to infer that the genotypes have normal meiosis and are fertile, making them ideal for crossing with improved diploids to develop commercial triploid hybrids.

The principal components analysis (PCA) presented in Fig. 1, demonstrates the importance of the agronomic characteristics in the genetic variability observed among the autotetraploids. The total phenotypic variability represented by the components PC1 and PC2 was 62.24%, where PC1 contributed 45.75% and PC2 contributed 19.50%.

Fig. 1
figure 1

Principle component analysis (PCA) of the 13 agronomic characteristics measured in autotetraploid banana populations originating from five wild diploids, using the package FactoMineR of the R Program

According to the analysis, the main variables responsible for the phenotypic variation were the following: BUW with 14.59%, NFR with 13.81%, HAW with 13.81%, and NSU with 11.80%, with NFR and NHA contributing 10.72 and 10.36%, respectively. The variables that contributed the least to the phenotypic variation were FRW (0.33%) and FLH (0.16%) and, therefore, were excluded from the combined Gower analysis. The principal component analysis is presented in Fig. 2. There was a tendency for the autotetraploids derived from the diploid Thong Dok Mak to group, except for Thong Dok Mak 15, located in the quadrant opposite of most of the genotypes. A similar behavior served for the genotypes of Pisang Mas that, except for Pisang Mas 5 and 6, formed a single group. The autotetraploids derived from Niyarma Yik, Malbut, and Pisang Lil were the most dispersed among the quadrants due to their higher genetic variability.

Fig. 2
figure 2

Dispersion graph of autotetraploid banana genotypes, originating from five wild diploids, based on the principal component analysis (PCA) using the package FactoMineR of the program R (R Core Team 2016)

The autotetraploids selected using the Mulamba and Mock (1978) sum of ranks index were dispersed in the PCA, allowing us to infer the possibility of using them in crosses to develop commercial triploid cultivars. This result supports the criteria adopted for selecting wild diploids for chromosome doubling: genetic similarity with commercial cultivars, potential for biofortification, and resistance to diseases. Figure 3 a shows the dendrogram of the genetic similarity based on the combined analysis (13 agronomic characteristics and 25 SSR and 42 IRAP molecular markers) obtained using the Ward-MLM method. This analysis did not consider the FRW and FLH variables due to their low contribution to the phenotypic variation. The correlation coefficient values (ccc) was high (r = 0.85, p < 0.0001, 10.000 permutations) and adequate, since values of r ≥ 0.56 are considered ideal and reflect good fitness with the genetic similarity values (Vaz Patto et al. 2004).

Fig. 3
figure 3

Genetic variability among autotetraploid banana populations originating from five wild diploids. a The Ward-MLM clustering method was used with a combined matrix by the Gower algorithm using agronomic characteristics and molecular markers. b Genetic variability estimated using SSR and IRAP molecular markers

Except for genotypes Thong Dok Mak 4 (G1) and Thong Dok Mak 10 (G3), the autotetraploids of this population were mostly grouped in G2. The same behavior was observed in the population derived from the wild diploid Niyarma Yik, which was in G3. The autotetraploids of Pisang Lilin, Malbut, and Pisang Mas grouped hierarchically in G4, each with populations derived from close original diploids, except for Pisang Mas 5 and 6, Malbut 3, and Pisang Lilin 8 (Fig. 3a).

The wild diploids (marked in the dendrogram with a zero after their name (e.g., PLilin0), were included in the diversity analysis that only used the molecular markers (SSR and IRAP), so they could be genetically compared with the derived autotetraploid populations (Fig. 3b). Four groups were formed: G1 with only Thong Dok Mak 4; G2 with all of the autotetraploids of the populations derived from Pisang Mas, Pisang Lilin, and Malbut (except Malbut 2); G3 with the diploid Malbut (not doubled); and G4 with the autotetraploids derived from Thong Dok Mak and Niyarma Yik, as well as their original diploids. The Malbut diploid was notable for not grouping with its derived autotetraploids. These results reinforce the importance of inducing polyploidy level as a tool to increase the genetic base of Musa spp., considering the genetic variability detected in the autotetraploid populations compared to the original diploids.

The analysis of molecular variance showed that approximately 78% of the variability identified by the SSR and IRAP markers was within the autotetraploid populations derived from the wild diploids. The ΦST statistic, which is equivalent to the proportion of genetic diversity attributed to the differences among populations, corroborates the AMOVA results (Supplementary Table 3).

AMOVA analysis has been used to calculate diversity in autopolyploid populations (Mermans and Liu 2018) and tetraploids from cotton treated with colchicine (Rauf et al. 2006). In our work, AMOVA showed that the difference was more within than between populations, with 62.67 and 37.32, respectively. The ɸST = 0.37 demonstrates moderate differentiation among the populations studied. This result is optimistic as an exploratory analysis since it shows that the colchicine treatment was able to also create variability that can be explored by breeders, given the straight genetic base in Musa spp.

Hamill et al. (1992) produced autotetraploids using the diploid SH3362 (resistant to Foc RT4) treated with colchicine and reported that they were more susceptible to cold compared to the original diploid and also maintained their resistance to Fusarium oxysporum s. sp. cubense tropical race 4. Van Duren et al. (1996) doubled the chromosomes of the diploid SH3362 using the same antimitotic agent. According to these authors, the autotetraploids generated had great improvement potential based on the low heterozygosity observed for the original diploid and the resistance to Fusarium tropical race 4. Rodrigues et al. (2011) doubled the chromosomes of the improved diploids 013018-01 and 086094-04 that are resistant to Fusarium oxysporum f. sp. cubense race 1. According to these authors, the autotetraploids produced have potential for use in crosses to develop commercial triploids hybrids. The results also showed that the genotypes had regular meiosis, which suggests female fertility.

Bakry et al. (2007) doubled the chromosomes of 25 wild diploids with colchicine and produced autotetraploids that were agronomically evaluated for various production cycles under field conditions and observed reduced vigor and production of suckers, especially for the diploids for cooking. All of the plants flowered under tropical conditions and were used in crosses to develop triploids. Jenny et al. (2013) crossed an autotetraploid, obtained through doubling the chromosomes of the diploid Kunnan, with wild diploids of Malaccensis, IDN110 and Musa balbisiana (CMR and P. Klutuk) and obtained the triploid progenies AAB (38–Kunnan × Malaccensis; 9–Kunnan × IDN 110) and ABB (15–Kunnan × CMR; 8–Kunnan × P. Klutuk). Most of the triploid hybrids exhibited parthenocarpy, increased bunch weight, and resistance to black sigatoka.

Therefore, the autotetraploids selected will be used in crosses with elite diploids developed by Embrapa, with the goal of creating triploid progenies to potentially select genotypes with characteristics aligned with the demands of producers and consumers.

Conclusions

Five autotetraploids selected using the Mulamba and Mock index have potential for use in crosses to develop triploid cultivars.

The SSR and IRAP molecular markers are efficient to quantify the genetic variability resulting from polyploidization via colchicine of wild banana diploids.

The analysis of molecular variance of the autotetraploid populations derived from five wild banana diploids allowed most of the genetic variability to be inferred within the populations.