Introduction

The genus Anastrepha (Schiner 1868) of the Tephritidae family, is considered the most economically important genus in Latin America (Castañeda et al. 2010). Anastrepha striata (Schiner 1868), known as the American guava fruit fly, is a quarantine pest according to the USDA-APHIS-PPQ (United States Department of Agriculture, Animal and Plant Health Inspection Service, Plant Protection and Quarantine program) and other regulatory agencies (Norrbom 2003) and is one of five quarantine species in Colombia: A. fraterculus (Wiedemann), A. grandis (Macquart), A. obliqua (Macquart), A. serpentina (Wiedemann) and A. striata (Schinner) (Castañeda et al. 2010). A. striata is a multivoltine and oligophagous species (Insuasty et al. 2007) that infests approximately 37 plant species (Norrbom et al. 1999) including economically important fruits, such as mango (Mangifera indica) and coffee (Coffea arabica). However, they have a clear preference for the genus Psidium (Myrtaceae) and Psidium guajava is its main host (Aluja et al. 2000). A. striata is widely distributed in the Americas, from the southern United States (Texas) to Brazil (Hernandez-Ortiz and Aluja 1993) and has a wide altitudinal range (0-2600 masl) (Arévalo et al. 1997; Martínez and Serna 2005). Along with A. fraterculus and A. obliqua, it has been reported to be one of the most abundant species in Colombia (Nuñez 2010). According to the Corporación Colombiana de Investigación Agropecuaria, CORPOICA, A. striata is a phytosanitary problem in Colombia, causing up to 90% of fruit losses and reaching infestation levels of up to 210 larvae/kg fruit (Insuasty et al. 2007).

Despite its wide distribution and economic importance, studies of A. striata have been restricted to aspects of its biology, behavior, population dynamics and distribution (Aluja et al. 1993; Ramírez et al. 1996; Cruz-López et al. 2015). In Colombia, knowledge of the species is based on studies of its population dynamics in the department of Santander (Olarte 1980) and the diversity of its parasitoids, hosts and distribution in departments such as Tolima, Valle del Cauca and Cundinamarca (Carrejo and González 1994; Martínez and Serna 2005; Ruiz-Hurtado et al. 2013). However, there is no knowledge about its genetic diversity and population structure, which has been demonstrated to be a prerequisite for planning pest control strategies (Roderick and Navajas 2003) because of the information it provides about the insects population dynamics (Aketarawong et al. 2011). This information is important when specialized techniques for control, such as semiochemicals, including sexual or aggregation pheromones (Agelopoulos et al. 1999), or the production and dispersion of sterile insects (SIT) (Karsten et al. 2013) are required.

Insect population structures can be influenced by the presence of physical barriers or characteristics associated with habitat differences (Avise 1994). For example, the Andes have played an important role in the diversification of neotropical organisms by their isolation on either side of the mountains or by creating a mosaic of montane habitats and inter-Andean valleys where differentiation processes can occur (Brower 1994; Lynch 1997; Graham et al. 2001; Elias et al. 2009). Southwestern Colombia is a very important fruit-producing region, in which several soils and climates can be found (Orozco 2003). The Andes directly influence these soils and climates by allowing the formation of three well differentiated natural sub-regions: the mountain region, formed by the Western and Central Andes, is mainly characterized by the presence of Andean forests and high humidity. The Pacific Coast, delimited by the Pacific Ocean and the foothills of the West Andes, contains Andean and sub-Andean zones, rainy areas and warm and humid mangroves. The inter-Andean valley of the Cauca River, delimited by the West and Central Andes, is characterized by agricultural zones that have almost entirely replaced the dry and highly dry tropical forests (Salazar et al. 2002). In Colombia, several studies have demonstrated the importance of habitat differences and the physical barrier on the genetic flow of insect species (Díaz et al. 2013; Velasco-Cuervo et al. 2016). The population structure and genetic diversity of insects may also be strongly influenced by the invasion pathways that led to their initial establishment. For example, Karsten et al. (2015) found genetic structuring of Ceratitis capitata between the native region (Africa) and the introduced region (Australia, Greece, Guatemala and Madeira). They also found that the genetic diversity of populations was associated to the initial colonization pathways of Europe from Africa and a secondary colonization of Australia from Europe.

It is important to evaluate if the populations of A. striata in southwestern Colombia have a population structure that could be influenced by the Andes or the geographical area it is in. This information will enable us to develop more efficient pest control strategies. For example, to know if management is aimed at a wider or finer scale depending on the level of connectivity of populations.

Mitochondrial genetic markers have proven to be informative for population genetic studies due to their strictly maternal inheritance, lack of recombination, relatively high mutation rate (10-fold faster than single copy nuclear DNA) (Brown et al. 1979) and the availability of very efficient PCR primers (Shi et al. 2005; Xie et al. 2006; Hu et al. 2008). Among these markers, the gene Cytochrome Oxidase subunit I (COI) shows a high degree of polymorphism that makes it appropriate for intraspecific genetic analyses (Mun et al. 2003) and has been used in population genetics studies of different fruit fly species (Hu et al. 2008; Prabhakar et al. 2012, 2013; Meeyen et al. 2014). Although less frequently used, the NADH dehydrogenase subunit 6 (ND6) gene, has similar characteristics to COI (Simon et al. 1994) and a high degree of polymorphism. Therefore, both genes are useful for the evaluation of intraspecific diversity and the population structure of A. striata.

In this study, we used the COI and ND6 gene sequences to assess whether A. striata has a population structure associated with the Western and Central Andes in southwestern Colombia. Additionally, an analysis of the demographic history of A. striata was performed to improve the knowledge about demographic processes that could be shaping the population structure and genetic diversity patterns.

Materials and methods

Sample collection

Fruits from guava (Psidium guajava), mango (Maguifera indica), coffee (Coffea arabica), plum (Spondia purpurea) and carambola (Averrhoa carambolo) were collected in nine locations in southwestern Colombia. The locations were chosen by assuming the West Andes are a physical barrier and to represent each of the different natural regions–the Pacific Coast, the inter-Andean valley of the Cauca River and the mountain region (Table 1; Fig. 1). The specimens were obtained from either fallen fruits or from the tree in non-crop zones (roadside, home garden, small farms, etc.). In an insect-breeding room, the fruits were placed in emergence chambers built from plastic containers containing a sterile mixture of sand and sawdust, with the corresponding fruits on top. The chambers were kept at room temperature until the emergence of adults, which were then preserved in 96% ethanol and stored at −20 °C until further molecular analysis. The specimens were identified using Steyskal’s (1977) and Caraballo’s (2001) taxonomic keys. The following diagnostic features of A. striata were considered: presence of a dark brown-almost black U-shaped mark on the scutum, no connection between the V and S bands in the wing, a yellow to brown subscutellum with lateral dark bands and an aculeus with a wide, blunt and non-serrated apex. For each individual used in this study, a voucher was stored with the date, collector’s name and geographical coordinates in the Museum of Entomology at the Universidad del Valle.

Table 1 Geographical coordinates and altitude above sea level of the nine A. striata sampling locations in southwestern Colombia. n corresponds to the number of COI and ND6 concatenate sequences that were used in this study
Fig. 1
figure 1

Haplotype frequencies for each location sampled. The western mountain range separates the inter-Andean valley of the Cauca River and the Pacific Coast. Symbols H1 to H14 represent the 14 haplotypes found for A. striata

DNA extraction, amplification and sequencing

DNA extraction from the head and legs of adult flies was performed with a DNeasy® Blood and Tissue Kit (QIAGEN, CA, USA) according to the manufacturer’s protocol for insects. A ~ 710 bp region of the COI gene was amplified using the primers LCO1490 (5′-GGTCAACAAATCATAAAGATATTGG-3′) and HCO2198 (5′-TAAACTTCAG GGTGACCAAAAAATCA-3′) (Folmer et al. 1994). The amplification cocktail was modified from Ruiz et al. (2010), for a total volume of 25 μL per reaction and contained 1X PCR buffer, 0.05 mM dNTPs, 2 mM MgCl2, 1 U Taq polymerase, 0.25 μM of each primer and 30 ng of DNA. The following thermal profile was used ,modified from Folmer et al. (1994): initial denaturation at 92 °C for five minutes, 35 cycles of 94 °C for 30 s, annealing at 52 °C for one minute, extension at 72 °C for one minute and a final extension at 72 °C for five minutes. A ~ 725 bp fragment of the ND6 gene was amplified with the primers TT-J-9886 (5′-TAAAAACATTGGTCTTGTAA-3′) (Barr et al. 2006) and ND6r (5′-TTATGATCCAAAATTTCATCA-3′) (Ruiz-Arce et al. 2012) using the protocol proposed by Ruiz-Arce et al. (2012). Amplification of the ND6 and COI genes was confirmed using a 1.5% agarose gel with 0.5X TBE buffer. Finally, the PCR products were sequenced by a specialized service (MACROGEN, INC., MD, USA).

Data analysis

The sequences obtained from COI and ND6 were manually edited with Sequencher 4.1.4 (Gene Codes Corporation, Ann Arbor, MI, USA) to obtain final lengths of 658 and 660 bp, respectively. A manual concatenation of ND6 and COI was performed using Sequencher 4.1.4, obtaining a sequence of 1318 bp. Sequence alignments were performed with Clustal X 2.1 (Larkin et al. 2007). The mitochondrial haplotypes and the number of variable sites were identified in the group of sequences using MEGA6 (Tamura et al. 2013). The haplotype and nucleotide diversity were estimated using Arlequin v3.5.1.3 (Excoffier and Lischer 2010). The intraspecific genetic distance was determined using the Kimura 2-parameter (K2P) nucleotide substitution model in MEGA6 (Tamura et al. 2013) because it is the appropriate model for estimating the small intraspecific genetic distances expected for this genomic region (Hebert et al. 2003). To estimate the genealogical relationships between haplotypes, a haplotype network was constructed based on the median-joining network algorithm (MJ) using NETWORK v4.6.1.3 (http://www.fluxus-engineering.com).

The population structure was estimated based on pair-wise FST. The statistical significance of this test was obtained by 1023 permutations. Analysis of molecular variance (AMOVA) was used to test the genetic differentiation among groups of populations from different natural regions. Both population pair-wise FST and AMOVA analysis were done using Arlequin v3.5.1.3 (Excoffier and Lischer 2010). A Mantel test was performed (Mantel 1967) to determine the existing relationship between the genetic distance (Arlequin’s FST) and the geographical distance (km) and thereby evaluate the isolation by distance (IBD) model. This test was performed in IBD v 3.23 (Jensen et al. 2005) using 1000 randomizations.

A mismatch distribution was performed to evaluate the population expansion signals using DNAsp 5.10.1 (Rozas et al. 2010). A population that has recently experienced a population expansion presents a unimodal mismatch distribution (Rogen and Harpending 1992). A sum of squared deviation was used with the Harpending’s index (Harpending 1994) to evaluate the deviation of the observed data from the observations that would be expected under a demographic expansion model. The population expansion time was calculated using the equation τ = 2ut, where u = m T µ (m T is the length of the nucleotide sequence under study, and µ is the mutation rate per nucleotide), and t is the time in generations (Roger and Harpending 1992), assuming a divergence rate of 2.3% per million years for insect mtDNA (Brower 1994) and a mean of six generations per year (Insuasty et al. 2007). To evaluate the population equilibrium, a Fu’s Fs test (Fu 1997) and a Tajima’s D test (Tajima 1989) were performed with DNAsp 5.10.1 (Rozas et al. 2010). Statistically significant negative values are obtained by these tests when a population is expanding.

Results

Of all the fruits processed, P. guajava was the only one that showed A. striata infestation. Eighty-two specimens were processed from guava fruits from the nine locations sampled. In three of the locations sampled (EN, LE and BU), A. fraterculus and A. striata infestations were found together on the same guava tree. However, the A. fraterculus proportion was less than 5% of the A. striata population.

Variation and genealogy of the mitochondrial concatenated sequence

Eighty-two sequences of 1318 bp. each were obtained by concatenation of the COI and ND6 sequences. 14 variable sites were observed (1.06% of the total length), of which seven were unique and seven were parsimony informative. 16 mutational changes, 14 transitions (nine A/G and five T/C) and two transversions (one C/A and one G/T) were observed. 14 haplotypes with intraspecific genetic distances (K2P) between 0 and 0.4% were identified. The sequences for each of these genes were deposited in GenBank under accession numbers KU985248–55 for COI and KU985256–63 for ND6. The haplotype frequencies determined for each location are shown in Fig. 1 and Table 2, where haplotype H1 is the most frequent and is found in all locations, regardless of their differing natural regions. The rest of the haplotypes, except for H12, are private, with a very low frequency (Table 2). The average haplotype diversity was 0.057 and ranged from 0.71 in EN to 0.25 in PR (Table 3). The average nucleotide diversity was 0.0005 and ranged from 0.001 in EN to 0.000 in ZA. The haplotype network (MJ) was structured from a central, most abundant haplotype (H1) and from which the rest of the haplotypes H2 to H14 diverge. All of the haplotypes diverge with a low number of mutational changes; haplotype H11 is the most divergent with three changes (Fig. 2). The second most frequent haplotype was H12, found in six locations (BU, ZA, EN, SE, VE and PO), with no geographical association pattern. The haplotype network showed a star-like shape typical of populations undergoing expansion (Slatkin and Hudson 1991).

Table 2 Distribution and frequency of different haplotypes in sampling locations. n indicates the number of individuals genotyped
Table 3 Haplotype diversity (h) and nucleotide diversity (π) values of A. striata in the nine locations analyzed
Fig. 2
figure 2

Haplotype network of 82 sequences (concatenates of the COI and ND6 genes) from A. striata in three natural regions of southwestern Colombia. Circles represent haplotypes, and their size is relative to the number of individuals sharing the haplotype. The haplotypes are divided by location, which is identified by a texture. The numbers correspond to the position of the nucleotide change. Symbols H1 to H14 represent the different mitochondrial haplotypes

Population genetic structure

The pair-wise FST statistic of nine locations showed that there is no genetic structure of A. striata in the southwestern Colombia, because 97% of the comparisons presented non-significant values below 0.12. The only significant comparison was between CH and BU with an FST value of 0.17 (Table 4). AMOVA analysis by grouping populations according to the natural regions also revealed no significant genetic differentiation among groups (F CT = 0.00006, P = 0.512) (Table 5). Therefore, there is no genetic isolation caused by the West Andes. Mantel test showed no significant relationship between the genetic and geographical distances (r = 0.1397, p = 0.198).

Table 4 Pairwise F ST values among the nine A. striata sampling locations in the three natural regions
Table 5 Results of the AMOVA analyses of nine sampling locations of A. striata from southwestern Colombia, with grouping in accordance with three natural regions

Demographic history

The mismatch distribution of the observed data showed unimodal behavior (Fig. 3), similar to the data simulated from the population expansion model, with a sum of squared deviations of DSS = 0.022 (P = 0.495) and a Harpending’s index of 0.467 (P = 0.683). The negative Fu (−11.834, P = 0.00) and Tajima (−2.002, P = 0.004) values, as the star-like shape haplotype network (Fig. 2), also indicated that the population has undergone expansion. The population expansion time, based on a 2.3% divergence rate (Brower 1994) and assuming six generations per year (Insuaty et al. 2007), was 17,000 years.

Fig. 3
figure 3

Mismatch distribution of 82 sequences (concatenates of the COI and ND6 genes) from A. striata in three natural regions of southwestern Colombia, representing the observed and expected paired differences under the population expansion model. The curve of the observed values shows a unimodal behavior typical of populations undergoing expansion

Discussion

Anastrepha striata was found in all of the locations sampled in greater numbers than A. frateculus; therefore, A. frateculus can be considered a low incidence species in guava fruits, at least in the region studied (5% of individuals collected). The predominance of A. striata was also previously reported by Núñez et al. (2004), who found percentages of 91.72% for A. striata and 8.26% for A. fraterculus in the northeastern region of Colombia.

Hernández-Ortiz et al. (2012), report that in Colombia there is only the Andean morphotype of A. fraterculus, which could explain the low incidence of this species and the predominance of A. striata in guava fruits. Several authors have previously noted that the fraterculus Andean morphotype can infest guava fruits, though it has also frequently found breeding in coffee and Andean berry fruits (Hernández-Ortiz and Morales-Valles 2004, Nuñez et al. 2004; Castañeda et al. 2010). In contrast, the fraterculus Brazilian-1 morphotype is restricted to guava fruits (Selivon et al. 2005). Additionally, A. striata and A. fraterculus proportions can vary depending on the area of study and factors such as the presence of other tephritids and the quality and abundance of alternating hosts. For example, Malavasi and Morgante (1980) found that guava was not very susceptible to A. striata attacks in Brazil, and Swanson and Baranowski (1972) found that A. suspensa was the only species infecting guava in the south Miami (USA) area.

The haplotype and nucleotide diversity of A. striata based on the mitochondrial concatenates data (COI + ND6) was 0.510 and 0.0005, respectively. This report is the first on the genetic diversity of this species in Colombia.

Therefore, there are no reference values for comparisons with other locations in the world. Compared with studies performed on other species of Anastrepha, our results indicate that the diversity of A. striata is low. For example, in A. ludens, Ruiz-Arce et al. (2015) reports haplotype and nucleotide diversities of 0.580 and 0.005, respectively. A recent study in A. obliqua in south western Colombia, with a similar sample size to one used in our study, showed a haplotype and nucleotide diversity of 0.553 and 0.0156, respectively (Aguirre 2016, undergraduate thesis, Universidad del Valle). These differences in genetic diversity between species can be correlated with dietary habits, because A. striata is an oligophagous species and was only found infesting guava fruits. While A. obliqua is a polyphagous species and was found infesting four different fruits in the same region.

Another explanation for the low genetic diversity could be that the haplotypes with low frequency might be present in the location but not detected in the sample due to low sample size. However, in this study, nearly all locations have sample sizes close to 10 individuals (Table 1). According to Luo et al. (2015) in genetic diversity studies with barcoding, sample sizes between 2 and 5 individuals present estimates with a high percentage of error, but values close to 10 generally came close to the correct estimate of diversity.

Despite the existing ecological and geographical differences among the three natural regions, AMOVA revealed no genetic differentiation among A. striata populations. Moreover, populations have the same predominant haplotype in all locations and non-significant FST values in 97% of the comparisons. These results probably indicate that A. striata moves freely across the three natural regions and that there is constant genetic flow. We hypothesized that the West Andes could be a geographical barrier for genetic flow. However, the AMOVA showed no significant genetic differences between the Pacific Coast and in the inner-Andean valley of the Cauca River. This would mean that the West Andes is not acting as an effective genetic barrier. We also found no relationship between genetic and geographical distances, which was expected given that the distance between the sampled localities is much greater than the dispersion capacity that has been reported for species of the genus Anastrepha (240 m of maximum displacement from the release site towards any point, not considering direction; Hernández et al. 2007).

A possible cause for the lack of genetic structure in A. striata is the anthropogenic effect because the transport and commercialization of fruits by humans can facilitate the movement of several insects such as aphids, woodlice, scarabs and larval stages of a variety of insects (Hill 2008). This effect would eliminate the Andes as a geographical barrier and would reduce genetic isolation due to distance or different eco-geographical regions. These dispersion phenomena mediated by humans have been reported in several studies of other fruit fly species (Gasperi et al. 2002; Malacrida et al. 2007; Shi et al. 2014; Karsten et al. 2015; Liebhold et al. 2016).

The Mediterranean fruit fly, Ceratitis capitata, is an example of these insects capacity for dispersion due to human globalization. Karsten et al. (2015), in a study conducted in Africa, Australia, Greece, Guatemala and Madeira suggest that C. capitata had an intercontinental distribution through a migration route from Africa to Europe, Australia and America with a reintroduction to South Africa from Europe. A strong structuring was found between Africa and the invasive localities, which is explained by quarantine export measures. In southwestern Colombia, all of these flies dispersion processes are a direct result of the implementation and modernization of agriculture by humans, largely because this region has been characterized by high agricultural production and possesses a network of main roads, such as the Pan-American Highway and the Panoramic Highway, that allows transportation between the main production and consumption zones of the country. The region also has a main access highway to one of the most important ports in the country, the Buenaventura port (Orozco 2003). Therefore, the high level of commercialization can be proposed as a way to increase the dispersion of larvae of different fruit fly species. This anthropogenic effect on the dispersion of A. striata would directly affect its population structure and genetic diversity.

Another possible cause for the lack of genetic structure of A. striata populations, despite the existence of different geographical barriers, is that the species is undergoing population expansion (Malacrida et al. 2007). This hypothesis is based on the star-like shape of the haplotype network showing the presence of a single common haplotype with a number of rare haplotypes connected to it by a few mutations and the demographic analysis. This pattern suggests that in spite of ecological or geographical isolation of populations, time was too short for mutations to occur and accumulate (Avise 2000), which agrees with the observation that although most haplotypes are not shared among locations, they are still found at very low frequencies. According to Slatkin and Hudson (1991), this type of behavior occurs in cases of recent invasion by a limited number of founders, followed by population growth, which would lead us to think that A. striata is an invasive species, at least in southwestern Colombia.

The demographic history analysis indicates that these recent species expansion signals originated approximately 17,000 years ago, in the last glaciations final epoch. During the Pleistocene glaciations, there was a phase of tropical forest degradation, which led to a cold and dry climate typical of this period (Penny 2001). These conditions could have caused the decrease in A. striata plant hosts. Approximately 18,000 years ago, the expansion of forests began when the climatic conditions became favorable, with a warmer and more humid climate, which corresponds to the onset of the A. striata population expansion. Studies of other insect species in the tropics have also proposed that climate changes in the Pleistocene could have had this effect on those species (Morgan et al. 2011; Meeyen et al. 2014). Therefore, this study supports the idea that this phenomenon was important in the formation of the genetic structure of insect populations.

This study is the first report on the genetic diversity of A. striata in Colombia, from which we conclude that this species is genetically homogeneous in southwestern Colombia; there is no evidence of genetic structure associated with the different eco-geographical regions or the West Andes as a possible geographical barrier. This behavior could be caused by human intervention as a passive dispersal agent of fruit flies, by an invasion followed by population expansion or by a combination of both factors. Therefore, the Andes of southwestern Colombia have not contributed to mitochondrial differentiation between the natural populations of A. striata. This study contributes to the genetic knowledge about this species and can be used for the creation of pest control programs.