Introduction

Chiriguano is the name given to a Guaraní-speaking ethnic group inhabiting southern Bolivia, western Paraguay and northwestern Argentina. The first historical references of the tribe date back to early sixteenth century [1,2,3].

Amazonian peoples belonging to the Tupí-Guaraní linguistic branch advanced southward from the Amazon basin to the south and east of the South American subcontinent. This migratory movement was interpreted as a search for the “Land without Evil” (a philosophical concept of Guaraní mythology expressed as ivy imaraä in the Guaraní language). The Guaraní migrations towards the Andean foothills began in the pre-Hispanic period in the late fifteenth century because of pressure due to over-population, religious reasons or search for metals [3, 4]. They arrived at the Bolivian Andean foothills from the Paraguayan-Brazilian plains in the sixteenth century and inhabited hills and valleys between Grande and Pilcomayo rivers (Fig. 1).

Fig. 1
figure 1

a Map indicating the area occupied by the Chiriguano tribesmen (in red) until nineteen century, the main rivers (Río Grande, Pilcomayo and Bermejo) and location of San Ramon de la Nueva Oran city, Salta province (23°08′10″S 64°19′20″W); b relative position of the region within South America. (Color figure online)

According to the chronicles written by Díaz de Guzmán in 1612 [5], the Guaraní reached the region between the Grande (Guapay) and Pilcomayo rivers, currently part of Bolivia, having been previously recruited as a working force by the Portuguese Alejo García in Paraguay. García was a castaway from a caravel of Juan de Solís, who ran aground on the Brazilian coast in 1516, and arrived in Paraguay and there recruited 2000 Guaraní tribesmen with whom he crossed the plain towards the Andes. His objective was to push back the Incas, who controlled the region in order to acquire agricultural and mining resources after having conquered the Chané-Arawak inhabitants. After being defeated, Garcia and his Guaraní army returned to Paraguay, where he was killed by his own troops. The Guaraní then returned to the Andean foothills, where they settled and received the pejorative name of Chiriguano. The Chiriguano seized the lands conquered by the Incas, ruling over the Arawak.

Ava-Guaraní (Ava means “man” in the Guaraní language) is the name currently adopted for Guaraní-Arawak admixture descendants, who were previously called Chiriguano by other Native American tribes. This name (formerly Chiriguanaes: “those who have a wife chané”) is a pejorative nickname given by Quechua-speaking tribesmen to the members of Avá-Guaraní. The name was originally derived from two Quechua words: chiri (‘cold’) and wañuq (“those who die”). It was then adopted and modified by the Spaniard conquerors, resulting in the name Chiriguano.

During this period, the admixture and ethnogenesis of Chiriguanos occurred. This process involved a reduced number of Guaraní speaking aboriginals subduing another group of Indians belonging to the Chané tribe (Arawak speakers). The Chané people had settled earlier in the Chaco Boreal and North Central Chaco ecological region. According to the chronicles, the Guaraní were such fierce warriors that approximately 50 of them could beat 1000–3000 Chané [3, 6]. A particular feature exhibited by these warriors was the practice of cannibalism against their rivals [2]. Guaraní warriors enslaved Chané populations, forcibly taking their women and relegating the Chané to slavery.

As mestizos, resulting from the admixture among the Guaraní minority and Chané women of Arawak origin (named “Guana” in the northern Chaco and “Chané” in the Andean foothills), the Chiriguano built up their own identity distinct from whites (called “karai”) or indigenous ethnic groups (called “tapuy”) [7].

The admixed population acquired distinct cultural traits. The Chané had developed a sedentary way of life in large communal houses, working in basketry and pottery, onto which the Guaraní imposed their language, belief system, social organization and practices of hunting, fishing and war. Thus, their traditional way of livelihood was based on a combination of horticulture (corn, beans, and cassava) and hunting, gathering, and fishing imposed by Guaraní influence.

Upon arrival of the Spanish in the sixteenth century, the Chiriguano occupied the headwaters of Pilcomayo and Río Grande rivers and were mainly at war against the Aymara and Quechua ethnic groups. They were, militarily, almost unconquerable by the Spaniards, although the Catholic missions were quite successful with them, mainly in the valley of the Parapetí River.

The start of the Spanish-Chiriguano war took place in 1564. In 1574, the viceroy of Peru, Francisco de Toledo, personally led a punitive campaign against the Chiriguano, which was unsuccessful. King Felipe II of Spain ordered by royal decree on May 20, 1584, an unusual declaration of war, ordering the enslavement of any Chiriguanos that were captured. The Chiriguanos offered fierce resistance to the conquerors until the nineteenth century, in a war that lasted for 328 years. The last battle fought by the Chiriguano against the Bolivian army in 1892, which the Chiriguano lost, was called “the last race to death” by Thierry Saignes in his Historia del pueblo Chiriguano [7], when their leader Apiaguaiki Tumpa was assassinated.

During the twentieth century, the desperate economic situation and the Chaco War (1932–1935) forced the tribe members to move and find refuge in the Argentinean Chaco region that includes the Salta province, while some communities moved to western Paraguay.

Currently, the “Chiriguano” still maintain a position of prominence in small communities, with the Chané as the “minority shareholders” in Chiriguano-Chané populations.

The last census of indigenous people in Argentina (ECPI 2004–2005) [8] registered 2099 individuals who declared their ethnicity as Chiriguano, inhabiting the Salta province in northwestern Argentina. In San Ramón de la Nueva Orán, Salta province, there are 15 communities belonging to Guaraní, Tupí-Guaraní and Avá Guaraní ethnicities.

Taking into account the geographical region occupied by the Chiriguano until their displacement to northern Argentina and the west of Paraguay, we selected previously published results of several populations from Bolivia, Peru and Brazil who belonged to Tupí-Guaraní or Arawak linguistic groups for comparison with the Chiriguano results described herein.

Given the genesis and history of the Chiriguano, the aim of this study was to characterize the population by means of uni- and bi-parentally transmitted genetic markers in order to establish potential links between the historical and genetic information. In addition, autosomal ancestry informative markers (AIMs) including single nucleotide polymorphisms (SNPs), deletion-insertion polymorphisms (DIPs) and a set of short tandem repeats (STRs), gold standards for human identification applications, showed the possible admixture that could have occurred in the genesis of this tribal group, at present settled in northwestern Argentina.

Materials and methods

Individuals

Blood samples were collected during the 1980s as part of an immunological survey and have been stored since then at the Immunology Service at the Buenos Aires University Hospital “José de San Martín” in the city of Buenos Aires. Sampling occurred at San Ramón de la Nueva Orán, Orán Department (23°08′10″S–64°19′20″W), Salta province, Argentina. The sample set included 46 liquid blood samples (32 females and 14 males) anti-coagulated with 0.5% (v/v) ethilendiaminetetracetic acid disodium salt (EDTA-Na2). Donors declared their birthplace as either the Salta province in Argentina (89%) or Bolivia (11%). The Ethical Committee of the School of Pharmacy and Biochemistry (Res. 744085/2008) approved the project. These samples have a historical value, and although they were collected in the early 1980s before of the Helsinki Declaration [9], the individuals participated voluntarily, by oral consent.

DNA extraction and quantification

DNA extraction was carry out by conventional approaches [10] from blood samples. Quantification was performed by real time PCR [11], using a Rotor Gene 6000 (Corbet, Brisbane, Australia).

Genetic analysis

Autosomal STRs

Samples were amplified with a set of STRs routinely used for forensic identification purposes, including: D3S1358, D1S1656, D2S441, D10S1248, D13S317, PENTA E, D16S539, D18S51, D2S1338, CSF1PO, PENTA D, TH01, VWA, D21S11, D7S820, D5S818, TPOX, D8S1179, D12S391, D19S433, SE33, D22S1045 and FGA. This marker set constitutes the commercial kit PowerFusion 6C System (Promega Corp. Madison, USA) and was analysed according to the manufacturer’s protocol. Some samples were amplified with PowerPlex 16 kit (Promega Corp. Madison, USA).

Ancestry informative markers (AIM)

A panel of 24 autosomal SNPs (AIM-SNPs) and 46 AIM-DIPs were amplified as previously described [12, 13].

Y-chromosome STRs

A set of 23 Y-STRs included in the commercial kit Power Y23 (Promega Corp. Madison, USA) were typed according to the procedure manual. All haplotypes reported in this study have been deposited in the Y-Chromosome Haplotype Reference Database (YHRD, http//http://www.yhrd.org) under accession number YA004299, R56.

mtDNA control region sequencing

The complete D-Loop region was amplified using primers L15971 and H00639 [14]. Amplicons were purified with ExoSap (USB Corp, USA) and sequencing reactions were performed with the Big Dye Terminator System v3.1 (Life Technologies, Applied Biosystems, Foster City, USA) according to the supplier’s protocol. Sequencing products were purified from residual dye terminators by means of alcohol precipitation. All sequences were performed with both forward and reverse primers (with at least six primers for each sample); electropherograms were visualized and edited with Sequencher v5.3 software (Gene Codes Corporation, USA).

Capillary electrophoresis

Electrophoresis analysis was carried out in an ABI3500 (Applied Biosystems, Foster City, USA) sequencer either for fragment or sequence analysis.

Haplogroup specific SNPs detection

The Y-SNP M3-Q3 (Q1a3a1 Y haplogroup-“Hg”), as well as mtDNA-SNPs 8027 G/A and 12,007 G/A (HgA2); 3547 A/G (HgB2); 14,308 T/C (HgC) and 2092 C/T (HgD2) were analysed by real time PCR followed by high resolution melting analysis (HRM) as previously described [15].

Statistical analysis

Allele frequency, Hardy–Weinberg equilibrium (HW), gene diversity and heterozygosity were calculated with Arlequin v3.5 [16] along with haplotype frequencies, molecular diversity indices and genetic distances. A neighbour joining (NJ) genetic distance tree (autosomal STRs) and evolutionary analysis was constructed with Mega v7.0.21 [17]. Median-joining networks were obtained with Network v 5.0.0.3 program (http://www.fluxus-engineering.com).

For Y-chromosome analysis, the length of DYS389II, expressed in number of repeat units, was subtract from that of DYS389I [18]. A normalized Slatkin genetic distance matrix (Arlequin v3.5) was graphed as a multi-dimensional scaling plot (MDS) by using XLSTAT (Addinsoft Corp) software. The YHRD database was use to search worldwide haplotype frequency distribution.

Admixture analysis (autosomal data) was performed using Structure v 2.3.4 software [19]. Genotypes of parental population including Europe, Africa, and Native American were obtained from published data of CEPH-panel samples [20, 21]. AIM-SNPs, AIM-DIP and autosomal STRs were analysed for genetic structure by means of structure [19]. In Structure, jobs were set for five iterations rounds. Parental populations (k) were set from two to five. Monte Carlo-Markov chain simulation, including a burning step of 10,000 followed by 20,000 iterations for data gathering were performed for each round. Start at pop info was set for parental populations. An admixture model and independent allele frequencies were used. The most likely value for the number of parental populations (k = 3) was determined using the program STRUCTURE HARVESTER, which enables the Evanno method to be implemented [22]. Data analysis was further refined using CLUMPP [23], and a bar plot was created using DISRUPT [24].

Admixture analysis involving Y-haplotype data was carried out by means of ADMIX 2.0 [25, 26]. Two parental populations were selected: Guaraní from Brazil [27] and Arawak from Bolivia [28]. Only samples belonging to the Q1a3a1 haplogroup were considered for the analysis. The mutation rate was taken from Zhivotovsky et al. [29]. Time of admixture event was 500 years ago.

Results

Analysis of kinship relationship among sample donors

This investigation used 46 samples (32 females and 14 males) belonging to unrelated individuals according to the information as recorded at the time of sampling. All samples were typed by means of 15–23 autosomal STRs; mtDNA D-Loop sequencing; 23 Y-STRs; 46 autosomal AIM-DIPs; 24 autosomal AIM-SNPs and 4–5 specific haplogroup’s SNPs.

Although declared as unrelated, the combined analysis of the different genetic markers (mtDNA, Y-STRs and autosomal STRs) showed some degree of biological relationships among donors. This finding led to a re-definition of the sample set to be considered for statistical analysis. For example, samples #37, #39 and #27 (female and two males, respectively), shared the same mtDNA haplotype (Hg A2); autosomal STRs profiles showed compatibility of the maternity of #37 with that of #39 and #27, but #39 and #27 had a different Y-haplotype. In consequence, in this case, only one sample was taken into account for mitochondrial analysis, two samples for Y-haplotype analysis, and only sample #37 for autosomal STRs analysis. Another pair of samples (#59 and #49, female and male, respectively) showed similar results and in this case only #49 was included in the analysis. These findings reduced the overall sample size for each type of marker.

Autosomal STRs

All 23 STRs loci investigated met HW expectations. The average heterozygosity was 0.742 (s.d. 0.133) and average gene diversity over loci was 0.744573 ± 0.373340. The Garza Williamson test indicated that there is no evidence of a recent bottleneck (values > 0.75). The lowest heterozygosity was observed in locus TH01 (0.577), and the highest value (0.884) was in loci D13S317, PentaE, D8S1179 and SE33. Although observed heterozygosity was similar, PentaE and SE33 showed the largest size range (16 different alleles vs 6) justified by their high intra-locus diversity (Online Resource 1).

A microvariant FGA/24.2 was observed in one profile. This microvariant is rare in the Argentinean population database (24.2 = 0.0003, http://www.ffyb.uba.ar/SHDG database) and was not observed in other aboriginal samples from Argentina previously investigated by our research group (i.e., Mapuche, Tehuelche, Mataco-Guaycurú speakers or Mbyá-Guaraní) [30, 31].

To evaluate genetic distance relationship between Chiriguano and other Native American groups of Argentina, an NJ tree was constructed based on a set of 15 autosomal STRs (included in the Power16 commercial kit, Promega Corp.) (Fig. 2). The analysis included Mataco-Guaycurú speakers (Wichi, Pilagá, Toba, and Mocoví settled in central and northern Argentina); Guaraní speakers (Mbyá-Guaraní inhabiting Misiones province, North Eastern Argentina) and Mapudungun speakers (Mapuche and Tehuelche, inhabiting Patagonia, Southern Argentina). Genetic distances between Chiriguano were statistically significant regarding all the groups; the lowest was observed with Toba (Rst: 0.00781, p = 0.00 ± 0.00) and highest with Wichi (Rst: 0.03847, p = 0.00 ± 0.00) (both Mataco-Guaycurú speaker groups). Although the Mbyá population from Misiones belongs to the Tupí-Guaraní linguistic branch, no relationship was observed with the Chiriguano (Rst: 0.03482, p = 0.00 ± 0.00). This fact could be explained by different evolutionary scenarios: Mbyá-Guaraní began its expansion in South America from Amazonian lands following a south-eastern direction and arrived to the present-day province of Misiones (north-eastern Argentina) approximately 1000 years ago [32]. The Guaraní ancestors of the Chiriguano began their displacement later, in the thirteenth century; this displacement followed a south-western direction, with originally Bolivia settled, and finally ended in the territory occupied at present in Salta province (Northwestern Argentina) in the early XX century.

Fig. 2
figure 2

Neighbour joining tree based on Rst genetic distance of eight Argentinean Native American ethnic groups, measured with 15 autosomal STRs

Autosomal ancestry informative markers

To investigate the ancestry of Chiriguano tribe members, a set of 24 AIM-SNPs and 46 AIM-DIPs were characterized. Regarding AIM-DIP, all systems met HW expectations, although one locus (MID 1607, rs2307803) met HW equilibrium only after Bonferroni correction [33]. Two loci were monomorphic: MID 3072, rs34611875 (genotype 1–1, frequency = 1) and MID 593, rs1160852 (genotype: 2–2, frequency = 1). Both DIPs were reported with high frequency at the same alleles in Native American CEPH panel [13]. Observed heterozygosity ranged from 0.023 (MID 2929, rs33974167) to 0.558 (MID 406, rs25630) (mean 0.27998, s.d. 0.16609).

All AIM-SNPs met HW equilibrium. Six loci (rs136929, rs139168, rs144848, rs180808, rs72135 and rs384377) [20] were monomorphic in concordance with the Native American CEPH panel (at the same loci, same allele). Observed heterozygosity ranged from 0.069 (rs205276) to 0.536 (rs147878).

Ancestry estimation of the Chiriguano was carried out considering three parental populations: Native South Americans (NA-CEPH), Europeans (E-CEPH) and West Sub-Saharan Africans (A-CEPH). The estimation employed autosomal STRs, autosomal AIM-SNPs and autosomal AIM-DIPs.

Genetic structure was evaluated by means of Structure, whose results showed a high Native American (NA) component, NA = 94.83% (AIM-SNPs) and NA = 94.67% (AIM-DIPs); the European (E) component was 4.77 and 4.73%, respectively; meanwhile the African (A) component was 0.6 and 0.4%. The Structure analysis with 15 autosomal STRs, used for identification purposes, was in good agreement with the abovementioned results using AIMs, although lower sensitivity (NA = 90.31%; E = 8.45% and A = 1.23%) was observed. Figure 3 shows bar plots obtained with the three sets of markers.

Fig. 3
figure 3

Structure analysis for assessing ancestry estimation of Chiriguano (Chi) assuming three parental populations (k = 3). We used as reference samples those included in the panel of the Centre d’Etudes de Polymorphisme Humaine (CEPH). Parental samples included; African (A-CEPH in green), European (E-CEPH in purple) and Native American (and NA-CEPH in orange). a Based on 15 autosomal STRs; b based on AIM-SNP and c based on AIM-DIP. (Color figure online)

Mitochondrial DNA

Complete D-Loop sequence was obtained from 45 samples, with one sample failing to amplify, and 41 were used for statistical analyses (after removing maternally related samples). Of the 41 haplotypes, all of them belonged to Native American haplogroups; 31 different haplotypes were observed; 24 were unique and the rest were observed two or three times. Haplogroup distribution was as follows: 21.95% HgA2; 41.46% HgB; 26.83% HgC and 9.75% HgD. The mean number of pairwise differences was 15.322880 ± 6.984319, and nucleotide diversity (average over loci) was 0.014677 ± 0.007430 (insertions at 309 and 315 were not taken into account). Mutations were as follows: transition = 68, transvertions = 5 and indels = 6.

Sub-haplogroup prediction was estimated with the EMPOP Query tool (http://www.empop.org), and diagnosis of point mutations was checked in Phylotree mtDNA tree Build 17 (http://www.phylotree.org). Specific SNPs in coding region were analysed by PCR real time followed by HRM [15]. The results were in accordance with the predicted haplogroup (data not shown). Online Resource 2 summarizes haplotype results.

Table 1 summarizes the total number of polymorphic sites per haplogroup. In general, HVRI was more polymorphic than HVRII-III. HgC samples displayed the highest polymorphism at HVRI, with 18 mutated positions over 27 observed within the haplogroup (considering the entire control region). Heteroplasmy was observed in two haplotypes (position T16217Y–HVRI and T195Y–HVRII; B4b hg in both cases) and was confirmed by sequencing in both directions. Position 195 was described as a fast evolving site [34], and the heteroplasmy T16217Y was also observed in some other population studies [35]. Haplotypes were submitted to EMPOP (accession number EMP00693). These haplotypes were not observed in 26127 of the EMPOP entire database (http://www.empop.org, Release 11).

Table 1 Variability of sequences in Native American haplogroups in Chiriguano

Figure 4 shows the median joining network for the four Native American haplogroups. No reticulations or complex structures were observed. Networks of haplogroups B and C showed highest diversity compared to those observed in A and D hgs. As can be observed, Chiriguano haplotypes show a high proportion of private mutations (positions highlighted in red).

Fig. 4
figure 4

Median joining network of Native American mitochondrial DNA haplogroups. Grey node: nodal haplogroup A2, B2, C1 and D1 respectively. Green nodes: median vectors. Black nodes: sub-haplogroups, sizes are proportional to frequency. Private mutations are indicated in red, insertion − 16282.1C (HgD) in blue. Insertions at 309 and 315, as well as heteroplasmic positions (HgB), were not included. Transversions were weighted three times as high as transitions. (Color figure online)

Y-STR and Y-SNPs

Online Resource 3 shows haplotypes from male samples. The haplotypes were validated by YHRD validation system and submitted to be incorporated in the database (accession number YA004299). All haplotypes were unique (gene diversity: 1.0000 ± 0.0270) and 13/14 showed M3/Q3-T variant, characteristic of founder Native American haplogroup Q1a3a. Only one haplotype (CHY-1, see Online Resource 3) showed the ancestral condition of M3/Q3-C and belongs to haplogroup R1b. Two haplotypes (CHY-7 and CHY-14) differed in one STR, DYS570. Although this STR was described as a rapid mutation Y-STR (mutation rate 1.33e−02 [36]), autosomal profiling allows for the rejection of a direct biological relation between donors (i.e., father /son) and mitochondrial DNA rejected a maternal relationship (i.e. half-brothers). Using Familias software v3.2 [37] the likelihood ratio (LR) obtained for indirect relations (such as half-brother, grandfather–grandson or uncle–nephew; kinship coefficient k2 = 0; 2k1 = 0.5; k0 = 0.5) was LR = 5.15. Employing a simulation tool for this type of degree relation, the hypothesis should be discarded with a probability of 97% (Familias v3.2, simulation with 1000 replicates, seed 12,345).

Other pair of haplotypes (CHY-3–CHY-12; and CHY-9–CHY-11) differed in the DYS458 marker by one repeat (mutation rate 6.36e−03, http://www.yhrd.org). Autosomal profiling allowed for the rejection of a first-degree relationship and mitochondrial DNA haplotypes a common maternal lineage.

None of these haplotypes was found in the YHRD database (release 55, including 39,414 haplotypes) in whole or meta-population databases.

We compiled a large list of published Y-haplotypes (N = 760, 16 populations) in order to compare our data. Most of the populations selected belong to the same linguistic group, Equatorial-Tucanoan [38], including Arawak from Bolivia, Brazil, Peru and Venezuela, and Tupí-Guaraní and Tupí from Brazil. Tupí-Guaraní is a branch of the Tupí language, spoken by a high proportion of indigenous tribes inhabiting the Amazonian basin. In addition, geographically related groups from Peru and Bolivia were included in the analysis (Table 2) [27, 28, 39,40,41,42], Slatkin’s normalized genetic distance matrix was computed by means of Arlequin V 3.5.2. For this analysis, haplotypes including 16 Y-STRs (DYS19, DYS389I, DYS389II; DYS390; DYS391, DYS392, DYS393, DYS385, DYS437, DYS438, DYS439; DYS448; DYS456; DYS458; YGATAH4 and DYS635) were considered in order to include several populations previously investigated. All haplotypes belonged to the Q1a3a Native American haplogroup.

Table 2 Details of population Y-haplotypes used for comparison study

Figure 5 depicts a multidimensional-scaling plot of genetic distances between groups. Except for one group (Pará Brazil, PaBr), Brazilian populations (AmBr, TuBr, MaBr and MgBr, belonging to Tupí, Tupí Guaraní and Arawak linguistic groups, see Table 2) occupy the most distant places among themselves and among the rest of the groups investigated. Since these samples correspond to very isolate Native American groups inhabiting Brazil, the result obtained is not surprising. In this figure, Chiriguano shared a central position together with the rest of populations included in the analysis.

Fig. 5
figure 5

MDS plot analysis of Slatkin linearized genetic distance matrix, based on Y-STR haplotypes of 17 populations (see Table 2 for population codes). Analysis included 16 Y-STRs haplotypes from Native American patrilineages (Hg Q1a3a). MDS was obtained with XLSTAT software. Kruskal stress: 0.099

Detailed relationships between populations can be observed if the genetic distant Brazilian groups are excluded from the analysis. Figure 6 shows an MDS representation of Y-STRs haplotypes for the closest related populations (geographical or linguistic), including Arawak from Venezuela, Peru, Bolivia and Brazil; Tupí-Guaraní from Pará, Brazil; Andeans populations from Perú; and Aymara and Quechua groups from Perú and Bolivia.

Fig. 6
figure 6

MDS plot of genetic distances between 13 populations (excluding MaBR; AmBR, TuBr and MgBr, see Table 2 for population codes). Analysis included 16 Y-STRs haplotypes from Native American patrilineages (Q1a3a hg). Kruskal stress 0.132

Chiriguano’s smallest genetic distance was observed with PaBr (Tupí-Guaraní, Brazil) and a native group from Beni, Bolivia (MoBo, Arawak) and could be explained taking into account the origin and evolution of Chiriguano (see introduction). On the other hand, it is closely related to other groups from Bolivia (ScBo) and Perú (CaPe; HuPe, UrPe and YaPe) (genetic distance matrix and significant matrix are shown in Online Resource 4). The group coded as MoBo corresponds to samples analysed by Tirado et al. [28]. These authors analysed 87 samples from indigenous settlements, most of which belong to the Mojeño ethnic group, which are an Arawak speaking people. Some of these populations were referred to as “allies” of the Chiriguano [43]. Among the haplotypes included in the comparison, only one Chiriguano haplotype (conformed by 16 Y-STRs) matched with an individual classified as Trinitario, Mojeño dialect, included by Tirado et al. [28] in his database. In addition, the genetic distance between Chiriguano and Cajamarca from Peru [40] was also not significant. According to the authors, there is evidence of a strong migration from Cajamarca to Amazonas (but not vice versa) during the second half of the last century. With the aim of demonstrating the relationship between Chiriguano and Tupí-Guaraní and Arawak people, an admixture analysis was carried out employing the groups of PaBr and MoBo as parental populations. The results showed that the contribution of parental population 1 (Tupí-Guaraní from Brazil) was 72.4% while the contribution of parental population 2 (Arawak from Bolivia) was 27.5%.

Figure 7 is a network representation of Y-haplotypes considering the closest related populations. This figure shows a complex network of haplotypes relationship where Chiriguano are closely interrelated with all the groups, occupying a central position in the reticulation.

Fig. 7
figure 7

Network analysis of 15 Y-STRs haplotypes (DYS385 was excluded). Light blue: Chiriguano; green: natives from Beni, Bolivia; yellow: Tupí Guaraní from Para, Brazil; dark grey: Arawak from Peru; and pink: Arawak from Bolivia. (Color figure online)

Discussion

In this study, we describe the genetic attributes of a group of biologically unrelated individuals self-declared as Chiriguano of the Avá Guaraní linguistic group. Ava Guaraní represent 2.9% of the overall aboriginal population of Argentina [8], 2.1% of the aboriginal population in Bolivia [44], and 15.7% of the aboriginal population in Paraguay [45].

This small but representative sample allowed us to detect high genetic variation, in both uni- and bi-parentally transmitted genetic markers. Matrilineage and patrilineage analyses demonstrated that these people carried specific Native American inheritance. AIM-SNPs and AIM-DIP analyses showed a high native genetic component in their genomes. Structure analysis carried out with a panel of autosomal STRs employed in human identification revealed that these data was well correlated with the AIM markers.

Mitochondrial DNA haplotypes were not found in EMPOP worldwide database, with only infrequent haplotypes detected. Only some of these haplotypes could be associated with other South American aboriginal people. Some particular haplotypes observed in Chiriguano were:

A2ah: the haplotype T16097C, T16098G, C16111T, C16167T, C16223T, C16290T, G16319A, T16324C, T16362C, C64T, A73G, T146C, A153G, A234G, A235G; A263G, 309.1C, 315.1C, 523DEL, 524DEL (diagnosis mutations in bold) was found in two samples. Posterior analysis of autosomal STRs analysis allowed us to suggest some degree of relationship between both donors, and consequently, only one sample was used in the statistical analyses. This haplotype belongs to A2ah sub-Hg, displaying the diagnosis mutations T16097C and A16098G and three private mutations (C16167T, T16324C and A234G).

The A2ah sub-Hg was first described by Taboada-Echalar et al. [46] in two samples from Bolivia (Beni and Santa Cruz, respectively). The authors postulate that this motif could be restricted to Bolivia, more specifically to the “llanos” geographical area. Recently, it was described in an individual declared as “Chanka” on the pacific coast of Perú. Chanka were closely related to their neighbours who spoke Amazonian languages, with inter-population migrations between the Andes and Amazonia previously postulated [47]. Alves-Silva et al. [48] reported two samples (HVRI–HVRII sequences) in Minas Gerais in southeastern Brazil, showing the diagnosis of mutations. Finally, it was also found in a sample belonging to the Toba tribe in Chaco, Argentina [49].

It is important to mention that all these samples exhibited private mutations and in consequence showed different haplotypes.

A2q (T16209C) was found in two Chiriguano samples. Both haplotypes are not identical but present the same private mutations 16301T and 238G and a lack of mutation T16362C (16362T!). This haplogroup was described by Behar et al. [50] (GenBank: JQ705973.1 and JQ705557.1); Achilli [51] (GenBank: EU431081.1); and Kumar in 2011 [52] (GenBank: HQ012132.1) (Phylotree mtDNA Tree build 17, http://www.phylotree.org) and was not found in other Argentinean ethnic groups. Alves-Silva et al. [48] described this haplogroup (based on HVRI–HVRII sequence) in two samples from southern and southeastern Brazil and Taboada Echalar [46] in eight samples from the llanos area of Bolivia (based only on HVRI sequence). The same private mutation in HVRI (16301T) was found in two out of eight samples of Santa Cruz de la Sierra, Bolivia. However, since only HVRI was analysed in these samples, the haplotype identity cannot be established.

B2b2 (G16145A): This sub-Hg was observed in one sample and was previously described by Taboada-Echalar (GenBank: KC503926.1) in Beni, Bolivia [46].

B2y: This sub-Hg was observed in one sample with a C16261T diagnosis mutation and three private mutations (A197G, G207A and T292C) that distinguishes it from those already reported (GenBank: JQ705259.1 and EU597534.1) [50, 53]. B2y sub-Hg was found in two samples from Bolivia [46], one sample in Santa Cruz de la Sierra (HVRI sequence) and another in Chuquisaca (HVRI–HVRII, sequences), although HVRII sequence shows dissimilarity with Chiriguano sequence.

D4h3a: This sub-Hg was observed in one Chiriguano haplotype, exhibiting a diagnosis mutation at A16241G and three private mutations: an insertion at − 16282.1T and the substitution T16311C and T58C. This sub-Hg is in Phylotree with accession number FJ168754.1 [54] and EU095531.1 [55].

D4j4a: was observed in one sample (mutation motif T16263C) and described previously by Sukernik et al. [56], in Even, Siberia (HM044856.1 and FJ858887.1).

Other particular haplotypes were inferred as belonging to A2w; D1e; C1c or D1a sub-Hgs (based on D-Loop sequences). However, further studies of diagnostic mutations located in the coding region should be done to confirm the haplogroup assignment and detect novel findings that will allow a re-classification of the present ones. Described mitochondrial sub-Hgs were not found either in other Native American groups investigated by our team [30, 31, 49] nor in an Argentinean population database [57] with the exception of D4h3a previously found in Mapuche and Tehuelche ethnic groups, Patagonia, Argentina [31] and A2ah in Toba, north Argentina [49].

Comparative analysis of Y-chromosome haplotypes from several related populations allowed us to find a close relationship between Chiriguano and other Guaraní or Arawak groups (see “Results”). Admixture analysis demonstrate a strong Y-chromosome parental contribution from Tupí Guaraní (Brazil) and in less proportion of Arawak from Bolivia. These finding reinforce the historical records about the ethnogenesis of this group, where groups of migrant Guaraní from the Amazon conquered the Arawak that inhabited Bolivia, enslaving them and appropriating their women.

Conclusions

Mitochondrial haplotypes, as well as Y-STRs haplotypes, demonstrated a genetic relationship between the Chiriguano and groups of inhabitants from Brazil, Bolivia and, in lesser degree, Peru. These findings correlate well with historical information about their origin and interrelations with other people during their displacement from the Amazon. Historical records reported that the Chiriguano were fierce warriors who subjugated other peoples in their migration from the Amazon to northern Argentina. The present study relates this group to other Native American groups from South America of Arawak and Tupí–Guaraní speakers. The anthropological knowledge about them could be correlated with the genomic information of the extant population inhabiting Argentina. Their evolutionary history has been short in relation to other groups. However, their genetic imprint is evident and correlates them with other groups of speakers of the same linguistic branches.