Abstract
The retrieval of Neanderthal (Homo neanderthalsensis) mitochondrial DNA is thought to be among the most significant ancient DNA contributions to date, allowing conflicting hypotheses on modern human (Homo sapiens) evolution to be tested directly. Recently, however, both the authenticity of the Neanderthal sequences and their phylogenetic position outside contemporary human diversity have been questioned. Using Bayesian inference and the largest dataset to date, we find strong support for a monophyletic Neanderthal clade outside the diversity of contemporary humans, in agreement with the expectations of the Out-of-Africa replacement model of modern human origin. From average pairwise sequence differences, we obtain support for claims that the first published Neanderthal sequence may include errors due to postmortem damage in the template molecules for PCR. In contrast, we find that recent results implying that the Neanderthal sequences are products of PCR artifacts are not well supported, suffering from inadequate experimental design and a presumably high percentage (>68%) of chimeric sequences due to “jumping PCR” events.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
Ancient DNA (aDNA) studies have suffered much criticism since they began about 20 years ago. The field is still recovering from the effects of early spectacular and erroneous claims, such as that of DNA being preserved in plant fossils, dinosaur bones, and amber for many millions of years (for recent reviews see Hebsgaard et al. 2005; Willerslev and Cooper 2005). Unfortunately, unreplicated results of surprising age continue to be published, including those from old human remains (e.g., Adcock et al. 2001), microorganisms (e.g., Cano and Borucki 1995; Vreeland et al. 2000; Fish et al. 2002), and plant fossils (Kim et al. 2004). These studies have routinely underestimated the extent to which aDNA research is confounded by contamination with modern DNA, and are widely thought to result from such contamination (Willerslev et al. 2004a; Hebsgaard et al. 2005). In recent years, a greater understanding of postmortem damage and contamination has provided a more robust foundation for the field, although the authentication of studies of human remains and microbes remains highly problematic (e.g., Willerslev et al. 2004b; Gilbert et al. 2005a; Hebsgaard et al. 2005; Willerslev and Cooper 2005).
The first report of putative Neanderthal (Homo neanderthalsensis) mitochondrial DNA (mtDNA) from the type specimen (Feldhofer I [Krings et al. 1997]) was a rare example of a remarkable aDNA result obtained using very strict criteria for authenticity, including the independent replication of results and tests of biochemical preservation (Cooper and Poinar 2001; Hofreiter et al. 2001a; Pääbo et al. 2004; Willerslev and Cooper 2005). The result is convincing as the Neanderthal sequence differs from any known modern human (Homo sapiens) and chimpanzee (Pan troglodytes) sequences but is clearly human-like. Furthermore, subsequent independent retrieval of similar, but not identical, mtDNA from other Neanderthal specimens strongly supports the sequence’s authenticity (Ovchinnikov et al. 2000; Krings et al. 2000; Schmitz et al. 2002; Serre et al. 2004a; Lalueza-Fox et al. 2005).
The retrieval of Neanderthal sequences enables the possibility of addressing the long-running debate about modern human origin, something that had remained unsolved in paleontological and modern genetic studies (Wolpoff 1989; Templeton 1992). Neanderthals have been suggested either to be (i) direct ancestors of modern man or to have contributed to the gene pool of today’s humans (multiregional model [e.g. Wolpoff et al. 1984; Templeton 2002]) or (ii) to have been replaced by anatomically modern humans without leaving any genetic trace in contemporary populations (Out-of-Africa replacement model [Stringer and Andrews 1988; Harvati et al. 2003]). For a recent review of the current evidence pertaining to this debate, see Finlayson (2005). Most published phylogenetic analyses suggest that Neanderthal mtDNA is positioned outside the genetic diversity of contemporary humans. This points to the Out-of-Africa replacement model (Krings et al. 1997, 1999, 2000; Ovchinnikov et al. 2000; Schmitz et al. 2002; Knight 2003), though one cannot, with the limited Neanderthal sequences that are available, exclude other scenarios yet (e.g., Nordborg 1998). However, there are two main problems associated with the studies: (i) the use of limited contemporary human mtDNA sequences (as few as 10 [Schmitz et al. 2002]). Such restricted sequence sampling has been shown to affect the phylogenetic position of ancient human mtDNA sequences (Cooper et al. 2001); (ii) the use of analytical methods (i.e., neighbor-joining or maximum parsimony) that are, to some extent, unable to account for the extreme among-site variation in substitution rates and the levels of parallel mutations (homoplasies) that exist in the human control region (Krings et al. 1997, 1999, 2000; Ovchinnikov et al. 2000; Gutiérrez et al. 2002).
Other important issues affecting investigations of Neanderthal genetics are recent claims that the Neanderthal sequences are erroneous or simply sequence artifacts. Based on disagreements in genetic distances of the Neanderthal sequences to contemporary humans and the age of the fossils, Gutiérrez et al. (2002) argue that the first published Neanderthal sequence (Feldhofer I [Krings et al. 1997]) is erroneous. This interpretation is supported because some of the positions in the Feldhofer I sequence are unique (Caldararo and Gabow 2000; Schmitz et al. 2002) and might possibly result from postmortem damage (Hansen et al. 2001). Recent results from Pusch and Bachmann (2004) suggest that the original Neanderthal sequences might not represent “authentic” Neanderthal DNA but sequence artifacts.
Altogether these uncertainties and claims can have severe implications for the understanding of modern human evolution. This paper aims to address the claims and reevaluate Neanderthal genetics and phylogeny in an up-to-date framework. We evaluate the first published Neanderthal sequence (Feldhofer I [Krings et al. 1997]) with respect to damage-based errors, as discussed by Gutiérrez et al. (2002), and further investigate the study by Pusch and Bachmann (2004) where it is claimed that the Neanderthal sequences might not represent “authentic” Neanderthal DNA but sequence artifacts. The phylogenetic position of the Neanderthals mtDNA sequences relative to contemporary human mtDNA is analyzed using Bayesian inference and a newly compiled dataset together with the datasets used by Gutiérrez et al. (2002).
Materials and Methods
Assessing Damage and Sequence Artifacts
To investigate the problem of damage-based errors in the HVR1 Feldhofer I consensus sequence we randomly simulated 100,000 sequences with the same base composition as observed in the 11 positions that are variable in Neanderthal HVR1 sequences (Table 1). This approach assumes that the Neanderthal population is fairly homogeneous and not too genetically structured. Each of the 11 bases was drawn independently of each other and the empirical distribution, D, of the average pairwise difference (APD) to all humans was computed. The APDs between the four Neanderthal sequences and all human sequences were calculated (Table 2), and a test was applied to determine whether the average obtained for the Feldhofer I sequence was extreme in D. The true variance of the APD is likely to be underestimated in D (but not the mean value), because sites are drawn independently of each other. A variance correction was therefore performed assuming the APD is binomial with mean 11q as in D (the observed mean of D was 4.56, yielding q=0.414) and variance 11q(1 – q) = 2.67 with q=0.414 (Fig. 1). This approach is justified because the human sequences (from large HVR1) consist of one main type comprising 1584 of 1905 sequences, and the other types are very similar to the most frequent one. It was tested whether the APD between Feldhofer I and humans was extreme in a binomial distribution with q=0.414 (Fig. 1).
To test for chimeric sequences caused by jumping PCR events (Pääbo et al. 1989) in the datasets of Push and Bachmann (2004) and Krings et al. (1997), we used the approach of Gilbert et al. (2003b), examining the clone sequences for incompatible miscoding lesion-derived base substitutions.
The phylogenetic analyses of the interim consensus sequences (ICS) from Pusch and Bachmann (2004) and the Neanderthal sequences were analyzed using MrBayes (Huelsenbeck and Ronquist 2003). The Markov chain Monte Carlo analysis in MrBayes was run for 2 million generations with four chains, three times independently. Trees were sampled every 100 generations and a 50% majority-rule consensus tree was produced from the last 1000 trees. Stationarity was checked using the command “sump.”
Phylogenetic Inference
To reevaluate the Neanderthal phylogeny, we took two approaches. In one set of analyses we used the aligned mtDNA control region sequence datasets from Gutiérrez et al. (2002). The data contained a large HVR1 dataset of the hypervariable region 1 from the mtDNA control region, consisting of 422 aligned positions consisting from 1905 contemporary human and 3 Neanderthal sequences (AF011222, AF254446, AF282971). Furthermore, they also included a smaller HVR12 dataset (combined HVR1 and HVR2 mtDNA control region) of 843 aligned positions, consisting of 377 contemporary human and 2 Neanderthal sequences (AF011222, AF142095 and AF282971, AF282972). Additionally, in our reanalysis, we used an additional Neanderthal HVR1 sequence (AY149291) not used by Gutiérrez et al. (2002) and two HVR1 Cro-Magnon sequences (AY283027, AY283028) from Caramelli et al. (2003). The recently published HVR1 sequence from Vindija (Vi-80 [Serre et al. 2004b]) was not included in the analyses, as it is identical to the first Vindija Neanderthal sequence (Krings et al. 2000) and could be derived from the same individual.
In another set of analyses we created a dataset with 519 HVR1 and HVR2 sequences of 859 aligned positions from the HvrBase (Handt et al. 1998; http://www.hvrbase.org). The dataset consisted of 7 Chimpanzee sequences used as outgroup, the Vindija Neanderthal HVR1 and HVR2 sequences (AF282971, AF282972 [Krings et al. 2000]), and 511 contemporary human sequences (see Supplementary Material for the complete dataset). The Feldhofer 1 HVR sequences (AF011222, AF142095 [Krings et al. 1997, 1999]) were not used to avoid bias from possible sequence errors. The 511 human sequences were composed of 52 Africans, 162 Asians, 21 Oceanic/Australians, 145 Europeans, and 131 Americans. The dataset was created from the approximately 4000 taxa containing both HVR1 and HVR2 in the database by eliminating identical or nearly identical sequences until the dataset was reduced to 511 human sequences. We used this procedure because the Markov chain Monte Carlo method that is used in MrBayes (see below) to approximate the posterior probability distribution relies on convergence of the log likelihood and other model parameters. Convergence can be difficult to achieve when dealing with a large number of sequences and a relatively low number of unique site patterns (Huelsenbeck et al. 2002). Additionally, we aimed to include the maximum amount of sequence divergence in the dataset in order to be able to run the dataset within a reasonable timeframe.
To account for the large amount of parallel evolution and rate variation within the HVR regions (Tamura and Nei 1993), we used the general time reversible model of nucleotide substitution (GTR [Tavaré 1986; Rodriguez et al. 1990]) with gamma-distributed rates among sites with a correction for invariable sites (GTR+Γ+I). The number of gamma categories was as standard set to 4. To investigate the phylogenetic signals within the two HVR12 dataset from Gutiérrez et al. (2002), we partitioned it into the HVR1 and HVR2 sections.
The phylogenetic analyses of the datasets from Gutiérrez et al. (2002) were performed with the parallel version of MrBayes version 3 beta 4 (Huelsenbeck and Ronquist 2003) on the BioCluster at the Zoological Museum, University of Copenhagen (http://www.zmuc.dk/). The HVR12 dataset and individual HVR1 and HVR2 partitions from Gutiérrez et al. (2002) were each run for 15 million generations, and the large HVR1 dataset from Gutiérrez et al. (2002) was run for 30 million generations. Stationarity for these analyses was checked using the “sump” command in MrBayes. The HVR12 dataset created for this study was analyzed using MrBayes version 3.1.1 (Huelsenbeck and Ronquist 2003) and was run for 50 million generations. Trees were sampled every 1000 generations, with a 50% majority-rule consensus tree computed from samples after stationarity had been reached. Stationarity and effective sample size (the number of effectively independent draws from the posterior distribution that is sampled from) were checked with the program Tracer 1.2.1 (Rambaut and Drummond 2004).
To investigate the phylogenetic signal under the neighbor-joining method (as in Gutiérrez et al. 2002), we analyzed the individual HVR1 and HVR2 parts from the HVR12 dataset with neighbor joining using the TN93 (Tamura and Nei 1993) model with gamma-distributed rates among sites and correction for invariable sites with the program PAUP* version 4 beta 10 (Swofford 1998).
Results
Errors in the Feldhofer I Sequence
We find that the probability of a randomly generated sequence having a higher average pairwise difference than the Feldhofer I is only 1.4%. Applying the variance correction we found that the probability of obtaining an average pairwise difference of 8 or more (the observed pairwise difference between Feldhofer I and humans is 7.91; Table 2) is 3.7%. Thus, the Feldhofer I HVR1 sequence is extreme in base composition compared to the Neanderthal HVR1 sequences and it is likely that the sequence is erroneous.
Artificial Neanderthal DNA
A BLAST search (Altschul et al. 1997) demonstrates that seven of the different ICS (I–V, IX, and XI) obtained in the experiment of Pusch and Bachmann (2004) show 100% matches with human mtDNA GenBank sequences (accession numbers AF285377, AF285367, AY426291, AB059953, AF519867, AY314618, and AY314618), implying that a variety of contemporary human contaminant sequences was amplified. A higher frequency of transitions than transversions (164/72 = 2.3), combined with the higher frequency of type 2 (cytosine → thymine and guanine → adenine, i.e., CG→TA; total =95) than type 1 (adenine → guanine and thymine → cytosine, i.e., AT→GC; total = 68) mutations observed among the clone products of Pusch and Bachman (2004), is consistent with observations on postmortem damage-derived miscoding lesions (Hansen et al. 2001; Hofreiter et al. 2001b; Gilbert et al. 2003a, b; Binladen et al. 2006), suggesting that postmortem damage might be involved in their data. Furthermore, at least 24 of the 35 ICS (68.6%) can be identified as chimeras caused by PCR jumping events according to the method of Gilbert et al. (2003b)—in some cases, up to four jumping PCR events per sequence (Fig. 2). In comparison, only 4 of 167 clone sequences (2.4%) used to generate the first published Neanderthal HVR1 sequence were found to contain similar evidence of jumping PCR events (clones A2.10, B11.4, B11.8, and B14.9 [Krings et al. 1997]).
Figure 3 demonstrates that the diverse set of clones (35 ICS) obtained in the experiment of Pusch and Bachmann (2004), including the XXVI sequences, is phylogenetically more similar to the CRS and to each other than to any of the published Neanderthal sequences. The separate position of the Neanderthals and the ICS is highly supported with a posterior probability of 100%.
Neanderthal Phylogeny
The Bayesian analyses of the four datasets from Gutiérrez et al. (2002) show that Neanderthal sequences are separated from modern human sequences with a posterior probability of 100% (Fig. 4). A schematic representation of the resulting trees for the large HVR1 and the smaller HVR12, and HVR1 and HVR2 partitions is given in Figs. 4a–d. The large HVR1 (Fig. 4a), the HVR12 (Fig. 4b), and the small HVR1 (Fig. 4c) datasets support Neanderthal monophyly with a posterior probability of 100%. The small HVR2 (Fig. 4d) datasets support Neanderthal monophyly with a posterior probability of 63%. Analyzing the dataset created for this study shows that the Vindija Neanderthal HVR1 and HVR2 sequences are positioned as a sister group to the contemporary humans with a posterior probability of 100% (Fig. 5). Further, this dataset shows that six sequences of African origin form a sister group to the rest of the contemporary humans. The neighbor-joining analyses of the datasets of Gutiérrez et al. (2002) confirm that the Neanderthal sequences fall outside the sequences of contemporary humans for the large HVR1 dataset. However, the Neanderthal sequences fall within modern human variation for the combined HVR12, the small HVR1, and the small HVR2 datasets (Figs. 6a and b).
Discussion
Recent phylogenetic and population genetic research suggests that any genetic interchange between Neanderthals and anatomically modern humans was very limited during the approximately 10,000 years (10 kyr) they potentially co-occupied the same areas of Europe and Asia (Currat and Excoffier 2004; Serre et al. 2004b) and that the Neanderthals have not contributed to the mtDNA genetic diversity found in present-day humans (Krings et al. 1997, 1999, 2000; Ovchinnikov et al. 2000; Schmitz et al. 2002; Knight 2003). These issues are central to the two main theories of modern human origins: the Out-of-Africa replacement model, where modern humans rapidly replaced archaic forms (e.g., Neanderthals) as they began to spread from Africa through Eurasia and the rest of the world sometime around 100,000 years ago (Stringer and Andrews 1988; Harvati et al. 2003); and the multiregional model, where genetic exchange or even continuity exists between archaic and modern humans (e.g., Wolpoff et al. 1984; Templeton 2002).
In this paper we have investigated the genetic affinities of the Neanderthals to anatomically modern humans. First, we have evaluated whether the first published Neanderthal sequence (Feldhofer I) is erroneous (Gutiérrez et al. 2002). Second, we have investigated whether the Neanderthal sequences are sequence artifacts (Pusch and Bachmann 2004). Finally, with our reflections on the first two questions in mind, we have readdressed the controversial question about the phylogenetic position of the Neanderthals.
Errors in the Feldhofer I Sequence
One explanation for the unresolved position of Neanderthals among anatomically modern humans is that the sequence data might be considered unreliable due to the degraded nature of the Neanderthal specimens and their DNA (Gutiérrez et al. 2002). Biochemical analyses for investigating the preservation condition of excavated Neanderthal bones and teeth indicate that most of the specimens are unlikely to yield any endogenous DNA (Serre et al. 2004b). The majority of samples that have yielded putative Neanderthal DNA have only enabled PCR amplification of mtDNA in the 50-base pair (bp) size range (Serre et al. 2004b; Lalueza-Fox et al. 2005). In addition, it has been difficult to replicate the entire Neanderthal HVR1 sequences in independent laboratories (Krings et al. 1997; Ovchinnikov et al. 2000), suggesting that preservation of Neanderthal fossils is at the edge of what is required for successful DNA studies. It is therefore possible that some of the published Neanderthal DNA sequences might contain errors due to miscoding lesions (Hansen et al. 2001). This type of DNA damage is of particular concern if amplifications start from few template molecules, which appears to be the case at least in the first published Neanderthal study (the Feldhofer I HVR1 sequence [Krings et al. 1997]).
In support of errors in the Feldhofer I HVR1 sequence it has been argued that the most recent Neanderthal specimen (Mezmaiskaya, ∼29 kyr old) shows a shorter genetic distance to contemporary humans than Feldhofer I (which is believed to be the oldest of the Neanderthal specimens) (Gutiérrez et al. 2002). However, the validity of this argument is questionable, as the Feldhofer I fossil has recently been redated to ∼40 kyr (Schmitz et al. 2002), and the young age of the Mezmaiskaya fossil is debated (Skinner et al. 2005). Additionally, it has been noted that the Feldhofer I HVR1 sequence harbors four unique substitutions (positions 107, 108, 111, and 112) possibly due to postmortem damage accumulated during amplification (Caldararo and Gabow 2000; Schmitz et al. 2002; Hansen et al. 2001). Using a maximum damage-based error rate of ∼0.06%, Hofreiter et al. (2001b) reject major errors in the Feldhofer I sequence. However, the rate might be underestimated because they do not take into account the possible presence of damage hotspots in the human D-loop (Gilbert et al. 2003a) and the error rate is calculated from the consensus of only three Neanderthal sequences.
Comparisons of the average APD between the Feldhofer I sequence and the sequences of contemporary humans with the APD of randomly generated “Neanderthal sequences” indicate that the Feldhofer I HVR1 sequence is extreme in its genetic composition. It is therefore likely that the Feldhofer I HVR1 sequence is erroneous and we cannot exclude that at least this sequence is modified due to postmortem damage (see Errors in the Feldhofer I Sequence, under Results).
Artificial Neanderthal DNA
Instead of the Neanderthal sequences being affected by damage, a recent study suggests that their unique substitution patterns are caused by PCR artifacts. Pusch and Bachmann (2004) report that 35 different mitochondrial HVR1 sequences, including a group containing 7 substitutions that are in combination characteristic for the Neanderthals (i.e., clone XXVI; Fig. 2) can be amplified from a single sequence of modern human mitochondrial DNA (matching the Cambridge Reference Sequence; CRS [Anderson et al. 1981]) if the PCR reaction is spiked prior to amplification, with 14 different aDNA extracts of non-Neanderthal origin. The authors thereby indirectly imply that the published Neanderthal sequences could be explained in this manner and may, in fact, not represent “authentic” Neanderthal DNA. However, as shown in Fig. 3, the diverse set of clones (35 ICS) obtained in the experiment of Pusch and Bachmann (2004), including the Neanderthal-like XXVI sequences, is phylogenetically more similar to the reference sequence and to each other than to any of the published Neanderthal sequences. The separate position of the Neanderthals and the ICS is highly supported, with a posterior probability of 100%, and all of the artificially generated sequences are therefore clearly distinguishable from the published Neanderthal sequences.
Another interesting issue is that regular BLAST searches (Altschul et al. 1997) reveal that seven of the different ICSs obtained by Pusch and Bachmann (2004) show 100% match with different human mtDNA GenBank sequences. This strongly suggests that a variety of human contaminants is amplified in the experiment. Furthermore, the higher frequency of transitions than transversions combined with the higher frequency of type 2 than type 1 mutations (see Materials and Methods) among the clone products of Pusch and Bachman (2004) is consistent with the presence of damage-based misincorporation in the template DNA (Hansen et al. 2001; Gilbert et al. 2003a, b, 2005a; Willerslev et al. 2003). This could also explain the high frequency of chimeric sequences recorded in their dataset (>68%). Such chimeric sequences are caused by “jumping PCR” events that frequently take place when the template molecules are damaged (Pääbo et al. 1989; Willerslev et al. 1999) and/or the amplification starts from very similar molecules (von Wintzingerode et al. 1997). In some cases, up to four jumping PCR events are found per ICS in the dataset of Push and Bachmann (Table 2). In this context it is important to keep in mind that six of the seven Neanderthal-characteristic substitutions in the XXVI clone are recorded in contemporary human populations and a chimeric sequence of as few as five contaminant molecules (accession numbers are listed in Table 3) could generate a XXVI-like sequence. Importantly, only 4 of 167 clone sequences (2.4%) used to generate the first published Neanderthal HVR1 sequence contain similar evidence of jumping PCR (clones A2.10, B11.4, B11.8, and B14.9 [Krings et al. 1997]).
Considering these results the retrieval of 35 different ICS (clones I to XXVIII, clones sls and srs [Pusch and Bachmann 2004, Table 2]) from the 14 spiked reactions is not surprising, taking the experimental design and the resulting sequence compositions into account. Exogenous human DNA is present in DNA extracts from museum remains even after extensive cleaning of the specimens (e.g., Malmström et al. 2005; Gilbert et al. 2006) and up to 20 different human sequences have been reported from a single fossil (Hofreiter et al. 2001a). Contaminant human DNA may originate from a variety of sources, including handling of the specimens (often over several decades), previous PCR products, and reagents and tools used for DNA extraction and PCR (Willerslev and Cooper 2005). Even though blank controls are negative, contaminant DNA sequences might be amplified from sample extracts due to sample contamination and/or “carrier effects” (Handt et al. 1994; Cooper and Poinar 2001; Hofreiter et al. 2001a). The 14 different ancient specimens used by Pusch and Bachmann (2004) for DNA extractions can be expected to carry a considerable load of human contamination, some of which is likely to be highly degraded after years of storage. Apparently, their experiments were not conducted using aDNA standards such as an isolated facility for DNA extractions and PCR setup and the cleaning of specimens, reagents, and tools (Cooper and Poinar 2001; Hofrierter et al. 2001a; Pääbo et al. 2004; Willerslev and Cooper 2005), so both sample and laboratory-based contamination are of major concern. Furthermore, the results have not proven reproducible (Serrre et al. 2004a; Beauval et al. 2005), which has become standard in human aDNA work (Cooper and Poinar 2001; Hofreiter et al. 2001a; Willerslev and Cooper 2005), so it is impossible to determine how much of the effect might be specific to the extraction and amplification techniques used.
If, as implied by Pusch and Bachmann (2004), the amplification of artificially “Neanderthal-like” XXVI clone sequences is common in aDNA studies and constitutes 16% of their clone products, it is surprising that similar sequences are not already present in GenBank after more than 50 ancient human mtDNA publications (i.e., PubMed search). In particular, such sequences should have been spotted in recent studies such as that of mtDNA from Cro-Magnons (Caramelli et al. 2003) and the Andaman Islanders (Endicott et al. 2003), where cloning was applied. In a recent study where more than 900 human clone sequences were obtained from 34 Viking specimens (Gilbert et al. 2003a), no sequences were obtained that matched the XXVI clones of Pusch and Bachmann or that had any Neanderthal-like substitution patterns.
Neanderthal Phylogeny
Readdressing the controversy of the phylogenetic position of the Neanderthal sequences (Gutiérrez et al. 2002), we favored the Bayesian inference method (Huelsenbeck and Ronquist 2003) because it assigns a posterior probability to each possible phylogeny rather than just selecting a single best tree, and further, the Bayesian inference method allows comparison of the support for conflicting phylogenies. The method uses a Markov chain Monte Carlo method which allows large sequence datasets to be analyzed in a statistical framework with an adequate model of substitution (Huelsenbeck et al. 2002). The method has been parallelized (Altekar et al. 2004), which has increased the complexity of analytical problems that can be solved.
The Bayesian inference delivers consistent results for all the datasets (the large HVR1 and small HVR1, HVR2, and HVR12 datasets used by Gutiérrez et al. [2002] and our newly constructed HVR12 dataset; see Materials and Methods), which strongly supports the separation of Neanderthals from contemporary humans (Figs. 4 and 5). Although the 95% credibility interval (the Bayesian equivalent of a confidence interval) also includes phylogenies where the Neanderthal sequences do not form a monophyletic group, the support for a monophyletic Neanderthal group is considerable. Altogether the Bayesian phylogenies are in agreement with previous phylogenetic analyses (Krings et al. 1997, 1999, 2000; Ovchinnikov et al. 2000; Schmitz et al. 2002; Knight 2003) and show that the ambiguous results of Gutiérrez et al. (2002) could be due to the inadequacy of the neighbor-joining method with the TN93 model (Tamura and Nei 1993) for the given data. Analyzing the dataset created for this study, we see not only that the Vindija Neanderthal HVR1 and HVR2 sequences are positioned as a sister group to the contemporary humans with posterior probability 100%, but also the ancestral position of six sequences with African origin.
Thus our results are in agreement with the expectations of the Out-of-Africa replacement model for modern human origin. To exclude the possibility that the results are influenced by the possible sequence errors in the Feldhofer I HVR1 sequence (Krings et al. 1997), this sequence is not included in our newly constructed HVR12 dataset.
Conclusion
The evolutionary relationship between Neanderthals and anatomically modern humans is highly debated. Intriguingly, phylogenetic analyses addressing this issue have so far suffered from limited sequence sampling and inadequate methodology causing conflicting results. Large-scale Bayesian analyses strongly support a position of the Neanderthal mtDNA sequences outside that of anatomically modern humans, in agreement with the expectations of the Out-of-Africa replacement model for modern human origin. It is noteworthy, however, that with the limited number of Neanderthal sequences that are available one cannot yet rule out other scenarios (e.g., Nordborg 1998). It was recently estimated that the maximum interbreeding rates between Neanderthals and anatomically modern humans have been <0.1% (Currat and Excoffier 2004). However, the result relies, among other things, on the existence of a very precise relationship between the chemical preservation of amino acids and endogenous DNA and on depurination being the main type of damage limiting the half-life life of DNA in fossil remains (Serre et al. 2004b). Both of these assumptions have been questioned (Collins et al. 1999; Hansen et al. 2006). Another interesting point is the fact that Neanderthal male contribution to anatomically modern humans is not recorded in the maternally inherited mtDNA sequences, and nuclear DNA (nuDNA) from Neanderthals may tell a different story. However, despite technical advantages, such as the creation of metagenomic libraries (a technique allowing for the sequencing of large amounts of genomic DNA without an initial amplification step) from the extinct cave bear (Noonan et al. 2005), it is likely to be difficult to obtain reliable Neanderthal nuDNA sequences with current techniques. This is due to contamination problems (i.e., distinguishing endogenous Neanderthal and contaminant contemporary human nuDNA if the sequences are identical) and the generally poor preservation of nuDNA in fossil remains (Poinar et al. 2003). Thus, the recovery of a well-preserved Neanderthal specimen free of contemporary human contamination or the search for variable sequences of ancient bone proteins (Nielsen-Marsh et al. 2005) might currently be the only realistic approaches for addressing this issue. Although some of the published Neanderthal mtDNA sequences may contain a few sequence errors due to damage in the template molecules for PCR, there is currently no solid evidence for the sequences being a result of PCR artifacts.
References
Adcock GJ, Dennis ES, Easteal S, Huttley GA, Jermin LS, Peacock WJ, Thorne A (2001) Mitochondrial DNA sequences in ancient Australians: implications for modern human origins. Proc Natl Acad Sci USA 98:537–542
Altekar G, Dwarkadas S, Huelsenbeck JP, Ronquist F (2004) Parallel Metropolis coupled Markov chain Monte Carlo for Bayesian phylogenetic inference. Bioinformatics 20:407–415
Altschul SF, Madden TL, Schäffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ (1997) Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 25:3389–3402
Anderson S, Bankier AT, Barrell BG, de Bruijn MH, Coulson AR, Drouin J, Eperon IC, Nierlich DP, Roe BA, Sanger F, Schreier PH, Smith AJ, Staden R, Young IG (1981) Sequence and organization of the human mitochondrial genome. Nature 290:457–465
Beauval C, Maureille B, Lacrampe-Cuyaubére F, Serre D, Peressinotto D, Bordes J-G, Cochard D, Couchoud I, Dubrasquet D, Laroulandie V, Lenoble A, Mallye J-B, Pasty S, Primault J, Rohland N, Pääbo S, Trinkaus E (2005) A late Neanderthal femur from Les Rochers-de-Villeneuve, France. Proc Natl Acad Sci USA 102:7085–7090
Binladen J, Wiuf C, Gilbert MT, Bunce M, Barnett R, Larson G, Greenwood AD, Haile J, Ho SY, Hansen AJ, Willerslev E (2006) Assessing the fidelity of ancient DNA sequences amplified from nuclear genes. Genetics 172:733–741
Caldararo N, Gabow S. (2000) Mitochondrial DNA analysis and the place of Neanderthals in Homo. Ancient Biomol 3:135–158
Cano RJ, Borucki MK (1995) Revival and identification of bacterial spores in 25- to 40-million year-old Dominican amber. Science 268:1060–1064
Caramelli D, Lalueza–Fox C, Vernesi C, Lari M, Casoli A, Mallegni F, Chiarelli B, Dupanloup I, Bertranpetit J, Barbujani G, Bertorelle G (2003) Evidence for a genetic discontinuity between Neanderthals and 24,000–year–old anatomically modern Europeans. Proc Natl Acad Sci USA 100:6593–6597
Collins MJ, Waite ER, van Duin ACT (1999) Predicting protein decomposition: the case of aspartic-acid racemization kinetics. Philos Trans Roy Soc Lond B 354:51–64
Cooper A, Poinar HN (2001) Ancient DNA: do it right or not at all. Science 18:1139
Cooper A, Rambaut A, Macaulay V, Willerslev E, Hansen AJ, Stringer C (2001) Human origins and ancient human DNA. Science 292:1655–1656
Currat M, Excoffier L (2004) Modern humans did not admix with Neanderthals during their range expansion into Europe. PLoS Biol 2:2264–2274
Endicott P, Gilbert MTP, Stringer C, Lalueza-Fox C, Willerslev E, Hansen AJ, Cooper A (2003) The genetic origins of the Andaman Islanders. Am J Hum Genet 72:178–184
Finlayson C (2005) Biogeography and evolution of the genus Homo. TREE 20:457–463
Fish SA, Shepherd TJ, McGenity TJ, Grant WD (2002) Recovery of 16S ribosomal RNA gene fragments from ancient halite. Nature 417:432–436
Gilbert MTP, Willerslev E, Hansen AJ, Barnes I, Rudbeck L, Lynnerup N, Cooper A (2003a) Distribution patterns of post-mortem damage in human mitochondrial DNA. Am J Hum Genet 72:32–47
Gilbert MTP, Hansen AJ, Willerslev E, Barnes I, Rudbeck L, Lynnerup N, Cooper A (2003b) Characterisation of genetic miscoding lesions caused by post-mortem damage. Am J Hum Genet 72:48–61
Gilbert MTP, Cuccui J, White W, Lynnerup N, Titball RW, Cooper A, Prentice MB (2004) Absence of Y. pestis-specific DNA in human teeth from five European excavations of putative plague victims. Microbiology 150:341–354
Gilbert MTP, Bandelt H, Hofreiter M, Barnes I (2005a) Assessing ancient DNA studies. TREE 20:541–544
Gilbert MTP, Shapiro B, Drummond A, Cooper A (2005b) Post mortem DNA damage hotspots in Bison (Bison bison and B. bonasus) provide supporting evidence for mutational hotspots in human mitochondria. J Arch Sci 32:1053–1060
Gilbert MTP, Rudbeck L, Willerslev E, Hansen AJ, Smith C, Penkman KEH, Prangenberg K, Nielsen-Marsh CM, Jans ME, Arthur P, Lynnerup N, Turner-Walker G, Biddle M, Kjølbye-Biddle B, Collins M (2005c) Biochemical and physical correlates of DNA contamination in archaeological human bones and teeth excavated at Matera, Italy. J Arch Sci 32:783–795
Gilbert MTP, Hansen AJ, Willerslev E, Turner-Walker G, Collins M (2006) Insights into the processes behind the contamination of degraded human teeth and bone samples with exogenous sources of DNA. Int J Osteoarch 16:156–164
Gutiérrez G, Sánchez D, Marín A (2002) A reanalysis of the ancient mitochondrial DNA sequences recovered from Neanderthal bones. Mol Biol Evol 19:1359–1366
Handt O, Höss M, Krings M, Pääbo S (1994) Ancient DNA: methodological challenges. Experientia 50:524–529
Handt O, Meyer S, Haeseler A. von (1998) Compilation of human mtDNA control region sequences. Nucleic Acids Res 26:126–129
Hansen AJ, Willerslev E, Wiuf C, Mourier T, Arctander P (2001) Statistical evidence for miscoding lesions in ancient DNA templates. Mol Biol Evol 18:262–265
Hansen AJ, Mitchell DL, Wiuf C, Paniker L, Brand TB, Binladen J, Gilichinsky DA, Rønn R, Willerslev E (2006) Crosslinks rather than strand breaks determine access to ancient DNA sequences from frozen sediments. Available at: 10.1534/genetics.106.057349; accessed April 2, 2006
Harvati K (2003) The Neanderthal taxonomic positions: models of intra- and inter-specific craniofacial variation. J Hum Evol 44:107–132
Hebsgaard MB, Phillips MJ, Willerslev E (2005) Geologically ancient DNA: fact or artefact? Trends Microbiol 13:212–220
Hofreiter M, Serre D, Poinar HN, Kuch M, Pääbo S (2001a) Ancient DNA. Nature Rev Genet 2:353–360
Hofreiter M, Jaenicke V, Serre D, von Haeseler A, Pääbo S (2001b) DNA sequences from multiple amplifications reveal artifacts induced by cytosine deamination in ancient DNA. Nucleic Acids Res 29:4793–47799
Huelsenbeck JP, Ronquist F (2003) MrBayes 3: Bayesian phylogenetic inference under mixed models. Bioinformatics 19:1572–1574
Huelsenbeck JP, Larget B, Miller RE, Ronquist F (2002) Potential applications and pitfalls of Bayesian inference of phylogeny. Syst Biol 51:673–688
Kim S, Soltis DE, Soltis PS, Suh Y (2004) DNA sequences from Miocene fossils: an ndhF sequence of Magnolia latahensis (Magnoliaceae) and an rbcL sequence of Persea pseudocarolinensis (Lauraceae). Am J Bot 91:615–620
Knight A (2003) The phylogenetic relationship of Neanderthal and modern human mitochondrial DNAs based on informative nucleotide sites. J Hum Evol 44:627–632
Krings M, Stone A, Schmitz RW, Krainitzki H, Stoneking M, Pääbo S (1997) Neanderthal DNA sequences and the origin of modern humans. Cell 90:19–30
Krings M, Geisert H, Schmitz RW, Krainitzki H, Pääbo S (1999) DNA sequence of the mitochondrial hypervariable region II from the Neanderthal type specimen. Proc Natl Acad Sci USA 96:5581–5585
Krings M, Capelli C, Tschentscher F, Geisert H, Meyer S, von Haeseler A, Grossschmidt K, Possnert G, Paunovic M, Pääbo S (2000) A view of Neanderthal genetic diversity. Nature Genet 26:144–146
Lalueza-Fox C, Sampietro ML, Caramelli D, Puder Y, Lari M, Calafell F, Martínez-Maza C, Bastir M, Fortea J, de la Rasilla M, Bertranpetit J, Rosas A (2005) Neanderthal evolutionary genetics: mitochondrial DNA data from the Iberian Peninsula. Mol Biol Evol 22:1077–1081
Malmström H, Stora J, Dalen L, Holmlund G, Götherström A (2005) Extensive human DNA contamination in extracts from ancient dog bones and teeth. Mol Biol Evol 22:2040–2047
Nielsen-Marsh CM, Richards MP, Hauschka PV, homas-Oates JE, Trinkaus E, Pettitt PB, Karavanic I, Poinar H, Collins MJ (2005) Osteocalcin protein sequences of Neanderthals and modern primates. Proc Natl Acad Sci USA 102:5594–5599
Noonan JP, Hofreiter M, Smith D, Priest JR, Rohland N, Rabeder G, Krause J, Detter JC, Pääbo S, Rubin EM (2005) Genomic sequencing of Pleistocene cave bears. Science 309:597–599
Nordborg M (1998) On the probability of Neanderthal ancestry. Am J Hum Genet 63:1237–1240
Ovchinnikov IV, Gotherstrom A, Romanova GP, Kharitonov VM, Liden K, Goodwin W (2000) Molecular analysis of Neanderthal DNA from the northern Caucasus. Nature 404:490–493
Pääbo S, Higuchi RG, Wilson AC (1989) Ancient DNA and the polymerase chain reaction. J Biol Chem 264:9709–9712
Pääbo S, Poinar H, Serre D, Jaenicke-Després V, Hebler J, Rohland N, Kuch M, Krause J, Vigilant L, Hofreiter M (2004) Genetic analyses from ancient DNA. Annu Rev Genet 38:645–679
Poinar H, Kuch M, McDonald G, Martin P, Pääbo S (2003) Nuclear gene sequences from a Late Pleistocene sloth coprolite. Curr Biol 13:1150–1152
Pusch CM, Bachmann L (2004) Spiking of contemporary human template DNA with ancient DNA extracts induces mutations under PCR and generates non-authentic mitochondrial sequences. Mol Biol Evol 21:957–964
Rambaut A, Drummond A (2004) Tracer v1.2. University of Oxford, Oxford, UK
Rodríguez F, Oliver JL, Marin A, Medina JR (1990) The general stochastic model of nucleotide substitution. J Theor Biol 142:485–501
Schmitz RW, Serre D, Bonani G, Feine S, Hillgruber F, Krainitzki H, Pääbo S, Smith FH (2002) The Neanderthal type site revisited: interdisciplinary investigations of skeletal remains from the Neander Valley, Germany. Proc Natl Acad Sci USA 99:13342–13347
Serre D, Hofreiter M, Pääbo S (2004a) Mutations induced by ancient DNA extracts? Mol Biol Evol 21:1463–1467
Serre D, Langaney A, Chech M, Teschler-Nicola M, Paunovic M, Mennecier P, Hofreiter M, Possnert G, Pääbo S (2004b) No evidence of Neanderthal mtDNA contribution to early modern humans. PLOS Biol 2:313–317
Skinner AR, Blackwell BAB, Martin S, Ortega A, Blickstein JIB, Golovanova LV, Doronichev VB (2005) ESR dating at Mezmaiskaya Cave, Russia. Appl Radiat Isotopes 62:219–224
Stringer CB, Andrews P (1988) Genetic and fossil evidence for the origin of modern humans. Science 239:1263–1268
Swofford DL (1998) PAUP*: Phylogenetic Analysis Using Parsimony (*and other methods). Sinauer Associates, Sunderland, MA
Tamura K, Nei M (1993) Estimation of the number of nucleotide substitutions in the control region of mitochondrial DNA in humans and chimpanzees. Mol Biol Evol 10:512–526
Tavaré S (1986) Some probabilistic and statistical problems on the analysis of DNA sequences. Lect Math Life Sci 17:57–86
Templeton AR (1992) Human origins and analysis of mitochondrial DNA sequences. Science 255:737
Templeton A (2002) Out of Africa again and again. Nature 416:45–51
von Wintzingerode F, Göbel UB, Stackebrandt E (1997) Determination of microbial diversity in environmental samples: pitfalls of PCR-based rRNA analysis. FEMS Microbiol Rev 21:213–329
Vreeland RH, Rosenzweig WD, Powers DW (2000) Isolation of a 250 million-year-old halotolerant bacterium from a primary salt crystal. Nature 407:897–900
Willerslev E, Cooper A (2005) Ancient DNA. Proc R Soc Lond B 272:3–16
Willerslev E, Hansen AJ, Christensen B, Steffensen JP, Arctander P (1999) Diversity of Holocene life forms in fossil glacier ice. Proc Natl Acad Sci USA 96:8017–8021
Willerslev E, Hansen AJ, Brand T, Binladen J, Gilbert TMP, Shapiro B, Wiuf C, Gilichinsky DA, Cooper A (2003) Diverse plant and animal DNA from Holocene and Pleistocene sedimentary records. Science 300:791–795
Willerslev E, Hansen AJ, Brand TB, Rønn R, Barnes I, Wiuf C, Gilichinsky DA, Mitchell D, Cooper A (2004a) Long-term persistence of bacterial DNA. Curr Biol 14:R9–R10
Willerslev E, Hansen AJ, Poinar HN (2004b) Isolation of nucleic acids and cultures from ice and permafrost. TREE 19:141–147
Wolpoff MH (1989) Multiregional evolution: the fossil alternative to Eden. In: Stringer C, Mellars P (eds) The human revolution. Edinburgh University Press, Edinburgh
Wolpoff MH, Wu X, Thorne AG (1984) Modern Homo sapiens origins: a general theory of human evolution involving the fossil evidence from East Asia. In: Smith FH, Spencer F (eds) The origins of modern humans: a world survey of the fossil evidence. Alan R. Liss, New York, pp 411–483
Acknowledgments
We thank A. Krogh, L. G. T. Jørgensen, and W. Hughes for valuable help and J. P. Huelsenbeck, S. Y. W. Ho, M. Phillips, and J. Haile for helpful discussion. G. Gutierrez was very helpful in providing an electronic version of their data. M.B.H. was supported by the Danish Research Council. C.W. was supported by The Danish Cancer Society. M.T.P.G. was supported by the Marie Curie FP6 Action. H.G. was supported by the Danish National Science Research Council. E.W. was supported by Wellcome Trust, the Carlsberg Foundation, and the Danish Research Council.
Author information
Authors and Affiliations
Corresponding author
Additional information
[Reviewing Editor: Dr. Martin Kreitman]
Electronic Supplementary Material
Rights and permissions
About this article
Cite this article
Hebsgaard, M.B., Wiuf, C., Gilbert, M.P. et al. Evaluating Neanderthal Genetics and Phylogeny. J Mol Evol 64, 50–60 (2007). https://doi.org/10.1007/s00239-006-0017-y
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00239-006-0017-y