Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

The papers which refer to our topic can be classified by seven subjects.

  1. (a)

    Geological description of dynamic problems of thermal evolution of the Earth, plate tectonics and chemical differentiation as well as of the Andean orogeny,

  2. (b)

    Models which are partially kinematic and partially dynamic. In this kind of models, essential features are prescribed in order to gain a large adaptation to geological and geophysical observations.

  3. (c)

    Geochemical models of growth and differentiation of continents which do not contain any dynamic modeling,

  4. (d)

    Dynamic models of the subduction process to understand the physical mechanism behind subduction,

  5. (e)

    Circulation models,

  6. (f)

    Fully dynamic models of subduction in a diamond (cf. Fig. 8), i.e. in a certain 3-D sector of the spherical shell, which represents the mantle, where this diamond is embedded into a realistic 3-D spherical-shell solution,

  7. (g)

    Fully dynamic forward models of spherical-shell convection with chemical differentiation and generation of continents.

Fig. 8
figure 8

The diamond (red), embedded in the global grid, which will be dyadically refined to the model resolution (Modified from [6, Fig. 1]. Of course, we use a further refinement of the grid)

We systematically described the papers of types (a)–(e) in [110]. Therefore, we will not repeat it here. Up to now, there is no paper of type (f). Some supplements will follow: To obtain a more profound analysis of the Andean orogeny, it is important to understand why a plateau-type orogen formed between a purely oceanic lithospheric plate (Nazca plate) and a continent (South America) [71]. The Andean mountain belt belongs to the non-collisional type. Kley and Monaldi [42, 45, 46] found a Cenozoic shortening of 250–350 km whereas Arriagada et al. [4] derived 400 km for the central Andes. In other areas of the Earth, however, the subduction zones at an oceanic-continental plate-boundary site have only little or no shortening and do not show any elevated plateaus. In most cases, the upper plate is characterized by backarc extension. Schellart and Rawlinson [82] discuss some hypotheses of which it is claimed that they explain this exceptional behavior of the Andes.

  • In [66], it is proposed that the young age of the Nazca plate and low negative buoyancy cause this phenomenon.

  • Climatic conditions, high friction and subduction erosion is thought to be the decisive factor [57, 58].

  • Heuret and Lallemand [30] emphasize the eminent role of acceleration of the westward movement of the upper, South American plate. Sobolev et al. [84] find out that the accelerating westward movement of the South American plate is the most important factor for the Andean orogeny.

The third hypothesis generates the question of the mechanism which drives South America westward. The ridge push at the mid-Atlantic ridge and the slab pull at the Lesser Antilles and the Scotia arc have been proposed but it could be that these contributions are too small. Schellart and Rawlinson [82] remarkably mention the slab pull of the Nazca plate. However, we propose that a downwelling of the bulk convection beneath South America caused by the thermal screening of the thick continental lithosphere could play a role. If there is a large upwelling east of South America and a large downwelling of the bulk convection under South America we could understand why the bulk convection current would move the South American plate westward since, because of the thick continental lithosphere, South America is not decoupled from the bulk convection by the asthenosphere. Additionally, we observe a large upwelling of the bulk convection beneath the Pacific. So we can expect that another current of the bulk convection will go eastward, producing the arcs of the Lesser Antilles and Scotia. It is unclear if these speculative arguments are appropriate. However, they show not only the necessity of dynamic, numerical regional models but also that the regional model must be nestled among time dependent boundary conditions determined by a global dynamic model. Evidently, some essential features of the Andean orogenesis cannot be explained by an isolated regional model.

We discuss further new geological and geophysical papers in Sect. 3.2.1. We did that on purpose in order to substantiate why we used certain details in our development of two special (alternative) mechanisms for a regional model of the South American subduction zone and Andean orogenesis. Further new papers are mentioned in Sect. 3.2.2 in order to select an appropriate set of prescribed plate movements in the surrounding spherical-shell convection model which is necessary to embed the regional model.

2 A Spherical-Shell Forward Model and Other Results

Under Sects. 2.12.3, we describe what we have done using the system HP XC4000 of the Steinbuch Center for Computing in Karlsruhe. Under Sect. 2.4, other efforts of us which have some relation to the topic are reported. There is a direct relationship between the Sect. 2.12.3 and the running works which we describe in Sect. 3.2.1 in rather distinct outlines, i.e. we discuss our specific South American dynamical model which is embedded in a second spherical-shell model.

2.1 Spherical-Shell Model: Forward Model

We developed two spherical-shell convection models. The forward model is described here; another model with prescribed plate movements is outlined under Sect. 3.2.2. For both spherical-shell models as well as for the regional models of the Andes (Sect. 3.2.1), we newly derived, in some cases adopted new radial profiles of the relevant physical parameters of the mantle [101]. As a Grüneisen parameter, we calculated an extended acoustic gamma, γ ax , using seismic observations. These observed values are the bulk modulus K, the shear modulus μ, dK ∕ dP and dμ ∕ dP, where P is the pressure. This procedure has the advantage to be based on observable quantities without any assumptions on mineralogy. A further calculated property is the adiabatic gradient. We derived new profiles for the thermal expansivity, α, and the specific heat, c v , at constant volume. For the chemical differentiation we used a simple melting criterion, T > f 3T m , where T is the temperature and T m is the solidus. We estimated the lower-mantle solidus using values of [60] and of other authors continuing them by using our γ ax . In the upper mantle and the transition layer, we took into consideration the water dependence of the solidus. Litasov [61] investigated the influence of different water concentrations on the solidus of peridotite and we took the derived depressions of the solidi into account in the computation of our convection-differentiation model of the mantle’s evolution. So in our new model, the melting temperature is a function of time and position. The most important innovation of the new convection model concerns our newly derived viscosity profile which is based on solid-state physics and seismological results [100, 101]. This viscosity distribution has a resemblance to the viscosity model of [65] although its derivation is totally different. The full set of convection-differentiation equations has been solved using the improved code Terra (see Sect. 2.2). For each run, we obtained lots of parameters, some of which can be compared with observational quantities. For ages from 4,490 Ma to the present time, we received the curves of the laterally averaged heat flow density qob at the surface, the converted continent-tracer mass per Ma, the Urey number Ur, the Rayleigh number Ra, the Nusselt number Nu, the volumetrically averaged mean temperature Tmean of the mantle, showing a very realistic temperature drop when compared with Archean komatiite temperatures, the integrated mass of continents, the kinetic energy of the mantle flow Ekin, the radiogenic heat production Qbar and the laterally averaged heat flow density qcmb at the core-mantle boundary (CMB). Further results are the vector field of the creeping velocity and the temperature distribution for every time step of the Earth’s evolution. Up to now, we varied the melting parameter f 3 and the thermal conductivity k. There are f 3 − k clusters of realistic totalities of solutions. Figure 1 represents an episodic distribution of juvenile additions to the continents which is connected with the episodicity of orogenetic epochs. Figure 2 shows the present-day distribution of continents of the same run. In this example, 42.2 % of the Earth’s surface is covered by continents. The present-day surface heat flow of this run is 81. 39 mW/m2, comparable with the observed value of 90. 185 mW/m2 [12]. The present-day value of qcmb of this run is 20. 20 mW/m2 which seems to be realistic, too.

Fig. 1
figure 1

Juvenile additions to the sum of continental masses acc. to the new convection-differentiation model Run 498, [101]

Fig. 2
figure 2

The distribution of continents (red), oceanic lithosphere (yellow) and oceanic plateaus (black dots) for the present time according to the new convection-differentiation model [101], Run 498, r n  = 0. 5, σ y  = 120 MPa, continental percentage = 42.2 %

We computed the shear viscosity, η, using

$$\eta (r,\theta ,\phi ,t) = 1{0}^{r_{n} } \cdot \frac{\exp (c\ \overline{T_{m}/T_{av}})} {\exp (c\ \overline{T_{m}/T_{st}})} \cdot \eta _{4}(r) \cdot \exp \left [c_{t} \cdot T_{m}\left ( \frac{1} {T} - \frac{1} {T_{av}}\right )\right ]$$
(1)

where r is the distance from the Earth’s mass center, θ the colatitude, ϕ the longitude, t the time and r n the viscosity-level parameter. The quantity r n has been varied to shift the viscosity profile to the left or to the right. So we generated different time-averaged Rayleigh numbers, Ra, varying from run to run. T m is the newly invented melting temperature [101] which additionally depends on water abundance. So T m is a function of time, too. T av is the laterally averaged temperature, T st the initial temperature profile. The quantity η4 denotes the new viscosity profile [101] for the initial temperature and r n  = 0. For MgSiO3 perovskite we should insert c = 14, for MgO wüstite c = 10 according to [113]. Therefore, the lower-mantle value of c should be somewhere between these two values. For numerical reasons, however, we are able to use only c = 7. In the lateral-variability term we use c t  = 1. The temperature is denoted by T.

Figure 3 shows the difference between computed and observed present-day continental surface percentage. The central run with r n  = 0. 5 (corresponding to Ra ≈ 108) and σ y  = 120 MPa has the minimum difference between theory and observation, namely 1.85 %. This run shows also other optimal results, e.g. regarding the temporal distribution of the magmatic-activity episodes (cf. Fig. 1) and the (spectral) distribution of continents (cf. Fig. 2). Further variations of the parameters refer to the factor f 3 of the melting criterion and to the thermal conductivity, k, keeping r n and σ y fixed at the mentioned two values. At a first glance on Fig. 4, we could have the impression that there is a trade-off between k and f 3 since optimal solutions (black circles with an outer ring) cluster along a certain curve. But all the other observable quantities in other f 3 − k plots show that only the three optimum solutions with k = 5. 0 W/(m ⋅K) in the upper right corner of Fig. 4 are realistic. Figure 5 demonstrates, e.g., that only thermal-conductivity values around k = 5. 0 W/(m ⋅K) lead to solutions which are satisfactory for all observables. This value is acceptable also from the physical point of view [115]. Therefore, we varied f 3 in small steps from 1.000 downward, keeping k = 5. 0 W/(m ⋅K). As expected from Figs. 4 and 5 and similar f 3 − k plots we obtained very realistic solutions down to f 3 = 0. 985. However, already f 3 = 0. 983 and f 3 = 0. 981 generates less convincing results. Figure 6, e.g., shows the corresponding present-day distribution of continents.

Fig. 3
figure 3

We compare different runs which are distinguished only by the viscoplastic yield stress, σ y , and the viscosity-level parameter r n . The symbols represent the magnitude of the difference, d c  ∗ , between the computed and observed present-day surface percentage of continents, expressed in percent

Fig. 6
figure 6

Less realistic distributions of continents (red), oceanic lithosphere (yellow) and oceanic plateaus (black dots) for the present time. r n  = 0. 5, σ y  = 120 MPa, k = 5. 0 W/(m ⋅K), f 3 = 0. 983 and f 3 = 0. 981, respectively

Fig. 4
figure 4

Keeping r n  = 0. 5 and σ y  = 120 MPa fixed, we vary the melting-criterion factor, f 3, and the thermal conductivity, k. The symbols represent intervals of the theoretical present-day continental surface, p c , in percent. Note for comparison that 40.35 % of the real present-day Earth is covered by continents and epicontinental seas. The optimal run, shown in Figs. 1 and 2, corresponds to k = 5. 0 W/(m ⋅K) and f 3 = 0. 995

Fig. 5
figure 5

A f 3 − k plot for constant r n  = 0. 5 and σ y  = 120 MPa. The laterally averaged surface heat flow is denoted by qob. The temporal average of qob over the last 900 Ma is called qob 9 and is expressed in mW/m2

2.2 Numerical Improvements

Two submitted publications of our group [55, 69] deal with numerical improvements. Their conclusions will not be discussed here. However, their results are very important for the realization of our Andean model (Sect. 3.2.1).

A big part of our efforts was concentrated on the creation of an essentially improved Terra code, which is necessary to resolve the numerical problems with the newly derived viscosity profiles. We equally apply this new viscosity profile [101] to a spherical-shell mantle convection model [100, 101] as well as to the regional model of the South American subduction slabs (Sect. 3.2.1). In 2009, the Terra developers of the universities Munich (H.-P. Bunge, M. Mohr), Cardiff (H. Davies), Leeds (P. Bollada), Jena (M. Müller, C. Köstler) and of the Imperial College London (R. Davies) and others started a close cooperation in the further development of the Terra code and intensified the collaboration with J. Baumgardner (San Diego, USA), the inventor of Terra. At a first joint meeting in Munich in 2009, the group decided to set up a community svn-repository for further code development, supplemented by trac, a web-based project management and bug-tracking tool, and automated compile/test cycles using BuildBot. From then on the group worked on a common code base using automated tests for every revision of the code. There have been three successive joint meetings in Cardiff in 2010, in Jena in 2011 (see also http://www.igw.uni-jena.de/geodyn/terra2011.html) and in Munich in 2012. The progress being made since the first meeting includes the following items.

  • Enhancement of the code to increase global resolution and maximum number of MPI processes.

  • Further development and integration of the Ruby test framework [68] into the automated BuildBot tests.

  • Implementation of a finite-element inf-sup stabilization using pressure-polynomial projections proposed in [15].

  • Development and implementation of an efficient preconditioner for the variable-viscosity Stokes system [54].

  • Refinement of the Pressure Correction algorithm [54], giving more robust convergence.

  • Restructuring of the code to use language features of Fortran95 and Fortran2003 where possible.

  • Integration of automated code documentation using doxygen.

  • Integration of VTK-support and automated visualization.

  • Significant improvements in the formulation of the free-slip boundary condition on the spherical surface.

Continued effort is spent on several numerical and technical topics as well as on including more realistic physical models. In the following, some important details are given.

  • In [97, 100, 101], two pressure- and temperature-dependent viscosity profiles of the Earth’s mantle are developed and used for the computation of two mantle convection models with chemical differentiation of oceanic plateaus and generation of continents. The new viscosity model includes very strong viscosity gradients at lithosphere-asthenosphere boundary and at 410- and 660-km phase boundaries. Therefore, it is very important to make the code Terra fit for such a strong challenge. Regarding a physically consistent variable viscosity momentum operator, J. Baumgardner, P. Bollada and C. Köstler figured out in which way the code has to be changed to apply a physically consistent A-operator using cell-averaged viscosities. The most significant code change is the switch from nodal based to triangle based operator parts on the sphere. The viscosity-weighted summation over triangular integrals is then done in the application of the operator. We expect that the cost for applying Au will be doubled but a consistent formulation on all grid-levels could pay off for this, especially if we get a better convergence rate of the multigrid algorithm. The implementation of the triangle-based operator formulation is now under way.

  • Exporting Terra-operators in sparse matrix format: When exporting the FE-matrices for the whole grid, they can be analyzed and PETSc or other parallel solver packages can be applied to it. This is intended to provide flexibility and to ensure reliability of future code changes.

  • Multigrid(MG)-implementation: As mentioned in [90], the current MG-implementation in Terra does not fulfill the expectations raised by the performance of a 2-D Cartesian version of Terra, documented in [114]. In [54] it is also identified to be the worst performing part of the iterative solver in Terra. It is only poorly analyzed, and it is not satisfactory documented. M. Mohr is going to analyze and document the currently used matrix-dependent transfer multigrid in detail. The MG-implementation is also to be changed to using cell-based viscosity averages.

  • Free-slip boundary condition and propagator matrix benchmark tests: P. Bollada and R. Davies showed that adding boundary terms to the right hand side of the momentum equation reduces some sort of errors while other kinds of errors still exist. They will continue to figure out the exact cause of that behavior and work on fixing this. With direct access to the radial velocity component, the local spherical coordinate system offers a way to straightforward implement the free-slip boundary condition. J. Baumgardner implemented this in a local copy of Terra, and it is ready to be used. The group has agreed to create a repository branch to continue working on that. If successful, the local spherical coordinate system-version will be merged into the trunk after some months of testing.

  • Adding Ruby tests: The Ruby test framework is ready to be used extensively in testing Terra’s subroutines as individually as possible. It can also be used for debugging by application of subroutines to predefined scalar and vector fields. Still the test coverage of the Terra by Ruby tests is very low and needs to be extended.

2.3 Andean Model

A large expenditure of time of U. Walzer in the last 3 years was the analysis and synopsis of geophysics, geology and geochemistry of the Andean orogenies. This has been done in close cooperation with J. Kley and L. Viereck-Götte. Some geophysical and geological considerations are outlined in Sect. 3.2.1.

2.4 Long-Term Related Works of Our Group

Kley investigated the structural geology of some areas of the central Andes [40] and gave a regional structural analysis and kinematic restoration [41, 47]. He participated in an effort to extend quantitative structural analysis to a transect right across the backarc area. First steps were also taken towards constraining the evolution of strain rates over time [49], leading to the suggestion that the continental strain rates had increased during the Andean orogeny. Kley began to extend kinematic analysis to the entire orogen, employing serial balanced sections and map view restoration techniques [44]. A map view kinematic model of the central Andes [45] formed the basis for comparison of geologically derived, orogen-scale kinematics with GPS and seismologic data [31, 32, 52]. It could be shown that the present-day strain field from satellite geodetic data closely matches the strain field for the last 10 Ma as inferred from geologic evidence. Additional evidence was presented that the strain rate in the South American plate becomes independent of plate convergence rate in the later stages. Using the map-view strain field as input for a numerical model, an attempt was also made to constrain crustal thickness evolution and the flux of crustal material during the Andean orogeny [33]. The studies on variations in structural style along the Andes [42] also triggered a second line of research dealing with the influence of inherited lithospheric heterogeneities on the spatial strain distribution. One important factor here is the widespread occurrence of Mesozoic rift basins [50] that were partially inverted as the thrust front migrated across them. Several case studies from a particularly well-exposed rift system in northern Argentina helped to clarify the importance of fault reactivation and stratigraphic discontinuities in conditioning the mechanical behavior of the upper crust in contraction [43, 46, 51, 67]. The results of these studies were incorporated in [48, 71].

Walzer and cooperators worked on convection-fractionation problems. The thermal evolution of the mantle and the chemical evolution of the principal geochemical reservoirs have been modeled simultaneously by a fractionation mechanism plus 2D-FD thermal convection [9396]. Oceanic plateaus, enriched in incompatible elements, develop leaving behind the depleted parts of the mantle. The resulting inhomogeneous heat-source distribution generates a first feed-back mechanism. The lateral movability of the growing continents causes a second feed-back mechanism [95]. Effects of the viscosity stratification on convection and thermal evolution of a 3D spherical-shell model have been investigated and a viscosity profile of the mantle was developed [103, 104]. The paper [106] presents 2D and 3D thermochemical models of mantle evolution where a self-consistent theory is included using the Helmholtz free energy, the Ullmann-Pan’kov equation of state, the free-volume Grüneisen parameter and Gilvarry’s formulation of Lindemann’s law. In order to obtain the relative variations of the radial factor of the shear viscosity, the pressure, P, the bulk modulus, K, and ∂K ∕ ∂P from the seismic model PREM have been used. The publications [97, 104, 105, 107] present models of self-consistent generation of stable, but time-dependent plate tectonics on a 3D spherical shell. Different types of solutions have been found for different models by systematic variation of parameters [97, 102, 104, 108, 109]. Stirring effects are investigated in [22]. A 3D spherical-shell mantle convection and evolution model with growing continents [97, 100, 101, 108, 109] has been developed. The evolution model equations guarantee conservation of mass, momentum, energy, angular momentum, and of four sums of the numbers of atoms of the pairs 238U-206Pb, 235U-207Pb, 232Th-208Pb, and 40K-40Ar. The pressure- and temperature-dependent viscosity is supplemented by a viscoplastic yield stress. The lithospheric viscosity is partly imposed to mimic the viscosity increase by chemical layering and devolatilization. Stochastic effects [98] are shown to exist especially in the chemical differentiation. Although the convective flow patterns and the chemical differentiation of the oceanic plateaus are coupled, the evolution of the time-dependent Rayleigh number, Ra t , is relatively well predictable from run to run and the stochastic parts of the Ra t (t) – curves are small [97].

Viereck-Götte and cooperators worked not only about the genesis of the Jurassic Ferrar large igneous province in Antarctica but also about the connection between Antarctica, South America and Africa. Plateau forming lavas in the Karoo province of South Africa and in the Ferrar province in Antarctica were emplaced synchronously at about 180 Ma. They thus seem to originate from the same dynamic mantle process. However, both are distinguished by their isotopic characteristics with respect to the Rb/Sr- and Sm/Nd systems: while the Karoo magmas show mantle values, the Ferrar magmas exhibit enriched upper crustal values. However, the boundary between both Jurassic igneous provinces is marked by a large transpressional shear zone (Heimefront SZ) of Pan-African age (600–500 Ma), crossing Dronning Maud Land (S-African side of Antarctica) on the continent side of the Grunehogna craton, a fragment of the W-Gondwana Transvaal craton. This shear zone is interpreted as a reactivated suture of Grenvillean age (1.1 Ga). If Jurassic magmas on either side of this boundary are isotopically different, it must be concluded that

  1. 1.

    This is not a signature in a lower mantle plume.

  2. 2.

    This must be a signature within the subcontinental lithospheric mantle.

  3. 3.

    This signature must be older than Grenvillean in age.

  4. 4.

    It must be introduced into a paleo supra-subduction mantle wedge by subduction processes if it is a crustal isotopic signature.

Their studies concentrated on the timing of the initiation of the Ferrar as a large igneous province as well as on the physicochemical characterization of the melt source region conditions during melt differentiation. Studying the intrusive, extrusive and volcaniclastic rocks:

  • They reconstructed the initiation of a large igneous province to have occurred in several steps of melt pulses within 5 Ma (189–183 Ma ago).

  • They showed initiation to have started with large-volume shallow-level intrusions of low-Ti andesitic melts into wet fluvial sediments (Triassic/Jurassic) associated with diatreme-forming Taalian-type eruptions.

  • The eruptions followed by basaltic andesites in small volume eruptions of partly pillowed lavas from local eruptive centers prior to large volume plateau forming lava extrusions from feeder dikes followed by a final pulse of a large volume andesites high in Ti that had differentiated from a common primary melt under lower pressure, oxygen fugacity and water activity.

All melts belong to the tholeiitic differentiation series, however with orthopyroxene instead of olivine as early fractionating mafic solidus phase. REE, Sr-Nd-isotope and PGE characteristics indicate generation of the primary melts within the spinel-lherzolite zone of an isotopically enriched and sulfur-undersaturated subcontinental lithospheric mantle. Crustal silica enrichment of this source due to an overprint in a supra-subduction environment had already been concluded from Re-Os isotopy. Our Sr-isotope data in plagioclase phenocrysts, however, exhibit decreasing radiogenic character in subsequent melt pulses indicating additional assimilation of crust to decreasing extents during ascent and differentiation. Due to a Cretaceous thermal event, 40Ar/39Ar-ages (ranging from 235 to 90 Ma) and S-isotope characteristics are heavily disturbed, only a few samples exhibit the relict primary δ34S-value of − 19.5.

3 The Andean Subduction Model and Its Embedding

3.1 Some Fundamental Remarks

Our modeling of the Andean slab should be compatible with our basic assumption that plate tectonics is an integral part of the convection of the entire mantle. Therefore, we should aim at a regional model which is embedded into a realistic convective spherical-shell model. It is possible that the mantle convection is characterized by top-down control. But it is also possible that the larger part of the mantle mass essentially determines the movements at the surface since the most important share of the primordial energy is stored there and also the principal share of the radiogenic energy is released there. Here we refer to the absolute value, not to the density of these quantities. On the other hand, for reasons of energy, it is evident that the Earth’s core cannot play a prominent part in controlling mantle convection. It is exactly the reverse. The mantle convection determines the boundary conditions of the hydromagnetic convection in the outer core. For example, in time spans of high activity of mantle convection, the latter one generates lateral temperature differences in D′′ which reduce the number of magnetic reversals or even make them impossible because of the anti-dynamo theorems.

Furthermore, it is well known at the present time that the oceanic lithosphere consists of three (or more) layers which are chemically different and that its lower boundary is characterized by a sharp viscosity jump. At the same time the oceanic lithosphere is also a thermal boundary layer. So, we do not intend to use the old simplified approach that the existence of an oceanic lithosphere can exclusively be explained by a thermal boundary layer. In this case we should expect a gradual decrease of the viscosity at the lower boundary. Furthermore, we conclude that the principal part of the buoyancy of the slab heavily depends on chemistry and that phase transitions, especially the basalt-eclogite transition, play an eminent role.

So, we want to combine a global 3-D spherical-shell convection model with a 3-D regional convection model of South America and the surrounding plates and a model of the orogeny of the Cordilleras. The Andes are an ideal test case for various reasons:

  • They are very large, thus keeping the inevitable problems of different scales at a minimum.

  • They are active and well-studied. A wealth of information constrains the processes of orogenesis.

  • They have a simple large-scale geometry. The plate margin is gently curved and was even straighter in the past. Convergence is orthogonal to the mean trend of the margin. The age structure of the Nazca plate near the contact with South America is symmetric, with the oceanic lithosphere oldest in the center and younging to both sides.

  • Their plate kinematic framework has remained nearly unchanged for the last 50 Ma. On the other hand, there was a prominent switch in the mode of subduction earlier, from presumably steep with backarc extension to low-angle with backarc shortening. In the Andes, both end-member settings can therefore be studied in the same place.

  • There is a clear compositional and rheological contrast between the two converging plates. It is unlikely that large volumes of material are transferred across the plate contact, in strong contrast to continental collision zones.

  • The Andean substrate has a simple geologic history. Much of the South American plate was in place before subduction started some 200 Ma ago. Except for the northern Andes, no terranes were accreted after the Paleozoic.

We aim at a numerical, nearly purely dynamic model with a minimum number of restrictions and additional assumptions. This model is to explain the physical mechanism of the essential features of Andean orogeny.

3.2 A Model of the Andean Subduction

3.2.1 Outlines of the Model

We do not develop an exclusively regional model of the Andean orogenesis since, in this case, the temporally varying boundary conditions are unknown. Therefore, it is often assumed, for reasons of simplicity, that there are no effects from outside of the regional computational domain. However, some changes in the arc volcanism and in the tectonic shortening of the Andes suggest a connection with the 30-Ma Africa-Eurasia collision. Therefore, we embed a regional 3D model into a 3D spherical-shell model. So, we solve the balance equations of momentum, energy and mass in the spherical-shell model using somewhat larger time steps on a coarser whole-mantle grid, coarser than in the regional model. The values of creeping velocity, temperature and pressure, determined in that way and lying at the boundaries of the regional computational domain, then serve as boundary conditions for a computation with smaller time steps for which the balance equations are solved in the regional computational domain.

3.2.2 The Regional Model

Meanwhile, we have developed a numerical tool for several classes of regional models, and it was indeed a great effort. So, we are in no way restricted to the model developed in the following. We could as well compare the effects of different geological proposals. To design the regional model, we developed not only extensive numerical improvements (see Sect. 2.2) in the Terra code but studied also the latest geophysical, geological and geochemical results taking them into consideration in the draft of a physically reliable mechanism which is not only geodynamically probable but also numerically feasible (U. Walzer, J. Kley, L. Viereck-Götte). There are several geologically descriptive model proposals. We mention only [13, 28, 29, 37, 71, 76]. Our designed computable model system ought to solve the following problems.

Fig. 7
figure 7

The geometric starting configuration in our second Andean regional model (Taken from [92])

  • Why do we observe today in some segments of the Andes flat subduction and magmatic lull {Bucaramanga, Peruvian (2–15 ∘ S) and Pampean (27–33 ∘ S) flat slab [75]}, but in other segments normal subduction with dip angles between 30 ∘  and 40 ∘ ? This is amazing since the westward velocity of the South American plate and the eastward velocity of the Nazca plate do not essentially vary alongshore.

  • Why do the flat-slab segments migrate in Cenozoic times along-side the Andes [76]? Ramos and Folguera [77] found an almost continuous belt of flat slabs which migrate southward.

  • Why is the present-time volcanism restricted to segments with a 30–40 ∘  dipping slab [1, 36, 48, 83]?

  • Why does the deformation essentially start in the West and migrate to and finish in the East [71]? We derived some estimations on some other hypotheses. After that we consider the assumption as most promising to answer the mentioned questions by the assumption that oceanic plateaus and aseismic ridges are carried by the conveyor belt, i.e. by the Nazca plate. The plateaus and ridges generate additional positive buoyancy [27]. So the hinge of the slab migrates to the East until the volcanism totally vanishes. However, if we scrutinize a good geological map of South America we notice that the relation between flat-slab segments and ridges are not simple. We can assign the Pampean flat slab to the Juan Fernandez Rigde [2]. Even the southward migration of the Pampean flat-slab zone can be explained by form and movement of the Juan Fernandez Rigde [37]. The southern part of the Peruvian flat slab can be referred to the Nazca Rigde. For the northern part of the Peruvian flat slab we have to introduce the hypothesis of an immersed Inca Plateau [28]. However, east of the Carnegie Rigde there is an abundant volcanism in Ecuador. Michaud et al. [64] show a detailed reconstruction of the eastward movement of the Carnegie Rigde which extends at least 60 km below the South American plate with a continuous plunging slab down to a depth of 200 km. The adakitic signal is proposed to be ridge-induced. In the case of the Iquique rigde we have to assume that there is no eastern continuation of it. Pindell and Kennan [74] show that the flat-slab area in northern Colombia and Venezuela might be considerably larger than the area assumed by Ramos [76].

  • Why should our mechanism of the generation of the Andes be three-dimensional? Several observations suggest the idea that essential features of the Andean orogenesis cannot be explained by 2-D dynamical models:

    1. (a)

      Hindle et al. [33] conclude that a mass balance which is restricted to a cross-section through the Andes leads into contradictions. They show that there has to exist an essential mass transport alongside the Andes. In particular, the displacement of material toward the axis of the bend in the central Andes leads to a significant crustal thickening. This cannot be explained by a two-dimensional model, neither kinematically nor dynamically.

    2. (b)

      Anderson et al. [3], Kneller and van Keken [53] and Barnes and Ehlers [5] show and discuss, for the southern Andean subduction zones, trench-parallel high seismic shear velocities near to the trench and an abrupt transition to trench-perpendicular high seismic shear velocities in the back arc. The Brazilian subcrustal lithosphere beneath the eastern Cordillera, the Interandean and the Subandes are East-West fast whereas the shear-wave velocity under the Altiplano and Puna has maximum values in the North-South direction. This is a hint that a significant three-dimensional flow might be involved in the mechanism.

    3. (c)

      The toroidal component. A pertinent argument for the necessity of 3-D models results from the following considerations and calculations. Already Gable et al. [20] and O’Connell et al. [70] showed the relevance of the toroidal-poloidal partitioning for lithospheric plate motions. The lateral subducting slab movement induces slab-parallel flows and a rollback-generated flow around the slab [81]. Stegman et al. [86] demonstrated that in the case of non-vanishing rollback of the subduction slab for some typical cases, 69 % of the energy of the negative buoyancy of the slab is converted into the toroidal component of the rollback-induced flow whereas 18 % are consumed for the weakening of the plate. These numbers show the importance of a 3-D modeling in a striking way. Only in the very beginning of the Andean-specific modeling we prescribe the velocity of the migrating subduction hinges as a function of time according to [71] in order not to overburden the model. But the other degrees of freedom of the flexible slab should be determined by the differential equations of the model. We vary the lateral extend of the individual slab between 200 and 5,000 km. That is, in a first type of numerical experiments we introduce the individual lobes shown by the distribution of seismicity. In a second numerical experiment we assume an undivided slab for the whole South American continent (Fig. 7), at best with a disconnection at the Chile Rise. For all versions we cut out a spherical diamond (Fig. 8) from the newly improved and inf-sup stable Terra code which is able to solve the convection differential equations in a spherical-shell mantle. Essential parts of the South American plate and the Nazca plate fit into this spherical diamond. The vertical boundary conditions are iteratively taken from a whole-mantle convection model. Cf. Sect. 3.2.2. In contrast with [86], we do not neglect the energy equation. It is evident that the subduction mechanism is possible only assuming a low-viscosity asthenosphere [18] which is less dense than the average oceanic lithosphere. Furthermore, the rheology may not be purely viscous. For this purpose, we [97] prefer a viscoplastic yield stress. The lithospheric-asthenospheric boundary for the viscosity is relatively sharp, also for the oceanic lithosphere. This is a special challenge for our code.

  • By the model, it should be possible to understand essential geochemical observations relevant for South America. The difficult problem of delamination is very probably connected with this issue. It is necessary to clarify what kind of delamination is dominant. Figure 9 shows the gradual increase of the flow of incompatible elements. After each flat-slab period, this rise is abruptly interrupted. Such kind of plots exist not only for the mass ratio La/Yb but also for 87Sr/86Sr and 144Nd/143Nd. These curves describe a slow growth of the abundances of elements with large ionic radii within one cycle since 87Sr is the daughter isotope of 87Rb, 143Nd is the daughter isotope of 147Sm and the other mentioned isotopes are stable. The fluctuations of the whole-rock initial ε Nd of the central Andean arc by DeCelles et al. [13] are evidently related to Fig. 9. DeCelles et al. [13] emphasize that the cyclical changes of the isotopic compositions of arc magmas cannot be explained by changes in the convergence rates. They and also we expect that these changes have to be explained by episodic gravitational foundering. There are two principal possibilities.

    1. (A)

      According to [13], below arc and hinterland, i.e. below the western parts of the South American continent, the eclogitization of the thickening lower continental crust and of the lithospheric mantle causes a density increase and therefore a delamination so that these units sink into the mantle wedge driven by their own weight. Carlson et al. [10] discuss the possibility that the continental mantle above the wedge of the mantle overlying a subducting oceanic plate can become unstable. The detachment from the overlying continental crust can cause major orogenetic episodes. Davidson and Arculus [11] propose a delamination of the cumulate layers below the seismological Moho back into the mantle of the sub-arc wedge. The two-dimensional numerical model by Sobolev et al. [84, 85] is compatible with the geological models mentioned under a) and belongs to b)-type of papers mentioned in the Introduction. They use a viscoelastic rheology supplemented by Mohr-Coulomb plasticity for the layered lithosphere. The drift of the overriding plate and the pulling of the slab is prescribed by the velocities at the boundaries of the 2-D model area and it is not calculated by solution of the balance equations though. What drives Andean orogeny? Sobolev et al. [84, 85] answer this question by numerical experiments using their 2-D model and varying only one influence parameter each. They conclude that the major factor is the westward drift of the South American plate. Paragraph (A) outlines only one way of thinking which we intend to test.

    2. (B)

      Here we propose a second hypothesis to understand the mentioned geochemical observations of [29] and of [13, Fig. 5]. This hypothesis is patronized by the idea of a geochemical marble-cake mantle [97, 99] (see Fig. 10) but composed of irregularly formed parts of a depleted MORB mantle (DMM) with 80–180 ppm H2O and 50 ppm C and another, richer reservoir with 550–1,900 ppm H2O and 900–3,700 ppm C [23, 111, 112]. The mid-oceanic ridge basalt is denoted by MORB. It is possible that the reservoirs are intermixed in smaller quantities so that there is no sharp boundary between the different parts of the mantle [34, 89]. DMM dominates in the smaller depths of the mantle. The deeper the slab dives into the mantle, the higher is the probability to touch regions with a high abundance of 3He and incompatible elements. Pilz [73] investigated 3He/4He ratios in the Puna plateau and at the volcano Tuzgle. He found that these 3He/4He values are higher than in the western Cordillera and in the Salta Basin east of it. Pilz and also we conclude that a provenance by degassing of the slab is not feasible since the mantle’s 3He is primordial. The 3He of the atmosphere cannot be subducted in appreciable quantities.

      Fig. 9
      figure 9

      The La/Yb ratio as a function of time for the igneous rocks of the north Chilean arc (21–26 ∘ S) acc. to Haschke et al. [29]. Note that after the flat-slab stage, the La/Yb suddenly decreases to a low starting value. The periods of orogenetic activity are before these drops

      Fig. 10
      figure 10

      A calculated marble-cake mantle model acc. to Walzer and Hendel [99]. This is an equatorial section showing the present-time state of the chemical evolution of incompatible elements of the Earth’s mantle. We use a modernized reservoir theory. The depleted MORB mantle (DMM) and a mantle which is rich in incompatible elements, yet, are strongly intermixed. Strongly depleted parts of the mantle which include more than 50 % DMM are represented by yellow areas. Relatively rich mantle parts with less than 50 % DMM are orange-colored. In general, the yellow-orange boundary does not correspond to a discontinuity of the abundances of U,Th,K, etc. The cross sections through the continents are red. Black dots represent the oceanic plateaus. The yield stress is 125 MPa, the viscosity-level parameter is − 0.50

      There are different, but related models of chemical layering inside the subducting oceanic lithosphere [21, 80]. There is a water-rich subduction channel above the slab. Because of the sediment dehydration and the oceanic crustal dehydration it is not entirely clear how deep this hydrous channel is extended. Not only the 3He/4He ratio of the Puna plateau [73] but also La/Yb, 87Sr/86Sr and 144Nd/143Nd increase from an age of τ = 4. 5 Ma until the present time [29]. There are different explanations for this phenomenon. One of them would be a rise in an antiparallel direction in hot fingers immediately or in some distance above the subducting slab. Such a suggestion has been offered by Tamura et al. [91] for the NE Japanese arc. Marsh [63] proposed a similar idea of a hydrothermal flow field, in this case immediately above the upper surface of the down-going slab. In this way he explains also the sharp line of volcanoes or the volcanic front which can be observed in the Andes. Pilz [73] concludes from seismic observations that such hot fingers are also in the wedge of the southern central Andes and that these fingers are near the surface of the slab. So we want to develop a numerical model with fluid pathways of hydrothermal fluids, described by a particle approach, not very far from the slab but with an antiparallel flow direction. This idea is corroborated also by Furukawa [19].

We are working about the numerical problem to cut out a spherical diamond, out of the dynamical 3-D spherical-shell model, and to determine the temporally changing boundary conditions at the vertical side walls of the 3-D diamond by the solution of the convection in a spherical shell, but now with prescribed plates at the surface of the sphere. The Nazca plate moves with a (in the first approach) prescribed angular velocity through the margin into the computational domain and dives under the South American continent because of a Rayleigh-Taylor instability which is induced mainly by the transition to eclogite. The oceanic plate has a given sandwich structure of zero-pressure densities, ρ0, [21] and viscosities, η0. The slab is supposed to be freely movable at the surface and floating inside the mantle. It is well-known that it is difficult to detach a spherical-shell plate from the surface of the shell into the mantle in a slab-like manner [8]. As in [97], we want to introduce a viscoplastic yield stress at the near-surface lithospheric region and compare the creeping viscosity, η c , at each position and time with the plastic viscosity, η p , and use the minimum value in the model. This procedure is sensible from the physical point of view. If a piece of the Nazca plate which is thickened by an oceanic plateau or a passive ridge approaches to South America then it pushes the hinge back under the South American plate because of its positive buoyancy. Therefore the volcanic front migrates to the East. The residual slab will become steeper and its tip will touch deeper and deeper parts of the mantle. Therefore, more and more atoms of incompatible elements will rise using the fluid pathways explaining the rising parts of the curve of Fig. 9. We are able to describe the movements of the hydrothermal fluid by a tracer modulus. These particles, describing the migration velocities of higher abundances of incompatible elements, 3He, metals like Cu, Ag, etc., are not carried along with the creeping rock. They can move along a surface antiparallel to the slab which represents a thin layer. A serpentinized layer-like part of the mantle wedge just above the subducting oceanic crust could serve in reality as such a thin layer [25]. The slab movement is (in this model as in reality) an integral part of the solid-state mantle convection. The mentioned quantities ρ0 and η0 of the slab will be additionally described by other tracers which, in this case, will be entrained and carried along by the solid-state creep down to the interior of the mantle. The generation and eastward migration of major Andean ore deposits [29, 38, 39, 79], the eastward migration of the volcanic front, the eastward movement of the deformation ages across the southern central Andes (21 ∘ S) [17, 71] as well as the eastward migration of deformation in the Interandean and Subandean [16, 41] can be dynamically modeled by our approach. When the subducting movement of the down-hanging part of the slab (and the flat-slab part behind) is accelerated, then an elevated activity of the Atacama fault zone, the Peruvian shortening, the Incaic shortening etc. are induced. The sudden interruption of the supply of highly incompatible elements (Fig. 9) must be induced by a radical event. Using the analogous, geologically describing models of [59, Fig. 4] and [35, Fig. 7], it cannot be concluded that many unusual features of the Permian-Jurassic South China fold belt can be explained by shallow subduction with an extensive final foundering of the lower plate combined with a roll-back of a small remnant subducting slab in the Mid-Jurassic. Humphreys [35] concludes for the Mid-Tertiary that an extensive sinking of the flat Farallon slab occurred which caused an uplifting of the continent covering a large area of the western North America. We propose an explanation by an extensive generalized eclogitization of the flat-slab part of the oceanic plate and an extensive delamination due to a Rayleigh-Taylor instability. The latter process can be simulated for the former and present South American flat slabs using a particle approach. The extensive tear-off would entirely interrupt the supplies of incompatible elements, 3He, etc. since the tracer transport path near the upper surface of the low-hanging part of the slab and the flat part of the slab is ripped.

At first glance, (A) and (B) seem to be competing models, but in reality they do not entirely exclude one another. It is evident that the two roughly sketched numerical regional models are very ambitious. Therefore, we could report here only on the numerical and mathematical results of the preparatory efforts, the further development of the code and our geodynamical conception. However, a spherical-shell model is finished (cf. 2.1).

3.2.3 Spherical-Shell Model: Prescribed Angular Velocities of the Lithospheric Plates

To define the time-dependent boundary conditions of the vertical side walls of the spherical diamond containing South America and the Nazca plate, we introduce a spherical-shell convection model with the same radial profiles of the relevant physical parameters as in [100, 101] but with prescribed angular velocities of the plates for the last 200 Ma. This desirable time span is essentially determined by the available observational data, e.g. by Fig. 9. But the most plate-motion models do not go back so far. However, the main difficulty refers to another item. Even though we know all velocities of neighboring plates between each other, we do not know the “absolute” velocities of the plates relative to the highly viscous mid part of the lower mantle. But the net rotation of the averaged lithosphere is important for the kinematic analysis and the dynamic modeling of the slabs, especially the slabs at the margin of South America. The “global tectonic map” of [82] is based on the Indo-Atlantic hotspot reference frame by O’Neill et al. [72] and on the relative plate motion model by DeMets et al. [14]. This map shows a large eastward motion of the Nazca plate, large in comparison to the magnitude of the velocity of the South American plate. In relation to this feature, this map is similar to the deforming, no-net-rotation reference frame model GSRM by Kreemer et al. [56]. In contrast to these two models, the hotspot reference model HS-3 [24] shows a high westward velocity of the South American plate, high in comparison with the amount of the velocity vectors of the Nazca plate. HS-3 is based on the age progressions of ten Pacific ocean islands. Becker [7] and Becker and Faccenna [8] wrote a very good analysis of the problem of “absolute” plate velocities. Becker [7] and Long and Becker [62] try to determine the present-day plate velocities from the convective shearing movements and the seismic anisotropy of the upper mantle. But this procedure is more sensitive to the direction of the velocity vector than to its magnitude. HS-3 contains a large net rotation of the laterally averaged lithosphere relative to the high-viscosity parts of the lower mantle. The majority of the authors derived a smaller real net rotation which is defined as the spherical harmonic degree l = 1 component of the toroidal part of the plate velocities. Ricard et al. [78] estimated 30 % of HS-3, Steinberger et al. [88] calculated 38 %. The azimuthal seismic anisotropy is compatible only with values less than 50 % of HS-3 [7].

Already early, Steinberger and O’Connell [87] linked the hot spot tracks and the movement of oceanic lithospheric plates. Gurnis et al. [26] report on a very practicable open-source system which contains the angular velocities for the plates from 140 Ma to the present time. Each plate has a time-dependent Euler pole. The plates are described by time-dependent closed plate polygons. Each of these plate boundaries has its own, time-dependent Euler pole. The code allows to introduce new interactive plate boundaries [9]. We intend to use two or three seemingly realistic spherical-shell plate-motion models to define the boundary conditions of our regional convective system of South America and its surrounding. The repercussions of the different spherical-shell systems on the regional model should be investigated and compared.

3.2.4 Future Numerical and Technical Improvements

In addition to the joint work within the group of Terra developers, further developments are to be continued regarding the following items.

  • We are going to further improve the parallelization of the particle tracking routines in Terra. Compared to previous Terra versions, there is an extra need for communication among several MPI-processes to figure out connected regions of partial melting in the mantle from which incompatible elements are extracted and transported to the surface. A similar communication is required to define the extent of continental lithospheric plates. With the high number of tracers, it is crucial to compress the required global information locally before it is exchanged among neighboring processors. R. Hendel will continue to reduce the communication overhead for tracking globally connected regions, so that the scalability of the particle routines will be extended to 500 and more processors. Such an optimized way of communicating global regions and features is also needed in modeling the elasticity of the subducting plates in the regional Andean model.

  • We also plan to bring the elasticity model in Terra to work. It has also to be chosen carefully how elasticity is dealt with in the solution of mass and momentum equations. It could be necessary to iterate over the whole Stokes within every time step until an equilibrium between elastic and viscous forces is reached.

  • To run both, regional and global models, with time-dependent boundary conditions at the surface, plate reconstruction data will be imported from the GPlates code [9] (www.gplates.org). The development of the interface between GPlates and Terra will be done together with L. Quevedo, Sydney.

  • Furthermore, the documentation of the code, which is build from source code comments automatically with doxygen, will be enhanced to make it easier for new developers to work on Terra.