A somatic genetic clock for clonal species

Yu, Lei; Renton, Jessie; Burian, Agata; Khachaturyan, Marina; Bayer, Till; Kotta, Jonne; Stachowicz, John J.; DuBois, Katherine; Baums, Iliana B.; Werner, Benjamin; Reusch, Thorsten B. H.

doi:10.1038/s41559-024-02439-z

A somatic genetic clock for clonal species

Article
Open access
Published: 10 June 2024

Volume 8, pages 1327–1336, (2024)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue Submit your manuscript

A somatic genetic clock for clonal species

Download PDF

4538 Accesses
2 Citations
303 Altmetric
43 Mentions
Explore all metrics

Abstract

Age and longevity are key parameters for demography and life-history evolution of organisms. In clonal species, a widespread life history among animals, plants, macroalgae and fungi, the sexually produced offspring (genet) grows indeterminately by producing iterative modules, or ramets, and so obscure their age. Here we present a novel molecular clock based on the accumulation of fixed somatic genetic variation that segregates among ramets. Using a stochastic model, we demonstrate that the accumulation of fixed somatic genetic variation will approach linearity after a lag phase, and is determined by the mitotic mutation rate, without direct dependence on asexual generation time. The lag phase decreased with lower stem cell population size, number of founder cells for the formation of new modules, and the ratio of symmetric versus asymmetric cell divisions. We calibrated the somatic genetic clock on cultivated eelgrass Zostera marina genets (4 and 17 years respectively). In a global data set of 20 eelgrass populations, genet ages were up to 1,403 years. The somatic genetic clock is applicable to any multicellular clonal species where the number of founder cells is small, opening novel research avenues to study longevity and, hence, demography and population dynamics of clonal species.

Rapid evolution with generation overlap: the double-edged effect of dormancy

Article 20 March 2019

Somatic genetic drift and multilevel selection in a clonal seagrass

Article 11 May 2020

Quantifying the effect of genetic, environmental and individual demographic stochastic variability for population dynamics in Plantago lanceolata

Article Open access 30 November 2021

Main

Clonal reproduction is the process of generating (potentially) physically independent multicellular organisms (that is, ramets sensu¹) via mitosis, a widespread life history among animals, plants, macroalgae and fungi². Starting from a single zygote, multipotent somatic cells proliferate to form new ramets via branching or budding, often becoming physiologically independent after a few years when severing from the parental tissue. All modules or ramets stemming from that single zygote represent a genet (or clone). Often, the contribution of sexual and clonal reproduction to local population structure varies among species and localities^3,4,5, resulting in asexual populations of ramets that are nested within the ‘classical’ population of genets^2,6. Coral, algae, seagrass or poplar genets, for example, can reach considerable size and, therefore, age with linear extents of >1 km (refs. ^7,8,9,10,11). The apparent persistence and resilience of asexual ramet populations is astonishing in light of the considerable temporal and spatial variation they may experience over their lifetimes despite little genetic variation (but see refs. ^10,12) and raises questions about these species’ adaptability in a rapidly changing climate¹³.

As a key parameter to evaluate this persistence, genet age/longevity has been inherently difficult to estimate, in particular, when biomass tracing back to an individual’s origin is not preserved, as is the case in non-woody plants¹⁴. For example, a small genet is not necessarily young if episodes of ramet mortality reduced its size in the past. To estimate genet age via molecular genetic methods, somatic genetic variation (SoGV) segregating among ramets has previously been used. However, those attempts lacked resolution, as the SoGV could be estimated at only a few marker loci^9,15.

In this Article, we present a novel approach to estimate genet age on the basis of a somatic genetic clock that uses complete genome information of the focal species. Molecular clocks were initially developed for species-level phylogenies and rely on the neutral theory of molecular evolution¹⁶. Fixed neutral mutations within species accumulate at a constant rate equal to the rate of spontaneous mutations¹⁷, and thus, genetic differences between species increase with absolute time^18,19. If the mutation rate can be derived on the basis of calibration points such as fossil evidence, clock estimates can be extended to phylogenetically related clades²⁰. Recently, fixation of SoGV was demonstrated in clonal species through a process of somatic genetic drift¹². During genet growth via new ramet formation, somatic mutations become fixed in the descendant ramets, essentially because only a few pluripotent cells of the proliferating tissue are recruited to form the new module or ramet^12,21,22 (Fig. 1 and Supplementary Fig. 1). Here, we built upon these findings and introduce the somatic genetic clock that uses the rate of genome-wide, asexual fixation of alleles to estimate the extent of differentiation between the founder and descendant ramets of a genet. In doing so, we can infer the time to the least common ancestor of multiple or pairs of ramets, here the zygote, and derive a ‘somatic genetic clock’ that permits the precise ageing of large plant clones (genets) and, possibly, other clonal animal, macroalgal or fungal species.

**Fig. 1: Dynamics of SoGV in generic clonal organisms.**

Results

A generic somatic genetic clock in clonal species revealed by modelling and simulations

To estimate the time over which fixed SoGV accumulates and segregates under clonal growth, we developed a stochastic, agent-based model of a generic clonal organism that comprises a collection of modules, adapted from population genetics models of cancer evolution²³ (Methods). Within this model, a module is simplified to the stem cell population of a single ramet (all somatic cells are derived from stem cells and, thus, can be ignored). Cells and modules are subject to stochastic update events including cell division, death and the formation of new modules, with new Poisson-distributed mutations occurring at each cell division. We considered a range of scenarios with different types of stem cell division (symmetric versus asymmetric) that characterize stem cell dynamics in clonal species. Specifically, we compared different (founder) stem cell pool sizes, and varying rates and mechanisms for forming new modules (branching versus splitting), attempting to capture possible life history variation in clonal species across the tree of life (Fig. 2). We found that, given sufficient time, any scenario would converge to a constant accumulation rate of fixed SoGV, and thus, the number of fixed SoGV would increase linearly with clonal age (Fig. 3 and Supplementary Figs. 2 and 3) as required for a useful molecular clock.

**Fig. 2: Processes determining fixation rates of SoGV in a generic clonal organism.**

**Fig. 3: Agent-based model predictions for the accumulation of fixed somatic mutations via somatic genetic drift.**

The accumulation rate of fixed SoGV was determined solely by the mutation rate per cell per site per year. While the module formation rate (r) does not directly impact the accumulation rate of fixed SoGV (Fig. 1), it can have a small indirect effect by altering the mutation rate, either as a result of stochasticity or because of different effective mutation rates during homeostasis and growth. This effect is small (Supplementary Fig. 4), and we consider it negligible for biologically relevant parameter ranges. The relative constancy despite different module formation rates, that is, asexual generation times, is equivalent to the classical molecular clock being dependent only on mutation rate and not sexual generation time^17,19,24.

We next explored the duration of the lag phase before linearity is reached and found that it depended upon the size of the stem cell pool per module (N), the number of founder stem cells that are recruited to form new modules (N₀), the ratio of symmetric versus asymmetric cell division, the rate of stem cell division (b), the rate at which new modules are formed (r) and whether they are formed by branching or splitting (Fig. 3a and Supplementary Figs. 5 and 6). Module formation via a small number of founder stem cells (small N₀) reached a linear equilibrium fast for both branching and splitting (Fig. 3a and Supplementary Fig. 6). The duration of the lag phase increased substantially for a large number of founder cells and/or solely asymmetric stem cell divisions. Fixation of SoGV occurs due to the repeated formation of new modules, during which the population of cells that form the module undergoes a bottleneck (Fig. 2). Additionally, fixation can occur due to homeostatic cell turnover within the module if, and only if, there is symmetric cell division, while this cannot occur for purely asymmetric divisions.

Next, we estimated the conditional fixation times for different clonal species’ life histories. Assuming asymmetric cell division, fixation occurs only due to repeated module formation, which can be represented as a modified Wright–Fisher process. We derive the conditional fixation times, which are approximately \(4{N}_{0}\left(1-{N}_{0}/N\right)/r\) (equation (1)) for module splitting and \(4{N}_{0}/\left(1-{N}_{0}^{2}/{N}^{2}\right)r\) (equation (2)) for module branching (see Supplementary Note 1.3 for the derivation using a diffusion approximation). Thus, fixation times may be decreased by reducing N₀, even when N is large (Supplementary Fig. 5). For symmetric cell division, fixation due to homeostatic cell turnover usually dominates, because the cell division rate b is greater than the module formation rate r. The conditional fixation time is therefore better represented by a Moran process, and is approximately \(N/b\) (equation (3), ref. ²⁵). The conditional fixation time can be considered as a lower bound on the lag phase to reach the equilibrium accumulation rate of fixed SoGV. Thus, these equations indicate the absolute timescale over which the somatic genetic clock is applicable for different species life histories.

Finally, we also considered additional complications with respect to the developmental mode of the clonal organisms. Under (1) stochastic quiescence, homeostatic modules move in (and out) of a quiescent state with a fixed rate; while under (2) seasonal quiescence, all modules become quiescent during a winter period (Supplementary Note 1). We also consider the possibility that mutations may occur during the cell lifetime, as well as at cell division. To this end, we introduce a time-dependent mutation rate ξ in addition to the per-cell mutation rate μ (Supplementary Note 1 and Supplementary Fig. 7). The lag phase before linearity is increased for both quiescence regimes (Supplementary Fig. 7a–c), indicating that the average rate of module formation across the population is lowered. However, linearity is still reached in all cases.

Application of the somatic genetic clock in a seagrass

We then applied the somatic genetic clock to the seagrass Zostera marina (eelgrass), an emerging model for evolution in clonal plants. We first examined the structure of the shoot apical meristem (SAM) containing a population of stem cells in higher plants²⁶ via laser confocal microscopy. We were interested in evidence for SAM stratification that determines the spread of SoGV across tissues²⁷, along with the likely number of stem cells (N) and module founder cells (N₀), as well as the stem cell division mode (symmetric or asymmetric) (Supplementary Note 2). We found that the SAM was organized into one-layered L1 (tunica) and underlying L2 (corpus) as in many other monocotyledonous plant species (Supplementary Fig. 8a). No periclinal cell division in L1 was observed during the formation of axillary meristems, indicating a stable boundary between L1 and L2 (Supplementary Fig. 8b–d). In contrast, frequent periclinal cell divisions in L1 were observed during the formation of leaves, which suggested that L1 mostly or exclusively contributed to leaves (Supplementary Fig. 9). A likely number of L1 stem cells is between 7 and 12 with possible both asymmetric and symmetric cell division modes (Supplementary Fig. 10). From this population, about three or four stem cells give rise to cells that form a new module.

Next, we addressed how a SoGV can become fixed throughout the entire tissue of a new module despite meristem stratification. Indeed, we find clear allele fixation at f = 0.5 in variant frequency diagrams (for example, >7,000 with f = 0.5; ref. ¹² and Supplementary Fig. 11). Although shoot meristems are generally stratified in Z. marina as in other angiosperms²⁸ (Supplementary Fig. 8), it cannot be excluded that infrequent periclinal cell divisions occur in the L1 (ref. ²⁹) leading to SoGV fixation in all tissues. Note that leaf tissues that are derived exclusively from L1 (Supplementary Fig. 9) were predominating in the sample used for bulk sequencing. We thus continued by simplifying the fixation dynamics by assuming a one-layer case, enabling the application of our model of a generic clonal organism to eelgrass. However, assuming that cell growth and division frequency is similar across layers³⁰, the model can be applied to any cell layer and derived organs in stratified meristems³¹.

We parametrized the model for eelgrass and focused on the most likely range with N = 7–12 and N₀ = 3–4 (Fig. 3) but also considered more extreme scenarios ranging from the strongest (N = 7, N₀ = 1) to the weakest (N = 12, N₀ = 6) intensity of somatic genetic drift, in combination with branching rates 3–8 yr⁻¹ (refs. ^32,33). The accumulation rate of fixed SoGV remained similar (Fig. 3b and Supplementary Figs. 12a and 13), indicating that mutation accumulation on the size of the SAM and rate of asexual reproduction was negligible.

Using equations (2) and (3), we estimated the conditional fixation times for novel mutations under asymmetric and symmetric cell division, respectively, within an eelgrass clone. For the most likely parameter range, these gave reasonable lower and upper bounds of 2 years (N = 7, N₀ = 3, r = 8 yr⁻¹) and 6 years (N = 12, N₀ = 4, r = 3 yr⁻¹) for asymmetric cell division, and 0.05 years (N = 7, b = 122 yr⁻¹) and 0.1 years (N = 12, b = 122 yr⁻¹) for symmetric division. This suggests that a constant accumulation rate required for the somatic genetic clock will be reached relatively fast in eelgrass, in the order of years or even months. This is verified by our simulations (Fig. 3b) in which we observe very small lag times (\(\lesssim\)1 year) for symmetric cell division. For asymmetric cell division, it took longer to reach an equilibrium, with the time increasing for smaller module formation rate (r) and larger (founder) module size. However, the lag times still appeared in the order of years, rather than decades.

Calibration of the somatic genetic clock

Next, two long-term cultivation experiments with Z. marina genets of known age (4 and 17 years, respectively) allowed for a calibration of the somatic genetic clock. Owing to statistical noise in estimating the true allele frequency via mapped reads at a given locus, differentiating between mosaic and fixed SoGV is inherently difficult. Hence, we developed the variable ‘Variant Read Frequency 50 (R_x)’ (hereafter VRF50(R_x)) as a proxy for the number of fixed SoGV in ramet ‘R_x’ relative to the founder of the genet (Methods and Supplementary Fig. 13). The mean VRF50(R_x) of a ramet population can be used as estimator for its age, that is, the time since founding by a parent genet or a zygote. In order to calibrate the somatic genetic clock for Z. marina, genets of known ages (4 and 17 years) were deep-sequenced (~900× and ~80× for the genets aged 4 and 17 years, respectively) to calculate the accumulation rate of VRF50(R_x). The role of sequencing depth and type of mutation caller were also examined (see Methods for details; Supplementary Tables 1–3). The mean VRF50(R_x) and the age of a genet were used to fit a linear model (Fig. 4a; y = 0.5044x − 1.4641, adjusted R²: 0.9483, P < 0.001). Accordingly, we find a rate of fixed mutation accumulation of 4.6 × 10⁻⁹ per year per site, similar to estimates in Arabidopsis³⁴.

**Fig. 4: Estimating the age of globally distributed eelgrass (*Z. marina*) clonal lineages based on the somatic genetic clock.**

To verify that our data could be used to accurately calibrate the clock, we re-created the sampling strategy for both timepoints, that is, 4 and 17 years, by simulation and estimated the accumulation rate of fixed SoGV (Fig. 3c and Supplementary Fig. 12b). Considering data from 100 simulations for each parameter setting, we observe similar estimated rates in all cases. The difference between the mean estimated rate and ‘true’ rate was between 0.1% and 8%, where the maximum difference is for the most extreme case (N = 12, N₀ = 6, r = 3 yr⁻¹). The standard deviation (s.d.) for each parameter setting ranged between 0.10 and 0.15 (mutations per year). As this was similar in magnitude to other sources of error, we consider that our calibration genets with known ages of 4 years and 17 years can be safely used for calibration. However, increasing the number of samples (more genets at given age/a higher range of genet ages) would probably reduce the error resulting from sampling.

Age estimation of 15 globally distributed Z. marina genets

We then used the calibrated somatic genetic clock to estimate the age of eelgrass genets in a worldwide data set³⁵ (Fig. 4b and Supplementary Data 1). Among the 15 genets with two or more ramets sampled, most were <40 years old (Fig. 4c), while 4 attained >270 years (Fig. 4d), 1 in Estonia (352 years), 2 in Norway (271 and 847 years) and 1 in Finland (1,403 years). All genets >270 years of age were located in higher latitudes (>50° N) in the North Atlantic, indicating that marginal populations were more likely to maintain old genets^4,11,36 and supporting the established geographic parthenogenesis pattern³⁷. Although the evolutionary history in the Pacific is much longer than that in the Atlantic³⁵, Pacific eelgrass genets were young (<40 years). In addition, the old clonal lineages were distributed in the locations that were recolonized by glacial refugia after the Last Glacial Maximum, indicating that clonal reproduction is a particularly successful reproductive mode to rapidly colonize newly opened areas⁴. Note that age estimates based on spatial extent would have been misleading, as genets with small spatial extent were found to be >300 years old. For example, while genet ES_C01 in Estonia contained only three ramets spread over ~20 m (Supplementary Fig. 15), it was estimated to be 352 years old based on the somatic genetic clock.

We also examined the mutational spectra of the four oldest genets detected here to six sexually segregating North Atlantic populations to identify possible differences between germline and asexually generated, mitotic mutations (Supplementary Fig. 16). Both mutational patterns show the well-known predominance of transitions over transversions in plants^34,38. In particular, G:C → A:T transitions contributed 61 ± 4% and 66 ± 3% (mean ± 1 s.d.; n = 6 and 4, respectively) of all single-nucleotide polymorphism (SNPs) in asexual versus sexual populations, respectively, regardless of their trinucleotide context (Supplementary Fig. 16a,b).

Discussion

We present a somatic genetic clock that permits precise age estimates of genets in clonally growing plants, and possibly, many clonal animal, fungal and macroalgal species. The duration of the lag time before the DNA-sequence-based somatic genetic clock approaches linearity decreases for fewer stem cells and founder stem cells; for symmetric rather than asymmetric cell divisions; and for increased rates of new module formation. Hence, an application of the somatic genetic clock is most accurate for estimating clonal age if the stem cell population size N is small and new module formation happens through a small founder cell population N₀ as realized in plant SAMs. In organisms that asexually reproduce through budding, time to linearity will depend on the number of cells contributing to the new bud. Conversely, marine invertebrates or algae that propagate asexually through fission will have an exceedingly long lag time, as essentially half of all body cells comprise the founder cell population N₀. By applying our analytical results (equations (1)–(3)), we are able to estimate the timescale over which the somatic genetic clock is applicable for any given organism.

Once linearity is reached, the rate of the somatic genetic clock is constant across module formation rates and, thus, asexual generation times, which is the hallmark of a valid molecular clock. Similar to the rate constancy despite different generation times in species-level phylogenies^17,24, under a higher module formation rate, fewer mutations are fixed by any single module formation event, but the total number of module formation events is higher (Fig. 1), and vice versa. Our proposed clock is analogous to mitotic evolution in non-modular species, such as humans, specifically the emergence of genetic heterogeneity among healthy and cancerous human somatic tissues within an individual^39,40. Somatic mutations accumulate linearly with age in human stem cells and fixate at a constant rate locally in spatially constrained stem cell populations, for example, in colon crypts or skin^41,42. Similarly, we find that the number of fixed SoGV between founder and descendant ramets also accumulates at a constant rate.

Currently, we cannot distinguish mutations resulting from DNA replication errors during mitotic divisions from those occurring outside cell division. Indeed, recent studies suggest that somatic mutations can also accumulate with age in both plants and animals^41,43. This indicates that, independently of cell division dynamics, other factors such as ultraviolet radiation, transposons or insufficient DNA repair systems could also increase the accumulation of mutations over time. A comparison of mutational spectra (Supplementary Fig. 16a,b) does not suggest that the frequency of a type of transitions commonly associated with environmental stress in plants is increased under long-term clonal growth. Even if this was the case in other species, it would rather enhance the validity of our somatic genetic clock, as it decouples somatic mutation accumulation from developmental processes.

The stem cell population dynamics during module formation are currently unknown for most clonal species other than angiosperms. The latter are complicated due to stem cell stratification into layers²⁸. However, even under these circumstances, the somatic genetic clock can be applied when either the sampled tissue is dominated by one meristematic layer (as is the case in eelgrass Z. marina) or when descendant tissues of a certain stem cell population can be clearly distinguished among the adult plant organs³¹.

Our findings on fixation processes will also apply to an evolutionary epigenetic clock that was recently described for self-fertilizing and clonally reproducing plants⁴⁴. This clock uses the much faster accumulation of neutral gene body (de)methylations of cytosine nucleotides. As an additional step, the identification of genomic regions with clock-like behaviour of (de)methylation is required⁴⁴. The somatic genetic clock proposed here is complementary and will be best suited for slightly longer time intervals of >10 years to potentially tens of thousands of years, and where methylation data are unavailable. Here, we provide the theoretical foundation why both, the somatic genetic clock and the evolutionary epigenetic clock⁴⁴, are ultimately determined by mutation rate, as is the case for general molecular clocks²⁴.

Some of the analogies of our modelled and observed temporal dynamics with classic population genetics are instructive. In our study, the stem cell population size, and the time period between two adjacent branching events, correspond to the population size N_e, and generation time in classic population genetics, respectively. Due to the usually large N_e (>100) in combination with genetic exchange among lineages, classic molecular clocks are limited to macro-evolutionary timescales (~10⁵–10⁸ years). However, the stem cell population size in plants is extremely small (for example, 7–12 for eelgrass, but for other angiosperms often only 3–4 (ref. ²⁶) or even only 2 in some species of ferns⁴⁵), and module formation events often occur multiple times per year, which makes somatic genetic clock solid for recent timescales. Note that the time until stem cell populations are ‘saturated’ with standing genetic variation, resulting from novel mutations, increases with population size N_e, similar to time lags required for a population to reach mutation–drift equilibrium in population genetics⁴⁶.

With increasing availability of full genome data at the population level, our study provides an achievable and accurate method for estimating the age of clonal plants and, possibly, other clonal species in the animal, macroalgal and fungal kingdom². It opens multiple new research avenues to model the demography, resilience and evolution of the many species that are facultatively clonal, and where direct and precise ageing information was previously unavailable.

Methods

Simulating fixed mutation accumulation in a clonal organism

We implemented a stochastic, agent-based model of a clonal organism, adapted from population genetics models of cancer evolution²³. The organism is represented as a population of modules that grows to a fixed size Z by producing new modules via module splitting or branching (Table 1). Modules consist of stem cells and have different dynamics depending on whether they are in growth or homeostasis. During the growth phase, the module grows by cell division, which is implemented by a stochastic pure-birth process with rate b. Once the module reaches size N, it enters homeostasis. Cell divisions are coupled with cell deaths, so that the population size remains constant. This is done either by implementing an asymmetric update (a cell divides producing only one progeny) or a symmetric update (a cell divides producing two progeny and another cell is removed from the module). This symmetric update corresponds to a Moran process. Dividing cells acquire novel, Poisson-distributed mutations with mean μ.

Homeostatic modules produce new modules at rate r. This is done by module splitting or module branching. For module splitting, the parent module donates N₀ cells to the new child module. Both parent and child modules then re-enter the growth phase. For module branching, N₀ cells are sampled without replacement from the parent module and then copied to form the child module that enters a growth phase. The parent module is unchanged. If the population of modules has reached maximum size Z, a randomly selected module is killed whenever a new module is formed to keep the population size constant. The simulation is implemented using a Gillespie algorithm⁴⁷:

1.
Initialize the simulation with one module that is formed of a single cell, t = 0.
2.
Calculate the transition rates for all transitions:
1. a.
  Cell division in a growing module: \({{{r}}}_{{\rm{a}}}={{{bn}}}_{\text{growth}}\)
2. b.
  Symmetric division in a homeostatic module: \({{{r}}}_{{\rm{b}}}=\lambda {{N}}{{{Z}}}_{\text{homeostatic}}\)
3. c.
  Asymmetric division in a homeostatic module: \({{{r}}}_{{\rm{c}}}={{\gamma }}{{N}}{{{Z}}}_{\text{homeostatic}}\)
4. d.
  New module formation: \({{{r}}}_{{\rm{d}}}={{r}}{{{Z}}}_{\text{homeostatic}}\)
Here, \({n}_{\text{growth}}\) is the total number of cells in growing modules and \(Z_{\text{homeostatic}}\) is the number of homeostatic modules. We set λ = b (or b/2), γ = 0 for purely symmetric division and \({{\lambda }}=0,{{\gamma }}=b\) for purely asymmetric division.
3.
Transition i is chosen with probability \({{{r}}}_{i}/\left({{{r}}}_{{\rm{a}}}+{{{r}}}_{{\rm{b}}}+{{{r}}}_{{\rm{c}}}+{{{r}}}_{{\rm{d}}}\right).\) If a cell division occurs during any transition, the newly divided cells acquire \(M\sim \text{Poisson}\left({\rm{\mu }}\right)\) novel mutations. Possible transitions are:
1. a.
  Choose a cell to divide uniformly at random from all cells in growing modules.
2. b.
  Choose a homeostatic module, uniformly at random. From that module, choose a cell to divide and a different cell to remove, uniformly at random (Moran update).
3. c.
  Choose a homeostatic module, uniformly at random. From that module, select a cell to divide, also uniformly at random. One progeny cell remains in the module, and the other is removed (asymmetric division).
4. d.
  Choose a homeostatic module uniformly at random to be the parent module, and if \(Z={Z}_{\max }\), choose a second module to die. A new module is formed from the parent module by (i) splitting or (ii) branching. First, select \({N}_{0}\) cells from parent module without replacement, then,
  1. (i)
    Module branching: copy them to form a new module, leaving the parent module unchanged.
  2. (ii)
    Module splitting: remove them from the parent module to form a new module.
4.
Update the time \(t^\prime=t+\updelta t\), where \(\updelta t \sim \text{Exponential}\left(1/\left({r}_{{\mathrm{a}}}+{r}_{{\mathrm{b}}}+{r}_{{\mathrm{c}}}+{r}_{{\mathrm{d}}}\right)\right)\).
5.
Repeat steps 2–4 until \(t={T}_{\max }\).

Data are generated at discrete time steps for the number of fixed SoGV in each module.

Shoot apex preparation and imaging in laser confocal microscope

Z. marina plants collected in Falckenstein, Kiel Fjord (54.392° N, 10.192° E) were kept at 8–12 °C temperature and 150 μmol quanta s⁻¹ m⁻² light intensity in 800-litre indoor wave tanks, the ‘Zosteratron’, receiving ambient Baltic seawater while rooted in ambient sediment (12 cm deep), with an intake pipe approximately 10 km distant from the collection site. The plants were then either moved immediately to room temperature for 2–3 days and imaged, or the temperature was slowly raised to 16 °C temperature for 7 days to induce growth before imaging. We used the plants at the vegetative phase of development.

For the imaging in the laser confocal microscope, plants were dissected in filtered seawater using tweezers and fine medical needles under a stereo-microscope (Nikon) so that all leaf primordia covering the SAM were cut off. Isolated shoot apices (SAMs with the youngest leaf primordia) and axillary meristems were fixed and prepared for the imaging according to ClearSee-based clearing method⁴⁸. Isolated apices were fixed with 4% paraformaldehyde dissolved in the phosphate-buffered saline buffer (pH 6.9–7.0 adjusted with HCl) for at least 2 h (at the first hour, under vacuum). Apices were washed twice in the phosphate-buffered saline buffer for at least 2 min, and incubated for 7–18 days in the ClearSee solution (2% urea, 10% xylitol and 15% sodium deoxycholate) at room temperature with gentle stirring. The ClearSee solution was changed every 1–2 days. Cell walls were stained with 0.05% Fluorescent Brightener 28 (FB, Sigma) dissolved in the ClearSee solution for at least 30 min, rinsed in the ClearSee solution and washed in fresh water for 1–2 min.

For the imaging, the apices were mounted in small containers filled with 5% of low-melting-point agarose and kept in fresh water. The imaging was performed using an upright confocal laser-scanning microscope (Leica TCS SP8) with long-working distance water-immersion 40× objective. For the FB, excitation and emission 405 nm and 425-475 nm wavelengths were used, respectively. Images were collected at 12 bits. Scanning speed was set at 400 Hz with 512 × 512 or 1,024 × 1,024-pixel frames, zoom at 0.75–2.0 and z-step at 0.3–0.8 µm. The pinhole was set at 1AU (airy units).

Image processing and analysis

Original confocal z-stack images (LIF) were converted using Fiji (https://fiji.sc) to TIFF files, which were then processed with the MorphoGraphX (MGX) v.2.0.1 (ref. ⁴⁹) to obtain top or site views and optical sections. To analyse the structure of apices, a series of optical 2–4-µm-thick sections were performed parallel and perpendicular to the SAM major axis (longitudinal and transverse sections, respectively). Developmental stages of leaf primordia were estimated on the basis of optical transverse sections through the apex. The p1 is the youngest primordium apparent as a bulge at the SAM surface. The successive stages were numbered in ascending order (p2, p3 and so on; Supplementary Figs. 8–10).

To estimate the number of stem cells at the SAM surface, cell clones were analysed (Supplementary Fig. 10). Cell clones (usually containing 4–16 cells) were recognized on the basis of the history of cell divisions at projections of SAM anticlinal cell walls. Specifically, the FB signal was projected in the MGX software from the defined depth (0–3 µm) onto the SAM surface. At these projections, the signal is the most intense in newly formed cell walls corresponding to most recent cell divisions (higher-order divisions). The signal in the oldest cell walls (regarded as clone borders) is the weakest due to a furrow formed over time between descendant cells.

Parameterizing the model for eelgrass

The modelling for eelgrass was focused on layer L1. New module formation was implemented by module branching, reflecting the fact that in eelgrass the new SAM is not directly derived from the stem cells (Supplementary Note 2). The following parameter range was used: b = 122 yr⁻¹ (ref. ²⁶); r = 3–8 yr⁻¹ (refs. ^32,33); N = 7–12; N₀ = 1–7; Z = 1,000; μ = 0.0069. Both symmetric and asymmetric cell division were considered by setting λ = b/2, γ = 0 or λ = 0, γ = b, respectively.

Eelgrass genets for calibration cultured in the lab

Four-year-old eelgrass calibration genets

Three small eelgrass patches, consisting of 17–25 leaf shoots were collected in April 2019 from an eelgrass meadow in Kiel, Germany (Falckenstein, 54.392° N, 10.192° E). To confirm clonal identity, each patch was carefully excavated by divers to examine rhizome connections and additionally genotyped with nine microsatellite loci⁵⁰. In the Baltic Sea, seeds germinate in March or April, while plants become mature at the end of year one. The observed number of shoots can be obtained by branching in the second year. Hence, we infer that the collected eelgrass patches were probably founded by seeds that germinated in 2017 and started branching in 2018. Plants were tagged, planted into 40-litre plastic boxes to a sediment height of 15 cm and placed into 800-litre wave tanks with flow-through ambient Baltic seawater at the GEOMAR Helmholtz Center for Ocean Research Kiel, the ‘Zosteratron’. Leaf shoot number was regularly reduced to allow clones to regrow and branch. In 2022, 3 years after start of the cultivation, one leaf shoot from each of boxes was selected and resequenced to ~900× coverage using a Novaseq 6000 S4 platform (paired end reads of 150 bp). The estimated time between tissue collection and seedling emergence was 4 years (3 years in the lab + 1 year in the field). Sequence data are available at BioProject no. PRJNA1025927, accession no. SRR26321801-804 and SRR26321811-812.

Seventeen-year-old eelgrass calibration genets

Data are from a whole-genome resequencing of two eelgrass genets with a known age of 17 years (ref. ⁵¹). Each genet was initiated by a single shoot collected from Bodega Harbour, California, in July 2004. Before sample collection plants had been kept for 17 years in large, 300-litre outdoor flow-through mesocosms at Bodega Marine Laboratory under ambient light and temperature conditions⁵². Six and five ramets were collected from each genet for genomic analysis in 2021, respectively. The clone assignment was checked on the basis of shared heterozygosity⁵¹. Ilumina sequencing data are available in the National Center for Biotechnology Information (NCBI) sequence read archive (~80×, BioProject no. PRJNA806459, SRA accession nos. SRR18000159–SRR18000170).

Sampled eelgrass genets in the field

ES_C01-ES_C03

We conducted novel whole-genome resequencing for ten leaf shoots collected from an eelgrass meadow in Estonia (Supplementary Fig. 14). They were chosen from a larger sampling based on microsatellite data that suggested they belong to three genets, containing three, four and three ramets, respectively. This was confirmed by whole-genome SNPs. The clonal lineages were named ‘ES_C01’ to ‘ES_C03’ in this study. Data are available in BioProject no. PRJNA1025927, SRA accession nos. SRR26321797–SRR26321810.

YU20_FI

Whole-genome resequencing for 24 ramets of a single large eelgrass genet was conducted in Finland at Ängsö¹². The next-generation sequencing data are available in the NCBI short read archive (~80×, BioProject no. PRJNA557092, SRA accession nos. SRR9879327–SRR9879353).

YU23_C01-YU23_C11

In a large population data set encompassing Pacific and Atlantic sites, 190 ramets from 16 geographic locations were re-sequenced³⁵, which revealed 11 genets in total that comprised 2–13 ramets. Previously, only one ramet per detected genet was included in subsequent phylogeographic analyses. Here, genets were named ‘YU23_C01’ to ‘YU23_C11’, and the respective among-ramet genetic differentiation was used for age determination. Next-generation sequencing data are available in the NCBI short read archive³⁵.

Whole-genome resequencing data of new populations

Bulk DNA of the meristematic region and the basal portions of the leaves was extracted using NucleoSpin Plant II Kit (Macherey-Nagel). DNA concentration was determined using a Qubit Fluorometer (Thermo Fisher Scientific) and Nanodrop Spectrophotometer (Thermo Fisher Scientific), and DNA quality was checked by agarose gel electrophoresis. DNA was sent to Beijing Genomics Institute (Hong Kong) for library construction and sequencing. The libraries were sequenced on either Novaseq 6000 S4 platform (PE150bp) or Hiseq Xten platform (PE150bp).

Mapping the sequencing data to the reference genome

We assessed the quality of the raw reads using FastQC v0.11.7 (https://www.bioinformatics.babraham.ac.uk/projects/fastqc/). BBDuk (https://jgi.doe.gov/data-and-tools/software-tools/bbtools/bb-tools-user-guide/bbduk-guide/) was used to remove adapters and for quality filtering according to the following criteria: (1) sequence downstream with quality <20 was trimmed (trimq = 20); (2) reads shorter than 50 bp after trimming were discarded (minlen = 50); (3) reads with average quality below 20 after trimming were discarded (maq = 20). FastQC was used to do a second round of quality check for the clean reads. Clean reads were then mapped against the Z. marina reference genome v2.1 (ref. ⁵³) using BWA-MEM v0.7.17 (ref. ⁵⁴) with default parameters. The aligned reads were sorted using SAMtools v1.7 (ref. ⁵⁵), and duplicated reads were marked using MarkDuplicates tool in GATK v4.0.1.2 (ref. ⁵⁶). Only properly paired reads (0 × 2) with MAPQ of at least 20 (-q 20) were kept using SAMtools.

Clone assignment check for ramets collected from Estonia

GATK4 was used to conduct joint SNP calling for the ten eelgrass ramets selected at three sites in Estonia. HaplotypeCaller was used to generate a GVCF format file for each individual, and GenotypeGVCFs was used for SNP calling based on the combined GVCF file from CombineGVCFs. After filtering (GitHub), the shared heterozygosity method⁵¹ was used to verify clonemate pairs that had already been pre-selected by microsatellite genotyping of a larger number of ramets (n = 10–15) per site.

Somatic polymorphism calling and calculation of VRF50(X ₁, X ₂)

Eelgrass (Z. marina) is diploid, and ~99.67% of the genome is homozygous. Hence, in most cases, a somatic mutation changes a homozygous to a heterozygous genotype. For SNP detection, the software packages Mutect2 (ref. ⁵⁷) and Strelka2 (ref. ⁵⁸) developed originally for cancer mutation calling were used. Both SNP callers compare the ‘normal’ sample and the ‘tumor’ sample. Here, SNPs were assumed to represent the ancestral ‘normal’ case if homozygous for the reference allele, because most novel mutations will turn a homozygous to a heterozygous site. Accordingly, the ‘tumor’ sample carried the novel alternative allele. For a specific Mutect2/Strelka2 run with X₁ as the ‘normal’ sample and X₂ as the ‘tumor’ sample, we used VRF50(X₁, X₂) to represent the number of somatic mutations in X₂ with a variant read frequency (VRF) ≥0.5. VRF50(X₁, X₂) was calculated as the number of SNPs meeting the following criteria: (1) the coverage of X₁ ≥ 12; (2) the coverage of X₂ ≥ 23; (3) the VRF of X₁ ≤ 0.01; (4) the VRF of X₂ ≥ 0.50.

We also examined the role of sequencing depth on the SNP calling results. Two data sets were compared: (1) three ramets of the oldest Finnish clone sequenced to 1,370× depth using a Novaseq platform versus 80× coverage on an Illumina platform (Supplementary Table 1); (2) randomly reducing a 900× data set to 80× coverage for three 4-year-old calibration genets (Supplementary Table 2).

Analysis of mutational spectra

Mutational spectra of soma and germline mutations were compared. Mutations were extracted from vcf files after SNP calling and classified according to substitution types and one base up- and downstream context into 96 categories. Graphs were produced with the MutationalPatterns R package. We compared germline mutations within six North Atlantic populations derived from the 11,705 core SNP set from ref. ³⁵ to somatic mutations detected in the four oldest genets identified in this paper (cf. Fig. 4b).

Calculation of VRF50(R _x)

During clonal growth, the fixation of SoGV within all the stem cells leads to substitutions compared with the founder ramet (for the eelgrass case, see Supplementary Fig. 1). We defined S(R_x) to represent the number of the fixed SoGV (that is, substitutions) in the ramet R_x compared with the founder seedling/ramet. By definition, the fixed SoGV have an allele frequency of f = 0.5 under diploidy. Based on sequencing data, allele frequency could be estimated by the VRF. In the histogram of VRF, the fixed SoGV form a peak at VRF 0.5 (Supplementary Fig. 1). However, for a normal coverage (<100×), mosaic distribution overlaps with the left-hand part of the fixation distribution. Hence, we focused on only the right-hand part of the fixation distribution, and used VRF50(R_x) as a proxy for S(R_x), which was the number of the fixed SoGV with a VRF ≥0.5.

After a specific time period from the initiation of the clonal lineage, the number of fixed SoGV in a ramet/module R_x, S(R_x), is expected to follow a Poisson distribution, S(R_x) ~ Poisson(λ). For a given S(R_x), the VRF has equal probability to be >0.5 or <0.5 and, thus, VRF50(R_x) is assumed to follow a binomial distribution, VRF(R_x) ~ B(S(R_x), 0.5). The expectation of VRF50(R_x) is 0.5λ.

We used VRF50(R_x)_obs to represent the value of VRF50(R_x) detected from the sequencing data that sufficiently cover a subset of the reference genome. To obtain VRF50(R_x)_obs, the most straightforward logic would be to compare the founder ramet/seedling and the target ramet R_x. However, the founder did not exist anymore after it had divided into two daughter ramets. Thus, we did an indirect calculation of the VRF50(R_x)_obs (Supplementary Fig. 13). For example, to obtain VRF50(R01)_obs, each of the other collected ramets of the same clonal lineage was used as the ‘normal’ sample in SNP calling (Mutect2 or Strelka2), and the maximum value for VRF50(clonemate of R01, R01) was assigned to VRF50(R01)_obs. Both SNP caller packages Mutect2 and Strelka2 were used to calculate VRF50(R_x)_obs for the clonal lineages with known age, and the results were similar (Supplementary Table 3). For the remainder, we used Mutect2 for comparability with older results¹² and as it seems more conservative.

Note that the sequencing data sufficiently cover only a subset of the genome. To estimate the genome coverage, HaplotypeCaller (GATK4) was run for each ramet using BP_RESOLUTION mode (-ERC BP_RESOLUTION). We then counted the number of the nucleotide sites with coverage ≥23 (that is, Size_e). The average VRF50(R_x) for a clonal lineage was calculated as (average VRF50(R_x)_obs)/(average Size_e) × total genome size. The 95% confidence interval of the average VRF50(R_x) was estimated on the basis of the Poisson distribution, that is, average VRF50(R_x) ± 1.96 × sqrt(average VRF50(R_x)) (Supplementary Data Table 1).

Estimating the number of mutations and genet age

The average VRF50(R_x) and the age for the clonal lineages with known age were used to fit a linear model (Fig. 4, y = 0.5044x − 1.4641, adjusted R² = 0.9483, P < 0.001), based on which the age of other clones was estimated (Fig. 4 and Supplementary Data Table 1). The number of fixed mutations that had accumulated in a ramet population of a given genet was calculated as 2 × VRF50(R_x) assuming a symmetric distribution of VRF of fixed somatic SNPs at f = 0.5 (Supplementary Data 1).

Table 1 Model parameters

Full size table

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

All DNA sequence data have been deposited in Genbank (Sequence Read Archive, detailed metadata in Supplementary Data 1). Source data are provided with this paper.

Code availability

Custom-made scripts and intermediate data steps were deposited on GitHub (https://github.com/leiyu37/SomaticGeneticClock (bioinformatics) and https://github.com/jessierenton/somatic-genetic-clock (modeling)).

References

Harper, J. L. in Population Biology and Evolution of Clonal Organisms (eds Jackson, J. B. C., Buss, L. W. & Cook, R. E.) 1–33 (Yale Univ. Press, 1985).
Reusch, T. B. H., Baums, I. B. & Werner, B. Evolution via somatic genetic variation in modular species. Trends Ecol. Evol. 36, 1083–1092 (2021).
CAS PubMed Google Scholar
Eriksson, O. Seedling dynamics and life histories in clonal plants. Oikos 55, 231–238 (1989).
Google Scholar
Eckert, C. G. The loss of sex in clonal plants. Evol. Ecol. 15, 501–520 (2001).
Google Scholar
Orive, M. E. & Krueger-Hadfield, S. A. Sex and Asex: a clonal lexicon. J. Hered. 112, 1–8 (2021).
PubMed Google Scholar
Jackson, J. B. C., Buss, L. W. & Cook, R. E. Population Biology and Evolution of Clonal Organisms (Yale Univ. Press, 1985).
Reusch, T. B. H., Boström, C., Stam, W. T. & Olsen, J. L. An ancient eelgrass clone in the Baltic Sea. Mar. Ecol. Prog. Ser. 183, 301–304 (1999).
Google Scholar
Ally, D., Ritland, K. & Otto, S. P. Aging in a long-lived clonal tree. PLoS Biol. 8, e1000454 (2010).
PubMed PubMed Central Google Scholar
Devlin-Durante, M. K., Miller, M. W., Precht, W. F. & Baums, I. B. How old are you? Genet age estimates in a clonal animal. Mol. Ecol. 25, 5628–5646 (2016).
CAS PubMed Google Scholar
Edgeloe, J. M. et al. Extensive polyploid clonality was a successful strategy for seagrass to expand into a newly submerged environment. Proc. R. Soc. B 289, 20220538 (2022).
PubMed PubMed Central Google Scholar
Pereyra, R. T. et al. Clones on the run: the genomics of a recently expanded partially clonal species. Mol. Ecol. https://doi.org/10.1111/mec.16996 (2023).
Yu, L. et al. Somatic genetic drift and multilevel selection in a clonal seagrass. Nat. Ecol. Evol. 4, 952–962 (2020).
PubMed Google Scholar
Honnay, O. & Bossuyt, B. Prolonged clonal growth: escape route or route to extinction? Oikos 108, 427–432 (2005).
Google Scholar
de Witte, L. C. & Stöcklin, J. Longevity of clonal plants: why it matters and how to measure it. Ann. Bot. 106, 859–870 (2010).
PubMed PubMed Central Google Scholar
Ally, D., Ritland, K. & Otto, S. P. Can clone size serve as a proxy for clone age? An exploration using microsatellite divergence in Populus tremuloides. Mol. Ecol. 17, 4897–4911 (2008).
CAS PubMed Google Scholar
Zuckerkandl, E. & Pauling, L. in Horizons in Biochemistry (eds Kasha, M. & Pullman, B.) 189–225 (Academic Press, 1962).
Easteal, S. Rate constancy of globin gene evolution in placental mammals. Proc. Natl Acad. Sci. USA 85, 7622–7626 (1988).
CAS PubMed PubMed Central Google Scholar
Kimura, M. & Ohta, T. On the rate of molecular evolution. J. Mol. Evol. 1, 1–17 (1971).
CAS PubMed Google Scholar
Bromham, L. & Penny, D. The modern molecular clock. Nat. Rev. Genet. 4, 216–224 (2003).
CAS PubMed Google Scholar
Nei, M. & Kumar, S. Molecular Evolution and Phylogenetics (Oxford Univ. Press, 2000).
Antolin, M. F. & Strobeck, C. The population genetics of somatic mutations. Am. Nat. 126, 52–62 (1985).
Google Scholar
Fagerström, T., Briscoe, D. A. & Sunnucks, P. Evolution of mitotic cell-lineages in multicellular organisms. Trends Ecol. Evol. 13, 117–120 (1998).
PubMed Google Scholar
Williams, M. J. et al. Quantification of subclonal selection in cancer from bulk sequencing data. Nat. Genet. 50, 895–903 (2018).
CAS PubMed PubMed Central Google Scholar
King, J. L. & Jukes, T. H. Non-Darwinian evolution. Science 164, 788–798 (1969).
CAS PubMed Google Scholar
Antal, T. & Scheuring, I. Fixation of strategies for an evolutionary game in finite populations. Bull. Math. Biol. 68, 1923–1944 (2006).
PubMed Google Scholar
Burian, A., Barbier de Reuille, P. & Kuhlemeier, C. Patterns of stem cell divisions contribute to plant longevity. Curr. Biol. 26, 1385–1394 (2016).
CAS PubMed Google Scholar
Klekowski, E. J. Progressive cross- and self-sterility associated with aging in fern clones and perhaps other plants. Heredity 61, 247–253 (1988).
Google Scholar
Klekowski, E. J. Plant clonality, mutation, diplontic selection and mutational meltdown. Biol. J. Linn. Soc. 79, 61–67 (2003).
Google Scholar
Lyndon, R. F. The Shoot Apical Meristem: Its Growth and Development (Cambridge Univ. Press, 1998).
Jackson, M. D. B. et al. Global topological order emerges through local mechanical control of cell divisions in the arabidopsis shoot apical meristem. Cell Syst. 8, 53–65.e53 (2019).
CAS PubMed PubMed Central Google Scholar
Goel, M. et al. The majority of somatic mutations in fruit trees are layer-specific. Preprint at bioRxiv https://doi.org/10.1101/2024.01.04.573414 (2024).
Tomlinson, P. B. Vegetative morphology and meristem dependence—the foundation of productivity in seagrasses. Aquaculture 4, 107–130 (1974).
Google Scholar
Sintes, T., Marbà, N. & Duarte, C. M. Modeling nonlinear seagrass clonal growth: assessing the efficiency of space occupation across the seagrass flora. Estuar. Coasts 29, 72–80 (2006).
Google Scholar
Ossowski, S. et al. The rate and molecular spectrum of spontaneous mutations in Arabidopsis thaliana. Science 327, 92 (2010).
CAS PubMed Google Scholar
Yu, L. et al. Ocean current patterns drive the worldwide colonization of eelgrass (Zostera marina). Nat. Plants 9, 1207–1220 (2023).
PubMed PubMed Central Google Scholar
Rafajlović, M. et al. Neutral processes forming large clones during colonization of new areas. J. Evol. Biol. 30, 1544–1560 (2017).
PubMed Google Scholar
Lynch, M. Destabilizing hybridization, general-purpose genotypes and geographic parthenogenesis. Quart. Rev. Biol. 59, 257–290 (1984).
Google Scholar
Hequan, S. et al. The identification and analysis of meristematic mutations within the apple tree that developed the RubyMac sport mutation. Preprint at bioRxiv https://doi.org/10.1101/2023.01.10.523380 (2023).
Greaves, M. & Maley, C. C. Clonal evolution in cancer. Nature 481, 306 (2012).
CAS PubMed PubMed Central Google Scholar
Martincorena, I. et al. Somatic mutant clones colonize the human esophagus with age. Science 362, 911 (2018).
CAS PubMed PubMed Central Google Scholar
Abascal, F. et al. Somatic mutation landscapes at single-molecule resolution. Nature 593, 405–410 (2021).
CAS PubMed Google Scholar
Blokzijl, F. et al. Tissue-specific mutation accumulation in human adult stem cells during life. Nature 538, 260–264 (2016).
CAS PubMed PubMed Central Google Scholar
Satake, A. et al. Somatic mutation rates scale with time not growth rate in long-lived tropical trees. Preprint at bioRxiv https://doi.org/10.1101/2023.01.26.525665 (2023).
Yao, N. et al. An evolutionary epigenetic clock in plants. Science 381, 1440–1445 (2023).
CAS PubMed Google Scholar
Harrison, C. J., Rezvani, M. & Langdale, J. A. Growth from two transient apical initials in the meristem of Selaginella kraussiana. Development 134, 881–889 (2007).
CAS PubMed Google Scholar
Kimura, M. The Neutral Theory of Molecular Evolution (Cambridge Univ. Press, 1983).
Gillespie, D. T. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81, 2340–2361 (1977).
CAS Google Scholar
Kurihara, D., Mizuta, Y., Sato, Y. & Higashiyama, T. ClearSee: a rapid optical clearing reagent for whole-plant fluorescence imaging. Development 142, 4168–4179 (2015).
CAS PubMed PubMed Central Google Scholar
Barbier de Reuille, P. et al. MorphoGraphX: a platform for quantifying morphogenesis in 4D. eLife 4, e05864 (2015).
PubMed PubMed Central Google Scholar
Reusch, T. B. H. Microsatellites reveal high population connectivity in eelgrass (Zostera marina) in two contrasting coastal areas. Limnol. Oceanogr. 47, 78–85 (2002).
Google Scholar
Yu, L., Stachowicz, J. J., DuBois, K. & Reusch, T. B. H. Detecting clonemate pairs in multicellular diploid clonal species based on a shared heterozygosity index. Mol. Ecol. Resour. 23, 592–600 (2023).
CAS PubMed Google Scholar
Hughes, A., Stachowicz, J. & Williams, S. Morphological and physiological variation among seagrass (Zostera marina) genotypes. Oecologia 159, 725–733 (2009).
PubMed Google Scholar
Ma, X. et al. Improved chromosome-level genome assembly and annotation of the seagrass, Zostera marina (eelgrass). F1000Research 10, 289 (2021).
CAS PubMed PubMed Central Google Scholar
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
CAS PubMed PubMed Central Google Scholar
Li, H. et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics 25, 2078–2079 (2009).
PubMed PubMed Central Google Scholar
Van der Auwera, G. A. et al. From FastQ data to high-confidence variant calls: the Genome Analysis Toolkit Best Practices Pipeline. Curr. Protoc. Bioinform. 43, 11.10.11–11.10.33 (2013).
Google Scholar
Cibulskis, K. et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat. Biotechnol. 31, 213–219 (2013).
CAS PubMed PubMed Central Google Scholar
Kim, S. et al. Strelka2: fast and accurate calling of germline and somatic variants. Nat. Methods 15, 591–594 (2018).
CAS PubMed Google Scholar

Download references

Acknowledgements

This work has been funded by the Human Frontiers in Science (HFSP), grant number RGP_0042_2020 to I.B.B., B.W. and T.B.H.R. B.W. is also supported by a Barts Charity Lectureship (grant no. MGU045) and a UKRI Future Leaders Fellowship (grant no. MR/V02342X/1). J.K is supported by the Horizon Europe Programme MARBEFES project (grant no. 101060937). M.K. was supported by a fellowship by the Helmholtz School for Marine Data Science (MarDATA, grant no HIDSS-0005). We thank F. Wendt for maintaining seagrass cultures and M. Timmermans for providing access to the confocal microscopy. We are grateful to S. Landis for assistance with preparing the figures.

Funding

Open access funding provided by GEOMAR Helmholtz-Zentrum für Ozeanforschung Kiel.

Author information

These authors contributed equally: Lei Yu, Jessie Renton.

Authors and Affiliations

GEOMAR Helmholtz-Center for Ocean Research Kiel, Marine Evolutionary Ecology, Kiel, Germany
Lei Yu, Marina Khachaturyan, Till Bayer & Thorsten B. H. Reusch
Evolutionary Dynamics Group, Centre for Cancer Genomics and Computational Biology, Barts Cancer Institute, Queen Mary University of London, London, UK
Jessie Renton & Benjamin Werner
Institute of Biology, Biotechnology and Environmental Protection, University of Silesia in Katowice, Katowice, Poland
Agata Burian
Institute of General Microbiology, Kiel University, Kiel, Germany
Marina Khachaturyan
Estonian Marine Institute, University of Tartu, Tallinn, Estonia
Jonne Kotta
Department of Evolution and Ecology, University of California, Davis, CA, USA
John J. Stachowicz & Katherine DuBois
Helmholtz Institute for Functional Marine Biodiversity, University of Oldenburg, Oldenburg, Germany
Iliana B. Baums
Alfred Wegener Institute, Helmholtz-Centre for Polar and Marine Research (AWI), Bremerhaven, Germany
Iliana B. Baums
Institute for Chemistry and Biology of the Marine Environment (ICBM), School of Mathematics and Science, Carl von Ossietzky Universität Oldenburg, Oldenburg, Germany
Iliana B. Baums

Authors

Lei Yu
View author publications
You can also search for this author in PubMed Google Scholar
Jessie Renton
View author publications
You can also search for this author in PubMed Google Scholar
Agata Burian
View author publications
You can also search for this author in PubMed Google Scholar
Marina Khachaturyan
View author publications
You can also search for this author in PubMed Google Scholar
Till Bayer
View author publications
You can also search for this author in PubMed Google Scholar
Jonne Kotta
View author publications
You can also search for this author in PubMed Google Scholar
John J. Stachowicz
View author publications
You can also search for this author in PubMed Google Scholar
Katherine DuBois
View author publications
You can also search for this author in PubMed Google Scholar
Iliana B. Baums
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Werner
View author publications
You can also search for this author in PubMed Google Scholar
Thorsten B. H. Reusch
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.B.H.R., I.B.B. and B.W. designed the project and obtained funding. L.Y. and M.K. prepared the DNA samples for sequencing and performed the bioinformatic analysis under supervision of T.B.H.R. J.R. contributed the modelling and simulations along with B.W. A.B. provided histological analyses and microscopic images, T.B. performed the mutational spectra analyses. J.K., J.J.S. and K.D. provided access to field sites and contributed biological material. J.R., L.Y. and T.B.H.R. wrote an initial draft of the manuscript. All authors interpreted the results, and edited and approved the final manuscript.

Corresponding authors

Correspondence to Benjamin Werner or Thorsten B. H. Reusch.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Ecology & Evolution thanks Alex Cagan, Kerstin Johannesson, Young Seok Ju and Long Wang for their contribution to the peer review of this work. Peer reviewer reports are available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Supplementary Notes 1 and 2, Figs. 1–16 and Tables 1–3.

Reporting Summary

Peer Review File

Supplementary Data 1

Excel file containing all metadata of all sequenced samples and some basic sequencing statistics.

Source data

Source Data Fig. 4

Source data of figure panels 4a,c,d as separate sheets of a single Excel file.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yu, L., Renton, J., Burian, A. et al. A somatic genetic clock for clonal species. Nat Ecol Evol 8, 1327–1336 (2024). https://doi.org/10.1038/s41559-024-02439-z

Download citation

Received: 30 November 2023
Accepted: 07 May 2024
Published: 10 June 2024
Issue Date: July 2024
DOI: https://doi.org/10.1038/s41559-024-02439-z
Springer Nature Limited

This article is cited by

A clock for clonal organisms
- Long Wang
Nature Ecology & Evolution (2024)

Associated content

A clock for clonal organisms

News & Views Nature Ecology & Evolution 10 June 2024

A somatic genetic clock for clonal species

Abstract

Similar content being viewed by others

Main

Results

A generic somatic genetic clock in clonal species revealed by modelling and simulations

Application of the somatic genetic clock in a seagrass

Calibration of the somatic genetic clock

Age estimation of 15 globally distributed Z. marina genets

Discussion

Methods

Simulating fixed mutation accumulation in a clonal organism

Shoot apex preparation and imaging in laser confocal microscope

Image processing and analysis

Parameterizing the model for eelgrass

Eelgrass genets for calibration cultured in the lab

Four-year-old eelgrass calibration genets

Seventeen-year-old eelgrass calibration genets

Sampled eelgrass genets in the field

ES_C01-ES_C03

YU20_FI

YU23_C01-YU23_C11

Whole-genome resequencing data of new populations

Mapping the sequencing data to the reference genome

Clone assignment check for ramets collected from Estonia

Somatic polymorphism calling and calculation of VRF50(X 1, X 2)

Analysis of mutational spectra

Calculation of VRF50(R x)

Estimating the number of mutations and genet age

Reporting summary

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation

Somatic polymorphism calling and calculation of VRF50(X ₁, X ₂)

Calculation of VRF50(R _x)