1 Introduction

Immunosenescence underlies poor health outcomes in the aging population, including diminished vaccine efficacy (Poland et al. 2010; McElhaney and Dutz 2008; Fleming and Elliot 2008), increased susceptibility to disease (including irregular presentation, intensified symptoms, longer recovery times, increased mortality) (Thomas-Crussels et al. 2012), and a heightened risk of cancer (Ginaldi et al. 2001). This degradative aging process of the human immune system originates from extensive fundamental changes to the size and functionality of immune cell pools, and the structure of lymphatic tissues in which they develop and operate (Salam et al. 2013).

Among the many changes associated with immunosenescence (Globerson and Effros 2000), the T cell compartment is arguably the most damaged (Wick et al. 2000; Gruver et al. 2007). The T cell pool is comprised of subpopulations of antigen-inexperienced naive cells and antigen-experienced memory cells, the latter of which retain immunological record of previous infections. The human immune compartment maintains \(\sim 10^{12}\) T cells in total, of which \(\sim 10^{11}\) are naive (Jenkins et al. 2009; Trepel 1974). During aging, the population of naive T cells declines in overall size, while the population of memory T cells undergoes extensive proliferation, thereby reversing the balance of naive and memory T cells that had persisted at younger ages (Globerson and Effros 2000; Fagnoni et al. 2000). The expansion of memory T cells further enhances immunological memory of previously encountered antigens, reinforcing existent immune protection. The remaining naive pool experiences loss of T cell receptor (TCR) “structural diversity” (Goronzy et al. 2007, 2015b)—the number of distinct TCR complexes present across the entire naive pool. The diversity of T cell clones, or “immunoclones,” characterized by the number of distinct TCR complexes among the cell population, provides the extent of antigen specificity. Unique TCR complexes are generated during T cell development in the thymus, via recombination of genes encoding the V and J domains of the TCR\(\alpha \) chain and the V, J, and D domains of the TCR\(\beta \) chain, along with additional insertion and deletion of nucleotide fragments (Murphy 2012). Combinatorially, a possible \(\varOmega _0 \sim 10^{15}\)\(10^{20}\) unique TCR complexes may be assembled via this rearrangement process (Laydon et al. 2015), but only \(\varOmega \sim (0.05) \times \varOmega _0\) of those rearrangements are functionally viable (Yates 2014), as determined by positive and negative selection tests in the thymus, which screen for appropriate reactivity to self-peptide/MHC molecules. Each TCR is activated by at least one peptide fragment presented via MHC molecules on the surface of an antigen-presenting cell; thus, loss of naive TCR structural diversity limits the number of new antigens to which the full naive T cell pool can respond. Naive cells are also suspected to suffer major functional deficiencies in aging, such as diminished binding affinity and proliferative capacity after antigenic stimulation (Moro-García et al. 2013), which have been studied mostly using murine models to date (Appay and Sauce 2014). Their effects on human immune systems are not yet well understood but nonetheless beyond the scope of this paper.

The total abundance of naive T cells, which inhabit both blood and lymphatic tissue, can be reliably estimated from measurements in small samples (Westermann and Pabst 1990; Bains et al. 2009a). Recently, Westera et al. (2015) estimated an \(\sim 52\%\) decrease in the naive T cell population in aging. In contrast, accurate estimation of full-organism TCR structural diversity is currently impeded by experimental imprecision and the inability to extrapolate small sample data to the full organism (Laydon et al. 2015). Experimentation typically entails DNA sequencing of the TCR\(\alpha \) or—more commonly—\(\beta \) chain, in particular the complementarity-determining region 3 (CDR3), which is the site of TCR binding to antigenic peptide and most significant basis for diversity (Murphy 2012).

Increasingly sophisticated sequencing and analysis methods have improved estimates (Shugay et al. 2015; Oakes et al. 2017) for the lower bound on TCR diversity, but direct estimation of TCR diversity remains a challenge due to various experimental complications, such as the inability to detect rare clonotypes, sequencing errors, and inaccurate measurement of clonotype frequencies resulting from inconsistencies in polymerase chain reaction (PCR) amplification (Laydon et al. 2015). Predicting full-organism TCR diversity from a small sample is typically formulated as an “unseen species problem,” and one of many canonical solutions to such a problem is employed in conjunction with experimental data (Chao 1984; Chao and Lee 1992; Colwell and Coddington 1994), but the true relationship between sample and full diversity is fundamentally elusive.

Despite variations across experimental measurements of TCR diversity, its age-related loss has been consistently observed. An early study conducted by Naylor et al. (2005) predicted a TCR\(\beta \) chain diversity of \(\sim 2 \times 10^7\) that persisted in donors through age 60, before dropping by two orders of magnitude to \(\sim 2 \times 10^5\) at age 70. More recently, Britanova et al. (2014) collected samples from donors of all ages and observed an approximately linear decrease in TCR\(\beta \) CDR3 diversity from \(\sim 7 \times 10^6\) in youth (6–25 years) to \(\sim 2.4 \times 10^6\) in advanced age (61–66 years). Qi et al. (2014) obtained a particularly high lower bound estimate of \(\sim 10^8\) unique TCR\(\beta \) sequences in youth (20–35 years), which declined two- to fivefold in advanced age (70–85 years).

Note that only the TCR\(\beta \) chain is sequenced in these experiments. Sequencing of both the \(\alpha \) and \(\beta \) chains would potentially produce a more accurate measure of TCR diversity, but the same experimental limitations preclude complete analysis. The measurement of diversity is further complicated by the potentially large disparity between structural diversity and “functional diversity”—that is, the number of antigens to which the T cell pool is capable of responding. Due to the potential for crossreactivity, in which one TCR might respond to many structurally similar peptide fragments, it is possible that actual TCR diversity is much higher than structural diversity indicates. It has been speculated that one TCR might respond to as many as \(10^6\) different peptide epitopes (Mason 1998).

To obtain lifetime estimates of TCR structural diversity and develop an informed context for discussion of functional diversity, we introduce a mechanistic mathematical model of the generation and replenishment of the naive T lymphocyte pool from birth through the end of life. Although experimental assessments of full-system information remain challenging, measurements for the dynamics of each component related to the naive T cell population can be found throughout the literature. Our mathematical approach combines the knowledge of these individual components to study their interplay, leading to an understanding of the full-system dynamics. By extending previous model studies of total cell counts (Mehr et al. 1996, 1997; Ribeiro and Perelson 2007; Bains et al. 2009a, b; Hapuarachchi et al. 2013; Murray et al. 2003; Reynolds et al. 2013), our multi-component formulation is able to efficiently track the total number of distinct naive T cell clones, allowing for a full-system assessment of TCR structural diversity.

2 Mathematical Models and Results

We develop our mathematical model by first constructing the equation governing the total population size of the naive T cell pool in Sect. 2.1, through which we quantitatively constrain the primary parameters of our model using experimental measurements found in the previous literature. The model that describes the evolution of immunoclones is derived in Sect. 2.2, allowing us to define and estimate the diversity of the naive T cell population in Sect. 2.3. In Sect. 2.4, we inspect the impact of sampling on the estimate of immunoclone diversity, as in practice it is only possible to extract a small fraction of the entire naive T cell population from a body.

2.1 Total Naive T Cell Population Model

There are three fundamental immunological mechanisms that sustain the naive T cell pool: (1) export of mature naive T cells from the thymus, (2) peripheral proliferation, and (3) cell removal from the naive pool due to death or phenotypic changes. These basic mechanisms constitute a birth–death–immigration process described by the ordinary differential equation:

$$\begin{aligned} \frac{\mathrm{d}N(t)}{\mathrm{d}t} = \gamma (t) + p N(t) - \mu (N) N(t), \end{aligned}$$
(1)

where N(t) denotes the total naive T cell count, \(\gamma > 0\) denotes the rate of thymic output, \(p > 0\) denotes the rate of proliferation, and \(\mu (N) > 0\) denotes the rate of population-dependent regulated cellular death or loss of naive phenotype.

While more complex feedback mechanisms have been proposed  (Mehr et al. 1997), other experiments have shown that thymic export is independent of naive T cell counts (Ribeiro and Perelson 2007; Berzins et al. 1998; Metcalf 1963) and it is well established that the export rate consistently decays throughout the human lifespan (Murray et al. 2003). The lifelong decline of thymic export is caused by thymic involution and leads to the degradation of structural integrity and functional capacity of the thymus with age (Steinmann et al. 1985). The age dependence of the thymic export rate of newly trained naive T cells is often approximated by an exponentially decaying function, \(\gamma (t) = \gamma _0 {\mathrm{e}}^{-at}\), where \(\gamma _0 > 0\) is the maximum rate of thymic output that arises in early years and \(a > 0\) is the rate of decrease in thymic output.

The immune systems of vertebrates maintain a healthy amount of naive T cells through complex homeostatic mechanisms, which include controlled production and distribution of common gamma chain cytokines, particularly IL-7, to the naive pool (Fry and Mackall 2005). IL-7 is secreted by stromal and endothelial cells in the thymus, bone marrow, and lymphatic tissue, providing T cells with necessary survival signals. In lymphoreplete conditions, competition for this limited resource regulates population size (Bradley et al. 2005; Tan et al. 2001; Vivien et al. 2001), but in lymphopenic conditions, high levels of IL-7 resulting from low T cell counts can even stimulate cellular proliferation. While IL-7 concentration may be explicitly formulated in a mathematical model of the peripheral T cell population, as in the work of Reynolds et al. (2013), most models incorporate IL-7 regulation implicitly in the form of carrying capacity, assuming quick equilibration in a state of competition for IL-7 in the presence of a given number of T cells. Such simplification commonly leads to the dependence on total cell counts of both cell proliferation and cell death rates, considering the cytokine’s dual role under lymphoreplete and lymphopenic conditions described above. Our model assumes a cell count dependence in the cell death rate only, focusing on scenarios of healthy aging, i.e., lymphoreplete conditions. We thus assume a regulated N-dependent cell death rate of the form

$$\begin{aligned} \mu (N) = \mu _0 + {\mu _1 N^{\theta }\over N^{\theta } + K^{\theta }}, \end{aligned}$$
(2)

where the first term, \(\mu _0>0\), is the basal rate of cellular death. The second one describes the IL-7-mediated regulation of cell death, with \(\mu _1 > 0\) representing the maximal increase to the death rate as \(N \rightarrow \infty \). The quantity K is analogous to a “carrying capacity” and dictates the population at which signaling-induced death starts to limit the population. We have shown that model predictions are not qualitatively sensitive to the Hill coefficient, so without loss of generality we fix the Hill coefficient \(\theta = 2\).

The constant rate of cellular proliferation p under healthy conditions is supported by recent studies of Westera et al. (2015), showing nearly identical naive proliferation rates at young and old ages during age-related nonlymphopenic loss of naive cells. IL-7-induced proliferation can arise in unhealthy lymphopenic conditions typically found in severe disease of the immune system (Brass et al. 2014), cytotoxic drug use (Gergely 1999), radiation treatment (Grossman et al. 2015), or other abnormal situations. These scenarios are, however, beyond the scope of our analysis.

Our model has six adjustable parameters, \(\gamma _0\), a, p, \(\mu _0\), \(\mu _1\), and K. The first four are biologically inherent to the mechanism of T cell homeostasis and have been measured experimentally in humans and rodents. The last two have to be constrained via parameter sweeps to match relevant experimental observations. To non-dimensionalize Eqs. 12, we use \(a^{-1}\) to rescale t and K to rescale N to find

$$\begin{aligned} \frac{\mathrm {d} N^\prime }{\mathrm {d} t^\prime } = {\gamma _0 \over a K} {\mathrm{e}}^{- t^\prime } + {(p - \mu _0)\over a} N^\prime \left( 1 - {\mu _1\over (p - \mu _0)} \frac{{N^\prime }^2}{{N^\prime }^2 + 1}\right) \end{aligned}$$
(3)

which depends on the three independent parameters, \(\gamma _0/(aK)\), \((p - \mu _0)/a\), and \(\mu _1/ \left( p - \mu _0 \right) \), that control the qualitative behavior of our model. Specifically, the parameter \(\mu _1/(p - \mu _0)\) specifies how quickly the N-dependent death rate approaches its maximal value. Note that p and \(\mu _0\) always appear together in the form of \(\left( p - \mu _0 \right) \) in the model and thus are effectively just one parameter, reducing the number of free parameters by one.

Figure 1a illustrates four qualitatively distinct evolution trajectories of N(t) that may arise from simulations of the model in the presence of a decaying thymic export rate \(\gamma (t)\) (gray dashed–dotted curve). The black dashed curve arises when \(\mu _1/(p - \mu _0) < 1\). In this case, cell proliferation always exceeds cell death, leading to unbounded expansion of the naive T cell population. This scenario is unrealistic, except perhaps during a period of lymphopenia. For \(\mu _1/(p - \mu _0) \ge 1\), cell death is able to balance cell proliferation at a homeostatic carrying capacity \(N = N_{\mathrm{ss}} (\gamma = 0)\), defined by \(\mu (N_{\mathrm{ss}} (\gamma = 0)) = p\), as \(\gamma \rightarrow 0\). As illustrated by the green dotted curve, N(t) rises and asymptotically converges toward \(N_{\mathrm{ss}} (\gamma = 0)\) provided that \(\gamma _{0}/(aK) \ll 1\). We refer to this scenario as being in the “proliferation-driven” regime, given that the cell population is driven to \(N_{\mathrm{ss}} (\gamma = 0)\) primarily by homeostatic proliferation. The model’s behavior makes a transition from proliferation driven to “thymus driven” if we increase \(\gamma _{0}/(aK)\). As shown by the blue solid curve, N(t), driven by increased thymic export, overshoots and approaches \(N_{\mathrm{ss}} (\gamma = 0)\) from above as \(\gamma (t) \rightarrow 0\) asymptotically. In “Appendix A,” we define “direct thymic output” and “proliferation-generated” subpopulations of naive T cells. A thymus-driven description indicates that the lifetime evolution of N(t) is entrained by thymic involution. We show that even in this case, the majority of the naive T cell population is maintained by homeostatic proliferation, while cells directly exported from the thymus only comprise 10–25% of the population, consistent with previous experimental findings in human adults (den Braber et al. 2012.)

Finally, the red dashed–dotted curve arises when \((p-\mu _0)/a \le 0\). In this case, cell death always exceeds cell proliferation as \(\gamma (t) \rightarrow 0\). In this case, the naive T cell population is almost entirely sustained by direct thymic export, and \(N(t) \rightarrow N_{\mathrm{ss}} (\gamma = 0) = 0\). This scenario is consistent with previous experimental findings in mice, where the average lifespan of naive T cells is shorter than cell doubling time, rendering peripheral proliferation of naive T cells highly unlikely in mice (den Braber et al. 2012). As stated earlier, in this paper we focus on scenarios of healthy aging (lymphoreplete conditions) in humans, which immediately rules out the scenarios of unbounded growth (black dashed curve) and complete collapse of the T cell population (the red dotted–dashed curve), effectively constraining our parameters to the physiologically reasonable values \(\mu _1/(p - \mu _0) \ge 1\) and \((p - \mu _0)/a > 0\).

Fig. 1
figure 1

(Color figure online) Qualitative behavior of the total naive T cell population model (Eqs. 12). a The total naive T cell population N(t) as a function of time (in years) for four qualitatively distinct scenarios. Unbounded growth arises when \(\mu _1/(p - \mu _0) < 1\) and the naive T cell population collapses when \((p - \mu _0)/a < 0\). Outside of these two regimes, N(t) converges asymptotically to a positive steady state as \(\gamma (t) \rightarrow 0\). If \(\gamma _{0}/(aK) \ll 1\), N(t) is driven primarily by homeostatic proliferation and increases monotonically toward the constant plateau. Increasing \(\gamma _{0}/(aK)\) leads to a transition from proliferation-driven scenario to thymus-driven populations, in which N(t) reaches a peak value before converging to the steady state. The decaying thymic export rate \(\gamma (t)\) is plotted alongside the N(t) curves as a reference. To quantify the decrease in cell counts with age, we define \({\bar{N}}_{{y}}\) as the average of N(t) between ages 20 and 30, and \({\bar{N}}_{\mathrm{o}}\) between 70 and 80; then, \(\Delta \left( {\bar{N}}\right) = \left( {\bar{N}}_{\mathrm{o}} - {\bar{N}}_{{y}}\right) /{\bar{N}}_{{y}}\) is the relative change in cell counts. The parameter values used are \(\gamma _0 = 1.8 \times 10^{10}\), \(a = 0.044\), and \(K = 10^{10}\) and \(p = 0.022\), \(\mu _0 = 0.017\), and \(\mu _1 = 0.004\) for unbounded growth, \(p = 0.17\), \(\mu _0 = 0.18\), and \(\mu _1 = 0.04\) for the collapse scenario, \(p = 0.18\), \(\mu _0 = 0.17\), and \(\mu _1 = 0.01001\) for the homeostasis-driven case, and \(p = 0.18\), \(\mu _0 = 0.17\), and \(\mu _1 = 0.04\) for the thymus-driven case. The initial value is \(N(1) = 10^{10}\) at \(t = 1\) year. b\(\Delta \left( {\bar{N}}\right) \) as a function of \(\gamma _{0}/(aK)\) and \(\mu _1/(p-\mu _0)\). When \(\gamma _{0}/(aK)\) and \(\mu _1/(p-\mu _0)\) are small, N(t) is driven primarily by proliferation and keeps increasing well into old age, leading to positive \(\Delta ({\bar{N}})\) values. Conversely, for large \(\gamma _{0}/(aK)\) and \(\mu _1/(p - \mu _0)\), thymic export dominates and N(t) peaks at early ages, resulting in negative \(\Delta ({\bar{N}})\). The black dotted curve corresponds to \(\Delta ({\bar{N}}) = - 52\%\) as previously reported by Westera et al. for human adults. At fixed \(\mu _1/(p - \mu _0) = 4\), we are able to reproduce this curve by setting \(\gamma _{0}/(aK) \simeq 41\) (corresponding to \(K = 10^{10}\) for our choice of parameter values). The value of \(\Delta ({\bar{N}})\) increases with decreasing \(\gamma _{0}/(aK)\) and becomes positive when \(\gamma _{0}/(aK) \lesssim 1\). Here, we fixed \((p - \mu _0)/a = 0.2\) and \(a = 0.044\)

We can further quantitatively calibrate the parameter values using experimental measurements for human adults in the literature. The constant peripheral proliferation rate p has been measured by Westera et al. (2015) as 0.05% \(\text {day}^{-1}\), or equivalently \(p = 0.18\)\(\text {year}^{-1}\) in a healthy human. The basal death rate \(\mu _0\) can be estimated from the lifespan of T-cells. Based on data from Vrisekoop et al. (2008), De Boer and Perelson (2013) obtain an average naive human \(\text {CD4}^+\) T-cell lifespan of \(\sim 5\) years and an average naive human \(\text {CD8}^+\) lifespan of \(\sim 7.6\) years. Given the normal \(\text {CD4}^+\text {:CD8}^+\) ratio of 2:1, the average combined naive T-cell clearance rate is \(\mu _0 = \frac{1}{5.9}\)\(\text {year}^{-1}\) = 0.17 \(\text {year}^{-1}\). Thymic involution within an aging human can be quantified by measuring the decrease in thymic epithelial volume (Steinmann 1986), based on which Murray et al. (2003) showed that thymic output decreases by an average of \(4.3\%\) per year between ages 0 and 100, implying a decay factor of \(a = |\ln (0.957)| \simeq 0.044\). The rate of thymic export has recently been measured for young adults (20–25 years old) at \(\sim 1.6 \times 10^{7}\) trained cells daily or equivalently \(5.8 \times 10^{9}\) per year (Westera et al. 2015). Assuming that this rate is \(\gamma (t)\) at \(t = 25\) years, we can back-calculate \(\gamma _0 = (5.8 \times 10^{9}) \times \left( \frac{100}{33.3}\right) \approx 1.75 \times 10^{10} \text { cell exports}/\text {year}\). Note that these values of p, \(\mu _0\), and a satisfy the constraint \(\left( p - \mu _0 \right) /a > 0\) that prevents the human naive T-cell population from completely collapsing once \(\gamma (t) \rightarrow 0\).

While direct experimental measurements of \(\mu _1\) and K are not available in the literature, further inspection of Fig. 1a reveals that \(\mu _1\) and K determine whether thymic export or homeostatic proliferation dominates the evolution of N(t). Through the dimensionless parameters, \(\gamma _{0}/(aK)\) and \(\mu _1/(p - \mu _0)\), the time at which N(t) peaks and how fast it declines from the peak vary with changes to the values of \(\mu _1\) and K. Recently, Westera et al. (2015) reported a \(52\%\) decrease in total naive T-cell counts between young human adults and elderly individuals, which we can use to quantitatively constrain \(\mu _1\) and K. Let us define individuals of an age between \(t = 20\) and 30 years as young adults, and those between \(t = 70\) and 80 as the elderly. Assuming that interpersonal heterogeneity unrelated to age averages out over large sample sizes in clinical data, we may evaluate \({\bar{N}}_{{y}} = \frac{1}{10} \int _{20}^{30} N(t) \mathrm{d}t\) and \({\bar{N}}_{\mathrm{o}} = \frac{1}{10} \int _{70}^{80} N(t) \mathrm{d}t\) as the average naive T-cell counts, respectively, for the young and the elderly, as illustrated by the shaded areas under the thymus-domination curve in Fig. 1a. The relative change in the naive T-cell count between young and elderly adults can thus be evaluated as

$$\begin{aligned} \Delta ({\bar{N}}) = {({\bar{N}}_{\mathrm{o}} - {\bar{N}}_{{y}}) \over {\bar{N}}_{{y}}}. \end{aligned}$$
(4)

Figure 1b plots \(\Delta ({\bar{N}})\) as a function of \(\gamma _{0}/(aK)\) and \(\mu _1/(p - \mu _0)\), with \(a = 0.044 \text { year}^{-1}\) for converting the dimensionless time to years to compute \({\bar{N}}_{{y}}\) and \({\bar{N}}_{\mathrm{o}}\). When \(\gamma _{0}/(aK) \lesssim 1\) and \(\mu _1/(p - \mu _0) \lesssim 2\), \(\Delta ({\bar{N}}) > 0\). Note that the homeostatic carrying capacity when \(\gamma (t) = 0\) is \(N_{\mathrm{ss}} (\gamma = 0) = K \left( \mu _1/(p- \mu _0) - 1 \right) ^{-1}\). A small \(\gamma _{0}/(aK)\) value represents a relatively low thymic export rate, and the carrying capacity increases rapidly as \(\mu _1/(p - \mu _0) \rightarrow 1\), both of which make it challenging for thymic output to fill up the naive T-cell pool to carrying capacity before \(\gamma (t)\) considerably decays within \(t \sim a^{-1}\). As a result, N(t) does not reach a peak value at a young age and continues increasing into old age. The \(\approx 52\%\) decrease in naive T-cell counts reported by Westera et al. (2015) is depicted by the black dotted curve, which exhibits an abrupt turn around \(\mu _1/(p-\mu _0) \approx 4\), suggesting that the value of \(\mu _1/(p-\mu _0)\) may most likely be around or above four.

In “Appendix A,” we further find that increasing \(\mu _1/(p-\mu _0)\) leads to a higher fraction of the naive T cell population coming from direct thymic export. For \(\mu _1/(p-\mu _0) \ge 10\), this fraction stays consistently above \(25\%\) throughout most of an adult human life, which exceeds previous experimental observations of 11–23% (den Braber et al. 2012), suggesting that 10 may be an upper bound on \(\mu _1/(p-\mu _0)\). Without loss of generality, we set \(\mu _1/(p-\mu _0) = 4\), yielding \(K = 10^{10}\) by calibrating our model to reproduce this decrease in the cell count (\(\gamma _{0}/(aK) \simeq 41\) with \(\gamma _0 = 1.8 \times 10^{10}\) and \(a = 0.044\)). In contrast, \(K = 10^{12}\) yields \(\gamma _{0}/(aK) \simeq 0.41\), leading to an increase in the cell count (\(\Delta ({\bar{N}}) \simeq 0.63\)). In between, \(K = 10^{11}\) results in a moderate decrease in the cell count (\(\Delta ({\bar{N}}) \simeq -0.33\)). For the rest of the paper, we fix \(K = 10^{10}\) and \(\mu _1/(p-\mu _0) = 4\), or equivalently \(\mu _1 = 0.04\) given that \(p = 0.18\) and \(\mu _0 = 0.17\), so that the age-related decline of N(t) in our model is consistent with Westera et al. (2015).

Fig. 2
figure 2

(Color figure online) Comparison of thymic export and cell population evolution time scales. a Plots of N(t) and \(N_{\mathrm{ss}}\) show discrepancy. The \(\gamma (t)\) dependence makes \(N_{\mathrm{ss}}\) decline monotonically with the exponentially decaying thymic export, and \(N_{\mathrm{ss}}\) approaches a small positive value as \(\gamma (t) \rightarrow 0\). The solution N(t) evolves toward \(N_{\mathrm{ss}}\) but never catches up with it because of a slower evolution time scale. b Comparison of timescales of thymic atrophy and cell population evolution. Thymic atrophy is the faster mechanism for most choices of the system’s parameters. Increasing \(\mu _1\) shortens the time scale of clone evolution, indicating that the steady-state solution can be a reasonable approximation to the fully time-dependent solution at very large \(\mu _1\) and very small \(p - \mu _0\). Here, varying \(N_{\mathrm{ss}}\) within the range \([10^{10}, 10^{12}]\) yields almost identical results, and the values of \(\gamma _0\) and K, chosen within the reasonable parameter regime, do not affect the results significantly. Parameter values used are \(\gamma _0 = 1.8 \times 10^{10}\), \(a = 0.044\), \(p = 0.18\), \(\mu _0 = 0.17\), \(K = 10^{10}\), \(\varOmega = 10^{16}\). In panel a, \(\mu _1 = 0.04\), and the initial condition is \(N(1) = 10^{11}\)

Note that there exist two intrinsic timescales in Eq. 1; thymic export decays at a rate a, while the homeostatic time scale is controlled primarily by p, \(\mu _0\), and also by \(\mu _1\) to a lesser degree. If homeostasis is much faster than thymic involution, the solution N(t) will quickly converge to the quasisteady-state solution as \(\gamma (t)\) evolves. We compare these two solutions in Fig. 2a, where the quasisteady-state solution is obtained by solving for the steady-state solution \(N_{\mathrm{ss}}\) of Eq. 1 with fixed \(\gamma (t)\) at each time t, and \(N_{\mathrm{ss}} (\gamma (t))\) (black dashed curve) decreases monotonically with age due to the continuous decline of \(\gamma (t)\). In contrast, N(t) (blue solid curve) slowly rises from the initial condition \(N(1) = 10^{11}\) and does not approach the quasisteady-state level until age \(\approx 20\) years. The trajectory of N(t) then overshoots the declining \(N_{\mathrm{ss}} (\gamma (t))\), reaches a peak value, and reverses course to go after \(N_{\mathrm{ss}} (\gamma (t))\). However, N(t) never catches up with \(N_{\mathrm{ss}} (\gamma (t))\) before the latter reaches a steady state of very low cell counts. That N(t) keeps lagging behind \(N_{\mathrm{ss}} (\gamma (t))\) indicates that the timescale for the full model solution to converge to the steady state is slower than the evolution of the nonautonomous term \(\gamma (t)\). The results here suggest that steady-state solutions cannot adequately describe the temporal evolution of the naive T-cell population in the biologically relevant range of parameter values that we have implemented. It is necessary to numerically compute the time-dependent solutions for the full nonautonomous equation.

Indeed, we find a disparity in the rates at which thymic export decays and the steady-state solutions evolve. The latter is provided by the inverse of the eigenvalue of Eq. 1 linearized around \(N = N_{\mathrm{ss}} (\gamma (t))\). The eigenvalue takes the form \(\lambda _1 = p_0 - (\mu _0 + \mu _1((3 N_{\mathrm{ss}}^2 K^2 + N_{\mathrm{ss}}^4)/((K^2+ N_{\mathrm{ss}}^2)^2))\). Simulations in Fig. 2b show that for the biologically relevant parameter values we have implemented, the cell population evolution timescale, \(|\lambda _1|^{-1}\) (red solid curve), is generally longer than the timescale of thymic involution (\(a^{-1} \simeq 22.7\) years for \(a = 0.044\) as denoted by the horizontal black dotted line). Hence, the nonautonomous solutions N(t) are expected to lag behind the thymus-driven steady-state solutions \(N_{\mathrm{ss}}\). For N(t) to be reasonably approximated by \(N_{\mathrm{ss}}\), the cell population has to evolve much faster than thymic involution, corresponding to the regime of very large \(\mu _1\), as indicated by the blue dashed–dotted curve, where cell death is extremely sensitive to the cell population size. However, \(\mu _1\) is bounded above by experimental observations, as discussed previously in parameter calibration. Thus, our conclusions derived from Fig. 2a and b should hold for parameter values within the biologically relevant range.

2.2 Clonotype Abundance Distributions

Quantification of the populations of individual clonotypes would require analysis of models that track the population dynamics of naive T-cells of each TCR type. Assuming the same population dynamics for each T-cell clonotype i, which may be appropriate in certain scenarios, the evolution of the expected cell count \(n_i(t)\) may be deduced from Eq. 1 and take the following generalized form:

$$\begin{aligned} \frac{\mathrm{d}n_i}{\mathrm{d}t} = \frac{\gamma (t)}{\varOmega } + p n_{i}-\mu (N)n_{i}, \end{aligned}$$
(5)

where \(\gamma (t)/\varOmega \) represents thymic export of naive T-cells of each clonotype (the total thymic export rate normalized by the total number of viable TCR combinations \(\varOmega \)), and \(N(t) = \sum _i n_i(t)\). Within the framework of these “neutral” models, basic qualitative behaviors of T-cell population dynamics have been investigated, particularly for scale-invariant properties that can be studied in a reduced system (Lythe et al. 2016; Desponds et al. 2015). Indeed, the total numbers of T-cell clonotypes \(\varOmega \) in rodent or human bodies are prohibitively large for direct numerical simulations of the full system using Eq. 5. It is thus common to reduce the full system to a more manageable size with the assumption that the phenomena under investigation are scale invariant. However, it is sometimes difficult to assert whether a certain property really does not change in a rescaled system, as nonlinear phenomena, such as the Allee effects, often arise in population dynamics and cast doubt on the scalability of the system. Moreover, some properties, such as the thymic export rate \(\gamma (t)\), are naturally scale dependent. It is not always clear how these quantities should be rescaled in a reduced system, and they have usually been omitted by simplification arguments in previous models, which limits the applicability of these models.

In particular, thymic involution is known to be associated with the age-related loss of naive T-cell diversity. Without the explicit inclusion of the thymic export rate, such loss of naive T-cell diversity cannot be properly investigated. To facilitate a more manageable full-system model, we consider a formulation that tracks how the expected number of clones of a given size changes with time. By focusing on clone count rather than the explicit cell count of each distinct clonotype, we are able to effectively reduce the number of tracked variables and thus the dimension of the model. This representation was used by Ewens (1972) in population genetics, by Goyal et al. (2015) in the context of hematopoietic stem cell population dynamics, and by Desponds et al. (2017) in the context of T-cells. We define \({\hat{c}}_k(t)\) to be the number of clones represented by exactly k naive T-cells in the organism at time t:

$$\begin{aligned} {\hat{c}}_k(t) = \sum _{i = 1}^{\varOmega } \delta _{n_i(t),k}, \end{aligned}$$
(6)

where the Kronecker delta function \(\delta _{x,y} = 1\) when \(x = y\) and 0 otherwise. By lumping clonotypes of the same cell count into one single variable \({\hat{c}}_k\), this alternative formulation can efficiently describe changes to the TCR clone diversity in the full system, albeit at the expense of the ability to distinguish each specific clonotype (Morris et al. 2014; Mora and Walczak 2016). Individual clone information is lost, and \(n_i(t)\) cannot be recovered from \({\hat{c}}_k(t)\) after the transformation in Eq. 6. Nonetheless, the amount of computation can be significantly reduced by truncating \({\hat{c}}_k(t)\) at a reasonably large k, as few large clones exist in realistic scenarios, and \({\hat{c}}_k(t)\) for large k is negligible. Letting \(c_0(t)\equiv \langle {\hat{c}}_{0}(t)\rangle \) denote the expected number of all possible (thymus-allowed) clonotypes unrepresented in the periphery at time t, and \(c_k(t)\equiv \langle {\hat{c}}_{k}(t)\rangle \) the expected number of clones of size k at time t, a set of equations governing the evolution of \(c_{k}(t)\) consistent with Eq. 5 can be derived in the mean-field limit. Below, we provide a heuristic derivation and leave the more formal development to “Appendix B.” The mean-field equation for the expected clone counts can be written as

$$\begin{aligned} \frac{\mathrm{d}c_k(t)}{\mathrm{d}t}&= \frac{\gamma (t)}{\varOmega } \left[ c_{k-1} - c_k\right] + p\left[ (k-1)c_{k-1} - k c_k\right] + \mu (N) \left[ (k+1)c_{k+1} - k c_k\right] , \end{aligned}$$
(7)

where \(N(t) = \sum _{i}^{\infty } n_{i}(t) = \sum _{\ell =1}^{\infty }\ell c_{\ell }(t)\). The expected values \(c_{k}(t)\) are also called species abundances in the ecology literature. The number of unrepresented clones is \(c_0 = \varOmega - \sum _{k=1}^\infty c_k\), and summing Eq. 7 multiplied by k over \(k = 1, 2, \ldots \) recovers Eq. 1. The mean-field assumption is articulated in terms such as \(\mu (\sum _{\ell } \ell {\hat{c}}_{\ell }) {\hat{c}}_{k}\) that involve higher-order products of \({\hat{c}}_{k}\) rather than correlations of products of \({\hat{c}}_{k}\).

In Eq. 7, the terms in the forms of \((\gamma (t)/\varOmega )c_k\), \(p k c_k\), and \(\mu (N) k c_k\) represent, respectively, the effect of thymic export, homeostatic proliferation and cell death on a naive T-cell clone already represented by k cells in the peripheral blood. Adding one cell via thymic export or homeostatic proliferation moves one clone from the \(c_k\)-compartment to the \(c_{k+1}\)-compartment, while the death of one cell shifts one clone from the \(c_k\)-compartment to the \(c_{k-1}\)-compartment. We approximate the proliferation rate, p, as a constant, at which rate all cells of all clones of size k replicate via homeostatic proliferation. Proliferation reduces \(c_k\) and increases \(c_{k+1}\). Terms of the form \(\mu (N) k c_k\), where the IL-7 regulated death rate \(\mu (N)\) is given by Eq. 2, reduce \(c_k\) and increase \(c_{k-1}\).

In a recent study, we found that the mean-field approximation breaks down only when \(\gamma /\mu < 1/\varOmega \ll 1\); under these circumstances, the total population is proliferation driven and the quasistatic configuration is \(N \sim K\) and all \(c_{k} \sim 0\) except \(c_{N}\) (Xu and Chou 2018). Thus, we reasonably assume that \(\gamma (t) > \mu /\varOmega \) allowing the use of the mean-field equations 7 in the rest of this paper.

For a healthy aging human adult, the naive TCR repertoire is mostly comprised of small clones with the probability of finding large clones decreasing with clone size k. To numerically solve Eq. 7, we thus truncate the model at a maximum clone size \(M \gg 1\), beyond which the probability of finding a clone is assumed negligible. For our implementation of the truncation, please see “Appendix C.” In Fig. 3a, we examine the effect of the truncation clone size M, showing sufficient convergence of \(c_{10}\) at \(t = 40\) and 70 to fixed values when \(M \gtrsim 30\), which indicates that further inclusion of clones beyond \(c_{30}\) has little effect on the solution for \(t \lesssim 70\) years. For numerical simulations of Eq. 7 in this paper, we set \(M = 200\) to ensure minimal truncation errors.

Fig. 3
figure 3

Simulations of Eq. 7. a Effect of numerical truncation. We plot \(c_{10}(40)\) and \(c_{10}(70)\) as functions of M for \(10 \le M \le 100\). Compartment sizes are effectively fixed when \(M \gtrsim 30\). b Temporal evolution of \(c_k (t)\). We plot \(c_2 (t)\), \(c_{19} (t)\), and \(c_{59} (t)\). Each \(c_k (t)\) curve rises to a peak value and subsequently decreases. As k increases, \(c_k (t)\) decreases in magnitude, and the time at which it reaches the peak value is pushed back. Parameter values: \(\gamma _0 = 1.8 \times 10^{10}\), \(a = 0.044\), \(p = 0.18\), \(\mu _0 = 0.17\), \(\mu _1 = 0.04\), \(K = 10^{10}\), \(\varOmega = 10^{16}\). Initial values \(c_1(1) = 10^{11}\), \(c_0(1) = \varOmega - 10^{11}\), \(c_k(1) = 0\) for all \(k \ge 2\)

Figure 3b shows the temporal evolution of \(c_k (t)\) for \(k = 2\), 19, and 59. As k increases, the overall magnitude of the \(c_k (t)\) curve decreases, and the age at which \(c_k (t)\) peaks increases. For example, \(c_2 (t)\) peaks around \(t \lesssim 20\) years, and there are many fewer clones of exactly two cells in old age than at young ages. In contrast, \(c_{19}(t)\) peaks around age 55, and the numbers of clones that have exactly 19 cells are roughly the same between old and young ages, whereas the number of clones that have exactly 59 cells (\(c_{59} (t)\)) keeps increasing into old age.

The relatively earlier decline of \(c_{k}(t)\) with smaller k is expected, considering that rare clones are introduced into the peripheral circulation primarily by the thymus, which started to involute after birth. With increasing k, the influence of thymic export on \(c_{k}(t)\) decreases, whereas the dependence on homeostatic proliferation increases. Recalling that the rate of thymic involution is faster than the time scale for homeostasis to drive the clonal population toward equilibrium, the fast decline of the rare clone population leaves room for larger clones to expand.

To accompany the steady state \(N_{\mathrm{ss}}\), we compute analogous fixed-\(\gamma _0\) steady-state values of the full system, \(c_k^\mathrm{ss}\), in “Appendix D.” The steady states satisfy \(c_k^{\mathrm{ss}} \rightarrow 0\) as \(\gamma _0 \rightarrow 0\) for all \(1 \le k \le M\). We further show that in spite of the fact that \(c_k^{\mathrm{ss}} \rightarrow 0\), Eq. 7 asymptotically yields a positive total cell count \(N = \lim _{M \rightarrow \infty } \sum _{k=1}^M k c_k^{\mathrm{ss}} > 0\) as \(M \rightarrow \infty \), qualitatively consistent with Eq. 1. Moreover, we prove in “Appendix E” that solutions \(c_{k}(t)\) of the full nonautonomous system satisfy \(c_{k}(t) \rightarrow 0\) for all \(k \le M\), with arbitrarily large M, as \(t \rightarrow \infty \). This result is completely independent of the assumed functional forms of the proliferation and death rates, suggesting that manipulation of homeostatic regulatory mechanisms cannot prevent the extinction of small T-cell clones caused by decaying \(\gamma (t)\). We thus conclude that thymic involution dictates the age-related decline of the TCR diversity of the naive compartment.

2.3 Diversity of the Naive T-cell Repertoire

By computing the functions \(c_k\) that track the number of clones consisting of k cells, we should have sufficient information to evaluate the variation in naive TCR structural diversity over a lifetime. Expected naive TCR structural diversity or “richness” is the total number of distinct naive T cell clones present in the immune compartment, for which we define a threshold naive TCR richness diversity

$$\begin{aligned} R_q(t) = \sum _{k \ge q} c_k(t), \end{aligned}$$
(8)

where \(q \in \mathbb {N}\) is a threshold, so that the quantity \(R_q(t)\) represents the number of clones of size at least q present in the immune compartment at time t. \(R_q(t)\) is a generalization of \(R_1(t)\), which is typically defined as the richness of naive TCR diversity. A higher threshold \(q > 1\) may arise because of immune surveillance, in which small clones may evade detection, or effectiveness of antigen detection, in which small clones may have an insufficient probability of encountering their specific antigens.

Fig. 4
figure 4

(Color figure online) Simulation of threshold richness diversity. a\(R_q(t)\) as a function of t, for \(q = 1, 2, 3\). \(R_q\) peaks at later times as q increases. b\(\Delta ({\bar{R}}_q(t))\) for varying q, \(\mu _1\). Higher \(\mu _1\) correspond to more severe loss of T-cell clones in advanced age. c\(\Delta ({\bar{R}}_q)\) for varying q, K. Small values of q result in a lifetime decrease to \(R_q\), but larger values result in a lifetime increase. This is due to the fact that \(R_q\) peaks at later times as q increases. d\(\Delta ({\bar{R}}_1)\) for varying \(\mu _1\), K. Initial values \(c_0(1) = \varOmega - 10^{11}\), \(c_1(1) = 10^{11}\)\(c_k(1) = 0\) for \(k \ge 2\). Parameter values, when not varying: \(\varOmega = 10^{16}\), \(K = 10^{10}\), \(p_0 = 0.18\), \(\mu _0 = 0.17\), \(\mu _1 = 0.04\), \(a = 0.044\), \(\gamma _0 = 1.8 \times 10^{10}\)

As shown in Fig. 4a, \(R_q (t)\) increases at young ages, peaks at a mature age, and declines afterward. For our previous parameter values, the peak age of \(R_1(t)\) is approximately \(t \sim 16\). Higher q leads to older peak ages of \(R_q(t)\), consistent with the results in Fig. 3b, in which the number of larger clones peaks in old age.

To compare \(R_q (t)\) between the elderly and young, we adopt the same criterion as with total cell counts and compute window-averaged values of \(R_q(t)\) between ages 20 and 30 for the young and between ages 70 and 80 for the elderly. By defining \({\bar{R}}_{{y}}(q) \equiv \frac{1}{10} \int _{20}^{30} R_q(t) \mathrm{d} t\), \({\bar{R}}_{\mathrm{o}}(q) \equiv \frac{1}{10} \int _{70}^{80} R_q(t) \mathrm{d} t\), we quantify the loss of richness by computing its relative change:

$$\begin{aligned} \Delta ({\bar{R}}_{q}) \equiv {({\bar{R}}_{\mathrm{o}}(q) - {\bar{R}}_\mathrm{y}(q))\over {\bar{R}}_{{y}}(q)}. \end{aligned}$$
(9)

Using the same parameter values as shown in Fig. 4a, we plot \(\Delta ({\bar{R}}_{q})\) with respect to \(\mu _1\) and q in Fig. 4b and c. In Fig. 4b, \(\Delta ({\bar{R}}_{q})\) decreases monotonically with increasing \(\mu _1\), suggesting that upregulated death rate exacerbates the age-related loss of richness, and the impact is more significant for larger q. Figure 4c shows that when \(K=10^{10}\), \(\Delta ({\bar{R}}_{q}) < 0\) for \(q \le 4\). This decreasing trend of \(R_{q}\) generally agrees with the loss of diversity observed in recent experiments where measurements were available across multiple ages (Qi et al. 2014; Britanova et al. 2014). For \(q = 5, 6\), \(\Delta ({\bar{R}}_{q}) \approx 0\), and \(R_{q}\) is nearly unchanged between youth and advanced age. For \(q \ge 7\), \(\Delta ({\bar{R}}_{q}) > 0\), indicating higher \(R_{q}\) in old age. Generally, the lifetime decrease in \(R_q(t)\) occurs with small q, whereas for large q, the trend is reversed, in agreement with our discussion of Figs. 3b and 4a regarding peak ages. This phenomenon indicates that loss of diversity is primarily due to the extinction of rare clones, which is consistent with the observation made by Naylor et al. (2005). In contrast, the number of larger clones increases over time, leading to the lifetime increase to \(R_q(t)\) at higher q.

Recent TCR-\(\beta \) sequencing studies have attempted to estimate the change in the repertoire richness of the naive T-cells with age. Despite the difference in orders of magnitude regarding the total number of circulating naive T-cell clones, these studies agreed quantitatively in the ratio of the age-related loss of richness. For example, Britanova et al. (2014) estimated \(\sim 7 \times 10^6\) clonotypes in youth (ages 6–25) and \(\sim 2.4 \times 10^6\) in aged individuals (ages 61–66), a roughly 66% drop from the youth figure. Similar measurements were also reported by Qi et al. (2014), in which a two- to fivefold decline (i.e., a 50–80% drop) between youth (ages 20–35) and advanced age (ages 70–84) was observed. These results are quantitatively consistent with our computation of \(\Delta ({\bar{R}}_1)\) for \(K = 10^{10}\)\(10^{11.5}\) and \(0.03 \le \mu _1 \le 0.05\) in Fig. 4d, whereas the decline of \(R_{q}\) for \(q \ge 2\) is not as pronounced as in these experimental observations.

Also note that the loss of clonal richness is more severe than the decrease in the total cell count between young and aged individuals. In Fig. 4a, \(\Delta ({\bar{R}}_{1})\) changes between \(\sim - \,66\%\) and \(\sim - \,76\%\) for \(0.03 \le \mu _1 \le 0.05\) and \(K = 10^{10}\). In contrast, Fig. 1b shows that for the same parameter range, \(\Delta ({\bar{N}})\) varies from \(\sim - \,30\%\) to \(\sim -\, 62\%\). However, the figures also reveal that richness is relatively less sensitive to changes to the cellular death rate, compared to the total cell count. This outcome reflects the fact that homeostatic cellular death is uniformly random across the entire naive T-cell population. The drop in richness is due to cell death within small clones that drives these clones to extinction, as observed by Naylor et al. (2005). Increases to the cellular death rate do not cause as much additional clonal extinction as they do additional cellular extinction, as many surviving clones are too large to wipe out by the death of a few cells.

2.4 Sampling Statistics

Considering that naive T-cell richness is often assessed via small blood samples, let us next use the same framework to examine the relation between the detected clone sizes in small samples and the true clone sizes in the full organism. As before, denote by N the total number of naive T-cells in the human’s immune compartment and \(Y \le N\) the number of cells collected during sampling from among the N total. We assume that the N total cells consist of R distinct clones, which we number from 1 to R. In this section, we denote by \(c_k^N\) the mean number of clones of size k from among the N total cells in the full organism (denoted by \(c_k\) in the previous simulations) and by \(c_k^Y\) the mean number of clones of size k in the sampling of Y cells taken from the N total cells. Then the expectation of \(c_k^Y\), denoted by \({\mathbb {E}}[c_k^Y]\), is:

$$\begin{aligned} {\mathbb {E}}\left[ c_k^Y\right] = \sum _{j=1}^R j P\left( c_k^Y = j\right) , \end{aligned}$$
(10)

where \(P\left( c_k^Y = j\right) \) represents the probability that there are precisely j clones of size k in the sampling. Then \({\mathbb {E}}[c_k^Y]\) may be expressed explicitly in terms of the \(c_k^N\) as:

$$\begin{aligned} {\mathbb {E}}\Big [c_k^Y\Big ]&= \sum _{l=k}^R \frac{1}{\left( {\begin{array}{c}N\\ Y\end{array}}\right) } c_l^N \left( {\begin{array}{c}l\\ k\end{array}}\right) \left( {\begin{array}{c}N-l\\ Y-k\end{array}}\right) . \end{aligned}$$
(11)

(See “Appendix F” for the detailed proof.) The collection of expressions given by Eq. 11 for \(k = 1, 2, \ldots , R\) yields a linear system of equations solvable for \(c_k^N\), using sampled data for the quantities \({\mathbb {E}}[c_k^Y]\). More specifically, if we define the vectors \(\mathbf {{\widehat{E}}} := ({\mathbb {E}}[c_1^Y],{\mathbb {E}}[c_2^Y],\ldots ,{\mathbb {E}}[c_R^Y],)\) and \({\mathbf {E}} := (c_1^N,c_2^N,\ldots ,c_R^N)\), Eq. 11 can be written as \(\mathbf {{\widehat{E}}} = \mathbf {A E}\), where \({\mathbf {A}}\) is a constant matrix that has nonzero elements only in the upper triangle, with nonzero diagonal entry \(\frac{1}{\left( {\begin{array}{c}N\\ Y\end{array}}\right) } \left( {\begin{array}{c}N-k\\ Y-k\end{array}}\right) \) in position (kk). The equation can always be solved uniquely for \({\mathbf {E}}\) given \(\mathbf {{\widehat{E}}}\). Thus, the full size distribution \({\mathbf {E}}\) can be uniquely reconstructed from the expected mean sample size distribution \(\mathbf {{\widehat{E}}}\) measured experimentally, provided that the latter can be reliably estimated through a sufficient number of repeated samplings.

In Fig. 5a, we use Eq. 11 to compute \({\mathbb {E}}[c_k^Y]\) from simulated \(c_k^N\), comparing the predicted sampling results of the richness \(R_1(t)\) for varying choices of Y. The results indicate that each decrease by one order of magnitude to the sample size results in a decrease in \(R_1(t)\) by roughly the same order of magnitude to the predicted diversity, except between the full sample and one-tenth of the sample (\(f = 10^{-1}\)), where the decrease in \(R_1(t)\) is less than one order of magnitude. Predictions of diversity vary with sample size, and small samples do not result in accurate measurements of diversity.

In Fig. 5b, we examine how sampling may affect the diagnosis of the age-related TCR richness decline \(\Delta \left( {\bar{R}}_q \right) \) defined in the previous subsection. We find \(\Delta \left( {\bar{R}}_q\right) \), which is negative, increasing with decreasing sampling fraction f, revealing that sampling causes an underestimate of the richness decline. As previously discussed, the decline of TCR richness in old age is primarily due to the extinction of small clones. Since small clones often evade detection during sampling, their extinction is largely unaccounted for, leading to lessened reduction in the richness measure. When f is very small, most of the small clones have escaped detection; thus, decreasing f further does not change \(\Delta \left( {\bar{R}}_q\right) \). Moreover, we note that \(\Delta \left( {\bar{R}}_1\right) \), which represents the case in Fig. 5a and is the most straightforward measure for age-related loss of TCR richness, converges from \(-\,73\%\) for the full sample, to \(-\,59\%\) for a sampling fraction \(f \lesssim 10^{-2}\), which is close to the value of \(\Delta \left( {\bar{R}}_3\right) \) for the full sample. This reaffirms our discussion in the previous subsection that a threshold \(q > 1\) may arise during the process of sampling. The results here indicate that when only a small fraction of a T-cell population is used to measure \(\Delta \left( {\bar{R}}_1\right) \), clones fewer than three cells largely evade detection, yielding a result equivalent to \(\Delta \left( {\bar{R}}_3\right) \) of the full sample, which underestimates the actual decrease in the TCR richness. Also note that the convergence of \(\Delta \left( {\bar{R}}_1\right) \) for \(f \lesssim 10^{-2}\) corresponds to the proportional downscaling of \(R_1(t)\) in Fig. 5a with decreasing f. For larger q, \(\Delta \left( {\bar{R}}_q\right) \) does not converge until f is lower, indicating that \(R_q(t)\) does not downscale proportionally until the sample fraction is very small.

Fig. 5
figure 5

(Color figure online) Comparison of actual and sampled richness. a True lifetime \(R_1\), as well as the expected \(R_1\) that result from extracting 10\(\%\), 1\(\%\), and 0.1\(\%\) of the total cell count for sampling. (\(Y = f \times N\), with \(f = 10^{-1}\), \(10^{-2}\), \(10^{-3}\).) Except between the full sample and \(f = 10^{-1}\), each decrease in the sample size by one order of magnitude results in a decrease to the expected \(R_1\) by approximately one order of magnitude. b The ratio of age-related TCR richness decline \(\Delta \left( {\bar{R}}_q\right) \) as a function of sampling fraction f for clone size thresholds \(q = 1\)–5. As f decreases, the value of \(\Delta \left( {\bar{R}}_q\right) \) increases, indicating a lower estimate of the TCR richness decline. When f is very small, \(\Delta \left( {\bar{R}}_q\right) \) becomes insensitive to further decreases to f. Parameter values used: \(\gamma _0 = 1.8 \times 10^{10}\), \(a = 0.044\), \(p = 0.18\), \(\mu _0 = 0.17\), \(K_0 = 10^{10}\), \(\varOmega = 10^{16}\), \(\mu _1 = 0.04\). Initial values \(c_0(0) = \varOmega \), \(c_k(0) = 0\) for \(k \ge 1\)

3 Discussion

We have formulated a model of lifetime human naive T cell population dynamics, which traces T cell lineages on the level of individual clones. It accounts for exponentially decaying lifetime thymic export, a constant rate of cellular proliferation, and variable cellular death rate that adjusts to present cell counts and the availability of survival resources. It depicts the generation of the naive T cell pool in early life via thymic export, and long-term maintenance of the population via peripheral turnover after thymic export has waned. Values of most of the model’s parameters can be found in the previous literature on humans, while the few exceptions are obtained by fitting some basic results of the model, such as age-related T cell loss, to experimental observations. Our analysis serves two important purposes: to map the thymic machinery, identifying which components do and do not contribute to age-related cellular loss, and then to interpret the nuanced role of that cellular loss in immunosenescence. While our results are intended for describing human aging, our approach can be adapted and interpreted to mice where thymic output plays a larger role in sustaining the naive T cell pool and where we might expect the diversity to be more immediately sensitive to changes in thymic output rates. In either case, we have found that if thymic export is assumed to decay exponentially to zero, then all compartments \(c_k(t)\) (with \(1 \le k \le M\)) deplete as \(t \rightarrow \infty \), independent of essentially any restrictive assumptions about the homeostatic proliferative mechanism in the periphery. Concretely, for any choice of proliferation and death rates \(p(N), \mu (N)\), that satisfy \(p(0), \mu (0) > 0\) and the choice \(\gamma (t) = \gamma _0 {\mathrm{e}}^{-at}\) with \(\gamma _0, a > 0\), there exists a sufficiently small \(\delta > 0\) guaranteeing \(c_k(t) \rightarrow 0\) as \(t \rightarrow \infty \) for all \(1 \le k \le M\), provided that \(\sum |c_k(1)| \le \delta \). Although this result only guarantees that trajectories \(c_k(t)\) started sufficiently close to zero converge to zero, simulation indicates that the basin of attraction to this “zero state” is actually quite large. In fact, for the typical initial conditions used throughout this paper, simulation suggests convergence of all compartments \(c_k\) to zero in infinite time.

Although it takes an extremely long time to deplete all \(c_k\) compartments for \(1\le k\le M\), the initial phase of this process can still cause significant loss of T cell diversity in aging individuals within a human lifespan. Most importantly, we find that the T cell loss driven by exponentially diminishing thymic export alone is robust against any assumptions about the homeostatic proliferative mechanism in the periphery that depend uniformly on the population size N, as this outcome is universal for all functional forms of \(p(N), \mu (N)\). Even a particularly strong homeostatic mechanism (say, one with \(p(0) \gg \mu (0)\)) cannot rescue a plunging diversity. This, in turn, suggests that in searching for treatments of age-induced loss of diversity, efforts should be directed at the thymus, in particular to maintaining thymic productivity into advanced age. In reality, heterogeneity can arise in the rates of peripheral proliferation and thymic output for naive T cells of distinct TCR expressions, due to differentiated responses to various growth factors in the periphery (Desponds et al. 2015; Lythe et al. 2016) and disparate sequencing frequencies in the thymus (Marcou et al. 2018). Even for naive T cells of identical TCR expressions, the peripheral proliferation rate decreases due to telomere shortening with each cell division (Weng et al. 1995; Hodes et al. 2002). Heterogeneity among different TCR expressions may provide a fitness advantage for certain clones and allow them to survive relatively longer than others when the TCR diversity is plunging. Telomere shortening likely makes older clones more easily replaced by newer ones, increasing the turnover rate of distinct TCR clones. Nevertheless, we do not expect such heterogeneity to qualitatively change our results here and rescue the diminishing TCR diversity caused by thymic involution.

Moreover, we compare the real-time simulations and the quasisteady-state solutions of the total cell count, as well as the number of distinct clones, over the course of age-related thymic output erosion. We find that our simulation results keep lagging behind the quasi steady-state solutions, suggesting that the erosion time scale of thymic output is faster than the time scale for the population dynamics to relax toward a steady state. Mathematically, this result reveals that the evolution of the T cell population within the human lifespan is a rather dynamical phenomenon, which may not be well described by quasistatic solutions, requiring evaluation of the fully nonautonomous system. Biologically, our results indicate that the loss of T cell diversity is a delayed response to thymic involution, and assessment of thymic function may predict the health of the immune system.

Although peripheral division cannot salvage the T cell population on a long time scale, higher basal proliferation rates may at least delay the erosion of the T cell compartment, sustaining acceptable effectiveness of the immune system within the human lifespan (Naylor et al. 2005). We assumed a constant lifetime rate of cellular proliferation, but alternative research suggests that proliferation rates may increase with age (Naylor et al. 2005). In light of this finding, we briefly inspect the effect of increased proliferation rates at advanced ages on cellular and clonal loss by modifying p(N) and \(\mu (N)\) in Eq. 7. To prevent unbounded growth caused by p(N) exceeding \(\mu (N)\) as \(N \rightarrow +\infty \), we adopt a logistic growth rate, \(p(N,t) = p(t)(1-N/K)\), where growth is bounded by the negative term; a discrete increase in the proliferation rate is incorporated in \(p(t) = p_0(1 + r H(t-T))\), with \(p_0 > 0\) the early-life basal cellular proliferation rate, and H(t) the Heaviside function, with T the age at which the rate increases. The constant r specifies the increase to the proliferation rate. The death rate is set to a constant value (\(\mu (N) = \mu _0 > 0\)) for simplicity, omitting the N-dependent term in Eq. 2 that practically becomes negligible compared to the negative term of p(N) as \(N \rightarrow +\infty \). By varying r, simulation under these alternate hypotheses indicates that increased basal proliferation rates do lead to notably higher total cell counts (Fig. 6a), but have little effect on diversity (Fig. 6b). These results further affirm that expansion of peripheral proliferation is unlikely to rescue the eroding naive T cell diversity, despite the increased cell count. If diversity loss is the main cause of immunosenescence (still a debatable topic in the medical community), peripheral proliferation may not be the sensible target of treatments.

The increased N(70) and nearly unchanged \(R_{1}(70)\) in Fig. 6 imply that the decline of T cell diversity in old age may appear more dramatic if the diversity is measured in terms of the frequency of distinct TCR sequences among the cycling cells, which corroborates the explanation that an increase in the proliferation rate in old age leads to a sharp decrease in T cell diversity (Naylor et al. 2005). Previous models have shown that even sharper decline of T cell diversity can be induced by fitness selection, where certain clonotypes increase their fitness in old age possibly due to higher avidity to self-antigens (Johnson et al. 2012, 2014; Goronzy et al. 2015a).

Although the boosts to the total cell count through artificial expansion of the proliferative mechanism are unable to replenish the declining TCR diversity in the naive T cell pool, it is possible that the impact is less severe than the decaying richness would have indicated, considering that most of the extinct clones are originally small clones, which may be much less effective than larger clones. In this regard, the viability of treating immunosenescence by expanding peripheral proliferation depends on the elucidation of the T cell pool’s effectiveness clone size—that is, the size a clone must have attained to effectively guarantee activation of the clone when its cognate antigen infiltrates the organism. The effectiveness clone size is intrinsically linked to true functional TCR diversity; if we can identify a threshold integer \(q^*\), such that clones of size at least \(q^*\) are reliably activated in the presence of their cognate antigen(s), but that smaller clones are not, then \(R_{q^*}(t)\) is naturally the most useful measure of diversity, because it accounts for precisely those clones actively participating in the adaptive immune mechanism. The larger the “correct” choice of \(q^*\) is, the more effective treatments to boost cellular proliferation in the periphery will be. Our model directly yields the number of clones of a particular size, making it straightforward to include or exclude clones below a certain cell count, should such a threshold exist and be identified.

Fig. 6
figure 6

(Color figure online) Total cell count and richness with rise in proliferation. Simulation of Eq. 7 with exponentially decaying thymic export, and peripheral homeostasis described by time-varying logistic growth. We use the thymic export rate \(\gamma (t) = \gamma _0 {\mathrm{e}}^{-at}\), peripheral death rate \(\mu (N) = \mu _0 > 0\), and peripheral proliferation rate \(p(N,t) = p(t)(1 - (N/K))\), with \(p(t) = p_0 (1 + r H(t-T))\). Here, H(t) represents the Heaviside function with jump at \(t=0\). The constant r determines the magnitude of the increase to the basal proliferation rate, and T represents the time at which the jump occurs. We take the jump to occur at varying ages. a\(\Delta ({\bar{N}})\) with jump at ages \(T=30\) and 70, for varying r. (Curve corresponding to \(T=50\) is omitted due to close similarity to \(T=30\) curve.) Raising the basal proliferation rate diminishes cellular loss in advanced age, with sufficiently high values of r producing a lifetime increase in total cell counts. The positive steady-state solution of the autonomous total cell ODE, \(\mathrm{d}N/\mathrm{d}t = \gamma _0 + p_0(1 - N/K) - \mu _0 N\), is given by \(N^* = (K/2) (1 - \mu _0/p_0 + \sqrt{(1 - \mu _0/p_0)^2 + 4 \gamma _0 / K p_0})\) and can be seen to satisfy \(\partial N^*/ \partial p_0 > 0\) if \(\gamma _0 < K \mu _0\), suggesting that increases to the basal proliferation rate are likely to increase the total cell count. b\(\Delta ({\bar{R}}_1)\) with \(T=30, 50,\) and 70, for varying r. Increases to the basal proliferation rate do mitigate diversity loss, but the effect is minor and potentially insignificant. Increases to the basal proliferation rate increase \(c_{k+1}\) due to a decrease in \(c_k\), preserving additional diversity, but the lifetime diversity loss is still observed, even when proliferation rates are high enough to generate a lifetime increase to the total cell count. Fixed parameter values: \(\gamma _0 = 1.8 \times 10^{10}\), \(a = 0.044\), \(p_0 = 0.18\), \(\mu _0 = 0.17\), \(K_0 = 3 \times 10^{11}\), \(\varOmega = 10^{16}\). Initial values: \(c_0(1) = \varOmega - 10^{11}\), \(c_1(1) = 10^{11}\)\(c_k(1) = 0\) for \(k \ge 1\). Equation 7 is truncated at \(k=200\)

The effectiveness clone size is also significant to the question of whether diversity loss is the driving factor in immunosenescence. Using the parameter values that we found in the literature, \(R_q(t)\) decreases for \(q \le 4\) from youth to advanced age, stays nearly constant for \(q = 5, 6\), and increases for \(q \ge 7\). The extinction of small clones allows the surviving clones to expand in size, leading the richness of large clones to increase in old age. If the minimal size for a T cell clone to effectively respond to antigens is large, the diversity of such “effective” clones may actually increase with age, strengthening the immune response. Therefore, either the minimal clone size required for effective immune response is low, or the weakened immune response in old age is caused primarily by other mechanisms. For example, functional deficiencies acquired by naive T cells in aging are one possible alternative cause of the weakened immune response. Such functional deficiencies have been studied heavily in mouse models, but research in humans is still lacking (Appay and Sauce 2014). Diminished naive T cell effector responsiveness and proliferative capacity have been observed in aged mice (Moro-García et al. 2013). It is possible that similar changes occur in humans. Conversely, experiments on mice have directly shown that loss of TCR diversity does have an actively detrimental effect on immune responsiveness (Yagger et al. 2008), supporting the notion that loss of TCR diversity is a significant contributor to immunosenescence.

Our model illustrates the feasibility of several different scenarios, in which loss of naive T cell diversity contributes to immunosenescence on drastically different levels. While we consider only the naive T cell population, memory T cells expand upon encountering antigens over the lifespan of an individual, eventually outnumbering naive T cells at around 30–40 years of age (Saule et al. 2005). Memory T cells rely on a mixture of IL-7 and IL-15 as survival signals (Rubinstein et al. 2008), which may reduce the amount of IL-7 available to naive T cells. However, memory T cells are observed to down-regulate expression of IL-7 receptor CD127 when the local concentration of IL-7 is low, thus preserving the naive repertoire (Surh and Sprent 2008). In addition, distinct population dynamics have been observed among various subsets of memory T cells, such as CD4\(^+\)/CD8\(^+\) central memory/effector memory/terminally differentiated T cells (Saule et al. 2005), as well as among memory T cells expressing naive phenotypes that accumulate with aging but do not contribute to the capacity to respond to new infections (Pulko et al. 2016). While a more complete picture of T cell population dynamics would include the memory compartment, our model may serve as a first step in that direction, in which each subset of memory T cells may be added. Moreover, our model indicates that the effectiveness clone size and crossreactivity in vivo are valuable pieces of missing information, the elucidation of which would allow for the identification of effective options to treat immunosenescence.

4 Summary and Conclusions

We have simulated the time evolution of the functions \(c_k(t)\), which represent the number of naive T cell clones of size k present in a human’s immune compartment at time t. We determined that under essentially any realistic assumptions about homeostatic proliferation and death, all clones deplete in infinite time if thymic export is assumed to decay exponentially. This implicates thymic export as a fundamental cause of age-associated diversity loss. We simulated our model under the assumption that a carrying capacity is regulated by homeostatic proliferation and death through N-dependent rates. We found that the manipulation of homeostatic proliferation and death rates, which may notably raise the carrying capacity and thus the total cell count, was unable to save falling diversity as an individual ages. It affirms the vital role of thymic output in age-related diversity loss and indicates that boosting the proliferation rate is unlikely an effective solution. However, if only clones of large size are sufficiently effective in the immune response, boosting proliferation rates might raise average clone sizes and help to mitigate the effects of lost diversity. We simulated “threshold richness diversity,” \(R_q(t)\), which counts the total number of clones of size q or larger. We found that by increasing q, the trajectory of \(R_q(t)\) changes from decreasing to increasing over a human lifetime. From this trend, we concluded that if only large clones are effective, the effective richness would actually increase with age, suggesting that it is important to identify the minimal effective clone size in order to determine whether the loss of TCR diversity is the primary driving mechanism of the immune dysfunction seen in advanced age. Lastly, we derived a one-to-one mapping between the full-sample diversity \(c_k^N\) of N cells and the expected measurement of diversity \({\mathbb {E}}[c_k^Y]\) in samples of Y cells. We found that the probability of detecting small clones shrank significantly with small sample sizes, which could potentially skew small sample statistics. In particular, we show that small samples tend to underestimate the age-related loss of T cell richness diversity. Our formulation provides a rigorous method for accurately inferring the statistical distribution of clonal sizes from small sample measurements.