1 Introduction

1.1 Size effect and the capital asset pricing model

The size of a stock is measured by its market capitalization, or market cap: current stock price multiplied by the number of shares. For a stock portfolio, its market cap is defined as weighted sum of market caps of constituent stocks, with weights equal to the portfolio weights. The size is a very important fundamental characteristic of a stock or a portfolio.

It is observed that small stocks have higher returns but higher risk than large stocks. An explanation is that small companies are in their dynamic growth phase and they have higher growth potential relative to large mature companies, but small companies are more vulnerable to failure and bankruptcy. Some researchers claim that even after adjusting for risk, small stocks have higher returns than large stocks. This adjustment can be made rigorous within the framework of the Capital Asset Pricing Model (CAPM). Take a portfolio of stocks with total returns (including dividends, not inflation-adjusted) Q(t) during time t. Here, we operate in a discrete-time setting. Compare it with risk-free returns from short-term Treasury bills R(t). An investor deserves premium reward for taking risk and investing in stocks rather than in safe Treasury bills. We calculate this equity premium P(t) by subtracting \(P(t) = Q(t) - R(t)\). Next, we compute this equity premium \(P_0(t)\) for a market portfolio, used as a benchmark. An example of such benchmark is the Standard & Poor (S&P) 500, a widely used benchmark for large U.S. stocks. Run a linear regression:

$$\begin{aligned} P(t) = \alpha + \beta P_0(t) + \varepsilon (t). \end{aligned}$$
(1)

The parameter \(\beta \) shows market exposure, how much risk the portfolio is exposed to because of fluctuations in the benchmark. The parameter \(\alpha \) shows excess return, how much can one earn from this portfolio on top of this market return. They are often called by their Greek names: beta and alpha. The residual \(\varepsilon (t)\) is called unsystematic risk which can be eliminated by diversification. According to the CAPM, \(\alpha = 0\) and the only risk which deserves rewards is the systematic risk (due to market exposure) since other risk can be diversified away. The CAPM was proposed in the classic article Sharpe (1964). Subsequent research cast doubt on the consistency of CAPM with actual market data. In particular, Banz (1981) found that taking a portfolio of small stocks generates positive \(\alpha \). That is, small stocks have higher returns than large stocks even after adjusting for market exposure (which is greater than 1 for small stocks). Subsequent classic article Fama and French (1993) confirmed this. Further research on the size effect can be found in Semenov (2015) and van Dijk (2011) and references therein. See also critique of CAPM in Fama and French (2004).

1.2 Our model

We study dependence of \(\alpha , \beta , \sigma \) (the standard deviation of \(\varepsilon (t)\)) on the size, measured by the market cap S(t), or, more precisely, by relative size to \(S_0(t)\):

$$\begin{aligned} C(t) = \ln \frac{S_0(t)}{S(t)}. \end{aligned}$$
(2)

We would like to find functions \(\alpha , \beta , \sigma \) of C such that for standardized white noise terms Z(t), with \(\mathbb E[Z(t)] = 0\) and \({\mathbb {E}}[Z^2(t)] = 1\):

$$\begin{aligned} P(t) = \alpha (C(t)) + \beta (C(t))P_0(t) + \sigma (C(t))Z(t). \end{aligned}$$
(3)

This allows us to quantify how exactly \(\alpha \) and \(\beta \) (and \(\sigma \) the standard deviation of unsystematic risk) depend on the relative size measure. We then consider a version of the Eq. (3) in which equity premia are replaced by price returns, that is, returns due to price changes (or, equivalently, market cap changes). That is, we replace P(t) and \(P_0(t)\) with \(\ln (S(t+1)/S(t))\) and \(\ln (S_0(t+1)/S_0(t))\), ignoring for now risk-free rate and dividends. This gives us:

$$\begin{aligned} \ln \frac{S(t+1)}{S(t)} = \alpha (C(t)) + \beta (C(t))\ln \frac{S_0(t+1)}{S_0(t)} + \sigma (C(t))Z(t) \end{aligned}$$
(4)

with C(t) from (2). This Eq. (4) includes only market caps of benchmark \(S_0(t)\) and the portfolio S(t). This time series equation or its continuous-time version, a stochastic differential equation, allows us to model S(t) and \(S_0(t)\) separately from dividends and risk-free returns. On top of this, we add the Eq. (3). Surprisingly, market data gives us almost the same functions \(\alpha , \beta , \sigma \) in (3) and (4), but with differing coefficients. Moreover, the white noise terms in (3) and (4) are almost perfectly (more than 99%) correlated. This does not follow from any theoretical considerations, and seems simply a piece of good luck which simplifies analysis.

We also adapt (3) and (4) for continuous time: Equity premium P(t) becomes then \(\mathrm {d}\ln V(t)\), where V(t) is the wealth process adjusted for risk-free returns. More precisely, \(V(t) = U(t)/U^*(t)\), where U(t) is the wealth accumulated from investing \(U(0) = 1\) in stock portfolio and reinvesting dividends, while \(U_*\) is a similar wealth process from investing in Treasury bills. Then (3) takes the form

$$\begin{aligned} \mathrm {d}\ln V(t) = \alpha (C(t))\,\mathrm {d}t + \beta (C(t))\,\mathrm {d}\ln V_0(t) + \sigma (C(t))\,\mathrm {d}W(t), \end{aligned}$$
(5)

where \(V_0\) is the adjusted wealth process for the benchmark, and W is a Brownian motion: A real-valued continuous process with \(W(t) - W(s) \sim {\mathcal {N}}(0, t-s)\) independent of \(W(u),\, 0 \le u \le s\), for all \(0 \le s < t\). This Brownian motion can be viewed as a zoomed out random walk with very small but very frequent steps. For simplicity, we assume that \(\ln V_0(t)\) is also a Brownian motion with positive drift (which captures the tendency of long-term stock returns to be greater than risk-free returns). Although equity premia have heavy tails and thus are not well-described by the Gaussian distribution, the Brownian motion provides a simple first approximation. Similarly, Eq. (4) becomes

$$\begin{aligned} \mathrm {d}\ln S(t) = \alpha (C(t))\,\mathrm {d}t + \beta (C(t))\,\mathrm {d}\ln S_0(t) + \sigma (C(t))\,\mathrm {d}W(t), \end{aligned}$$
(6)

with C(t) from (2). By the above remark, we can assume that the driving Brownian motion W from (5) and (6) is the same.

1.3 Dependence upon the size measure

What is the dependence of market exposure \(\beta \), excess return \(\alpha \), and standard deviation of unsystematic risk \(\sigma \) upon relative size?

Our statistical analysis does not yield conclusive results. The white noise tests for Z unfortunately fail. Thus we cannot claim that a model (3),  (4) passes goodness of fit tests. The most resonable guess for \(\alpha , \beta , \sigma \) seems to be:

$$\begin{aligned} \alpha (c), \ \beta (c),\ \sigma (c) = {\left\{ \begin{array}{ll} \alpha _+c,&{}\quad 1 + \beta _+c,\ \sigma _+c,\quad c \ge c_+\\ -\alpha _-|c|^{1/2},&{}\quad 1 + \beta _-c,\ \sigma _-|c|^{1/2},\quad c \le -c_-. \end{array}\right. } \end{aligned}$$
(7)

Here, \(\alpha _{\pm }, \beta _{\pm }, \sigma _{\pm }, c_{\pm }\) are positive constants. Our data analysis does not allow us to suggest these functions in a neighborhood of zero. This is due to the fact that observed C(t) in our data do not come very close to zero.

1.4 Stochastic portfolio theory

The continuous-time version is useful because we can use stochastic calculus and immerse these models in Stochastic Portfolio Theory (SPT). This is a framework for stock market modeling which does not depend on particular models. It suggests overweighing small stocks and continuous rebalancing. This means investing in small stocks in proportion greater than their market cap would dictate. One example of this is taking the equal-weighted portfolio. The central result of SPT: under mild conditions, diversity (no stock dominates the entire market) and sufficient intrinsic volatility, such portfolios outperform the market portfolio, which invests in each stock in proportion to its market cap; see Banner and Fernholz (2008), Fernholz and Karatzas (2005), Fernholz et al. (2005). This theory has solid theoretical basis, and is consistent with observed data. For references, see the book Fernholz (2002) and a more recent survey Fernholz and Karatzas (2009). SPT is based on the same observation as above: small stocks have higher return and risk than large stocks.

Although we mentioned above that SPT is model-independent, there are some SPT models which attempt to capture this observation: competing Brownian particles, where logarithms of market caps evolve as Brownian motions with drift and diffusion coefficients dependent on their current ranks relative to other particles, Banner et al. (2005), Banner et al. (2011), Karatzas and Sarantsev (2016); their generalizations with jumps, or with dependence on both name and rank (so-called second-order models): see articles Banner et al. (2011), Barnes and Sarantsev (2020), Shkolnikov (2011); volatility-stabilized models, where \(\ln S(t)\) are modeled by stochastic differential equations (SDE) with volatility inversely proportional to S(t), Pal (2011), and their generalizations, Pickova (2014). As the number of stocks tends to infinity, the limiting behavior of competing Brownian particles and volatility-stabilized models is studied in Chatterjee and Pal (2010), Jourdain and Reygner (2015), Shkolnikov (2012), Shkolnikov (2013).

In this article, we recognize the difference between price and total returns, and model them separately as (3) and (4) for discrete time or (5) and (6) for continuous time. SPT is much more developed for continuous-time diffusive models based on SDE than for discrete-time, see Pal and Wong (2016). Thus it is reasonable to switch to continuous time in (5) and (6).

Stochastic Portfolio Theory deals with diversification benefits in the form of excess growth rate, and functionally generated portfolios. The standard assumption in SPT is that there are no dividends; that is, price and total returns are the same. But here this is no longer true. To model separately price returns (which drive the market capitalization processes) and total returns (which drive the wealth processes) requires an extension of the SPT, in particular the concept of functionally generated portfolios. This is left for future research.

1.5 Data analysis

We take real U.S. market data from Kenneth French’s Data Library online: market cap, price and total returns for equal-weighted portfolios made from size deciles (stocks split into top 10%, next 10%, etc. according to their size). This library contains data processed from original raw data from the Center for Research in Securities Prices (CRSP) at the University of Chicago, July 1926–June 2020 (84 years), monthly data.

We study models of n portfolios and the benchmark with market caps \(S_0, \ldots , S_n\), relative size measures \(C_0, \ldots , C_n\), and wealth processes \(V_0, \ldots , V_n\). The corresponding Brownian motions \(W_1, \ldots , W_n\) from (6) and (5) (recall that these two equations have the same Brownian motions) are assumed to be i.i.d. for simplicity (although our analysis shows this to be inconsistent with actual data). Consider market weights:

$$\begin{aligned} \mu _i(t) = \frac{S_i(t)}{S_0(t) + \cdots + S_n(t)},\, i = 0, \ldots , n, \end{aligned}$$

representing the proportion of the ith stock in the overall market. Small stocks have smaller market weights. The market weight vector \({\mu } = (\mu _0, \ldots , \mu _n)\) is a Markov process on the n-dimensional simplex \(\triangle _n\). If this market weight vector converges to a unique stationary distribution as \(t \rightarrow \infty \) in the total variation norm (see definitions in Sect. 4), then we call the system stable. In this article, we state and prove that our market model is stable under certain conditions on \(\alpha , \beta , \sigma \). We also investigate whether collisions: \(S_i(t) = S_j(t)\) happen (when small stocks grow and overtake large stocks). Finally, at each time rank weights from top to bottom:

$$\begin{aligned} \mu _{(0)}(t) \ge \cdots \ge \mu _{(n)}(t). \end{aligned}$$

The capital distribution curve is the plot of ranked weights vs their ranks on the double logarithmic scale:

$$\begin{aligned} \left( \ln k, \ln \mu _{(k)}(t)\right) ,\quad k = 0, \ldots , n. \end{aligned}$$

For the actual market data, this plot is linear, except at the endpoint. See Fernholz (2002), Figure 5.1 for the capital distribution curve for the CRSP stock universe for 8 days, chosen to be the last days (December 31) of the eight decades – December 31, 1929; December 31, 1939; ...December 31, 1999. Strikingly, these 8 curves almost completely coincide. In other words, this curve is stable in time. Previously mentioned competing Brownian particles and volatility-stabilized models reproduce this feature, see Chatterjee and Pal (2010) and Pal (2011) respectively. In this article, we establish this property for our model using both theoretical analysis and simulations.

1.6 Our contributions

We propose a new model within SPT which is consistent with CAPM and captures the size effect. This model is consistent with long-term market data (although not fully consistent) and captures its important features. We study its properties: long-term stability, collisions of particles, market weights, and the capital distribution curve.

1.7 Organization of the article

In Sect. 2, we describe our data in detail and analyze it. This provides a motivation for continuous-time modeling. In Sect. 3, we introduce a system of SDE as in (6) and (5), and prove existence and uniqueness of the solution. We also discuss applicability of SPT, since now size and wealth processes are different. In Sect. 4, we show stability under certain conditions and see how they are compared with data analysis in Sect. 2. In Sect. 5, we replicate the linear capital distribution curve. Finally, Sect. 6 is devoted to conclusions and suggestions for future research. The “Appendix” contains some data analysis for existing exchange-traded funds based on size. The code and data can be found on GitHub: asarantsev/CAPM-SPT.

2 Data analysis

2.1 Data description

Our main data source, as mentioned in the Introduction, is the CRSP database at the University of Chicago, taken from Kenneth French’s online data library. We take from this library price and total monthly returns for equally-weighted portfolios made of stock in each decile from July 1926 to June 2020. We analyze only the top 8 deciles, corresponding to large-cap, mid-cap, and small-cap stocks. The two bottom deciles are micro-cap stocks which we exclude. Deciles are created based on the market cap at the end of each June. After a year, these deciles are reconstituted. The data library also has average market capitalizations for the end of each month for each decile. Risk-free monthly returns are computed as \(R(t) = \ln (1 + r(t)/12)\), where r(t) are monthly data for short-term Treasury bills taken from the Federal Reserve Economic Data (FRED) website: January 1934–June 2020 series TB3MS and July 1926–December 1933 (discontinued) series M1329AUSM193NNBR.

Stock and portfolio returns can be computed in two ways: arithmetic A and geometric G, which are related as follows: \(G = \ln (1 + A)\). Arithmetic returns are quoted regularly for their practicality in computing portfolio returns such as in the data library which we used as our data source. The arithmetic return of a portfolio is equal to the weighted average of arithmetic returns of constituent stocks, but here we convert arithmetic returns to their geometric versions according to the above formula.

The advantage of geometric over arithmetic returns for our research is apparent when combining compound interest rates. For example, 20% and 30% arithmetic returns combined gives 56% returns, whereas geometric returns combine for 50% as expected.

Our time unit is equal to a month, of total \(T = 12\cdot (2020-1926) = 1128\) months, \(t = 0\) corresponding to June 1926, \(t = T\) corresponding to June 2020.

Fig. 1
figure 1

Beta \(\beta _k(n)\) vs \(C_k(n)\) for top-decile benchmark

For a decile k (with \(k = 1\) for the top decile, \(k = 8\) for the bottom decile), its average market cap at end of month t is denoted by \(S_k(t)\), the price returns are \(Q_k(t)\), and the equity premium (total returns including dividends minus risk-free returns) is \(P_k(t)\).

2.2 Beta analysis for price returns

Split \(T = 1128\) months into \(N = 47\) two-year, \(K = 24\)-month time periods. For each period \(n = 1, \ldots , N\) and each decile \(k = 1, \ldots , 8\), except the top one, which we use as the benchmark (this top decile roughly corresponds to Standard & Poor 500 constituent stocks), regress

$$\begin{aligned} Q_k(t) = \alpha _k(n) + \beta _k(n)Q_1(t) + \delta _k(t),\quad k = 2, \ldots , 8, \end{aligned}$$
(8)

where t is in this period, and \(\alpha _k(n), \beta _k(n)\) are intercept and slope (excess return and market exposure), found using ordinary least squares; and \(\delta _k(t)\) are residuals. Thus we compute beta \(\beta _k(n)\) for each decile from 2nd to 8th and each of 47 two-year periods. For size measure of the kth decile vs top decile, take

$$\begin{aligned} C_k(n) = \ln \frac{S_1(Kn)}{S_k(Kn)}, \end{aligned}$$

where \(S_1(Kn)\) and \(S_k(Kn)\) are average market capitalizations for the beginning of nth period. Then we plot \(\beta _k(n) - 1\) vs \(C_k(n)\) and \((\beta _k(n) - 1)/C_k(n)\) in Fig. 1. We plot \(\beta _k(n) - 1\) instead of \(\beta _k(n)\) since our benchmark for beta is 1: The beta for the top decile (which coincides with the benchmark) is 1.

2.3 Model suggestions for price returns

From Fig. 1(a), we get the suggestion that

\(\beta _k(n) - 1\) is proportional to \(C_k(n)\). This is confirmed by Fig. 1(b). This transformation helps make variance constant. We still cannot claim that these normalized quantities are i.i.d. white noise, since they fail white noise tests. Still, let us extract the trend in \(\beta _k(n)\) by replacing it with \(1 + \gamma C_k(n)\) for some coefficient. Next, comparing with (8), consider the residual \(Q_k(t) - (1 + \gamma C_k(n))Q_1(t)\). We make it dependent on n (the overall two-year period), not individual months t in this period, by taking the sum of geometric price returns \(Q_k(t)\) over t in this two-year period. This sum is equal to overall price returns \({\overline{Q}}_k(t)\) in this two-year period. Then we let:

$$\begin{aligned} \varepsilon _k(n) := {\overline{Q}}_k(n) - (1 + \gamma C_k(n)){\overline{Q}}_1(n),\quad k = 2, \ldots , 8;\ n = 1, \ldots , N. \end{aligned}$$
(9)
Fig. 2
figure 2

Residual \(\varepsilon _k(n)\) vs \(C_k(n)\) for top decile benchmark

We then plot \(\varepsilon _k(n)\) vs \(C_k(n)\) in Fig. 2(a), together with \(\varepsilon _k(n)/C_k(n)\) vs \(C_k(n)\) in Fig. 2(b). We see again in Fig. 2(a) that \(\varepsilon _k(n)\) depends on \(C_k(n)\) linearly, and in Fig. 2(b) that the variance becomes constant. Again, we cannot claim that \(\varepsilon _k(n)/C_k(n)\) are i.i.d. since white noise tests fail. But if we model this as i.i.d. \({\mathcal {N}}(\mu , \rho ^2)\), we get:

$$\begin{aligned} \frac{\varepsilon _k(n)}{C_k(n)} = \mu + \rho Z_k(n), \end{aligned}$$
(10)

where \(Z_k(n)\) are i.i.d. standard normal random variables. A generalization of this could be:

$$\begin{aligned} (Z_2(n), \ldots , Z_8(n)) \sim {\mathcal {N}}_7({\mathbf {0}}, \Sigma ) \end{aligned}$$
(11)

i.i.d. multivariate normal with mean zero vector and (not identity) covariance matrix with units on the main diagonal (that is, \(\mathbb E[Z_k(n)] = 0\) and \({\mathbb {E}}[Z_k^2(n)] = 1\)). Combining (9) and (10), we get:

$$\begin{aligned} Q_k(n) = (1 + \gamma C_k(n))Q_1(n) + \mu C_k(n) + \rho C_k(n) Z_k(n). \end{aligned}$$
(12)

We can estimate \(\gamma = 0.0045, \mu = 0.0069, \rho = 0.052\). This gives us (7), the top row.

Fig. 3
figure 3

8th decile (bottom) benchmark

2.4 Bottom decile as benchmark

If we repeat this analysis in previous subsection with benchmark 8th decile (which in our research is the bottom decile since we ignore the 9th and 10th deciles), we get: For \(\beta _k(n)\) in (8), we plot \(\beta _k(n) - 1\) vs \(C_k(n)\) in Fig. 3(a). In this case, dependence is also linear. Next, take \(\gamma \), the mean of these quantities. Create residuals similarly to (9):

$$\begin{aligned} \varepsilon _k(n) = Q_k(n) - (1 + \gamma \sqrt{|C_k(n)|})Q_8(n),\, k = 1, \ldots , 7,\, n = 1, \ldots , N. \end{aligned}$$
(13)

The plot of these residuals vs \(C_k(n)\), normalized by dividing by \(\sqrt{|C_k(n)|}\), is shown in Fig. 3(b). Making white noise test for \(\varepsilon _k(n)\), we fail to reject this hypothesis. Assume \(\varepsilon _k(n) \sim \mathcal N(\mu , \rho ^2)\). Combining this with (13), we get:

$$\begin{aligned} \begin{aligned} Q_k(n)&= (1 + \gamma \sqrt{|C_k(n)|})Q_8(n) + \mu \sqrt{|C_k(n)|} + \rho \sqrt{|C_k(n)|} Z_k(n),\\&\quad k = 1, \ldots , 7,\quad n = 1, \ldots , N;\\ Z_k(n)&= (\varepsilon _k(n) - \mu )/\rho \sim {\mathcal {N}}(0, 1)\quad \text{ i.i.d. } \end{aligned} \end{aligned}$$
(14)

This verifies (7), the bottom row. From (14), we see that linear functions from (12) are not the only possible and reasonable functions for \(\beta \). In Sect. 4, we state and prove results for general \(\alpha , \beta , \rho \). Our coefficients are \(\gamma = 0.12, \mu = 0.0055, \rho = 0.090\).

2.5 Data analysis for equity premia

Repeating similar analysis for equity premia instead of price returns, we get similar functions for both cases: top and bottom deciles as benchmarks. Moreover, point estimates for price returns and equity premia are close; \(\gamma = 0.045, \mu = 0.0017, \rho = 0.052\) for the top decile benchmark, and \(\gamma = 0.12, \mu = 0.0024, \rho = 0.088\) for the 8th decile benchmark. Thus functions \(\beta \) and \(\sigma \) for price returns and equity premia are the same. Unfortunately, for \(\alpha \) this is not true. We will have separate functions for price returns and equity premia. As noted in Sect. 1, white noise terms \(Z_k(n)\) for price returns and equity premia are almost perfectly correlated, since the Pearson correlation coefficient is greater than 99%. However, price returns and equity premia for the benchmark are not perfectly correlated. Thus noise terms for size and wealth processes are different.

2.6 Conclusions

Data analysis in previous subsections implies that excess return for small stocks is positive, and market exposure for small stocks is greater than 1. Large stocks have negative excess return and market exposure less than 1. In other words, small stocks are riskier than larger stocks, but they have higher returns even after adjusting for market exposure. We will accommodate this for continuous time in the next section.

3 A continuous-time model

3.1 Model description

Take a filtered probability space \((\Omega , {\mathcal {F}}, (\mathcal F_t)_{t \ge 0}, {\mathbb {P}})\) with the filtration satisfying the usual conditions (each \({\mathcal {F}}_t\) contains all \({\mathbb {P}}\)-null sets; that is, \(A \in {\mathcal {F}}_t\) with \({\mathbb {P}}(A) = 0\) and \(B \subseteq A\) implies \(B \in {\mathcal {F}}_t\); and the filtration is right-continuous, that is, \({\mathcal {F}}_t = \cap _{s > t}{\mathcal {F}}_s\) for all \(t \ge 0\)). All stochastic processes \(X = (X(t),\, t \ge 0)\) below are adapted, that is, X(t) is \(\mathcal F_t\)-measurable for every \(t \ge 0\). We say that a stochastic process \(B = (B(t),\, t \ge 0)\) is a standard Brownian motion if \(B(t) - B(s) \sim {\mathcal {N}}(0, t-s)\) is independent of \(\mathcal F_s\) for every \(0 \le s < t\).

Replace white noise terms \(Z_k(n)\) in (12) or (14) by standard Brownian motions \(W_k(t)\), or, more exactly, its differential \(\mathrm {d}W_k(t)\). Replace price returns \(Q_k(n)\) by \(\mathrm {d}\ln S_k(t)\). Recall that \(S_k(t)\) is the average market capitalization of the kth portfolio at time t. Replace equity premia \(P_k(n)\) with \(\mathrm {d}\ln V_k(t)\), where \(V_k(t)\) is the wealth (including reinvested dividends) of the kth decile, starting from \(V_k(0) = 1\), divided by wealth similarly computed from risk-free returns. Set three functions: \(\alpha , \beta , \sigma \) of \(C_k(t)\) (alpha, beta, and standard deviation of unsystematic risk). Set a separate function \(\alpha _*\) of \(C_k(t)\) for equity premia instead of price returns. We shall not set separate functions for \(\beta \) and \(\sigma \) since from our data analysis we found that these functions are the same for price returns and equity premia. We have n portfolios indexed by \(1, \ldots , n\), and the benchmark indexed with 0 (that is, total \(n+1\) portfolios). We write separate functions for price returns and equity premia (which we denote by asterisks). Consider the following system of SDE:

$$\begin{aligned} \mathrm {d}\ln S_k(t)= & {} \alpha (C_k(t))\,\mathrm {d}t + \beta (C_k(t))\,\mathrm {d}\ln S_0(t) + \sigma (C_k(t))\,\mathrm {d}W_k(t), \end{aligned}$$
(15)
$$\begin{aligned} \mathrm {d}\ln V_k(t)= & {} \alpha _*(C_k(t))\,\mathrm {d}t + \beta (C_k(t))\,\mathrm {d}\ln V_0(t) + \sigma (C_k(t))\,\mathrm {d}W_k(t), \end{aligned}$$
(16)

where \(C_k(t)\) is the relative size measure defined in the Introduction:

$$\begin{aligned} C_k(t) = \ln \frac{S_0(t)}{S_k(t)}. \end{aligned}$$
(17)

We also model \((S_0, V_0)\) using two-dimensional geometric Brownian motion: \((\ln S_0, \ln V_0)\) has drift vector \((g_S, g_V)\) and covariance matrix

$$\begin{aligned} \varvec{\Sigma }_{(S, V)} = \begin{bmatrix} \sigma _S^2 &{}\quad \rho _0\sigma _S\sigma _V\\ \rho _0\sigma _S\sigma _V &{}\quad \sigma _V^2 \end{bmatrix} \end{aligned}$$

That is, for \(t > s\) we have: \(\ln S_0(t) - \ln S_0(s) \sim \mathcal N(g_S(t-s), \sigma _S^2(t-s))\) and \(\ln V_0(t) - \ln V_0(s) \sim {\mathcal {N}}(g_V(t-s), \sigma ^2_V(t-s))\); here, \(\rho _0\) is the correlation between these two increments. Thus

$$\begin{aligned} \begin{aligned} \ln S_0(t)&= \ln S_0(0) + g_St + \sigma _SW_S(t),\\ \ln V_0(t)&= \ln V_0(0) + g_Vt + \sigma _VW_V(t), \end{aligned} \end{aligned}$$
(18)

where \(W_S\) and \(W_V\) are (correlated) standard Brownian motions. We assume \(W_1, \ldots , W_n\) are independent of \((B_S, B_V)\). Note that \(W_1, \ldots , W_n\) can be dependent of each other standard Brownian motions. We assume that \(W = (W_1, \ldots , W_n)\) is an n-dimensional Brownian motion with zero drift vector and covariance matrix \(\varvec{\Sigma }_W\) with units on the main diagonal.

Definition 1

For functions \(\alpha , \beta , \sigma : {\mathbb {R}} \rightarrow {\mathbb {R}}\), real numbers \(g_S, g_V\), \(2\times 2\) covariance matrix \(\varvec{\Sigma }_{(S, V)}\), and \(n\times n\) correlation matrix \(\varvec{\Sigma }_W\), the system of Eqs. (15), (16), (17), (18) is called a CAPM-size market model of \(N+1\) portfolios, indexed by \(0, \ldots , N\). The process \(S_k\) is called market size, or market cap, of the kth portfolio; and the process \(V_k\) is called the wealth process for this kth portfolio. The portfolio indexed by \(k = 0\) is called the benchmark. The functions \(\alpha \) and \(\beta \) are called by their Greek names. The function \(\sigma \) is called the standard error (of the diversifiable risk).

3.2 Existence and uniqueness

We can rewrite (15) using (17):

$$\begin{aligned} \mathrm {d}C_k(t) = -\alpha (C_k(t))\,\mathrm {d}t + (1 - \beta (C_k(t))\,\mathrm {d}\ln S_0(t) + \sigma (C_k(t))\,\mathrm {d}W_k(t). \end{aligned}$$
(19)

Remark 1

Each kth Eq. (15) is independent of other equations: Does not contain \(S_l\) for other \(l = 1, \ldots , n\). It depends only on \(S_k\) and \(S_0\).

Definition 2

Define the explosion time as follows:

$$\begin{aligned} {\mathcal {T}} := {\mathcal {T}}_1\wedge \ldots \wedge {\mathcal {T}}_n, \end{aligned}$$
(20)

where \({\mathcal {T}}_k\) is the explosion time for (15), or, equivalently, for (19).

On \([0, {\mathcal {T}})\), Eqs. (15) and (16) have a unique solution.

Theorem 1

  1. (a)

    If \(\alpha , \beta , \sigma , \alpha _* : {\mathbb {R}} \rightarrow {\mathbb {R}}\) are measurable and locally bounded, there exists a unique strong solution to the system of SDE (15), (16), (17), until the explosion time \({\mathcal {T}}\).

  2. (b)

    If, in addition, we have the following linear bound:

    $$\begin{aligned} |\alpha (c)| + |\beta (c)| + |\sigma (c)| + |\alpha _*(c)| \le K(1 + |c|), \end{aligned}$$

    then the explosion time is infinite: \({\mathcal {T}} = \infty \).

Proof

(a) First, let us show existence and uniqueness in (19). The diffusion part in this equation is given by \({\tilde{\sigma }}(C_k(t))\,\mathrm {d}{\tilde{W}}_k(t)\), where

$$\begin{aligned} {\tilde{\sigma }}^2(c) := \sigma ^2_S(1-\beta (c))^2 + \sigma ^2(c), \end{aligned}$$

and \({\tilde{W}}_k\) is a standard Brownian motion which is a combination of \(W_S\) and \(W_k\). The drift part is given by \({\tilde{\gamma }}(C_k(t))\,\mathrm {d}t\), where

$$\begin{aligned} {\tilde{\gamma }}(c) := -\alpha (c) + g_S(1 - \beta (c)). \end{aligned}$$
(21)

Thus we can rewrite (19) as follows:

$$\begin{aligned} \mathrm {d}C_k(t) = {\tilde{\gamma }}(C_k(t))\,\mathrm {d}t + {\tilde{\sigma }}(C_k(t))\,\mathrm {d}{\tilde{W}}_k(t). \end{aligned}$$
(22)

These functions \({\tilde{\gamma }}\) and \({\tilde{\sigma }}\) are measurable, and we apply Zvonkin (1974) to prove strong existence and pathwise uniqueness of the solution \(C_k\) to this SDE, at least until the explosion time \({\mathcal {T}}_k\). Here we use Remark 1: the system of n equations \(C_1, \ldots , C_n\) consists of n independent one-dimensional SDE. Next, we can reconstruct \(S_1, \ldots , S_n\) from \(C_1, \ldots , C_n\) and \(S_0\): \(S_k(t) = S_0(t)e^{-C_k(t)}\). Since \(S_0\) is well-defined for infinite time horizon (as geometric Brownian motion), the strong solution \(S_k\) for (15) exists and is pathwise unique, too, at least until the explosion time \({\mathcal {T}}_k\). Finally, strong existence and pathwise uniqueness for (16) can be shown as follows. Let

$$\begin{aligned} {\mathcal {T}}_{k, m} := \inf \{t \ge 0\mid |C_k(t)| = m\},\, k = 1, \ldots , n;\ m = 1, 2, \ldots \end{aligned}$$

Since \(\alpha _*, \beta , \sigma \) are locally bounded, they are bounded on \([-m, m]\). We can rewrite (16) as

$$\begin{aligned} \mathrm {d}\ln V_k(t) = {\overline{\gamma }}(C_k(t))\,\mathrm {d}t + {\overline{\sigma }}(C_k(t))\,\mathrm {d}{\overline{W}}_k(t), \end{aligned}$$
(23)

with \({\overline{W}}_k\) a standard Brownian motion, and

$$\begin{aligned} {\overline{\gamma }}(c) := \alpha _*(c) + g_V\beta (c),\quad {\overline{\sigma }}^2(c) := \sigma ^2_V\beta ^2(c) + \sigma ^2(c). \end{aligned}$$
(24)

These two functions from (24) are bounded on \([-m, m]\). Rewriting (23) as

$$\begin{aligned} \ln V_k(t) = \int _0^t {\overline{\gamma }}(C_k(s))\,\mathrm {d}s + \int _0^t{\overline{\sigma }}(C_k(s))\,\mathrm {d}{\overline{W}}_k(s), \end{aligned}$$

we see that (strong) solution until \(t < {\mathcal {T}}_{k, m}\) is well-defined and pathwise unique, because the integrals (both usual and stochastic) are finite. As \(m \rightarrow \infty \), we have: \(\mathcal T_{k, m} \uparrow {\mathcal {T}}_k\) almost surely. Thus (16) have a pathwise unique strong solution until \({\mathcal {T}}_k\). Combining this with (20), we complete the proof of (a).

Next, (b) follows from (19) and (23) and estimates of linear growth for \({\tilde{\gamma }}\) and \({\tilde{\sigma }}\):

$$\begin{aligned} |{\tilde{\gamma }}(c)| + |{\tilde{\sigma }}(c)| \le K_*(1 + |c|), \end{aligned}$$

following from similar estimates for \(\alpha , \alpha _*, \beta , \sigma \). \(\square \)

4 Main results

4.1 Long-term stability results

Recall the definition of market weights.

Definition 3

For the market model in (15), we define market weights as follows:

$$\begin{aligned} \mu _i(t) = \frac{S_i(t)}{S_0(t) + \cdots + S_n(t)},\quad i = 0, \ldots , n,\, t \ge 0. \end{aligned}$$

These market weights sum up to 1 for every \(t \ge 0\), and are in one-to-one correspondence with \(C_1(t), \ldots , C_n(t)\): There exists a bijection \(\Phi : \triangle _n \rightarrow {\mathbb {R}}^n\) such that

$$\begin{aligned} \Phi : (m_0, \ldots , m_n) \mapsto \left( \ln \frac{m_0}{m_1}, \ldots , \ln \frac{m_0}{m_n}\right) . \end{aligned}$$

The process \({\mathbf {C}} = (C_1, \ldots , C_n)\), as its each component, is a Markov process. The same is true for the market weight vector \(\mu = (\mu _0, \ldots , \mu _n)\).

Definition 4

We call a probability measure \(\pi _{\mu }\) on \(\triangle _n\) a stationary distribution if \(\mu (0) \sim \pi _{\mu }\) implies \(\mu (t) \sim \pi _{\mu }\) for all \(t \ge 0\). The market is called stable if this market weight vector has a unique stationary distribution \(\pi _{\mu }\), and when we start from another initial distribution \(\mu (0)\), then the distribution of \(\mu (t)\) converges to \(\pi _{\mu }\) as \(t \rightarrow \infty \) in the total variation (TV) norm:

$$\begin{aligned} \sup \limits _{D \subseteq \triangle _n}|{\mathbb {P}}(\mu (t) \in D\mid \mu (0) = x) - \pi _{\mu }(D)| \rightarrow 0\quad \text{ as }\quad t \rightarrow \infty . \end{aligned}$$

We can similarly (and equivalently) define stability for the process \({\mathbf {C}} = (C_1, \ldots , C_n)\), which is \({\mathbb {R}}^n\)-valued. These definitions of stability are equivalent, since there is a one-to-one continuous mapping between C and \(\mu \).

Theorem 2

Under assumptions of Theorem 1, suppose

$$\begin{aligned} \varliminf \limits _{c \rightarrow \infty }\left[ \alpha (c) + g_S(\beta (c) - 1)\right] > 0\quad \text{ and }\quad \varlimsup \limits _{c \rightarrow -\infty }\left[ \alpha (c) + g_S(\beta (c) - 1)\right] < 0. \end{aligned}$$
(25)

Then the market system is stable.

Proof

We apply the method of Lyapunov functions. As a Lyapunov function, take the following infinitely differentiable function \(V : {\mathbb {R}} \rightarrow [0, \infty )\):

$$\begin{aligned} V(c) := {\left\{ \begin{array}{ll} |c|, &{}\quad c \ge 2;\\ 0, &{}\quad c \le 1.\end{array}\right. } \end{aligned}$$

This function can be constructed by smoothing kernel and convolution. The generator for each \(C_k\) in (15) is given by

$$\begin{aligned} {\mathcal {L}}{\mathcal {f}}(c) := {\tilde{\gamma }}(c)f'(c) + \frac{1}{2}{\tilde{\sigma }}^2(c)f''(c). \end{aligned}$$

Recall (21): \(\varlimsup \limits _{c \rightarrow \infty }{\tilde{\gamma }}(c) < 0\) and \(\varliminf \limits _{c \rightarrow -\infty }{\tilde{\gamma }}(c) > 0\). Combining this with (25), we get:

$$\begin{aligned} \varlimsup \limits _{|c| \rightarrow \infty }{\mathcal {L}}{{V}}(c) < 0. \end{aligned}$$

From the articles Meyn and Tweedie (1993a, 1993b), we get tightness of each \(C_k\). That is, \(\sup _{t \ge 0}{\mathbb {P}}(|C_k(t)| \ge c) \rightarrow 0\) as \(c \rightarrow \infty \). The same is true for the vector \({\mathbf {C}} = (C_1, \ldots , C_n)\). Combining this observation with the following property: for each i, \({\mathbb {P}}(a< C_i(t) < b\mid C_i(0) = x) > 0,\quad x, a, b \in {\mathbb {R}}\), we complete the proof. \(\square \)

Remark 2

We also have convergence

$$\begin{aligned}&\sup \limits _{g \in {\mathcal {G}}}|{\mathbb {E}}[g(C_k(t))] - (\pi _C, g)| \rightarrow 0,\, t \rightarrow \infty ,\\&{\mathcal {G}} := \{g : {\mathbb {R}} \rightarrow {\mathbb {R}}\mid \sup \limits _{z \in {\mathbb {R}}}\frac{|g(z)|}{1 + |z|} < \infty \}. \end{aligned}$$

In particular, we have \({\mathbb {E}}[C_k(t)] \rightarrow m_C\), where \(m_C\) is the mean of the distribution \(\pi _C\).

Example 1

If \(\beta (c) = 1 + \gamma c\), \(\alpha (c) = \mu c\), then \(\alpha (c) + g_S(\beta (c) - 1) = (\mu + g_S\gamma )c\), and (25) is equivalent to

$$\begin{aligned} \varGamma := \mu + g_S\gamma > 0. \end{aligned}$$
(26)

Example 2

From the data analysis in Sect. 2, let

$$\begin{aligned} \alpha (c) := {\left\{ \begin{array}{ll} \alpha _+|c|^{\gamma _+},&{}\quad c \ge 0;\\ -\alpha _-|c|^{\gamma _-},&{}\quad c \le 0; \end{array}\right. } \ \beta (c) := 1 + {\left\{ \begin{array}{ll} \beta _+c,&{}\quad c \ge 0;\\ \beta _-c,&{}\quad c \le 0; \end{array}\right. } \ \sigma (c) := {\left\{ \begin{array}{ll} \sigma _+|c|^{\gamma _+},&{}\quad c \ge 0;\\ \sigma _-|c|^{\gamma _-},&{}\quad c \le 0. \end{array}\right. } \end{aligned}$$

Here, \(\alpha _{\pm }, \beta _{\pm }, \gamma _{\pm }, \sigma _{\pm } > 0\). Let us find conditions for (25): For the first condition, if \(\gamma _+ < 1\), we have \(g_S\beta _+ > 0\); if \(\gamma _+ = 1\), we have \(\alpha _+ + g_S\beta _+ > 0\); if \(\gamma _+ > 1\), we have \(\alpha _+ > 0\). Similarly for the second condition in (25). The actual estimates from Sect. 2 satisfy these conditions.

4.2 Hitting times

We wish to allow \(S_i\) and \(S_0\) to exchange ranks. That is, we want to allow for \(S_i(0) < S_0(0)\) but \(S_i(t) > S_0(t)\) for some \(t > 0\), or vice versa. This is consistent with real world market behavior, when portfolios exchange ranks based on size. In terms of relative size measures \(C_i\), we wish that \(C_i\) can move from positive half-line to negative half-line. In particular, it must hit zero with positive probability.

This is not true for Example 1 if \(\sigma (c) = \rho c\) for \(\rho > 0\). Indeed, then \(C_i\) is a geometric Brownian motion with drift \(-\varGamma \) and thus converges to 0 almost surely as \(t \rightarrow \infty \). Thus the limiting stationary distribution is the delta measure at the origin, \(\delta _{(0, \ldots , 0)}\). The corresponding limiting distribution for \(\mu \), the market weight vector is \(\delta _{(1/(n+1), \ldots , 1/(n+1))}\). If \(\sigma (c)\) is bounded away from zero, this changes the behavior of \({\mathbf {C}}\).

Theorem 3

Under conditions of Theorem 2, assume

$$\begin{aligned} \sigma (c) \ge \sigma _* > 0\quad \text{ for } \text{ all }\quad c \in {\mathbb {R}}. \end{aligned}$$
(27)
  1. (a)

    For every \(c \in {\mathbb {R}}\) with positive probability there exists a \(t > 0\) such that \(C_i(t) = c\).

  2. (b)

    The stationary distribution \(\pi _C\) for \({\mathbf {C}} = (C_1, \ldots , C_n)\) has support on \({\mathbb {R}}^n\). The stationary distribution \(\pi _{\mu }\) for \(\mu \) has support on \(\Delta _n\).

Proof

Return again to the Eq. (22), which is a simplified Eq. (19).

  1. (a)

    Compute the scale function s for the diffusion \(C_k\): Its derivative is

    $$\begin{aligned} s'(c) = \exp \left[ -2\int _0^c\frac{{\tilde{\gamma }}(u)}{{\tilde{\sigma }}^2(u)}\,\mathrm {d}u\right] . \end{aligned}$$

    There exist \(\gamma _* > 0\) and \(c_* > 0\) such that

    $$\begin{aligned} {\tilde{\gamma }}(u) \le -\gamma _*,\, c \ge c_*;\quad {\tilde{\gamma }}(u) \ge \gamma _*,\, c \le -c_*. \end{aligned}$$

    Moreover, \({\tilde{\sigma }}(u) \ge \sigma (u) \ge \sigma _*\) for all \(u \in {\mathbb {R}}\). Thus for \(c \ge c_*\),

    $$\begin{aligned} s'(c) \ge s'(c_*)\exp \left[ 2\int _{c_*}^c\frac{\gamma _*}{\sigma _*^2}\,\mathrm {d}c\right] = s'(c_*)\exp \left[ 2(c-c_*)\frac{\gamma _*}{\sigma _*^2}\right] . \end{aligned}$$
    (28)

    A similar estimate is true for \(c \le -c_*\):

    $$\begin{aligned} s'(c) \le s'(-c_*)\exp \left[ 2(|c|-c_*)\frac{\gamma _*}{\sigma _*^2}\right] . \end{aligned}$$
    (29)

    The speed measure has bounded Lebesgue density \(1/(s'(c)\sigma ^2(c))\), as shown in (28), (29), (27). Apply Feller’s test from Karatzas and Shreve (1998), Chapter 5, Section 5 to complete the proof.

  2. (b)

    The statement for \(\pi _C\) follows from ellipticity of the elliptic partial differential equation governing the Lebesgue density of this stationary distribution. The statement for \(\pi _{\mu }\) follows from one-to-one mapping between \({\mathbf {C}}\) and \(\mu \).

\(\square \)

It seems to us that (27) is a reasonable assumption, since we would want to allow for exchange of ranks of portfolios. This does not contradict our statistical analysis, since we can observe only \(C_i(t) > c_+\) or \(C_i(t) < -c_-\). Our suggested functions \(\sigma (c)\) do give \(\sigma (0) = 0\) if we extend them to \([-c_-, c_+]\) as is. But we could not observe \(C_i(t)\) in a neighborhood of zero, thus we can extend it as a piecewise function.

Under assumptions of Theorem 3, the stationary distribution for each \(C_i\) has (after normalization) density as above: \(1/(s'\sigma ^2)\), which is supported on the whole real line but is bounded. This is different from the case \(\sigma (c) = \sigma _0c\) discussed above in Example 1, when the stationary distribution is concentrated at one point, but the components of the stationary distribution for the overall vector C are not independent since the SDE for individual relative size measures are dependent.

4.3 Lack of propagation of chaos

For interacting particle systems, sometimes dependence (as a process itself, or stationary distribution) vanishes as the number of particles tends to infinity. This phenomenon is called propagation of chaos, since the system becomes less interdependent and more chaotic. Such results were shown for competing Brownian particles and volatility-stabilized models (see citations in the Introduction). But it is unreasonable to expect this for the current system since all particles are dependent. The limiting density, if it exists, will likely be a solution to a stochastic partial differential equation. To derive this large system limit is left for future research.

5 Capital distribution curve

5.1 Modified plots

Let us study the capital distribution curve \((\ln k, \ln \mu _{(k)}(t))\) in this model. We solve the system of stochastic differential equations explicitly. We use the values of \(C_k(t)\) to plot the capital distribution curve:

$$\begin{aligned} C_k(t) = \ln \frac{S_0(t)}{S_k(t)} = \ln \frac{S_0(t)}{S(t)} - \ln \mu _k(t). \end{aligned}$$

Thus the ranking of \(C_k(t)\) reverts the ranking of ranked market weights:

$$\begin{aligned} 0 \le C_{(1)}(t) \le \cdots \le C_{(n)}(t). \end{aligned}$$

Thus we can plot the modified curve \((\ln k, C_{(k)}(t))\). If this curve is linear, the same can be said for the original capital distribution curve. Now we shall study the system of stochastic differential equations (19) and plot \((\ln k, C_{(k)}(t))\) for fixed t.

5.2 Degenerate case

Even if the system is stable, under conditions of Theorem 2, capital distribution curve can be degenerate, equal to one point: \((1/(n+1), \ldots , 1/(n+1))\). This is the case when \(C_k(t) \rightarrow 0\) a.s. as \(t \rightarrow \infty \) for each \(k = 1, \ldots , n\). Indeed, in this case \(\mu _k(t) \rightarrow 1/(n+1)\) a.s. as \(t \rightarrow \infty \). In particular, this is true in Example 1 with \(\sigma (c) = \rho c\) for \(\rho > 0\). Below, we consider the case when the capital distribution curve is not trivial.

5.3 Linear case

This is the case when

$$\begin{aligned} \alpha (c) = \mu c,\quad \beta (c) = 1 + \gamma c,\quad \sigma (c) = \rho . \end{aligned}$$
(30)

Then the system (19) is linear:

$$\begin{aligned} \mathrm {d}C_k(t) = -\mu C_k(t)\,\mathrm {d}t - \gamma C_k(t)(g_S\,\mathrm {d}t + \sigma _S\,\mathrm {d}W_S(t)) + \rho \,\mathrm {d}W_k(t). \end{aligned}$$

As shown in Karatzas and Shreve (1998), Chapter 5, Section 6.C we can solve this system explicitly:

$$\begin{aligned} C_k(t)&= Z(t)\left[ C_k(0) - \int _0^tZ^{-1}(u)(\rho \,\mathrm {d}W_k(u) - \gamma \sigma _S\rho \,\mathrm {d}u)\right] ;\\ Z(t)&= \exp \left[ -(\mu + \gamma g_S)t - \gamma \sigma _S\,W_S(t) - \frac{1}{2}\gamma ^2\sigma _S^2t\right] . \end{aligned}$$

Below, we show the simulation results for \(n = 100\), with initial conditions \(C_k(0) = 0\), \(k = 1, \ldots , n\). We take estimated values \(\mu = 0.0069\), \(\gamma = 0.0045\). For \(\rho \), we take the value 0.1, which is consistent with estimates. Finally, estimates for mean \(g_S\) and standard deviation \(\sigma _S\) of monthly price returns for the benchmark (top decile) are given by \(g_S = 0.0044\) and \(\sigma _S = 0.0541\). We simulate until \(t = 100\), assuming all \(W_1, \ldots , W_n, W_S\) are independent Brownian motions.

6 Conclusions

We developed a model in this article which can be viewed as an enhancement of the Capital Asset Pricing Model, which stresses dependence of stock portfolios upon the overall market. The portfolios are based on size (market cap), and the quantities \(\alpha \), \(\beta \), and standard error \(\sigma \) (of diversifiable idiosyncratic risk) depends on size (more precisely, size of portfolio relative to the size of benchmark). Thus we wrote a system of stochastic differential equations.

We write separately systems of equations for equity premia (total returns, including dividends, minus risk-free returns), and for market size (that is, price returns). They are very similar and the idiosyncratic risk can be taken the same. The equity premium and price returns of the benchmark are driven by different random processes (although correlated) (Fig. 4).

Fig. 4
figure 4

Capital distribution curve \((\ln k, C_{(k)})\)

Using CRSP 1926–2020 monthly data of size deciles, we find reasonable guesses for \(\alpha \), \(\beta \), \(\sigma \) as functions of relative size. Our statistical analysis is not fully rigorous, because it fails white noise tests. However, we do find some reasonable results.

On the theoretical side, we prove long-term stability results: under some conditions on \(\alpha , \beta , \sigma \), the vector of relative size measures converges as \(t \rightarrow \infty \) to a stationary distribution.

Finally, an important feature of real-world markets: stability and linearity of the capital distribution curve, is reproduced in our model by numerical simulation.

Future research can include making more sophisticated time series models which take into account autocorrelations, or non-Gaussian fluctuations of the market. It seems important to develop the SPT for the case of dividends, when price and total returns (and by extension market capitalizations and wealth processes) are different. Finally, we would like to derive a large system limit.