1 The Capital Asset Pricing Model

One of the most important results of the previous chapter is the last line in Eq. 27.31. It can be written in the form

$$\displaystyle \begin{aligned} \widehat{\mathbf{R}}={\boldsymbol\beta}_{m}\widehat{R}_{m}\;.{} \end{aligned} $$
(28.1)

Based on this equation, the expected excess returns of all assets are explained solely by their betas with respect to the optimal portfolio and the return of the optimal portfolio itself. This is very remarkable. Within the framework of classical Mean-Variance portfolio theory there is not much freedom for the individual assets. The optimal portfolio drives everything. The question now remains, what this optimal portfolio exactly is. The following argument leads to an identificationFootnote 1:

If all investors have the same information on all assets (in particular concerning expected returns, variances and covariances) and if all investors behave optimally (in the sense that they invest in such a way that their mean/variance-ratio is maximal), then every investment will lie on the capital market line. If Inequality 26.59 holds, than each investor will invest in a mixture of the optimal portfolio V m and the risk free money market account. Investments with a variance lower than σ m are long in the risk free investment while investments with a variance higher than σ m are leveraged, i.e., they borrow from the money market. Adding up all investments of all market participants, the total amount borrowed must of course equal the total amount of lending, i.e., the grand total money market position of all market participants must equal zero. Therefore, the grand total of all investments of all market participants is the optimal portfolio. On the other hand, the grand total of all investments of all market participants is “the market” itself. Thus, the optimal portfolio is the market itself, which is why it is also called the market portfolio .

One can also look at this from a different angle: you can always do a linear regression for the (historical) asset excess returns \(\widehat {r}_{i}\) on (historical) market excess returns, i.e.

$$\displaystyle \begin{aligned} \widehat{r}_{i}=\alpha_{i,m}+\beta_{i,m}\widehat{r}_{m}+\epsilon_{i} \ \text{for } i=1,\ldots,M\;,{} \end{aligned} $$
(28.2)

where the 𝜖 i are the error terms which by definition have zero expectation and are uncorrelated with \(\widehat {r}_{m}\). Calculating expected values and comparing this with Eq. 28.1, we immediately see that the CAPM requires the alpha of each asset to be zero. In the CAPM, no additional return is expected for any asset above that produced by the asset’s beta and the market!

The same holds for any portfolio of assets, since in the regression everything is linear.

$$\displaystyle \begin{aligned} \underset{\widehat{r}_{V}}{\underbrace{\sum_{i}w_{i}\widehat{r}_{i}}} & =\underset{\alpha_{V,m}}{\underbrace{\sum_{i}w_{i}\alpha_{i,m}}} +\underset{\beta_{V,m}}{\underbrace{\sum_{i}w_{i}\beta_{i,m}}}\widehat{r} _{m}+\underset{\epsilon_{V}}{\underbrace{\sum_{i}w_{i}\epsilon_{i}}} \;,\\ \widehat{r}_{V} & =\alpha_{V,m}+\beta_{V,m}\widehat{r}_{m}+\epsilon_{V}\;.{} \end{aligned} $$
(28.3)

Since the α i,m should all be zero, so must α V,m. Thus, in the CAPM, the alpha of any portfolio is zero. No portfolio can beat the market in the long run. This is not a new insight really, but in fact trivial: We constructed the optimal portfolio to have the best possible mean/variance ratio. Then we argued that the optimal portfolio should have the same position weights as the market. Thus, the market has the best possible mean/variance-ratio. Therefore, no other portfolio has a better mean/variance ratio because if it did we would have taken this portfolio as the optimal portfolio.

The “market” contains everything one could possibly invest in: equity, debt, commodities, real estate, etc., even art. Most often, this is not a feasible concept for practical situations, so instead, “the market” is divided into several (sub-)markets like the German Bond Market, the US Mid Cap Market or the Japanese Real Estate Market, to name a few of many hundreds or even thousands. But even these smaller markets are usually too large to handle, so Indices are defined as representatives of those markets. Examples of such Indices are the S&P 500, the German DAX, the Japanese Nikkei, etc. They contain a limited number of assets (with weights defined and calculated by the respective index provider) and serve as a proxy of the market they are supposed to represent. Indices are often used as benchmarks to measure the performance of a portfolio. In the following we will use the subscript B for the benchmark. The benchmark portfolio V B is supposed to be a proxy for the market portfolio V m (of whatever market we are interested in), but it is usually not identical to the market portfolio. In the following, we will carefully distinguish between benchmark and market portfolio properties using subscripts B and m, respectively.

2 Theory of Efficient Markets

At the heart of active portfolio management lies the hope of a portfolio manager that he can outperform the market, despite the fact that the CAPM says this is impossible. That does not necessarily mean, however, that the portfolio manager does not believe in the CAPM. He rather doubts the assumptions of the CAPM, in particular the one that everybody has the same information. The information edge the manager may believe to have on some assets causes him to cast asset return expectations that differ from the “consensus returns” of Eq. 28.1. This then results in different position weights, even if the manager uses the same mean/variance optimization as in the CAPM. If his return estimates turn out right he will indeed outperform the market. However, efficient market theory (at least in its strong form) states that this is still not possible in the long run. Efficient market theory comes in three varieties:

  • The weak efficient market theory states: Investors cannot outperform the market using historical price and volume data, only.

  • The semi strong efficient market theory states: Investors cannot outperform the market using publicly available information, only (i.e., historical price and volume data, fundamental data, recommendations published by analysts, etc.)

  • The strong efficient market theory states: Investors can never outperform the market. Market prices contain all relevant information.

A couple of observations support the view that at most the weak efficient market theory is in fact valid in the real world. For example, the long-term success of some value investors apparently excludes the semi-strong and strong theory. Also, not all market participants have equal access to all available informations, otherwise insider trading and manipulation of market indices (like in the Libor scandal) wouldn’t be possible. In addition, we know from behavioural finance that even professional asset managers oft act quite irrational. Finally, institutional investors are bound to obey regulatory requirements, which sometimes stands in the way of rational decisions.

3 Benchmarking Against an Index

Nonetheless the main goal of so-called active asset management is “to beat the market”, i.e., to gain a positive alpha against a benchmark (in most cases an index) which represents the market to be beaten. To assess how well an asset has been doing compared to the benchmark one can do a regression of the historical asset returns against the historical benchmark returns as in Eq. 28.2.

$$\displaystyle \begin{aligned} \widehat{r}_{i}=\alpha_{i,B}+\beta_{i,B}\widehat{r}_{B}+\epsilon_{i} \ \text{for } i=1,\ldots,M\;. \end{aligned}$$

Taking the expectation (e.g., as a historical average) we get the relation between asset and benchmark returns that serves as the definition of α B, the vector of the assets alphas with respect to the benchmark:

$$\displaystyle \begin{aligned} \widehat{\mathbf{R}}={\boldsymbol\alpha}_{B} +{\boldsymbol\beta}_{B}\widehat{R}_{B}\;.{} \end{aligned} $$
(28.4)

If the benchmark perfectly represented the market and if the CAPM were perfectly true, this would collapse to Eq. 28.1, i.e., the assets alphas would all vanish.

The asset betas with respect to the Benchmark are given by the matrix C and the position weights w B of the benchmark portfolio as in Eq. 26.23.

$$\displaystyle \begin{aligned} {\boldsymbol\beta}_{B}=\frac{1}{\sigma_{B}^{2}}\mathbf{Cw}_{B}=\frac {\mathbf{Cw}_{B}}{{\mathbf{w}}_{B}^{T}\mathbf{Cw}_{B}}\;.{} \end{aligned} $$
(28.5)

Equations 28.4 and 28.5 directly implyFootnote 2 that the following combination of α B, β B and C is be zero:

$$\displaystyle \begin{aligned} {\boldsymbol\alpha}_{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}=0\;.{} \end{aligned} $$
(28.6)

The same considerations that lead us to Eq. 28.4 also work for the portfolio’s expected excess return

$$\displaystyle \begin{aligned} \widehat{R}_{V}={\mathbf{w}}^{T}\widehat{\mathbf{R}}=\underset{\alpha_{V,B} }{\underbrace{{\mathbf{w}}^{T}{\boldsymbol\alpha}_{B}}}+\underset{\beta_{V,B} }{\underbrace{{\mathbf{w}}^{T}{\boldsymbol\beta}_{B}}}\widehat{R}_{B}\;.{} \end{aligned} $$
(28.7)

Here, the portfolio betaFootnote 3β V,B with respect to the benchmark follows from the regression analysis in the usual wayFootnote 4 given by Eq. A.20:

$$\displaystyle \begin{aligned} \beta_{V,B}\equiv\frac{\operatorname*{cov}(r_{V},r_{B})}{\operatorname*{Var} (r_{B})}=\frac{\operatorname*{cov}(\widehat{r}_{V}, \widehat{r}_{B})}{\operatorname*{Var}(\widehat{r}_{B})}\;.{} \end{aligned} $$
(28.8)

The portfolio alpha α V,B can also be determined by such an regression of the historical portfolio excess returns \(\widehat {r}_{V}\) against the historical benchmark excess returns \(\widehat {r}_{B}\). In this way one gets an ex post estimation of how the portfolio did in the past. The historical portfolio returns are of course not only a result of market movements but also of trading, i.e., position changes done by the portfolio manager. Therefore, for the estimation of the impact of portfolio changes, it is important to get ex ante estimates for alpha and beta over the next holding period of the current portfolio with its current holdings. To get these, we take the historical information of the assets and the current position weights in the portfolio to produce ex ante estimates of portfolio properties, like for instance alpha and beta. This can be done by using the historical alphas and betas of the assets with respect to the benchmark together with the current weights of those assets in the portfolio:

$$\displaystyle \begin{aligned} \alpha_{V,B}=\sum_{i=1}^{M}w_{i}\alpha_{i,B}={\mathbf{w}}^{T}{\boldsymbol\alpha} _{B}\;,\quad \beta_{V,B}=\sum_{i=1}^{M}w_{i}\beta_{i,B} ={\mathbf{w}}^{T}{\boldsymbol\beta}_{B}\;.{} \end{aligned} $$
(28.9)

The historical alphas and betas of the assets are determined via the regressions given in Eq. 28.2. One might also use other estimates for the asset alphas and betas in this ex ante estimation of the portfolio alpha and beta (e.g., an asset manager’s personal opinion).

In the context of active portfolio management, not only alpha and beta but usually all quantities (return, risk, etc.) are defined relative to the benchmark: if the portfolio loses money but the benchmark performs even worse, the portfolio manager is still happy in this relative framework (this is not necessarily true for the investor). This is in stark contrast to the previous sections, where we considered the excess return and the risk of the portfolio itself, an approach sometimes called Total Return Management . Still, all of the above considerations in the Total Return framework can easily be recovered within the benchmark-relative framework by simply defining the benchmark to be the money market account.

There are two different ways commonly used by market participants to define everything relative to the benchmark: using active quantities and residual quantities. Let’s start with the active quantities.

3.1 Active Portfolio Properties

The active position is defined as the deviation of the portfolio weights from the benchmark weights

$$\displaystyle \begin{aligned} \overline{\mathbf{w}}\mathbf{=w}-{\mathbf{w}}_{B}\;.{} \end{aligned} $$
(28.10)

Similarly, the difference between the cash positions of the portfolio and the benchmark is called the active cash . From Eq. 26.64, the active cash is simply

$$\displaystyle \begin{aligned} \text{Active Cash}=\left(1-{\mathbf{w}}^{T}\mathbf{1}\right) -\left(1-{\mathbf{w}}_{B}^{T}\mathbf{1}\right) =-\overline{\mathbf{w}}^{T}\mathbf{1} \end{aligned}$$

The active return is defined as the difference between the portfolio return, Eq. 28.3, and the benchmark return.

$$\displaystyle \begin{aligned} \overline{r}_{V}:=\overline{\mathbf{w}}^{T}\widehat{\mathbf{r}}=\widehat {r}_{V}-\widehat{r}_{B}=\alpha_{V,B}+(\beta_{V,B}-1)\widehat{r}_{B} +\epsilon_{V}\;. \end{aligned}$$

The expected active return is defined as the expected return of the portfolio with weights given by the active position

$$\displaystyle \begin{aligned} \overline{R}_{V} & =\text{E}\left[ \overline{r}_{V}\right] =\overline {\mathbf{w}}^{T}\widehat{\mathbf{R}}\\ & ={\mathbf{w}}^{T}{\boldsymbol\alpha}_{B}+({\mathbf{w}}^{T}{\boldsymbol\beta} _{B}-1)\widehat{R}_{B}\\ & =\alpha_{V,B}+(\beta_{V,B}-1)\widehat{R}_{B}{} \end{aligned} $$
(28.11)

and the active risk is defined in terms of the variance of active returns:

$$\displaystyle \begin{aligned} \overline{\sigma}_{V}^{2}:=\delta t\operatorname*{var}\left[ \overline{r} _{V}\right] =\delta t\operatorname*{var}\left[ \epsilon_{V}\right] +(\beta_{V,B}-1)^{2}\sigma_{B}^{2}\;.{} \end{aligned} $$
(28.12)

The last equation holds since the errors 𝜖 V are by construction uncorrelated all other parameters. The term (β V,B − 1) appearing here is called active beta . The active risk is also called Tracking Error since it is a measure of how well the portfolio tracks the benchmark. We can of course also write the tracking error in the form of Eq. 26.17 as the risk of a portfolio with position weights \(\overline {\mathbf {w}}\):

$$\displaystyle \begin{aligned} \overline{\sigma}_{V}^{2}=\delta t\operatorname*{var}\left[ \overline{r} _{V}\right] =\delta t\operatorname*{var}\left[ \overline{\mathbf{w}} ^{T}\widehat{\mathbf{r}}\right] =\overline{\mathbf{w}}^{T}\mathbf{C} \overline{\mathbf{w}}\;. \end{aligned}$$

The same holds for the expected active return in Eq. 28.11 when compared to the expected portfolio return in Eq. 26.3. Indeed, many concepts introduced in the previous sections carry over to the benchmark-relative framework by simply replacing the weights w by the active weights \(\overline {\mathbf {w}}\). For instance, in analogy to Eq. 26.19 we can define the marginal active risk as

$$\displaystyle \begin{aligned} \frac{\partial\overline{\sigma}_{V}}{\partial\overline{\mathbf{w}}} =\frac{\mathbf{C}\overline{\mathbf{w}}}{\sqrt{\overline{\mathbf{w}} ^{T}\mathbf{C}\overline{\mathbf{w}}}}=\frac{\mathbf{C}\overline{\mathbf{w}} }{\sqrt{\delta t\operatorname*{var}\left[ \epsilon_{V}\right] +(\beta _{V,B}-1)^{2}\sigma_{B}^{2}}} \end{aligned}$$

and attribute the active risk to the individual positions as in Eqs. 26.20 or 26.21.

$$\displaystyle \begin{aligned} \overline{\sigma}_{V}=\overline{\mathbf{w}}^{T}\frac{\partial\overline{\sigma }_{V}}{\partial\overline{\mathbf{w}}}=\sum_{i=1}^{M}\overline{w}_{i} \frac{\partial\overline{\sigma}_{V}}{\partial\overline{w}_{i}}\;. \end{aligned}$$

The amount of active risk attributed to the ith asset is thus

$$\displaystyle \begin{aligned} \overline{w}_{i}\frac{\partial\overline{\sigma}_{V}}{\partial\overline{w}_{i} }=\frac{\overline{w}_{i}^{2}\sum_{k=1}^{M}C_{ki}}{\sqrt{\delta t^{2} \operatorname*{var}\left[ \epsilon_{V}\right] +(\beta_{V,B}-1)^{2}\sigma _{B}^{2}}} \ \ \text{for } i=1,\ldots,M\;. \end{aligned}$$

The percentage of active risk attributed to the ith asset is this number divided by \(\overline {\sigma }_{V}\).

3.2 Residual Portfolio Properties

While the above active quantities focus on the differences between the portfolio and the benchmark itself, the residual quantities focus on difference between portfolio properties and the properties implied by the CAPM. To be specific, the residual (excess) return is defined as the difference between the portfolio excess return and its excess return implied by the CAPM via Eq. 28.1.

$$\displaystyle \begin{aligned} \widetilde{r}_{V}:=\widehat{r}_{V}-\beta_{V,B}\widehat{r}_{B}=\alpha _{V,B}+\epsilon_{V}\;.{} \end{aligned} $$
(28.13)

Correspondingly, the residual risk , is defined as the volatility of the residual return:

$$\displaystyle \begin{aligned} \widetilde{\sigma}_{V}^{2}:=\delta t\operatorname*{var}\left[ \widetilde {r}_{V}\right] =\delta t\operatorname*{var}\left[ \epsilon_{V}\right]\;.{} \end{aligned} $$
(28.14)

According to Eq. 28.4, the expectation of the residual return is equal to the portfolio’s alpha .

$$\displaystyle \begin{aligned} \widetilde{R}_{V}=\text{E}\left[ \widetilde{r}_{V}\right] =\mathbf{w} ^{T}{\boldsymbol\alpha}_{B}=\alpha_{V,B}\;.{} \end{aligned} $$
(28.15)

The Information Ratio is defined as the expected residual (excess) return per residual risk and is therefore the benchmark-relative analogue to the Sharpe Ratio, Eq. 27.22.

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{V}\equiv\frac{\widetilde{R}_{V}}{\widetilde{\sigma}_{V} }=\frac{\alpha_{V,B}}{\sqrt{\delta t\operatorname*{var}\left[ \epsilon _{V}\right]} }\;.{} \end{aligned} $$
(28.16)

The residual position weights producing the above residual return and residual risk are

$$\displaystyle \begin{aligned} \widetilde{\mathbf{w}}\mathbf{=w}-\beta_{V,B}{\mathbf{w}}_{B}\;.{} \end{aligned} $$
(28.17)

With these weights, we can write

$$\displaystyle \begin{aligned} \widetilde{R}_{V}=\widetilde{\mathbf{w}}^{T}\widehat{\mathbf{R}} \quad \text{and}\quad \widetilde{\sigma}_{V}^{2}=\widetilde{\mathbf{w}}^{T}\mathbf{C} \widetilde{\mathbf{w}}\;.{} \end{aligned} $$
(28.18)

and define the marginal residual risk in analogy to Eq. 26.19 as

$$\displaystyle \begin{aligned} \frac{\partial\widetilde{\sigma}_{V}}{\partial\widetilde{\mathbf{w}}} =\frac{\mathbf{C}\widetilde{\mathbf{w}}}{\sqrt{\widetilde{\mathbf{w}} ^{T}\mathbf{C}\widetilde{\mathbf{w}}}}=\frac{\mathbf{C}\widetilde{\mathbf{w}} }{\sqrt{\delta t\operatorname*{var}\left[ \epsilon_{V}\right]} }\;.{} \end{aligned} $$
(28.19)

In analogy to Eq. 26.20, the residual risk attributed to the ith position is

$$\displaystyle \begin{aligned} \widetilde{w}_{i}\frac{\partial\widetilde{\sigma}_{V}}{\partial\widetilde {w}_{i}}=\frac{\widetilde{w}_{i}^{2}\sum_{k=1}^{M}C_{ki}}{\sqrt{\delta t\operatorname*{var}\left[ \epsilon_{V}\right]} } \quad \text{for}\quad i=1,\ldots,M\;.{} \end{aligned} $$
(28.20)

The percentage \(\widetilde {A}_{i}\) of residual risk attributed to the ith asset is this number divided by \(\widetilde {\sigma }_{V}\).

4 Benchmark and Characteristic Portfolios

We have shown that attributes and characteristic portfolios prove to be very useful tools in the “total return” framework. Motivated by this we will now define some attributes and their characteristic portfolios in the benchmark-relative framework. In Chap. 27 we analyzed the properties (i.e., leverage, return, variance, etc.) of the characteristic portfolio for any arbitrary vector a of asset attributes. We will now add to this list the characteristic portfolio’s residual properties as well as its alpha and beta with respect to the benchmark.

The ex ante estimators for alpha and beta are given by Eq. 28.9 with the vectors of asset alphas and betas as in Eq. 28.4. With the weights of the characteristic portfolio given by Eq. 27.4, the characteristic portfolio’s alpha and beta with respect to the benchmark are

$$\displaystyle \begin{aligned} \alpha_{a,B} & \equiv{\mathbf{w}}_{a}^{T}{\boldsymbol\alpha}_{B}\mathbf{=} \frac{{\mathbf{a}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B}}{\mathbf{a} ^{T}{\mathbf{C}}^{-1}\mathbf{a}}\\ \beta_{a,B} & \equiv{\mathbf{w}}_{a}^{T}{\boldsymbol\beta}_{B}\mathbf{=} \frac{{\mathbf{a}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}{\mathbf{a} ^{T}{\mathbf{C}}^{-1}\mathbf{a}}\;.{} \end{aligned} $$
(28.21)

Let’s look now at the residual properties of the characteristic portfolio. With Eq. 27.4, the residual weights defined in Eq. 28.17 are

$$\displaystyle \begin{aligned} \widetilde{\mathbf{w}}_{a} & \mathbf{=}{\mathbf{w}}_{a}-\beta_{a,B} {\mathbf{w}}_{B}\\ & =\frac{{\mathbf{C}}^{-1}\mathbf{a}}{{\mathbf{a}}^{T}{\mathbf{C}}^{-1}\mathbf{a} }-\frac{{\mathbf{a}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}{\mathbf{a} ^{T}{\mathbf{C}}^{-1}\mathbf{a}}{\mathbf{w}}_{B}\\ & =\frac{{\mathbf{C}}^{-1}\mathbf{a}}{{\mathbf{a}}^{T}{\mathbf{C}}^{-1}\mathbf{a} }-\frac{{\mathbf{a}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}{\mathbf{a} ^{T}{\mathbf{C}}^{-1}\mathbf{a}}\frac{{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B} }{{\boldsymbol\beta}_{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}\;,{} \end{aligned} $$
(28.22)

where in the last step we have used the implied benchmark weights which will be introduced below in Eq. 28.28. According to Eq. 28.18, residual return and residual risk are then

$$\displaystyle \begin{aligned} \widetilde{R}_{a} & \equiv\widetilde{\mathbf{w}}^{T}\widehat{\mathbf{R} }=\widehat{R}_{a}-\sigma_{a}^{2}\frac{{\mathbf{a}}^{T}{\mathbf{C}}^{-1} {\boldsymbol\beta}_{B}}{{\boldsymbol\beta}_{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B} }{\boldsymbol\beta}_{B}^{T}{\mathbf{C}}^{-1}\widehat{\mathbf{R}},\\ \widetilde{\sigma}_{a}^{2} & \equiv\widetilde{\mathbf{w}}^{T}\mathbf{C} \widetilde{\mathbf{w}}=\sigma_{a}^{2}\left[ 1-\frac{\left(\mathbf{a} ^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}\right) ^{2}}{\left({\boldsymbol\beta} _{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}\right) \left(\mathbf{a} ^{T}{\mathbf{C}}^{-1}\mathbf{a}\right)} \right]\;,{} \end{aligned} $$
(28.23)

where we have used Eqs. 27.6 and 27.7. Note that the Cauchy-Schwarz inequality, Eq. 26.37, implies that \(\widetilde {\sigma }_{a}^{2}\) is always larger than zero except for those attributes which are multiplesFootnote 5 of β B.

With the above results, Information Ratio and marginal residual risk contributions can easily be calculated from their respective Definitions 28.16 and 28.19 for any characteristic portfolio.

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{a} & =\frac{\widetilde{R}_{a}}{\widetilde{\sigma}_{a} }=\frac{\sigma_{a}^{2}}{\widetilde{\sigma}_{a}}{\mathbf{a}}^{T}\mathbf{C} ^{-1}\widehat{\mathbf{R}}-\frac{\sigma_{a}^{2}}{\widetilde{\sigma}_{a}} \frac{{\mathbf{a}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}{{\boldsymbol\beta}_{B} ^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}{\boldsymbol\beta}_{B}\mathbf{C} ^{-1}\widehat{\mathbf{R}}\\ \frac{\partial\widetilde{\sigma}_{a}}{\partial\widetilde{\mathbf{w}}} & =\frac{\mathbf{C}\widetilde{\mathbf{w}}}{\widetilde{\sigma}_{a}}=\frac {\sigma_{a}^{2}}{\widetilde{\sigma}_{a}}\mathbf{a-}\frac{\sigma_{a}^{2} }{\widetilde{\sigma}_{a}}\frac{{\mathbf{a}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta} _{B}}{{\boldsymbol\beta}_{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}{\boldsymbol\beta} _{B}\;.{} \end{aligned} $$
(28.24)

The residual risk attributions follow directly from definition 28.20.

4.1 The Fully Invested Minimal Risk Portfolio

As a first example, we will apply the above results to the characteristic portfolio for the attribute L=1 introduced in Sect. 27.2, i.e., for the fully invested minimal risk portfolio. Its alpha and beta with respect to the benchmark are

$$\displaystyle \begin{aligned} \alpha_{L,B}\mathbf{=}\frac{{\mathbf{1}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B} }{{\mathbf{1}}^{T}{\mathbf{C}}^{-1}\mathbf{1}}\;,\quad \beta_{L,B} \frac{{\mathbf{1}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}{\mathbf{1} ^{T}{\mathbf{C}}^{-1}\mathbf{1}}\;. \end{aligned}$$

Its residual weights are

$$\displaystyle \begin{aligned} \widetilde{\mathbf{w}}_{L}=\frac{{\mathbf{C}}^{-1}\mathbf{1}}{\mathbf{1} ^{T}{\mathbf{C}}^{-1}\mathbf{1}}-\frac{{\mathbf{1}}^{T}{\mathbf{C}}^{-1} {\boldsymbol\beta}_{B}}{{\mathbf{1}}^{T}{\mathbf{C}}^{-1}\mathbf{1}}\frac {{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}{{\boldsymbol\beta}_{B}^{T}\mathbf{C} ^{-1}{\boldsymbol\beta}_{B}}\;. \end{aligned}$$

and the portfolio’s residual properties are

$$\displaystyle \begin{aligned} \widetilde{R}_{a} & =\widehat{R}_{L}-\sigma_{L}^{2}\frac{\mathbf{a} ^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}{{\boldsymbol\beta}_{B}^{T}\mathbf{C} ^{-1}{\boldsymbol\beta}_{B}}{\boldsymbol\beta}_{B}^{T}{\mathbf{C}}^{-1}\widehat {\mathbf{R}}\\ \widetilde{\sigma}_{L}^{2} & =\sigma_{L}^{2}\left[ 1-\frac{\left({\mathbf{1}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}\right) ^{2}}{\left({\boldsymbol\beta}_{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}\right) \left({\mathbf{1}}^{T}{\mathbf{C}}^{-1}\mathbf{1}\right)} \right]\;. \end{aligned} $$

Information Ratio and marginal residual risk contributions are

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{L} & =\frac{\sigma_{L}^{2}}{\widetilde{\sigma}_{L} }{\mathbf{1}}^{T}{\mathbf{C}}^{-1}\widehat{\mathbf{R}}-\frac{\sigma_{L}^{2} }{\widetilde{\sigma}_{L}}\frac{{\mathbf{1}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta} _{B}}{{\boldsymbol\beta}_{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}\boldsymbol{\beta} _{B}{\mathbf{C}}^{-1}\widehat{\mathbf{R}}\\ \frac{\partial\widetilde{\sigma}_{a}}{\partial\widetilde{\mathbf{w}}} & =\frac{\sigma_{L}^{2}}{\widetilde{\sigma}_{L}}\mathbf{1-}\frac{\sigma_{L}^{2} }{\widetilde{\sigma}_{L}}\frac{{\mathbf{1}}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta} _{B}}{{\boldsymbol\beta}_{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\beta}_{B}}{ \boldsymbol\beta}_{B}\;. \end{aligned} $$

As usual, the residual risk attributions now follow directly from definition 28.20.

We will postpone the discussion of the other two important characteristic portfolios already introduced above, namely the portfolio for the attribute \(\mathbf {A}=\widehat {\mathbf {R}}\) of Sect. 27.3 and the market portfolio of Sect. 27.4 until we have introduced two new portfolios, namely the characteristic portfolios for alpha and beta.

4.2 The Characteristic Portfolio for Beta

Let the attribute vector b be the beta of the assets with respect to the benchmark as in Eq. 28.4

$$\displaystyle \begin{aligned} b_{i}:=\beta_{i,B}\equiv\frac{\operatorname*{cov}(\widehat{r}_{i},\widehat {r}_{B})}{\operatorname*{var}(\widehat{r}_{B})}=\frac{\delta t}{\sigma_{B} ^{2}}\sum_{k=1}^{M}w_{B,k}\underset{C_{ik}/\delta t}{\underbrace {\operatorname*{cov}(\widehat{r}_{i},\widehat{r}_{k})}} \end{aligned}$$

or in vector notation

$$\displaystyle \begin{aligned} \mathbf{b=\boldsymbol\beta}_{B}=\frac{\mathbf{Cw}_{B}}{{\mathbf{w}}_{B}^{T}\mathbf{Cw}_{B}}{} \end{aligned} $$
(28.25)

According to Eq. 27.1, the exposure of any portfolio V  to this particular attribute is

$$\displaystyle \begin{aligned} b_{V}={\mathbf{w}}^{T}\mathbf{b}=\frac{{\mathbf{w}}^{T}\mathbf{Cw}_{B}} {{\mathbf{w}}_{B}^{T}\mathbf{Cw}_{B}}\;. \end{aligned}$$

Comparing this with the portfolio beta β V,B with respect to the benchmark, i.e., with

$$\displaystyle \begin{aligned} \beta_{V,B} & \equiv\frac{\operatorname*{cov}(\widehat{r}_{V},\widehat {r}_{B})}{\operatorname*{var}(\widehat{r}_{B})}=\frac{\delta t}{\sigma_{B} ^{2}}\sum_{k=1}^{M}w_{B,k}\operatorname*{cov}(\widehat{r}_{V},\widehat{r} _{k})\\ & =\frac{\delta t}{\sigma_{B}^{2}}\sum_{k=1}^{M}w_{B,k}\sum_{i=1}^{M} w_{i}\underset{C_{ik}/\delta t}{\underbrace{\operatorname*{cov}(\widehat {r}_{i},\widehat{r}_{k})}}=\frac{{\mathbf{w}}^{T}\mathbf{Cw}_{B}}{\mathbf{w} _{B}^{T}\mathbf{Cw}_{B}}{} \end{aligned} $$
(28.26)

we find that the exposure of any portfolio V  to the attribute b is equal to the portfolio beta with respect to the benchmark:

$$\displaystyle \begin{aligned} b_{V}={\mathbf{w}}^{T}\mathbf{b}=\beta_{V,B}\;. \end{aligned}$$

Let’s investigate now the characteristic portfolio V b for this particular attribute, i.e., the minimum risk portfolio with unit exposure b V = 1. According to Eqs. 27.4 and 28.25, this portfolio has the following weights

$$\displaystyle \begin{aligned} {\mathbf{w}}_{b}=\frac{{\mathbf{C}}^{-1}\mathbf{b}}{{\mathbf{b}}^{T}\mathbf{C} ^{-1}\mathbf{b}}=\left({\mathbf{w}}_{B}^{T}\mathbf{Cw}_{B}\right) \frac{{\mathbf{C}}^{-1}\left(\mathbf{Cw}_{B}\right)} {\left(\mathbf{w} _{B}^{T}\mathbf{C}\right) {\mathbf{C}}^{-1}\left(\mathbf{Cw}_{B}\right)} ={\mathbf{w}}_{B}\;. \end{aligned}$$

Thus, the characteristic portfolio for the attribute “beta to a benchmark” is the benchmark itself:

$$\displaystyle \begin{aligned} {\mathbf{w}}_{b}\mathbf{=w}_{B}\;.{} \end{aligned} $$
(28.27)

Therefore, among all portfolios with β V,B = 1, the benchmark itself has minimum risk.

Equation 28.27 shows a way how to imply the weights of the individual assets in the benchmark, if those weights are not known to the investors: From historical time series determine the asset betas with respect to the benchmark and the asset’s covariances. Then the implied benchmark weights are given by

$$\displaystyle \begin{aligned} {\mathbf{w}}_{B}=\frac{{\mathbf{C}}^{-1}\boldsymbol{\beta}_{B}}{\boldsymbol{\beta}_{B} ^{T}{\mathbf{C}}^{-1}\boldsymbol{\beta}_{B}} {} \end{aligned} $$
(28.28)

The portfolio risk and return and the asset betas follow from Eqs. 27.6, 27.7 and 27.9 as usual:

$$\displaystyle \begin{aligned} \sigma_{b}^{2}=\frac{1}{{\boldsymbol\beta}_{B}^{T}{\mathbf{C}}^{-1}\boldsymbol{\beta} _{B}}\;,\quad \widehat{R}_{b}=\frac{{\boldsymbol\beta}_{B}^{T} {\mathbf{C}}^{-1}\widehat{\mathbf{R}}}{{\boldsymbol\beta}_{B}^{T}\mathbf{C} ^{-1}{\boldsymbol\beta}_{B}}\;,\quad {\boldsymbol\beta}_{b}={\boldsymbol\beta}_{B}\;.{} \end{aligned} $$
(28.29)

Finally, the marginal risk contribution, Eq. 27.10, of each asset and the percentage of risk attributed to each asset, Eq. 27.11, are:

$$\displaystyle \begin{aligned} \frac{\partial\sigma_{b}}{\partial w_{b,i}}=\sigma_{b}\beta_{i,B}\;,\quad A_{b,i}=w_{b,i}\beta_{i,B}~\quad \text{for } i=1,\ldots,M\;. \end{aligned}$$

According to Eqs. 28.21 and 28.6, the portfolio’s alpha and beta with respect to the benchmark are simply zero and one

$$\displaystyle \begin{aligned} \alpha_{b,B}=0\;,\quad \beta_{b,B}=1\;.{} \end{aligned} $$
(28.30a)

Since the portfolio is the benchmark itself, all above properties also describe the benchmark portfolio V B. Moreover, the residual weights all vanish. This can also be verified explicitly by setting a=β B in Eq. 28.22. Therefore residual risk and return are both zero, and the Information Ratio is undefined.

4.3 The Characteristic Portfolio for Alpha

Let’s now choose an attribute vector a to be the alpha of the assets with respect to the benchmark, i.e., according to Eq. 28.4

$$\displaystyle \begin{aligned} \mathbf{a}:={\boldsymbol\alpha}_{B}=\widehat{\mathbf{R}}-{\boldsymbol\beta} _{B}\widehat{R}_{B}\;.{} \end{aligned} $$
(28.31)

According to Eqs. 27.1 and 28.9, the exposure of any portfolio V  to this particular attribute is the (ex ante estimate for the) portfolio alpha:

$$\displaystyle \begin{aligned} a_{V}={\mathbf{w}}^{T}\mathbf{a}=\sum_{i=1}^{M}w_{i}\alpha_{i,B}=\alpha_{V,B}\;. \end{aligned}$$

The characteristic portfolio V a for this particular attribute is the minimum risk portfolio with unit exposure a V = 1, i.e., with a portfolio alpha of 1. According to Eqs. 27.4, 27.6 and 27.9, the characteristic portfolio for the attribute a has the following weights, variance, return and betas:

$$\displaystyle \begin{aligned} {\mathbf{w}}_{a} & \mathbf{=}\frac{{\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B} }{{\boldsymbol\alpha}_{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B}}\mathbf{=} \sigma_{a}^{2}{\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B}\\ \sigma_{a}^{2} & =\frac{1}{{\boldsymbol\alpha}_{B}^{T}{\mathbf{C}}^{-1} {\boldsymbol\alpha}_{B}}\\ \widehat{R}_{a} & =\sigma_{a}^{2}{\boldsymbol\alpha}_{B}^{T}\mathbf{C} ^{-1}\widehat{\mathbf{R}}=\frac{{\boldsymbol\alpha}_{B}^{T}{\mathbf{C}}^{-1} \widehat{\mathbf{R}}}{{\boldsymbol\alpha}_{B}^{T}{\mathbf{C}}^{-1}\boldsymbol{\alpha} _{B}}\\ \frac{\mathbf{Cw}_{a}}{\sigma_{a}^{2}} & ={\boldsymbol\beta}_{a}=\boldsymbol{\alpha} _{B} \;.{} \end{aligned} $$
(28.32)

Thus, the asset betas with respect to this characteristic portfolio V a are equal to the asset alphas with respect to the benchmark portfolio V B. By contrast, the portfolio alpha and beta of V a with respect to the benchmark are simply one and zeroFootnote 6 as follows directlyFootnote 7 from Eqs. 28.21 and 28.6

$$\displaystyle \begin{aligned} \alpha_{a,B}\mathbf{=}1\;,\quad \beta_{a,B}=0{}\;. \end{aligned} $$
(28.33)

Since β a,B = ρ a,Bσ aσ B this implies that portfolio V a is totally uncorrelated with the benchmark:

$$\displaystyle \begin{aligned} \rho_{a,B}=0\;. \end{aligned}$$

With the asset betas as in Eq. 28.32, the marginal risk contribution, Eq. 26.25, of each asset and the percentage of risk attributed to each asset, Eq. 26.26, are:

$$\displaystyle \begin{aligned} \frac{\partial\sigma_{a}}{\partial w_{a,i}}=\sigma_{a}\alpha_{i,B} \,,\quad A_{a,i}=w_{a,i}\alpha_{i,B}~ \quad \text{for}\quad i=1,\ldots,M\;.{} \end{aligned} $$
(28.34)

Let’s look now at the residual quantities of this portfolio. Since β a,B = 0 we have

$$\displaystyle \begin{aligned} \widetilde{\mathbf{w}}_{a}\mathbf{=w}_{a}-\beta_{a,B}{\mathbf{w}}_{B} ={\mathbf{w}}_{a} \end{aligned}$$

and therefore

$$\displaystyle \begin{aligned} \widetilde{R}_{a}\mathbf{=}\widehat{R}_{a} \ \text{und }\ \widetilde{\sigma} _{a}^{2}=\sigma_{a}^{2} \;.{} \end{aligned} $$
(28.35)

Moreover, the marginal residual risk contribution of each asset and the percentage of residual risk attributed to each asset are the same as in Eq. 28.34.

$$\displaystyle \begin{aligned} \frac{\partial\widetilde{\sigma}_{a}}{\partial\widetilde{w}_{a,i}}=\sigma _{a}\alpha_{i,B} \,,\quad \widetilde{A}_{a,i}=w_{a,i}\alpha_{i,B}~ \quad \text{for}\quad i=1,\ldots,M\;. \end{aligned}$$

Note that because of Eq. 28.15 we have

$$\displaystyle \begin{aligned} \widetilde{R}_{a}=\alpha_{a,B}={\mathbf{w}}_{a}^{T}{\boldsymbol\alpha}_{B}=1{} \end{aligned} $$
(28.36)

where the last step follows because V a is the characteristic portfolio for α B. Thus, portfolio V a not only has α a,B = 1 but it also has expected excess return \(\widehat {R}_{a}=1\). It therefore has the same expected excess return as the characteristic portfolio V A for attribute \(\widehat {\mathbf {R}}\) discussed in Sect. 27.3. However, both portfolios are not necessarily the same: among all portfolios with \({\mathbf {w}}^{T}\widehat {\mathbf {R}}=1\) there are some which also fulfill w Tα B = 1. However, among the other portfolios with \({\mathbf {w}}^{T}\widehat {\mathbf {R}}=1\) there could well be some with less risk than portfolio V a. Thus, portfolio V a is not necessarily the minimum risk portfolio with excess return equal one, i.e., not necessarily equal to V A.

Because of Eqs. 28.36, 28.35, and 28.32, the information ratio of the characteristic portfolio V A is

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{a}=\frac{\widetilde{R}_{a}}{\widetilde{\sigma}_{a}} =\frac{1}{\widetilde{\sigma}_{a}}=\frac{1}{\sigma_{a}}=\sqrt{\boldsymbol{\alpha} _{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B}}\;.{} \end{aligned} $$
(28.37)

This can also be read in the following way: the total risk as well as the residual risk of this portfolio are both equal to the reciprocal of the Information Ratio.

Portfolio V a has minimum risk, and because of Eq. 28.35, it also has minimum residual risk among all portfolios V  with α V,B = 1. Thus, portfolio V a maximizes the Information ratio Eq. 28.37 among those portfolios. To show that portfolio V a has maximal Information Ratio among all portfolios (and not only among those with α V,B = 1), we will now show that for any arbitrary portfolio V  with weights w, there is a portfolio with the same information ratio and a portfolio alpha equal to one. We find this portfolio by constructing its weights: the residual position weights of portfolio V  are given by Eq. 28.17. Because of Eq. 28.18, the Information Ratio (defined in Eq. 28.16) for any portfolio V λ with residual weights \(\lambda \widetilde {\mathbf {w}}\) is the same for any λ > 0. The weights w λ of this portfolio follow directly from the requirement \(\widetilde {\mathbf {w}}_{\lambda }\mathbf {=} \lambda \widetilde {\mathbf {w}}\):

$$\displaystyle \begin{aligned} \widetilde{\mathbf{w}}_{\lambda} & \mathbf{=}\lambda\widetilde{\mathbf{w} }\\ {\mathbf{w}}_{\lambda}-\beta_{\lambda,B}{\mathbf{w}}_{B} & =\lambda\left(\mathbf{w}-\beta_{V,B}{\mathbf{w}}_{B}\right) \\ {\mathbf{w}}_{\lambda} & =\lambda\mathbf{w}+(\beta_{\lambda,B}-\lambda \beta_{V,B}){\mathbf{w}}_{B}\;.{} \end{aligned} $$
(28.38)

We can adjust λ such that the alpha of portfolio V λ becomes one:

$$\displaystyle \begin{aligned} 1\overset{!}{=}\alpha_{\lambda,B}=\widetilde{R}_{\lambda}=\lambda\widetilde {R}=\lambda\alpha_{V,B} \,\Longrightarrow\lambda=\frac{1}{\alpha_{V,B} }\;. \end{aligned}$$

Thus, for any given portfolio V  with weights w there is indeed a portfolio V λ with the same information ratio and a portfolio alpha equal to one. The weights of this portfolio can be obtained by solving the equationFootnote 8

$$\displaystyle \begin{aligned} {\mathbf{w}}_{\lambda}=\frac{1}{\alpha_{V,B}}\mathbf{w}+(\beta_{\lambda,B} -\frac{\beta_{V,B}}{\alpha_{V,B}}){\mathbf{w}}_{B}\;. \end{aligned}$$

This proves that portfolio V a, having maximal Information Ratio among all portfolios V  with α V,B = 1, has indeed maximal Information Ratio among all portfolio.

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{\text{max}}=\widetilde{\gamma}_{a}=\frac{1}{\widetilde {\sigma}_{a}}=\frac{1}{\sigma_{a}}\;.{} \end{aligned} $$
(28.39)

This Equation is the benchmark-relative analogue to Eq. 27.23.

Choosing the arbitrary portfolio V  in Eq. 28.38 to be the characteristic portfolio V a itself (and observing Eq. 28.33) we get the most general form a portfolio with the same (i.e., maximal) Information Ratio as V a may have:

$$\displaystyle \begin{aligned} {\mathbf{w}}_{\lambda} & =\lambda{\mathbf{w}}_{a}+\beta_{\lambda,B} {\mathbf{w}}_{B} \ \ \text{mit } \lambda>0{}\\ & \Longrightarrow\widetilde{\gamma}_{\lambda}=\widetilde{\gamma}_{a}\;. \end{aligned} $$
(28.40)

Equation 28.40 provides a handy method for constructing portfolios with maximum Information ratio using the characteristic portfolios V a and V b = V B. Furthermore, it provides a powerful tool to check whether a given portfolio has maximum IR. We will come back to this point later.

To complete this section, we will now show that a relation similar to Eq. 27.25 for Sharpe Ratios also holds in the benchmark-relative framework for Information Ratios. With Eq. 28.32 and the general Eq. 28.9, the alpha of any portfolio with weights w can be expressed in terms of covariances with the characteristic portfolio V a as

$$\displaystyle \begin{aligned} \alpha_{V,B}={\mathbf{w}}^{T}{\boldsymbol\alpha}_{B}=\frac{{\mathbf{w}}^{T} \mathbf{Cw}_{a}}{\sigma_{a}^{2}}=\frac{\delta t\operatorname*{cov}(r_{V} ,r_{a})}{\sigma_{a}^{2}}=\frac{\delta t\operatorname*{cov}(\widehat{r} _{V},\widehat{r}_{a})}{\sigma_{a}^{2}}=\frac{\delta t\operatorname*{cov} (\widetilde{r}_{V},\widetilde{r}_{a})}{\sigma_{a}^{2}}\;, \end{aligned}$$

where the second-to-last step is trivial, since subtracting a constant r f doesn’t change the covariances. The last step, however, is not trivial and only holds because portfolio V a has zero beta with respect to the benchmark.Footnote 9 With this alpha, the IR of portfolio V  can be written asFootnote 10

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{V}=\frac{\widetilde{R}_{V}}{\widetilde{\sigma}_{V}} =\frac{\alpha_{V,B}}{\widetilde{\sigma}_{V}}=\frac{\delta t\operatorname*{cov} (\widetilde{r}_{V},\widetilde{r}_{a})}{\widetilde{\sigma}_{V}\sigma_{a}^{2} }=\underset{\widetilde{\gamma}_{a}}{\underbrace{\frac{1}{\widetilde{\sigma} _{a}}}}\underset{\operatorname*{corr}(\widetilde{r}_{V},\widetilde{r}_{a} )}{\underbrace{\frac{\delta t\operatorname*{cov}(\widetilde{r}_{V} ,\widetilde{r}_{a})}{\widetilde{\sigma}_{V}\widetilde{\sigma}_{a}}}}\;, \end{aligned}$$

where in the second step we have used Eq. 28.15 and in the last step we have used Eqs. 28.35 and 28.37. Therefore, the IR of any portfolio V  is given by the (maximum) IR of the characteristic portfolio V a and the correlation between the residual returns of those two portfolios:

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{V}=\widetilde{\gamma}_{a}\operatorname*{corr} (\widetilde{r}_{V},\widetilde{r}_{a})\ \ \ \text{for all portfolios } V\;.{} \end{aligned} $$
(28.41)

This equation is the benchmark-relative analogue to Eq. 27.25.

5 Relations Between Sharpe Ratio and Information Ratio

To find out how the Sharpe Ratio used in Total Return Management is related to the Information Ratio used in the benchmark-relative framework, we will analyze the two characteristic portfolios with maximum Sharpe Ratio, i.e., portfolios V A of Sect. 27.3 and portfolio V m of Sect. 27.4, in the benchmark framework.

5.1 The Market Portfolio

We will begin by applying Eq. 28.40 to check if the market portfolio of Sect. 27.4 has maximum IR. To bring its weights w m into a form similar to Eq. 28.40, we write the asset returns in Eq. 28.4 with Eqs. 28.32 for α B and 28.29 for β B:

$$\displaystyle \begin{aligned} \widehat{\mathbf{R}}={\boldsymbol\alpha}_{B}+{\boldsymbol\beta}_{B}\widehat{R} _{B}=\frac{\mathbf{Cw}_{a}}{\sigma_{a}^{2}}+\frac{\mathbf{Cw}_{B}}{\sigma _{B}^{2}}\widehat{R}_{B} \;.{} \end{aligned} $$
(28.42)

Expressing now the asset returns on the left hand side in terms of the market portfolio as in Eq. 27.31, we get

$$\displaystyle \begin{aligned} \frac{\mathbf{Cw}_{m}}{\sigma_{m}^{2}}\widehat{R}_{m}=\frac{\mathbf{Cw}_{a} }{\sigma_{a}^{2}}+\frac{\mathbf{Cw}_{B}}{\sigma_{B}^{2}}\widehat{R}_{B}\;. \end{aligned}$$

Solving for w m shows that the market portfolio can be written as a combination of the benchmark portfolio V B and the characteristic portfolio V a.

$$\displaystyle \begin{aligned} {\mathbf{w}}_{m} & =\frac{\sigma_{m}^{2}}{\sigma_{a}^{2}}\frac{1}{\widehat {R}_{m}}{\mathbf{w}}_{a}+\frac{\sigma_{m}^{2}}{\sigma_{B}^{2}}\frac{\widehat {R}_{B}}{\widehat{R}_{m}}{\mathbf{w}}_{B}\\ & =\frac{{\boldsymbol\alpha}_{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B} }{{\mathbf{1}}^{T}{\mathbf{C}}^{-1}\widehat{\mathbf{R}}}~{\mathbf{w}}_{a} +\frac{{\boldsymbol\beta}_{B}^{T}{\mathbf{C}}^{-1}\widehat{\mathbf{R}}} {{\mathbf{1}}^{T}{\mathbf{C}}^{-1}\widehat{\mathbf{R}}}~{\mathbf{w}}_{B}\\ & =\underset{\lambda}{\underbrace{\frac{{\boldsymbol\alpha}_{B}^{T} {\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B}}{{\mathbf{1}}^{T}{\mathbf{C}}^{-1} \widehat{\mathbf{R}}}}}{\mathbf{w}}_{a}+\underset{\beta_{m,B}}{\underbrace {\left({\boldsymbol\beta}_{B}^{T}{\mathbf{w}}_{m}\right)} }{\mathbf{w}}_{B}\;,{} \end{aligned} $$
(28.43)

Here we have used Eqs. 27.31, 28.29 and 28.32 in the second step and Eq. 27.30 in the last step. Although it looks very similar, λ is not the portfolio alpha of the market portfolio.Footnote 11 Nonetheless, Eq. 28.43 has indeed the form of Eq. 28.40 with λ > 0 as long as \(\widehat {R}_{m}>0\). Therefore, as is always the case when the market portfolio is involved, everything only works if Inequality 26.59 holds.Footnote 12

If Inequality 26.59 holds, then the market portfolio has maximal Information Ratio (in addition to having maximal Sharpe Ratio).

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{m}=\widetilde{\gamma}_{a}=\widetilde{\gamma}_{\text{max}}\;.{} \end{aligned} $$
(28.44)

Because of Eq. 28.41, this means that the residual returns of the market portfolio are fully correlated with the residual returns of portfolio V a.

$$\displaystyle \begin{aligned} \operatorname*{corr}(\widetilde{r}_{m},\widetilde{r}_{a})=1\;. \end{aligned}$$

From the above expression for w m, we find that the residual weights of the market portfolio can be expressed in terms of the weights of portfolio V a as

$$\displaystyle \begin{aligned} \widetilde{\mathbf{w}}_{m}\boldsymbol{\equiv}{\mathbf{w}}_{m}-\beta_{m,B}\mathbf{w} _{B}=\frac{{\boldsymbol\alpha}_{B}^{T}{\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B} }{{\mathbf{1}}^{T}{\mathbf{C}}^{-1}\widehat{\mathbf{R}}}{\mathbf{w}}_{a} =\frac{\sigma_{m}^{2}}{\sigma_{a}^{2}}\frac{1}{\widehat{R}_{m}}{\mathbf{w}}_{a}\;.{} \end{aligned} $$
(28.45)

With these weights, the residual risk of the market portfolio becomes

$$\displaystyle \begin{aligned} \widetilde{\sigma}_{m}=\sqrt{\widetilde{\mathbf{w}}_{m}^{T}\mathbf{C} \widetilde{\mathbf{w}}_{m}}=\frac{\sigma_{m}^{2}}{\sigma_{a}^{2}}\frac {1}{\widehat{R}_{m}}\sqrt{{\mathbf{w}}_{a}^{T}\mathbf{Cw}_{a}}=\frac{\sigma _{m}^{2}}{\sigma_{a}\widehat{R}_{m}}\;.{} \end{aligned} $$
(28.46)

Similarly, the residual return of the market portfolio is

$$\displaystyle \begin{aligned} \widetilde{R}_{m}=\widetilde{\mathbf{w}}_{m}^{T}\widehat{\mathbf{R} }\boldsymbol{\equiv}\frac{\sigma_{m}^{2}}{\sigma_{a}^{2}}\frac{1}{\widehat{R}_{m} }{\mathbf{w}}_{a}^{T}\widehat{\mathbf{R}}=\frac{\sigma_{m}^{2}}{\sigma_{a}^{2} }\frac{\widehat{R}_{a}}{\widehat{R}_{m}}\;.{} \end{aligned} $$
(28.47)

Inserting these results into Eq. 28.16 we find the Information Ratio of the market portfolio to be the same as the Sharpe Ratio of portfolio V a.

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{m}=\frac{\widetilde{R}_{m}}{\widetilde{\sigma}_{m}} =\frac{\widehat{R}_{a}}{\sigma_{a}}=\gamma_{a} \end{aligned}$$

or, written with \(\widetilde {\gamma }_{\text{max}}\) from Eq. 28.44:

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{\text{max}}=\widetilde{\gamma}_{m}=\gamma_{a}\;.{} \end{aligned} $$
(28.48)

where in the last step we have used Eq. 28.44. Therefore, the (maximum) Information Ratio of portfolio V a is equal to its Sharpe Ratio. However, this is not the maximum Sharpe Ratio. The relation between the maximum Sharpe Ratio γ m (see Eq. 27.33) and the maximum Information Ratio \(\widetilde {\gamma }_{m}=\widetilde {\gamma }_{a}\) (see Eq. 28.37) is

$$\displaystyle \begin{aligned} \frac{\gamma_{\text{max}}}{\widetilde{\gamma}_{\text{max}}}=\frac{\gamma_{m} }{\widetilde{\gamma}_{a}}=\frac{\sigma_{a}\widehat{R}_{m}}{\sigma_{m}} =\frac{\sigma_{m}}{\widetilde{\sigma}_{m}}\;, \end{aligned}$$

where we have used Eq. 28.46 in the last step. The maximum Sharpe Ratio is as much larger than the maximum Information Ratio as the total risk of the market portfolio is larger than its residual risk.

$$\displaystyle \begin{aligned} \gamma_{\text{max}}=\frac{\sigma_{m}}{\widetilde{\sigma}_{m}}\widetilde {\gamma}_{\text{max}} \;.{} \end{aligned} $$
(28.49)

With \(\gamma _{\text{max}}=\gamma _{m}=\widehat {R}_{m}/\sigma _{m}\) and \(\widetilde {\gamma }_{\text{max}}=\widetilde {\gamma }_{m}=\widetilde {R} _{m}/\widetilde {\sigma }_{m}\), this means that the ratio of excess and residual return of the market portfolio equals the ratio of the squares of its risk and residual risk:

$$\displaystyle \begin{aligned} \frac{\widehat{R}_{m}}{\widetilde{R}_{m}}=\frac{\sigma_{m}^{2}}{\widetilde {\sigma}_{m}^{2}}=\frac{\operatorname*{var}\left[ r_{m}\right] }{\operatorname*{var}\left[ \epsilon_{m}\right]} \;,{} \end{aligned} $$
(28.50)

where in the last step we have used the original definition 28.14 of the residual risk.

The marginal contributions of the assets to the residual risk of the market portfolio are by definition

$$\displaystyle \begin{aligned} \frac{\partial\widetilde{\sigma}_{m}}{\widetilde{\mathbf{w}}_{m}} =\frac{\mathbf{C}\widetilde{\mathbf{w}}}{\sqrt{\widetilde{\mathbf{w}} ^{T}\mathbf{C}\widetilde{\mathbf{w}}}}=\frac{\mathbf{C}\widetilde{\mathbf{w}} }{\widetilde{\sigma}_{m}}\;. \end{aligned}$$

With the above results 28.45 and 28.46, this can be written as

$$\displaystyle \begin{aligned} \frac{\partial\widetilde{\sigma}_{m}}{\widetilde{\mathbf{w}}_{m}}=\frac {\sigma_{m}^{2}}{\sigma_{a}^{2}}\frac{1}{\widehat{R}_{m}}\frac{\mathbf{Cw} _{a}}{\widetilde{\sigma}_{m}}=\frac{\mathbf{Cw}_{a}}{\sigma_{a}}\;. \end{aligned}$$

We can now apply Eqs. 28.32 to write the marginal residual risk contributions to the market portfolio in several useful forms.

$$\displaystyle \begin{aligned} \frac{\partial\widetilde{\sigma}_{m}}{\widetilde{\mathbf{w}}_{m}}=\sigma _{a}{\boldsymbol\beta}_{a}=\sigma_{a}{\boldsymbol\alpha}_{B}\;.{} \end{aligned} $$
(28.51)

With Eq. 28.37 for σ a we get a particularly insightful result:

$$\displaystyle \begin{aligned} {\boldsymbol\alpha}_{B}=\widetilde{\gamma}_{\text{max}}\frac{\partial \widetilde{\sigma}_{m}}{\widetilde{\mathbf{w}}_{m}} \;.{} \end{aligned} $$
(28.52)

The asset alphas are proportional to the assets’ marginal residual risk contributions to the market portfolio, with the maximum Information Ratio being the constant of proportionality.

5.2 The Characteristic Portfolio of the Excess Return

Let’s now analyze the excess return’s characteristic portfolio V A with its properties given by Eq. 27.24. Since V A is closely related to the market portfolio via Eq. 27.29, we can directly use Eq. 28.43 to express V A in terms of the characteristic portfolios for alpha and beta as

$$\displaystyle \begin{aligned} {\mathbf{w}}_{A} & =\frac{\sigma_{A}^{2}}{\sigma_{L}^{2}}\widehat{R} _{L}{\mathbf{w}}_{m}=\frac{\sigma_{m}^{2}}{\sigma_{L}^{2}}\frac{\widehat{R}_{L} }{\widehat{R}_{m}}\left[ \frac{\sigma_{A}^{2}}{\sigma_{a}^{2}}\mathbf{w} _{a}+\frac{\sigma_{A}^{2}}{\sigma_{B}^{2}}\widehat{R}_{B}{\mathbf{w}}_{B}\right]\\ & =\frac{\sigma_{A}^{2}}{\sigma_{a}^{2}}{\mathbf{w}}_{a}+\frac{\sigma_{A}^{2} }{\sigma_{B}^{2}}\widehat{R}_{B}{\mathbf{w}}_{B}=\frac{{\boldsymbol\alpha}_{B} ^{T}{\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B}}{\widehat{\mathbf{R}}^{T} {\mathbf{C}}^{-1}\widehat{\mathbf{R}}}~{\mathbf{w}}_{a}+\underset{\beta_{A,B} }{\underbrace{\left({\boldsymbol\beta}_{B}^{T}{\mathbf{w}}_{A}\right) } }{\mathbf{w}}_{B}\;, \end{aligned} $$

where Eqs. 27.18, 27.24, 28.29 and 28.32 have been used. This again has the form of Eq. 28.40 with λ > 0 for any positive definite Matrix C. Thus, the excess return’s characteristic portfolio has maximal Information Ratio (in addition to having maximal Sharpe Ratio).

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{A}=\widetilde{\gamma}_{a}=\widetilde{\gamma}_{\text{max}}\;. \end{aligned}$$

This holds in every situation, even if Inequality 26.59 does not hold. The residual weights of portfolio V A can now be expressed in terms of the weights of portfolio V a as

$$\displaystyle \begin{aligned} \widetilde{\mathbf{w}}_{A}\boldsymbol{\equiv}{\mathbf{w}}_{a}-\beta_{A,B}\mathbf{w} _{B}=\frac{\sigma_{A}^{2}}{\sigma_{a}^{2}}{\mathbf{w}}_{a}\;. \end{aligned} $$

With these weights, the residual variance of the portfolio becomes

$$\displaystyle \begin{aligned} \widetilde{\sigma}_{m}=\sqrt{\widetilde{\mathbf{w}}_{A}^{T}\mathbf{C} \widetilde{\mathbf{w}}_{A}}=\frac{\sigma_{A}^{2}}{\sigma_{a}^{2}} \sqrt{{\mathbf{w}}_{a}^{T}\mathbf{Cw}_{a}}=\frac{\sigma_{A}^{2}}{\sigma_{a}^{2} }\sigma_{a}\;. \end{aligned} $$

Similarly, the residual return of the portfolio is

$$\displaystyle \begin{aligned} \widetilde{R}_{A}=\widetilde{\mathbf{w}}_{A}^{T}\widehat{\mathbf{R}} =\frac{\sigma_{A}^{2}}{\sigma_{a}^{2}}{\mathbf{w}}_{a}^{T}\mathbf{R} =\frac{\sigma_{A}^{2}}{\sigma_{a}^{2}}\widehat{R}_{a}\;. \end{aligned} $$

Inserting these results into Eq. 28.16, we find the Information Ratio of the portfolio to be the same as the Sharpe Ratio of portfolio V a.

$$\displaystyle \begin{aligned} \widetilde{\gamma}_{A}=\frac{\widetilde{R}_{A}}{\widetilde{\sigma}_{A}} =\frac{\widehat{R}_{a}}{\sigma_{a}}=\gamma_{a}\;. \end{aligned} $$

This is no surprise since we already established \(\widetilde {\gamma } _{A}=\widetilde {\gamma }_{a}\) and we know from Eq. 28.48 that \(\gamma _{a}=\widetilde {\gamma }_{A}\).

The relation between the maximum Sharpe Ratio γ A (see Eq. 27.23) and the maximum Information Ratio \(\widetilde {\gamma }_{A}=\widetilde {\gamma }_{a}\) (see Eq. 28.37) is

$$\displaystyle \begin{aligned} \frac{\gamma_{A}}{\widetilde{\gamma}_{a}}=\sqrt{\frac{\widehat{\mathbf{R}} ^{T}{\mathbf{C}}^{-1}\widehat{\mathbf{R}}}{{\boldsymbol\alpha}_{B}^{T} {\mathbf{C}}^{-1}{\boldsymbol\alpha}_{B}}}=\frac{\sigma_{a}}{\sigma_{A}}\;. \end{aligned} $$

Thus the relation between the maximum Sharpe Ratio and the maximum Information Ratio can also be expressed in terms of the two volatilities σ a and σ A. Together Eq. 28.49, the relationships between maximum Sharp Ratio and maximum Information Ratio are in summary:

$$\displaystyle \begin{aligned} \frac{\sigma_{m}}{\widetilde{\sigma}_{m}}=\frac{\gamma_{\text{max}} }{\widetilde{\gamma}_{\text{max}}}=\frac{\sigma_{a}}{\sigma_{A}}\;.{} \end{aligned} $$
(28.53)

The first equality, involving the market portfolio, is only true if Inequality 26.59 holds, while the second equality holds in every situation.