Keywords

1 Introduction

The central purpose of this study is to empirically test whether a penalty exists for pursuing an ethical approach to investing in Australia. In particular, we address two research questions. First, we investigate whether daily returns of Australian equity socially responsible investment funds have different tail risk exposure in the return distribution compared to that of matched conventional equity funds. We ask this question as Copp et al. (2010) identify that assessment of risk in SRIs is an area which is yet to be subjected to empirical investigation. Second, we compare two downside risk measures across socially responsible and matched conventional funds: Value-at-Risk and expected shortfall.

The literature on ethical investing is well established, as is the idea of costs/benefits incurred by investors in SRI funds, in search for an answer to the question of whether it is possible “to do well while doing good” as postulated by Hamilton et al. (1993). Traditionally, financial researchers assume that investors sole objective is to maximise returns for a given level of risk. This search for the holy grail of a mean-variance efficient portfolio necessitates holding a fully diversified portfolio of assets. This requirement of holding a well-diversified portfolio contravenes investing in SRI funds as they often impose negative screens restricting the opportunity set available for investing, resulting in the potential exclusion of entire industries (Humphrey and Lee 2011). Therefore, investing in SRI funds and having a fully diversified portfolio is simply not achievable (Hong and Kacperczyk 2009) implying further the likelihood of investors facing a penalty for following their social conscience. However, Humphrey and Lee (2011) state that the number of positive and negative screens has very little impact on returns but finds evidence to suggest the positive screening reduces risk. The literature fails to find conclusive support for the above argument with many empirical investigations reporting results that demonstrate that it is not necessary to sacrifice returns in order to pursue ethical considerations. We aim by way of an empirical investigation into SRI funds in Australia to fill this gap.

Research in this area traditionally focusses on whether SRI criteria for funds have any effect on performance compared to their conventional counterparts where performance may be measured by balance sheet or share price performance of the stocks in their investment portfolios. This type of analysis usually focuses on returns or excess returns as in, for example, Bauer et al. (2006) or Becchetti and Ciciretti (2009). We contribute to the literature by explicitly comparing the risk characteristics of SRI and conventional funds by studying tail risk exposures. This is a worthwhile question as conventional wisdom suggests that reduced diversification opportunities of SRI funds may lead to greater volatility in their portfolios. However, the less volatile nature of ethical investments could lead to steadier, more sustainable returns. If there is a difference in terms of risk, it is of interest to investors who wish to invest ethically without incurring a financial penalty.

Specifically, we look at two risk measures; Value-at-Risk (hereafter VaR) and expected shortfall (hereafter ES).Footnote 1 VaR is a threshold for the worst possible loss over a target horizon with a given level of confidence. It was famously created as a response to the financial crises at the end of the twentieth century, and the need for an easily calculated, all-encompassing risk measure that could some summarise all the risk of a trading book in a single number. VaR is widely used throughout the financial industry and is the recommended risk measure in the Basel II and Basel III accords. ES was created as a response to several criticisms of VaR both mathematically and conceptually. Whereas VaR only provides an upper limit for the worst possible loss, ES tells us the expected loss once VaR is exceeded. Unlike VaR, ES is a subadditive risk measure, i.e., ES of a portfolio is less than the sum of ES from its constituent assets. Subadditivity is important as it encourages diversification. Like VaR, ES can summarise the risk of a large portfolio of several different assets in a single number. However, ES is still not as widely used as VaR and much less is known about its performance and modelling.

Recent media exposure (Collett 2013; Liew 2012)Footnote 2 has attracted greater flow of funds into this type of investment as investors are attracted by the opportunity of benefiting from financial gains associated with investing in a portfolio that is more consistent with their social conscience (Lee et al. 2010). An argument has also been made that, given the ethical considerations which drive socially responsible investments (SRIs), investors might be willing to accept lower financial returns (Statman 2011); that is, incur a penalty for pursuing ethical investments.

The Australian funds management industry provides an interesting setting within which to study the risk exposure of SRI funds. Much of the research, which has tended to concentrate on performance-related issues and the impact of screening, has been conducted on the US market. The Australian funds management industry has one of the largest and fastest growing funds management sectors in the world. This growth is underpinned by Australia’s government-mandated retirement scheme. Further strengthening this sector is the sophistication of Australia’s investor base (Nordkvelde et al. 2013) which has resulted in the need for greater regulation and forced disclosure by fund managers within the SRI space. Additionally, Australia has a resilient economy, a world-class regulatory environment, and a multilingual skilled workforce who demand choice with respect to their investment opportunities.

Australia’s funds management industry is the largest in the Asia-Pacific region. Its size and sophistication reflects the nation’s strengths in having the regions: largest pension fund industry; the largest share market (ex-Japan) measured by free-float market capitalisation; the fastest growing foreign exchange market; and third largest high-net-worth market after Japan and China.Footnote 3 Despite this, growth fund managers within the Australian market still have fewer investment opportunities, than US counterparts, that satisfy ESG criteria as stipulated by the Social Investment Forum (SIF).Footnote 4 Therefore, as previously identified by Humphrey and Lee (2011), it is possible that Australian SRI funds performance and hence risk exposure through lack of diversification opportunities may actually be worse than that of SRI funds in the US or other developed markets.

We find evidence that investors do not pay a penalty (in terms of higher risk) to invest ethically and hence fund managers of ethical funds are performing as well as fund managers of more conventional funds. Socially responsible investment (SRI) typically refers to a style of investment that aspires to consider both financial return and social good. SRI strategies are usually monitored according to (a) environment, (b) social justice, and (c) corporate governance criteria, or ESG for short. The most common SRI approaches include the positive or negative screening of investments based on their ESG performance, and the integration of ESG factors in financial analysis where these factors represent a core driver of both value and risk in companies and assets.Footnote 5 Our results will be of interest to SRI investors in other countries other than the USA where limited investment opportunities that meet ESG criteria are available and hence impact directly on the risk associated with such investment practices.

The rest of the paper is organised as follows. Section 12.2 presents a literature review. Section 12.3 describes the methods of risk measurement, detailing VaR, ES, modelling assumptions, and our backtesting approach. Section 12.4 describes the data and methodology. Section 12.5 presents the empirical findings. Section 12.6 concludes.

2 Literature Review

The Social Investment Forum, a national not-for-profit organisation charged with promoting the concept, practice, and growth of socially responsible investing (SRI), describes socially responsible investing as “an investment process that considers the social and environmental consequences of investments, biota positive and negative, within the context of rigorous financial analysis”.

The early work within the SRI space trace back to the seminal work by Hamilton et al. (1993) who present three alternative hypotheses with respect to the relative returns of SRI shares compared with that of conventional companies. First, the authors postulate expected returns of socially responsible stocks are equal to the expected returns of conventional stocks. In such a world, supply and demand of such securities is matched resulting in no movement of the share price. This is termed the “no effect” hypothesis. The second hypothesis is the “doing good but not doing well” hypothesis. Here, the returns for SRI shares are lower than the expected returns of conventional shares. It is deduced there is a penalty for investing ethically.

The last hypothesis is the “doing well while doing good” hypothesis which assumes expected returns for SRI shares are higher than those of conventional shares. Proponents of such a way of thinking promote the benefit of social screening and argue that this enhances financial performance by eliminating companies of questionable business practices.

In more recent times, the literature has started to investigate these issues associated with SRI as it relates within the funds management industry. Much of this research has emerged post-2000Footnote 6 and has tended to concentrate on performance and in particular the effect of negative screening on performance (see, Lee et al. 2010; Humphrey and Lee 2011). Negative screening involves rejecting investment opportunities due to the nonsatisfaction of ESG criteria (e.g. shares are often excluded that invest in gambling, tobacco, and pornography industries amongst others), and the effect that this has on investment returns.

Theories advocating ESG propose that corporate social responsibility (CSR) increases NPVs and also acts as a signalling mechanism used by companies for indicating prosperity which in turn results in superior subsequent performance (Heal 2005; Fisman et al. 2006). These theories, however, are at odds with traditional economic thinking which states the imposition of noneconomic values by trustees of managed funds is inappropriate and that “the social responsibility of business is to increase profit” (Friedman 1970). Two main hypotheses are established within the SRI literature that associates SRI to share price (see, Derwall et al. 2011). First, the “shunned-stock hypothesis” which infers SRI leads to excess demand for CSR leader shares, and shortage of demand for CSR laggard shares, resulting in excess returns for the latter. Secondly, the “errors-in-expectations hypothesis” which implies that positive screens in favour of highly ranked CSR shares result in outperformance due to CSR signals not being priced correctly. Derwall et al. (2011) reconcile the two phenomena to coexist and show that outperformance of highly ranked CSR shares are eventually arbitraged away after longer time horizons where information contained in CSR is eventually impounded into share price.

It has also argued that a reflexive effect could occur with investors diverting funds away from polluting companies which in turn cause companies to alter behaviour (Heinkel et al. 2001). From a financial perspective, yet an alternative theory predicts a cost in performance arises due to reduced diversification opportunities that are a direct cost of screening practices in place (Herzel and Nicolosi 2013).

Humphrey and Lee (2011) argue that due to the constraints in place for eliminating “sinful” industries from inclusion within their portfolios, it is logical to postulate that SRI funds are likely to underperform compared with both the broader market and unconstrained fund managers. Indeed, Ooi and Lajbcygier (2013) provide evidence that sinful industries outperform SRI both on average and with economic significance about 4% per annum.

Like the theoretical literature, much of the empirical literature is similarly split in its opinion of how much SRI penalises investors by way of lower returns or benefits an investment portfolio as a resulting of adhering to ESG criteria. In our review of this literature, we focus on studies which concentrate on the impact of SRI performance rather than on the strand of research that is concerned with investor behaviour.Footnote 7 Guerard (1997) finds no statistical difference exists in share returns when comparing ethically screened and unscreened universes. Similarly, Kurtz (1997) reports mixed evidence as to whether social factors such as environmental policies, employee relations, and research and development (R&D) spending is associated with abnormal returns. In contrast, Statman (2006, 2007) find evidence of higher SRI returns but also higher tracking errors of SRI portfolios compared to conventional benchmarks. Becchetti et al. (2008) look at performance of shares in the Domini Social Index and find that companies going into this index report higher return on equity and higher sales and productivity.

In two separate studies, Bauer et al. (2005, 2006) apply four-factor models to compare performance of ethical mutual funds against conventional counterparts. In both cases, the studies document that, after controlling for common factors (such as market, size, book-to-market, and momentum), there is no penalty in being an ethical investor. Interestingly, they observe a learning period where the performance of ethical fund managers gradually improves to catch up with the performance of conventional funds. Becchetti and Ciciretti (2009) investigate SRI performance comparing socially responsible portfolios versus a control sample and find no evidence of difference in excess returns. Renneboog et al. (2008) find evidence of underperformance of SRI funds and mixed evidence of a “smart money” effect in the case of SRI investors who are able to identify poorly performing funds but not outperforming funds. The link between SRI and book-to-market ratios is investigated by Galema et al. (2008) who find a negative effect exists which also affects alpha negatively. Hong and Kacperczyk (2009) look at sin shares from alcohol, tobacco, and gaming industries. They find less analyst coverage and institutional ownership which results in a significant price effect of 15–20%.

They argue these shares tend to be cheaper as a result of higher litigation risk and being subject to more scrutinised accounting and regulation.

We now shift our attention to empirical evidence which investigates SRI within the Australian market. In an early study, Bauer et al. (2006) investigates the risk-adjusted performance of 25 Australian retail ethical funds and finds SRI underwent a significant catching up phase throughout the period 1992–1996, after which delivering returns similar to those of conventional funds in the later period 1996–2003. In a separate study which investigates the return performance of 89 SRI funds, Jones et al. (2008) document that ethical funds significantly underperform the market. In more recent times, Copp et al. (2010) find that systematic risk of SRI both in Australia and internationally increases more than that of conventional funds during economic downturns. Humphrey and Lee (2011) extend the work of Jones et al. (2008) in a study which investigates 24 Australian equity SRI funds. They find no significant difference between SRI and conventional counterparts with respect to return and postulate no penalty exists in terms of risk-adjusted returns for pursing a socially conscious investment strategy.

When examining the impact of SRI on market risk, one must consider that less-diversified SRI funds subjected to a restricted investment universe will be subject to more idiosyncratic risk. However, many SRI investors with would certainly be of the view that their investments are long term, sustainable, and above all else benefiting the world in which we live. Such an investment would be less volatile, less subject to corporate scandal, and offered more protection by the government. It is observed that ethical funds are less exposed to market variability than conventional funds (Bauer et al. 2005, 2006) and are more value-orientated than growth-orientated funds. Also, SRI portfolios exhibit lower conditional volatility and more robustness to shocks (Becchetti et al. 2008) after fitting GARCH(1,1) and APARCH(1,1) models. Prior research shows that negative screening significantly increases market risk and reduces funds abilities to form diversified portfolios (Humphrey and Lee 2011). Bollen (2007) compares volatility of monthly flows into SRI and conventional funds and finds SRI flows to be less volatile. He finds that US SRI fund flows are less sensitive to past negative returns than are conventional funds, but the flows of SRI funds are more sensitive to past positive returns. A similar study is carried out by Benson and Humphrey (2008) on monthly fund flows and the flow–performance relationship. They find more persistence in the case of SRI flows to that of conventional fund flows with SRI investors less sensitive to lagged returns and more likely to reinvest in similar shares. Renneboog et al. (2011) find that younger and smaller SRI funds have less volatile returns tending to attract more fund flows than other SRI funds but can find no evidence of superior performance.

It is important to consider how to measure risk. The volatility of returns is a standard tool used by investors to evaluate investments, for example, in technical analysis or Sharpe ratios. Despite this, however, volatility is not completely adequate because it does not contain information about tail behaviour, i.e. the extreme returns that can greatly affect an investments value. VaR and ES have provided potential solutions and EVT was developed with this practical application in mind. It is generally accepted that a risk measure should have the property of subadditivity (McNeil et al. 2005). Subadditivity requires that the act of merging portfolios has no escalation in risk, i.e., it promotes diversification. ES demonstrates this property, as does VaR in most cases.

3 Methods for Estimating Value-at-Risk and Expected Shortfall

VaR and ES are risk measures used to determine expected losses with a given probability. Specifically, given a set of returns on an investment, the (1 − p)% VaR, i.e. the VaR at the (1 − p)% confidence level, is the p-quantile of the return distribution. VaR thus measures the maximum an investment can lose over a certain time horizon with probability p. One criticism of VaR is that it does not provide information on the shapes of the tails above or below the (1 − p)% confidence level. Thus, we assign to each VaR an associated (1 − p)% ES which measures the expected loss of the investment on the condition that the loss is greater than VaR. This study adopts the approach detailed in McNeil et al. (2005) and Jorion (1997) to measure and ES.Footnote 8 For our particular work, we study the 1-day VaR and ES for three confidence levels, 95, 99, and 99.5%. We are mainly interested in returns less than a certain amount, i.e. losses. Losses are generally described as the “left” tail of the distribution. If we define X t as the time series of negative log returns, we can define the (1 − p)% VaR at time t, denoted by VaR t,p , as

$$ \text{P}\left( {X_{t + 1} \le - {\text{VaR}}_{t,p} } \right) = p, $$
(1)

with the associated (1 − p)% ES given by

$$ ES_{t,p} = {\text{E}}\left[ {X_{t + 1} \left| {X_{t + 1} > {\text{VaR}}_{t,p} } \right.} \right] $$
(2)

where E[X|Y] represents the expectation of X conditional on Y.

We present three approaches for calculating the VaR in expression (1) and ES in expression (2).

The GARCH models are historically used to account for heteroscedasticity through a time-varying volatility. In this connection, we use the EGARCH which is more flexible than a standard GARCH as it can capture the asymmetric reactions of volatility to positive and negative shocks, i.e. the leverage effect. In addition to this, the volatility process should always be positive. The EGARCH model is characterised by the following return series dynamics:

$$ X_{t} = \mu + \sigma_{t} Z_{t} $$
(3)

where the innovations {Z t } t≥0 are stationary i.i.d. and

$$ \log \sigma_{t}^{2} = w + \alpha \left| {\frac{{X_{t - 1} - \mu }}{{\sigma_{\text{t - 1}} }}} \right| + \beta \,\log \sigma_{t - 1}^{2} + \gamma \frac{{X_{t - 1} - \mu }}{{\sigma_{\text{t - 1}} }} $$
(4)

The model is stationary for β < 1; ω, α, and γ are parameters to be calibrated. We estimate the EGARCH model parameters using the semi-parametric approach of a quasi-maximum-likelihood estimation.

As in McNeil and Frey (2000), we then standardise the residuals and calculate one-day VaR and ES as

$$ {\text{VaR}}_{t,p} = \mu_{t + 1} + \sigma_{t + 1} {\text{VaR}}_{p} (Z) $$
(5)

and

$$ {\textit{ES}}_{t,p} = \mu_{t + 1} \sigma_{t + 1}{\textit{ES}}_{p} (Z). $$
(6)

As {σ t }t ≥ 0 is decided by the EGARCH calibration, the difference in modelling choices thus lies within the calculation of VaR p (Z) and ES p (Z). Different values will be obtained depending on the distributional assumptions our three modelling approaches make on (Z t ).

3.1 The Historical Approach

This approach estimates quantiles from the historical distribution of returns (Linsmeier and Pearson 2000). The underlying assumption of the historical approach is that the past distribution is a suitable proxy for the future.

For a sample of 1-day returns of length T, the (1 − p)% VaR is simply the estimated (1 − p)% quantile. ES can also be estimated empirically. We can write the ES of the marginal distribution of the residuals as

$$ {\text{ES}}_{p} (Z) = {\text{VaR}}_{p} (Z) + e\left( {{\text{VaR}}_{p} (Z)} \right) $$
(7)

where e(u) is the average loss in excess of a threshold, u, conditional on the threshold having been exceeded.

3.2 The Gaussian Approach

A common approach is to assume that the marginal distribution of the residuals is a standard Gaussian (Jorion 1997). The (1 − p)% VaR is then simply given by the expression

$$ {\text{VaR}}_{p} \left( Z \right) = N^{ - 1} \left( p \right), $$
(8)

where N (−) is the cumulative distribution function of the standard Gaussian distribution. In addition, ES is known in closed form for Gaussian models. In particular, it is well known that

$$ {\text{ES}}_{P} (Z) = \frac{{N^{\prime } ({\text{VaR}}_{p} (Z))}}{p} $$
(9)

.

3.3 The EVT Approach

Many financial time series exhibit heavy-tailed distributions. For this reason, it is desirable to have as much flexibility in modelling tail behaviour as possible. EVT was developed with this application in mind (McNeil and Frey 2000). Indeed, tail behaviour may be modelled directly using EVT without making assumptions on the return distribution as a whole. We adopt, in particular, the peaks over thresholds (POT) method where we identify extreme observations that exceed a high threshold u and model these “exceedances” separately from nonextreme observations.

For a sample of exceedances of size N u the exceedance magnitude is the size of an exceedance over the threshold and is given by Y i  = Z i  − u for i = 1,, N u . We then let F (−) be the marginal distribution function of Z t for each t ≥ 0. The probability distribution of {Y t }t ≥ 0 for a given threshold u defined by

$$ \begin{aligned} Fu(y) & = {\text{P}}\left( {Zt - u \le y\left| {Zt} \right. > u} \right) \\ & = \frac{{F\left( {y + u} \right) - F\left( u \right)}}{{1 - F(u)}} \\ \end{aligned} $$
(10)

may then be written as

$$ F(z) = \left[ {1 - F(z)} \right]F_{u} (y) + F(u). $$
(11)

EVT provides the theory to describe the limiting distribution of (11) as u → z +, where z + denotes the upper (possibly infinite) limit of the distribution of Z t . Indeed, Balkema and Haan (1974) and Pickands (1975) show that, for u sufficiently high, F u (−) is approximated by the generalised Pareto distribution (GPD). The cumulative distribution function of the GPD is given by

$$G_{\xi,\beta}(z) =\left\{\begin{aligned}&1- {\left(1+ \frac{\xi z}{\beta}\right)}^{-1/{\xi}} &&\text{if}\,{\xi}\,\ne\,0 \\ &1- \text{exp}(-z/\beta) &&\text{if}\, {\xi}\,=\,0,\end{aligned} \right.$$
(12)

where ξ ∈ R and β > 0 are called the shape and scale parameters, respectively. The parameters of the GPD can be found by a maximum-likelihood estimation (see, e.g. Embrechts et al. 1999). A value of ξ > 0 corresponds to heavy-tailed distributions.

For a data set of size T with N u exceedances, we observe that F (u) may be approximated empirically as

$$ F(u) = 1 - \frac{{N_{u} }}{T}. $$
(13)

For sufficiently high u, Eq. (11) thus simplifies to

$$ F(z) = 1 - \frac{{N_{u} }}{T}\left( {1 + \frac{\xi (z - u)}{\beta }} \right)^{ - 1/\xi } $$
(14)

Expression (14) is a tail estimator. It can be used to estimate VaR by observing that, by definition, we have

$$ F\left( {{\text{VaR}}_{t,p} } \right) = p. $$
(15)

By inverting this expression, we obtain

$$ {\text{VaR}}_{p} (Z) = u + \frac{\beta }{\xi }\left( {\left( {\frac{pT}{{N_{u} }}} \right)^{ - \xi } - 1} \right) $$
(16)

Additionally, the ES can also be calculated in closed form through expression (7) to obtain

$$ {\text{ES}}_{p} (Z) = \frac{{{\text{VaR}}_{p} (Z) + \beta - \xi u}}{1 - \xi }. $$
(17)

One downside of the POT analysis lies in the ad hoc determination of the threshold u. A certain balance needs to be struck between choosing a value of u high enough such that EVT is applicable and low enough such the number of exceedances is statistically significant. One method for choosing u is by analysis of the mean-excess function (MEF) plot (cf. Embrechts et al. 1999). The MEF for a GPD should be linear, and it may be possible to choose u such that the MEF is linear for all points above u. However, this method is time-consuming and often researchers will choose u such that the number of exceedances N u is equal to some small, fixed percentage (e.g. 5%) of the size of the data sample.

3.4 Backtesting

Following, for example, Christoffersen (1998) and Berkowitz and O’Brien (2002), the relative performance of each VaR approach is determined by a violation ratio. The idea is that we compare the VaR estimated at time t with the actual return observed at time t + 1. A violation occurs if a realised loss is greater than the estimated VaR on a given day. The violation ratio is then defined as the total number of violations divided by the total number of estimated VaRs.

The motivation behind this is as follows. Given a VaR number at a certain confidence level (1 − p)%, we can expect that approximately p% of the time it will be exceeded by the next days return. This follows from the definition of VaR as a p-quantile. The more accurate the model, the closer the violation ratio is to p. A violation ratio higher (lower) than the expected one indicates that the model consistently under (over-)-estimates the risk.

4 Data Description

4.1 Sample Selection

For the purpose of the empirical study within this chapter, we use daily time series return data sourced from Morningstar Direct for Australian open-ended equity SRI and matched conventional funds for the period January 1998–November 2013. The matching process is discussed in detail in Sect. 4.2. Our final data set consists of daily returns for 13 SRI and 13 matched conventional funds. In order to concentrate our analysis on the difference between SRI and conventional investing, we ensure that all funds satisfy the following two criteria; first, all funds must have at least 75% of their equity holdings in Australian markets; and secondly, we require at least 4 years of daily return data. We impose the minimum restriction of 75% because we wish to ensure that any difference in performance between SRI and conventional funds is driven by the SRI attribute and not by altering asset allocation. The requirement to have minimum of 4-year daily data is driven by the requirement to have sufficient data observations to enable us to be able to carry out EVT.

In addition to the daily returns of both the conventional and SRI funds sourced through Morningstar, we also access other variables such as base currency; share class; inception date; investment area; domicile; and assets undermanagement (AUM). For the sample period 1998–2013, there exist 92 equity funds that have a socially conscious classification and 2896 conventional funds. However, the same funds can be duplicated with different share classes of the same fund. To avoid duplication, we conduct analysis on a distinct portfolio basis by keeping only the parent share class. We concentrate solely on Australian equity funds and, to keep focus on issues relating to SRI, we eliminate funds that have less than 75% of their holdings in domestic equity. We remove funds with missing AUM, and we study funds with at least four years of daily data to allow sufficient analysis of tail behaviour. This leaves us with 13 parent class, domestic Australian equity-focused funds.

4.2 Matching Methodology

For each SRI fund, we select the conventional fund with the lowest distance score as its matched fund to compare VaR and ES risk measures. In order to identify conventional funds with the best possible fit to the 13 SRI funds included within our final sample, we conduct the following matching algorithm. Our prematched final data set consisting of 92 SRI and 2896 conventional funds reduces to 13 SRI and 269 conventional funds. We run a matching algorithm to pair each of the 13 SRI funds with a conventional fund based on fund style, age, and size. The matching algorithm is based on the Carhart (1997) 4-factor model. The factors used are provided by the authors of Gray et al. (2014) who calculate market minus risk-free asset (RMRFT), small minus big (SMB), high minus low (HML), and momentum (MOM) factor returns using data for ASX-listed stocks over the 1990–2012 period.

The Australian factors are constructed in the spirit of Fama and French (1993) with modifications to reflect distribution of market cap amongst Australian stocks. Following the work of Brailsford et al. (2012), the Gray et al. (2014) 4-factor construction uses a modification regarding cut-offs. Brailsford et al. (2012) noted that the distribution of size and book-to-market ratios in Australia was heavily skewed and therefore justified the modification to the standard Fama and French methodology of using median market cap to partition into small and big stocks was not adequate. Gray et al. (2014) identify a portfolio of approximately 300 “large” stocks by taking the stocks that contribute the top 90% of total market capitalisation and a portfolio of about 1700 “small” stocks that contribute the remaining 10%.

In line with Fama and French (1993), small minus big (SMB) and high minus low (HML) factors are constructed by averaging across portfolios. For example, SMB is the difference in return between the small and large portfolios. A momentum (MOM) factor is also constructed using six portfolios double-sorted on size and prior 12-month returns. This procedure provides a time series of monthly returns to SMB, HML, and MOM factor-mimicking portfolios. Each of the SMB, HML, and MOM portfolios is constructed for the Australian market.

We run the Carhart 4-factor regressions using the fund monthly returns using the following equation:

$$ \begin{aligned} r_{t} - r_{f,t} & = \alpha + \beta_{\text{MKT}} \left( {r_{{{\text{MKT,}}t}} - r_{f,t} } \right) + \beta_{\text{SMB}} r_{{{\text{SMB}},t}} \\ & \quad + \beta_{\text{HML}} r_{{{\text{HML,}}t}} + \beta_{\text{MOM}} r_{{{\text{MOM,}}t}} + E_{t} \\ \end{aligned} $$
(18)

where r t represents the monthly returns of the fund; r MKT,t , the monthly returns of the market; r SMB,t , the monthly returns of the SMB portfolio; r HML,t , the monthly returns of the HML portfolio; r MOM,t , the monthly returns of the MOM portfolio; and r f,t , represents the risk-free rate. In addition, α is the risk-adjusted performance estimate for the fund; β MKT, β SMB, β HML, and β MOM are the regression betas of the fund with respect to the market MKT, SMB, HML, MOM portfolios, respectively; and E t are a series of i.i.d. innovations.

Our matching procedure is then similar to that of Bollen (2007) and Renneboog et al. (2011) who calculate a matching “score” for candidate funds that are no more than 2 years older or younger than the SRI fund and match whether they charge load fees or not. Specifically, for each SRI fund i and candidate fund j we calculate the score S i,j defined as

$$ \begin{aligned} S_{i,j} & = \frac{{\left( {{\text{AUM}}_{i} - {\text{AUM}}_{j} } \right)^{2} }}{{\sigma_{\text{AUM}}^{2} }} + \frac{{\left( {\beta_{{{\text{MKT}},i}} - \beta_{{{\text{MKT,}}j}} } \right)^{2} }}{{\sigma_{\text{MKT}}^{2} }} + \frac{{\left( {\beta_{{{\text{SMB}},i}} - \beta_{{{\text{SMB}},j}} } \right)^{2} }}{{\sigma_{\text{SMB}}^{2} }} \\ & \quad + \frac{{\left( {\beta_{{{\text{HML}},i}} - \beta_{{{\text{HML,}}j}} } \right)^{2} }}{{\sigma_{\text{HML}}^{2} }} + \frac{{\left( {\beta_{{{\text{MOM}},i}} - \beta_{{{\text{MOM}},j}} } \right)^{2} }}{{\sigma_{\text{MOM}} }} \\ \end{aligned} $$
(19)

where AUM k are the assets under-management of fund k. We also scale the difference in betas for variance where σ 2MKT , σ 2SMB σ 2HML , and σ 2MOM the cross-sectional variances of each beta across the total sample of funds for each of the MKT, SMB, HML, and MOM portfolios, respectively.

4.3 Summary Statistics

Table 12.1 reports descriptive statistics for daily returns for each SRI fund (refer Panel A) and each matched conventional fund (refer Panel B). For the SRI funds, the full sample sizes range from 1294 days for OnePath to 3035 days for the BT Ethical Share Fund, and for the matched conventional funds, the sizes range from 2113 of ANZ Australian Equity Share Fund to 3711 for Hyperion Small Growth Companies. On average conventional funds (0.04%) outperformed conventional funds (0.03.5%) although the SRI fund Perpetual Wholesale Ethical has the highest average return overall with 0.06%. In general, the funds have high standard deviations (volatility), are negatively skewed, and exhibit excess kurtosis indicating that log returns do not follow a Gaussian distribution. This helps to motivate the use of historical and EVT-based VaR estimates in our analysis. Indeed, the lowest kurtosis estimates are 3.83 for the SRI AMP Leaders Australian Share Fund and 3.80 for the conventional fund Advance Australian Smaller Companies which indicates heavy-tailed behaviour. It is clear that the Hyperion Small Growth Companies fund has the most extreme kurtosis.

Table 12.1 Descriptive statistics for SRI and conventional funds

In Table 12.2, we present the summary statistics for the residuals of each SRI fund and matched conventional fund, respectively. The residuals are obtained after fitting the EGARCH model described in Sect. 12.3. We can see from the kurtosis that in general the residuals show a reduction in heavy-tailed behaviour. We fit a GPD to each set of residuals and report the resulting parameter estimates (also in Table 12.2). For each fund, we chose the threshold u such that the number of exceedances N u would be 5% of the total length of the time series. We also report the threshold parameter for transparency. As discussed, the parameter ξ indicates heavy-tailed behaviour in the range ξ > 0. On average, the funds in our data set appear to be heavy tailed, although this is not true in general. The “heaviness” of each tail is also not especially strong as values of ξ = 0.5 are not uncommon in financial markets (see, e.g. Fong Chan and Gray 2006) compared to our highest value of ξ = 0.37 for the AMP Sustainable Future Australian Share fund. Lastly, we note that there appears to be no noteworthy difference in the ξ parameter between SRI and their conventional funds.

Table 12.2 Summary statistics for EGARCH residuals

5 Results

5.1 The Relative Risk of SRI and Conventional Funds

We now present the main result of our paper. We calculate the 95, 99, and 99.5 percentile 1-day VaR and ES for each SRI and matched conventional fund using the full data sample. Results are reported in Tables 12.3 and 12.4. We compute rolling VaR and ES using Eqs. (5) and (6). The methodology that we adopt to obtain our rolling estimates is as follows. We utilise a sliding window that is 70% of the size of the data set for each fund. The method is adaptive in that each model, VaR and ES are re-estimated as the window rolls through the data points. In this type of exercise, it is impractical to optimally determine a threshold value for each window as it progresses through the data set via examining the mean-excess function. Therefore, at each step we choose the number of exceedances to be equal to the upper 5% of ranked losses (negative returns). The literature is not clear in terms of what constitutes an appropriate threshold (cf. Hult et al. 2012). We selected 5% quantile of returns following a visual inspection of the mean-excess plots (essentially plots of exceedances verses u, see Hult et al. 2012) of a random sample of funds. It was found that this heuristic approach left a suitably ample sample size.

Table 12.3 Estimated VaR of SRI and conventional funds
Table 12.4 Estimated ES of SRI and conventional funds

Table 12.3 presents the 1-day VaR for each SRI and matched conventional fund using the historical, Gaussian, and EVT estimates of VaR. The VaR estimates show that the SRI and matched conventional counterparts exhibit similar overall risk characteristics, e.g. a 95% VaR around 1.66. This lack of discernible significant difference is consistent when conducting sensitivity analysis at 95, 99, and 99.5% confidence levels. We can see similar results in Table 12.4 for the 1-day ES measure. Roughly half of the sample of SRI funds exhibit higher risk and roughly half exhibit lower risk than their matched conventional counterparts although the differences are slight. On this evidence, we are not able to reject the hypothesis that there is no significant difference in risk between SRI and conventional funds.

It is possible to draw very similar conclusions from both the results of the rolling VaR and ES estimations when observed graphically as in Fig. 12.1. Thus, to save space we have elected to present only the results for the 95% VaR.Footnote 9 The VaR numbers that we provide here are estimated using the Gaussian approach, except for the Hyperion Small Growth Companies fund where we use the historical approach. This is consistent with the backtesting procedure introduced in Sect. 12.3, the results of which are discussed below.

Fig. 12.1
figure 1

Rolling 1-day Value-at-Risk at the 95% confidence level calculated with the Gaussian approach for each SRI fund (blue) and its matched conventional fund (red-dashed). We also give the differences (orange) with 95% confidence levels (orange-dotted) estimated via bootstrapping

Fig. 12.2
figure 2

QQ plots for the AMP FLI-AMP Sustainable Future Australian Shares fund overlaid with a Gaussian distribution

Figure 12.1 therefore presents our key result, that is, the dynamic difference between risks as measured for SRI and conventional funds. We provide the 95% VaR for the SRI and conventional funds, the difference between the two, and 95% confidence levels for the value of this difference as calculated via bootstrapping. The main conclusion we can draw from this analysis is that there does not seem to be a consistent penalty of greater risk for SRI investors. In particular, the difference between risk measures is rarely above or below zero with 95% confidence.

On closer inspection, it appears that just after the beginning of 2011, many funds do enter into a period of decoupling, with regard to risk, from their matched counterparts. Once again the SRI funds are neither consistently more nor less risky than the conventional funds, with the actual ratio being near to 61% in favour of a decrease (SRI less risky). However, the average difference in risk does seem to be significantly different from 0. Observing the graphs again we may be convinced that this decoupling has something to do with a large increase in VaR numbers that also occurs at the beginning of 2011. This period in time actually corresponds to a decrease in Australian stock prices and volatility of the ASX share index. We therefore propose that the increase in VaR numbers is due to funds increasing their exposure during the low volatility period. The difference in risk numbers between SRI and conventional funds may be due to the different mechanics involved in rearranging the holdings of each style of fund. For example, in an SRI fund one must consider screening results and the inherent sinfulness of investments before acting. However, we possess the data regarding the number of positive and negative screenings that each SRI fund uses and can see no correlation between these and the increase or decrease in the risk difference to conventional funds in 2011. However, we also noticed the five funds with the largest AUM; BT Ethical Share, Perpetual Wholesale Ethical, Perennial Socially Responsive Shares, Hunter Hall Australian Value Trust, and Australian Ethical Smaller Companies all increased their risk to a lesser extent than their conventional counterparts. This may be due to their size making the mechanics of increasing risk more difficult. Coincidentally each of these funds have a large number of negative screens (6–8).

5.2 Robustness Checks

As discussed in Sect. 3.4, in order to check the validity and the robustness of the result to the model specification, backtesting procedures are conducted. Table 12.5 contains the violation ratios for each confidence level of VaR for each of the historical, Gaussian, and EVT approaches for the SRI and conventional funds. For a VaR with confidence levels (1 − p)%, we select the model that has a violation ratio closest to p%. In the event of a tie, we decided to favour the Gaussian due to its simplicity and on the strength of its overall performance.

Table 12.5 Violation ratios for SRI and conventional funds

We can see that the Gaussian approach proves more often to be the appropriate risk measure, especially for the lower confidence levels. In fact the Gaussian approach can be considered to perform better in the backtesting for every fund except for Hyperion Small Growth Companies at the 95% confidence level where the historical approach seems best. This result may be surprising since the Gaussian distribution is often considered to be an inadequate description of asset returns. However, on closer examination of QQ plots generated by the log returns,Footnote 10 we can see that our fund data are not as heavy tailed as various other financial time series. For example, the electricity markets or emerging markets studied in Fong Chan and Gray (2006) and Gençay and Selçuk (2004).

The violation ratio test also shows us that all three approaches seem to be consistently over estimating risk. This could be because the sliding window generally covers the credit crunch event and associated crises of 2008–2011 and is thus calibrated to a “riskier” state of the world. In order to test this, we repeated the analysis with a smaller rolling window such that we would not include the whole crisis period in every calibration. However, the numbers obtained were similar to those presented in this study.

5.3 Performance Metrics of SRI and Conventional Funds

In this section, we compare risk-adjusted monthly fund performance metrics for our sample of Australian SRI and matched conventional funds. Fund mangers are mostly interested in overall performance comparison using monthly time horizons. Table 12.6 shows monthly Sharpe (1964), Treynor and Mazuy (1966), and Sortino and Forsey (1996) ratios for the funds. Along with the standard Sharpe ratio (returns minus risk-free rate divided by standard deviation), two other “modified Sharpe” ratios are calculated with the usual excess returns divided by VaR and Expected Shortfall. Treynor ratios are calculated as the excess return divided by CAPM beta, and Sortino ratios are calculated as excess returns divided by downside deviation (standard deviation of negative returns). The risk-free rate and CAPM market returns for the performance metrics are used from the series used in the matching process outlined in Sect. 4.2.

Table 12.6 Monthly risk performance metrics for SRI and conventional funds

The results in Table 12.6 show slight outperformance (a paired t test for 65 metric differences shows significance at 1%) with the conventional funds, suggesting although SRI funds are not significantly riskier on a daily basis, they potentially can suffer in terms of performance over a 10-yr period. These reasons could be due to the costs that SRI funds face compared to conventional funds. These are reduced diversification opportunities but also the cost of researching, reviewing, and maintaining the portfolios meet corporate and socially responsible criteria. These aforementioned criteria can also mean costs adapting to changes where the threshold for exclusion should be set are constantly evolving, e.g. proposal and adoption of new screens such as exclusion of utilities due to fossil fuel exploration.

6 Conclusion

The main purpose of this paper is to contribute to the SRI literature by investigating if investors pay a penalty (in terms of higher risk) for pursuing ethical investment strategies. We do this by evaluating the performance of 13 selected SRI equity-managed funds in Australia using daily returns to see whether these funds have different tail risk exposures in the return distribution compared to that of matched conventional equity funds. The motivation for this work is that the assessment of risk in SRIs is an area still in its infancy with higher moments and the tails of the return distribution yet to be subjected to empirical investigation.

We show that overall risk is unlikely to differ between an SRI and a conventional portfolio. We have analysed the theoretical underpinning that suggests that fund managers who opt for reduced diversification opportunities as a result of positive and negative screens (which are common place with SRI funds) are likely to face greater volatility in their portfolios. However, we show that investors who wish to invest ethically do so without incurring a financial penalty in terms of tail risk. Even if an SRI manager avoids sin stocks, he or she is still likely to be able to hold at least 30–40 stocks together and therefore get most of the benefits of a Markowitz diversified portfolio. We show evidence therefore that a SRI-constrained investment universe is unlikely to affect risk and even more unlikely that any differences will manifest themselves in extreme returns in the tail(s) of the distribution.

We do observe slight outperformance by conventional funds compared to that of SRI across all of the performance metrics reported in this study. Our study is one of the number of studies for SRI funds in Australia. Cummings (2000), Tippet and Leung (2001), and Humphrey and Lee (2011) all find that the performance of SRI funds is similar to that of conventional funds in Australia. Our findings contradict these earlier studies but find support for Jones et al. (2008) who like us find that Australian SRI funds underperform conventional funds in comparison with the market benchmark. Like Jones et al. (2008) who examined the sample period 1986–2005 we also use a sample period exceeding 15 years. In contrast the previous studies which found no differences in return performance between SRI and conventional funds used shorter time periods within their analysis. As SRI is relatively in its infancy, the comparable performance of SRI versus conventional over a long term is still open for further debate.

There are several interesting possible directions for future research. Firstly, it is of great interest to further investigate the screening practices and sizes of SRI funds and determine whether the number of positive and negative screens has an impact on risk. A large cross section of SRI funds across different countries would thus represent a significant contribution to the literature. Secondly, it would be interesting to investigate the riskiness of sustainability indices as compared to the composite indices globally across different markets. This could be done using various models such as the variance-covariance, historical simulation, and extreme value theory approach to forecast VaR and ES, respectively. Finally, a closer look at performance measures on indices, such as Dow Jones Sustainability Index (DJSI) and DJSI Australia Index, that incorporate risk and downside risk, such as Sharpe, Sortino, and Treynor ratios, would further aid the discussion on the financial value of SRI.