Introduction

It is increasingly common for shareholders to have “social” as well as monetary preferences. A 2014 survey by US Trust cited in the Financial Times Footnote 1 found that 75 percent of investors under 35 “consider the social and environmental impact” when making financial investments. In the US alone, certified socially responsible assets amount to $2 trillion (Kitzmueller and Shimshack 2012). The presence of socially responsible shareholders has implications for firm governance and incentives alignment within the firm. In this paper, we study the compensation contract offered by socially responsible shareholders to the firm’s manager.

In the neo-classical framework, investors have preferences for profits, and the objective of the firm consists in maximizing profits subject to a set of constraints imposed by law and the markets. Corporate decisions are guided by the principle of shareholder value maximization. The agency problem which emanates from the separation between ownership and control can then be solved by giving the manager a stake in the firm’s profits—for example in the form of a fraction of the firm’s equity—which will encourage the manager to increase profits.

When investors are also socially responsible, their objective function becomes bidimensional: indeed, they now jointly maximize firm profits and corporate social performance (CSP). They may then also want to encourage some corporate actions that increase CSP, even at the expense of profits. Given the evidence that a substantial number of investors and investment funds do have this type of preference (Sparkes and Cowton 2004), it would make sense to give firm managers incentives to increase both firm profits and CSP, as emphasized by Baron (2008). However, in practice managerial incentives typically take the form of stocks and stock options (Frydman and Jenter 2010), and the structure of CEO compensation in socially responsible firms is not significantly different than in other similar firms (Frye et al. 2006). This might be considered puzzling: why not also incentivize managers to increase CSP? This paper will provide an answer to this apparent inconsistency: in the presence of socially responsible investors, equity-based compensation will lead a manager to jointly maximize profits and CSP according to investors’ preferences.

We use a stylized principal–agent model in which socially responsible shareholders design the contract of a manager who must then determine the allocation of firm resources toward profit maximization and CSP enhancement. Following this managerial decision, the shareholders set the stock price of the firm according to their preferences on a competitive market for the firm’s shares. Crucially, the stock price reflects both expected firm profits and CSP: socially responsible shareholders are willing to pay more for shares in a socially responsible firm, ceteris paribus. We show that the provision of managerial incentives is then surprisingly simple, and reflects current practices: it is optimal to give short-term and long-term equity holdings to the manager. As a result, the manager is concerned with the consequences of his allocation of firm resources for the value of the firm’s equity. Given that the stock price aggregates the preferences of shareholders on profits and CSP, giving equity-based incentives to a manager ensures that he will jointly maximize profits and CSP according to shareholders’ preferences. That is, even though the objective function of shareholders is bidimensional, it is sufficient to give managerial incentives along one dimension only, namely the market value of the firm’s equity. Contrary to the neo-classical framework, maximizing the market value of the firm’s equity is not equivalent to maximizing profits when shareholders value CSP.

This framework allows to derive a number of related results. First of all, we show that a decrease in social responsibility requires an increase in equity holdings: if managers derive less utility from “doing good,” the sensitivity of their pay to their performance must be increased for incentive purposes. In addition, when social preferences are heterogeneous, we show that a firm will hire a manager whose social preferences matches its average shareholder’s. Indeed, a congruence on this dimension facilitates the provision of incentives, and any discrepancy will result in inefficiencies. This suggests that firms should screen managers based on their personal preference for social responsibility (see also Besley and Ghatak 2005; Brekke and Nyborg 2008). Overall, our results suggest that socially responsible firms will offer compensation packages similar (although not identical) to other firms,’ but that they will hire more socially responsible managers.

The model also shows that more socially responsible firms will tend to have a higher valuation but lower subsequent stock returns, ceteris paribus. This distinction helps reconcile some apparently contradictory empirical findings, as reviewed for example in Margolis et al. (2007). It also emphasizes that more socially responsible firms have a lower cost of equity capital.

These results have important implications for the debate on CSR and managerial incentives. The debate on CSR was largely framed by the view of Friedman (1970) that it would be illegitimate for managers to pursue any objective other than profit maximization, because managers are the agents of shareholders. On the other side, some proponents of CSR argue that firms should not be run exclusively in the interests of shareholders, because in some instances it is desirable to depart from profit maximization (e.g., Kolstad 2007). By contrast with these two perspectives, our model shows that a firm managed in the interests of its socially responsible shareholders will maximize not only its profits but also its CSP. In addition, compensating managers based on the firm’s equity value will lead them to devote resources both to profit-enhancing activities and to CSP-enhancing activities. This result contrasts with the view summarized in Tirole (2001) that equity-based compensation “encourage[s] management to devote most of its effort to enhancing profitability and favor this objective when trading off the costs and benefits of alternative decisions.” Because of this multitask problem, Tirole (2001) then argues that social responsibility is likely to be best promoted with flat compensation rather than performance-based incentives, in contrast with the results presented in this paper. In case a measure of CSP is available, the literature argues that compensation should be contingent both on profits and on this CSP measure (e.g., Baron 2008), for example via CSP audits contingent on observing high profitability as in Sinclair-Desgagné (1999). By contrast, we show that equity-based compensation can play a useful role in fostering CSP, to the extent that shareholders are socially responsible.

Finally, the paper provides a legal basis for Corporate Social Responsibility (CSR) in the US and the UK,Footnote 2 where managers and directors have a fiduciary duty to act in the best interests of shareholders. Indeed, the model emphasizes that shareholders will optimally write contracts that induce the manager to allocate firm resources to CSP, which benefits shareholders and is consistent with the maximization of shareholder wealth.

The emerging literature on CSR is reviewed in Kitzmueller and Shimshack (2012). Our contribution to this literature is to study the incentives alignment problem in the presence of socially responsible investors. A closely related paper is Graff Zivin and Small (2005). In their model, shareholders with preferences for consumption and for charitable giving can choose between investing in a socially responsible firm, or making donations directly. With these shareholder preferences, a share in a socially responsible firm is a bundle that delivers both financial and social benefits. Graff Zivin and Small (2005) study when investments in CSP by firms or direct charitable donations are substitutes, but they do not study incentive alignment between managers and shareholders, which is the focus of this paper. Another closely related paper is Baron (2008), who builds a very rich model which relates CSP incentives to the social preferences of the firm’s consumers and shareholders. One crucial difference is that managerial compensation can be contingent on profits and CSP in Baron (2008), whereas it can be contingent on the (endogenously determined) stock price in this paper.

The rest of the paper is organized as follows. The next section presents the model. The following two sections present the main results of the paper, and revisit the model under the assumption that preferences for CSP are heterogeneous across shareholders. The last section discusses the results and concludes. The Appendix includes the proofs of the main results and provides additional technical details.

A Model of Corporate Social Responsibility

This section develops a model to study the provision of equity-based incentives to managers when shareholders value CSP. As managerial incentives can depend on the stock price, which is determined by investors, it is important to complement a model of managerial contracting by a model of portfolio choice. We rely on the existing literature to develop a stylized model which combines these two features, and which also takes into account the fact that agents do not only derive utility from wealth, but also have social preferences. We make a number of simplifying assumptions to keep a model that combines all these elements as tractable and simple as possible.

We consider a publicly listed firm run by a manager and initially owned by n shareholders. Thus, we focus on firms for which a stock price is available, and in which there is a separation between ownership and control. At \(t=0\), the manager can exert effort or not: \(e \in \{0,1\}\). A manager who exerts effort incurs a private cost worth C > 0 in monetary terms. Without effort (e = 0), profits at t = 2 are \(\tilde{\epsilon }\), where \(\tilde{\epsilon }\) is a normally distributed random variable with zero mean and variance \(\sigma ^2\) realized at t = 2, and CSP at t = 2 is 0. A manager who exerts effort (e = 1) “unlocks” the firm’s potential, and must then at t = 0 choose the allocation \(a \in [0,1]\) of firm resources given that profits at t = 2 are \(\sqrt{1-a}+\tilde{\epsilon }\), and CSP at t = 2 is \(\phi \sqrt{a}\). For the moral hazard problem to be realistic and nontrivial, we assume that neither effort e nor the resource allocation a is contractible. This is a multitasking problem in the spirit of Holmström and Milgrom (1991). As is standard in the literature, “CSR manifests itself in some observable and measurable behavior or output” (Kitzmueller and Shimshack 2012), which we define as CSP. There are decreasing returns to increasing either profits or CSP, so that it is generically socially optimal to allocate resources to both.

The parameter \(\phi \ge 0\) captures the differences in firms’ technologies and opportunities which are such that CSP is relatively more valuable in some firms and some industries than in others. This captures the notion that the capacity to undertake socially beneficial actions varies across firms and industries, for example because some firms have developed a particular expertise which is relevant for CSP while others have not. The level of \(\phi\) thus determines the “bang (in terms of CSP) for the buck” for investments in CSP. For example, a typical high \(\phi\) firm may be able to substantially reduce its polluting emissions at a very low cost; in this case, a dollar invested in CSP will generate of lot of “social good,” i.e., \(\phi\) is high. A typical medium \(\phi\) firm is able to improve its CSP by undertaking activities unrelated to its business, such as making donations. In a typical low \(\phi\) firm, resources allocated to CSP are often wasted, for example because they are used inefficiently or because they tend to be diverted (think about funds allocated to support local green entrepreneurship which are actually used to fund the ailing high-tech venture of an employee’s relative, say). With \(\phi =0\) (respectively \(\phi \rightarrow \infty\)), it is worthless to allocate any resource to CSP (resp. to profit maximization).

The important assumption that managerial “effort” is privately costly captures the divergent interests between the manager and shareholders. As in standard principal–agent models, it justifies the use of explicit monetary incentives (as will be clear in the next section, with \(C \rightarrow 0\) the contract would simply serve a risk-sharing purpose). The assumption that profits are normally distributed helps keep the portfolio choice problem tractable, but is otherwise not crucial.

At t = 1, shareholders observe a and \(\phi\), and the stock price p is established on a market for the firm shares. (We show in the “Unobservable actions” section in the Appendix that the main results of the paper are unchanged if instead shareholders observe noisy signals which are informative about expected profits and CSP but do not reveal the manager’s action.) That shareholders observe both a and \(\phi\) (or signals about firm profits and firm CSP) is crucial, because otherwise the stock price would not reflect either the financial or the social performance of the firm. At t = 1, each shareholder must then (re)allocate his wealth \(\omega\) to a portfolio that includes z shares in the firm and an investment of \(\omega -zp\) at the risk-free rate \(r_f\)—which is for simplicity normalized to zero: \(r_f=0\).Footnote 3 Given shareholders’ optimal demands for the firm stock at t = 1, the stock price is determined by the standard market clearing constraint. To simplify and alleviate calculations, we normalize the number of shares outstanding to 1, so that the equity holdings of a shareholder may also be viewed as the fraction of the firm that he owns, and firm value at t = 1 is simply equal to the stock price.

The agents (the manager and the shareholders) are risk-averse expected utility maximizers, with preferences defined over money and CSP as in Graff Zivin and Small (2005). This important assumption is consistent with the results of Andreoni and Miller (2002), who demonstrate that “altruism is rational,” in the sense that observed altruistic behavior can be generated by a standard utility function. The risk aversion assumption is essential for the portfolio choice problem and for the derivation of the stock price. For tractability, we assume that the agents have preferences with Constant Absolute Risk Aversion (CARA), with an absolute risk aversion of \(\rho\), so that their utility writes as \(U(X)=-\exp \{-\rho X \}\) (the assumption of CARA utility combined with normally distributed profits allows to derive a simple expression for the stock price). Crucially, the argument X of the utility function is a function of financial and social returns: for an agent who from t = 1 to t = 2 owns z shares in the firm with resource allocation a in which the manager exerts effort, the argument X(za) is the following function of t = 2 profits and CSP:

$$\begin{aligned} X(z,a) = z \left( \sqrt{1-a}+\tilde{\epsilon } + u_s \phi \sqrt{a} \right) + W(z) \end{aligned}$$
(1)

where W(z) is the non-firm wealth of the agent (as a function of z), for example the investment at the risk-free rate minus the payment to the manager for shareholders, or the fixed wage minus the effort cost for the manager. It is important to note that \(u_s\) is the weight that agents place on CSP relative to profits. This parameter essentially captures the intensity of the preference for CSP. With \(u_s=0\), agents only care about “money.” In this paper, it is critical that shareholders value both the financial performance and the social performance of the firm. The empirical evidence suggests that shareholders are “socially responsible” and value CSP, at least to some extent (e.g., Sparkes and Cowton 2004), which is captured by assuming \(u_s>0\). This is also in line with the view of Reinhardt et al. (2008) that “some shareholders may gain utility from the knowledge that their profits have been invested in socially responsible projects.” (we allow for heterogeneous preferences in that regard in a subsequent section).

The actions that contribute to CSP in this model are activities and practices that contribute to social goods at the expense of firm profits, as in Graff Zivin and Small (2005). For example, Tirole (2001) mentions that the firm could refrain from “bribing officials in less developed countries” or from “polluting when pollution taxes or permits are not yet put in place.” In Baron (2008), “the expenditures could represent redistribution through corporate philanthropy, human rights policies, or paying a living wage.” Other examples include transferring technologies and skills to a charity, or educating people in the local community. There is anecdotal evidence that some firms allocate resources for these purposes rather than exclusively to increase profits. It is important to note that, in this model, CSP represents actions that, while socially desirable, reduce firm profits. These actions have been referred to as “altruistic CSR” by Lyon and Maxwell (2008). Thus, we consider only the subset of CSR policies for which there is a nontrivial tradeoff between improving the firm’s social performance and its financial performance (actions that improve both the firm’s social and financial performance, such as “strategic CSR,” should be undertaken in any case). This paper will study this tradeoff, and in particular how it affects the stock price, resource allocation, and managerial incentives.

There are n ex-ante identical shareholders, each of whom owns the same fraction of the firm at t = 0 (it follows that shareholders have the same preferences at this stage). At t = 0, shareholders optimally design the manager’s contract, which must be such that it is accepted by the manager and that it induces effort. The contract will have consequences for the resource allocation chosen by the manager, which shareholders also take into consideration. The manager has a reservation utility of \(\bar{U}\), which means that he will not accept the contract offered by shareholders if it is associated with a level of expected utility less than \(\bar{U}\). For the problem to be nontrivial, we assume that the cost of effort C is sufficiently small for effort to be optimal, which notably necessitates \(C<\sqrt{1-a}+ u_s \phi \sqrt{a}\).

We begin the analysis by studying the outcome in the absence of agency problems, i.e., with no separation between ownership and control, by assuming for simplicity that there is only one shareholder (n = 1) who also manages the firm (this analysis and in particular Claim 1 remain valid if we consider the same problem with n shareholders—for a formal proof we refer to the “First-best with heterogeneous preferences” section in the Appendix with \(u_s^i = u_s\) for any shareholder i). Given that effort is optimal, the allocation of firm resources that maximizes the shareholder’s expected utility is the value of a that maximizes

$${\mathbb {E}} \left[ U \left( \sqrt{1-a}+\tilde{\epsilon } + u_s \phi \sqrt{a}-C \right). \right]$$
(2)

In this setting, this is equivalent to maximizing the certainty equivalent of X(za), which writes as \(\sqrt{1-a}+ u_s \phi \sqrt{a} - \frac{\rho }{2} \sigma ^2-C\).Footnote 4 The resource allocation that maximizes this criterion is the allocation in the absence of agency problems, i.e., the “first-best” allocation, denoted by \(a^{\text{FB}}\). Consequently, we have:

Claim 1

The first-best optimal resource allocation is

$$\begin{aligned} a^\mathrm{FB} = \frac{u_s^2 \phi ^2 }{1 + u_s^2 \phi ^2 } \end{aligned}$$
(3)

The proof immediately follows from the maximization of the shareholder’s certainty equivalent with respect to a (as the certainty equivalent is concave in a, the optimum is given by the first-order condition). In particular, this result implies that it is optimal to devote more resources to CSP than to profit maximization if and only if \(u_s \phi >1\). This first-best resource allocation \(a^{\text{FB}}\) will be our benchmark in the subsequent analysis.

In an economy populated by n agents, say, this first-best optimal resource allocation \(a^{\text{FB}}\) is implicitly based on the preferences of all agents in the model: without an agency problem, the equilibrium portfolio choices are such that any agent in the economy holds the same fraction of the firm (see the “Portfolio choice” section in the Appendix), which implies that it is sufficient to consider the preferences of the firm’s shareholders in an analysis of the first-best (with \(C \rightarrow 0\), which can be interpreted as the limit case in which there is no agency problem, we show in (11) below that the manager has the same equity holdings as any other shareholder). With incomplete markets and imperfect risk sharing, an analysis of the first-best would also need to consider the preferences of agents in the economy who are not shareholders of the firm.

Corporate Social Performance in Equilibrium

In this section, we analyze the agency relationship between the shareholders and the manager. We derive the optimal managerial contract designed by shareholders at t = 0, the resource allocation optimally chosen by the manager, and the stock price established on the market for the firm’s stocks at t = 1.

Stock Price

We solve the model by backward induction, starting with the portfolio choice problem at t = 1. Shareholders choose their portfolio by allocating their wealth to the firm stock and to a risk-free investment to maximize their expected utility. As shown in the Appendix (see the “Portfolio choice” section for more details), the resulting stock price p established at t = 1 as a function of the long-term equity holdings \(z_m^{\text {LT}}\) of the manager and of the resource allocation a is

$$\begin{aligned} p = \sqrt{1-a} + u_s \phi \sqrt{a} - \frac{1-z_m^{\text {LT}}}{n} \rho \sigma ^2 \end{aligned}$$
(4)

The stock price is increasing in expected profits, in the utility associated with owning shares in a socially responsible firm, and decreasing in the variance of profits.

Proposition 1

The resource allocation \(a^{\star }\) that maximizes the stock price corresponds to the first-best optimal allocation: \(a^{\star }=a^{\text{FB}}\).

The proof of this important result is in the Appendix. The resource allocation that maximizes the expected utility of shareholders, which is by definition the first-best resource allocation \(a^{\text{FB}}\), is also the one that maximizes the stock price. This is because the stock price reflects the valuation of the firm by shareholders, and this valuation is maximized by taking actions preferred by shareholders. In this model, socially responsible shareholders derive utility from CSP, and will therefore be willing to pay more to hold shares in a socially responsible firm, all else equal.

Managerial Incentives

We now derive the managerial contract, as designed optimally by the shareholders at t = 0. We consider contracts which consist of a fixed wage w, \(z^{ST}_m\) shares in the firm that the manager must sell at the t = 1 stock price, and \(z_m^{\text {LT}}\) shares that the manager must hold until t = 2. We refer to these two latter components as “short-term equity holdings” and “long-term equity holdings.” This is a major difference with respect to Baron (2008), who assumes that managerial compensation can only depend on profits and social expenditures. Thus, the contract \(\xi\) in this model can be described by \(\{w,z_m^{\text {ST}},z_m^{\text {LT}} \}\). Denoting by \(Y(\xi ,a)\) the argument in the manager’s utility function, a manager who exerts effort \((e=1)\) chooses the resource allocation a to maximize

$$\begin{aligned} \mathbb {E} \left[ U \left( Y(\xi ,a)\right) |e=1 \right] = \mathbb {E} \left[ U \left( w + z_m^{\text {ST}} p +z_m^{\text {LT}} \left( \sqrt{1-a} + \tilde{\epsilon } + u_s \phi \sqrt{a} \right) - C \right) \right] \end{aligned}$$
(5)

At the time of contracting (t = 0), there are n shareholders, each of whom owns the same fraction of the firm. Any shareholder will accordingly bear a fraction 1 / n of the cost of managerial compensation, which consists in the fixed wage w and the liquidation value \(z_m^{\text {ST}} p\) of short-term equity holdings at t = 1. In addition, each shareholder will own the same fraction \(\frac{1-z_m^{\text {LT}}}{n}\) of the firm from t = 1 to t = 2 (see (20)), given that the manager will own a fraction \(z_m^{\text {LT}}\) of the shares. Shareholders choose the managerial contract to maximize their expected utility:

$$\max _{w,z_m^{\text {ST}},z_m^{\text {LT}},a_m} {\mathbb {E}} \left[ U \left( \frac{1-z_m^{\text {LT}}}{n} \left( \sqrt{1-a_m} + \tilde{\epsilon } + u_s \phi \sqrt{a_m} \right) - \frac{1}{n} \left( w+z_m^{\text {ST}}p \right) \right) \right] ,$$
(6)

subject to the following constraints:

$$\begin{aligned} {\mathbb {E}}[U(Y(\xi ,a_m))|e=1]\ge & {} {\mathbb {E}}[U(Y(\xi ,a))|e=0] \end{aligned}$$
(7)
$$\begin{aligned} {\mathbb {E}}[U(Y(\xi ,a_m))|e=1]\ge & {} {\mathbb {E}}[U(Y(\xi ,a))|e=1] \qquad \forall a\in [0,1] \end{aligned}$$
(8)
$$\begin{aligned} {\mathbb {E}}[U(Y(\xi ,a_m))|e=1]\ge & {} \bar{U}\end{aligned}.$$
(9)

The following Proposition describes the equilibrium contract and the associated resource allocation:

Proposition 2

In equilibrium, the manager receives the following short-term and long-term equity holdings:

$$\begin{aligned} z_m^\mathrm{ST} = \max \left\{ \frac{C}{\sqrt{1-a^\mathrm{FB}} + u_s \phi \sqrt{a^\mathrm{FB}} } - \frac{1}{1+n} , 0\right\} \qquad \mathrm{and} \qquad z_m^\mathrm{LT} = \frac{1}{1+n} \end{aligned}$$
(10)

In addition, the fixed wage w is set so that the expected utility of the manager is equal to his reservation utility. For any set of parameter values, the resource allocation optimally chosen by the manager is the first-best allocation aFB.

In this setting, short-term and long-term equity holdings are perfect substitutes in terms of incentives. This is because maximizing the stock price is achieved by choosing the resource allocation preferred by shareholders, while a manager with long-term equity holdings only would choose the resource allocation that he prefers as a shareholder (given that he is also socially responsible). Since shareholders and the manager have the same social preferences u s (an assumption which is relaxed in the next section), these two allocations of resources are identical and equal to the first-best resource allocation a FB.

However, short-term and long-term equity holdings have different implications for risk sharing. At the optimum, long-term equity holdings are set at the level which ensures first-best optimal risk sharing. As the manager and shareholders are risk averse with CARA utility, it is well known that it is optimal to allocate the same fraction of the risk \(\tilde{\epsilon }\) to each agent, even absent any moral hazard problem (e.g., Demange and Laroque 2006). In particular, with \(z_m^{\text {ST}}=0\) (which is the case with \(C \rightarrow 0\)), we have \(z_m^{\text {LT}}=\frac{1}{1+n}\) according to Proposition 2. Using (20), each shareholder then owns the following fraction of the firm:

$$\begin{aligned} z = \frac{1-z_m^{\text {LT}}}{n} = \frac{1-\frac{1}{1+n}}{n} = \frac{1}{1+n} \end{aligned}$$
(11)

In this case, the manager and each shareholder hold the same fraction of the firm. By contrast, Baron (2008) assumes universal risk neutrality, so the managerial contract has no risk-sharing purpose in his model.

In addition, when the cost of effort C is sufficiently high, short-term equity holdings adjust upward as needed to provide enough effort incentives (the equilibrium level of \(z_m^{\text {ST}}\) is increasing in the cost of effort C). Given that the stock price is set prior to the realization of uncertainty, exposing the manager to the short-term stock price does not expose him to any risk. Finally, the level of pay adjusts so that the manager accepts the contract (the equilibrium fixed wage is increasing in the manager’s reservation utility \(\bar{U}\)). The fixed wage w is earned in any circumstances, so that it does not have any effect on incentives, but it adjusts so that the manager is at his reservation level of utility.

The degree of social responsibility \(u_s\) affects not only the resource allocation, but also the short-term equity holdings of the manager:

Claim 2

The short-term equity holdings of the manager \(\left( z_m^{\text{ST}}\right)\) are decreasing in the preference for CSP, as measured by \(u_s\).

This result is due to the combination of two effects. First, the direct effect of a higher social responsibility \(u_s\) is a greater intrinsic motivation of the manager to exert effort, for a given resource allocation. This in turn reduces the need for explicit incentives. Second, a higher \(u_s\) also increases the equilibrium resource allocation a, which results in a higher firm value conditional on the manager exerting effort (relative to the case with a lower \(u_s\)). It is then possible to preserve effort incentives while reducing the sensitivity of managerial compensation to firm value, because there is now a larger wedge between firm value when the manager exerts effort and firm value when he does not. Thus, the model predicts that short-term equity-based incentives will be lower when social responsibility is stronger.

The “standard” principal–agent model in which shareholders must simply induce a manager to maximize profits corresponds to the special case of \(\phi =0\):

Corollary 1

For \(\phi =0\) , the efficient resource allocation is \(a^{\text{FB}}=0\) , and the managerial equity holdings are

$$\begin{aligned} z_m^\mathrm{ST} = \max \left\{ C - \frac{1}{1+n} , 0\right\} \qquad \mathrm{and} \qquad z_m^\mathrm{LT} = \frac{1}{1+n} \end{aligned}$$
(12)

This result directly follows from Claim 1 and from Proposition 2 with \(\phi =0\).

The contract could in principle also depend on firm profits and on CSP. However, the contract described in Proposition 2 elicits the first-best resource allocation, does not give any rent to the manager, and is associated with first-best optimal risk sharing. That is, there is no need to use any measure besides the stock price for incentives provision. Thus, even if CSP were contractible, making managerial incentives directly contingent on CSP (in addition to other variables) would not improve efficiency. Furthermore, in practice, making incentives contingent upon CSP is difficult and could result in distortions, as discussed further in Sect. 4.

Alternative incentive schemes could be feasible in theory, but none would improve efficiency in a strict sense, and most would suffer from a number of flaws. For example, the manager’s effort could be monitored by a designated monitor, but Tirole (2001) notes that these schemes are rarely observed in practice. He attributes this to the fact that it would often be in the interests of the manager and the monitor to collude. By contrast, “A market has more integrity. (...) with a market (cum insider trading rules) it becomes much harder for the entrepreneur to capture the passive monitoring process.” (Tirole 2001). The manager and the shareholders could also report the manager’s actions to a court of law, as discussed in Laffont and Martimort (2001). However, this mechanism faces problems of equilibrium selection (Laffont and Martimort 2001, p.251), and it may be costly to implement due to the existence of legal costs or communication costs. It may even be infeasible in practice due to the impossibility to measure effort on a clear scale, or because of restrictions on the ability to impose punishments in case of deviation—either due to limited liability or to imperfections in the judicial system (Laffont and Martimort 2001, Chap. 9).

Valuation and Returns

The model predicts that firms with different capacities for CSP, as captured by the parameter \(\phi\) in our model, will systematically differ in terms of valuation and stock returns (where the stock return from t = 1 to t = 2 is the financial return from buying the stock at t = 1 and receiving the firm profits at t = 2). Indeed, assuming that parameters are such that the stock price and the expected stock return are positive,Footnote 5 we have:

Proposition 3

In equilibrium, firm valuation at t  = 1 is increasing in the firm’s capacity for CSP, as measured by \(\phi\) , and the expected stock return from t = 1 to t = 2 is decreasing in \(\phi\).

Note that firms with a greater capacity \(\phi\) for CSP also invest more in CSP: \(a^{\text{FB}}\) is increasing in \(\phi\) (see Claim 1), so that firms with a higher \(\phi\) are also more “socially responsible.”

The first part of Proposition 3 is less obvious than it seems: all else equal, it is clear from Eq. (4) that a higher capacity for CSP (\(\phi\)) translates into a higher stock price. However, Proposition 3 states that firm valuation is increasing in the firm’s capacity for CSP (\(\phi\)) in equilibrium, once the effect of \(\phi\) on the resource allocation a is taken into consideration. The second part of Proposition 3 is also less obvious than it seems. Indeed, even though allocating more resources to CSP decreases firm profits at t = 2, the stock price at t = 1 is determined endogenously by shareholders who already take this into account. Instead, stock returns from t = 1 to t = 2 are decreasing in the firm’s capacity for CSP (\(\phi\)) because shareholders derive utility from CSP, so that they drive the stock price over and above what they would be willing to pay for expected firm profits, hence the lower expected (future) return. In other words, socially responsible investors value a socially responsible firm more, and are willing to sacrifice monetary returns to enhance CSP.

This result has two important implications. First of all, Proposition 3 emphasizes that devoting some firm resources to CSP rather than profit maximization can increase the market value of the firm’s equity. Indeed, socially conscious shareholders are willing to invest more money into socially responsible firms per unit of expected profit generated, or equivalently the expected return on equity is lower for firms that invest more in CSP, as established in Proposition 3. This in turn gives these firms an advantage, namely a lower cost of equity capital and a higher equity value, which notably enhances their ability to raise external funding via equity issues. This advantage could compensate for the greater profitability of other firms (with a lower \(\phi\)). An important implication is that the lower profitability of firms that invest more in CSP does not necessarily threaten their survival.

Proposition 3 also shows that the relation between CSP and financial performance will be different if the latter is measured with valuation metrics or with return metrics. This result is all the more important that recent research on the relation between CSP and financial performance often does not distinguish between stock returns and firm valuation—for example, the review of Margolis et al. (2007) lumps these two types of financial performance together. Although most studies establish correlations rather than causation, the extant empirical evidence is mostly consistent with these predictions of the model. On the one hand, the event studies referenced in Kitzmueller and Shimshack (2012) find a positive stock price reaction to “positive social news,” and the studies referenced in Margolis et al. (2007) generally find that CSP and firm value are positively related. On the other hand, Margolis et al. (2007) acknowledge that this effect becomes weaker once the effect of CSP on subsequent financial performance is examined, and Brammer et al. (2006) find that CSP is negatively related to subsequent stock returns. While these findings may at first glance seem paradoxical or even contradictory, our framework rationalizes them and emphasizes the importance of distinguishing between the valuation and stock return effects of CSP. This should facilitate studies of the relation between CSP and corporate financial performance, thereby responding to Ruf et al. (2001) and van Beurden and Gössling (2008), who argue that inconsistencies across empirical studies may be due to a lack of theoretical foundations.

Finally, we argue that the results in Proposition 3 continue to hold in a model with several investment periods. In a multiperiod model in which consecutive generations of investors are socially responsible in every period, the stock price will be high not only when investors purchase the stock, but also when they sell it afterwards. It is then unclear whether the expected (monetary) return of more socially responsible firms (with a higher \(\phi\)) is still lower. To address this concern, in the “Stock returns over time” section of the Appendix, we sketch a simple model with several periods and overlapping generations of socially responsible investors, and we show that Proposition 3 still holds. The intuition is that, for markets to clear at the beginning of every period, the stock of a more socially responsible firm must have a lower expected (monetary) return, otherwise it would be in excess demand. This principle continues to hold if the stock has a resale value which is increasing in \(\phi\).

Heterogeneous Preferences

We now extend the model by considering heterogeneous preferences for CSP across the population. Andreoni and Miller (2002) find that a quarter of subjects in their experiments are “selfish money-maximizers,” while others display varying degrees of altruism. Thus, social preferences are not uniform in the population. In this section, we study the consequences of this heterogeneity for contracting, the matching between managers and firms, the stock market equilibrium, and the efficient resource allocation, which notably enables us to establish the robustness of the main results of the previous section.

The preference for CSP of shareholder i (respectively manager j) is denoted by \(u_s^i\) (resp. \(u_s^j\))—instead of \(u_s\) for every agent in the baseline model. We assume that the population of potential managers is sufficiently diverse that shareholders can hire a manager of any type, i.e., they can choose the level of \(u_s^j\) (in de Bettignies and Robinson 2013, the firm can also choose the level of social responsibility of its manager). To keep the model tractable and simple, we assume that managers only differ on this dimension. Thus, this extension of the model enables us to address a new issue, namely to determine the type of manager that each type of firm will attract.

We consider the case where the shareholder structure does not change over time, by assuming that the shareholder structure—considering the set of shares held by shareholders—at t = 0 is the same as at t = 1 (we would otherwise need to consider the implications of shareholders having different investment horizons, which would considerably complexify the model and is not the focus of the paper).Footnote 6 Note that due to optimal risk sharing across risk-averse shareholders, a firm will typically be owned by several shareholders. At t = 0, the shareholders design the compensation contract to maximize their collective surplus. Specifically, we use the criterion proposed in Grossman and Hart (1979), who show that when side-payments between shareholders are allowed, “the firm’s objective function is a weighted sum of shareholders’ private valuations of the firm’s future production stream.” This assumption is natural if a compensation contract is to satisfy the Pareto criterion. It also has the advantage of approximating each shareholder group’s bargaining power.

At t = 1, the stock price \(\bar{p}\) as a function of the resource allocation a, of the contract \(\xi \equiv \left\{ \bar{w},\bar{z}_m^{\text {ST}},\bar{z}_m^{\text {LT}} \right\}\), and of the set \(\{ u_s^i \}\) of shareholder preferences (cf. the “Portfolio choice with heterogeneous preferences” section in the Appendix) is

$$\begin{aligned} \bar{p} = \sqrt{1-a} + \frac{\sum _{i=1}^n u_s^i}{n} \phi \sqrt{a} - \frac{1-\bar{z}_m^{\text {LT}}}{n} \rho \sigma ^2 \end{aligned}$$
(13)

The stock price aggregates the (heterogeneous) preferences of shareholders across the profits and CSP dimensions. As in the baseline model, the resource allocation \(\bar{a}^{\star }\) that maximizes the stock price is the same as the first-best allocation: \(\bar{a}^{\star }=\bar{a}^{\text{FB}}\) (technical details are in the “First-best with heterogeneous preferences” section in the Appendix). In what follows, we consider two cases.

First, suppose that the firm hires a manager j such that \(u_s^j = \frac{1}{n}\sum _{i=1}^n u_s^i\), i.e., the preference for CSP of the manager is the same as the average shareholder preference for CSP. Then the optimal contract is similar to the one described in Proposition 2—the only difference is that the preference for CSP \(u_s\) is “replaced” by the average shareholder preference for CSP, \(\frac{1}{n}\sum _{i=1}^n u_s^i\):

Proposition 4

A manager with same preference for CSP as the average shareholder receives the following short-term and long-term equity holdings:

$$\begin{aligned} \bar{z}_m^{\text {ST}} = \max \left\{ \frac{C}{\sqrt{1-\bar{a}^\mathrm{FB}} + \frac{\sum _{i=1}^n u_s^i}{n} \phi \sqrt{\bar{a}^\mathrm{FB}}} - \frac{1}{1+n} , 0\right\} \qquad \mathrm{and} \qquad \bar{z}_m^\mathrm{LT} = \frac{1}{1+n} \end{aligned}$$
(14)

In addition, the fixed wage \(\bar{w}\) is set so that the expected utility of the manager is equal to his reservation utility. For any set of parameter values, the resource allocation optimally chosen by the manager is the first-best allocation \(\bar{a}^{\text{FB}}\).

In this case, as in the previous section, risk sharing is socially optimal, and the manager is at his reservation level of utility (i.e., he does not derive any rent). That is, hiring a manager with the same preference for CSP as the average shareholder allows to obtain the first-best optimal resource allocation, to achieve socially optimal risk sharing, and to maintain the manager at his reservation level of utility (cf. Proposition 4). On the contrary, if the firm does not hire a manager j whose preference for CSP matches the average shareholder’s, then inefficiencies are unavoidable. This leads us to this new result:

Proposition 5

It is optimal for a firm to hire a manager with the same preference for CSP \(u_s^j\) as the average shareholder’s: \(u_s^j=\frac{1}{n}\sum _{i=1}^n u_s^i\).

Thus, the model predicts assortative matching along the preference for CSP dimension between shareholders and managers: shareholders will if possible hire managers whose preference for CSP corresponds to their own average preference. For example, firms in which shareholders only care about financial returns will hire a purely self-interested manager, whereas firms in which shareholders are extremely (respectively moderately) socially responsible will hire an extremely (resp. moderately) socially responsible manager. This matching may be facilitated by the fact that job seekers prefer to work for organizations with “values similar to their own,” as indicated by the survey-based evidence reported in Kitzmueller and Shimshack (2012).

This result differs from previous justifications for matching intrinsically motivated agents with similar firms. In Brekke and Nyborg (2008), firms practice CSR to screen intrinsically motivated agents, which then allows them to offer less explicit incentives. In Besley and Ghatak (2005), extrinsic incentives and intrinsic motivation are substitutes, so that matching socially conscious agents and principals allows to reduce bonus payments. In our model, if the firm hires a manager whose social preferences differ from those of shareholders, then either there is no risk sharing between the manager and the shareholders (if \(\bar{z}_m^{\text {LT}}=0\)), or the manager chooses an allocation of firm resources that differs from the first-best allocation.

The determination of the shareholder structure is not the focus of this paper, but the model nevertheless gives interesting insights. Indeed, the ownership of a given shareholder is increasing in the difference between the preference for CSP of this shareholder and the preference for CSP of the average shareholder, and it is increasing in the CSP of the firm if this difference is positive (cf. equation (68)). In addition, we established that a firm whose shareholders have a strong preference for CSP will invest relatively more in CSP, in the sense that \(\bar{a}^{\text{FB}}\) is higher. That is, the ownership structure of a firm not only affects but is also affected by the firm’s CSR policy.

Discussion and Conclusion

The main message of the paper is that, in the presence of socially responsible investors, the market mechanism and equity-based compensation are not necessarily inconsistent with social objectives, and can even be instrumental in driving the firm toward these objectives. This result, however, relies on some crucial assumptions. In particular, it is important that shareholders are socially responsible, in the sense that they derive additional utility from owning shares in a socially responsible firm. This in turn implies that they are willing to pay more to own shares in a socially responsible firm, ceteris paribus. Should shareholders not be socially responsible, or should their preferences on this dimension not translate into a higher willingness to pay, equity-based compensation would then not fully play its role. More generally, the level of the parameter \(u_s\), which represents the sensitivity of shareholders’ certainty equivalent utility to CSP, must be sufficiently high for the stock price to adequately reflect CSP. Existing empirical studies suggest that \(u_s\) is positive, but it is still unclear to what extent shareholders are willing to sacrifice financial returns for social returns.

In that regard, it is important to underline that we defined CSP as the extent to which the firm sacrifices profits in the social interest. This perspective on CSP is in line with the definition of CSR in McWilliams (2000): “Actions of firms that contribute to social welfare, beyond what is required for profit maximization, are classified as Corporate Social Responsibility (CSR).” Likewise, McWilliams and Siegel (2001) characterize corporate social responsibility as “actions that appear to further some social good, beyond the interests of the firm and that which is required by law.” Our approach could arguably be categorized as an “integrative theory” of CSR according to the typology of Garriga and Melé (2004), although in this case the corporation integrates the (social) preferences of its shareholders.

The analysis developed in this paper challenges the view that a firm can either benefit shareholders by maximizing profits or deliberately not act in the best interests of its shareholders. For example, according to Kitzmueller and Shimshack (2012) “Within this narrow neoclassical firm paradigm, CSR expenditures could only be a manifestation of moral hazard towards shareholders.” Likewise, Margolis et al. (2007) assert that the firm can either maximize shareholder wealth or pursue social objectives. By contrast, with socially responsible investors, we have shown that the market value of a firm is (up to a point) increasing in the amount of resources invested to improve its CSP rather than its profits. This is because the market value of a company measures not only its ability to generate cash flows (i.e., its financial performance) but also its social performance, so that the firm can benefit shareholders by jointly maximizing profits and CSP. This distinction is important, especially regarding the legal basis for allocating firm resources to CSP, which is at best unclear in the US and UK according to Reinhardt et al. (2008), where directors and managers have a fiduciary duty to shareholders (in other countries such as France, management and directors must also take into consideration the “social interest,” as specified for example in Rapport Viénot (1995)). In addition, with socially responsible investors, we have shown in Claim 1 that a firm managed in the interests of its shareholders will invest some resources to improve its CSP rather than its profits. This establishes that devoting resources to CSP can still be consistent with the fiduciary duty of managers and directors if shareholders are socially responsible (see Reinhardt et al. (2008) for a discussion of the legality of CSR in different countries).

The main result that equity holdings give the manager incentives to jointly maximize firm profits and CSP according to shareholders’ preferences is especially important in light of the fact that providing specific CSP incentives is notoriously difficult. This is both because of measurement and aggregation problems. While it is straightforward to measure firm profits, it is often harder to measure firm CSP, even on only one dimension. Firm spending on socially responsible activities could be measured in some instances, but measuring inputs rather than outputs may not fully solve the moral hazard problem. In addition, while aggregating monetary sums is simple, it is more arduous to aggregate measures of CSP across different dimensions. As is already well known from the agency literature, providing incentives based on an imperfect measure of performance may have undesirable side effects, notably because it might encourage agents to game the system or to manipulate this performance measure. For example, rewarding time spent at the office rather than achievements could encourage a manager to spend a lot of unproductive time at the office; likewise, rewarding the money spent on CSP rather than actual CSP could encourage the manager to simply give money to badly managed charities rather than carefully study the opportunities available to the firm.

The purpose of this paper was to study the provision of managerial incentives when a firm can invest to enhance its financial performance (profits) and its social performance (CSP), and when shareholders value both aspects of the firm’s performance. The analysis relied on a number of critical assumptions, which we now discuss. First, there is a separation of ownership and control: aligning interests between the owner(s) and the manager of a firm would be irrelevant if the owner of the firm were also its manager. Second, to have a meaningful agency problem, the manager must exert “costly effort,” as already discussed in Sect. 1 of the paper. Third, the manager must decide the allocation of firm resources toward the improvement of its financial or social performance, and this resource allocation is observable but uncontractible. The uncontractability of actions reflects their complex nature, and the ability of agents to game attempts at measuring and rewarding actions or inputs rather than outcomes or outputs, as already discussed in the previous paragraph. That actions are uncontractible is thus a common assumption in the moral hazard literature. That either the resource allocation or informative signals about the firm’s profits and CSP (cf. the “Unobservable actions” section in the Appendix) are observable is crucial: if neither were observable, then the firm’s CSP would not be reflected in the stock price. In practice, it seems likely that investors receive some information about the firm’s CSP and profits, i.e., they receive signals about these two dimensions of firm performance. Fourth, the assumption that shareholders are risk averse is realistic and important for portfolio choice and the resulting stock price. (Risk-neutral shareholders would buy or sell infinite numbers of shares as long as the stock price differs from their valuation of the stock, which would be a problem when shareholders have heterogeneous preferences for CSP.) This said, the level of shareholder risk aversion is not restricted in the model, and it could be arbitrarily low. Fifth, there must be a market for firm shares on which the stock price is established, i.e., the firm must be publicly listed. The use of a stock market-based mechanism for incentive provision is obviously impossible if firm shares are not traded and valued on a stock market. Sixth, shareholders must value the social performance of the firm. This assumption relies on some aforementioned empirical evidence. As already noted, this is crucial for the stock price to reflect firm profits and firm CSP, which is in turn necessary for equity-based incentives to encourage the manager to improve both the financial and the social performance of the firm.

In addition to the crucial assumptions enumerated in the preceding paragraph, the analysis in this paper also relies on a number of simplifying assumptions made for tractability. In this paragraph, we discuss the results’ robustness to different assumptions. First, the effort by the firm’s manager is binary. This allows us to consider only one incentive constraint in the analysis, and to focus on the allocation of firm resources to profit maximization and CSP rather than on the level of managerial “effort.” This also means that the level of effort to be induced by the manager is fixed (at e = 1), i.e., it does not depend on other parameters. As argued by Edmans et al. (2009), if the benefits of effort are multiplicative in firm size, then the highest level of effort will be optimal for sufficiently large firms (this is the “maximum effort principle”), i.e., the level of effort to be implemented is fixed, as is the case in our model. In future research, it would be interesting to study the optimal level of managerial effort that the optimal contract should induce, and in particular the interaction between the preference for CSP and the optimal level of effort—this would be especially relevant for smaller firms. Second, by considering normally distributed firm profits, we do not study the impact of the asymmetry of the distribution or the fatness of its tails on the optimal contract. In our model, the shape of the distribution of firm profits can potentially affect the level of the stock price (for example, prudent investors would be willing to pay more to hold a stock with a positive skewness), and the optimal risk-sharing rule (i.e., the allocation of long-term equity holdings across agents). A change in the level of the stock price would affect the manager’s fixed wage, which does not play a crucial role in the analysis. Moreover, with CARA utility, the optimal risk-sharing rule is for each agent to bear the same fraction of the risk, independently of the shape of its distribution (Gollier 2001, p. 58). The main results of the paper would thus be qualitatively robust to alternative distributional assumptions. Third, to derive the stock price, we assumed a simplified portfolio choice problem in which investors allocate their wealth to the firm stock and a risk-free asset. As long as shareholders have monetary and social preferences, the stock price would still reflect firm profits and firm CSP even if the set of assets were expanded, so this assumption is not crucial. Fourth, by postulating CARA utility, we take into account the risk aversion of the manager and investors, but we do not let their level of absolute risk aversion be decreasing in wealth (“DARA”). With DARA, the optimal risk-sharing rule would depend on the level of wealth of the different agents, with more wealthy agents bearing a higher fraction of firm risk. For example, a manager who is more wealthy than investors would hold more long-term equity holdings than a single investor for risk-sharing purposes. In turn, higher managerial long-term equity holdings could decrease the managerial short-term equity holdings necessary for incentive purposes, because short-term and long-term equity holdings are substitutes in incentive provision, as in Proposition 2. In sum, the optimal contract described in Proposition 2 would keep similar features, but the relative importance of short-term and long-term managerial equity holdings would depend on the distribution of wealth across agents in the economy. Fifth, to focus on the heterogeneity of preferences for CSP across agents, we assumed for simplicity that all agents in the model have the same degree of risk aversion. If agents were also heterogeneous in terms of risk aversion, the optimal risk-sharing rule would be altered, with less risk-averse agents bearing a higher fraction of firm risk (as discussed above). The analysis could thus be extended in several directions.

Finally, this paper should be viewed as a starting point for further research in this emerging area. A modified version of this model could be used to address a number of important related questions. For example, it would be interesting to study the dynamics of a firm’s shareholder structure and its interaction with contracting. Indeed, a more socially responsible firm would likely attract more socially responsible shareholders, who would in turn elicit a higher level of CSP (the diversification motive would admittedly mitigate the tendency of shareholders with different preferences for CSP to hold different portfolios). Note that the notion that shareholders adjust their financial investments according to their preference for CSP is consistent with the existence of indices and mutual funds that apply CSR screenings, thus allowing investors to invest in CSP-strong companies (investors with no preference for CSP can on the contrary invest in mutual funds that hold “sin stocks”). A careful analysis of these mechanisms could presumably contribute to explain the diversity of CSR strategies across firms, the diversity of their ownership structures, and the diversity of their managers.