1 Introduction

The analysis of consumption decisions in multi-person households dates back to the pioneering work of (Becker 1973, 1974, 1981), which introduced the insight that individuals consume, not households. He was the first to show how marriage markets influence individual consumption within a marriage. He also proposed the Rotten Kid theorem, where an altruistic dictator made all the decisions on behalf of the household members, to explain how multi-person households could end-up behaving in a unitary way. These seminal papers lead to a burgeoning literature on individual consumption in marriage. For example, Grossbard (1976); Grossbard-Shechtman (1984) considered each member of the household as separate decision-makers whose interactions did not necessarily lead to an equilibrium allocation. Others assumed spouses allocated resources through a cooperative or non-cooperative bargaining game (e.g., Manser and Brown (1980); Lundberg and Pollak (1993).

More recently, the collective household model, which assumes that decision-makers achieve efficient outcomes, has become one of the main paradigms through which much of the research on consumption by multi-person households is conducted. One reason that explains its widespread use is that the model yields testable restrictions even though it rests upon a very small set of assumptions (Browning and Chiappori 1998; Chiappori and Ekeland 2006).

One particular set of falsifiable restrictions proposed in the literature to test efficiency focuses on the effects of so-called distribution factors.Footnote 1 There are three different ways distribution factors can be used to test the efficiency hypothesis. First, the proportionality condition states that the ratio of the marginal effect of two distribution factors must be equal across demand equations (Bourguignon et al. 1993; Browning et al. 1994; Bourguignon et al. 2009). Second, the z-conditional demand condition requires the effects of the remaining distribution factors to vanish once the demand equations are conditioned on the demand for some other good and upon substituting out one of the distribution factors (Bourguignon et al. 2009). Finally, the rank condition posits that the impact of the distribution factors must be at most of size one (Chiappori and Ekeland 2006).

The collective model is hardly ever rejected when using tests based on distribution factors, assuming given price levels. Some have thus been led to question the restrictive nature of the constraints imposed by the efficiency hypothesis. Others have raised concern over the statistical validity or power of the tests they imply. For instance, the proportionality condition implies a nonlinear restriction across equations which is generally tested by means of a Wald test (Bourguignon et al. 1993; Quisumbing and Maluccio 2003; Bobonis 2009). Yet, Wald tests are not invariant to algebraically equivalent nonlinear parameterization of the null hypothesis (Dagenais and Dufour 1991). Tests based on z-conditional demands are potentially more powerful because they boil down to testing single equation exclusion restrictions. Nevertheless, z-conditional demand equations include endogenous right-hand side variables and are typically estimated using an instrumental variables approach. The omitted distribution factors are natural instruments, but can prove to be weak. Finally, it is well-known that rank condition tests may suffer from poor statistical power in small samples (Camba-Mendeza and Kapetanios 2009).

Irrespective of the above issues, under-rejection of the efficiency hypothesis may arise for a more practical matter. In their seminal theoretical contribution, Bourguignon et al. (2009) (henceforth BBC2009) assume that at least one distribution factor (locally) affects each demand equation.Footnote 2 This assumption is however hardly ever satisfied empirically. To the best of our knowledge, among all the papers using either the proportionality or the z-conditional demands restriction to test the collective model, only two (Chiappori et al. 2002; Attanasio and Lechene 2014) satisfy the BBC2009 assumption. Thus, the vast majority neglects it. Yet, the standard testing approach used in the literature to investigate the efficiency hypothesis based on these restrictions requires the latter assumption to hold. Such inconsistency between the statistical procedures and their underlying assumptions, we argue, is a plausible candidate to explain under-rejection of the collective model.

However, the assumption of BBC2009 is perhaps too restrictive since distribution factors, by definition, need only affect two (or more) demand functions.Footnote 3 This is the starting point of our paper. We propose a falsifiable restriction of the efficiency hypothesis, which extends BBC2009’s approach insofar as it does not require a distribution factor to (locally) affect each demand equation. The basic intuition is that, even under collective rationality, it is possible that the demands for a subset of goods (e.g., heating, electricity, lodging) may not be affected by the relative bargaining power of the household members, at least locally. In the case of two-member households, expenditures on these goods will then be independent from all distribution factors. On the other hand, the demands for those goods that are influenced by the spouses’ bargaining power must depend on all distribution factors. This provides an alternative all or nothing testable restriction. We derive a set of testable conditions that takes this restriction into account and fully characterize collective rationality, absent price variations. In our approach, the proportionality condition, as well as the z-conditional demand condition, apply only to goods that depend on all the distribution factors.

As with z-conditional demands, the new falsifiable restriction boils down to testing an exclusion restriction in each single equation. Because it rests upon unconditional demand functions, endogeneity of right-hand side variables is not an issue for testing this constraint. Furthermore, contrary to the proportionality or the z-conditional demands restrictions, it does not require any of the distribution factors to be continuous. This is a great advantage since the most convincing distribution factors are random policy treatments. Finally, we show how our approach can be extended to households comprising potentially more than two decision makers.

To illustrate both our test procedure and how under-rejection of efficiency may arise, we use a field survey conducted by one of the co-authors of this paper to collect information on the decision process in very poor households from rural Burkina Faso. The social and customary environments in which these households evolve are likely to impede enforcement of efficient marriage contracts as these are deeply rooted in traditions that dictate expected behavior from both spouses. We investigate the efficiency of outcomes both in monogamous and bigamous households.Footnote 4

The empirical analysis is based upon the widely used and flexible QUAIDS demand system. For both monogamous and bigamous households our data clearly reject the efficiency assumption using our test procedure. We also compute a test of Chiappori and Ekeland (2006)’s rank condition. This latter is asymptotically equivalent to our (simpler) test procedure. Concern with sample size leads us to bootstrap the rank condition test using a recent procedure proposed by Portier and Delyon (2014). Results from this test are consistent with our own test as it rejects efficiency for both monogamous and bigamous households. We next test the proportionality condition despite the fact that the assumption of BBC2009 is not satisfied in our data, as is customarily done in the literature. Our results show that collective rationality is then (falsely) not rejected for monogamous households. We also show that the z-conditional demands approach cannot be implemented to test the efficiency of our Bukinabé households by lack of strong instruments, a problem we suspect is more common than what is usually reported.

Since the all or nothing restriction boils down to testing an exclusion restriction in each unconditional demand function, it is straightforward to review the empirical literature supporting collective rationality based on distribution factors.Footnote 5 Of the ten papers we reviewed, only four are consistent with collective rationality based on our test. Much of the evidence in favour of collective rationality is therefore open to criticism.

The rest of the paper is organized as follows. The next section presents our generalization of BBC2009’s collective model to households comprising respectively two and potentially more decision-makers. We discuss our new procedure to test the model in each of these cases. We also illustrate how the standard approach to test BBC2009’s model may yield misleading inference. Section 3 describes the socio-economic specificities of monogamous and polygamous households in Burkina Faso and discusses the choice of distribution factors. We also present the design of our survey and the main samples characteristics. Section 4 presents our estimation results and provides various tests of our generalized model for both monogamous and polygamous households. Section 5 concludes.

1.1 Theoretical restrictions of the collective model based on distribution factors

Our theoretical approach is based on the collective model in the absence of price variation as in BBC2009. We generalize their approach in two directions. First, we relax one of their crucial assumptions, namely that at least one distribution factor (locally) influences all the demand functions. Second, given that part of our empirical analysis focuses on bigamous households, we allow households to comprise more than two adult members.Footnote 6

Consider a household with I + 1 members. Each member i draws his/her well-being from the consumption of N market commodities, which we represent by the vector x. Each commodity may be consumed privately or publicly by household members. All prices are normalized to 1 so that the household budget corresponds to ιx = m, where ι is a unit vector of dimension N and m is the level of exogenous household expenditures.Footnote 7

Each member i has his own preferences U i(x) over (private and public) goods consumed in the household. No restrictions are imposed on the nature of the preferences. They can be egotistic or altruistic and may involve externalities or other types of preference interactions.Footnote 8 We assume that U i(x) is strongly concave, twice differentiable in x and increasing in each of its arguments.

Under rationality, the outcomes of the household decision process are assumed to be Pareto-efficient. This means that the household chooses a vector x such that no other feasible vector could make all members at least as well off and at least one member strictly better off. The collective model also allows the possibility for exogenous variables, called distribution factors, to influence the household’s decisions. These variables are denoted by the vector z of dimension K.

The influence of these factors can be understood within a bargaining framework where each member has an outside option. The poorer his/her outside options, the more he/she will be willing to compromise and thus the lower will be his/her bargaining power. As a result, the less the consumption decisions will correspond to his/her preferences.

Outside options can vary across individuals and cultures. For example, members could behave non-cooperatively in case of minor disagreements (Lundberg and Pollak 1993; Chen and Woolley 2001) and eventually separate in case of major disagreements (Manser and Brown 1980; McElroy 1990). In the latter case, the state of the marriage market as proxied by the sex ratio (Chiappori et al. 2002), the nature of divorce laws (Gray 1998; Chiappori et al. 2002) and the relative contribution of the spouse to the household income (Browning and Chiappori 1998; Dauphin et al. 2011) have been considered as distribution factors in the literature. In the context of developing countries, Haddad and Kanbur (1992) stress the possibility for women to return to their native families in case of disagreement, and discrimination against women in the market place as potential distribution factors.

The setting of collective rationality is equivalent to stating that there exists a vector μ(m, z) of I non-negative Pareto weights such that x is the solution to the following program:

$$\begin{array}{l}\\ {\rm{Max}}\,{\boldsymbol{\mu }}\left( {m,{\bf{z}}} \right)\prime \left[ {{U^1}\left( {\bf{x}} \right), \ldots ,{U^I}\left( {\bf{x}} \right)} \right] + {U^{I + 1}}\left( {\bf{x}} \right)\\ {\rm{subject}}\,{\rm{to}}\,\iota \prime {\bf{x}} = m.\\ \end{array}$$

Thus the household pseudo-utility function to be maximized is a weighted sum of the individual utility functions. The Pareto weight associated with the preferences of member i (for i ≠ I + 1) can be interpreted as the importance attached to these, relative to those of the (I + 1)th member, in the household decision process. If the Pareto weight of a given member is equal to zero, the household does not take into account that member’s preferences in the decision process, other than via the possible caring preferences of the other members. The I Pareto weights can therefore be viewed as the distribution of decision power within the household and the number of decision-makers as the number of strictly positive Pareto weights plus one.

The Pareto weights might be functions of distribution factors and of household expenditures, in which case they are assumed to be twice continuously differentiable in (m, z). It should be noted that some weights may be (locally) constant while others may respond to distribution factors. Furthermore, the non-constant weights may not all (locally) respond to the same distribution factors. When all the weights are constants, the household is said to behave rationally in a unitary way because the objective function can be interpreted as representing a unique utility function. When some of the weights are non-constants, the household is said to behave rationally in a collective way because the objective function cannot be interpreted as representing a unique set of preferences.

The demand system obtained from solving the above program for x can be written as: \({\bf{x}} = {\hat {\bf x}}\left( {m,{\boldsymbol{\mu }}\left( {m,{\bf{z}}} \right)} \right)\), with \(\iota \prime {\hat {\rm x}}\left( {m,{\boldsymbol{\mu }}\left( {m,{\bf{z}}} \right)} \right) = m\) from the adding-up restriction. This shows that the distribution factors influence household consumption choices only through the non-constant Pareto weights. This follows from the fact that the distribution factors do not affect the Paretian frontier of the household consumption possibilities, but only the household’s location on it. Clearly, since the Pareto weights are unobservable so is the structural demand system. Yet, it is still possible to test whether the reduced form of the latter, x(m, z), satisfies:

$${\bf{x}}\left( {m,{\bf{z}}} \right) \equiv {\hat {\bf x}}\left( {m,{\boldsymbol{\mu }}\left( {m,{\bf{z}}} \right)} \right).$$
(1)

Proposition 2 of BBC2009 assumes that at least one distribution factor (locally) affects each demand function. Yet, distribution factors need not locally influence more than two of the latter to yield falsifiable restrictions. We thus start by relaxing this assumption and derive the appropriate test procedure. Next, we generalize the test to the case where a household comprises more than two potential decision-makers.

1.2 Generalization of proposition 2 of BBC2009 - two decison-makers

Consider a partition \({\bf{x}} \equiv \left[ {{{\bf x \prime}}_J ,{\bf{x}}_{ - J \prime} } \right]\prime\) of the demand system and a partition \({\bf{z}} \equiv \left[ {{{\bf z \prime}}_J ,{\bf{z}}_{ - J \prime} } \right]\prime\) of the set of distribution factors, with x J and z J having the same dimension J. Given such a partition, (1) can be written as:Footnote 9

$${{\bf{x}}_J} = {{\bf{x}}_J}\left( {{{\bf{z}}_J},{{\bf{z}}_{ - J}}} \right) \equiv {{\hat {\bf x}}_J}\left( {{\boldsymbol{\mu }}\left( {{{\bf{z}}_J},{{\bf{z}}_{ - J}}} \right)} \right),$$
(2)
$${{\bf{x}}_{ - J}} = {{\bf{x}}_{ - J}}\left( {{{\bf{z}}_J},{{\bf{z}}_{ - J}}} \right) \equiv {{\hat {\bf x}}_{ - J}}\left( {{\boldsymbol{\mu }}\left( {{{\bf{z}}_J},{{\bf{z}}_{ - J}}} \right)} \right).$$
(3)

If the sub-system of reduced-form demand functions in (2) has continuous first partial derivatives and is such that \({D_{{{\bf{z}}_J}}}{{\bf{x}}_J}\left( {{{\bf{z}}_J},{{\bf{z}}_{ - J}}} \right)\) is non-singular at a point P = (z j ,z j ), then we can use the Implicit Function Theorem to invert x j and z j in some open neighborhood of P to get the following local inverse function:

$${{\bf{z}}_J} = {{\bf{z}}_J}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right),$$

which has continuous first partial derivatives. Upon substituting the latter into (2) and (3) we get:

$${{\bf{x}}_J} = {{\bar {\bf x}}_J}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right) \equiv {{\bf{x}}_J}\left( {{{\bf{z}}_J}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right),{{\bf{z}}_{ - J}}} \right) \equiv {{\hat {\bf x}}_J}\left( {{\bf{\mu }}\left( {{{\bf{z}}_J}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right),{{\bf{z}}_{ - J}}} \right)} \right),$$
(4)
$${{\bf{x}}_{ - J}} = {{\bar {\bf x}}_{ - J}}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right) \equiv {{\bf{x}}_{ - J}}\left( {{{\bf{z}}_J}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right),{{\bf{z}}_{ - J}}} \right) \equiv {{\hat {\bf x}}_{ - J}}\left( {{\boldsymbol{\mu }}\left( {{{\bf{z}}_J}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right),{{\bf{z}}_{ - J}}} \right)} \right).$$
(5)

The (local) sub-system of demands x −J in (5) is written as a function of the sub-system of (local) demands x J and of the distribution factors z −J .These are the so-called z-conditional demands proposed by BBC2009. The sub-system x −J is said to be z J -conditional, since it is conditional on the inversion of z J . We further assume that μ(z) and \({\hat {\bf x}}\left( {{\boldsymbol{\mu }}\left( {\bf{z}} \right)} \right)\) are differentiable at the point P. The first generalization is as follows:

Proposition 1

A system of N ≥ 2 demand functions of a household with I + 1 = 2 members is locally compatible with rationality if, and only if, distribution factors, z, either do not influence the demand system (unitary rationality):

$${D_{\bf{z}}}{x_n}\left( {\bf{z}} \right) = 0\quad \forall n = 1, \ldots ,N,$$
(7a)

or influence it in the following way when K ≥ 2 (collective rationality):Footnote 10

$${D_{\bf{z}}}{x_n}\left( {\bf{z}} \right) = 0\,or\,{D_{\bf{z}}}{x_n}({\bf{z}}) \ne \ne 0\quad \forall n = 1, \ldots ,N.$$
(7b)

Moreover, the demands for which \({D_{\bf{z}}}{x_n}\left( {\bf{z}} \right) \ne \, \ne {\bf{0}},\) denoted \(x_m^*\left( {\bf{z}} \right)\) also satisfy:

$$\frac{{\partial x_m^{\rm{*}}\left( {\bf{z}} \right)/\partial {z_1}}}{{\partial x _m^{\rm{*}}\left( {\bf{z}} \right)/\partial {z_k}}} = \frac{{\partial x_1^{\rm{*}}\left( {\bf{z}} \right)/\partial {z_1}}}{{\partial x_1^{\rm{*}}\left( {\bf{z}} \right)/\partial {z_k}}} \ne 0\quad \forall k = 2, \ldots ,K,m = 2, \ldots ,M$$
(7c)

and, equivalently,

$${D_{{{\bf{z}}_{ - 1}}}}\bar x_m^{\rm{*}}\left( {x_1^{\rm{*}},{{\bf{z}}_{ - 1}}} \right) = 0\quad \forall m = 2, \ldots ,M,$$
(7d)

where 2 ≤ M ≤ N.

The proof is available in a web Appendix.Footnote 11 The proposition states that the demand system of a two-person household is compatible with rationality if and only if it either complies with unitary rationality, (7a), or with collective rationality, (7b), (7c), and, equivalently to the latter, (7d). According to Restriction (7a), a demand system is compatible with unitary rationality if and only if none of its demand functions is influenced by distribution factors. Our new all or nothing Restriction (7b) stresses that a demand system that responds to at least two distribution factors is compatible with collective rationality if each of its demands either does not respond to any of the distribution factors or responds to all of the distribution factors. Equation (7c) further restricts the manner in which the distribution factors impact the demand functions that respond to all of the distribution factors (denoted \(x_m^{\rm{*}}\)): the ratio of the marginal effects of any two distribution factors must be equal across the latter. Finally, Restriction (7d) is equivalent to (7c). It states that the demand functions \(x_m^{\rm{*}}\) are compatible with collective rationality only if they no longer respond to the distribution factors once they are conditioned on any of them, i.e., are transformed into their z 1 -conditional form. Many empirical applications investigate the efficiency hypothesis using (7c). In most cases, the tests ignore Restriction (7b). Yet, the two go hand-in-hand. We will argue later on that focusing on (7c) while ignoring (7b) partly explains why the efficiency assumption is hardly ever rejected.

The intuition behind Proposition 1 is the following. In a rational household composed of two members there is only one Pareto weight. The distribution factors, if they exist, can only exert their effect on consumption choices through this weight. If unitary rationality holds, then the Pareto weight is constant. Each of its demand, x n , must therefore satisfy (7a). On the other hand, if collective rationality holds and distribution factors do exist, then the single Pareto weight must respond to all of them. If a given demand function, x n , say, does not locally respond to the Pareto weight, it will respond to none of the distribution factors (D z x n (z) = 0). Conversely, if x n does respond to the Pareto weight, it will be sensitive to all the distribution factors (D z x n (z)≠≠0. Each demand x n stemming from a collectively rational household must therefore satisfy the all or nothing Restriction (7b). Furthermore, since distribution factors exert their effects through a single weight, the demand functions that respond to it, \(x_m^{\rm{*}}\), must be such that the ratio of the marginal effect of any two distribution factors is equal to the ratio of the marginal effect of the two distribution factors on the weight, and this ratio must be different from zero. Therefore, the ratio of the marginal effect of any two distribution factors is equal across \(x_m^{\rm{*}}\) demands as stated by (7c). Finally, conditioning a given demand function, \(x_m^{\rm{*}}\), by another, \(x_1^{\rm{*}}\) say, is equivalent to maintaining \(x_1^{\rm{*}}\) constant. In order to maintain \(x_1^{\rm{*}}\) constant, z 1 must compensate for the variations in z −1 in such a way that the variations in the weight cancel out. Restriction (7d) must thus hold for this z 1 -conditional demand.

An important corollary to the all or nothing restriction is that a system in which some demands respond to a subset of distribution factors while other demands respond to another subset of distribution factors is not compatible with collective rationality when I + 1 = 2. The all or nothing restriction is absent from Proposition 2 of BBC2009 because it is assumed at the outset that one of the distribution factors (locally) affects all the demand functions. Since there is a single Pareto weight, this amounts to assuming that all the demand functions respond to the weight and therefore that they all respond to all the distribution factors affecting the weight. This is equivalent to assuming that all the demand functions satisfy (7b), and more precisely its second part, that is D z x n (z)≠≠0. Hence, in the BBC2009 framework our Proposition 1 boils down to Restriction (7c) and, equivalently to the latter, (7d).

Contrary to the proportionality and the z-conditional demands restrictions, the all or nothing condition does not in fact require any of the distribution factors to be continuous. This is a great advantage since the most convincing distribution factors are random policy treatments. The all or nothing and the z-conditional demands restrictions boils down to testing an exclusion restriction in each single equation. This type of restriction is likely to be more powerful than the proportionality restriction since we can use single-equation estimation methods and since single-exclusion tests are more robust than tests of equality of parameters across equations. Furthermore, because the all or nothing restriction rests upon unconditional demand functions, endogeneity of right-hand side variables is not an issue for testing this constraint. It thus appears that restriction (7b) is likely to be more powerful to reject collective rationality than restrictions (7c) and (7d).

1.3 Generalization of proposition 2 of BBC2009—multiple decision makers

Proposition 1 is valid for households in which it can legitimately be assumed that there are at most two decision-makers. Many household configurations (extended families, adult children, polygamous households, etc.), though, may potentially have more than two decision-makers.Footnote 12 It is relatively straightforward to extend Proposition 1 to multiple potential decision maker households.

Proposition 2

A system of N ⩾ I + 1 demand functions of a household with I + 1 members is locally compatible with rationality if, and only if, the distribution factors z either do not influence the demand system ( unitary rationality ):

$${D_{\bf{z}}}{x_n}\left( {\bf{z}} \right) = 0\quad \forall n = 1, \ldots ,N.$$
(8a)

or influence it in the following way when K ≥ I+ 1 ( collective rationality ): there exists a non-negative J ≤ I − 1 for which Footnote 13

$$\begin{array}{l} {D_{{{\bf{z}}_{ - J}}}}{{\bar x}_n}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right) = 0\,or\, \\ {D_{{\bf{z}}_L^{\rm{*}}}}{{\bar x}_n}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right) \ne \, \ne 0\,and\,{D_{{\bf{z}}_{ - L}^{\rm{*}}}}{{\bar x}_n}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right) = 0\quad n = J + 1, \ldots ,N,\\ \end{array}$$
(8b)

where \({{\bf{z}}_{ - J}} \equiv \left[ {{\bf{z}}_L^{*\prime },{\bf{z}}_{ - L}^{*\prime }} \right]\prime\) with 1 ≤ LKJ. Moreover, when 2 ≤ L, the demand functions that satisfy \({D_{{\bf{Z}}_L^*}}{\bar x:n}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right) \ne \, \ne {\bf{0}},\) denoted \(\bar x_m^*\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right)\), must also satisfy:

$$\frac{{\partial \bar x_m^{\rm{*}}({{\bf{x}}_J},{{\bf{z}}_{ - J}})/\partial z_1^{\rm{*}}}}{{\partial \bar x_m^{\rm{*}}({{\bf{x}}_J},{{\bf{z}}_{ - J}})/\partial z_l^{\rm{*}}}} = \frac{{\partial \bar x_1^{\rm{*}}({{\bf{x}}_J},{{\bf{z}}_{ - J}})/\partial z_1^{\rm{*}}}}{{\partial \bar x_1^{\rm{*}}({{\bf{x}}_J},{{\bf{z}}_{ - J}})/\partial z_l^{\rm{*}}}} \ne 0\quad \forall l = 2, \ldots ,L\quad m = 2, \ldots ,M$$
(8c)

and, equivalently,

$${D_{{{\bf{z}}_{ - \left( {J + 1} \right)}}}}\bar x_m^{\rm{*}}\left( {{{\bf{x}}_J},x_1^{\rm{*}},{{\bf{z}}_{ - \left( {J + 1} \right)}}} \right) = 0\quad \forall {\it{m}} = 2, \ldots ,M,$$
(8d)

where 2 ≤ M ≤ N−J.

The proof is provided in a Web Appendix.Footnote 14 This proposition states that the demand system of an I + 1-person household is compatible with rationality if and only if it either complies with unitary rationality (8a) or with collective rationality (8b), (8c), and, equivalently to the latter, (8d). Restriction (8a) is identical to Restriction (7a). Restrictions (8b) and (8c) are equivalent to (7b) and (7c) but they involve z-conditional rather than unconditional demand functions. Restriction (8b) states that the demand system of an I + 1-person household influenced by at least I + 1 distribution factors is compatible with collective rationality if there is a non negative J ≤ I − 1 for which each demand contained in x −J , once conditioned on x J , either does not respond to any of the remaining distribution factors (z −J ) or responds to a subset of them, denoted \({\bf{z}}_L^{\rm{*}}\).Footnote 15 This subset must be the same for all the z J -conditional demand functions. In other words, those z J -conditional demands that still respond to distribution factors must respond to the same subset of them. The latter may include all the remaining distribution factors or a subset of them. Hence, having some z J -conditional demand functions responding to some distribution factors and other z J -conditional demand functions responding to other distribution factors is incompatible with collective rationality. Restriction (8c) further states that the z J -conditional demand functions \(\left( {\bar x:m^{\rm{*}}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right)} \right)\) responding to distribution factors \({\bf{z}}_L^{\rm{*}}\) are compatible with collective rationality only if the ratios of the marginal effect of any two distribution factors included in \({\bf{z}}_L^{\rm{*}}\) are equal across them. Finally, and equivalently to Restriction (8c), Restriction (8d) stresses that the \(\bar x:m^{\rm{*}}\left( {{{\bf{x}}_J},{{\bf{z}}_{ - J}}} \right)\) demand functions are compatible with collective rationality only if they no longer respond to the distribution factors once they are conditioned on one more demand influenced by \({\bf{z}}_L^{\rm{*}}\) (say \(x_1^{\rm{*}}\)).

Chiappori and Ekeland (2006) (henceforth CE2006) provide another generalization of Proposition 2 of BBC2009, which we present here in a slightly modified manner.

Proposition 3

(Chiappori and Ekeland 2006 ) A system of N ⩾ I + 1 demand functions of a household with I + 1 decision-makers is compatible with rationality if, and only if, distribution factors, z , either do not influence the demand system ( unitary rationality ):

$$rank\left[ {{D_{\bf{z}}}{\bf{x}}\left( {\bf{z}} \right)} \right] = 0$$
(9a)

or satisfies the following condition whenever K ⩾ I + 1 ( collective rationality ):

$$0 < rank\left[ {{D_{\bf{z}}}{\bf{x}}\left( {\bf{z}} \right)} \right] \le I.$$
(9b)

See CE2006 for the proof. Restriction (9a) is identical to our restrictions (7a) and (8a). Restriction (9b) states that if there exists K ⩾ I + 1 distribution factors, then the rank of the matrix D z x(z) must be greater than zero, but no greater than I. Intuitively, there can be no more than I Pareto weights under collective rationality. Since the distribution factors only impact the demand system through the latter, if there are fewer weights than there are distribution factors, their effects on the demand functions must necessarily be linearly dependent. Propositions 2 and 3 are equivalent. If a demand system satisfies (8b), (8c) and (8d), it will also satisfy (9b) and vice versa. However, if (8b) is satisfied, but restrictions (8c) and (8d) do not apply because L < 2, then (9b) may not be satisfied. This is because (8b) is only necessary. However, under collective rationality, and as long as K + 1 ≥ I, it will always be possible to partition \({\bf{x}} \equiv \left[ {{\bf x \prime}_J ,{\bf x}_{ - J \prime}} \right]\prime\) and \({\bf{z}} \equiv \left[ {{\bf z \prime}_J ,{\bf{z}}_{ - J \prime} } \right]\prime\) in such a way that L ≥ 2.Footnote 16

The two propositions provide falsifiable restrictions for an overall test of rationality. In the case of Proposition 2, the first step is to test whether all the demands satisfy Restriction (8a), i.e., to test whether households behave rationality in a unitary way. If this hypothesis is rejected, the next step is to test collective rationality. The general formulation of the null hypothesis corresponding to collective rationality is H 0:J ≤ I − 1 vs. H 1:J > I − 1. Since H 0 is a composite hypothesis, a sequential approach can be followed. One should thus start by testing whether all the demand functions satisfy Restriction (8b) when J = 0. If this hypothesis is rejected, the Restriction (8b) has to be tested for J = 1 and so on until it is not rejected for J ≤ I − 1. If the Restriction (8b) is not rejected for J ≤ I − 1, the Restriction (8c) or (8b) should be tested for those demand functions that are influenced by distribution factors. If the latter hypothesis is not rejected, then testing stops and collective rationality is not rejected. Conversely, if Restriction (8c) or (8d) is rejected, then Restriction (8b) should be tested again for a higher J. If the restrictions (8b) and (8c) or (8d) are successively rejected for all J ≤ I − 1, then collective rationality as well as overall rationality must be rejected. The same sequential approach must be used with Proposition 3. The first step is to test whether the rank of D z x(z) is equal to zero. If not so, then the next step is to test whether it is equal to 1 and so on until I is reached. If the rank is not found to be equal or inferior to I, then overall rationality is rejected. If the rank is found to be of any value greater than 1 but inferior or equal to I, then testing stops and collective rationality is not rejected.

1.4 Empirical investigations of household collective rationality

Household collective rationality is the object of much research in the empirical literature. In what follows, we present a brief and informal meta-analysis that illustrates how our propositions may partly explain why the standard testing approach is likely to under-reject the collective model based upon distribution factors.

Most papers focus on households composed of two adults, which is the concern of our Proposition 1. Demand systems are estimated using data from developed as well as developing countries and are based on a variety of functional forms (AIDS, QUAIDS, etc.). Likewise, a rich set of distribution factors are used to proxy spouses’ relative bargaining power (e.g., relative income, age and assets at marriage).

As shown earlier, for collective rationality of two-person households to be satisfied, the all or nothing Restriction (7b) needs to hold and Restriction (7c) and, equivalently (7d), only applies to the subset of demand functions that are responding to all the distribution factors. Yet, it is customary in the literature to neglect Restriction (7b) and to test collective rationality by means of Restriction (7c), or Restriction (7d), over the full set of demands, thus including those that do not respond to distribution factors as well as those who respond to a subset of them. This, we argue, partly explains why collective rationality is likely to be under-rejected. In short, the failure to reject the collective model may be an artifact of the testing procedure.

For illustration, we report in Table 1 a series of papers referred to in Chapter 5 of Browning et al. (2014) and in Naidoo (2015) whose empirical results are interpreted as generally supportive of the collective model. We see first in column 5 that among the ten papers using the proportionality condition or the z-conditional demands to test the collective model, only two of them (that is, Chiappori et al. 2002 and Attanasio and Lechene 2014) satisfy the BBC2009 assumption. Recall that this condition assumes that at least one distribution factor affects each demand equation. Second, among the ten papers, four of them (that is, Quisumbing and Maluccio 2003, Bayudan 2006, Bobonis 2009, Attanasio and Lechene 2014—columns 6 and 7) conclude that the collective model is not rejected though it is rejected, based on our all or nothing test. Note also that, according to our test, the unitary model is not rejected in Thomas et al. (1999) and Vermeulen (2005) as none of the distribution factors are statistically significant. This leaves only four papers which are consistent with collective rationality according to our test (Bourguignon et al. 1993; Thomas and Chen 1994; Browning and Chiappori 1998; Chiappori et al. 2002).Footnote 17

Table 1 Papers testing the collective model on 2-person households with proportionality condition or z-conditional demands

For example, let us focus on the paper by Quisumbing and Maluccio (2003). It uses data from various developing countries to estimate household demand systems. Assets of spouses at marriage and spouses’ schooling are treated as distribution factors. Results for Bangladesh show that the demand for food significantly responds only to the husband’s assets, that the demand for education responds only to the wife’s assets and the husband’s schooling and that the demand for child clothing responds only to the husband’s schooling. This is not compatible with collective rationality as it violates the all or nothing restriction. Indeed, in a demand equation where one distribution factor is statistically significant, all other distribution factors should also be significant. Results for Ethiopia show that food responds to the two assets distribution factors, but alcohol and tobacco only to the wife’s assets. This is not compatible with collective rationality either. In both cases, Restriction (7c) is tested with a joint test over all demands and is not rejected, and hence nor is collective rationality.

The latter study is representative of how most papers test collective rationality. In what follows, we use data from rural Burkina Faso to investigate rationality within monogamous and bigamous households. All or nothing Restriction (7b) (monogamous) or (8b) (bigamous) is first tested. If satisfied, we next move on to test (7c) or (8c) as the case may be. We also test collective rationality using the asymptotically equivalent test of CE2006 rank condition. We next use the same approach as in the above papers and test Restriction (7c) irrespective of whether the all or nothing Restriction (7b) holds.

2 The burkinabé family

Burkina Faso is one of poorest countries in the world. In 2014, the country ranked 181th out of 185 countries, with a life expectancy of 56.3 years, an adult literacy rate of 28.7% and a GDP per capita of 1 602 PPP US$ (UNDP 2014).Polygamy is quite prevalent. It is estimated that up to 22% of men and 42% of women are in a polygamous union (INSD 2010).

2.1 Distribution factors

According to the anthropological literature on Burkina Faso, spouses tend to behave non-cooperatively when a disagreement arises, at least initially. The husband “refuses to give cereals, money and gifts to his wife and will favor another wife. The wife, in return, will refuse to carry out her domestic and conjugal duties […] The wife may thus refuse to fetch water from the well for him, to heat it up for him, to wash his clothes and give him food that she has herself produced or bought”.Footnote 18 The more financially independent the wife is, the greater will be her bargaining power. The wife’s contribution to the household income could thus qualify as a distribution factor.

Over the years , the husband may gradually accumulate enough wealth to pay for the dowry of an additional wife. The husband’s threat of a co-wife may thus become more and more credible over time, thereby gradually reducing the bargaining power of the first wife, ceteris paribus. The duration of the marriage should thus qualify as a distribution factor in monogamous households. In the case of polygamous households, the anthropological literature has also highlighted that a wife’s bargaining power depends on the number of years since marriage and on her rank. The wives “must submit to an internal hierarchy conditioned by age and the length of the marriage: although negligible when less than a decade separates their birth or their union, it is perceptible beyond that. […] Furthermore, the first wife has authority on the other wives”.Footnote 19 This suggests to use the duration of the first wife’s marriage relative to the duration of the second’s wife marriage.

Two important remarks are in order. First, choosing appropriate distribution factors is difficult in that they are required to only affect allocations, not preferences, which is difficult to test. Thus the wife’s contribution to the household (labor) income can be considered as a distribution factor if we assume separability between leisure and consumption. We are not aware of any study that tests the collective model with distribution factors that does not make this assumption. We acknowledge that it is more problematic in the context of self-consumption as in the present case, but assuming non-separable preferences would greatly complicate the empirical strategy. Given that the main purpose of our empirical analysis is to illustrate the testing procedure and to show that under-rejection might occur when the BBC2009 assumption is overlooked, we follow standard practice and assume separability.Footnote 20 More importantly we do not account for household production and its effect on the virtual household income and consumption decisions. In addition, while our model assumes that the marriage status of the household is exogenous, the above discussion suggests that it could be endogenous as it may partly be determined by the relative bargaining power of the (first) wife. Indeed, the higher the bargaining power of the wife is, the more likely she will be able to impose her (presumed) preference for monogamy. This may be the source of a selection bias in our estimators as our econometric analysis is conditional on the household marriage contract. A natural approach would involve modeling the marital status of the household. This however would greatly complicate the analysis while being peripheral to the main point we wish to underline in this article. On the other hand, it must be acknowledged that such a selectivity problem applies to almost all the empirical literature on collective rationality since the formation and dissolution of households is itself endogenous.

2.2 The survey

With an aim at testing household efficiency in Burkina Faso, Anyck Dauphin, one of the co-authors of this paper, conducted a field survey between January and March 1999 under the auspices of the Centre canadien d’étude et de coopération internationale (CECI). The information on the income of the different spouses was collected indirectly. Since most households live off of agriculture, and since agricultural production survey are very complex, the survey focused on household expenditures which can be considered as a good indicator of their permanent income. For each spouse, data were collected on expenditures on food and non-food products, durable goods and self-consumption.

The survey was conducted in the Province of Passoré which has a population of approximately 322,000Footnote 21, primarily because the CECI had been involved in the region for a long time and had established close links with the local institutions. The province is divided into nine administrative regions. In order to minimize cost, the survey was limited to the five regions that were deemed the most representative of the economic and social fabric of the province. These include Dakiégré, Pelegtanga, Rallo, and Sectors 1 and 5 of the City of Yako (Yako-1, Yako-5).

To be included in the sample a household had to meet the following two conditions: (1) The (male) household head as well as his spouse(s) had to be less than 70 years of age and; (2) They all had to live permanently on the same compound. Prior to sampling, a census was conducted in each of the five regions to identify married households and to determine eligibility. Over 125 married households were then randomly selected among those eligible in each region, except for the village of Dakiégré where all 111 households were selected.Overall, as many as 551 households out of 611 were interviewed (response rate = 90%).

2.3 Sample characteristics

Sample size for monogamous and bigamous households are 392 and 117, respectively. The main characteristics of these samples are presented in Table 2, which is divided into three separate panels. The first shows that more than 30% of monogamous households are Muslim, a percentage that increases to 44% in bigamous households. Monogamous husbands are on average 42 years old, that is 7 years younger than polygamous males. Wives from monogamous households are on average 33 years old, somewhat in-between the age of first and second wives of bigamous households. Not surprisingly then, monogamous wives with an average of 3.5 children have fewer (more) children than the first (second) wife of bigamous households. The estimated budget for monogamous and bigamous households over a two month period covered by the survey are respectively 117 620 CFA and 216 743 CFA.

Table 2 Sample characteristics

The ability to assess the impact of distribution factors on expenditures is greatly enhanced if the survey focuses on assignable goods. These may be consumed by more than one household member but individual consumption is observed in the data. A priori, distribution factors that favour a particular household member should have a noticeable impact on his/her share of a given assignable good. The field survey was thus designed to collect information on the main assignable goods consumed by each member of the household. Survey pretesting indicated that clothing and hairdressing were the two most important items that could qualify as assignable goods in the rural Burkinabé context. Expenditures on these two goods were aggregated into a single category which we refer to as “Personal Care”. Each spouse in the household was thus surveyed about the expenditures made on these goods for his/her own purpose, and for those of the other spouses and their children. The second panel of the table reports the average share of the household budget devoted to the clothing and hairdressing of the husband (PC-Husband), his wives (PC-Wife1 and PC-Wife2) and their respective children (PC-Child1 and PC-Child2). The shares of personal care accruing to the wives are larger than those of husbands and children in both monogamous and bigamous households. In monogamous households the wives’ share amounts to 9.95% while it represents about 4% for both wives in bigamous households. The main staple food, millet, represents between 7.3 and 10.5% of the household budget, whereas the remaining food items (O-Food) account for slightly more than 40%.

The last panel of the table focuses on distribution factors. In our data, a monogamous wife contributes on average to approximately 23% of total household income. In bigamous households, the first and second wives’ shares are 17 and 14%, respectively. Finally, monogamous households had lasted approximately 14 years and bigamous households had formed 22 years prior to our survey. The time lapse between first and second marriages is about 11 years.

3 Estimation results

Our estimation strategy is threefold. For both monogamous and bigamous households we estimate a QUAIDS demand system. Given the relatively small size of our samples, we thought this parametric model would be a good approximation to a full-fledged non-parametric one. We first test rationality with Proposition 1 for monogamous households and with Proposition 2 for bigamous households. Second, we use the test of the rank condition proposed by CE2006. Finally, we test collective rationality for monogamous households using the restrictions proposed by Bourguignon et al. (2009), irrespective of whether one of the distribution factors locally affects each demand of the system.

3.1 Monogamous households

The demand system is composed of six non-durable goods, of which three are assignable: PC-Husband, PC-Wife, PC-Children, Millet, Other Foods and expenditures on remaining nondurable goods. Only the first five demands are estimated due to the adding-up constraint. Individual shares are regressed against the log of total expenditures on non-durable goods (lnExp) and its square (lnExp2). The two distribution factors are the log of the wife’s share of household income (SIncomeW) and the duration of marriage (DMar). We also control for location, religion, age of spouses and the number of children under 16 years of age (xn). The budget shares functions (for n = 1,…,5) are written as:

$${w_n} = {\bf x\prime}_n {{\boldsymbol \alpha }_{\boldsymbol{n}}} + {\beta _n}\, {\rm ln}\,Exp + {\theta _n}{\left( {{\rm{ln}}\,Exp} \right)^2} + {\delta _n}SIncomeW + {\gamma _n}DMar + {\varepsilon _n}$$
(6)

To account for the possibility that total expenditures on non-durable goods are endogenous, we included the residuals of the auxiliary regressions of the log of total expenditures and its square on a set of instruments in the QUAIDS specification in (6) as control functions. The instruments used in the first stage equation were the log of total income, its square and the sum of the number of years of education of the husband and his wives. Since households in rural Burkina Faso have next to no savings and spend very little on durable goods, not surprisingly, these instruments were found to be very strong. Given that the residuals proved to be not significant, we did not reject the exogeneity of the total expenditures in any regression. Therefore Table 3 focuses on the OLS estimation results. Several parameter estimates are statistically significant. Ethnic groups and location appear to be important determinants of expenditure shares. Husbands’ age is negatively related to both PC-Husband and PC-Wife but positively related to Other Foods. Likewise, the number of children has a negative impact on PC-Husband and PC-Wife but a positive one on PC-Children, as expected. The log of expenditures and its square are not individually significant for any share, but are jointly significant for PC-Children, Millet and Other Foods.

Table 3 OLS estimation of the QUAIDS demand system monogamous households

Interestingly, the wife’s share of total income (SIncomeW) impacts negatively PC-Husband and positively PC-Children. These results are consistent with a larger share of income translating into a larger bargaining power. Notice also that DMar usually appears to be unfavorable to the wife presumably because the likelihood of the husband contracting a new marriage increases.

3.2 Rationality tests based on proposition 1

Proposition 1 provides restrictions that are gradually more restrictive. This allows us to adopt a sequential approach. The first step is to test unitary rationality (7a), which assumes away the existence of distribution factors. It is straightforward to test this restriction based on point hypotheses through simple t-tests.Footnote 22 According to Table 3, this restriction must be rejected since the two distribution factors we consider are statistically significant in various demand functions.

We thus move on to test collective rationality using Restriction (7b). This all or nothing restriction is a necessary condition requiring that each demand function either do not respond to any of the distribution factors or respond to all of the distribution factors. Again, simple t tests of significance can be used.Footnote 23 As shown in Table 3, the shares of PC-Wife and PC-Children respond significantly only to one out of two distribution factors. Therefore, collective rationality must be rejected.

3.3 Rationality tests based on CE2006

As stressed earlier, the CE2006 rank condition test is asymptotically equivalent to the test of our Proposition 1. Their test is carried out sequentially, starting with the Restriction (9a). The null hypothesis H 0:rank[D z x(z)] = 0 is computed using an F statistic to test that both distribution factors are simultaneously statistically significant in all the demand functions. The F(10,1870) statistic is equal to 4.28 and has an associated P-value of 0.00001. Clearly, the unitary rationality is rejected. We thus move on and test H 0:rank[D z x(z)] = 1. Since our sample is relatively small, we implement a recent constrained bootstrap method proposed by Portier and Delyon (2014) to insure both our proposition and that of CE2006 have similar statistical properties.Footnote 24 More precisely, we use the test statistic proposed by Li (1991) and we estimate its bootstrap distribution under the null hypothesis that rank[D z x(z)] = 1 as suggested by Portier and Delyon (2014). The sampling value of the statistic is equal to 0.00577 and has a P-value of 0.0001. The CE2006 test is thus consistent with our own test in rejecting collective rationality.

3.4 Rationality tests based on BBC2009

Recall that Proposition 2 of BBC2009 requires that all the demands function respond to at least one common distribution factor. As stressed earlier, this condition is hardly ever satisfied. It is certainly not in our data. We nevertheless omit this condition and carry on testing collective rationality. Restriction ii of Proposition 2 of BBC2009 stipulates that for collective rationality to hold the ratio of the marginal effects of the two distribution factors must be equal across the demand functions, i.e., H 0: δ 1/γ 1 = δ 2/γ 2 = δ 3/γ 3 = δ 4/γ 4 = δ 5/γ 5. Based on the parameter estimates of Table 4, we get a test statistic of χ 2(4) = 7.02 with an associated P-value of 0.135. In other words, the null assumption can not be rejected and so neither is collective rationality.

Table 4 OLS estimation of the demand system—bigamous households

Obviously, this test is fundamentally flawed because it is based on a false premise. Indeed, both distribution factors are not statistically different from zero in three demand functions. Hence, the ratios of the marginal effects are essentially zero in most cases. As a matter of fact, a joint test that all the ratios are equal to zero, i.e., H 0: δ 1/γ 1 = δ 2/γ 2 = δ 3/γ 3 = δ 4/γ 4 = δ 5/γ 5 = 0, yields a test statistic of χ 2(5) = 9.06 with an associated P-value of 0.107. In other words, we can not reject the null assumption that all the ratios are equal to zero. A naive application of the BBC2009 test would thus lead us to (falsely) not to reject collective rationality.

The Proposition 2 of BBC2009 provides another restriction stated in terms of z-demand functions which is equivalent to the Restriction ii. This restriction, like our Restriction (7d), requires that a given demand function be inverted relative to one of its distribution factors and that the latter be substituted into the remaining demand functions. It states that the resulting z-conditional demand functions must no longer respond to the distribution factors. One difficulty with this approach is that the conditioning demand function must be instrumented to obtain consistent estimators. The literature suggests the substituted distribution factor be used as an instrument. We thus considered the four possibilities provided by Table 3, that is inverting PC-Husband or PC-Children on SIncomeW and inverting PC-Husband or PC-Wife on DMar. For each of these possibilities, we tested whether the distribution factor constituted an adequate instrument. In each case, the Cragg-Donald Wald F statistic, which is equal to the effective F statistic in the just-identified case, was found to be well below the three critical values suggested in the literature. These are the rule of thumb of Staiger and Stock (1997), the Stock and Yogo (2005) critical value for 10% maximal IV size distortion, and the critical value associated to a bias of Nagar of 10% as proposed by Montiel and Pflueger (2013). Furthermore, despite our best efforts, we could not come up with a single set of satisfactory instruments in an overidentified context. All proved to be weak. Consequently, we elected not to investigate Restriction iii of BBC2009 further.Footnote 25

3.5 Bigamous households

The demand system includes the same items as with monogamous households but in addition includes personal care of the second wife (PC-Wife2) and her children (PC-Children2). The system is composed of eight non-durable goods, five of which are assignable. The first seven demand functions are estimated using the QUAIDS system. The three distribution factors include the log of the share of income of each wife relative to total household income as well as the share of the duration of the first wife’s marriage relative to the total duration of the marriage of the two wives, i.e., SDMarW1/The other explanatory variables are the same as with monogamous households but also include the age of the second wife as well as her number of children aged under 16. The budget shares (for n = 1,…,7) are written as follows:

$$\begin{array}{rcl}{w_n} = {\bf x \prime}_n {{\boldsymbol{\alpha }}_{\boldsymbol{n}}} + {\beta _n}\,{\rm{ln}}\,Exp + {\theta _n}{\left( {{\rm{ln}}\,Exp} \right)^2} + {\delta _n}SIncomeW1 \\ + {\rho _n}SIncomeW2 + {\gamma _n}SDMar W1 + {\varepsilon _n}\end{array}$$
(7)

Table 4 reports the OLS estimation results. Several parameter estimates are statistically significant. As with monogamous households, ethnic origin and region of residence are important determinants of spending patterns. The log of total expenditures and its square are not statistically significant.

The distribution factors SIncomeW1 and SIncomeW2 are statistically significant in two demand functions and have the expected signs. The former proxies the bargaining power of the first wife and interestingly is shown to have a negative impact on both PC-Husband and PC-Wife2. The latter proxies the second wife’s bargaining power and positively affects PC-Wife2 and PC-Children2, as expected. Finally, SDMarW1 proxies the seniority of the first wife and is associated with her having greater bargaining power. Results show it has a negative impact on PC-Children2 and a positive one on Millet and O-Food. In short, the marginal effects are intuitively appealing and are consistent across the demand system.

3.6 Rationality tests based on proposition 2

Proposition 2 is a generalization of Proposition 1 to multi-person households. Testing begins with Restriction (8a), which implies that the three variables SIncomeW1, SIncomeW2 and SDMarW1 must have no influence on the demand system. Because the distribution factors are statistically significant in various demand functions we reject that bigamous households behave in a unitary way.

We next investigate whether collective rationality holds using the necessary Restriction (8b). The first step is to test whether (8b) holds when J = 0, i.e., that each individual demand function either does not respond to any of the distribution factors or responds to all of them. This restriction is clearly rejected since none of the demand functions responds to all three distribution factors. We thus move on to test whether Restriction (8b) is satisfied when J = 1. According to Restriction (8b), any given demand function that responds to the distribution factors may be inverted relative to one of the latter. Upon substituting the distribution factor, the remaining conditioned demand functions must either all be insensitive to the remaining distribution factors, or only be sensitive to a common subset. From Table 4, there are six possible inversions: PC-Husband and PC-Wife2 relative to SIncomeW1, PC-Wife2 and PC-Children2 relative to SIncomeW2 and finally, PC-Children2 and O-Food relative to SDMarW1. For each of these possibilities, we tested whether the distribution factor constituted a weak instrument. The Cragg-Donald Wald F statistic is reported in Table 5. Based on the rule of thumb of Staiger and Stock (1997), the variable SIncomeW2 is not a weak instrument in PC-Wife2 and PC-Children2. However, based on the Stock and Yogo (2005) critical value for the 10% maximal IV size (=16.38), we do not reject that SIncomeW2 is a weak instrument of PC-Wife2. This instrument is also weak accordingly to the critical value associated to a 10% bias of Nagar as suggested by Montiel and Pflueger (2013) (=23.1). Because of these mixed results, and in order to show how to implement Restriction (8b) with J = 1, we carry out the analysis and invert SIncomeW2 and PC-Wife2. We next substitute out the latter in the unconditional demand functions that respond to the distribution factors, namely PC-Husband, PC-Children2 and O-Food. The resulting z-conditional demand functions were estimated by 2SLS and the results are reported in Table 6.

Table 5 Tests based on proposition 2–bigamous households
Table 6 2SLS estimation of z1-conditional demands - polygamous households

The parameter estimates of the z 1-conditional demand functions are very similar to their unconditional counterpart. The distribution factor SIncomeW1 remains statistically significant in PC-Husband while the same holds for SDMarW1 in PC-ChildrenW2 and OFood. This is at odds with Restriction (8b) when J = 1. Therefore, conditionally on the assumption that SIncomeW2 is not a weak instrument of PC-Wife2, collective rationality is rejected.Footnote 26

3.7 Rationality tests based on CE2006

As with monogamous households, we begin by testing unitary rationality. Restriction (9a) posits that rank[D z x(z)] must be equal to zero for this to hold. The test yields F(10,1420) = 4.85 with a very small P-value. Unitary rationality is thus strongly rejected. We next test H 0:rank[D z x(z)] = 1 using the bootstrapped Li (1991) statistic. The sample value of the statistic is 0.1018 with a P-value of about 0.000. Once again, the null assumption must be rejected. This is consistent with our previous result based on Restriction (8b) for J = 0. Finally, we move on to test H 0:rank[D z x(z)] = 2. This is equivalent to testing there are three decision-makers in the household. The sample statistic is equal to 0.017 and its P-value is about 0.000. Rejection of the hypothesis thus implies rejection of collective rationality, which is consistent with the result from our Proposition 2.

4 Conclusion

The collective household model has become one of the main paradigms to conduct empirical research on consumption decisions by multi-person households. There are good reasons for that: it is based on relatively innocuous assumptions, and assuming these hold, it allows investigating intrahousehold impacts of numerous policies. Yet, one can not help but be concerned about the falsifiability of the model as it is seldom if ever rejected in the empirical literature. Such overwhelming evidence in favor of the collective model may eventually stray many from investigating the foundations of the model toward assuming it holds, irrespective of the environment under investigation.

We suspect the under-rejection of the collective model is primarily due to the manner in which its underlying theoretical restrictions are translated into statistical restrictions. Indeed, most papers investigate the collective model using a test procedure that was proposed by BBC2009. Yet the procedure requires that in any given demand system all functions respond to at least one common distribution factor. This implies that all distribution factors impact on all demand functions. This assumption is scarcely met in the empirical literature. Most papers simply ignore this and thus conduct tests based on a false premise.

In this paper, we provide a new falsifiable restriction which extends BBC2009’s approach insofar as it does not require a distribution factor to affect each equation of a demand system. In the case of a two-person household, this (all or nothing) restriction imposes that each demand function is either affected by all distribution factors or by none of them. We derive a set of testable conditions and fully characterize collective rationality, assuming no variations in prices. Moreover, our approach is generalized to households comprising potentially more than two members. We provide a brief and informal meta-analysis that suggests that much of the evidence in favour of the collective rationality in the empirical literature fails to satisfy our new restriction.

We illustrate the usefulness of our approach by investigating efficiency in allocation of consumption within monogamous and bigamous households in rural Burkina Faso. Social and cultural environments as well as institutional arrangements are likely to impede the enforcement of efficient marriages. We thus do not expect, a priori, outcomes to be efficient. Based on our proposed test procedure and on a test of Chiappori and Ekeland (2006)’s rank condition, rationality is found not to hold for monogamous and polygamous households alike. We next proceed to test rationality for monogamous households using the test procedure of Bourguignon et al. (2009) while neglecting the fact that none of the distribution factors are statistically significant in every demand functions. Collective rationality is then (falsely) found to hold for monogamous households.

An important conclusion that can be drawn from this paper is that the collective model is empirically falsifiable: its underlying assumptions translate into non-trivial constraints that may be rejected in particular cases. Appropriately accounting for the latter may reveal that far fewer households behave efficiently than what the current literature suggests.

Recent work (e.g., Lechene and Preston 2011) has shown that non-cooperative models with public goods or externalities may impose restrictions on household behavior. A natural extension to our paper would be to develop and take to data a general model that has the collective and the non-cooperative (Nash) models as particular cases. Rigorous testing of competing sets of constraints would enhance our understanding of household behavior (see Naidoo 2015).