1 Introduction

This study explores the effect of welfare programs on work incentives and the adult labor supply in developing countries. The analysis builds on the experimental evaluations of three programs implemented in rural areas in Latin America: Mexico’s Programa Nacional de Educacion, Salud y Alimentacion (PROGRESA), Nicaragua’s Red de Protección Social (“Social Protection Network”) (RPS), and Honduras’ Programa de Asignación Familiar (“Family Allowance Program”) (PRAF). The study takes advantages of the random assignment of localities to program deployment and control groups and presents comparable estimates of impacts on the adult labor supply and remuneration levels. These estimates are based on homogeneous datasets and were arrived at through the use of a common estimation methodology.

The impact of welfare and income-support programs on labor supply has been widely studied in developed countries (Moffitt 2002; Meghir and Phillips 2008; Moffitt and Scholz 2009). This literature has pointed out the existence of work disincentives among recipient households, and these and other considerations have prompted recent reforms that have incorporated sophisticated measures to mitigate these negative effects (Moffitt 2003a; Blundell and Hoynes 2004; Dickens et al. 2004; Michalopoulos et al. 2005). The programs under study here are conditional cash transfer (CCT) programs, which combine monetary benefits with incentives for curbing child labor and fostering the accumulation of human capital. Benefit receipt is subject to a series of verifiable conditions, such as school attendance, vaccination, and regular medical checkups, among others. The results of a number of evaluations in Latin America indicate that cash transfers, especially when combined with conditionalities, have proved successful in increasing welfare and human capital accumulation in recipient households and in reducing child labor (see the reviews by Rawlings and Rubio (2003, 2005) and Fiszbein and Schady (2009)).

Unlike their recent counterparts in the USA and Europe, however, these programs do not incorporate measures to guard against potential negative impacts on the adult labor supply. Moreover, there is very little consistent, systematic evidence regarding this aspect, despite the existence of a wealth of empirical analyses concerning their intended outcomes. This study attempts to establish whether these cash transfers have any incentive effects on the labor supply of adults in recipient households, on non-eligible individuals, and on the broader labor-market equilibrium.

The main contribution made by this study is the systematic clear-cut evidence that it provides concerning the labor-supply effects of welfare programs in developing countries. Despite the crucial role played by such programs in the income-generation process among poor segments of the population, there is limited evidence concerning labor-supply decisions in this context. Existing studies point to the presence of complex interactions among public policy, work incentives, and labor allocation within households (see, for instance, Ardington et al. 2009). Moreover, the systematic evidence presented below is derived from experimental evaluation designs,Footnote 1 which have clear advantages over the policy and natural experiments underlying most previous studies of welfare programs and labor outcomes (Angrist and Krueger 1999; Blundell and MaCurdy 1999; Imbens et al. 2001; Eissa et al. 2008). These evaluation strategies have also overcome some of the shortcomings of previous randomized experiments, such as those of the negative income tax of the 1960s and 1970s in the USA (Ashenfelter and Plant 1990; Moffitt 2003b).

Comparable results for the three countries indicate that the effects of these programs on the labor supply of participating adults are, while primarily negative, small and nonsignificant. Even though they provided considerable transfers, the programs did not reduce the labor supply substantially in the short term. However, the evidence also reveals the presence of other effects on labor markets. In the case of PROGRESA, there was a small positive effect on the number of hours worked by female beneficiaries, a sizeable increase in wages, especially among male beneficiaries, and a resulting increase in household labor income after the program had been in operation for 2 years. These impacts can be attributed to changes in the labor supply of adults in eligible households and to the increased amount of time available to women as a result of higher school enrollment rates among children. Moreover, PROGRESA seems to have reduced the female labor-force participation rate in ineligible households in the localities randomly assigned to the program. This mechanism is related to recent findings on the indirect impact of CCTs on ineligible households (Angelucci and De Giorgi 2009) and implies that large-scale interventions can have broader equilibrium effects. This additional layer of complexity should be considered in the design and evaluation of future interventions. These equilibrium effects also have important consequences for the interpretation of results from randomized controlled trials (see Moffitt 2003b; Duflo et al. 2008; Heckman 2008; and the debate between Deaton 2009 and Imbens 2010).

The study is organized as follows. Section 2 discusses the theoretical underpinnings of the potential impact of cash transfers on labor supply and presents a review of the empirical evidence for countries in Latin America. Section 3 briefly reviews the programs and their evaluation strategies and then goes on to describe the estimation and inference procedures. Section 4 presents the empirical results on labor-market outcomes for adults in the three programs. Section 5 provides an overview of the results and some conclusions.

2 Labor markets and conditional cash transfer programs

2.1 Potential impacts of CCTs on labor markets

CCT programs combine short-term poverty alleviation (through cash transfers) with long-term outcomes that are achieved through the use of incentives for human capital accumulation (school attendance, health checkups, improved nutrition, and the reduction of child labor).Footnote 2 With the exception of minor training components in some programs, the overall design of the CCTs in Latin America is not directly related to the employment of adults in beneficiary households. There are no restrictions on work, and, unlike previous workfare-like initiatives in developing countries, CCTs do not use low-wage jobs as targeting mechanisms (Besley and Coate 1992; Kanbur et al. 1994). Most importantly, earned labor income does not reduce benefit levels. In this sense, CCTs constitute a simpler policy instrument than welfare programs in developed countries: as a pure subsidy (as far as adults are concerned—although some of the conditionalities might imply some costs in terms of time), CCTs do not induce steep replacement rates as traditional welfare programs do, nor do they entail the complexity of welfare-to-work initiatives such as the Earned Income Tax Credit in the US (Eissa and Liebman 1996) or the UK’s Working Families’ Tax Credit (Meghir and Phillips 2008).Footnote 3

The lack of work requirements does not mean, however, that the programs are neutral in terms of adult labor supply and work incentives. The income-support component and the conditionalities relating to children’s health and education might still have affected these outcomes. The economic theory suggests several ways in which CCTs can affect work decisions within recipient households. In a standard static model of choice between consumption and leisure, the components of CCTs may play a role through at least four channels.

Firstly, the cash transfer component of the program generates an increase in unearned non-labor income. As such, it induces a pure income effect, which loosens the budget constraint of the recipient households. The rise in unearned income could reduce the number of hours of work if leisure is a normal good for beneficiaries, but the presence of fixed hour or money costs, such as commuting or childcare (Cogan 1981; Bhattarai and Whalley 2003; Kluve and Tamm 2012), may induce an increase in labor supply as a result of the lump-sum transfer (Ralitza and Wolff 2011).

Conditionalities constitute the second channel through which CCTs may induce behavioral responses in the adult labor supply. The requirements related to children’s human capital accumulation may have an impact on a household’s allocation of time: the positive impact of CCTs on children’s school attendance could free up time previously spent on childcare (Blau and Tekin 2007; Baker et al. 2008; Mörk et al. 2013), thereby further reducing the cost of work. There is some evidence of reduced participation of women in domestic work in the case of PROGRESA (Parker and Skoufias 2000).

The third channel is related to the potential decrease in household income associated with the reduction in child labor (Basu and Hoang Van 1998). This effect diminishes the net impact of cash transfers in households where children are induced to reduce their participation in work activities and could therefore mitigate the transfer’s potential disincentive in terms of the adult labor supply.Footnote 4

Finally, the fourth channel operates through different types of spillovers. On the one hand, there may be indirect effects: Angelucci and De Giorgi (2009), for instance, find that PROGRESA has had an impact on the consumption of ineligible households in program communities, and Bobonis and Finan (2009) report substantial spillovers in terms of secondary school enrollment decisions for the same program. On the other hand, changes in the labor supply schedule of beneficiaries may affect aggregate wage levels and thus remunerations for both recipients and non-recipients. In the presence of such effects, the identification strategy based on the random allocation of the program would be partially compromised owing to a violation of the stable unit treatment value assumption (Angrist et al. 1996). In terms of the labor supply, equilibrium effects reduce the scope for the interpretation of reduced form estimates as simple labor supply elasticities with respect to unearned income.

The combination of these four channels implies that the overall effect of CCTs on labor market outcomes for adults is ambiguous from a theoretical point of view. The presence of any impact, and its direction, is ultimately an empirical question.

2.2 The impact of CCTs on labor markets: previous findings for Latin America

Most of the literature on the impact of CCTs focuses on the programs’ intended outcomes. While results vary from country to country, program evaluations reveal, to some degree, a positive effect on years of schooling, reductions in child labor, and improvements in some key health indicators (Rawlings and Rubio 2003, 2005; Bouillon and Tejerina 2006; Fiszbein and Schady 2009), as well as other related unintended effects on, for instance, fertility (Todd et al. 2011).

Effects on the adult labor supply have been partially analyzed for PROGRESA and RPS. The significant reduction in child labor found in the case of PROGRESA (Skoufias and Parker 2001) contrasts with the absence of an impact on labor market outcomes for adults in beneficiary households, according to results from Parker and Skoufias (2000) and Skoufias and Di Maro (2008). Both studies, based on probit estimations, find no significant program effects on adult labor-force participation within eligible households in program localities. Also in the context of PROGRESA, Angelucci and De Giorgi (2009) find that household equivalent labor earnings for adults are not affected by the program. This study provides further results from estimations that include fixed effects at the household or individual levels. While these controls may not be strictly necessary in the context of a randomized controlled trial, they allow for better controls for baseline differences in employment (see Table A2 and Behrman and Hoddinott 2005) and may also induce a gain in precision (Duflo et al. 2008). The results include these additional controls and confirm the robustness of previous findings.

The impact of Nicaragua’s RPS program on the adult labor supply is analyzed in detail by Maluccio (2007). While the studies on PROGRESA referred to in the previous paragraph concentrated on individual labor-force participation and household earnings, Maluccio (2007) studies the effect of RPS on total hours of work at the household level. The results, obtained by means of a random effects model, indicate that the program has had a small but significant negative effect on total household hours of work, with most of the negative impact relating to the amount of time spent in agricultural activities. These effects and their causes are discussed in detail below, with the evidence being presented here pointing to a household composition effect rather than a direct effect on hours worked.

There are fewer papers that draw upon evaluation data for Honduras’ PRAF program. Galiani and McEwan (2012) developed an original evaluation strategy based on census data instead, which was collected shortly after the program was implemented. They report no significant effects for PRAF on the labor supply of adult women and only a small decrease (1 %) for adult males, although this estimate is not robust to alternative specifications.

The analysis presented in this study provides comparable results for the three programs. They are based on a common procedure for processing the original datasets, which leads to homogeneous definitions for dependent and independent variables. Moreover, the estimates for the three programs are derived from the same methodology and allow for the same type of controls for randomization imbalances and other issues by including individual and household fixed effects. Finally, while evaluations of PROGRESA and PRAF have concentrated on individual participation and those of RPS on household hours, the results detailed below allow for further disaggregation in order to look at participation, hours of work, sector allocation, household labor earnings, and wages (when possible) for all programs.

3 Experimental evaluation strategies and estimation methodology

3.1 The programs and their evaluations

The data used in this analysis are drawn from ad hoc longitudinal surveys carried out in order to evaluate each specific intervention. The three programs share a common evaluation methodology, with baseline and follow-up data collection being conducted in localities that were randomly assigned to program deployment and in those that were selected into the control group. The three data sources were harmonized on the basis of a common set of criteria in order to achieve maximum comparability using the methodology described in CEDLAS (2012).

The three interventions targeted rural areas in poor regions of the respective countries. The following paragraphs briefly describe the three programs’ evaluation strategies,Footnote 5 based on PROGRESA’s experimental design, which randomized program deployment at the locality level.

In 1997, Mexico began implementing the first phase of PROGRESA. It was geographically targeted by locality. From an initial group of the 506 localities that were selected for the first round, 320 were randomly selected to participate in the program, which was not deployed in the remaining 186 localities. Households in the latter localities were still subject to the data collection process and thus constituted the control group for the program’s evaluation. The intervention also included a targeting rule based on a proxy means test: only qualifying households in treatment localities were eligible to participate.

The data employed in this study are drawn from the PROGRESA Evaluation Survey. The estimates discussed below are based on the initial baseline survey and on three follow-up roundsFootnote 6 conducted at 6-month intervals following program implementation. The surveys collected sociodemographic and labor-market information for all households and individuals in both treatment and control communities.

Honduras’ PRAF was implemented in a set of 50 randomly selected municipalities of a total of 70, with the 20 remaining municipalities forming the control group. The data used in this study correspond to a baseline survey carried out in the second half of 2000 and a follow-up survey in 2002. In contrast with PROGRESA, where all households in treatment and control localities were interviewed, the PRAF surveys covered only a representative sample of households. The corresponding sampling weights are used in the empirical work outlined below.

For the case of Nicaragua’s RPS, half of the countries’ poorest 42 localities were randomly assigned to the treatment group. The data used in this study are drawn from the initial baseline survey carried out in the third quarter of 2000 and the first and second follow-up surveys conducted in October 2001 and October 2002, respectively. As with the PRAF evaluation data, the survey consists of a representative sample of the population in treatment and control localities, and sampling weights are used for the estimations.

Finally, although the programs have a number of characteristics in common, it should be noted that there were significant differences in the average size of the cash transfers provided by each of the initiatives. Imputing transfer values from each program’s eligibility rules to the evaluation samples used in this analysis, the transfers represented about 4 % of total household consumption for PRAF, 20 % for RPS, and 40 % for PROGRESA.Footnote 7 The potential effect of these differences is discussed in the section that covers the empirical results.

3.2 Estimation and inference with random assignment by locality

In view of the random assignment of localities in the context of the three programs under study and the availability of repeated observations, a differences-in-differences (DD) estimation technique is the most suitable one of exploiting the evaluation design and identifying the causal effects of the programs. A standard DD model with controls takes the form of

$$ Y_{ist} =A_s +B_t +cX_{ist} +\beta I_{st} +\varepsilon_{ist} $$
(1)

where \(Y_{ist}\) denotes the outcome variable of interest for individual (or household) i in group (or village) s at time t, \(I_{st}\) is an indicator variable representing treatment status for group s in time t (or alternatively, an interaction between a treatment group indicator and time effects), \(A_{s}\) and \(B_{t}\) are group and time effects, respectively, \(X_{ist}\) is a matrix of individual characteristics, and \(\varepsilon _{ist}\) is an error term. The estimate of the program impact is the coefficient \(\beta \). Without the \(X_{ist}\) controls and with two time periods, the estimate of \(\beta \) by ordinary least squares (OLS) is simply the difference in changes in mean outcomes between the treatment and control groups between the two time periods. The more general case, with more than two time periods, adds a full set of time controls and interactions to account for differential evolutions over time.

The canonical DD model of Eq. 1 without including individual controls \(X_{ist}\) provides estimates of \(\beta \) that amount to differences in the outcomes at the locality level. The evaluation of PROGRESA, PRAF, and RPS, however, collected repeated household and individual observations, which means that a much richer set of information is available and can be exploited (Wooldridge 2001, 2007). Specifically, the inclusion of individual (or household) fixed effects in the estimation of Eq. 1 permits the identification of program effects while controlling for some of the pretreatment differences between localities (see Table 1, discussed below, and in the Electronic Supplementary Material). Moreover, this allows for a potential gain in precision (Duflo et al. 2008). While these individual fixed effects were not accounted for in the studies of labor supply reviewed in the previous section, they are routinely included in evaluations of CCT impacts on other outcomes (for instance, in Gertler’s 2004 evaluation of PROGRESA’s effect on health, among many others). The results discussed below provides two sets of estimates for each outcome based on Eq. 1: one with a full set of individual controls \(X_{ist}\), and one with a full set of individual fixed effects but no time invariant \(X_{ist}\) variables.

Table 1 Descriptive statistics, by program

With respect to the estimation methodology, the empirical results presented below are based on linear models—either OLS or fixed effect (FE) estimations of Eq. 1—for binary dependent variables such as labor-force participation and for continuous variables such as hours of work, wages, and income. As pointed out by Angrist and Pischke (2008), linear probability model estimates do not differ substantially from those of probit or logit regressions. Moreover, coefficients for the indicator and interaction variables in Eq. 1 have a straightforward causal interpretation for linear estimates.

All the results presented below give estimates of \(\beta \) in Eq. 1 over the full treatment and control samples, which correspond to intention-to-treat (ITT) coefficients. In the case of PROGRESA, the dataset contains a multidimensional targeting score which was used as a proxy means test for participation within program localities, thus making it possible for the eligibility status of each household to be known. For this reason, PROGRESA’s results are also computed as differences between eligible households in treatment and control localities (average treatment effect—ATE)Footnote 8 and differences between ineligible households in the two sets of localities. The latter estimates correspond to Angelucci and De Giorgi’s (2009) indirect treatment effects (ITE).Footnote 9 To account for potential heterogeneous effects of the programs, the estimations are also computed by conditioning on the gender of the individual or the household head, as an alternative to the inclusion of multiple interactions (Djebbari and Smith 2008).

Finally, the standard errors in the estimations need to account for the structure of the programs’ evaluation and implementation processes. In the context of the three CCTs under study, random assignment did not apply directly over beneficiary households or individuals. The allocation was instead done at the geographical level. In terms of the equation above, randomization occurs at the group (village) level (s) instead of the individual or household level (i). Since eligibility for the program is defined at the group level, the standard errors of the DD estimates should account for the likely intra-cluster correlation to avoid a potential bias. Donald and Lang (2007) attribute this bias to the fact that many of the outcomes analyzed in the literature are serially correlated, which is not usually controlled for in DD estimations (see the discussion in Bertrand et al. (2004, BDM henceforth)). This issue may be particularly important in the case of the labor-market outcomes covered in this study. A failure to account for this correlation across the randomization groups makes the usual OLS standard errors inconsistent and leads to erroneous inferences of the program’s causal effects.

BDM propose two methods to correct the standard errors of estimates in Eq. 1:Footnote 10 (a) taking into account the serial correlation of the outcome variable in each group s (this is known as cluster-robust variance estimation (CRVE) and is implemented by clustering observations by the assignment groups (e.g., localities)) and (b) estimating standard errors using block bootstrap with replacement. The first method was used to arrive at the empirical results presented below; the standard errors are virtually equal to those obtained from block bootstrapped standard errors.Footnote 11 These corrections to the covariance matrix yield unbiased estimates of household- or individual-level outcomes in geographic targeting settings, thus accounting for potential serial correlation across groups.

4 CCTs and labor-market outcomes for adults

4.1 Descriptive statistics and random assignment processes

This section discusses the empirical evidence regarding the effect of CCT programs on labor outcomes for adults using experimental evaluation data from the three interventions detailed above: PRAF (Honduras), PROGRESA (Mexico), and RPS (Nicaragua).

Table 1 presents a series of descriptive statistics compiled at the time of the baseline survey for both treatment and control localities for these three programs. These statistics make it possible to verify the nature of the balance between the treatment and control groups in terms of observables. As expected in a rural setting in developing countries, household size in all three programs is fairly large, with an average of more than six individuals per household. About 70–80 % of these households include two spouses. The calculations in the tables show that treatment and control households are not significantly different in terms of their demographic composition, with only a few small significant differences for some variables.

Table 1 also presents average educational levels for the treatment and control localities in each program. Since the programs are targeted at poor areas in each country, the distribution of educational outcomes is concentrated in lower levels of attainment, with about 5 years of education for PROGRESA, 2.2–2.3 for RPS, and 3.4–2.9 for PRAF. The differences in educational achievement and enrollment rates between the treatment and control groups are small, except in the case of PRAF.

Finally, with respect to labor-market outcomes, the program datasets allow for nothing more than a simple definition of participation: individuals report if they work or if they do not. Employment ranges from 51.2 to 53.1 (PROGRESA) to 56.2–57.7 (RPS) and to 66.3–64.5 (PRAF). It is substantially higher for men than for women in the samples for the three countries (a difference of about 55 percentage points in PRAF and of about 70 percentage points in PROGRESA and in RPS). Employment is also higher in households with children and in single-headed households.

The unconditional means of socioeconomic and demographic statistics indicate some preprogram differences between treatment and control groups at the individual and household levels.Footnote 12 These results are generally in line with preexisting reports on these programs, which also found some significant differences between treatment and control localities (see Behrman and Todd 1999, for PROGRESA; Glewwe and Olinto 2004, for PRAF; and Maluccio and Flores 2005, for RPS). Given the nature of the random assignment process in the three programs, implemented at the locality level, these differences probably arise because of the small number of effectively randomized units (localities). These differences reflect the composition of the resulting samples rather than their selection into treatment. In any case, the estimations discussed below control for individual characteristics or, alternatively, for individual fixed effects to account for the ex post differences in the treatment and control samples.

4.2 The effect of CCTs on labor-market outcomes for adults

The analysis of labor-market outcomes for adults in the three programs is restricted to a common sample selection criterion which includes individuals between 15 and 80 years old. Estimates of household-level outcomes are restricted to household heads in the same age range.

The original evaluations focused primarily on each program’s intended outcomes, such as children’s health and education. The evaluation surveys have a much smaller set of labor-market indicators than larger periodic surveys use. In the three data sources employed in this study, the adult population can be divided into two alternate categories of labor-market status: those who work outside the home and those who do not.Footnote 13 The discussion refers to work, employment and labor supply interchangeably.

There are other labor-market outcomes of interest, besides employment status, that can be explored using these evaluation datasets: the number of hours worked in all occupations in a week (for those with positive hours); an indicator for employment in agricultural activities (for those employed); and the total hours worked in the household by members from 15 to 80 years of age (this variable is computed and estimated at the household level for households with positive hours).

As stated in the previous section, the results correspond to two alternative specifications for each outcome of interest. On the one hand, the tables report OLS estimates \(\beta \) in Eq. 1 with a series of controls:

  • Controls for individual characteristics: gender (if applicable), household size, an indicator for two-parent households, number of children, age of the individual, age squared, and educational indicators (complete primary through complete university).

  • Controls for household characteristics: the gender of the household head (if applicable), household size, an indicator for two-parent households, number of children of the head of household, a dummy variable indicating if at least one child in the household attends school, and indicators for the household head’s educational level.

The FE estimations, on the other hand, do not include any individual controls, since most of those listed above are time invariant or have low variability. All estimations include time effects, treatment indicators, interactions between the two, and locality controls, with standard errors clustered at the locality level. Finally, the results present estimates for the ITT for the three programs and for males and females separately. For the specific case of PROGRESA, the availability of eligibility status data means that average treatment effects (ATE) and ITE can also be computed. The tables only report the relevant coefficient for the treatment effects (the coefficient \(\beta \)).

The estimates for PRAF correspond to the simple two-period case (baseline in second half of 2000, follow-up in May–August 2002), while estimates for RPS and PROGRESA include multiple consecutive follow-up surveys.Footnote 14 The RPS baseline was established in August–September 2000, with a first follow-up in October 2001 and a second one in October 2002. For PROGRESA, the baseline corresponds to September 1997–March 1998, while the follow-up data were collected in November 1998, March 1999, and November 1999.

The evidence concerning the main theoretical question—the impact of each program on employment—is presented in Tables 2 and 3, which show the estimated coefficient of the treatment period/treatment status interaction in Eq. 1. Table 2 presents the results for Honduras’ PRAF and Nicaragua’s RPS programs. None of the estimates of the programs’ effects on employment are statistically different from zero at standard significance levels. The estimates range from −0.5 to −1.8 percentage points for PRAF and from −0.2 to −2.3 percentage points for RPS (with positive effects of 0.6–0.9 points for males in the first follow-up survey). In all cases, the effects are higher in terms of absolute value for females than for males.

Table 2 Program effect on employment: PRAF and RPS
Table 3 Program effect on employment: PROGRESA

Table 3 presents the results for PROGRESA. The coefficients on employment, estimated jointly for males and females, are also negative and are in a similar range to those reported in Table 2 (from about −0.3 to −2.6 percentage points). Despite some statistically significant coefficients, there does not appear to be a consistent pattern of significant results for all three follow-up periods or for both OLS and FE estimations. The overall and average treatment effects are compatible with a setting in which income effects are either small or counterbalanced by other forces, as discussed in Section 2.

However, the ITE estimates (effects on individuals in ineligible households) exhibit a higher degree of significance, which seems to be driven mostly by a large fall in employment among ineligible females in the third follow-up survey (about 3.5–4.9 percentage points, for FE and OLS estimations, respectively). While this result may be a statistical artifact, additional results regarding type of employment, hours and household labor income indicate that there may be composition effects within households and between eligible and ineligible individuals. These overall effects are discussed in detail below following the presentation of the rest of the empirical evidence.

Besides their effect on overall employment, the programs may also affect occupational choice. For instance, Skoufias et al. (2008) find that the Programa de Apoyo Alimentario (“Food Support Program) (PAL) program in Mexico induced workers to move away from agricultural work, which supports the idea that this kind of work acts as food insurance. Tables 4 and 5 present the results for regressions for employed individuals, in which the dependent variable is an indicator equal to one if they work in agricultural occupations and is zero otherwise.

Table 4 Program effect on agricultural employment: PRAF and RPS
Table 5 Program effect on agricultural employment: PROGRESA

The coefficients reported in Table 4 indicate that neither PRAF nor RPS induced a substantial shift in labor allocation to agricultural or other sectors at the aggregate level. The coefficients for the overall population are negative for PRAF and their sign for RPS depends on the estimation method that is used, but they are not statistically significant for either of the two programs. Table 5, however, indicates a positive, significant and large effect on agricultural employment in Mexico for ineligible males in the second and third follow-up rounds (of 6.1 and 4.5 percentage points, respectively), although these effects are statistically different from zero only for the FE estimates. In contrast, the average treatment effect for males (the effect on those eligible) is substantially closer to zero (ranging from −0.8 to 1.5 percentage points) and not significant at standard levels for any of the three treatment periods. This reinforces the existing evidence of the presence of composition effects by household and by eligibility status.

A more detailed picture of labor-market effects emerges from the analysis of Tables 6 and 7, which present regressions in which the dependent variable is the number of hours worked for individuals with strictly positive reported hours. The estimates for PRAF are consistently positive and small (from about 0.5 h to about 1.9 h per week), while those for RPS are consistently negative (from about −1.5 h to about −5.7 h) and are higher in terms of absolute values for women (−3 h to −5.7 h, depending on the follow-up and estimation method). However, none of the estimates for PRAF and RPS in Table 6 are significantly different from zero.

Table 6 Program effect on individual hours of work per week: PRAF and RPS
Table 7 Program effect on individual hours of work per week: PROGRESA

The estimates for PROGRESA (Table 7) are available for the first and third follow-up (information on hours worked was collected only for these surveys). The effects for all adults are substantially smaller than they are for PRAF and RPS, and they are not statistically significant for the full sample or for eligible and ineligible males.Footnote 15 However, there is a small but consistently significant positive average treatment effect of about 0.4 additional hours worked per week by female beneficiaries for the two available follow-up periods (ATE with OLS and FE estimates) and a smaller but still significant ITT estimate of 0.18–0.36 h in the third follow-up. These results are for working individuals and indicate a small adjustment in the intensive margin of labor supply for women, which is compatible with the idea that beneficiaries have more time available than before because of the increase in children’s school enrollment documented for PROGRESA.

While the former effect refers to the impact of the programs on individual hours of work, Tables 8 and 9 present the effects on total hours of work by adults in the household, per adult (these are household, not individual, estimates). The results for PRAF in Table 8 are similar to those shown in Table 6, with small and positive coefficients (in female-headed households, the coefficients are larger for OLS estimations), but the overall effects on number of hours worked are not significant. The results for RPS are also similar to those given in Table 6: there are larger negative effects in terms of the number of hours worked per adult, which was higher by the time of the first follow-up survey (a year after the baseline survey), but these estimates are not significantly different from zero. The results for RPS, however, differ from those reported by Maluccio (2007), who finds a significant fall in the hours worked by adults. The difference between Maluccio’s (2007) estimates and those presented here is that the dependent variable in the regressions reported in Table 8 is the total number of hours worked by adults in the household per adult, while Maluccio (2007) uses total overall hours for the household. Replicating Maluccio’s (2007) estimates indicates that there is indeed a negative and significant effect on total hours at the household level, but this is driven by a household composition effect: the number of adults in households fell significantly in female-headed households in RPS.Footnote 16 The results for PROGRESA in Table 9 indicate small and not statistically significant results for this household aggregate, even in female-headed households.

Table 8 Program effect on hours worked by adults in the household, per adult: PRAF and RPS
Table 9 Program effect on hours worked by adults in the household, per adult: PROGRESA

The overall results indicate that the programs did not introduce substantial disincentives to work, with no significant effects on the intensive or the extensive margin of labor supply for individuals and households in treatment localities. The small but significant increase in hours of work for female beneficiaries in PROGRESA is compatible with the presence of other factors that counterbalance the income effects, as discussed in Section 2.

However, the empirical results indicate that PROGRESA led to a substantial reduction in employment levels for ineligible women and a shift among ineligible men toward work in agricultural activities. These results are compatible with Angelucci and De Giorgi’s (2009) evidence on PROGRESA’s effects on ineligible individuals. The following section deals with the effect of PROGRESA on wages and labor income and provides a fuller picture of the program’s effect on labor-market outcomes for adults.

4.3 The effect of PROGRESA on wages and household labor income

The discussion in Section 2 highlighted the possibility that cash transfer programs such as PROGRESA can have equilibrium effects by, for instance, shifting the aggregate labor supply curve by withdrawing children from the labor market, by freeing up adults’ time, or by changing the latter’s willingness to work through an income effect. It may also change relative remuneration levels, for instance by changing the sector allocation balance between beneficiaries and non-beneficiaries, as observed for PROGRESA in Table 5. The PROGRESA dataset provides a basis for an analysis of the program’s effects on individual hourly wages and on household labor income per adult (this information was not collected by the PRAF and RPS evaluation surveys).

The results of the regressions presented in Table 10 indicate that PROGRESA had a sizeable effect on hourly wages in the treatment localities, although this effect seems to be driven mainly by eligible males (none of the coefficients for females are statistically significant, and none of the indirect treatment effect estimates are either). The ITT estimates indicate an increase in hourly wages of about 5.7 % by the time of the third follow-up survey, with a higher average treatment effect coefficient of about 6.9 % (both coefficients are significant only for the FE estimates). When restricting the sample to males, the ITT and ATE FE estimates indicate an effect of about 7.5 and 9.8 %, respectively.

Table 10 Program effect on individual log hourly wages: PROGRESA

Finally, these higher hourly wages are partially reflected in higher levels of household labor income per adult. This effect is reported in Table 11, which indicates an increase of about 3.9–4.6 % (for FE estimations, ITT, and ATE, respectively), which is concentrated in the third round of the follow-up (2 years after the baseline survey) and among male-headed households. These effects, however, are statistically significant only at the 10 % level and then only for fixed-effects ITT and ATE estimates. This evidence on labor income for adults is not incompatible with Angelucci and De Giorgi’s (2009) results for monthly adult equivalent labor earnings in PROGRESA.Footnote 17 These effects on individual wages and household earnings are discussed in the context of the overall results for PROGRESA in the following section.

Table 11 Program effect on log household labor income, per adult: PROGRESA

5 Discussion and conclusions

This study of the effect of welfare programs on work incentives and the adult labor supply in developing countries is based on estimates derived from the experimental evaluations of three programs implemented in rural areas: PROGRESA in Mexico, RPS in Nicaragua, and PRAF in Honduras.

The empirical results indicate that none of the three CCT programs has had any major impact on labor-market outcomes for adults, with no discernible effects on any of the outcomes considered for PRAF and RPS being detected, but with more complex and nuanced patterns of response emerging in the case of PROGRESA. The overall results indicate that the programs have not introduced any substantial disincentives to work and that they have had no significant effect on the intensive or the extensive margin of labor supply for individuals or households in treatment localities. The finding that substantial monetary transfers have not had an impact on employment is compatible with a setting in which income effects (assuming that leisure is a normal good for beneficiaries) are either small or are counterbalanced by some of the other factors discussed in Section 2. This seems to be the case, for instance, with regard to the small (about 0.4 h per week) but significant increase in the number of hours of work for female beneficiaries in PROGRESA. This increase in the intensive margin of the labor supply for working women is compatible with the existence of some of the channels that may link CCTs with an increasing labor supply. More specifically, this effect can be associated with the program’s positive effect on the school enrollment rate for children in eligible households, which in turn may have increased the amount of time that women have available to devote to paid employment. Indeed, previous evidence for the program found that it did, in fact, reduce women’s participation in domestic work (Parker and Skoufias 2000).

The results for PROGRESA, however, point to the presence of more complex effects on labor-market outcomes. Angelucci and De Giorgi (2009) document the various types of positive welfare effects that PROGRESA has had on ineligible households in program localities; for instance, they report higher consumption, which seems to be mediated by credit markets, gifts and the overall effect of the program on the local economy. This study confirms the presence of these spillover and equilibrium effects and provides additional evidence in the form of the impact in terms of the labor-market outcomes for adults. The indirect treatment effects signal a reduction in employment, which is mostly driven by ineligible females. This contrasts with the increase in the number of hours worked by eligible females, although the latter effect is too small to suggest that eligible women fully displaced their ineligible counterparts from their jobs. The lack of a substantial effect with regard to the number of hours worked or household labor income per adult for the ineligible population indicates the presence of some reallocation of labor within ineligible households. Moreover, there was a substantially greater shift toward agricultural employment on the part of ineligible males than there was among their eligible counterparts. On the other hand, the program substantially increased the wages and labor income of males in eligible households. This evidence, taken as a whole, suggests that PROGRESA allowed eligible males to move away from agricultural work and toward higher-paying employment, with neutral sector reallocation at the locality level (as indicated by the lack of significant aggregate effects on agricultural employment). The evidence concerning the relative shift of ineligibles toward agricultural labor is consistent with the finding of Skoufias et al. (2008) that the PAL program in Mexico induced workers to move away from agricultural work. It also further supports the idea that employment in agriculture acts as food insurance. Eligible individuals in PROGRESA seemed to be able to take advantage of more risky but potentially more rewarding nonagricultural employment opportunities or at least to do so more than their ineligible counterparts could.

Taken as a whole, the evidence indicates that, while CCT programs in poor rural areas with high benefit levels do not create a major disincentive to work, they may still have effects in terms of employment-related outcomes and may nonetheless influence the equilibrium of the labor market. The presence of equilibrium effects in this context should not be surprising: for example, in the first stage of PROGRESA’s implementation, about half of the households in treatment localities received a transfer equivalent to 40 % of their income. The aggregate effect of such large, widespread transfers must have had an impact at the community level, above and beyond their effect on eligible households, and this is reflected in Angelucci and De Giorgi’s (2009) finding that PROGRESA has had a positive effect on the consumption levels of ineligible households.

These results have important implications for the evaluation, design, implementation and scope of future programs. Equilibrium effects complicate the interpretation of reduced form estimates from randomized controlled experiments, which is a long-standing discussion in regard to the analysis of welfare programs (see, for instance, Browning’s (1971) critique of Orcutt and Orcutt’s (1968) randomized negative income tax experiments based on feedback effects of wages). In terms of the empirical results presented here, the programs’ impacts can be attributed to shifts in sector allocation and access to better income-generating opportunities for males in eligible households and to the increase in the amount of time available to women associated with higher school enrollment rates for children. It is not possible, however, to rule out further effects linked to program-induced aggregate changes in labor demand. Moreover, in the case of CCTs whose intended outcomes span multiple dimensions, indirect effects on labor-market outcomes could be confounded with the direct impact of the transfers and the programs’ conditionalities. The results suggest that labor-market effects, apart from work disincentives, should be taken into account in the design of welfare programs in developing countries and that the evaluations of such programs should seek to disentangle the underlying mechanisms that are at work.