1 Introduction

In Ethiopia, the Agricultural Growth Program (AGP) is a multifaceted investment program by the government and development partners, with a focus on agricultural development. Since its inception in 2011, the AGP has implemented activities to strengthen the capacity of farmer organizations and their service providers to scale up best practices and support marketing and processing of selected agricultural outputs through engagement with the private sector. In addition, the program supported the construction, rehabilitation and management of small-scale rural infrastructure and increased the efficiency of key value chains through improved access to markets. The AGP aimed to improve agricultural productivity, enhance production efficiency and support commercialization of smallholder farmers. Growth in agricultural productivity and enhancing commercialization can increase and stabilize food supplies, and increase the ability to purchase food, with implications on household food security (Mainuddin & Kirby, 2009).

In smallholder agriculture, productivity is generally defined in terms of technical change and improvement in the efficiency with which inputs are transformed into outputs in the production process (Chambers, 1988). Technical change refers to increases in the set of production possibilities that come about through increased research and development, while technical efficiency enhancement refers to increases in output–input ratios made possible by developments in the production process (Kumbhakar & Lovell, 2000). Hence, in the context of AGP, technical progress would indicate the impact of shifts in production technology from the use of irrigation, high yielding varieties of seeds, modern agricultural machinery, fertilizers, pesticides, and other inputs. It would also capture the effects of improved utilization of land and labor resources, and changes in cropping pattern in favor of high value-added crops.

There are several theoretical channels by which such types of agricultural interventions affect agricultural production and household food security (Sileshi et al., 2019; Majumder et al., 2016). Agricultural programs may: (i) impact on farmer input use decisions through changing the relative prices of agricultural inputs and outputs (Serra et al., 2005); (ii) change investment and on-farm labor supply decisions through an income effect (Young & Westcott, 2000; Serra et al., 2005); and (iii) enhance an insurance effect for possible agricultural risk mitigation (Burfisher & Hopkins, 2003). Consistent with the theoretical justification, specific benefits expected from the AGP include improved productivity, value addition and market opportunities. In these ways the program is expected to increase income, employment opportunities and the food security of smallholder households.

The income and insurance effects may shift the working motivation of farmers (such as in terms of quality and quantity of on-farm labor supply decisions), adoption of farm technologies and allocations of inputs (Zhu & Lansink, 2010). It is expected that these effects will change farmers’ production capacity. The observed variation among individual farmer decisions can be associated with variations in their risk aversion behavior (Binswanger, 1980). If farmers are risk averse which is usually the case for smallholder farmers in developing countries, agricultural support policies will directly affect farm investment decisions and on farm production in the presence of uncertainty (Hennessy, 1998; Yesuf & Bluffstone, 2009).

The income effect of agricultural programs on efficiency and productivity is unknown a priori. If the agricultural program supports farmers with the required finance for technology adoption or to invest in efficiency-improving on-farm activities, then the technical efficiency level of farmers is expected to increase. On the other hand, technical efficiency may decline if farmers are less motivated to do well due to more income stemming from the program. Hence, the actual impact of the AGP on farmer performance is worth empirical investigation.

There are several studies that focus on the impacts of agricultural policy reforms on farm economic performance using efficiency and productivity analyses (e.g., Coelli et al., 2002; Hadley, 2006; Murillo et al., 2007). Furthermore, there is also extensive literature on total factor productivity (TFP) that usually decomposes TFP into the contributions of technical efficiency, technical change, and scale efficiency (Kumbhakar & Lovell, 2000; Karagiannis et al., 2004; Majumder et al., 2016). Technical change is the rise in the maximum output that can be achieved from a given level of inputs (a shift in the production frontier) while technical efficiency shift is the change in a farmer’s capacity to produce maximum output given its set of production factors (how close it is to the production frontier). Scale efficiency change is the degree to which a farm household is enhancing the scale of its operations.

The objectives of this study were to evaluate the roles of technical progress, technical efficiency and scale efficiency as determinants of TFP and investigate whether farm households in AGP implementation districts were more likely to increase TFP and its components compared to those in non AGP districts. More specifically, this study examined the extent to which AGP impacts the smallholder’s technological progress, farm technical and scale efficiency, and TFP.

The extent to which the AGP has affected farmers’ performance in terms of change in efficiency and productivity is an interesting policy issue. Thus, by carefully examining the constituent parts of agricultural productivity and analyzing the drivers of productivity and efficiency change, this study shed light on the ways AGP can affect agricultural growth and evaluate the pathways to productivity. Furthermore, from the policy point of view, it is imperative to identify whether technological progress has stagnated over time or whether technology has been used in ways to achieve its full potential. Because technical advances and efficiency change constitute different sources of TFP growth, different policies tailored to the different parts may be required to increase household food security.

1.1 Methodological framework

1.1.1 Stochastic Frontier Analysis (SFA) and TFP components

In the case of smallholder agriculture where multiple-outputs and factors of production prevail, TFP can simply be expressed as the ratio of aggregate outputs to aggregate inputs. The general stochastic production frontier model for panel data can be written as (Battese & Coelli, 1992):

$$ {y}_{it}=f\left(t,{z}_{it},\beta \right).\exp \left({v}_{it}\right).\exp \left(-{u}_{it}\right),\kern0.5em {u}_{it}\ge 0 $$
(1)

where yit and zit are respectively the vectors of outputs and production factors used in the farm in year t (t = 1,. ., T) for farm household i (i = 1, 2, 3,. .., N) where t is a time trend index that serves as a proxy for technical change; β is the vector of parameters to be estimated. The vectors vit and uit represent different error components that are assumed to be independently distributed. vit refers to the random part of the error, while uit is a downward deviation from the production frontier that can be shown by the negative sign and the restriction uit > 0. Thus, f(t, zit, β). exp(vit) represents the stochastic frontier of production and vit has symmetrical distribution to capture the random effects of measuring errors and shocks that cause the position of the deterministic part of the frontier to vary from farmer to farmer. The systematic error term, uit, is associated with output-orientated technical inefficiency. The level of technical efficiency (TE) in production for the ith farm households at the tth year, that is the ratio of observed output to potential output (which is given by the frontier), is captured by the component exp(−uit) (Coelli et al., 2002), and, therefore, 0 < TEit < 1.

An index of technological change (TCit) for the ith farm households can be directly calculated from the estimated parameters of the stochastic production frontier (Coelli et al., 2002). TC is mathematically obtained by taking the first partial derivatives of the production function with respect to the time variable, where TC < 0 is technological regression and TC > 0 indicates technological progress. Assuming a translog production technology with six inputs (see Table 2 for the description of factors of production), the model can be expressed in the following way:

$$ \ln {y}_{it}={\beta}_0+\sum \limits_{k=1}^6{\beta}_k\ln {z}_{kit}+\frac{1}{2}\sum \limits_{k=1}^6\sum \limits_{j=1}^6{\beta}_{kj}\ln {z}_{kit}\ln {z}_{jit}+{\beta}_tt+\frac{1}{2}{\beta}_{tt}{t}^2+\sum \limits_{k=1}^6{\beta}_{tk}t\ln {z}_{kit}+{v}_{it}-{u}_{it} $$
(2)

Kumbhakar (2000) has suggested a productivity change decomposition analysis that accounts for divisions of productivity changes into technical efficiency, technical innovation and scale effects. The components of Total Factor Productivity (TFPg) can be obtained from algebraic manipulations of the deterministic part of eq. (2).

The rate of TFP change can be specified as:

$$ {TFP}_g={TC}_{it}-u+\left( RTS-1\right)\sum \limits_{k=1}^6{\lambda}_{zk}.{g}_{zk}. $$
(3)

where RTS denotes returns to scale with \( RTS=\sum \limits_{k=1}^6{\varepsilon}_{zk} \); and λzk = εzk/RTS is defined as shares of the kth elasticity of production.

Thus, TFP can be decomposed into three elements: technical progress (TC), change in technical efficiency, denoted by \( \dot{u} \) and scale efficiency (SE), given by \( \left( RTS-1\right)\sum \limits_{k=1}^6{\lambda}_{zk}.{g}_{zk} \).

Scale efficiency, which is traditionally considered as a measure inherently related to the returns to scale of a technology in the production process, shows how close an observed farmer is to the optimal scale (Coelli et al., 2002).

The effect of each component of TFP depends on the relative changes of each part. If technical inefficiency does not change over time, it does not have any impact on the rate of change of productivity. The same is true with technological change. However, the role of economies of scale depends both on the relative strength of technology as well as on factor intensities. The scale effect source of productivity change is present only if the production technology is characterized by the variable returns to scale (Coelli et al., 2002).

1.1.2 Estimation of impact of AGP on TFP, TC, SE and TE

We employ a regime switching regression with the stochastic frontier model to investigate change in TFP, TC, TE and SE due to AGP participation. Given that participation in the AGP is important for increasing productivity, it is intuitively reasonable to consider the econometric issue of the heterogeneity of the impacts of the program among farm households in districts where the AGP was implemented and non-AGP districts. The reduced form of the stochastic production function in eq. (2) was estimated separately for households in AGP districts and non-AGP districts. The standard econometric method of assessing the effects of program participation is to use a dummy indicator variable for the program participation over a pooled sample of observations. This assumes program participation could have only an intercept shifting effect and there are common slope coefficients for farm households in AGP and non-AGP districts. However, the set of variables that affect productivity could depend on whether one participates in the program or not. If this is true, then estimation of eq. (2) on a sample of farm households that pools across the program participation classification may lead to biased estimates. I used the Chow test (Chow, 1960), which rejected the hypothesis of equality of the slope coefficients for the two groups [χ2(35) = 116.49 with p value = 0.000]. This result confirmed that controlling for input factors and time variable, farm productivity is structurally different between farm households in AGP and non-AGP districts.

The gaps between AGP and non-AGP participant in TE, SE, TC and TFP may be decomposed into a part accounted for by differences in household heterogeneities and a part accounted for by differences in coefficients. The former is interpreted as AGP participation differences in TE, SE, TC and TFP attributable to differences in underlying socio-economic characteristics between households in AGP and non-AGP districts.

Consider the following exogenous switching regression model specified by regressing the household specific indices of TE, TC, SE and TFP against a set of farm and household specific characteristics for the group AGP = 0, 1 (non-participant and participant, respectively):

$$ {\displaystyle \begin{array}{cc}{I}_{it,1}={\alpha}_{0,1}+{\alpha}_{t,1}t+{\alpha}_1{x}_{it,1}+{c}_{i,1}+{\varepsilon}_{it,1}& \mathrm{if}\ \mathrm{AGP}=1\\ {}{I}_{it,0}={\alpha}_{0,0}+{\alpha}_{t,0}+{\alpha}_0{x}_{it,0}+{c}_{i.o}+{\varepsilon}_{it,0}& \mathrm{if}\ \mathrm{AGP}=0\end{array}} $$
(4)

where Iit is the TE, SE, TC and TFP index, xit is a column vector of covariates hypothesized to be correlated with efficiency and factor productivity; α is unknown parameters to be estimated. The time trend variable t captures temporal changes in efficiency and factor productivity; ci is unobserved individual specific random disturbance which is constant over time; εit is an idiosyncratic error term which varies across time and individuals and is independently and normally distributed with mean zero and common variance σ2.

Because all the dependent variables are censored (they are limited between 0 and 1), the parameters for TFP, TC, TE and SE are obtained through maximum likelihood estimation (MLE) by applying a two-tailed correlated random effects (CRE) Tobit procedure or the Mundlak–Chamberlain device, following Mundlak (1978) and Chamberlain (1984). The Tobit approach has been used in several studies to evaluate the factors influencing farm inefficiencies and technology adoption (e.g., Fernandez-Cornejo, 1994; Wadud, 2003). To implement the CRE framework in eq. (4), I included a vector of variables containing the means for household i of all time-varying covariates. These variables have the same value for each household in every year but vary across households. One benefit of the CRE estimator is that the assumption of independence between the covariates and unobserved individual specific random term can be relaxed by including the vector of time-averaged variables. This can control for time-constant unobserved heterogeneity as with fixed-effects while avoiding the problem of incidental parameters in nonlinear models and allows for measurement of the effects of time-constant independent variables (Wooldridge, 2002).

The conditional expectations for each outcome variable were computed by manipulating model (4) in the actual and counterfactual scenarios:

$$ E\left({I}_1\left| AGP=1\right.\right)={\delta}_1t+{\alpha}_1{x}_1 $$
(5a)
$$ E\left({I}_0\left| AGP=1\right.\right)={\delta}_0t+{\alpha}_0{x}_1 $$
(5b)
$$ E\left({I}_0\left| AGP=0\right.\right)={\delta}_0t+{\alpha}_0{x}_0 $$
(5c)
$$ E\left({I}_1\left| AGP=0\right.\right)={\delta}_1t+{\alpha}_1{x}_0 $$
(5d)

The “actual” AGP and Non-AGP scenarios are the ones actually observed in the data (5a and 5c, respectively). The “counterfactual” scenarios show what the TE, SE, TC and TFP outcomes for households in AGP districts (AGP = 1) would be, if they had had the same characteristics as the households in non-AGP districts, and vice versa (5b and 5d). Alternatively, what the TE, SE, TC and TFP outcomes of households in non-AGP districts (AGP = 0) would be if the responses (coefficients) to their characteristics had been the same as the current returns to the characteristics of households in the AGP districts, and vice versa. Using these conditional expectations and considering the AGP participation variable as a “treatment” variable, the average TE, SE, TC and TFP expected values in the real and hypothetical scenarios are presented in Table 1 and their definitions are given below the tableFootnote 1:

Table 1 Conditional expectations, Agricultural Growth program (AGP) effects and heterogeneity effects

The AGP participation gap in TE, SE, TC and TFP for households in the AGP districts is defined if households in the AGP districts had had the same characteristics as they do now, but the same returns to those characteristics as households in the non-AGP districts have now. This is given as the difference between (5a) and (5b):

$$ {ATT}_I=E\left({I}_1\left| AGP=1\right.\right)-E\left({I}_0\left| AGP=1\right.\right)=\left({\delta}_0t+{\alpha}_0{x}_1\right)=\left({\delta}_1-{\delta}_0\right)t+\left({\alpha}_1-{\alpha}_0\right){x}_1 $$
(6)

Similarly, the expected change in TE, SE, TC and TFP for households in the non-AGP districts if they had had the same returns to their characteristics as the households in the AGP districts have now, is given as the difference between (5c) and (5d):

$$ {ATU}_I=E\left({I}_1\left| AGP=0\right.\right)-E\left({I}_0\left| AGP=0\right.\right)=\left({\delta}_1t+{\alpha}_1{x}_0\right)-\left({\delta}_0t+{\alpha}_0{x}_0\right)=\left({\delta}_1-{\delta}_0\right)t\left({\alpha}_1-{\alpha}_0\right){x}_0 $$
(7)

Equations (6) and (7) are equivalent to the average treatment effect on the treated (ATT) and on the untreated (ATU), respectively. In this study, the ATT and ATU indicate what the TE, SE, TC and TFP outcomes that households in the AGP districts would have had if the factors facing them had been the same as those currently facing households in the Non-AGP districts, and vice versa. The difference between (6) and (7) can also be used to compute the composition or heterogeneity effects, for instance, due to resource quality difference, managerial skill, and differential access to services. The composition (heterogeneity) effects show, respectively, what the difference would have been if all households had had the current AGP households’ responses and the current non-AGP households’ responses to the observable characteristics. This provides information on whether the AGP participation gap on TE/TFP is larger or smaller due to characteristics of AGP households or non-AGP households. Households in the AGP and non-AGP districts do in fact have different observable characteristics, and this would have an impact even if their responses to the characteristics had been the same.

1.2 The AGP program, data and study areas

The AGP aims primarily to increase agricultural productivity and market access for key crop and livestock products in targeted areas with increased participation of women and youth. The program has the following components (Berhane et al., 2013). The first aims to sustainably increase the productivity of crop and livestock value chains, improve access to markets, and enhance agricultural commercialization. Crop production and productivity improvement is targeted through measures such as scaling up of best practices, promoting use of fertilizers, increasing the availability and adoption of improved inputs, agronomic practices, and increased use of small-scale irrigation. In the livestock sub-sector, increased growth is expected to be achieved through improving availability of feed both in quantity and quality, provision of veterinary services, improved breeds and market infrastructure, and enhancing value addition. The second component aims to finance and provide technical support for rural infrastructure. Priority was given to water development for small-scale irrigation. The expected results from this component are improved small-scale irrigation infrastructure to increase productivity and enhance the adaptive capacity of farmers to extreme climatic shocks, market infrastructure for enhanced commercialization, and capacity of public institutions for enhanced service delivery.

The models developed in the previous sections were applied to three rounds of panel datasets spanning a 5-year period, 2011–2017. The data used for this study was obtained from a farm household survey in AGP participant and non-AGP districts. A total of 93 districts, out of which 61 AGP participant and 32 non-AGP districts were selected from the four regional states of the country, Amhara, Tigray, Oromia and SNNP. A randomly selected 7927 farm households participated in the baseline survey. The attrition rate for the whole period was about 10%.

The household survey collected agriculture-specific production data, household characteristics and community features. As a first step, I distinguished the main factors of production considered in the stochastic frontier model: land, labor, seed, animal power, fertilizer and agro-chemicals. Land was defined as the total hectares of land cultivated by the household, and labor input was the sum of family and hired labor time measured in adult equivalent person-days. Animal power was the number of oxen owned by the household which was used as a proxy for animal traction power. Farmers produce different crop types on different plots. But for the purpose of this study, I aggregated crop output and included all seasons. This variable was measured in monetary value (‘000 Birr/ha). The current district level average price was used to aggregate the values of the crops. The summary statistics for inputs and output variables for the entire sample and disaggregated by year and AGP participation status are presented in Table 2. Similarly, Table 2 provides the definitions and descriptive statistics of the prospective explanatory variables. The selection of the empirical specification is grounded on a theoretical behavioral hypothesis and draws on previous similar empirical technical efficiency and total factor productivity literature (Huang & Kalirajan, 1997; Coelli et al., 2002; Wadud, 2003; Zhu & Lansink, 2010; Majumder et al., 2016; Sileshi et al., 2019).

Table 2 Descriptive statistics and variable description

Based on these empirical works and economic theory, household, farm, and location characteristics were summarized in the empirical specifications. These include household composition, education, wealth (including livestock ownership) and other sources of income such as participation in off-farm activities; social capital and network (membership in formal and informal organizations), current shocks/stresses experienced on crop production (such as plot level disturbances and rainfall shocks), land tenure security, and agro-ecology of the location. A wide range of plot-specific attributes such as soil fertility, depth, slope, farm size in hectares, farm fragmentation, distance of plot from residence and detailed agricultural practices were also considered in the empirical specification.

2 Empirical results

2.1 Hypothesis tests

I estimated the pooled frontier model for the crop farms with AGP participation as a dummy variable and examined whether AGP participation status specific frontier function was a better representation than the pooled frontier production function. The Chow test result (χ2(35) = 116.49 with p value <0.0000) suggested that the production technology specification for AGP participant and non-participant are structurally different and hence the hypothesis that the pooled frontier function is a correct presentation of AGP participant and non-participants’ production technology was rejected. Therefore, the subsequent presentation will be based on the estimation of the results separately for the AGP participant and non-participant specific production frontier function.

I conducted a formal specification test to determine the chosen model specification fitting to the data. The first null hypothesis that the Cobb–Douglas production function is a good representation for the data (H0 : βkj = 0, k, j = 1, …, 6) was strongly rejected indicating that the Translog production function is the preferred model.Footnote 2 The second null hypothesis, that there is no technological change over time (H0 : βt = βtt = βtj = 0, j = 1, …, 6) was also strongly rejected by the data, indicating that technological change exists in the study area.Footnote 3 The magnitude and direction of technological change will be determined and discussed later.

The result provides statistical confirmation that the distribution of the random variable, uit, has a non-zero mean and is truncated below zero. The null hypothesis η =0 which suggests time invariant technical inefficiency model, was statistically rejected in both AGP participants and non-participants’ models. This showed that technical efficiency levels vary significantly over time. η equals 0.135 and 0.052 in the AGP participant and non-participant models respectively, showing that the technical inefficiency of crop production increased by 14% for participants and 5% for non-participants in the period 2011 to 2017. The test of significance in both AGP participant and non-participant models indicated the rejection of the null hypotheses of no technical inefficiency effects.

2.2 Econometric estimates – Stochastic Frontier model

Here I provide results of the parameter estimates of the stochastic production functions and technical efficiency, technological change, returns to scale and their roles to the observed change in productivity. The parameter estimates for the Translog stochastic frontier production function are shown in Table 3. Out of 35 coefficients, 21 coefficients from the AGP-districts and 16 from the non AGP-districts estimations were significantly different from zero at least at the 10% level indicating the importance of some of the interactions and non-linearity among variables. All the five inputs considered in the estimations were found to be the major determinants of output growth in multiple ways: in isolation or when interacted with the other inputs, or both. These factors of production, however, have shown heterogeneous effects on output growth for farm households in the AGP and non-AGP districts. This result confirms the presence of structural differences of production functions among households with and without AGP.

Table 3 Stochastic production functions estimations for crop productions in AGP and Non-AGP districts (Dependent variable: Farm revenue)

The indicator for returns to scale was obtained by summing the elasticities of the five factors of production included in the model. Table 4 indicates that the average farm household in the AGP districts had decreasing returns to scale (0.604) and so did the average households in the non-AGP districts (0.227). The null hypothesis of constant returns to scale was statistically rejected at 1% significance level in favor of decreasing returns in both AGP and non-AGP districts. Overall, the result suggests that an increase in the amounts of productive factors leads to a less than proportionate output growth; and the scale elasticity for farm households in AGP districts was statistically lower than households in the AGP districts. This means that even if the scale effect on productivity change could be high in both AGP and non-AGP farm households, relative output change is higher for farm households in non-AGP districts than households in the AGP districts.

Table 4 Estimated Scale Elasticity and Scale Efficiency

Table 3 also shows that there is significant technological bias for both groups of households. The negative and significant coefficient of the time-trend variable in the model indicates that there was an initial technological regress on production; however, it increased over time as indicated by the significant positive coefficients of the squared terms. The frontier is shifting downwards at a rate of 16% for households in the AGP districts and 11% for households in the non-AGP districts. However, the non-linearity of technical change implies that there was technological progress over time that moved the production frontier upwards. Technological progress for farm households in the AGP districts is about 10%. This was statistically higher than the technical progress for households in the non-AGP districts, by about 2%. Moreover, based on a joint significance test of inputs, the null hypothesis of neutral technological progressFootnote 4 (H0 : βtj = 0, j = 1, …, 6 inputs) was generally rejected at a 1% significance level. The next section examines whether the changes in productivity are caused mainly by changes in the scale of production, or by differences in rates of technological change or technical efficiency change.

2.3 TFP and its decomposition

Table 5 presents the average TFP scores and corresponding decomposition results by year and based on AGP participation status. The TFP index is decomposed into technical change (TC), technical efficiency change (TE) and scale efficiency (SE). Overall, there are substantial variations in the growth parameters and their components across years and across households in AGP and non-AGP districts. There was substantial variation in average technical efficiencies over time among households in AGP and non-AGP districts.

Table 5 Summary statistics of Total Factor Productivity (TFP), Technical Change (TC), Technical Efficiency (TE) and Scale Efficiency (SE) index by time and AGP participation status

This result also shows a consistent decline in the technical efficiency of farmers over time from 68% in 2011 to 57% in 2017 for farm households in AGP districts; and from 70% in 2011 to 68% in 2017 for households in non-AGP districts (Table 5). While technical efficiency of farm households was decreasing over time in both AGP and non-AGP districts, the average annual rate of decrease for households in AGP districts (4%) was statistically higher than for households in non-AGP districts (1%). This implies that farmers in AGP districts have become more technically inefficient over time during the AGP implementation period.

Scale efficiency in this study was small across years and for households in both AGP and non-AGP districts. Table 5 shows that the average scale efficiencies were 20% for farmers in the AGP districts and 25% in non-AGP districts. This implies that these farmers could have further increased their farm output by 75 to 80% if they had adopted an optimal scale of operation. The results also show that about 39% of farm households in the non-AGP districts and 49% of those in the AGP districts showed increasing returns to scale (Table 4). These farmers produce at a suboptimal level, i.e., the output levels of these groups of farmers were lower than the optimal levels and they should expand their operation to reach the optimal scale. The scale efficiency of these farmers is noticeably lower than the average 20% (about 10–12% for both groups of households) and the average returns to scale was higher than unity (1.7–2.6). Only about 4 % of the farmers in the sample were characterized as operating at an optimal scale, while 47% of the AGP and 57% of the non-AGP groups had decreasing returns to scale.

Table 6 presents the average results of the TFP decomposition for farm households in AGP and non-AGP groups. In aggregate, TFP increased at an average rate of 7.8% per annum during the AGP implementation period (2011–2017). This growth rate was slightly higher for farm households in AGP districts than households in non-AGP Districts (8.3% versus 6.6%). The rate of change in TFP for farm households in both AGP and non-AGP districts appeared to be relatively heterogeneous over the two sub periods (2011 to 2013 and 2013 to 2017) of the AGP implementation period. In the AGP districts, annual growth of TFP was positive but decreasing gradually — increasing by about 14% during 2011–2013 and by about 3 % during 2013–2017. Similarly, the annual growth of TFP for farm households in non-AGP districts showed a declining trend over the two sub periods from 9.5% during the first sub period to 3.8% during 2013–2017.

Table 6 Decomposition of total factor productivity (TFP) annual change, 2011–2017

The substantial share of TFP growth is related to technical progress (increasing at an average annual rate of 6.8%) and increases in scale efficiency (3.4% of annual growth). In both groups of farm households, the contribution of technological change for increasing factor productivity appeared to be positive over the two sub-periods of the AGP implementation period, although the change in technological progress over the two periods was not constant for farm households in the AGP districts (12.8 and 2.8%) and households in the non-AGP districts (it varied from 6.7 and 2.8).

Technical change contributed about 88% of the increase in productivity between 2011 and 2014, and 83% between 2014 and 2017. While scale efficiency showed a positive annual growth rate of about 3 % (but with a declining trend over time from 5% to 2%), its contribution to the growth of TFP also increased between the two sub periods from 39% to 42%. In contrast, the technical efficiency had a diminishing growth rate over time in both groups of households. However, the rate of decline of technical efficiency was low and constant over time (1% annually) for farm households in non-AGP districts compared with households in the AGP districts, (it declined annually from 3% to 5%). This technical inefficiency contributed to a decline of TFP growth rate by about 5% annually.

Figure 1 also shows the effect of TC, SE and TE on TFP more clearly. The relationship between TFP and TE was not linear for both groups of farm households. TFP declines at lower levels of TE. The positive contribution of TE to the growth of TFP was observed at higher levels of TE. However, as described above, Fig. 1 (Panel b) also shows an almost linear relationship between TC and TFP. Similar to the pattern of TE and TFP, Fig. 1 (Panel c) shows a U-shaped relationship between SE and TFP. Compared to ‘Panel a’ of Fig. 1 (the effect of TE on TFP), the positive contribution of SE on TFP starts at an early stage of SE, before the farm household attains about 50% scale efficiency.

Fig. 1
figure 1

Cumulative distribution for impact of AGP participation on technical efficiency (TE) by quantiles of TE

2.4 Impacts of AGP on TE, SE, TC and TFP

The findings on TE, SE, TC and TFP so far is based on simple averages of the outcomes among farm households in AGP and non-AGP districts. However, this approach does not disentangle whether the change in TE, SE, TC and TFP is due to program effects or other households and farm heterogeneities. Accordingly, as described above, I estimated the CRE regression of TE, SE, TC and TFP indices for households separately in the AGP participant districts and non-AGP districts. The estimation results of the outcomes (TE, SE, TC and TFP) equations will be discussed in the next section. However, several variables in the model showed significant correlation with the outcome variables and there were differences between the outcome equations coefficients among households in AGP and non-AGP districts. This illustrates the heterogeneity in the sample with respect to TE, SE, TC and TFP.

From the regression estimates, the average AGP effect on the population (ATE) was derived, as well as the average AGP effect on the AGP participant (ATT) and the average treatment effect on the non-participants (ATU). The unconditional average effect is presented in Table 7. The unconditional average effects indicate that the average annual TE, SE, TC and TFP indices for farm households in the AGP districts were significantly different from those of farm households in the non-AGP districts. This presents an 11%, 15% and 32% TFP, TC and SE increment, respectively, and a 16% decline in TE due to AGP. This expected effect is different from the percentage difference in the simple sample means between farm households in the AGP and non-AGP districts shown in Table 5. The simple contrast with unconditional average effect among participant and non-participant groups may provide misleading conclusions because the approach does not take in to consideration the difference in the outcome variables that may be caused by observed and unobserved factors.

Table 7 Unconditional AGP participation effects on total factor productivity, technological progress, technical efficiency and scale efficiency (results from correlated random effects estimation)

The result of the decomposition analysis for the switching regression with CREs model for TFP, TC, TE and SE are presented in Tables 8, 9, 10 and 11. As in Teklewold et al. (2013), I analyzed the differences in TFP, TC, TE and SE among AGP participant and non-participant households into two parts: the first part is attributable to differences in underlying socio-economic characteristics (such as household and farm characteristics) called “composition or heterogeneity effect”, while the second is attributable to the “response” to these characteristics called “program effects”. To determine the average effects of the program on the participants, the expected outcome variables (TFP, TC, TE and SE) for AGP participant were compared with their counterfactual - what they would have been if they had not participated in AGP - by comparing columns A and B of Tables 8, 9, 10 and 11. This is shown in column C computed as the difference between columns A and B.

Table 8 Average expected total factor productivity (TFP) and conditional AGP participation effects (results from correlated random effects estimation)
Table 9 Average expected technical progress (TC) and conditional AGP participation effects (results from correlated random effects estimation)
Table 10 Average expected technical efficiency (TE) and conditional AGP participation effects (results from correlated random effects estimation)
Table 11 Average expected scale efficiency (SE) and conditional AGP participation effects (results from correlated random effects estimation)

Participation in AGP lead to a 13% higher TFP (i.e., 0.848 to 0.716); on average, compared to non-participation (Table 8). The difference was statistically significant. In the counterfactual cases, farm households who actually participated in AGP would have generated less TFP if they did not participate (see first row of column B in Table 8). Similarly, farm households that did not participate would generate 13% more TFP (i.e., 0.811 to 0.682) if they did participate (see second row of column A in Table 8) than if they did not participate in the program. Generally, under both conditions, TFP was higher for AGP participant farm households than non-participant households.

The result from the transitional heterogeneity gaps indicates that TFP of AGP participants and non-participants increased by 3.7% (i.e., 0.848 to 0.811) and 3.4% (i.e., 0.716 to 0.682), respectively, if TFP is conditioned by AGP participant characteristics than AGP non-participant characteristics. The program effects alone do not explain the overall difference in TFP status between farm households in AGP and non-AGP districts. In this case, while both AGP participation and non-participation would show an improvement in TFP status from using the AGP participant resources and characteristics, the benefit for non-participants is larger compared to the participants; the increase in TFP is higher for the former by 0.4%.

Table 9 presents the contribution of AGP on TC. Participation in AGP significantly contributed to household level TC. Both the ATT (average AGP effect of TC on the participant) and ATU (average AGP effect of TC on the non-participant) shows that AGP participation leads to a higher TC, on average, compared to non-participation. Under both conditions, the difference was statistically significant. Farm households who actually participated in AGP would have generated less TC if they did not participate (see first row of column B in Table 9).

The results from the transitional heterogeneity gaps indicate heterogeneity of TC due to a composition effect. Thus, AGP participation and non-participation increased TC by 0.4% (i.e., from 0.149 to 0.145) and 1.6% (i.e., from 0.054 to 0.048), respectively, if TC was conditioned by AGP participants’ characteristics than characteristics of non-participants. The program effect does not alone give the overall AGP difference in TC between farm households in AGP and those in non-AGP districts. Similar to other components of TFP, the negative heterogeneity effects indicate that while AGP participation and non-participation would show an improvement in TC status from using the resources of AGP participants and characteristics of households, non-participants benefit more compared to the participants.

I also estimated AGP differences in TE (Table 10). In general, the result provides further evidence on AGP difference in TE status due to program effects and due to the overall composition effect (differences in observed socio-economic characteristics). The average AGP gap in TE status of AGP participation compared to non-participation conditional on characteristics of AGP participants represents a 10% decline in TE (from 0.680 in AGP participation to 0.577 in non-participation). Similar to this, the AGP gap conditional on characteristics of non-participants showed a decline in TE of AGP participation by about 12% compared to non-participation (0.546 versus 0.669). There were also significant transitional heterogeneity effects. The TE status would decrease by about 3 % (i.e., from 0.577 to 0.546) and increase by 1 % (i.e., from 0.680 to 0.669) respectively for AGP participation and non-participation, if the respective TE was conditioned by the resources and characteristics of AGP participants rather than those of non-participants. Thus, AGP participation and non-participation show an improvement in TE status from using resources and characteristics of the participants. Participants in AGP benefit more from TE compared to non-participants due to the program.

Results on the average causal effect of AGP on SE are in Table 11. The results reveal that the SE differential is also apparently caused by differences in household and farm characteristics. AGP non-participants could have been more scale efficient had they had the resources and characteristics of AGP participants. The results also indicate that there were some sources of heterogeneity due to program effects that makes AGP participation more scale efficient than non-participation under both AGP participants’ and non-participants’ characteristics. Column A of row 1 and column B of row 2 in Table 11 shows that the SE of non-participant households is lower (0.343 versus 0.253), on average, than the AGP participant households. However, with the counterfactual condition (column A of second row) where the non-participants have AGP participants’ response coefficients, the difference will be reduced to about −1% (0.343 versus 0.354).

Column C presents the treatment effects of AGP on SE. Non-participation results in lower SE. With the counterfactual conditions (column of B of row 1) that the AGP farm households had the non-AGP households’ characteristics and returns, the SE would still be 8 % lower. Similarly, in the counterfactual case that the AGP non-participant households had the characteristics of AGP households, the mean SE will be 10% less than what it is now.

Finally, I examined the evolution of TE and TFP across households in AGP and non-AGP districts by quantiles of the TE and TFP distributions to see the patterns of distributions between AGP and non-AGP households at the extreme values. Figure 2 shows the cumulative distributions for the effect of AGP participation on TE by quantiles of TE. The result indicates that in the fourth quartile (i.e., the upper tails of the TE distribution) the cumulative distribution of TE for farm households in the AGP districts dominates that of farm households in the non-AGP districts. This is shown by the cumulative distribution functions (CDF) of TE of farm household in the AGP districts being constantly below that of households in the non-AGP districts (Fig. 2, Panel d). In the middle part (the second and third quantiles) of TE distributions, the cumulative distribution of TE for farm households in the non-AGP districts dominated that of farm households in the AGP districts (Fig. 2, Panel b and Panel c). The result may suggest that although participation in AGP has a role in improving the efficiency and productivity level of farm households, there is a limited contribution of AGP in lifting those farm households who are found at the lower tail of efficiency and productivity distribution.

Fig. 2
figure 2

Cumulative distribution for impact of AGP participation on total factor productivity by quantiles of Total Factor Productivity (TFP)

3 Discussions

The results indicate that most of the production inputs used in the AGP in Ethiopia had positive first-degree coefficients and negative quadratic terms. This suggests that the impacts of the different factors of production on agricultural output growth were heterogeneous. For instance, keeping all other inputs constant, a 1% increase of fertilizer usage would result in a 0.5–1% increase in output, but with diminishing returns. The non-linear effect of fertilizer as well as its significant interaction with other inputs suggested that use of fertilizer has a positive output elasticity for farm households in the AGP districts and negative elasticity in the non-AGP districts. For AGP districts, the highest output elasticity is due to the complementarity of multiple farm inputs. The result was consistent with earlier studies on the use of multiple farm inputs for higher output elasticity (Villano et al. (2015).

The findings also suggest that land remains the most important factor of production for all farm households with an output elasticity of 1.5 for participants in the AGP and 1.0 for non-AGP participants. The high elasticity of land area is not surprising because in the presence of small size farms, as in Ethiopia, this factor can be considered a quasi-fixed input (Alvarez & Arias, 2004; Madau, 2007). The direct effect of labor had a positive output elasticity between 0.18–0.31, but that result was not statistically significant. However, with an increase in labor, labor has statistically significant negative output elasticity at −0.03 for farm households in the AGP districts.

Technical change normally involves the contraction/expansion of inputs and outputs and changes the marginal rates of technical substitution among the inputs (Coelli et al., 2002). The interaction terms of technology and farm inputs in the model capture the neutral and non-neutral technical change component of the total factor productivity change. Technological progress was associated with labor supply, fertilizer use and seed saving in both AGP and non-AGP districts. Moreover, while technological progress involved use of land and agro-chemicals in AGP districts, it was neutral to these inputs in non-AGP districts. The results also reveal no technological progress in the use of animal power in AGP districts, but animal power saving in non-AGP districts. The finding suggests greater scope of technological progress through AGP and the effective use of farm inputs.

On average, farmers in the AGP and non-AGP districts are respectively 57 and 68% efficient in using technologies. The average technical efficiency estimates from this study are comparable with technical efficiency of smallholder farmers recorded by some previous studies such as Wondimu (2016) and Abrar and Morrissey (2006) who reported 57–61% technical efficiencies for Ethiopia. Technical efficiency improvement refers to increases in output–input ratios made possible by improvement in the production process with the existing technology and resources (Kumbhakar, 2000; Coelli et al., 2002). The result generally suggests that improving technical efficiency such as through technical advice and extension services has the potential to increase output by about 32% for farmers in the non-AGP districts and 43% for farmers in the AGP districts.

The declining trend of technical efficiency over time in this study suggests that farm households in both AGP and non-AGP districts have a slow rate of output growth. This supports the assertion that farmer technical inefficiency is one of the most important elements affecting factor productivity in smallholder farming (Kumbhakar, 2000; Wadud, 2003). On the other hand, the results suggest that scale inefficiency is obtained mainly because the farms operate at a suboptimal scale, with increasing returns to scale. Wondemu (2016) suggests that lack of a competitive land market may be the primary factor that prevents farmers from fully exploiting economies of scale in Ethiopia and suggested further land consolidation as one potential source of productivity growth.

The significant positive growth rate of TFP in this study implies that technical progress has compensated for the minimal contribution of scale efficiency and the negative contribution of technical inefficiency to TFP growth in Ethiopia. The reason for the technological progress may be the favorable investment effect from the AGP through measures such as providing finance and technical support to scale up of best practices, increasing the availability and adoption of improved inputs (seeds and fertilizers), agronomic practices, and increased use of small-scale irrigation (Berhane et al., 2013). Thus, farm households in AGP districts benefit from increasing factor productivity relative to farm households in districts not included in the program That is, findings in this study support the notion that the AGP may influence agricultural productivity by encouraging investment on land and adoption of new technologies (Hadley, 2006; Villano et al., 2015).

4 Conclusions and policy implications

This study assessed the technical and scale efficiency, technical progress and total factor productivity growth in smallholder crop agriculture in Ethiopia using a stochastic frontier approach. The effects of the agricultural growth program and household and farm characteristics on total factor productivity and its components were evaluated. The econometric results confirmed that households in the AGP districts, on average, had an increased total factor productivity and technical change, but a decreased technical efficiency compared with households in non-AGP districts. This difference can be partly explained by the differences in observable endowments; the switching regression treatment effects established a significant difference between ATT and ATU, showing significant heterogeneity effects. This indicates that although the AGP program resulted in a positive effect on total factor productivity, technical change and scale efficiency, there were important additional sources of total factor productivity, technical change and scale efficiency changes due to differences in observed characteristics.

There are important policy implications from the analysis in this study. It is evident from the analysis that the critical factor affecting total factor productivity is the rate of technological progress. Thus, any policy change which affects the use of modern agricultural inputs directly or indirectly would have a role in the growth of agriculture in Ethiopia. While research and development are important in determining the country’s agricultural production potential and increasing household food security status, some issues need to be taken care of. In the long run, the growth in output must rely on improvements in technical efficiency. But, the principal difficulty in the 5-year period covered by the Agricultural Growth Program lies in the slow or negative rate of increase in technical and scale efficiency. This indicates that there is a growing urgency for sustained improvements in technical advice and extension services, which requires a more active role for the public sector in research and extension activities in collaboration with farmers to bring inefficient farmers closer to the frontier over time. Emphasis should be on the communication of research results to farmers in usable forms and the establishment of regional and national means to enhance research-extension-farmer linkages and the efficiency and relevance of technology generation and transfer. In addition, as indicated by the locational variability in efficiency, the potential for efficiency improvements in different locations is substantial.