Keywords

1 Introduction

As pointed out in other sources [1,2,3], Brazilian agriculture is highly concentrated. Only five hundred thousand farmers, 11.4% of the total, produced 87% of the total production value in 2006 (last agricultural census). These data motivate studies that identify factors of importance for public policies leading to productive inclusion in agriculture in Brazil. Indeed, the major (state) agricultural research company in Brazil defines “productive insertion and poverty reduction” as one of the impact axes in its strategic planning map. Access to technology is the main cause of production concentration and, very likely, of poverty in the fields. We see, in this context, that the agricultural sector demands proper public policies in order to improve access to technology and to increase productive insertion and reduce rural poverty.

As emphasized in Souza and Gomes [3], market imperfections are the main cause of inhibition of the access of farmers to technology and, therefore, to productive inclusion. Market imperfections are the result of asymmetry in credit for production, infrastructure, information availability, rural extension and technical assistance, among others [4].

The lack of physical infrastructure and education make it difficult for the rural extension to fulfill its role and, therefore, gain proper access to technology. Another point to be emphasized is related to the imperfection of the production markets. Souza et al. [5] highlight that small farmers sell their products at lower values and buy inputs at higher prices. The large scale producers are able to negotiate better input and output prices and the existence of these different prices characterizes a market imperfection. Unfavorable negotiation may lead to higher prices for the adoption of better technologies and thus lead to difficulties in achieving higher economic efficiency.

We contribute to this literature modeling production value as a function of several aggregates, reflecting, on a municipal level, the input usage, environment control, technical assistance and the effect of market imperfection variables on the technical efficiency of production. The modeling process postulates a Cobb-Douglas representation in a typical stochastic frontier approach and is carried out under the assumption of endogeneity of technical assistance. The models we used follow the basic lines of Karakaplan and Kutlu [6] and Karakaplan [7]. We extend Karakaplan’s approach to the truncated normal and the exponential distributions. Alternatives to the approach are also suggested, considering non-linear models with the Murphy and Topel [8] variance correction. In this context we allow the use of fractional regressions [9, 10]. Our results extend Souza and Gomes’ [3] findings.

2 Data

The data sources for this article are the Brazilian agricultural census of 2006, the Brazilian demographic census of 2010, and municipal databases on education and health.

We follow the approach of Souza et al. [2, 5] in the definition of production and contextual variables.

Production (inputs and output) is defined using monetary values. The source is the agricultural census of 2006 [11]. The output variable is the value of production and the inputs are expenses on labor (labor), land (land) and technological inputs (techinputs), which includes machinery, improvements in the farm, equipment rental, value of permanent crops, value of animals, value of forests in the establishment, value of seeds, value of salt and fodder, value of medication, fertilizers, manure, pesticides, expenses with fuel, electricity, storage, services provided, raw materials, incubation of eggs and other expenses. Value of permanent crops, forests, machinery, improvements on the farm, animals and equipment rental were depreciated at a rate of 6% a year (machines – 15 years, planted forests – 20 years, permanent cultures – 15 years, improvements – 50 years, animals – 5 years). Farm data from the agricultural census were aggregated to form totals for each county. A total of 4,965 counties (almost 90% of the total) provided valid data for our analysis.

The contextual variables we chose are a performance municipal index of social development (social), an index of demographic development (demog), the proportion of farmers who received technical assistance (techassist), the proportion of non-degraded areas (ndareas) and the proportion of forested areas (forest). The last two are proxies for environment control. Market imperfections are mainly associated with the social index.

The demographic index captures the population dynamics that tend to follow rural development. The variables considered in this dimension of development are the migration index (rural to urban areas), average number of farm dwellers, aging rate (total municipal rural population over rural population over 60), dependency rate (ratio of the rural population with age in the bracket 15–59 over the rural population with age in the bracket of 0–14 plus over 60), ratio of urban to rural population in the municipality. The source is the demographic census of 2010 [12] in general, and the 2000 and 2010 census for the migration index. The demographic score was computed using the ranks of these measurements, weighted by the relative multiple correlation coefficient.

The index of social development reflects the level of well-being, favored by factors such as the availability of water and electric energy in the rural residences, and level of education, health and poverty in the rural households. It was computed as a weighted average of normalized ranks of the following variables: education (illiteracy rate), poverty index, average gross per capita income of rural households, proportion of farms with access to electricity and water, index of basic education, index of performance of the public health system and vulnerability of children up to five years old. These indicators were obtained from the Brazilian demographic census 2010 [12], from the Brazilian agricultural census 2006 [11], and from the databases of the National Institute of Research and Educational Studies (INEP), referring to education in 2009 [13], and of the Ministry of Health 2011 data [14]. The social score was computed using the ranks of these measurements, weighted by the relative multiple correlation coefficient.

3 Methodology

Our approach to assess production and efficiency of production follows along the lines of Karakaplan and Kutlu [6] and Karakaplan [7]. The structural model for our application is defined by (1) for municipality i, where techassist is assumed endogenous and \( y_{i} \) is the log of gross income.

$$ \begin{aligned} & y_{i} = b_{0} + b_{1} \log \left( {labor_{i} } \right) + b_{2} \log \left( {land_{i} } \right) + b_{3} \log \left( {techinputs_{i} } \right) + b_{4} \left( {forext_{i} } \right) \\ & \,\,\,\,\,\,\,\,\, + b_{5} \left( {ndareas_{i} } \right) + b_{6} \left( {techassist_{i} } \right) + v_{i} - u_{i} \\ & v_{i} ,u_{i} {\text{ independent}} \\ & v_{i} \sim N\left( {0,\sigma^{2} } \right) \\ \end{aligned} $$
(1)

The \( u_{i} \) are non-negative inefficiency components and the \( v_{i} \) are a random sample of an idiosyncratic error component. We assume three possible distributions for the inefficiency component: half-normal, exponential and truncated normal.

For the half-normal we have \( u_{i} \sim N^{ + } \left( {0,\sigma_{{u_{i} }}^{2} } \right) \) and \( \sigma_{{u_{i} }}^{2} = \exp \left( \begin{aligned} & b_{7} + b_{8} \log \left( {labor_{i} } \right) + b_{9} \log \left( {land_{i} } \right) + b_{10} \log \left( {techinputs_{i} } \right) + b_{11} \left( {forext_{i} } \right) + \\ & b_{512} \left( {ndareas_{i} } \right) + b_{13} \left( {social_{i} } \right) + b_{14} \left( {demog_{i} } \right) \\ \end{aligned} \right) \). For the exponential \( u_{i} \sim \exp \left( {\zeta_{i} } \right),\zeta > 0 \), we assume the variance \( \sigma_{{u_{i} }}^{2} = \zeta^{ - 2} \) with the same representation as the half-normal. Finally, for the truncated normal \( u_{i} \sim N^{ + } \left( {\mu_{i} ,\sigma_{u}^{2} } \right) \) and \( \begin{array}{*{20}l} {\mu_{i} = b_{7} + b_{8} \log \left( {labor_{i} } \right) + b_{9} \log \left( {land_{i} } \right) + b_{10} \log \left( {techinputs_{i} } \right) + b_{11} \left( {forext_{i} } \right) + } \hfill \\ {b_{512} \left( {ndareas_{i} } \right) + b_{13} \left( {social_{i} } \right) + b_{14} \left( {demog_{i} } \right)} \hfill \\ \end{array} \).

Endogeneity in Karakaplan and Kutlu [6] and Karakaplan [7] means correlation of a variable with \( v_{i} \). This assumption invalidates the classic stochastic frontier analysis. A classic approach for handling this issue is to use two stage least squares or the general method of moments (GMM), as suggested in Amsler et al. [15]. On the other hand, Karakaplan and Kutlu [6] and Karakaplan [7] suggest the use of instrumental variables in a context of maximum likelihood estimation, resembling classical frontier analysis. In our application, we follow this approach and the instruments considered for techassist are the exogenous variables plus demographic and social indicators. The instrumental variable regression is assumed to be linear but the idea can be easily generalized to non-linear specifications \( techassist_{i} = f\left( {z_{i} ,\delta } \right) + \varepsilon_{i} \). In this formulation, \( z_{i} \) is a vector of instrumental variables and \( \varepsilon ' = \left( {\varepsilon_{1} , \ldots ,\varepsilon_{n} } \right) \) has mean zero and variance matrix \( \sigma_{\varepsilon }^{2} I \). Heteroskedastic formulations are possible assuming a general variance matrix \( \Upomega \). This formulation also allows for the Bernoulli specification described in Papke and Wooldridge [9], which is particularly convenient if one is dealing with fractions. In this instance, the model can be estimated assuming \( f\left( . \right) \) to be a distribution function.

Karakaplan [7] in its ‘sfkk’ module in the Stata software makes use of the half-normal distribution and the linear instrumental variable regression.

Let \( \rho \) be the correlation between \( \varepsilon_{i} \) and \( v_{i} \). Endogeneity means \( \rho \ne 0 \). We assume the bivariate normal distribution as in (2).

$$ \left[ {\begin{array}{*{20}c} {\tilde{\varepsilon }_{i} } \\ {v{}_{i}} \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} {{{\varepsilon_{i} } \mathord{\left/ {\vphantom {{\varepsilon_{i} } {\sigma_{\varepsilon } }}} \right. \kern-0pt} {\sigma_{\varepsilon } }}} \\ {v{}_{i}} \\ \end{array} } \right]\sim N\left( {\left[ {\begin{array}{*{20}c} 0 \\ 0 \\ \end{array} } \right],\left[ {\begin{array}{*{20}c} 1 & {\rho \sigma } \\ {\rho \sigma } & {\sigma^{2} } \\ \end{array} } \right]} \right) $$
(2)

Using a Cholesky decomposition we may write (3) and, therefore, we have (4).

$$ \left[ {\begin{array}{*{20}c} {\tilde{\varepsilon }_{i} } \\ {v{}_{i}} \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} 1 & 0 \\ {\rho \sigma } & {\sigma \sqrt {1 - \rho^{2} } } \\ \end{array} } \right]\left[ {\begin{array}{*{20}c} {\tilde{\varepsilon }_{i} } \\ {\tilde{w}{}_{i}} \\ \end{array} } \right] $$
(3)
$$ \begin{aligned} & y_{i} = b_{0} + b_{1} \log \left( {labor_{i} } \right) + b_{2} \log \left( {land_{i} } \right) + b_{3} \log \left( {techinputs_{i} } \right) + b_{4} \left( {forext_{i} } \right) \\ & \,\,\,\,\,\,\,\,\,\, + b_{5} \left( {ndareas_{i} } \right) + b_{6} \left( {techassist_{i} } \right) + \eta \tilde{\varepsilon }_{i} + w_{i} - u_{i} \\ & w{}_{i} = \sigma \sqrt {1 - \rho^{2} } \tilde{w}_{i} \\ & \mu = \rho \sigma \\ \end{aligned} $$
(4)

Therefore, when the residual variance is constant, the component \( \eta \tilde{\varepsilon }_{i} \) is the correction term for bias. The test of \( \eta = 0 \) is an endogeneity test. The model is estimated by maximum likelihood.

For the half-normal distribution, the likelihood function is given by (5).

$$ \begin{aligned} \log L\left( \theta \right) = & \sum\limits_{i = 1}^{n} {\left\{ {\frac{{\ln \left( {{2 \mathord{\left/ {\vphantom {2 \pi }} \right. \kern-0pt} \pi }} \right) - \ln \left( {\sigma_{Si}^{2} } \right) - \left( {{{e_{i} } \mathord{\left/ {\vphantom {{e_{i} } {\sigma_{Si}^{2} }}} \right. \kern-0pt} {\sigma_{Si}^{2} }}} \right)}}{2} + \ln \Upphi \left( {\frac{{\lambda_{i} e_{i} }}{{\sigma_{Si}^{{}} }}} \right)} \right\}} \\ \, & + \, \sum\limits_{i = 1}^{n} {\left\{ {\frac{{\ln 2\pi - \ln \sigma_{\varepsilon } - \sum\nolimits_{i = 1}^{n} {\left( {{{\varepsilon_{i}^{2} } \mathord{\left/ {\vphantom {{\varepsilon_{i}^{2} } {\sigma_{\varepsilon }^{2} }}} \right. \kern-0pt} {\sigma_{\varepsilon }^{2} }}} \right)} }}{2}} \right\}} \, \\ \end{aligned} $$
(5)

Here \( \lambda_{i} = {{\sigma_{ui} } \mathord{\left/ {\vphantom {{\sigma_{ui} } \sigma }} \right. \kern-0pt} \sigma } \) and \( \sigma_{Si}^{2} = \sigma_{ui}^{2} + \sigma^{2} \). Notice that \( e_{i} \) is defined by (6).

$$ e_{i} = y_{i} - \left( \begin{aligned} & b_{0} + b_{1} \log \left( {labor_{i} } \right) + b_{2} \log \left( {land_{i} } \right) + b_{3} \log \left( {techinputs_{i} } \right) + b_{4} \left( {forext_{i} } \right) \\ & + b_{5} \left( {ndareas_{i} } \right) + b_{6} \left( {techassist_{i} } \right) + \eta \tilde{\varepsilon }_{i} \\ \end{aligned} \right) $$
(6)

For the exponential model, the likelihood function becomes (7) and for the truncated normal it is defined as in (8), where \( \gamma = {{\sigma_{u}^{2} } \mathord{\left/ {\vphantom {{\sigma_{u}^{2} } {\sigma_{S}^{2} ,\,\,\sigma_{S}^{2} = \sigma_{u}^{2} + \sigma^{2} }}} \right. \kern-0pt} {\sigma_{S}^{2} ,\,\,\sigma_{S}^{2} = \sigma_{u}^{2} + \sigma^{2} }} \).

$$ \begin{aligned} \log L\left( \theta \right) & = \sum\limits_{i = 1}^{n} {\left\{ { - \ln \left( {\sigma_{u} } \right) + \frac{{\sigma^{2} }}{{2\sigma_{u}^{2} }} + \ln \Upphi \left( {\frac{{ - e_{i} - {{\sigma^{2} } \mathord{\left/ {\vphantom {{\sigma^{2} } {\sigma_{u} }}} \right. \kern-0pt} {\sigma_{u} }}}}{\sigma }} \right) + \frac{{e_{i} }}{{\sigma_{u} }}} \right\}} \\ \, & \,\,\,\,\,\, + \, \sum\limits_{i = 1}^{n} {\left\{ {\frac{{\ln 2\pi - \ln \sigma_{\varepsilon } - \sum\nolimits_{i = 1}^{n} {\left( {{{\varepsilon_{i}^{2} } \mathord{\left/ {\vphantom {{\varepsilon_{i}^{2} } {\sigma_{\varepsilon }^{2} }}} \right. \kern-0pt} {\sigma_{\varepsilon }^{2} }}} \right)} }}{2}} \right\}} \, \\ \end{aligned} $$
(7)
$$ \begin{aligned} \log L\left( \theta \right) = & \sum\limits_{i = 1}^{n} {\left\{ \begin{aligned} & - \frac{{\ln \left( {2\pi } \right)}}{2} - \ln \sigma_{S} - \ln \Upphi \left( {\frac{{\mu_{i} }}{{\sigma_{S} \sqrt \gamma }}} \right) + \ln \Upphi \left( {\frac{{\left( {1 - \gamma } \right)\mu_{i} - \lambda e_{i} }}{{\sigma_{S} \sqrt {\gamma \left( {1 - \gamma } \right)} }}} \right) \\ & - \frac{1}{2}\left( {\frac{{e_{i} + \mu_{i} }}{{\sigma_{S} }}} \right)^{2} \\ \end{aligned} \right\}} \\ \, & + \, \sum\limits_{i = 1}^{n} {\left\{ {\frac{{\ln 2\pi - \ln \sigma_{\varepsilon } - \sum\nolimits_{i = 1}^{n} {\left( {{{\varepsilon_{i}^{2} } \mathord{\left/ {\vphantom {{\varepsilon_{i}^{2} } {\sigma_{\varepsilon }^{2} }}} \right. \kern-0pt} {\sigma_{\varepsilon }^{2} }}} \right)} }}{2}} \right\}} \, \\ \end{aligned} $$
(8)

Karakaplan and Kutlu [6] suggest an alternative to estimation easier to implement, which can be extended to accommodate fractional regression models in the instrumental regression. The idea is to perform the estimation in two steps. Firstly, one fits the instrumental variable regression and computes residuals \( \hat{\varepsilon }_{i} = techassist - f\left( {z_{i} ,\hat{\delta }} \right) \) and then runs the standard stochastic frontier model (9).

$$ \begin{aligned} y_{i} \, = & \,b_{0} + b_{1} \log \left( {labor_{i} } \right) + b_{2} \log \left( {land_{i} } \right) + b_{3} \log \left( {techinputs_{i} } \right) + b_{4} \left( {forext_{i} } \right) \\ \, & + b_{5} \left( {ndareas_{i} } \right) + b_{6} \left( {techassist_{i} } \right) + \eta \hat{\varepsilon }_{i} + w_{i} - u_{i} \\ \end{aligned} $$
(9)

The process will not produce the same results as the full maximum likelihood estimation. Greene [4] names it limited information maximum likelihood. The variance matrix of the estimator requires the Murphy and Topel [8] correction. Let \( \hat{\delta } \) be the maximum likelihood estimate obtained from the instrumental variable regression with variance matrix \( \hat{V}_{1} \). The likelihood function is \( \ln \left( {techassist,z,\delta } \right) \). Let \( \hat{\theta } \) be the maximum likelihood of the resulting frontier model obtained when \( \delta = \hat{\delta } \). The variance matrix is \( \hat{V}_{2} \) and the likelihood function is \( \ln f_{2} \left( {y,x,\hat{\delta },\theta } \right) \), where the vector \( x \) includes inputs, technical assistance, non-degraded areas, forests, and the residual from the instrumental variable regression. Following Greene [16], we may define the matrices (10) and (11).

$$ \hat{C} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {\frac{{\partial \ln f_{2i} }}{{\partial \hat{\theta }}}} \right)} \left( {\frac{{\partial \ln f_{2i} }}{{\partial \hat{\delta }'}}} \right) $$
(10)
$$ \hat{R} = \frac{1}{n}\sum\limits_{i = 1}^{n} {\left( {\frac{{\partial \ln f_{2i} }}{{\partial \hat{\theta }}}} \right)} \left( {\frac{{\partial \ln f_{1i} }}{{\partial \hat{\delta }'}}} \right) $$
(11)

The estimated variance matrix of the limited information maximum likelihood estimator is defined as in (12).

$$ \hat{V} = \frac{1}{n}\left( {\hat{V}_{2} + \hat{V}_{2} \left( {\hat{C}\hat{V}_{1} \hat{C}' - \hat{R}\hat{V}_{1} \hat{C}' - \hat{C}\hat{V}_{1} \hat{R}'} \right)\,\hat{V}_{2} } \right) $$
(12)

In our exercise we used both methods, that is, full likelihood estimation as well as the two-step procedure. Regression in the first step used both the fractional approach of Papke and Wooldridge [9] and linear regression.

4 Statistical Results

Following the standard literature in stochastic frontier analysis we fitted 11 models to the data described in Sect. 2, using the approaches of Sect. 3. The models considered are: Case 1 – The full information maximum likelihood approach under the half-normal and exponential inefficiency distributions, and the correspondent limited information maximum likelihood for the best model under linear and fractional instrumental variables regressions. The only inefficiency effect considered is the social indicator; Case 2 – The limited information maximum likelihood assuming both instrumental variables’ regression assumptions, including as efficiency effects all independent factors for the half-normal and truncated normal. Tables 1 and 2 show the goodness of fit measures considered for model choice.

Table 1. Fit statistics: Case 1 – Social indicator is the only inefficiency effect.
Table 2. Fit statistics: Case 2 – All exogenous variables are inefficiency effects.

We experienced convergence problems with some of the assumptions for the inefficiency distribution, depending on the assumption itself and on the number of variables included in the efficiency effect. The full information maximum likelihood with all exogenous variable included in the effect did not converge, inhibiting the application of the standard likelihood approach to test nested hypothesis. We see from Tables 1 and 2 that the best fit is the full information estimator under the half-normal distribution, reducing the set of inefficiency factor effects to the social indicator. The models fitted in two stages using the linear and the non-linear binomial Papke and Woodridge [12] assumptions indicate similar results, with a slight superiority for the fractional regression. Correlations between actual and estimated values for the instrumental regressions are, respectively, 80.1% and 80.4%. Programming was carried out using Stata 14 and SAS 9.2 software.

Table 3 shows statistical estimation for full information half-normal model including a social effect for the inefficiency component. Table 4 shows the fractional regression for technical assistance. Table 5 shows the limited information maximum likelihood with the Murphy-Topel variance correction [8], under the binomial specification for the instrumental variable regression.

Table 3. Full information maximum likelihood estimation. Half-normal stochastic frontier under endogeneity of technical assistance. Stata output.
Table 4. Instrumental variable fractional regression. SAS output.
Table 5. Limited Information maximum likelihood estimation. Half-normal stochastic frontier under endogeneity of technical assistance and fractional regression. SAS output.

In the context of the full information maximum likelihood estimation correlation between actual and predicted values of the frontier model, including efficiency effects, is 88.6%. The component technical assistance affects significantly and positively the response variable (log income). There is no evidence of endogeneity (p-value = 0.1858).

Table 6 summarizes the relative importance of production factors, including returns to scale. We see that technology dominates, followed by labor and land. The technology shows decreasing returns to scale. These results fairly agree with Souza et al. [5].

Table 6. Relative elasticities and returns to scale.

Technical assistance, non-degraded areas and the proportion of forested areas are all statistically significant (Table 4). The former act favoring production and the latter have a negative effect.

Table 7 shows 5-number summaries for technical efficiency. Figure 1 shows box plots for the normalized ranks of the efficiency measurements. Efficiency differs significantly by regional classification. There is a clear domination of South, Southeast, and Center-West.

Table 7. Normalized rank of technical efficiency – 5-number summary.
Fig. 1.
figure 1

Box plots of technical efficiency by region.

The social indicator positively affects technical efficiency, as reported in Table 4. Regions that are to benefit the most from improvements in the social indicators are the North and Northeast. The instrumental variable regression indicates a strong dependence of technical assistance on the environment, demographics and the social conditions. The increased population dynamics makes the presence of technical assistance unnecessary, implying, therefore, a negative effect of the demographic index. The other indices are positively related to technical assistance.

The limited information maximum likelihood estimation agrees, in general, with the full information maximum likelihood results. There is no evidence of technical assistance endogeneity. See Table 5. The main difference regards the standard error of the estimated coefficient of the social indicator in the inefficiency variance (Table 5). The Murphy-Topel correction inflates the variance, forcing non-significance. However, the coefficient values are similar. The fractional instrumental variables regression indicates positive relation to the social indicator and to non-degraded areas (Table 4). The demographic index is negatively related to the response and the proportion of forested areas is not significant.

Limited information estimation, including all instrumental variables as technical efficiency effects, is clearly inferior to the full information model estimated, including only the social indicator (Tables 1 and 2). The interesting feature of these models is the similarity of the results obtained with the linear and non-linear instrumental regression, suggesting robustness of the linear instrumental regression.

5 Concluding Remarks

We fitted a stochastic frontier under endogeneity to municipal data using the Brazilian agricultural census of 2006 – the last one available. The objective of this study, besides assessing input elasticities, was to investigate effects of market imperfection variables on production. Market imperfections come from different realities in production experienced by small and large farmers. They relate to infrastructure, level of education and access to credit, implying in different input and output prices for small and large farmers. The presence of market imperfection makes it harder for rural extension and technical assistance to promote productive inclusion.

For public policy decision-making, the identification of production function components elasticities is of importance to guide rural governmental assistance. This is critical for reducing poverty in the fields and for increasing production. We conclude that technology is the main input factor for increasing income in rural Brazil. The social indicator is the key variable to reducing inefficiency. The indicator is relatively too low for the Northern and Northeastern regions. Values are less than half of the corresponding values of other regions. Public policies should be oriented to improve this indicator particularly in these regions.

Technical assistance is an important part of rural extension and has a direct positive effect on income. Improvement of the social indicator will tend to facilitate the access of technical assistance creating, in this way, a synergic positive effect on income.

Environment in our study was measured in two ways: non-degraded areas and the proportion of forested areas. Keeping non-degraded areas relates to technology and has a positive impact on production. On the other hand, keeping a relative large area of uncultivated land in the farm will have a negative effect on income. Extension and technical assistance may be the key factor to extract value from forests and properly preserve these areas.

Finally, we emphasize the fact that the use of limited information maximum likelihood estimation indeed eases convergence in the stochastic frontier models. The linear instrumental regression seems to be robust, but it produces inferior fits when compared with fractional regressions. The Murphy-Topel variance matrix correction may change the significance of important variables relative to the full information maximum likelihood estimation.