1 Introduction

Empirical studies on productivity and efficiency at the micro-level have found large heterogeneity across firms or plants, even within narrowly defined industries (see, e.g., Syverson 2010; Dosi et al. 2011). Differences in performance between production units have mainly been attributed to variations in management skills, human capital, innovation, types of ownership, firms’ international exposure and size, together with factors which are external to the firms, like technological spillovers and the regulatory environment. Similarly, the decision about which phases of production to keep inside the firm and which to leave ‘outside’ (i.e., the control of vertical links of production) is another factor related to a firm’s productive performance, which has been widely investigated in the economics and management literature. Several models have been proposed to explain the existence of firms with different degrees of vertical integration, referring to a variety of factors such as transaction and agency costs, market-power and firms’ specific capabilities (for a comprehensive survey on the determinants of the vertical scope of the firm, see Lafontaine and Slade 2007).

From a theoretical point of view, the link between vertical integration and efficiency may really be the result of two-way causality, i.e. from productive efficiency to the type of vertical organization or vice versa. Although theoretical models have tried to explain the existence and functioning of each direction of causation, no theory contemporaneously contemplates both directions, and in the last few years, this lack has led to the flowering of a vast empirical literature, the results of which are still inconclusive.

In light of these facts, we empirically examine the link between firms’ efficiency and vertical integration in a novel panel dataset comprising about 500 Italian machine tool (MT) builders, examining and assessing both directions of causation. The MT industry is a strategic sector in most industrial countries (Carlsson 1989) and gathers together all the producers of metal working machinery and component. It is a natural candidate for this analysis given that its vertical structure has taken on various configurations since the 1950s (see Rolfo 1993). At the present time, the MT industry is characterized by the coexistence of various types of organizational forms and heterogeneity in productive efficiency. Our empirical analysis is structured in two steps: first, we implement a stochastic production frontier model (SFM) to estimate firms’ technical efficiency; second, we investigate the relationship between the degree of vertical integration and technical efficiency, by means of two equations for the two directions of causality.

We find that, once we have controlled for firms’ unobserved heterogeneity and an important set of time-variant characteristics, inefficiency levels have a positive effect on the degree of vertical disintegration, i.e., more efficient firms choose vertically integrated structures, whereas less efficient firms choose disintegrated organizations. This result, which is robust to control for the endogeneity of inefficiency in the relationship, indicates that an ex ante selection mechanism is at work in the industry; conversely, no significant effect is found from the organizational mode to the firm’s productive performance, in a sort of ‘adaptive’ mechanism.

The contribution of this work runs in two main directions: first, it sheds light on the relationship between the control of vertical links of production and firms’ performance, identifying the main direction through which the effect works; second, it attempts to describe the functioning of the MT industry in Italy, which is a key sector of small and medium enterprises (SMEs) which has usually been seen as central for the country’s industrialization and development after the second world war. The paper is structured as follows: Sect. 2 presents the related literature on the link between vertical integration and efficiency; Sect. 3 describes the empirical strategy; Sect. 4 illustrates data; Sect. 5 shows results; and Sect. 6 adds some robustness checks. Lastly, Sect. 7 draws some conclusions.

2 Vertical integration and efficiency: theory and evidence

2.1 Theory

In a simplified setting, in order to be produced, a final good needs two inputs: an intermediate input, and a resource/input which is available to the final good producer. The manufacturer must decide either to buy the intermediate input from an external supplier (i.e., to outsource it) or make it ‘in-house’, vertically integrating. The two organizational forms are alternative ways of producing, which practically appear and coexist not only among various industries but also within them.

However, the wide heterogeneity of vertical boundaries among firms in the same industry is a compelling issue: why should firms adopt different degrees of vertical integration in a ‘common’ environment? And is there any relationship between this choice and firms’ productive performance? As noted previously, the causation between the vertical organization of production and efficiency is not a one-way phenomenon. In this respect, we can refer to different approaches which explain heterogeneity in vertical integration choices: from these approaches, alternative views on the causal relationship have been put forward.

The competitive markets approach, which predicts a self-selection mechanism by heterogeneous firms into different modes of production, has been adopted in several models at the crossroads of industrial organization and international trade. Following this approach, Elberfeld (2001) demonstrates that vertically integrated and disintegrated firms may coexist in the same industry in equilibrium: integrated firms incur higher fixed costs but save on marginal costs. In the model by Antras and Helpman (2004), which rests on a property-rights setting (Grossman and Hart 1986), vertically integrated firms face higher fixed organizational costs; different variable costs depend on decisions about outsourcing production of an intermediate input, and in which country to do so. Heterogeneity in productivity is also introduced in the model: in an industry characterized by relatively higher organizational fixed costs for vertically integrated firms, only the most efficient firms are expected to choose an integrated structure.

The alternative direction of causation, i.e. from the vertical scope of the firm to its productive performance, is analyzed by different approaches in the literature. Market-power theories usually predict a positive effect from vertical integration on firms’ productive efficiency which is linked to the avoidance of double marginalization or other practices that are inefficient (Perry 1989). The strategic management literature moves instead from the fact that firms may have different capabilities of managing vertical links of production. Thus, heterogeneity in the vertical scope reflects the adaptation of firms’ organizational form to their capabilities. An agent-based model proposed by Jacobides (2008) illustrates how firms with heterogeneous capabilities choose different modes of vertical organization, according to the transaction costs they face. Firms can later invest in new capabilities in order to reduce transaction costs; this evolutionary process implies that firms shift from integrated to disintegrated structures and vice versa.Footnote 1 Differences in productive performance emerge ex post as the result of the selective pressure of the market, and causality moves from the organizational choice to the level of productive efficiency. Other authors have focused on coordination issues which may be related to the vertical scope of the firm: on the one hand, vertically integrated organizations may benefit of greater coordination along the production chain (Kogut and Zander 1996), while on the other hand, a greater focus on ‘core competences’ (thorough vertical disintegration/outsourcing), may lead firms to gain in average efficiency.

Overall, theories neither say a final word on the prevailing direction of causality, nor do they predict clear-cut effects; this fact has recently generated a significant amount of empirical research.

2.2 Evidence

The empirical evidence of firms’ efficiency as a determinant of the vertical organization choice has grown in the last few years. Tomiura (2007), analyzing a representative sample of manufacturing firms in Japan, finds that the most productive firms establish international vertical links of production (through foreign direct investments), whereas less productive firms choose outsourcing; a similar result is found by Castellani and Zanfei (2007) in a representative sample of Italian firms; Federico (2010) shows a systematic positive relationship between productivity and vertical integration (either at home or abroad) for Italian manufacturing; and Bakhtiari (2011), in an unbalanced panel of Australian manufacturing firms, shows that the least efficient firms resort to outsourcing in order to save overhead costs associated with integrated structures. All these studies assess the self-selection mechanism of the most productive firms into vertically integrated organizations, but they cannot exclude the other-way-round effect.

As regards market-power-based theories, evidence of effects of vertical integration on firms’ efficiency is fragmented; however, data seem to support the fact that the efficiency gains of vertical integration outweigh anti-competitive effects (Kerkvliet 1991; Chipty 2001). Furthermore, empirical works on the effects of vertical disintegration/outsourcing on firms’ productive performance has gained momentum in the last 10 years, mainly looking at the international side of the phenomenon. Girma and Görg (2004) use establishment-level data in the chemicals, electronics and engineering industries in the UK, finding a positive effect of outsourcing on total factor productivity in the latter industry only, while Görg et al. (2008) find evidence of positive effects from outsourcing of services on the productivity of Irish manufacturing firms which, however, only holds for exporters. Heshmati (2003) and Olsen (2006) offer two surveys of studies on the relationship between (national and international) outsourcing and efficiency, with particular reference to service outsourcing, from which, however, no clear-cut effects emerge.

Thus, empirical studies have not led to any definite picture of the link between the vertical organization of production and firms’ performance. In addition, only a few studies have explicitly looked at effects stemming from both directions of causality.Footnote 2

3 Empirical strategy

In order to assess the link between vertical integration and efficiency in the Italian MT industry, we structure our empirical analysis in two steps. We first implement a SFM for estimating firms’ technical inefficiency; then, we examine the relationship between the degree of vertical integration and the resulting inefficiency scores, using two equations and instrumental variables to control for endogeneity. Sections 3.1 and 3.2 detail the steps of the empirical analysis.

3.1 First step: stochastic frontier models and unobserved heterogeneity

A simple SFM for panel data can be written, in log-linear form, as

$$ y_{it}={f\left({\bf x}_{{\bf it}},\varvec{\beta}\right)}+\epsilon_{it}=\alpha+\varvec{\beta}'{\bf x}_{{\bf it}}+\epsilon_{it}, $$
(1)

where y it denotes the output of the ith firm in the tth time period, x it is the vector of N inputs used by the producer, \(\varvec{\beta}\) is the vector of technology parameters, and \(\epsilon_{it}\) is the composed error term, where:

$$ \epsilon_{it}= v_{it}-u_{it}. $$
(2)

Equations 1 and 2 combine to give:

$$ y_{it}=\alpha+\varvec{\beta}'{\bf x}_{{\bf it}}+v_{it}-u_{it}, $$
(3)

where

$$ v_{it}\sim {\rm i.i.d.} \, N(0,\sigma^{2}_{v}),\, \hbox{and}, \, u_{it}\sim {\rm i.i.d.} \, N^{+}\,(0,\sigma^{2}_{u}). $$
(4)

The composed error consists of a normally distributed component v it , which accounts for random variations of the frontier across firms (due to factors which are not under their control) and measurement errors in y it , and a component u it , which accounts for the difference between the actual level of production and the maximum attainable level, i.e., technical inefficiency, which is assumed to be half-normally distributed.Footnote 3 The estimation is usually performed via maximum likelihood (ML) methods to obtain consistent and efficient estimates of vector \(\varvec{\beta}\) and variance parameters σ 2 v and σ 2 u . Estimates of \(\epsilon_{it}\) are directly recoverable as \(\widehat{\epsilon}_{it}=y_{it}-\widehat{\alpha}-\widehat{\varvec{\beta}}'{\bf x}_{{\bf it}},\) and the estimator developed by Jondrow et al. (1982) can then be used to obtain estimates of the inefficiency scores:

$$ \widehat{u}_{it}=E\left(u_{it}|\epsilon_{it}\right)=\frac{\sigma_{v}\sigma_{u}}{\sigma}\left[\frac{\phi\left(\frac{\epsilon_{it}\lambda}{\sigma}\right)}{1-\Upphi\left(\frac{\epsilon_{it}\lambda}{\sigma}\right)}-\left(\frac{\epsilon_{it}\lambda}{\sigma}\right)\right], $$
(5)

where \(\sigma=\sqrt{\sigma^{2}_{v}+\sigma^{2}_{u}},\,\lambda=\sigma_{u}/\sigma_{v},\) and \(\phi(\cdot)\) and \(\Upphi(\cdot)\) denote, respectively, the density function and the cumulative function of the standard normal distribution.

The specification contained in Eqs. 3 and 4, which has been adopted in a number of works (see, e.g., Kumbhakar 1990; Battese and Coelli 1995) does not take into account the panel nature of the data, and treats them much as a pooled set of observations. This raises an important point: when differences among observations are confined to the vector of x it , the u it elements in Eq. 3 are intended to capture all and only the time-variant firms’ inefficiency; conversely, if there are firm-specific time-invariant effects—which may be correlated to inputs—and they are not tackled in the model, this fact would lead to biased estimates in the \(\varvec{\beta}\) parameters. This ‘pure’ heterogeneity would thus affect overall residuals \(\widehat{\epsilon}_{it}, \) leading to an incorrect statement of technical inefficiency (see Greene 2008, p. 173).Footnote 4

Greene (2005) proposed two SFM which contemplate both unobserved heterogeneity and time-variant inefficiency: the ‘true’ fixed effects (TFE) and ‘true’ random effects (TRE) models. The TRE model may be written as:

$$ y_{it}=\alpha+\omega_i+\varvec{\beta}'{\bf x}_{{\bf it}}+v_{it}-u_{it}, $$
(6)

where ω i is the random term which is specific to each firm and assumed to be uncorrelated with inputs, and the other variables and parameters are defined as in Eqs. 3 and 4. The model has a two-part error component, ω i which should capture the unobserved heterogeneity, and \(\epsilon_{it} = v_{it}-u_{it}, \) which has asymmetric distribution. However, if firm-specific effects are correlated with the vector of inputs chosen by the firm, the TRE model may be prone to the heterogeneity bias.

A useful solution for correcting the TRE model, accounting for this correlation, is to adopt the adjustment proposed by Mundlak (1978), inserting the within-group means of inputs in the main frontier function as follows (Abdulai and Tietje 2007)Footnote 5:

$$ y_{it}=\alpha+\varvec{\beta}'{\bf x}_{{\bf it}}+\varvec{\delta}'\overline{{\bf x}}_{{\bf i}}+\overline{z}_i+v_{it}-u_{it}, $$
(7)

where:

$$ \omega_i=\varvec{\delta}'\overline{{\bf x}}_{{\bf i}}+\overline{z}_i, $$
(8)

\(\overline{{\bf x}}_{{\bf i}}=\frac{1}{T_{i}}\sum^{T_{i}}_{1}{\bf x}_{{\bf it}}\) are the within-group means of inputs and \(\overline{z}_i \sim N(0,\sigma^{2}_{\overline{z}})\) is the orthogonal-to-inputs part of the firm-specific component ω i ; the other variables and parameters are defined as in Eqs. 3 and 4.

The resulting inefficiency scores may be interpreted as in deviation from the firm’s average output level (average inefficiency),Footnote 6 given that the overall residual is equal to:

$$ \widehat{\epsilon}_{it}=y_{it}-\widehat{\alpha}-\widehat{\varvec{\beta}'}{\bf x}_{{\bf it}}-{\widehat{\delta}'}\overline{{\bf x}}_{{\bf i}}, $$
(9)

and the inefficiency scores may be estimated, following the Jondrow formula, as \(\widehat{u}^{\rm TREMU}_{it}=E\left(u_{it}|\widehat{\epsilon}_{it}\right).\)

In order to estimate the technical inefficiency of Italian MT builders, taking unobserved heterogeneity into account, we adopted the TREMU; however, as robustness checks, we compared estimates of the TREMU model with those of the pooled stochastic frontier (PSF) in Eq. 3, the TFE and TRE models, reporting the obtained estimates and inefficiency scores in Sect. 6.1, and presenting proper statistical tests in order to support the preferred TREMU specification. For estimating the parameters of the TREMU model via maximum simulated likelihood Footnote 7 we adopt a translog specification with three inputs:

$$ \begin{aligned} y_{it}&=\alpha+\sum_{n}\beta_{n}\cdot(x_{nit})+\frac{1}{2}\sum_{n}\sum_{p}\beta_{np}\cdot(x_{nit}x_{pit}) &+ \sum_{n}\delta_{n}\cdot(\overline{x}_{ni}) +\frac{1}{2}\sum_{n}\sum_{p}\delta_{np}\cdot(\overline{x_{ni}x_{pi}})\\ &+\overline{z}_i+\tau_{t}+v_{it}-u_{it}, \end{aligned} $$
(10)

where np = (capitallaborintermediates); we also control for factors affecting all firms in the same way in a given year by including (t − 1) year dummies τ t .

3.2 Second step: vertical integration and efficiency

In the second step of the analysis, we use the inefficiency scores recovered in the first step (by means of TREMU), and we empirically model their relationship with the degree of vertical integration, taking both directions of causality into account. In order to test whether firms characterized by different levels of efficiency self-select into different organizational forms, we have estimated variants of the following equation:

$$ \ln(\rm VDIS_{it})=\gamma_0 + \gamma_1 \widehat{u}^{\rm TREMU}_{it} + \varvec{\lambda}'{\bf Z} + \eta_i + w_{it}, $$
(11)

where VDIS it is a measure of the degree of vertical disintegration, \(\widehat{u}^{\rm TREMU}_{it}\) are the estimated inefficiency scores, Z is a vector of time-variant controls. η i is a vector of firm dummies which should capture the effect of time-invariant firms’ unobserved characteristics, and w it is the i.i.d., normally distributed error component. The γ1 is the most important coefficient, which captures the percentage change in the degree of vertical disintegration which is due to a 1% change in the inefficiency level, ceteris paribus.

Conversely, the vertical organization of production may influence the firm’s efficiency and, in order to assess the extent of this effect, we ran variants of the following equation:

$$ \widehat{u}^{\rm TREMU}_{it} =\theta_0 + \theta_1 \ln(\rm VDIS_{it}) + \varvec{\varphi}'{\mathbf{Z}} + \eta_i + \xi_{it}, $$
(12)

where \(\widehat{u}^{\rm TREMU}_{it}, {\rm VDIS}_{it}, \eta_i\) and Z are defined as in Eq. 11, and ξ it is the i.i.d., normally distributed error component: of particular interest is coefficient θ1, which captures the percentage change in the inefficiency level due to a 1% change in the degree of vertical disintegration. We are aware that the estimation of Eq. 12 may have limitations, such as omitted variable bias and inconsistency with respect to the first step of the analysis.Footnote 8 For this reason, as a robustness check, we also estimated the parameters of the production frontier (in Eq. 10) and those of Eq. 12 following a one-step approach: the results of the one-step and the reasons for preferring the two-step estimation procedure are listed in Sect. 6.2.

The next section details the variables which were included in the frontier model and in Eqs. 11 and 12.

4 Data and descriptive analysis

This study uses an original dataset, compiled by recovering data from several sources: the list of MT producers comes from the Italian Machine Tools, Robots and Automation Manufacturers Association (UCIMU), balance sheet information are from Bureau Van Dijk’s AIDA dataset and sectoral deflators for output and inputs come from the Italian National Institute of Statistics (ISTAT). The data Appendix A explains in detail how the dataset was built and cleaned.

4.1 Description of variables

4.1.1 Variables in the frontier equation

Output (Y) is measured by the amount of revenues from sales and services at the end of the year, net of inventory changes and changes to contract work in progress; labor input (L) is measured as the total number of employees at the end of the year; capital stock (K) in a given year is proxied by the nominal value of tangible fixed assets; and intermediate inputs (M) are measured as the sum of (i) costs of raw materials consumed and goods for resale (net of changes in inventories) plus (ii) cost of services. All monetary measures are expressed in thousands of euros and have been deflated by the proper industry-level index.Footnote 9

We are aware that replacing the quantity or real measures of output and inputs with monetary values deflated by an industry-level index may generate the so-called omitted price bias (Katayama et al. 2009), while failing to account for firm-level deviations from industry-level prices may result in bias estimates of inefficiency. This fact may constrain the reliability of the estimated inefficiency scores, and a note of caution is warranted. However, in the SFM, we partially controlled for time-invariant firms’ unobserved characteristics by introducing firms’ effects. If firm-level deviations from the industry-level output and input price indexes can be considered as being time-invariant in the period under analysis (10 years), an empirical model with firms’ effects would eliminate them.Footnote 10 A similar argument may be made to compare firms with different capacities of negotiating input prices, and the subsequent acceptability of the deflated measures of capital and intermediates.Footnote 11 Summing up, although we cannot exclude the possibility that inefficiency scores partially reflect firm-specific prices in the output and input markets, we coped with this problem in the econometric framework.

All inputs and the output were normalized by mean correction before including them in logs in the production frontier; first-order coefficients of the translog production function can thus be interpreted as output elasticities for the average unit considered.

4.1.2 Vertical disintegration

We build a measure of vertical disintegration VDIS as the ratio of intermediate inputs (M) over total costs of production for the year. For the ith firm in the tth time period, this may be written as:

$$ {\rm VDIS}_{it}=\frac{C_{{\rm RM},it}+C_{{\rm S},it}}{C_{{\rm RM},it}+C_{{\rm S},it}+C_{{\rm L},it}+C_{{\rm K},it}+C_{{\rm O},it}} $$
(13)

where C RM,it is the cost of raw materials consumed and goods for resale (net of changes in inventories), C S,it is the cost of services, C L,it total personnel costs, C K,it total depreciation, amortization and write-downs (which may be interpreted as the figurative cost of capital) and C O,it is a (negligible) residual class. This ratio is an indicator of the relative share of the factors of production acquired from other firms, over all factors of production including labor and capital.Footnote 12 This measure is related to that proposed by Adelman (1955), i.e., the ratio of value added to sales, but the main advantage of our measure with respect to the Adelman index is its lower sensitivity to differences in output prices.Footnote 13 However, caution is required. First, the VDIS measure is prone to suffering from the different input prices which may be faced by MT producers. The problem should be less severe for labor and capital prices: in fact, due to the well-known salary rigidities in the Italian labor market, the focus on a single sector and the geographical agglomeration of the Italian MT builders in a few Northern Italy regions (see the data Appendix A), it is not unreasonable to assume that w it  = w jt (common salary for the same type of worker) for all firms i ≠ j; as for capital, it is reasonable to assume that variations in C K,it among firms mainly depend on the amount of machinery and equipment acquired.Footnote 14 Nonetheless, for a given level of vertical integration, finding a high level of the VDIS variable may be due to the fact that firms’ suppliers enjoy a lot of market power in selling intermediates, and we cannot control explicitly for that. However, as it has been explained in the previous section, the employment of firms’ effects in the first and second step of the empirical analysis should lessen this problem if suppliers’ market power can be assumed as time-invariant in the considered period. Second, we acknowledge that the variable may capture differences in labor-intensity across firms; however, in the second step of the empirical analysis, we have included a measure of size (scale) and a measure of the average wage, which should partially control for this issue.

The VDIS measure was included in logs in the regressions performed.

4.1.3 Control variables

Equations 11 and 12 also include a vector Z of control variables. These variables come from the theoretical literature on vertical integration and outsourcing.

Standard theory generally suggests that the decision about keeping some stages of the production process in-house or relocating them ‘outside’ (outsourcing) depends, all else being equal, on the possibility of saving on labor costs (Abraham and Taylor 1996). We therefore introduced a measure of the average wage for the ith firm in the tth time period, \(\overline{\rm WAGE}_{it}, \) as the ratio of total personnel costs over the number of employees at the end of the year. The possibility of achieving scale economies in the production of the intermediate input may also affect the decision about vertical integration; thus, we included a measure of firm size, SIZE it , defined as the total number of employees at the end of the year.Footnote 15

The literature on transaction costs and property rights suggested other determinants of the vertical integration choice, such as the degree of asset specificity and environmental uncertainty. Unfortunately, we have no information on the degree of specificity of single inputs, and we follow Antonietti and Cainelli (2007) by including the ratio of total debts to total assets at the end of the year, ASS_UNS it , which should be negatively related to the average degree of specificity of all the firm’s assets.Footnote 16 We also included a measure of volume uncertainty in the downstream market: following Lieberman (1991), volume uncertainty is measured as the sum of squared residuals between time t and time t − x of the following regression:

$$ y_{it} = \psi_{0} + \psi_{1}(t)+ \psi_{2}(t^{2})+ \psi_{3}(t^{3})+ \upsilon_{it}, $$
(14)

where y it is (the log of) the output measure and \(t=(1,\dots,10)\) is an integer increasing in each year. The measure may be defined as:

$$ {\rm UNCE}_{it}=\frac{1}{x+1}\sum^{t}_{t=t-x}\widehat{\upsilon}^{2}_{it}, $$
(15)

where x = 2 for years which go from 2000 to 2007, x = 1 for 1999 and x = 0 for 1998. Lastly, because the MT industry is characterized by cycles in the aggregate demand for MTs by its customers (such as producers of automobiles, aircraft and home appliances), as suggested by Wieandt (1994, p. 427), we introduced into the regressions a dummy for the years showing a downward trend in the aggregate value of production (DCYCLE), i.e., 2002, 2003 and 2004.

All controls (except for DCYCLE) have been included in logs in the regressions.

4.2 Descriptive statistics and industry overview

The original database contains 3,875 observations (corresponding to 505 firms) with information on output and inputs for the period 1998–2007, which were used to estimate the parameters of the frontier model and recover the inefficiency scores. It also contains 2,973 observations (401 firms) with full information on all relevant variables for the same period; this smaller sample was used for estimating Eqs. 11 and 12. The data Appendix A explains how samples were obtained.

The figures from Table 1 are in line with general statistics on the industry appearing in technical reports (see UCIMU 2007). The vast majority of producers of MTs are SMEs, in which almost 75% of producers invoices less than 13 million euros, and the top 10% invoices (at least) twice that amount. The Italian MT industry is indeed characterized by the coexistence of a small group of large firms, and a large tier of smaller firms. As emphasized by Rolfo (1993), Italian MT builders are basically single-product firms, and almost all types of products reveal the existence of niches, in which the ability to solve customers’ specific problems is essential (Wengel and Shapira 2004). The two largest product specializations are metal-cutting machinery such as machining centers and lathes, and metal-forming machinery such as presses and sheet metal deformation machinery, as confirmed by our dataset (Table 2).

Table 1 Descriptive statistics, 1998–2007
Table 2 Breakdown of firms by type of production

Table 1 indicates that Italian MT producers show high levels of vertical disintegration (0.67) on average. This evidence is in line with general patterns characterizing the broader Italian manufacturing industry, as shown by Arrighetti (1999). The comparison of the standard deviation of the VDIS measure (almost 0.12) with its average value stresses the high heterogeneity of MT producers with respect to their vertical organization choices.

5 Econometric results

5.1 First step: the SFM

The estimationFootnote 17 of the TREMU production modelFootnote 18 is presented in Table 3.

Table 3 The TREMU SFM

The λ parameter is approximately equal to 1.75, thus revealing that inefficiency actually resides in the data and supporting the adequacy of the frontier model with respect to an average production function which does not take into account the existence of inefficiency, i.e. u i,t  = 0 for all it. We can conduct some generalized likelihood ratio tests of the form \({\rm LR}=-2\left[\ln L(H_{0})-\ln L(H_{1})\right]\sim\chi^{2}_{J}\) on the estimated parameters of the TREMU model. First, we can check for the adequacy of the translog specification against the more parsimonious Cobb–Douglas form: the first row of Table 4 supports the choice of the more flexible form. Second, we can conduct a joint test of the significance of the vector of year dummies, τ t : the second row of Table 4 ensures the significance of the τ t vector. Descriptive statistics on the estimated inefficiency scores (obtained via the Jondrow estimator) for the TREMU model are presented in the last row of Table 4. MT builders are ‘on average’ quite efficient, showing a percentage of inefficiency of almost 7%.

Table 4 Generalized LR tests on parameters of the TREMU model and inefficiency scores

After having recovered the inefficiency scores, \(\widehat{u}^{\rm TREMU}_{it}, \) we use them as the measure of performance to investigate the relationship between firm’s efficiency and vertical integration in the next section.

5.2 Second step: vertical integration and efficiency

In order to investigate the relationship between firms’ efficiency and vertical integration, we start from estimating variants of Eq. 11, in which efficiency determines the vertical integration choice by means of OLS.

Bearing in mind that (see Sect. 3.1) the \(\widehat{u}^{\rm TREMU}_{it}\) scores must be interpreted as in deviation from a firm’s mean level of output (inefficiency), and in order to control for time-invariant firm characteristics which may be correlated both with the level of inefficiency and the degree of vertical disintegration,Footnote 19 we adopt the fixed-effects transformation of Eq. 11 (see Wooldridge 2002, p. 267), by inserting all other variables as deviations from their firm’s average: estimates are listed in Table 5. In specification A1, the degree of vertical disintegration is regressed on the estimated level of inefficiency and the other firms’ characteristics. Results reveal a positive relationship between the firm’s inefficiency and its degree of vertical disintegration. Higher inefficiency levels lead firms to adopt more disintegrated structures for their production processes; in particular, a 1% increase in the inefficiency level leads to a 0.59% increase in the chosen degree of vertical disintegration.Footnote 20

Table 5 Efficiency as a determinant of vertical integration

The value of coefficients referring to other variables is also worthy of comment. The relationship between size and vertical disintegration turns out to be negative, indicating that the larger MT builders are, the more vertically integrated they are. Firms with higher average wages show a lower degree of vertical disintegration, although the relationship is not statistically significant. The degree of asset specificity appears to be positively correlated with vertical integration (we recall that ASS_UNS it , the debt-to-assets ratio, is a proxy for the degree of ‘un-specificity’ of the firm’s assets), which matches previous empirical works adopting the transaction costs perspective (see, e.g., Lyons 1995). The estimated coefficient of the UNCE it variable reveals a negative relationship between the level of uncertainty in the final demand and the degree of vertical disintegration. More uncertainty leads MT producers, on average, to control a greater part of their production processes, which is consistent with the prediction of transaction cost economics (see Lafontaine and Slade 2007, p. 657). Lastly, years characterized by a downward trend in aggregate demand are also characterized by a lower degree of vertical disintegration.

Thus, after controlling for a relevant set of firms’ characteristics, higher inefficiency levels are systematically related to higher degrees of vertical disintegration. Thus, the coefficient of the \(\widehat{u}^{\rm TREMU}_{it}\) scores suggests that more integrated organizations are advantaged.

Although the results capture a systematic pattern of how firms’ efficiency levels map into different degrees of vertical integration, this cannot be interpreted as a causal effect: the results may still suffer from problems of endogeneity and reverse causation. We implemented two robustness checks to deal with this problem. First, we estimated the above specification, using the 1-year lagged values of inefficiency instead of contemporaneous values, which should reduce the endogeneity problem. Coefficient γ l1 in specification A2 is smaller than that of contemporaneous scores, γ1 in A1, although the positive and significant relationship is confirmed. Once we introduce both contemporaneous and lagged inefficiency levels, in specification A3, the former ones show a much higher coefficient, which captures almost the entire effect of inefficiency on vertical disintegration. Second (specification A4), we instrumented the current and 1-year lagged levels of inefficiency with the inefficiency level at the beginning of the period, \(\widehat{u}^{\rm TREMU}_{i,1998}, \) and the current and 1-year lagged inefficiency scores from the pooled SFM in Eq. 3, \(\widehat{u}^{\rm PSF}_{it}. \) These variables, which are well correlated with \(\widehat{u}^{\rm TREMU}_{it}\) and \(\widehat{u}^{\rm TREMU}_{i,t-1}\) (see Table 9 in Sect. 6.1), should not in fact be correlated with the VDIS it variable after demeaning. The generalized method of moments (GMM) estimator is implemented. The P value of Kleibergen-Paap rk LM test rejects the null hypothesis, thus reassuring us about the identification of the model; the Kleibergen–Paap Wald rk weak-identification test confirms that the relationship between the instruments and the potentially endogenous regressors is strong, showing a remarkably high F statistic (135.28Footnote 21); and the instruments are valid, as the Hansen J statistic, with a P value of 0.17, indicates that overidentifying restrictions are not rejected. However, the exogeneity test rejects the null hypothesis that both \(\widehat{u}^{\rm TREMU}_{it}\) and \(\widehat{u}^{\rm TREMU}_{i,t-1}\) are exogenous (P value = 0.0000); thus, the OLS estimates are inconsistent and the IV estimates must be preferred, although they show a smaller but still significant effect from inefficiency to vertical disintegration.

Summing up, these results suggest that, after controlling for the endogeneity of firms’ inefficiency, a positive effect from the latter to the degree of vertical disintegration is at work; all else being equal, more inefficient firms select more disintegrated structures. The effect amounts to an elasticity of 0.16%.

Endogeneity may be due either to unobserved time-variant characteristics related to vertical integration and efficiency, or to a true reverse causality from the vertical organization of production to the performance of the firm. In order to assess whether a reverse effect is at work, we estimate the variants of Eq. 12, after the fixed-effects transformation; the results are listed in Table 6. In specification B1, we regress levels of inefficiency on contemporaneous degrees of vertical disintegration, controlling for firms’ unobserved heterogeneity and the other characteristics. A 1% change in the degree of vertical disintegration leads to a 0.13% change in the inefficiency level, but given the endogeneity of the VDIS variable in the relationship, this evidence is only suggestive and we need further checks to be able to asses something which is nearer to a causal effect.

Table 6 Effect of vertical organization on firms’ productive efficiency

Consequently, we first regress the current levels of inefficiency on 1-year lagged degrees of vertical disintegration, VDIS i,t−1, finding that the positive relationship decreases in magnitude (specification B2).Footnote 22 Second, we instrument VDIS it with 1-year lags of the proxy of asset specificity, the measure of uncertainty and firm size. The underidentification (P value) and weak-identification tests show that the equation is identified and that the instruments are well correlated with VDIS i,t − 1 (F statistic = 22.39). The Hansen J test gives a P value higher than 0.15, indicating that the implemented instruments are valid. Given the low P value of the exogeneity test, the OLS estimates are not consistent and the IV estimates should be preferred to them, indicating an effect which is not significant from VDIS it to \(\widehat{u}^{\rm TREMU}_{it}\) in the empirical model.Footnote 23 Thus, an effect from the organization of production to firms’ efficiency is not supported by the data.

Overall, once firms’ unobserved heterogeneity, firm size, average wage, degree of asset specificity, proxies for demand uncertainty, economic cycle, and the endogeneity in the relationship have all been controlled for, the evidence provided above clearly indicates the self-selection mechanism of the most efficient firms in vertically integrated structures, while the evidence of an effect from the organization of production to efficiency is not supported by the data.

But why should the most efficient builders of MTs select more integrated structures? One explanation has been provided by the competitive markets’ approach with heterogeneous firms, such as the works by Elberfeld (2001) and Antras and Helpman (2004) in which the authors assume higher organizational fixed costs for vertically integrated firms, and higher variable costs for the intermediates faced by disintegrated firms. Holding on to this approach, the most efficient builders of MTs may choose integrated structures, and less efficient ones may choose to outsource part of their production process by buying intermediate inputs from other firms, thus reducing fixed costs but bearing higher marginal costs and staying in the market.Footnote 24 Although results may be consistent with this explanation, due to the unavailability of data on input prices for the Italian MT builders, we are not able to perform a direct and rigorous test on the mechanism suggested by Elberfeld (2001) and Antras and Helpman (2004), for example, by estimating a cost function in levels, and we simply look at that theoretical framework as a plausible interpretation for our results.

6 Robustness checks

6.1 Alternative SFMs

As mentioned in Sect. 3.1, other production models may be adopted to estimate the technical inefficiency of the Italian MT producers. We compared the TREMU model with the average production function, the PSF model in Eq. 3, the TFE and TRE model: estimates are listed in Table 7 and generalized likelihood ratio tests \({\rm LR}=-2\left[\ln L(H_{0})-\ln L(H_{1})\right]\sim\chi^{2}_{J}\) were performed to select the model which minimizes misspecification biases (Table 8). Specification M1 reports the technology parameters of an ‘average production function’ estimated via OLS (u i,t  = 0 for all it). This model can be tested against a PSF model, which explicitly takes technical inefficiency into account. The LR test of σ u  = 0 generates a χ 2 = 1.65 which supports σ u being different from 0 at the 10% level; the introduction of firms’ effects in the frontier model changes the results. The TFE model, which may be specified as

$$ y_{it}=\alpha_i+\varvec{\beta}'{\bf x}_{{\bf it}}+v_{it}-u_{it}, $$
(16)

where

$$ v_{it}\sim {\rm i.i.d.} \, N(0,\sigma^{2}_{v}), \,u_{it}\sim {\rm i.i.d.} \, N^{+}(0,\sigma^{2}_{u}), $$
(17)

and α i is a vector of firm dummies, is estimated via ML and reported in the third column of Table 7. This model allows time-invariant heterogeneity (firms’ effects) to be correlated with inputs, but the vector of firm dummies creates an incidental parameter problem (Lancaster 2000): with small T, estimates of α i are inconsistent and subject to small sample bias, and given that \(\widehat{\epsilon}_{it}=y_{it}-\widehat{{\alpha_i}} -\widehat{\varvec{\beta}}'{\bf x}_{{\bf it}},\) this bias may directly affect the estimated inefficiency scores.

Table 7 Alternative SFMs
Table 8 Generalized LR tests on parameters of SFM

The LR test in the second row of Table 8 strongly rejects the possibility that firms’ effects are jointly not significant in the model. In addition, a test of significance of the σ u parameter in the M3 model reveals that inefficiency actually resides in the data (third row of Table 8).

An alternative way of including firms’ effects in the frontier is by implementing the TRE model (specification M4); however, as stated in Sect. 3.1, TRE assumes that firm-specific heterogeneity is uncorrelated with inputs. Both TFE and TRE models may be chosen to estimate the technical inefficiency of the Italian MT builders, and we need a test in order to select the model which fits the data best. However, to our knowledge, there are no direct ways of testing between the TFE and TRE models in the context of stochastic frontiers, and we rest on an ‘indirect’ test. We estimated the TREMU in specification M5, and a strong rejection of the null hypothesis that the Mundlak terms are jointly equal to zero should be viewed as favoring TFE over TRE. Actually, this result is borne out by the test reported in the fourth row of Table 8.

Overall, the TREMU model is our favorite specification for several reasons: it allows us to control for the heterogeneity of firms, separating it from time-variant technical inefficiency, thus lessening the heterogeneity bias which affects the PSF model; like the TFE model, TREMU allows for part of the individual effect to be correlated with inputs, thus overcoming a major problem in the TRE model; this is also confirmed by the fact that technology parameters of the TREMU model result to be much closer to the ones of the TFE model than to those of the TRE model; finally, unlike TFE, which produces biased estimates of inefficiency scores, TREMU does not suffer from the incidental parameter problem.

Table 9 lists Pearson’s and Spearman’s rank correlation coefficients among estimates of the inefficiency scores. Interestingly enough, when we compute within-group deviations from the firms’ average inefficiency obtained via the PSF model, \(\widetilde{\widehat{u}^{\rm PSF}_{it}}, \) and calculate correlation coefficients with the other sets of scores, the \(\widetilde{\widehat{u}^{\rm PSF}_{it}}\) are much more in line with the inefficiency scores from the TFE, TRE and TREMU models than with those from the PSF. This evidence supports the idea (Sect. 3.1) that the inefficiency scores from TFE, TRE and TREMU should be interpreted, as being in deviation from the firm’s average inefficiency level.

Table 9 Correlation coefficients among sets of scores

6.2 The one-step approach

The one-step estimation is frequently adopted in empirical works which analyze the determinants of inefficiency, by modeling the parameters of the inefficiency distribution with third variables. Given that the estimation of Eq. 12 in a separate second step may have limitations, we also estimated the TFE model parameterizing the variance of the inefficiency distribution (half-normal) with the measure of vertical disintegration and the controls, jointly estimating the frontier parameters and the effects of third variables on inefficiency via ML estimationFootnote 25; results are shown in Table 10. The magnitudes of coefficients are not directly comparable with those of specifications B1–B3, and they should be interpreted as correlations with the variance of the inefficiency distributions; the signs are consistent with those obtained from two-step estimations. The main result is that once the (1-year) lagged measure of VDIS is included in the regression, the effect on the variance of the inefficiency distribution disappears (specification C2), which is consistent with the other-way-round effect, stemming from a self-selection mechanism. This fact reassures us about the results obtained via the IV-GMM approach.

Table 10 Effect of vertical organization on firms’ productive efficiency: TFE, one-step estimation

Moreover, the two-step estimation procedure allows us to control for the endogeneity in the relationship between vertical integration and efficiency and to compute conditional marginal effects in the second step of the analysis, which are both central issues in the paper.

7 Concluding remarks

In this paper, we study the relationship between vertical integration and firm efficiency in the Italian MT industry. The link may definitely be the result of a two-way causality, which has been neither comprehensively included in a single theoretical framework nor systematically assessed in empirical works. We empirically ground our analysis on a sample of about 500 Italian MT producers and, in order to disentangle the direction of causation, we develop the analysis in a two-step econometric framework. In the first step, we estimate technical inefficiency via a SFM, taking the unobserved heterogeneity among firms into account, in light of the latest frontier models for panel data suggested by Greene (2005). In the second step, we investigate the relationship between the degree of vertical integration and technical efficiency, by means of two equations using instrumental variables to control for endogeneity. Once firms’ unobserved heterogeneity, firm size, average wage, degree of asset specificity, a proxy for demand uncertainty and the economic cycle are controlled for, the evidence indicates a self-selection mechanism of the most efficient firms in vertically integrated structures; conversely, the evidence of an effect from the organizational mode to efficiency is not supported by the data. The result of the self-selection mechanism is robust to the use of instrumental variables in a GMM approach, which takes the endogeneity of inefficiency into account, showing an effect which amounts to an elasticity of 0.16%.

The results are relevant and may be interpreted in the light of models proposed in the literature on industrial organization and international trade (Elberfeld 2001; Antras and Helpman 2004): the most efficient builders of MTs choose integrated structures, while less efficient firms choose to outsource part of their production process by buying intermediate inputs from other firms. The most efficient firms may exploit their advantage in order to control a greater part of the production chain, maybe resting on a greater coordination among different phases (Kogut and Zander 1996), a deeper control over the innovation process and tailored intermediate inputs (Grossman and Hart 1986). This organizational form may be mostly advantaged in uncertain environments (Lafontaine and Slade 2007) characterized by fluctuations in the aggregate demand, like the Italian MT industry. Leaving some phases of the production process to ‘outside’—which has been documented as one of the most frequent business practices in the last few decades—may seem to be a rational choice for less efficient firms in order to deal with some segments of demand.

Overall, this paper contributes to better understanding the link between the vertical organization of production and firms’ efficiency, exploring—to our knowledge for the first time—both directions of causality. It also contributes to a better understanding of the functioning of the Italian MT industry, in which heterogeneous firms characterized by different levels of efficiency and organizational forms coexist.