1 Introduction

Since studies by Solow (1956, 1957), economists have agreed on the importance of technological progress as a source of growth and several analyses have been carried out at different levels of data aggregation (firms, industries, and countries) in order to empirically evaluate the role of innovative efforts. The basic idea is that a higher level of Research and Development (R&D) investments allows the introduction of new processes and/or new products or the improvement of the existing ones, so enhancing profits and/or reducing costs and, as a consequence, increasing productivity. With regards to the Italian case, technological competitiveness is still quite a way from achieving the Lisbon strategy objectives, according to which R&D investments should reach the target level of 3% of GDP. In 2007, OECD countries invested 2.3% of GDP in R&D, while the percentage was just 1.2 in Italy.

One of the main concerns in this field of research is the determination of a good proxy for technological stock. It is commonly argued that technology is a quasi-public good and that innovative activities undertaken by a firm may generate spillovers and benefit other firms. Therefore, the technology available to each firm is a result not only of its own innovative activities, but also of innovative activities undertaken by others. If, on one hand, the relevance of R&D externalities has been principally analysed in aggregate growth models (see, i.e., Romer 1990; Benhabib and Jovanovic 1991), on the other hand, some micro-econometric studies have recently started to deal with technological spillovers (Cincera 2005; Harhoff 2000; Jaffe 1988; Los and Verspagen 2000; Wakelin 2001; Medda and Piga 2004; Adams and Jaffe 1996; Aiello and Pupo 2004; Aiello et al. 2005; Aiello and Cardamone 2005, 2008). Results mainly show that R&D spillovers positively affect firms’ output, although the magnitude of the impact varies from one study to another.Footnote 1

All previous studies use the technological capital (or R&D investments) of other firms to determine the stock of R&D spillovers for each firm and mainly consider the Cobb-Douglas production function. The only exception in the use of the Cobb-Douglas specification is provided by Aiello and Cardamone (2008), who consider a linear translog production function on the basis of the assumption of a technology with constant returns to scale.

The main advantage of using the translog production function is that it does not impose the elasticity of substitution among inputs to be constant. With respect to the previous work which uses the translog specification (Aiello and Cardamone 2008), this paper relaxes the strict assumption of constant returns to scale. In fact, if R&D spillovers work as a public good, then they introduce positive externalities on firms’ costs. From a methodological perspective, the relaxing of the assumption of constant returns to scale implies dealing with a nonlinear translog production function in which the parameter relating to returns to scale may be estimated directly. To the best of our knowledge, this paper is the first attempt to estimate a nonlinear translog specification in order to analyse the role of technological spillovers.

Another relevant issue regards how R&D spillovers are measured. Following the method adopted in related literature (Griliches 1979, 1991), we consider the technological capital of other firms to be the subject of the technological transfer; we further assume that firms are not able to absorb all external technology and that absorptive capacity differs from one firm to another (Cincera 2005; Harhoff 2000; Jaffe 1988; Los and Verspagen 2000).Footnote 2 All this implies that technological spillovers can be determined as a weighted sum of other firms’ technological capital. The weighting system used in this paper is based on a similarity index which is computed on a set of firm specific variables, as was carried out in Aiello and Cardamone (2008). However, in this paper, the determination of the spillover indicator is further improved by using an index of technical efficiency computed through the Data Envelopment Analysis approach (DEA):Footnote 3 we assume that the transfer of technology across firms is related to the technical efficiency of each firm and, in so doing, we also address the issue of the relevance of direction in technological diffusion. The assumption is that the more technically efficient a firm is, the greater its capacity to absorb external technology is. We also verify whether R&D spillovers matter for firms’ production when considering the opposite assumption, i.e. there are higher R&D intensity flows between firms with lower technical efficiencies. Finally, firms’ geographical proximity is considered as another key-factor in the transmission of technology.

Using a panel of 1,203 manufacturing firms over the period 1998–2003, a system of equations, given by the nonlinear translog specification and cost shares equations, is adopted since it allows multicollinearity among regressors to be reduced and the efficiency of estimators to be improved. The equation system is estimated through the nonlinear three stage least square estimator (N3SLS) in order to take into account the endogeneity of regressors. Moreover, a two step instrumental method (IV) is used in order to correct for sample selection bias (Wooldridge 2002): in the first step a probit model which describes the firm’s decision to invest in R&D is estimated; in the second step the fitted probabilities obtained in the first step are used as instrumental variables in the translog estimation.

Overall, empirical results show that returns to scale are increasing and output elasticity with respect to R&D spillovers is always positive and significant. Furthermore, it mainly emerges that, for Italian manufacturing firms, the internal and external stocks of R&D capital are highly substitutes.

The paper is organised as follows. Section 2 introduces the procedures used to determine the different R&D spillover indicators. Section 3 discusses the production function specification and the estimation method used. Section 4 describes data. Section 5 presents the econometric results of output elasticities and elasticities of substitution between inputs. Finally, Sect. 6 concludes.

2 The determination of R&D spillovers

From an empirical perspective, one of the main problems in the analysis of the role of R&D spillovers regards the determination of technological flows between firms.

The most common approach used to determine R&D spillovers is to consider a weighted sum of other firms’ R&D capital stock. This approach requires the determination of a weighting system Ω in which each element ω ij indicates the proportion of technology produced by firm j and used by firm i. Two assumptions are made (Griliches 1979, 1991): (a) it is likely that ω ij increases when the technological distance between i and j decreases, and (b) technological distance does not depend on economic transactions.

An initial attempt to determine a weighting system using a measure of technological distance is provided by Jaffe (1986, 1988), who considers the firm’s position in a technological space. This space is divided into k technological sectors regarding patent classifications. Using the patent distribution, one way to determine the weighting system is to consider the uncentered correlation metric (see Eq. 1) assuming that the more similar the patent distribution of two firms is, the higher the technological flow between them will be. The effectively absorption of external technology for each firm depends on its capacity to identify the new available technology and to assimilate and use it in the production process, that is on the absorptive capacity. The technological absorptive capacity is the capacity of each firm to absorb new external technology and to use it in order to introduce new products and/or processes or improving the existing ones. It may depend on the set of technological opportunities, i.e. the amount of technological resources, available to each firm (Cohen and Levinthal 1989, 1990).

Many authors (Jaffe 1986, 1988; Griliches 1979, 1991; Cincera 2005; Harhoff 2000; Kaiser 2002; Aldieri and Cincera 2009) agree that absorptive capacity depends on technological proximity: the closer two firms are in technological space, the more they benefit from each other’s research efforts. The technological proximity of each pair of firms depends on how similar the firms are in terms of technology adopted and efforts made to adopt new technology. As suggested by others, these efforts are related to many factors, such as the sector in which each firm operates, the size of the firm, the number of skilled employees with respect to unskilled ones, R&D investments and expenditure on Information and Communication Technology (ICT).

Bearing in mind all previous considerations, in order to measure technological proximity between each pair of firms, we consider the uncentered correlation metricFootnote 4 computed using a set of variables. For each pair of firms (i, j) the uncentered correlation is defined as follows:

$$ \omega_{ijt} = \frac{{{\mathbf{X}}_{it} {\mathbf{X}}_{jt}^{\prime } }}{{\left( {\left( {{\mathbf{X}}_{it} {\mathbf{X}}_{jt}^{\prime } } \right)\left( {{\mathbf{X}}_{jt} {\mathbf{X}}_{jt}^{\prime } } \right)} \right)^{1/2} }} $$
(1)

where X is the set of variables which define the technological similarity between firms and t is time (1998–2003). Index \( \omega_{ijt} \) ranges from zero to one. It is zero when firm i and firm j are not related at all, while it is unity if the k-variables in X it and X jt are identical.Footnote 5 Firms which are relatively close with respect to the variables used to compute Eq. 1 are assumed to benefit more from each other’s R&D investments than firms at a greater distance from each other. In order to determine proximity in the technological space, we use variables which should be strictly related to the firms’ innovative activities, such as the numbers of skilled (with at least a high school level of education) and unskilled (just primary school) employees, investments in ICT,Footnote 6 internal and external (e.g. using university laboratories) R&D investments, and, finally, the sectoral mark-up.Footnote 7 The latter variable is included in order to consider the characteristics of the industry in which each firm operates as well as the fact that the flow of technology between firms in the same sector should be higher than that between firms belonging to two different sectors. All variables are normalised with respect to their average in order to take into account the different scales and units used in their measurement. The values of variables are expressed at 2000 real prices.Footnote 8 Table 4 of the Appendix presents the average values of technological flow intensities between each pair of sectors for the year 2003, as obtained by using Eq. 1. Average values show that firms belonging to the sectors of leather, wood, petroleum and non-metallic mineral products mainly use technology produced by other firms in the same sector. Relatively high intensities of technological flows are observed among firms belonging to the sectors of petroleum, basic metal and non-metallic mineral products, while lower technological flow intensities are found in the electrical machinery and motor vehicle sectors. Table 5 presents an example of the proximity measure given by Eq. 1 computed for twenty Italian manufacturing firms in 2003. The table shows high variability in the index of technological similarity among firms as, excluding diagonal values, ω ij ranges from 0.24 to 0.99. Moreover, it should be noted that there are relatively high intensities of technological flows among firms belonging to the same sector.

As Tables 4 and 5 show, the similarity index yields a symmetric matrix of weights, i.e. ω ijt  = ω jit . This means that the intensity of the technological flows from firm i to firm j is equal to that observed from firm j to firm i. This property of the index contrasts with the evidence that direction matters in determining how technology circulates from one firm to another. Therefore, we consider an asymmetric transformation of the similarity index based on an index of technical efficiency obtained from an application of the DEA (Data Envelopment Analysis). In other words, the similarity index is combined with the technical efficiency of each firm, measured in terms of distance from the technological frontier. This makes asymmetric the similarity index because the technical efficiency index differs across firms. The DEA is implemented bearing in mind a problem orientated to the maximisation of output and assuming variable returns to scale. The output indicator is the firm’s value added while the inputs considered are employees, book value of total assets and technological capital, as determined by using the perpetual inventory methods based on R&D investments and assuming a depreciation rate of 15%.Footnote 9 For each year, we compute four different frontiers, dividing the sample according to the Pavitt (1984) classification. Technical efficiency is then computed for each firm in the sample over the period 1998–2003.Footnote 10

The index of technical efficiency obtained from DEA is multiplied by the similarity index (Eq. 1). The underlying hypothesis is that the more efficient a firm is, the more it is able to absorb external technology. In other words, it is assumed that a firm which is close to the efficiency frontier uses technological factors properly in the productive process allowing the firm to absorb and use a higher amount of external technology. Thus, the weighting system which combines both technological similarity and technical efficiency is defined as follows:

$$ \tilde{\omega }_{ijt} = \omega_{ijt} \cdot TE_{it} $$
(2)

where TE it indicates the technical efficiency of firm i at time t. \( \tilde{\omega }_{ijt} \) is equal to 1 if the two firms, i and j, are technological similar and firm i is efficient, while it tends to zero if firms i and j are not similar or firm i is not efficient.

As a robustness check, we also consider that firms which are far from the efficient frontier would benefit more from the absorption of external technology, i.e. we compute:

$$ \tilde{\omega }_{ijt}^{1 - TE} = \omega_{ijt} \cdot (1 - TE_{it} ) $$
(3)

If proximity to the efficient frontier improves the absorption and utilisation of external technology in the production process, then, in the estimation results, we should find that spillovers computed with the weighting system given by Eq. 2 affect firms’ production more than those determined by using weights in Eq. 3. On the other hand, if distance from the efficient frontier determines more benefits in terms of adoption and utilisation of technology produced by others, then estimates should show that spillovers computed considering Eq. 2 affect firms’ production less than spillovers obtained when considering weights in Eq. 3.

Finally, since a large number of papers deal with the theoretical issues of the nexus between spatial agglomeration and knowledge spillovers (Romer 1986; Arrow 1962; Orlando 2000; Audretsch and Feldmann 2004; Koo 2005; Bottazzi and Peri 2003; Aldieri and Cincera 2009), we include the geographical dimension among factors which determine the technological diffusion.

A simple way of weighting the diffusion of innovation among firms located in different areas is to take into account the geographical distance between them. In this paper, the distance is measured using the great circle system (Maurseth and Verspagen 2002), which is based on geographical coordinations and is defined as the shortest distance between any two points on the surface of a sphere.Footnote 11 Denoting the geographical distance between the provinces where firms i and j operate by d ij , a weight of geographical proximity can be computed as follows:

$$ g_{ij} = {1 \mathord{\left/ {\vphantom {1 {\left( {1 + d_{ij} } \right)}}} \right. \kern-\nulldelimiterspace} {\left( {1 + d_{ij} } \right)}} $$
(4)

which is unity when the pair (i, j) is in the same province and tends to zero when the two firms are located in distant provincial capitals.Footnote 12

An effective indicator of technological flow intensities needs to take all of the determinants of technological diffusion, such as technological similarity, technical efficiency and geographical proximity, into account jointly. Since the closer and more similar firms are, the more they should benefit from each other’s technology, we average the indexes \( \tilde{\omega }_{ijt} \) and \( g_{ij} \):

$$ \nu_{ijt} = \frac{{\tilde{\omega }_{ijt} + g_{ij} }}{2} $$
(5)

We also consider other weighting systems which combine asymmetric technological and geographical proximities. To be more precise, we also compute a combination of measures [2] and [4] given by:

$$ \nu^{\prime}_{ijt} = \frac{{2\tilde{\omega }_{ijt} + g_{ij} }}{3} $$
(6)

and

$$ \nu^{\prime\prime}_{ijt} = \frac{{\tilde{\omega }_{ijt} + 2g_{ij} }}{3} $$
(7)

The indices are asymmetric and range from zero to one.Footnote 13 They are zero when both \( \tilde{\omega }_{ijt} \) and \( g_{ij} \) are equal to zero, i.e. firm i and firm j are both geographically distant and technologically dissimilar (or firm i is not technically efficient). Moreover, as \( \tilde{\omega }_{ijt} \) and \( g_{ij} \) cannot be greater than one, indices given by Eqs. 57 are unity if both \( \tilde{\omega }_{ijt} \) and \( g_{ij} \) are equal to one, that is when the closeness of the pair (i,j) is unity in both dimensions (technology and geography). This range ensures that firm i cannot absorb more technology than that produced by firm j and that the technological flow from firm i to firm j is not negative. With measure given by Eq. 5, we assume that asymmetric technological similarity and geographical proximity affect the flow of technology between two firms with the same intensity. When using measures given by Eq. 6 and 7, we assume that technological flows are driven by asymmetric technological similarity and geographical proximity, respectively. If both asymmetric technological similarity and geographical proximity matter, independently of the weight used for their combination, then we should find that the R&D spillovers determined by using the weights in 5, 6 and 7 have a similar impact on firms’ production.

All the weighting systems can be used to determine technological spillovers. For the ith firm and time t, the stock of R&D spillovers (Spill it ) is the weighted sum of R&D capital of the other N − 1 firms, that is:

$$ Spill_{it} = \sum\limits_{{\begin{array}{*{20}c} {j = 1} \\ {j \ne i} \\ \end{array} }}^{N} {\upsilon_{ijt} CT_{jt} } \quad {\text{with}}\;i = 1,2, \ldots ,N\quad {\text{and}}\quad \, t = 1,2, \ldots ,T $$
(8)

where υ ijt denotes a generic weighting system. Bearing in mind all previous considerations, seven stocks of R&D spillovers are computed. First of all, the spillover stock is computed considering the symmetric and asymmetric similarity approaches, i.e. \( \upsilon_{ijt} = \omega_{ijt} \), \( \upsilon_{ijt} = \tilde{\omega }_{ijt} \) and \( \upsilon_{ijt} = \tilde{\omega }_{ijt}^{1 - TE} \). Secondly, flows of innovation are weighted using geographical proximity (\( \upsilon_{ijt} = g_{ij} \)). Finally, the combinations of geographical and technological proximity (\( \upsilon_{ijt} = v_{ijt} \), \( \upsilon _{{ijt}} = v_{{ijt}}^{\prime } \) and \( \upsilon_{ijt} = v_{ijt}^{\prime \prime } \)) are considered.Footnote 14 The decision to consider just these weighting systems is due to the fact that, as we indicated above, the unweighted sum of other firms’ technological capital (i.e. υ ijt  = 1) cannot represent the true intensities of technological diffusion among firms.

3 The translog production function

This section describes the production function used to estimate the impact of technological spillovers on output. As indicated above, the Cobb Douglas production function is the most commonly used functional form. This functional form imposes that the elasticity of substitution between inputs is constant. In this paper, a translog production function (Christensen et al. 1973) is considered, and a test is carried out to see whether this choice is confirmed as being correct by data. The specification considered is that proposed by Chan and Mountain (1983) and successively corrected by Kim (1992). This specification does not require returns to scale to be constant since the relative parameter θ is directly estimated. The translog production function considered is the following:

$$ \begin{aligned} \ln Y_{it} = & \theta (\alpha + \alpha_{L} \ln L_{it} + \alpha_{K} \ln K_{it} + \alpha_{Ct} \ln CT_{it} + \alpha_{Sp} \ln Spill_{it - 1} + \xi_{T} t \\ + \frac{1}{2}\beta_{LL} \left( {\ln L_{it} } \right)^{2} + \frac{1}{2}\beta_{KK} \left( {\ln K_{it} } \right)^{2} + \frac{1}{2}\beta_{CtCt} \left( {\ln CT_{it} } \right)^{2} \\ + \frac{1}{2}\beta_{SpSp} \left( {\ln Spill_{it - 1} } \right)^{2} + \frac{1}{2}\delta_{TT} \left( t \right)^{2} \\ + \beta_{LK} \ln L_{it} \ln K_{it} + \beta_{LCt} \ln L_{it} \ln CT_{it} + \, \beta_{LSp} \ln L_{it} \ln Spill_{it - 1} \\ + \beta_{KCt} \ln K_{it} \ln CT_{it} \, + \beta_{KSp} \ln K_{it} \ln Spill_{it - 1} + \beta_{CtSp} \ln CT_{it} \ln Spill_{it - 1} \\ + \gamma_{LT} \ln L_{it} \cdot t + \gamma_{KT} \ln K_{it} \cdot t + \gamma_{CtT} \ln CT_{it} \cdot t + \, \gamma_{SpT} \ln Spill_{it - 1} \cdot t) \\ + \eta_{\text{s}} dp_{s} + \eta_{g} da_{g} + \varepsilon_{it} \\ \end{aligned} $$
(9)

for i = 1,…,N firms and t = 1,…,T years, where Y is output, L is labour, K is physical capital, CT is technological capital, Spill is the R&D spillover stock and t is a temporal index. Furthermore, dp s , with s = 2, 3, 4, are industrial dummies in accordance with the Pavitt (1984) classification, da g , with g = 1, 2, 3, are territorial dummies, and \( \varepsilon_{it} \) is the error term.Footnote 15 We consider the usual assumption of symmetry in the translog production function (Christensen et al. 1973; Berndt and Christensen 1973), so that \( \beta_{ij} = \beta_{ji} \).

Output is measured by firms’ value added. Physical capital is measured by the book value of total assets. Labour is given by the number of employees in head counts. Furthermore, for each firm the stock of technological capital is determined by current and past investments in R&D. This stock of capital is used to determine the stock of R&D spillovers that is available to each firm (equation [8]). Moreover, the stock of spillovers is 1-year lagged in order to take into account the plausible assumption that there is a temporal lag between when new knowledge becomes available and when it is applied to the production process by the firm.Footnote 16

In order to verify the validity of the choice of the translog production function rather than the Cobb-Douglas, the joint significance of parameters β, γ and δ is tested. If these parameters are jointly significant then the use of the Cobb-Douglas production function is not adequate. The contrary holds.

Following Berndt and Christensen (1973) and May and Denny (1979), Eq. 9 is estimated in conjunction with the cost-share equations. This is because the system of equations allows us to use additional information without increasing the number of parameters to be estimated (Antonioli et al. 2000). Furthermore, it improves the efficiency of estimations and reduces the multicollinearity which is suspected to be present in the Eq. 9 (Feser 2004; Lall et al. 2001; Goel 2002).

Given the assumption of profit maximising firms, the cost share equations of labour S L , physical capital S K , technological capital S CT and R&D spillover stock S SP are the following:

$$ S_{L,it} = \alpha_{L} + \beta_{LL} \ln L_{it} + \beta_{LK} \ln K_{it} + \beta_{LCt} \ln CT_{it} + \beta_{LSp} \ln Spill_{it - 1} + \gamma_{LT} \cdot t + u_{L,it} $$
(10)
$$ S_{K,it} = \alpha_{K} + \beta_{LK} \ln L_{it} + \beta_{KK} \ln K_{it} + \beta_{KCt} \ln CT_{it} + \beta_{KSp} \ln Spill_{it - 1} + \gamma_{KT} \cdot t + u_{K,it} $$
(11)
$$ S_{Ct,it} = \alpha_{Ct} + \beta_{LCt} \ln L_{it} + \beta_{KCt} \ln K_{it} + \beta_{CtCt} \ln CT_{it} + \beta_{CtSp} \ln Spill_{it - 1} + \gamma_{CtT} \cdot t + u_{Ct,it} $$
(12)
$$ S_{Sp,it} = \alpha_{Sp} + \beta_{LSp} \ln L_{it} + \beta_{KSp} \ln K_{it} + \beta_{CtSp} \ln CT_{it} + \beta_{SpSp} \ln Spill_{it - 1} + \gamma_{SpT} \cdot t + u_{Sp,it} $$
(13)

We assume a sum of input cost shares equal to one and homogeneity of grade θ, in formulae \( \sum\nolimits_{i} {{{\partial \ln Y} \mathord{\left/ {\vphantom {{\partial \ln Y} {\partial \ln X_{i} }}} \right. \kern-\nulldelimiterspace} {\partial \ln X_{i} }}} = \theta \) with \( \sum\nolimits_{i} {\alpha_{i} } = 1 \), \( \sum\nolimits_{j} {\beta_{ij} } = 0 \) and \( \sum\nolimits_{i} {\gamma_{iT} } = 0 \). As a consequence, the sum of the error terms in the Eqs. 1013 is unity for each observation, and, hence, the error variance–covariance matrix is singular. Thus, estimation of the equation system 912 yields estimates for all of the parameters.Footnote 17

4 Data source

Data used in the empirical analysis come from the 8th and 9th “Indagine sulle imprese manifatturiere” (IMM) surveys carried out by Capitalia. These two surveys cover the period 1998–2003, contain standard balance sheets and collect a great deal of qualitative information from a large sample of Italian firms. The 8th survey covers the period 1998–2000 while the 9th survey refers to the period 2001–2003. Each survey considers more than 4,500 firms and includes all Italian manufacturing firms with more than 500 workers and a representative sub-sample of firms with more than 10 workers (the stratification used by Capitalia considers location, size and sector of the firm). 1,650 firms figure in both surveys, but, after data cleaning, we obtain a balanced panel of 7,218 observations, with large N (1,203 cross sections) and small Y (6 years).

Table 1 shows a breakdown of the sample of firms in 2003. We only present data regarding the last year available as the distribution of firms by size, sector and location is time-invariant. We split the sample into R&D performing firms and non-R&D performing firms. The first group is comprised of firms with positive R&D capital. In the sample, there are 557 R&D performing firms and 646 non-R&D performing firms. With regards to the geographical location of firms, we find that about two-thirds were located in northern Italy (445 in the north west and 382 in the north east). At the 2-digit ATECOFootnote 18 industry level, the sample is dominated by firms in the textiles, basic metals and non-electrical machinery industries, while the petroleum refinery industry is represented by just 6 firms. In the case of R&D performers, most firms are located in northern Italy and are active in the textiles, non-electrical and electrical machinery industries. As far as size is concerned, a large number of firms are of small and medium size (Table 1).

Table 1 Breakdown of the sample of firms, labour productivity and factor intensity in Italian manufacturing firms by industry, area and size in 2003 (weighted average)

Table 1 also presents 2003 labour productivity and physical and technological capital intensities. Labour productivity is measured as the ratio of value added with respect to the number of employees, whereas capital factor intensity is expressed as the ratio of physical (or R&D) capital to value added. Data are the 6-year weighted average.Footnote 19

It is worth pointing out that the average value of labour productivity is 72,000 euros for the entire sample of firms and 63,000 euros for R&D performing firms. Furthermore, output per worker differs with geographical area: it ranges from 100,000 euros, for the firms operating in the centre of Italy, to 61,000 euros in the northern regions.Footnote 20 With regards size, the highest labour productivity is found in large firms, while, as far as sectors are concerned, the most productive firms belong to the paper and petroleum industries. Finally, the leather industry accounts for the lowest labour productivity.

Physical capital intensity is 1.20 for the total sample of firms and 1.25 for R&D performers; moreover, in the case of the entire sample, it is relatively high for firms located in the north-east and the south, and high physical capital intensity is also observed for R&D performing firms located in the north-east and in the centre of Italy. With regards size and industry, larger firms, firms in the food, rubber and plastic industries and R&D performers in the paper sector register relatively high values of physical capital intensity.

Bearing in mind the specific aim of this paper, the analysis of R&D capital intensity is of great interest. It is computed as the share of technological capital (CT) with respect to value added (Y). At a national level, it is 0.35 for all R&D performers; firms operating in the north west of Italy register a value (0.47) which is higher than the national average, while, in other areas, R&D intensity is lower (0.30 in north east, 0.26 in the centre and 0.09 in the south). R&D intensity differs greatly when one considers firm size: it is 0.41 for firms with more than 250 employees, 0.29 for small (11–50 workers) and 0.25 for medium sized firms (51–250 employees). Finally, intensity is high in the chemical (0.96), electrical (0.67) and non-electrical (0.40) sectors and low in the wood (0.03) and paper (0.04) sectors (Table 1).Footnote 21 To sum up, it seems, from the descriptive analysis, that there is no clear relationship between R&D expenditure and a firm’s productivity, except for the analysis based on size which indicates that larger firms seem to invest more in R&D. The lack of a marked relationship between productivity and R&D might be due to the fact that firms which operate in different industries carry out different R&D activities with different intensities. Even the divergence between North and South in terms of technological capital intensities seems less evident in terms of labour productivity.

5 Estimation method and econometric results

This section presents results regarding the output elasticities with respect to each input. Results are obtained by estimating the equation system 912. The equation system is nonlinear because of the nonlinearity of Eq. 9. It is estimated by using a nonlinear three stage least square estimator (N3SLS). Moreover, bias of the sample selection is taken into account. The sample selection issue arises because the stock of R&D capital is determined using R&D investments and, in many cases, firms do not invest in R&D (zero-investment values). As a consequence, we have a sub-sample of firms with positive values for R&D capital and a sub-sample of firms with zero values for R&D capital. The log-linearisation of Eq. 9 restricts the sample of firms to the R&D performing firms, and in so doing, forces us to work with a sample which is no longer random because it ignores the underlying process which leads each firm to invest, or not, in R&D. It can be shown that, if this underlying process is correlated with the primary equation, i.e. the translog specification, then estimates obtained disregarding this issue are biased. The selection process can be modelled by using a treatment effect model where the sample is separated into the treated (the units that participate in a programme, in this case the firms which invest in R&D) and the untreated (those which do not invest in R&D) firms, where the treatment (investing in R&D) is an endogenous process. Following Wooldridge (2002), this issue is addressed by using a two-step IV method: in the first step a probit model is considered which explains the decision to invest in R&D, and in the second step the translog production function is estimated using as instruments the fitted probabilities \( \left( {\hat{G}_{it} } \right) \) derived from the first step. Whereas all firms (R&D performing and non-R&D performing) are used in the first stage, in the second stage (Eqs. 912), only the R&D performing group is considered. This procedure is suitable for two main reasons. First of all, the usual standard errors and test statistics are asymptotically valid and, secondly, no particular specification of the probit model has to be set up (Wooldridge 2002).Footnote 22

The dependent variable of the probit model is unity if the i-th firm invests in R&D and is zero if it does not. The probit model regressors are the same as the explanatory variables of the production function (Eq. 9), plus the key determinants of the decision to invest in R&D, which are selected following the literature on this subject (Leo 2003; Becker and Pain 2003; Gustavsson and Poldhal 2003; Bhattacharya and Bloch 2004). The determinants considered are human capital, cash flow, investments in ICT, a dummy equal to unity if firm i exports and a set of dummies measuring the geographical location and the economic sector of each firm.Footnote 23

The probit estimation results are presented in Table 6 of the Appendix. Results show that the probability of investing in R&D is positively correlated with human capital and investments in ICT, as well as exports. Furthermore, the effect of R&D spillovers on the probability of investing in R&D is not significant when considering the asymmetric similarity index of Eq. 2 and the geographic proximity through Eq. 4, while it is significantly negative in the case of spillovers computed by considering higher weights for less efficient firms (Eq. 3). The latter result is in line with those obtained by Cardamone (2010) according to which R&D spillovers negatively affect the probability of introducing a product or process innovation within Italian manufacturing firms. Moreover, when considering spillovers computed in Eq. 3, the coefficient of the squared variable (lnspill 2) is significantly positive. Hence, in that case, it seems that the diffusion of technology is a disincentive in the decision to invest in R&D until a given level of spillovers is reached; after the effect becomes positive.

As regards the estimation of the translog production function, there could also be an endogenity problem, due to the fact that inputs and output will be probably determined simultaneously, as in all estimation of a production function (see, among the others, Mairesse and Hall 1996). In order to take into account endogeneity of regressors, as well as the fitted probabilities obtained in the first step, we consider the 1-year lagged value of endogenous variables (labour, physical and technical capital and their squared values) as instrumental variables in the second step.Footnote 24

From a theoretical point of view, the estimated parameters of a translog are not interpretable and, hence, only the implied output elasticities with respect to each input are discussed.Footnote 25 These elasticities are obtained as a combination of estimated translog coefficients and the average of input values (Verspagen 1995; Saal 2001).

Several tests are carried out in order to verify whether the specification chosen and the estimation method employed are appropriate. An initial test concerns the joint significance of coefficients relative to squared and interaction variables.Footnote 26 A second test looks at the constant returns to scale hypothesis. In particular, the null hypothesis \( H_{0} :\theta = 1 \) is tested against the alternative hypothesis that θ is different from one. Finally, a Breusch-Godfrey test on the serial correlation of error terms is also carried out. Results are presented in Table 2. The diagnostic tests show, in all estimations, the absence of first and second order serial correlation. Furthermore, the F-Fisher test indicates that the use of the Cobb-Douglas production function is not adequate since coefficients of the interaction and squared variables are jointly significant; the t-Student test computed on the θ coefficient also shows that returns to scale are always significantly higher than one, except for one case in which they are significantly decreasing (column 3). These findings greatly support our decision to relax the hypothesis of constant returns to scale and shed some light on the fact that R&D spillovers act as a quasi-public good that generates positive externalities.

Table 2 Output elasticities for Italian manufacturing firms. N3SLS estimations (1998–2003)

5.1 Output elasticities

The econometric results for the full sample of firms are summarised in Table 2. In column 1, elasticities are estimated by considering the symmetric index of technological similarity to be the weighting system of technological flows (see Eq. 1). Columns 2 and 3 refer to asymmetric similarity indices which combine technological similarity and technical efficiency (Eqs. 2 and 3, respectively). Column 4 regards the outcomes obtained using the index of geographical proximity (Eq. 4). The final output elasticities (columns 5, 6, 7) are obtained using the three combinations of the asymmetric technological similarity index and geographical proximity measure (Eqs. 57, respectively).

One of the first things to emerge is that all the output elasticities are positive and highly significant. As for conventional inputs, the output elasticities relative to labour and physical capital are similar to those derived from a neoclassical production function. To be more precise, output elasticity relative to labour ranges from 0.56 to 0.58 while output elasticity relative to physical capital varies from 0.17 to 0.24. Similar results are also obtained in the several estimations for the internal stock of R&D capital: output elasticity is highest and equal to 0.16 when considering asymmetric technological spillovers with higher weights for firms which lie far from the efficient frontier (column 3), while it is lowest, equal to 0.11, when considering asymmetric spillovers with higher weights for firms that are close to the efficient frontier. These findings suggest that the elasticities of output wth respect to the traditional inputs (labour, physical and technological capital) are robust to the different measures of spillovers used. Moreover, they are similar to those already presented in some papers aimed at assessing the impact of R&D capital on firms’ production (Mairesse and Hall 1996; Hall and Mairesse 1995; Harhoff 1998; Aiello et al. 2005).

The magnitude of the impact of R&D spillovers on firms’ production is high, except for cases of higher weights for firms that are far from the efficient frontier (0.036, column 3). This result suggests that firms which are not efficient would only benefit marginally from the technology produced by others, as they would need to develop an adequate absorptive capacity. The output elasticity relative to R&D spillovers is 0.15 if we consider geographical proximity (column 4), and 0.44 if we determine spillovers by using the asymmetric index of technology (column 2). It is between 0.39 and 0.43 when combining asymmetric technological spillovers and geographical proximity (columns 5,6,7). It may be noticed that the McElroy R-squared, which measures the goodness of fit of the equation system, is slightly higher in the case of estimation with asymmetric technological spillovers of Eq. 2 (column 2) than it is in the other estimations. Thus, it seems that using the asymmetric technological index which combines technological proximity and technical efficiency improves the model specification.Footnote 27

The estimates obtained in this analysis are similar to those obtained by Cincera (2005) for a sample of 625 large firms throughout the world over the period 1987–1994, while Los and Verspagen (2000) obtain a higher value (equal to 0.56) for a sample of 680 U.S. manufacturing firms over the period 1977–1981. It should also be noted that results obtained when considering different weights of asymmetric technological and geographical spillovers (columns 5,6,7) are not substantially different. This means that the method used to combine technological and geographical proximities when measuring spillover intensities does not seem to affect output elasticities. To sum up, these results confirm the hypothesis that elasticities vary according to the procedure used to weight technological flows, i.e. to the various assumptions made regarding the capacity to absorb technology produced by other firms. Elasticity of geographical spillovers is substantially lower than that observed for technological spillovers (except in the case of higher weights for firms that are far from the efficient frontier) and this finding suggests that Italian manufacturing firms seem to benefit more from technological diffusion stimulated by proximity in technological space than from that encouraged by proximity in the geographical sense. Finally, it should also be noted that returns to scale are generally increasing, except in column 3 where we observe decreasing returns to scale and a substantially lower elasticity of production to spillovers. This could be due to the fact that technological diffusion can only marginally help firms which are distant from the efficient frontier to benefit from external technology and achieve increasing returns to scale. This result suggests that, for Italian manufacturing firms, proximity to the efficient frontier may mean that there are greater benefits, in terms of production, to be had from absorbing external technology.

5.2 Elasticity of substitution

The degree of substitution among inputs is evaluated by considering the technical elasticities of substitution.Footnote 28 The determination of elasticities of substitution is limited to cases of the asymmetric technological and geographical indices (columns 2, 3, 4, 5 in Table 2).

Table 3 shows the technical elasticities of substitution, which are computed by considering the average of the variables. Furthermore, a test which verifies the null hypothesis that elasticities of substitution are equal to one is reported. Test results show that elasticities of substitution are significantly different from one.

Table 3 Technical (TES) elasticity of substitution (as a mean average of the sample) over the period 1998–2003

From Table 3 it may be noticed that a decrease of 1% in the use of labour implies an increase of 2–3% in the use of physical capital and 4–5% in the use of technological capital. These are the highest elasticities of substitution observed when considering traditional inputs. Moreover, a decrease of 1% in the use of physical capital implies an increase of 1.4–1.5% in the use of technological capital. With regards R&D spillovers computed when considering more efficient firms, a decrease of 1% in technology absorbed from other firms implies an increase of 2.6% in the use of physical capital and of 4% for technological capital. This also means that if the absorption of external R&D increases, physical capital and technological capital substantially decrease. These elasticities do not significantly change when considering the R&D spillovers computed by using the combination of geographical and technological proximities. This finding suggests that the absorption of external technology may yield large benefits if firms are technological similar and also technical efficient.

Moreover we found very different elasticities of substitution when considering higher asymmetric spillovers for less efficient firms. In particular, technological capital and R&D spillovers are only marginally substitutes, given that the relative coefficient is relatively low. This means that if the stock of external R&D decreases, the internal technology increases but only marginally. Moreover, a decrease of 1% in the use of labour, physical capital or technological capital determines an increase in the use of spillovers of 15.8, 6.6 and 4.5%, respectively. This result suggests that, in the case of contingent difficulties, inefficient firms could be stimulated to improve their absorptive capacity in order to obtain more benefits from external technology.

6 Concluding remarks

The aim of this paper is to assess the impact of R&D spillovers on the production of Italian manufacturing firms. With respect to the related literature, it introduces two main improvements in the empirical specification. The first deals with the functional form to be used in modelling the impact of R&D and the second concerns the use of an index of technical efficiency to refine the determination of R&D spillovers.

As far as functional form is concerned, we use a nonlinear translog production function which allows the estimation of returns to scale. With regards to R&D spillovers, we propose an asymmetric transformation of the uncentered correlation based on a technical efficiency index, determined by considering the DEA approach. The underlying assumption is that the flow of innovation between firms is related to the technical efficiency of each firm. Moreover, we also take into account the technological diffusion among firms due to geographical proximity.

Using a data panel of 1,203 manufacturing firms over the period 1998–2003, we estimate the nonlinear translog specification through the 2-step IV estimator in order to take into account both sample selection and endogeneity issues. In the first step, the selection process that leads firms to invest or not in R&D is modelled. In the second step, a system of the nonlinear translog and cost-share equations is estimated using the nonlinear 3SLS estimator.

Results show that returns to scale are mainly increasing. Furthermore, it emerges that output elasticity to R&D spillovers is always positive and that different methods of measuring of spillovers bring about different effects of internal and external R&D stocks on the firm’s output. Even though this analysis does not allow it to be established which indicator best represents technological flow between firms, it is worth noting that if we consider the asymmetric similarity index determined by combining technological proximity and technical efficiency, the McElroy R-squared is slightly higher than in the other cases. If technological flows are weighted by the average of the asymmetric technological index and the geographical proximity measure, the output elasticity relative to R&D spillovers is equal to 0.4. Aiello and Cardamone (2008), who adopted a linear translog specification, found similar results with regards output elasticity relative to R&D spillovers. However, they found that output elasticity relative to traditional inputs is very sensitive to the spillover indicator considered. Their result is probably due to the assumption of constant returns to scale which means that the sum of output elasticities has to be equal to one. Results also show that the effect of R&D spillovers is greater if the absorptive capacity is assumed to be higher for more efficient firms. This outcome suggests that technological absorption capacity could be improved by technical efficiency in enterprises’ production processes.

As far as elasticities of substitution between inputs are concerned, in the case of higher R&D flows for efficient firms, the technical elasticity of substitution between technological capital and R&D spillovers is relatively high. Hence, from results regarding output elasticities and elasticities of substitution, it seems that the more efficient and technological similar firms are, the more they are able to absorb and use technology produced by others in their production processes.

In terms of policy implications, the low level of R&D intensity observed in Italy suggests that public intervention in the promoting of R&D activities is required. Indeed, lowering R&D costs would cause an increase in production and the adoption of technology and, therefore, an improvement in firms’ performances. Hence, policy interventions aimed at stimulating Italian R&D investments and innovation could help Italian firms narrow the competitive gap with respect to other developed countries.