1 Introduction

Modern economies adjust to the new and continuously changing technological and knowledge-based environment mainly through knowledge creation and innovation. Therefore, today, innovation is a key reason of economic and regional growth. The production of new knowledge at regional level is inspired and facilitated not only by the main R&D inputs, researchers and R&D expenditures, but also by the global knowledge. First, the productivity of a Regional Innovation System (RIS) increases with knowledge being transferred by agents who compose the particular RIS, intra-regional spillovers. Marshall (1890) supports that specialization of regional economic activity, clustered in the same industry, empowers knowledge diffusion between firms. On the other side, Jacobs (1969) supports that developments coming from different but complementary fields of knowledge enhance knowledge diffusion and, consequently, innovation is facilitated by an industrially diverse environment. Second, literature has also examined the inter-regional knowledge spillovers, through the role of physical (geographical or spatial) and technological proximity between regions on knowledge spillovers.

Spatial proximity has been verified by literature as a significant factor in intra regional (Jaffe (1989) Jaffe et al. (1993), Acs et al. (1994) and inter regional knowledge spillovers (Bottazi and Peri 2003). Technological proximity, agents neighboring in the technology space, is important factor in the explanation of cross firms (Jaffe 1986) and cross regions knowledge spillovers (Maurseth and Verspagen 2002; Bottazi and Peri 2003).

The aim of this paper is to investigate the effect of intra-regional and inter-regional knowledge spillovers on a regional knowledge production function, using data for 210 EU NUTS-2 regions for the period 1995–2006. We adopt Jaffe’s (1988) concept of technological space at cross regional level analysis to capture the impact of technological proximity and the physical contiguity to measure the impact of geographical proximity on inter-regional spillovers. This paper moves forward the previous analysis with the inclusion of a synergetic term of these two measures in order to put into further scrutiny the role of inter-regional spillovers. We argue that the simultaneous consideration of the geographical and technological proximity significantly influences the marginal effects of these independent variables on knowledge creation. Finally, we construct an interaction term, consisted of the regional technological diversity and the cross regional technological proximity, in order to investigate how the diversity of regional innovation production influences the role of technological proximity on knowledge transfer across regions.

Our findings indicate that both diversity and specialization does matter. More precisely, when a region is geographically close to strong innovative systems then the knowledge spillovers are stronger when the pair of regions innovates in different technological fields. But when a region is spatially close to weak innovative systems then the knowledge spillovers are stronger when the two regions innovate in similar technological fields. Our results also indicate that a region with low technological diversity receives more knowledge from regions contiguous in the technological space.

The rest of the paper is organized as follows. In Section Two, we discuss the theoretical issues concerning the diffusion of knowledge and we present the research hypotheses. Section Three gives a short description of the methodology and reports the data. Section Four, presents the empirical results and the discussion. Finally, Section Five presents a conclusion along with some policy implications.

2 Theoretical background and hypothesis setting

Region has increasingly been the unit of the economic analysis for new knowledge production. The main reason for this trend is the existence of the agglomeration economies, which usually cannot be counted at national level. The existence of similar or diverse activities at regional level may significantly influence the creation and diffusion of knowledge. Besides, at European Union level, knowledge agents located in a region may exchange knowledge with agents in neighbouring regions of a different country. Therefore, the examination of the impact of technological proximity, technological diversity and geography is more essential at regional rather than at national level.

Doloreux (2002) defined the regional innovation system (RSI) as a social system which involves interactions between sets of agents (private and public) in a systematic way in order to increase and enhance the localized learning capabilities of a region. He underlined four basic elements comprising an RSI: firms, institutions, and knowledge infrastructure and innovation policy. Autio (1998) partitioned the RSI into two sub-systems: The sub-system consisting of those who generate and diffuse knowledge (universities, public research institutions, liaison offices, etc.) and the sub-system consisting of those who apply and exploit knowledge (industrial companies and their customers, collaborators, contactors, and competitors). Communication between the two sub-systems includes among others the transfer of knowledge.

The output of the innovative activity of an innovative unit i (firm, region or country) at time t, X it , is a function of the inputs (Z it ) supplied to the unit i and the stock of knowledge (K t ) available in the public domain at time t (Griliches 1979):

$$ X_{it} = f\left( {Z_{it} ;K_{t} } \right);Z_{it} = [R_{i,t - k} ,H_{it} ] $$
(1)

where f(.) is the knowledge production function (KPF). The vector of inputs Z it includes the stream of R&D expenditures (R i,tτ ) during the past τ years (T ≥ τ≥0); and the stock of human resources (H it ) (e.g. scientists) at time t. The stock of knowledge K t available to all innovative units at time t, is the cumulative output of the knowledge production function up to time t − 1:

$$ K_{t} = \sum\limits_{i = 1}^{N} {\sum\limits_{\tau = 1}^{T} {X_{i,t - \tau } } } $$
(2)

where Ν is the number of innovative units and K t constitutes the public domain available to all innovative units. K t measures the knowledge spillovers.

Based on concept of the knowledge production function, empirical literature has assumed that the production of innovation at regional or firm level is the result of inputs and the knowledge spillovers from other regions or businesses. Using three basic inputs of innovation (R&D expenditures, human capital and social capital or tacit knowledge), Tappeiner et al. (2008) found that these inputs exert a statistically significant and economically substantial impact on the level of innovation of a region. Guellec and Pottelsberghe de la Potterie (2004), estimated the contribution of several sources of knowledge (business R&D, public R&D and foreign firms R&D) in productivity growth.

The literature argues that the production of innovation in a region or in a business is not only a result of inputs mentioned before, but also a result of knowledge which diffused by other regions or businesses. Romer (1990), assumed that the increase in knowledge level A j of an agent j depends on the human capital accumulated by the agent and the stock of knowledge A available in the society:

$$ \dot{A}_{j} = \delta H^{j} A $$

where \( \dot{A}_{j} \) is the creation of new knowledge by the agent j, H j is the human capital of the agent j and A is the aggregate stock of knowledge. In this paper j stands for regions. Hence, the creation of new knowledge by a region j depends on the aggregate stock of knowledge. The aggregate stock of knowledge consists of the stock of knowledge accumulated in region j, H j, and the stock of knowledge accumulated by the other regions, \( \mathop \sum \limits_{k = 1}^{N} A_{k} \), \( k \ne j \). Romer, in his article, made the assumption that A is non-rivalrous and non-excludable and consequently assigned equal weights to all A k . However, knowledge diffusion at the very least involves a cost related to knowledge assimilation. As stated by Hischleifer and Riley (1992, p. 276) “Only rarely does mere disclosure suffice to convey a message; and there are types of information, whose nature and value are not transparent, that are hard for the users to absorb even when offered freely.” Thus, an agent may not assimilate new knowledge produced by another agent if its absorptive capacity is insufficient. Absorptive capacity is the ability of an agent to identify, assimilate and exploit knowledge from the environment (Cohen and Levinthal 1989) and then adapt it into the processes and routines that are specific to the agent (Criscuolo and Narula 2002). Thus, we adjust Romer’s equation to incorporate weights which are, firstly, related to the absorptive capacity of the receiving region and, secondly, to the communication channel of the transfer of knowledge from the sender to the recipient region: \( \mathop \sum \limits_{k = 1}^{N} w_{jk} A_{i} ,\quad k \ne j \) where k is the sender region and j is the recipient. If \( w_{jk} = 1 \) the recipient region j has the appropriate absorptive capacity to receive knowledge from the region k and the communication channel between the two regions does not produce errors or noise. If \( w_{jk} = 0 \) the recipient region j does not receive any knowledge from region k either because its absorptive capacity is not considerable or the communication channel is closed or both.

At regional level knowledge spillovers could be classified into two types: (a) intra-regional spillovers, spillovers between agents located in the same region, reflecting the structure of the regional innovation system and (b) inter-regional spillovers, spillovers between agents located in different regions.

We define the impact of a region’s own stock of knowledge on regional innovation output as intra-regional spillovers and the impact of the global stock of knowledge, \( \mathop \sum \limits_{k = 1}^{N} w_{jk} A_{i} ,\quad i \ne j \), as inter-regional spillovers. Intra-regional knowledge spillovers are present when agents, e.g. firms and research institutes, located in the same region exchange knowledge. Two theoretical tracks have been developed to explain the intra-regional knowledge spillovers: the first track is based on the Marshall (1890) analysis and the second follows the Jacob’s (1969) analysis. The difference between the two tracks is associated with the impact of diversification of the regional technological or economic activity on knowledge creation.

According to the first theoretical track, firms belonging to the same sector, efficiently decode and exploit sectoral knowledge. Therefore, regional specialization or concentration of the innovative activity in few (or one) technological fields amplifies the knowledge spillovers and subsequently the new knowledge production. Porter (1990), supportive to Marshallian theory, argued that knowledge spillovers are enhanced within specialized and locally clustered industries. Marshall and Porter considered the knowledge diffusion across different industries almost trivial.

Jacobs (1969), on the other hand, argues that a wide range of industries in a region or city increases the knowledge transfer between agents of different industries and leads to the growth of the regional innovative activity. For Jacobs diversity facilitates innovation through the positive externalities of knowledge spillovers.

The previous empirical testing of these theoretical tracks does not provide clear evidence, even though the majority of the findings support diversity. Glaeser et al. (1992) and Feldman and Audretch (1999), using data from the USA, found that localized industries with diverse but complementary activities are closer to new ideas and innovations than specialized industries. Audretsch and Feldman (1996), also using data from the USA, found that innovative output is located and clustered in industries where knowledge spillovers and new economic knowledge are prevalent. Henderson (1997) argued that even though a big number of US industries are clustered in large cities, these cities’ innovative activity is supported by diversity.

Paci and Usai (2000), found a positive relationship between industry specialization and innovative output at NUTS-1 regional level of European Union and Greunz (2004) reached at the same conclusion using data at NUTS-2. On the other hand, Ouwersloot and Rietveld (2000) and Van Oort (2002), using regional data from the Netherlands, recognized the positive correlation between regional diversity and innovative activity. Fritsch and Franke (2004) investigated the role of knowledge spillovers and R&D cooperation on innovation in three German regions. Comparing the R&D expenditures between manufacturing firms in the same industry and in different industries, they found that the production of innovation made by firms in the same industry is the only important source of knowledge spillovers. Moreno et al. (2006) tested for the diversity impact on the regional innovative activity for 175 European regions. They found that specialization is more conducive to agglomeration of innovation in the same sectors than diversity. Fritsch and Slavtchev (2010) come to balance the two conflicting tracks as they investigated the relationship between regional innovation efficiency and diversity across German regions, using a Gini index to measure diversity. They found an optimum level of industrial diversity beyond which an increase in diversity may have a negative impact on innovative efficiency. Finally, Ponds et al. (2010) investigated the impact of regional academic research spillovers on regional innovation activity in Netherlands. They found that the collaborating networks between universities and firms facilitate knowledge spillovers even though these spillovers are geographically bounded.

We assume that each technological field is a system comprising of agents and inter-relations between them. At regional level, these technological fields constitute sub-systems of the regional innovation system. Sometimes, the solutions to technological, and only problems, could not be found inside the sub-system of the technological field. The answers to problems that arise in a sub-system could not come either from its agents per se or from the inter-relations between these agents. In such cases, the answers may reside outside the sub-system, in the meta-system (in the terminology of L. Wittgenstein). Thinking beyond the boundaries of the firm and of the technological field may be more creative in dealing with technological problems. Therefore, the regional technological diversity offers to the agents of the various technological fields (sub-systems) the ability to assimilate knowledge from other sub-systems and then adapt it into the processes and routines that are specific to the agent. Thus, we posit the first hypothesis:

Hypothesis 1

The higher the regional innovation diversity the higher the regional innovation productivity.

The investigation of the knowledge transfer across agents located in different regions (inter-regional spillovers) includes the analysis of the factors that influence this transfer. Significant part of the literature found that knowledge spillovers are strongly affected by the geographical proximity between the knowledgeable agents. According to Doloreux (2002), the spatial proximity is important for knowledge spillovers because: (a) the spatial agglomerations of firms enhance the regional interactive learning, (b) the cost of knowledge and information transfer falls since the communication channels between the agents produce limited errors or noise, and (c) the tacit knowledge is transferred without semantic difficulties since the agents share a common social and economic environment. Maurseth and Verspagen (2002), using data on patent citations, analyzed the knowledge transfer in a sample of European regions. They found that patent citations increase with geographical proximity between the inventors of the citing and cited patent. Bottazi and Peri (2003) estimated the impact of R&D externalities on generating profitable new ideas using R&D private inputs available in region i and global knowledge coming from region i and other regions. They found that cross regional knowledge spillovers increase with spatial proximity between regions. Tappeiner et al. (2008) examined different levels of innovation and their spatial distribution in 51 European regions. They found that the higher the spatial proximity of innovation resources the higher innovation activity. Consequently, all these studies implicitly assumed that the weight w in inter-regional spillovers depends on spatial proximity.

However, geographical proximity it is not sufficient to enhance knowledge spillovers across regions. If the regional agents are not sufficiently knowledgeable the absorption and exploitation of the knowledge coming from outside would be limited or in the extreme case negligible. Cohen and Levinthal (1990) introduced the concept of individual firm’s absorptive capacity as an important factor influencing knowledge spillovers. Each region builds up its capability to decode and exploit the new knowledge created in other regions. This capability is close to the concept of absorptive capacity. Caragliu and Nijkamp (2012) extended the concept of the absorptive capacity at country and regional level and they found that if a region faces a low absorptive capacity then the region receives knowledge diffused only by its surrounding areas.

A region absorbs knowledge not only when it is created in physically contiguous regions but also in regions, which innovate in similar technological fields. If regions innovate in different technological fields it might not advance for knowledge diffusion. Maurseth and Verspagen (2002), following a similar methodology to Jaffe (1988) at regional level, found that that the patents of pairs of regions that innovate in similar technologically fields are more frequently cross cited than other pairs. Greunz (2003) investigated knowledge diffusion in a two-dimensional space, geographical and technological proximities, on the new knowledge creation in 153 European regions. Her results show that when technological proximity is weighted by a geographical decay factor the technological knowledge spillovers increase. Moreno et al. (2005) investigated the spatial distribution of innovative activity and the importance of knowledge spillovers in a KPF context across 175 European regions. They constructed a contiguity and distance matrix weighted by the technological distance matrix. Their results show clearly that innovative activity is getting enforced with the simultaneous existence of technological and geographical proximity.

Regions have higher capacity to absorb the knowledge produced by their technologically neighbor regions and therefore we expect that the inter-regional knowledge spillovers increase with technological proximity. But the simultaneous consideration of physical and technological proximity may change the impact of technological proximity on inter-regional knowledge spillovers. The knowledge recipient regions which are contiguous to strong innovative sender regions absorb more knowledge the higher (or lower) the technological distance between them.

On the other hand, when a recipient region is contiguous to weak innovative sender regions, it absorbs more knowledge from the sender the lower (or higher) the technological distance between them. Thus, we posit the following hypotheses.

Hypothesis 2a

The cross regional spatial proximity has a positive effect on regional innovative activity.

Hypothesis 2b

The cross regional technological proximity has a positive effect on regional innovative activity.

Hypothesis 3a

If the region is spatially contiguous to weak innovative regions then it absorbs knowledge from other regions that are technologically distant or neighbors.

Hypothesis 3b

If the region is geographically contiguous to strong innovative regions then it absorbs knowledge from regions that are technologically distant or neighbors.

In the case of intra-regional knowledge spillovers, we argue that the correlation between technological diversity and knowledge transfer is positive, as suggested by Jacobs. In the case of inter-regional spillovers does the same argument applies too? In other words does diversity or specialization enforces the technological knowledge spillovers across regions? Or this regional diversity is important in the knowledge spillovers between regions that are technologically close or distant? If two regions innovate in different (similar) technological fields, does technological diversity or specialization of the recipient region advance the inter-regional knowledge transfer? Thus, we posit our fourth hypothesis.

Hypothesis 4

The impact of the technological distance on knowledge transfer between a recipient and a sender region is conditioned on the technological diversity (specialization) of the recipient region.

3 The model and data

To test our hypotheses, we estimate the knowledge production function (KPF) (Eq. 1) at regional level. Based on Griliches (1990) arguments, patents have been extensively used as a proxy for the output of knowledge production function. Besides, Acs et al. (2002) compared the literature-based innovation output measures at regional level (as they were published in trade and technical journals) and the “patent” as direct indicators of “innovation”. They found that the number of patents is a reliable output measure of the innovative activity. Bottazi and Peri (2003), also note that “Although not all inventions are patented, the patented ones have to fulfill minimal standards of novelty”, originality and potential use. Therefore, patents can be considered as a good approximation to ideal data on “economically profitable ideas” which one would like to have for testing theories on innovation” (p. 692).

However, in the relative literature, we found some limitations and reservations as regard the safety of this measure. For instance, According to Pakes and Griliches (1980, p. 378), “patents are a flawed measure (of innovative output) particularly since not all new innovations are patented and since patents differ greatly in their economic impact.” Hall et al. (2001) argued that patents are good indicators of new technology creation but they do not measure the economic value of these technologies. In this paper we discuss more extensively the drawbacks and limitations in choosing patents as a measure of innovative activity in the data section below.

The vector of inputs consists of the number of researchers in science and technology and the expenditures in research and development. We use annual data on Human Resources in Science and Technology (HRST) as percentage of total population as a proxy of the first input and the total intramural R&D expenditure (GERD) measured as euro per inhabitant as a proxy of the second input. The vector of global knowledge, K t , is used to measure the intra-regional and inter-regional spillovers.

The hypothesis 1 is tested by using the Shannon’s entropy index as a measure of the technological diversity. We calculate the Normalized Shannon’s Entropy Index (NEI) (Kumar et al. 1986):

$$ NEI_{j} = \frac{1}{lnN}\mathop \sum \limits_{i}^{N} p_{ij} log\left( {\frac{1}{{p_{ij} }}} \right) $$

where p ij is the patents share of the technological field i in the total innovation activity of region j. N is the number of technological fields, i = 1, 2, …, 121, j is the number of regions, j = 1, 2, …, 210. The NEI takes values from 0 to 1. If NEI j  = 0 the region produces innovations in one field only. If NEI j  = 1, the regionally produced innovations are equally distributed across the 121 technological fields. If the first hypothesis is valid then we expect that the NEI (technological diversity) is positively correlated with the regional innovation activity (number of patents).

Hypotheses 2a and 2b refer to inter-regional knowledge spillovers. Hypothesis 2a is tested using the geographical proximity between the j regions, which has been extensively used in literature. We construct a quadratic matrix GD for the 210 regions. Each element GD jk of the GD represents the geographical proximity between regions j and k. It takes the value 1 if the two regions are neighbors and 0 otherwise. Next, we multiply the GD matrix with the vector of patents P for each year. The product of this multiplication gives the vector of the global knowledge for region j weighted by the geographical proximity for year t. This variable is denoted as (GDS) in our dataset and varies across regions and time. Bottazi and Peri (2003) used a similar index, the matrix of geographical distances multiplied with the vector of ln(R&D) resources. We regard patents as a more immediate measure of knowledge than the R&D expenditures because R&D expenditure is an input in KPF whereas a patent is the output (Griliches 1990). GDS j indicates whether the region j is contiguous to strong innovative regions (GDS j takes high values) or to weak innovative regions (GDS j takes low values). Since geographical proximity has been used extensively by the literature as a proxy for the tacit knowledge spillovers and research collaboration between institutes closely located, we expect that these knowledge spillovers are significant mainly in the case where the geographically contiguous regions are strongly innovative. Therefore, the estimated coefficient is expected to be statistically significant and positive.

The second hypothesis (2b) is tested using the Romer’s A, the global stock of knowledge, weighted by technological proximity. We use the technological proximity (TP) between regions as measured by the Jaffe’s (1988) technological distance calculated using the patent applications to EPO, disaggregated into the 121 technological fields of the IPC:

$$ TP_{jk} = \left( {f_{j} \times f_{k} '} \right)\left( {f_{j} \times f_{j} '} \right)\left( {f_{k} \times f_{k} '} \right)^{ - 1/2} $$

where \( f_{j} = \frac{{P_{i}^{j} }}{{\mathop \sum \nolimits P_{i}^{j} }} \) and \( f_{k} = \frac{{P_{i}^{k} }}{{\mathop \sum \nolimits P_{i}^{k} }} \), \( P_{i}^{j} \) is the number of patents of technological field i in region j and \( P_{i}^{k} \) is the number of patents of technological field i in region k. Since the number of technological fields is N, there are two vectors: \( \left[ {N \times 1} \right]_{j} \) and \( \left[ {N \times 1} \right]_{k} \) and the angular separation between \( f_{j} \) and \( f_{k} \) leads to the quadratic matrix TP. The element TP jk of TP is a positive correlation index which takes values in [0, 1]. If TP jk tends to 0, the regions j and k contact research in different technological fields. On the other hand, if TP jk tends to 1, the regions j and k conduct research in similar technological fields. Finally, if TP jk  = 1, the regions j and k conduct research in exactly the same fields.

We multiply TP with the vector of patents P, the global stock of knowledge. The result is the vector TPS, whose elements could be regarded as each region’s global knowledge weighted by technological proximity. In Cohen and Levinthal (1989) terminology, we consider TPS as a proxy for the regional absorptive capacity. A TPS j  = 0 is an indication of low absorptive capacity since the region conducts research completely different from those regions which produce high volume of knowledge. In such a case, the “global” knowledge for the region j is usually lower than the global knowledge (the Romer’s A), the unweighted sum of the elements of the vector P. A high TPS j is an indication that the region j is technologically contiguous to strongly innovative regions and, therefore, the region j will absorb more knowledge from the public domain. Hence, the estimated coefficient of TPS is expected to be statistically significant and positive.

To test the third hypothesis, that is the synergetic effect between technological and geographical proximity, we use the interaction term (INT 1 ), the product of GDS and TPS. If INT 1 is statistically significant we might conclude that an interaction effect exists. Finally, the fourth hypothesis is empirically tested via the use of a second interaction term (INT 2 ), the product of NEI and TPS.

Concluding, the knowledge production function of a region j is represented by the following Cobb-Douglas type KPF:

$$ Q_{j} = E \cdot RS_{j}^{{a_{1} }} \cdot RD_{j}^{{a_{2} }} \cdot e^{{a_{3} \cdot DIV + a_{4} \cdot \log TPS + a_{5} GDS + a_{6} INT_{1} + a_{7} INT_{2} }} $$
(3)

The time index is omitted for simplicity reasons. The variable Q j is the total number of patents in region j, RS is the number of researchers in science and technology, RD is the research and development expenditures, DIV is the regional technological diversity measured by NEI, TPS is knowledge spillovers weighted by technological proximity and GDS is knowledge spillovers weighted by geographical proximity. The variables DIV, TPS and GDS are used to capture the various spillover effects for region j. DIV captures intra-regional spillovers whereas TPS and GDS capture inter-regional spillovers. Finally, E could be treated as an estimation of the regional leveraging capacity-efficiency (LCE). Taking logarithms, equation (3) becomes:

$$ \log Q_{j} = \log E + a_{1} \log RS_{j} + a_{2} \log RD_{j} + a_{3} \log DIV_{j} + a_{4} \log TPS_{j} + a_{5} \log GDS_{j} + a_{6} INT_{1j} + a_{7} INT_{2j} $$
(4)

which is the general model to be estimated.

We use a panel data set of 210 European regions at the NUTS-2 level for the period 1995–2006. Our dataset offers a highly diverse landscape since some countries are consisted only of two regions (Bulgaria) and others, such as Germany, are consisted of 37 regions. The innovative activity’s distribution is uneven since some regions present high innovative activity and some others very low, as it is shown in Table 1 (minimum and maximum values of patents). However, almost all the 210 regions increased their innovative activity in the period of 12 years of our sample. Therefore, since there is a variation of innovative activity both across time and across regions, we decide to use panel data analysis.

Table 1 Descriptive statistics

The data are provided by Eurostat. NUTS-2 regions are administratively independent from each other; they may all participate independently in the programmes of the European Union and may even apply their own tax and economic policies.

We refer to the patents according to their application date rather than their issue date. The application date represents the time at which new knowledge is actually made public. Moreover, in most patent offices, there is a usually long and random time delay between application and issuing of a license (Jaffe 1986). The regional distribution of patent applications is assigned according to the inventor’s country or region of residence. If an application has more than one inventor, it is divided equally among all inventors and, subsequently, among their countries of residence. Thus, double counting is avoided. When a patent corresponds to multiple technological fields, it is assigned to the first code of the International Patent Classification (IPC) indicated in the patent.Footnote 1 One of the drawbacks of using patent applications at EPO may be that, since EPO acts as single intermediary to participating countries, the cost of double patenting (national and European) may prevent some small firms from demanding an extension of the national patent to European level. On the other hand, as stated by Maurseth and Verspagen (2002), patents filed at the EPO are more likely to have higher average quality. Furthermore, as pointed out by Cefis and Orsenigo (2001), EPO uses a highly homogeneous procedure, so the data are readily comparable to perform comparative analysis. Also, some large corporations with subsidiaries in the EU tend to patent in the USPTO and off shore IP holding companies. Of course, they may have significant knowledge spillovers in terms of new management practices, technologies etc. Also EU patent activity may also reflect the national state support systems since some national agencies co-finance the patent applications by firms increasing that way the number of patent applications. By and large, it is clear that patent is not the perfect measure as an output of a knowledge production function. However, it still remains a fairly good indicator of innovative activity.

Table 1 reports some descriptive statistics and Table 2 reports the correlation between the explanatory variables. Since there is no strong positive or negative correlation between any pair of independent variables, there are no multicollinearity issues in our estimations.

Table 2 Correlation matrix

4 Empirical results and discussion

The dataset is balanced since every region is steadily observed for every single year. Estimates are carried out by applying panel data methodology. Due to the lack of space, we only present the fixed effects (FE) estimations as suggested by the Hausman and Breush-Pagan tests.

The results of the estimates are shown in Table 3. We present six different fixed effects models all of which include the main KPF inputs, the researchers and the R&D expenditures, but differ in the inclusion of the knowledge spillovers and the interaction terms. Model 1 presents the base regional KPF with the two inputs only, R&D expenditures (RD) and researchers (RS), without spillover effects. Model 2 introduces the impact of intra-regional spillovers into KPF by including the variable NEI. Model 3 introduces the impact of inter-regional spillovers by including the TPS and GDS. Model 4 includes the interaction term INT 1 . Model 5 incorporates the simultaneous impacts of intra and inter-regional spillovers. Finally, the second interaction term, INT 2 , is included in Model 6.

Table 3 Panel Estimation results

4.1 Control variables

The dependent variable, the logarithm of patents, is positively correlated with the KPF inputs, RS and RD. The elasticity of RD is statistically significant in all models and ranges from 0.53 (Model 1) to 0.38 (Model 6). However, RS is not statistically significant in all models. This finding is similar to previous findings. Bottazi and Peri (2003), measuring R&D (employment and real spending (private and public) in research and development) in 86 European regions, find a positive and statistically significant correlation between R&D and patents production in each region. Tappeiner et al. (2008) found that the traditional factors of innovative activity, such as expenses on R&D and human capital, in spatial clusters increase the patenting activities. Fritsch and Slavtchev (2010), willing to investigate the efficiency of regions by relating regional R&D inputs with output (patents), they found this relation explicitly positive.

The introduction of knowledge spillovers reduces the impact of R&D expenditures on innovation output since the estimated coefficients for both inputs are much higher in Model 1 than in the other five models. This is an indication of the role that knowledge spillovers play in new knowledge production. Not only are the regional resources (inputs) necessary for new knowledge production but also the global knowledge. Hence, the establishment of a mechanism to facilitate the knowledge transfer both inside the regional innovation system and across the regions will improve the efficiency of the RIS by assimilating a higher share of the global knowledge. The influence, however, of the knowledge spillovers on the two inputs estimated coefficients is not similar. The estimated coefficient of the RD falls from 0.53 in Model 1 to 0.35 and 0.38 in Models 5 and 6 correspondingly. On the other hand, the coefficient of the RS falls from 0.74 in Model 1 to 0.15 and 0.30 in Models 5 and 6 correspondingly. This difference might be attributed to the fact that global knowledge may actually be a closer substitute to researchers than to R&D expenditures. However, the estimated coefficient of the RS in model 6 suggests that this substitutability is not perfect. The degree of substitutability between R&D expenditures and global knowledge is smaller since the region saves funds from knowledge already existing in the global public domain.

4.2 Returns to scale

The returns to scale with respect to RD and RS, the main inputs of the KPF, are increasing in Model 1, are almost constant in Model 2 and decreasing in the models with knowledge spillovers. The returns to scale are equal to 0.68 in the last model after the introduction of both intra and inter regional spillovers. Hence, the average European regional innovation system operates under decreasing returns to scale for R&D expenditures and researchers. However, if we consider the global knowledge as a third input, the total returns to scale are increasing. But we should emphasize to the importance of the global knowledge in order to achieve increasing returns which otherwise are decreasing. What is important from this finding is that if a RIS plans to spend more resources, in order to increase the output of the KPF, this policy will be efficient only in the case the RIS has, or plans, a mechanism to absorb global knowledge. If not, our findings suggest that more resources may not be as efficient as its planners believed and, therefore, the RIS may waste scarce resources that could be used more efficiently.

4.3 Hypothesis 1

Hypothesis 1 is verified by using the variable NEI as a proxy for regional technological variation, which is also positively correlated with patents. This finding implies that technological diversity facilitates the new ideas and knowledge production and increases the innovation productivity. We note that non-linearities between technological diversity and innovation performance, as Fritsch and Slavtchev (2010) found for Germany, are not supported by our data. The positive sign of the NEI coefficient suggests that the intra-regional spillovers increase with technological diversity. Our findings support the argument of the Jacob’s track of literature that technological diversity facilitates the creation of new knowledge. Furthermore, our findings suggest that in the case where the regional innovation activity is concentrated in few technological fields (and in the extreme case in one field) the region attracts new innovative agents belonging to these technological fields or to the neighboring ones. Therefore, innovative activity increases as long as the agents innovate in these few fields. On the other hand, if a region innovates in a diversity of fields, the attraction of more and diverse agents may contribute to the increase of the regional innovative activity. In the lights of these results, the policy implications are significant. Technological policy usually takes two forms. The first is concentrated in the innovation of specific strategic sectors. The second, which is more decentralized, does not focus on specific sectors. We argue that European regions should apply the second type of technological policy since it facilitates the intra regional knowledge spillovers and consequently the exploitation of the resources devoted to innovation is more efficient.

One of the basic principles of Regional Smart Specialization called specialized diversification (OECD 2013) comes to complement the upper suggestion. This substantial principle is revealed mainly by the Smart Specialization’s Strategies which involve the adoption, dissemination and adaption of General Purpose Technologies (GPTs) across a wide range of sectors. In the context of these strategies, European Commission laid emphasis on the “related variety”. This idea arose out of debates as to whether innovation is more likely to occur in regions that are diversified rather than specialized because there are more opportunities afforded, in the former, for knowledge spillovers between different sectors. However, such spillovers are only likely to occur where there is some form of “relatedness” between sectors. Related variety exists where sectors share some form of complementary competences. (Commission of Europe 2013). Additionally, according to Boschma (2014) related variety can boost diversification and true economic renewal in regions by making new recombination between industries. The importance of related variety for regional growth and regional diversification has been confirmed in a number of recent studies (e.g. Frenken et al. 2007; Boschma and Iammarino 2009; Neffke et al. 2011; Boschma et al., 2012; Rigby 2012).

Audretsch and Feldman (1996) using survey data from Levin et al. (1987) to discern the disciplines that form a common science base that contributes to the basis for cross-industry increasing returns, find that the presence of diverse industries within the same science-base in a city leads to increased innovation. Therefore, we can argue that “it is not diversification per se that is important for growth, but the patterns of specialized diversification across related technologies that are important for growth” (Mccann and Ortega-Argilés 2013, p. 7).

4.4 Hypothesis 2

Hypothesis 2a is verified by measuring the estimated coefficient of GDS which quantifies the direct impact of global knowledge weighted by geographical proximity on new knowledge production. The coefficient is positive and statistically significant in all models. Thus, when a region is spatially contiguous with strong innovative regions the spillovers due to geographical proximity are positive. Hypothesis 2b is verified by measuring the estimated coefficient of TPS which quantifies the direct impact of the global knowledge weighted by technological proximity on regional knowledge production. The coefficient is positive and statistically significant in all models and its value is relatively stable. That is, as a region is technologically contiguous to high innovative regions the cross regional spillovers have a greater influence on regional new knowledge production.

4.5 Hypothesis 3

The coefficient of the interaction term (INT 1 ) is statistically significant and negative. For the analysis below, we use the estimated coefficients of Model 5 as it is representative because incorporates both the intra and inter-regional spillovers:

$$ l\left( {Patents} \right) = C + 0.15\,l\left( {RS} \right) + 0.35\,l\left( {RD} \right) + 0.94\,NEI + 0.43\,l\left( {TPS} \right) + 0.008\,GDS - 0.00075\left[ {l\,\left( {TPS} \right) \times GDS} \right] $$

The first order partial derivatives of this function with respect to TPS and GDS are:

$$ \frac{\partial l(Patents)}{\partial l(TPS)} = 0.43 - 0.00075\,GDS $$
(5)
$$ \frac{\partial l(Patents)}{\partial GDS} = 0.008 - 0.00075\,TPS $$
(6)

According to Hirschberg and Lye (2010), the interaction term in a regression measures the marginal effect of an independent variable depending on the values of another independent variable. The partial derivatives (5) and (6) are the marginal effects of TPS and GDS. The sign of these marginal effects depends on the values of the other independent variable entering in each marginal effect equation, namely GDS for the Eq. (5) and TPS for the Eq. (6). It is evident from these two equations that the simultaneous consideration of the geographical proximity reduces the impact of the knowledge spillovers due to technological proximity and vice versa. However, it is not clear for what value of GDS (TPS) these marginal effects change sign. Therefore, it is necessary to construct the confidence intervals for these variables following the Fieller theorem as applied by Hirschberg and Lye (2010). We denote as β4 = 0.43, β5 = 0.007 and β6 = −0.00075 from Eqs. (5) and (6).

The sign of the partial derivative \( \frac{\partial l(Patents)}{\partial l(TPS)} \) changes at GDS* = −β46 and the sign of the partial derivative \( \frac{\partial l(Patents)}{\partial GDS} \) changes at TPS* = −β56. The confidence intervals for GDS* and TPS* with 100(1 − α) = 5 % are the roots of the quadratic equations \( a{\text{GDS}}^{2} + 2bGDS + c = 0 \) and \( a{\text{TPS}}^{2} + 2bTPS + c = 0 \) correspondingly. Where \( a = \beta_{6}^{2} - t_{a/2}^{2} \sigma_{6}^{2} \), \( b = - \left( {t_{a/2}^{2} \sigma_{46} - \beta_{4} \beta_{6} } \right) \) and \( c = \beta_{4}^{2} - t_{a/2}^{2} \sigma_{4}^{2} \) for the first equation and \( a = \beta_{6}^{2} - t_{a/2}^{2} \sigma_{6}^{2} \), \( b = - \left( {t_{a/2}^{2} \sigma_{56} - \beta_{5} \beta_{6} } \right) \), and \( c = \beta_{5}^{2} - t_{a/2}^{2} \sigma_{5}^{2} \) for the second equation. \( \sigma_{i}^{2} \) are the estimated variances of \( \beta_{i} \) (i = 4, 5, 6), \( \sigma_{46} \) and \( \sigma_{56} \) are the covariances between \( \beta_{4} {\text{ and }} \beta_{6} \) and \( \beta_{5} {\text{ and }} \beta_{6} \) respectively. \( t_{a/2}^{2} \) is the α % critical value of t-distribution with T − k degrees of freedom.

The two roots of the first quadratic equation are \( (GDS_{L} , GDS_{U} ) = \left( { - b \pm \sqrt {b^{2} - ac} } \right)/a \), and the two roots of the first quadratic equation are \( (TPS_{L} , TPS_{U} ) = \left( { - b \pm \sqrt {b^{2} - ac} } \right)/a \). The two roots define the confidence bounds for GDS* and TPS* respectively. Solving for the roots of the first equation we get GDSU = 1,215 and GDSL = 350 and for the second equation TPSU = 11.5 and TPSL = 10. The subscripts U and L denote the upper and lower bound of the confidence interval.

The total impact of the TPS on patents by taking into consideration the influence of the geographical proximity has been determined by the corresponding confidence interval. The confidence intervals of the two interacted variables determine the bounds in which the sign of partial derivatives \( \frac{\partial l(Patents)}{\partial l(TPS)} \) is uncertain. For values 350 < GDS < 1,215 the partial derivative is statistically zero and consequently TPS does not influence new patents production. Above and below the upper and lower bounds the signs are definite.

For values GDS < 350 the partial derivative \( \frac{\partial l(Patents)}{\partial l(TPS)} \) takes positive values but its value is smaller than 0.43, the direct impact. This means that when a region is geographically contiguous to weak innovative systems then the impact of the global knowledge spillovers weighted by technological proximity is still positive. In other words, when a region j is contiguous to a weak innovative region k, then knowledge spillovers from k to j are stronger the lower the technological distance between them. The knowledge spillovers are stronger when the two regions innovate in similar technological fields. In this case inter-regional technological specialization does matter. This result gives an explicit answer to the scenarios presented in hypothesis 3a.

The partial derivative \( \frac{\partial l(Patents)}{\partial l(TPS)} \) takes negative values for GDS > 1,215. That is when a region is geographically contiguous to regions with high innovative activity, the inter-regional spillovers weighted by technological proximity are increasing when the region receives knowledge from technologically distant regions. In other words, when a region j is contiguous to a strong innovative region k, the knowledge spillovers from k to j are stronger the higher the technological distance between them. The knowledge spillovers are stronger when the two regions innovate in different technological fields. It seems that even in cross regional knowledge spillovers inter-regional technological diversity does matter. This result gives an explicit answer to the scenarios presented in hypothesis 3b.

4.6 Hypothesis 4

The partial derivative \( \frac{\partial l(Patents)}{\partial GDS} \) has positive sign in all models, implying that when a region is spatially contiguous to strong innovative regions the global knowledge spillovers weighted by geographical proximity have a positive impact on regional innovation activity. However, the simultaneous consideration of the technological proximity reduces the impact of the GDS on patents. The partial derivative takes negative values for TPS > 11.5 and positive values for TPS < 10. But the TPS takes much larger values than 11.5 and, hence, only the first case TPS > 11.5 is valid. Thus, the sign of the partial derivative \( \frac{\partial l(Patents)}{\partial GDS} \) is clearly negative for every TPS value. The second interaction term (INT 2 ) is a statistically significant variable with a negative coefficient. We measure the first order partial derivative of the KPF with respect to the TPS using the estimated coefficients of the Model 6.

$$ l\left( {Patents} \right) = C + 0.30l\left( {RS} \right) + 0.38l\left( {RD} \right) + 12.11NEI + 0.53l\left( {TPS} \right) + 0.00047GDS - 1.25\left[ {l\left( {TPS} \right) \times NEI} \right] $$
$$ \frac{\partial l(Patents)}{\partial l(TPS)} = 0.53 - 1.25 {\text{NEI}} $$
(7)

Following again Hirschberg and Lye (2010), the marginal effect of TPS on patents depends on NEI. We construct the confidence bounds of \( \frac{\partial l(Patents)}{\partial l(TPS)} \) function. We denote as β4 = 0.53, and β6 = −1.25 from equation (7).

The sign of the partial derivative \( \frac{\partial l(Patents)}{\partial l(TPS)} \) changes at NEI* = −β46. The confidence interval for NEI* with 100(1 − α) = 5 % is determined by the roots of the quadratic equation \( a{\text{NEI}}^{2} + 2bNEI + C = 0 \) where \( a = \beta_{6}^{2} - t_{a/2}^{2} \sigma_{6}^{2} \), \( b = - \left( {t_{a/2}^{2} \sigma_{46} - \beta_{4} \beta_{6} } \right) \) and \( c = \beta_{4}^{2} - t_{a/2}^{2} \sigma_{4}^{2} \). \( \sigma_{i}^{2} \) is the variance of \( \beta_{i} \) (i = 4, 6), \( \sigma_{46} \) is the covariance between \( \beta_{4} {\text{ and }}\beta_{6} \), and \( t_{a/2}^{2} \) is the α % critical value of the t-distribution with T − k degrees of freedom.

Solving for the roots we found that \( (NEI_{L} , NEI_{U} ) = \left( { - b \pm \sqrt {b^{2} - ac} } \right)/a \) where the subscripts U and L denote the upper and lower limit of the confidence interval for NEI*. We find that NEIU = 1.1 and NEIL = 0.3. The partial derivative \( \frac{\partial l(Patents)}{\partial l(TPS)} \) has a positive sign when NEI < 0.3 which means that as NEI takes lower values region’s j diversity gets lower and then the inter regional knowledge spillovers are higher the lower the technological distance between regions. In other words, when a region j exhibits low technological diversity then the inter-regional knowledge spillovers from region k are higher when the two regions innovate in similar technological fields. For values of NEI between 0.3 and 1 the impact of the global spillovers weighted by technological proximity (TPS) is statistically zero. NEI, by its structure, cannot take values higher than 1.

4.7 Resources leveraging efficiency

The estimated constant E could be considered as an index of the RIS ability to efficiently leverage resources. E, as an index number, does not provide any information unless it varies across time or across regions. In this paper, the E varies across regions and is time invariant. Therefore, the estimated vector E of the regional idiosyncratic constants provides information about the differences between the leveraging capacity-efficiency (LCE) of the 210 EU regional innovation systems. Since we have assumed that all regions use the same knowledge production function, the E is a “shift” parameter: the KPF of the regions with higher E are located higher in the inputs space than the KPF of regions with lower E. If, for example, a region has E k = 1 and another region has E j = 2, it could be an indication that the second region can leverage the same resources twice as much as the first one.

We report some regional idiosyncratic LCE in Table 4. The first and third columns present the top thirty European regions in terms of patents production and LCE correspondingly. The second and fourth columns report the number of patents and the LCE correspondingly. The LCE index is normalized to unity using as benchmark the highest regional LCE (Noord-Brabant region).

Table 4 The most efficient regional innovation systems

From Table 4 we conclude that there are regions which, although they are very high classified in terms of patents production, do not exhibit high LCE. For instance, the Île de France region, the region with the highest production of patents, is not present in the list of the 30 regions with the highest LCE. This finding might indicate that these regional innovation systems produce high knowledge output by reserving a lot of resources without, however, being supported by the required LCE. A possible reason for this phenomenon could be policies which tended to spread “knowledge investment” (e.g. high education and vocational training, public and private R&D) too thinly, not making much of an impact in many areas. Since entrepreneurship and innovation tends to be higher in regions with large market potential, some regions may waste a lot of sources in order to cover many fields of innovative activity.

On the other hand, there are regions with high LCE but with lower knowledge output. For example, Rheinhessen-Pfalz region is one of the most efficient regions in terms of LCE despite the fact that it is not classified in the top regions with the highest patents production. Mittelfranken, Freiburg and Emilia-Romagna exhibit the same behavior. Regions like the mentioned before, probably have identified more accurately the fields in which they intend to innovate so that they do not consume unnecessary sources. The Smart Specialization concept forms one of the main pillars of the reformed EU Cohesion Policy, which highlights the need for regions to identify and select their own potential sources of innovation and economic growth. In that respect, regions should proceed to self-assessment of their knowledge assets, capabilities and competences and the knowledge agents between whom knowledge is exchanged (Mccann and Ortega-Argilés 2013, p. 10).

Table 5 illustrates the twenty regions with the lower LCE. These regions belong to the South and Eastern Europe. Because these regions have the lower LCE it seems that their RIS is characterized by low competences to coordinate and leverage resources allocated to innovation activity. However, the policy implications of our findings suggest that a policy which intents to allocate more resources to innovation activities without improving the coordination and leveraging capacity of the RIS will not be as productive as the policy makers might expect. Technological policy should aim at promoting a strong partnership between businesses, public entities and knowledge institutions.

Table 5 The less efficient regional innovation systems

5 Concluding remarks and policy implications

Aim of this paper was to investigate the determinants of the regional innovative activity. Knowledge spillovers gleaned from the innovative process in and out of a region play a major role in patents production. Intra-regional knowledge spillovers defined and influenced by the degree of diversification of the regional innovative activity. Inter-regional knowledge spillovers defined and influenced by the geographical and technological proximity between each pair of regions. We estimated a regional knowledge production function using data for 210 European regions for a 12 years period (1995–2006).

The dependent variable, the logarithm of patents, is positively correlated with the KPF inputs, RS and RD. The introduction of knowledge spillovers reduces the impact of these inputs. The influence, however, of the knowledge spillovers on the two inputs estimated coefficients is not similar. The estimated coefficient of the RD falls more than the coefficient of the RS. This difference might be attributed to the fact that global knowledge may actually be a closer substitute to researchers than to R&D expenditures. The degree of substitutability between the R&D expenditures and global knowledge is smaller since the region saves funds from knowledge already existing in the global public domain. The returns to scale with respect to RD and RS, the main inputs of the KPF, are increasing in the models without spillovers and decreasing in the models with knowledge spillovers. Hence, the average European regional innovation system operates under decreasing returns to scale for R&D expenditures and researchers. However, if we consider the global knowledge as a third input, the total returns to scale are increasing.

Technological diversity facilitates the new ideas and knowledge production and increases the innovation productivity. Our findings support the argument of the Jacob’s track of literature that technological diversity facilitates the creation of new knowledge. In the light of our findings, we argue that European regions should apply a decentralized technological policy because it facilitates the intra regional knowledge spillovers and makes the exploitation of the resources devoted to innovation more efficient. Diversity of innovative activity in every region is related positively with the output of the regional knowledge production function. This means that intra-regional knowledge spillovers are enhanced when a region innovates in different technological fields.

The estimated coefficient of TPS is positive and statistically significant. That is as a region is technologically contiguous to high innovative regions the cross regional spillovers have a greater influence on regional new knowledge production. The estimated coefficient of GDS is also positive and statistically significant. Thus, when a region is spatially contiguous with strong innovative regions the spillovers due to geographical proximity are positive. However, the simultaneous consideration of the geographical and technological proximity significantly influences the marginal effects of these independent variables on new knowledge creation. Thus, when a region is geographically contiguous to weak innovative systems then the knowledge spillovers are stronger when the two regions innovate in similar technological fields. When a region is geographically contiguous to regions with high innovative activity the knowledge spillovers are stronger when the two regions innovate in different technological fields.

The structure of the recipient’s region innovative activity determines the impact of the global knowledge weighted by technological proximity on regional innovation productivity. When the regional innovation activity is diverse then the impact of the TPS is zero or the outcome is not so clear. When the NEI is smaller than 0.3 the impact of the TPS is positive. That is when the regional innovation activity is concentrated in few technological fields the interregional spillovers increase when the region is technologically contiguous to strong innovation regions. The global knowledge is produced in regions which are technologically closed to this region.

This position gleaned from our estimations, supported also from the context of Regional Smart Specialization. One of the basic principles of Regional Smart Specialization is the specialized diversification. Since, each region is characterized by specific structure, it is able to take advantage of its domains where specific technologies are dominant and to promote diversity of activities in these existing technological fields. Therefore, we can argue that regions which apply specialized diversification across related technologies can achieve growth (Mccann and Ortega-Argilés 2013).

Besides, “self-discovery” (Hausmann and Rodrik 2003) is critical for regions’ success. The smart specialization suggests that the policy recommendations at regional level may be very different in different places, depending on the region’s technological profile, industrial structure and geography. The S3 platform aims to assist regions and the EU Member States in developing, implementing and reviewing regional smart specialization strategies, and help regions identify high-value added activities which offer the best chances of strengthening their competitiveness (Commission of Europe 2010).

In the light of our results, the “one size fits all” technological policy as applied both by member states and the EU has not proved fruitful. The European regions do not all fit into the same category, since they comprise, for example, peripheral regions, old industrial regions and fragmented metropolitan regions (Todtling and Trippl 2005). The EU should apply a “tailor made” technological policy to increase each region’s absorptive capacity. In addition, national and regional authorities should be more committed to applying a potent research and innovation policy and providing good quality research infrastructure at the regional level. According to Feldman and Florida (1994), the existence of technological infrastructure explains the regional concentration of innovative activity in the US. Among other bodies, this technological infrastructure includes research institutions (universities) which provide sources of technological opportunity, as well as networks between universities, industry and business service providers.

The joint policy action of the EU, involving both national and regional authorities, is based on the complementarity principle, and is aimed to expand the stock of knowledge of these regions in the public domain, reduce their technological gap with the strong innovative regions, and enhance their ability to assimilate more knowledge of an increasingly advanced nature.