1 Introduction

The presence of research spillovers has been recognized by inventors, firms, policymakers, and academics for some time (Besen and Raskind 1991; Lerner 2002). Some spillovers are authorized by inventors when they license their technologies, while others, such as from reverse engineering and from trade, are somewhat unavoidable and taken to be costs of doing business or of being innovative. Yet a third type is unauthorized spillovers, where competitors or counterfeiters use inventions without permission/authorization while trying to enter markets or to strengthen their competitive positions. However, little is formally known about the impact of the unauthorized spillovers and the present research attempts to make headway in this regard.

Research spillovers are indicative of lax intellectual property rights (IPR) protection and more generally of weak institutions. Firms investing in R&D face uncertainty about payoffs and this uncertainty is enhanced when spillovers are widespread. Some of this weak IPR protection might be structural, as in a lack of attention to plugging loopholes or covering both hard and soft technologies by the government, while other types of weaknesses in IPR protection might be engineered or structural—via corruption for instance. A better understanding of the effects of the impact of weak IPR protections is potentially informative to policymakers trying to design policies to bolster technological change.

An earlier informative review of the literature on the economics of research spillovers is due to Griliches (1992) and Feldman (1999) provides a nice compilation of related empirical studies. The presence of research spillovers has been documented by various scholars, both within nations and internationally (Coe and Helpman 1995; Coe et al. 2009; Goel et al. 2016).

This paper examines the impact of IPR infringement crimes on the input (R&D spending) and output (patents) of the research process. R&D and patents are sequential stages of the process of technological change, with the former being the input and the latter being the output. The extant literature proxies IPR enforcement via composite indices or measures of institutional quality, whereas this paper employs a direct (hard) measure of IPR crimes. Equally important, we are able to compare the effects of the IPR crimes with those of weakening (endogeneity) of institutions via corruption (Goel 2003).

Using data across U.S. states, results show that IPR crimes reduce research spending but do not impact patenting. Upon comparison with a broader measure of weak institutional quality (proxied by corruption), we find that greater corruption has a robust negative effect on patenting. In terms of magnitudes, the elasticities of R&D spending with respect to IPR crimes are greater than those of patents with respect to corruption. Policy implications of these results are discussed.

The organization of the rest of the paper includes theoretical background and the model in the next section, followed by data and estimation, results and conclusions.

2 Theoretical background and model

The extant literature on research spillovers is largely theoretical, in part due to the difficulties with quantifying research spillovers (see Griliches 1992). Even the theoretical studies on the research spillovers are often vague, taking spillovers to be some fraction of research rewards, without allowing for spillovers to be different across recipients or that spillovers may be uncertain (d’Aspremont and Jacquemin 1988).

Some spillovers of research knowledge are unavoidable when they occur through trade, FDI and reverse engineering (Krammer 2014, 2015). When firms trade with others or license technologies, there is some learning involved, which unravels some of the mysteries of the underlying innovation. Such spillovers are almost impossible to guard against.

Weak intellectual property rights adversely affect the incentives to innovate (see Denicolò and Franzoni 2012 for some related theoretical arguments; Cohen et al. 2002 provide a comparison of Japan and the United States; also see Goel 1999; Mohnen 2009). Weak intellectual property rights might take the form of cumbersome or vague patent filing procedures, lax or delayed enforcement, or a narrow scope of such rights—e.g., when soft technologies are not covered.

As mentioned, some of the lax IPR protection might be structural—where government’s overall institutional setup is weak. In such cases, even when IP-protecting statutes look good on paper, a lack of effective complementary institutions renders overall IPR protection ineffective.

Yet another impact on institutions might be more deliberate where through corruption and bribery, market participants can undermine institutions.Footnote 1 In the case of technological change, this undermining might take several forms, including bribing to have lower punishment or escape detection for IPR infringement, firms might be offering bribes/campaign contributions to have the scope of their inventions widely defined, or offering bribes to have international trade barriers erected, etc. Goel (2003) formally models some of these aspects in a game-theoretical setup. His results show that greater rent-seeking by rival firms lowers a firm’s profit-maximizing research and rent-seeking activity.Footnote 2

Whereas, there is recognition that research spillovers exist (both formal and informal), evidence and formal research on their effects is scarce and it is in this respect that the present study attempts to make a contribution.

Based on the above discussion, we formulate the following hypotheses that we shall test empirically:

H1

Greater IPR crimes would, ceteris paribus, reduce R&D spending.

H2

Greater IPR crimes would, ceteris paribus, reduce patenting.

H3

Greater corruption would, ceteris paribus, reduce R&D spending.

H4

Greater corruption would, ceteris paribus, reduce patenting.

To test the above hypotheses, we formulate a set of equations that we will estimate with data from U.S. states.

With the dependent variables, PATENT and R&D spending, capturing sequential stages of the process of technological change, our estimated equations take the following general form (with subscript i denoting a state).

$$ {{\text{R}}\&}{\text{D}}\;{\text{output}}\;\left( {{\text{PATENT}}_{{\text{i}}} } \right){\text{ }} = {\text{ }}f\left( {{\text{IPRcrime}}_{{\text{i}}} ,{\text{ CORRUPT}}_{{\text{i}}} ,{\text{Institutions}}_{{\text{k}}} ,{\text{ Z}}_{{{\text{im}}}} } \right) $$
(1)

and

$$ {{\text{R}}\&}{\text{D}}\;{\text{input}}\;\left( {{{\text{IndustryR}}\&} {\text{D}}_{{\text{i}}} ;{{\text{TotalR}}\&} {\text{D}}_{{\text{i}}} } \right) = g\left( {{\text{IPRcrime}}_{{\text{i}}} ,{\text{ CORRUPT}}_{{\text{i}}} ,{\text{ Institutions}}_{{{\text{ik}}}} ,{\text{Z}}_{{{\text{im}}}} } \right) $$
(2)
$$ {\text{i}} =1, 2, \ldots, 51 $$
$$ {\text{k}} = {\text{EconFREE}},{\text{ Regulation}} $$
$$ {\text{m}} = {\text{RGDP}},{\text {POP}}, {\text{URBAN}}, {\text{ EDU}},{\text {UNEMP}} $$

In our sample, the average number of patents granted per capita across U.S. states was 0.0003 (average of 2014–2015), the average industry R&D per 1000 dollars of state GDP $13.92, and the average total R&D (industry plus academic R&D) per 1000 dollars of state GDP was $17.67. There was, however, considerable variation in research spending across individual states with TotalR&D being the highest in Massachusetts ($51.61 per $1000 of state GDP) and the lowest in Wyoming ($2.85 per $1000 of state GDP). The variation in patents granted was even more pronounced, with a high of 40,661 (California in 2014) to a low of 40 (Alaska in 2015).

The main control variable of interest, and one that is unique to this study, is related to IPR/copyright and counterfeit crimes in a state. This is a direct measure that is pertinent to research investments and patenting (as opposed to composite indices of patent protection (Park 2008) that have been used in the literature). In our sample, IPR crimes ranged from 1 to 277.

As a broader measure of weak institutional quality and potential spillover leakages, we include corruption prosecutions in a state (CORRUPT). Other things being the same, states with greater corruption would signify greater uncertainty about appropriability of research benefits—both in the research input and research output stages (Goel 2003).Footnote 3 In a broader context, the consideration of corruption can be seen as adding to the literature on the effects of corruption (see Dimant and Tosato 2018 for a recent literature review), or corruption can be indicative of weak governance institutions (Krammer 2015).

Fewer regulations and tax complexities are likely to make research and patenting more desirable by lowering the transactions costs, although some regulations might be tied to IPR safeguards. We include a broader measure of economic freedom (EconFREE) and a narrower measure of regulation (Regulation) to see their effects on research spending and patenting.

The set of control variables that we employ are denoted by the vector Z and these include economic prosperity (RGDP), state size (POP), urbanization (URBAN), literacy (EDU) and unemployment (UNEMP). Greater economic prosperity can be tied to greater opportunity and better institutions in wealthier states, whereas larger states would have larger potential markets for invented products (although at the same time, there would be greater challenges to guard proprietary information in larger states). The degree of urbanization is related to networking opportunities and information flows, and greater literacy would empower both research spending and patenting. Finally, greater unemployment provides a ready availability of labor input, although the purchasing power of the unemployed is limited.

Within this overall setup, the satisfaction of Hypotheses 14, would imply negative and statistically significant coefficients on IPRcrime and CORRUPT in Eqs. (1) and (2), respectively.

We turn next to a discussion of the data employed and the estimation procedure used to test the above equations.

3 Data and estimation

3.1 Data

The main and unique variable in this study is the number of IPR and copyright violations per state in the United States. However, the availability of this variable is limited to a single year (2015—see www.ic3.gov) and this limits our analysis to a cross-section. This limitation also constrains our ability to consider relevant aspects like lags in research and patenting (Goel and Saunoris 2016). Yet, as a first formal attempt to discern the impacts of IPR crimes, the present exercise seems worthwhile.

Given that there could be lumpiness in patent grants and in corruption prosecutions (i.e., some years with unusually high or low occurrences), we take the average of 2 years (2014 and 2015) to smooth out such effects. Total R&D is the sum of academic and industrial R&D at the state level.

The data are from reliable sources that are routinely used in other contexts. Details about the variables used, including definitions, summary statistics and data sources are provided in Table 1. Table 2 reports pairwise correlations among the key variables. As expected, both R&D spending measures, IndustryR&D and TotalR&D, have a high correlation with PATENT. Interestingly, the correlation between IPRcrime and CORRUPT is a modest 0.34 which suggests that while the two measures are positively related, they are picking up somewhat different aspects and that our use of a direct measure in the form of IPRcrime will yield some new insights. Finally, all the correlations of IPRcrime are statistically insignificant and the formal econometric analysis will reveal if some of these relations turn out to be statistically significant when other relevant factors are accounted for.

Table 1 Variable definition, summary statistics and data sources
Table 2 Correlation matrix of key variables

3.2 Estimation

Given the cross-sectional nature of the data, we are somewhat limited in the estimation techniques that can be employed. Tables 3 and 4 report OLS estimation results, with t-statistics based on robust standard errors. To address issues with possible outliers and to as a robustness check, Table 5 reports results of a robust regression. All estimations were conducted using the STATA software.

Table 3 Effects of IPR infringement: baseline models
Table 4 Effects of IPR infringement: additional considerations
Table 5 Effects of IPR infringement: robust regression

4 Results

4.1 Baseline models

Our baseline results are reported in Table 3. The R2 is at least 0.36 and the RESET test shows an absence of significant specification issues in all cases.

With regard to the impact of IPR crimes, we find that greater IPR crimes reduce research spending, but do not have a significant effect on patenting. The effect on research spending holds for both industrial R&D and total R&D (which also includes academic research spending). The insignificant effect on patents can be seen in light of the fact that firms at the patenting stage are farther along the research process (with many fixed and sunk costs) and therefore greater appropriation uncertainty, as signified by greater IPR crimes, has relatively less impact on patents (compared to R&D spending when firms may have greater flexibility in changing research course or abandoning projects altogether).

With respect to the effect of corruption—that captures weak institutions (both research-related and others like tax and licensing), greater corruption lowers both research spending and patenting. This is consistent with corruption having short-term as well as long-term effects. Viewed differently, IPR crimes capture specific issues with appropriability, and corruption address more general related issues. In the following sections, we test the validity of these findings by using an alternate set of determinants and an alternate estimation technique.

As expected, greater literacy (EDU) boosts both patenting and R&D—the resulting coefficient is statistically significant in all instances. Greater literacy thus mitigates the negative effects of IPR infringement and weakness in institutions (as captured by corruption). Greater urbanization, on the other hand, positively affects research spending but not patenting. This finding can be seen as being consistent with urbanization being associated with informal networking and information flows that might assist research spending decisions; whereas, with patenting, inventors are seeking exclusivity and then the insignificant influence of urbanization seems plausible.

The effects of economic prosperity, state size (as denoted by population),Footnote 4 and economic freedom are statistically insignificant in all the models in Table 3.

4.2 Robustness check 1: additional considerations

In Table 4, we consider a set of additional control variables to see whether they affect patenting and R&D spending. Specifically, the additional variables we consider are the unemployment rate (UNEMP) and Regulation, which is a more specific index of the economic freedom index (EconFREE) employed earlier.

Since the findings for IndustryR&D and TotalR&D were quite similar in Table 3, we drop IndustryR&D as a dependent variable here (and also in Table 5). Furthermore, since education and unemployment would be related, EDU is dropped as a regressor.

The R2 is at least 0.32 in all the models estimated and the RESET test, with the exception of Model 4.3, signifies an absence of major specification issues.

The coefficients on both Regulation and UNEMP are all negative but statistically insignificant. More importantly, however, the pattern for the main variables of interest—IPRcrime and CORRUPT—is similar to Table 3. In other words, greater IPR crimes only negatively impact total R&D spending, while greater corruption negatively affects both patenting and R&D spending. We turn next to an alternate estimation technique that addresses robustness issues related to estimation.

4.3 Robustness check 2: alternate estimation

Table 5 replicates Table 3 with robust regression estimation. This technique allows us to account for disproportionate influences of possible outliers. The F-value is statistically significant in all the models.

We find a sharper distinction with robust regression regarding the effects of IPR crimes and corruption—the former has a negative effect on research spending, while the latter negatively affects patenting. IPR crimes can be seen as increasing ex-ante uncertainty about the research, while greater corruption can be seen as adding to ex-post uncertainty.

The pattern of other findings is largely similar qualitatively to what was reported in Table 3. In particular, the effects of population and economic prosperity (RGDP) are again statistically insignificant.

Greater literacy positively impacts patenting, but has no effect on research spending. This can be seen as literacy aiding the transaction or filing costs associated with patenting. Urbanization, contrary to Table 3, has greater statistical support for patenting than for R&D spending. This suggests that perhaps outliers were affecting the findings with respect to the impact of urbanization in Table 3. Interestingly, greater economic freedom has a negative and significant impact on research spending in Model 5.2—perhaps signifying that greater economic freedom involves less government vigilance against IPR abuse.

4.4 Relative magnitudes of effects

In Table 6 we summarize the magnitude of effects by reporting elasticities evaluated at respective sample means.

Table 6 Summary of elasticities

From Table 3 (with OLS regression), the elasticity of research spending with respect to IPR crimes is significantly greater than that of patents with respect to corruption (− 0.5 versus − 0.3, respectively). IPR crimes have a more pronounced (negative) effect on research spending than greater corruption does on patenting. This pattern holds true in Table 5 (with robust regression) where the elasticities are − 0.4 and − 0.2, respectively. Thus, a ten percent increase in IPR crimes would lower research spending by four percent, while a similar increase in corruption would lower patenting by 2% (Table 6). Given that research spending and patenting are sequential stages, a lowering of research spending via greater IPR crimes would in due course lower patenting also.

4.5 Additional consideration: accounting for the sequential nature of R&D and patents using mediation analysis

Since R&D and patents are sequential in nature, with R&D preceding patents, it would be instructive to study the extent of the influence of IPRcrime and CORRUPT on patents that passes through R&D. For this purpose, we employ mediation analysis and the results are reported in Table 7.Footnote 5

Table 7 Mediation analysis of the effects of IPRcrime and CORRUPT on PATENT

The estimation setup of the direct and indirect effects captured through mediation analysis, following the framework and notation of Eqs. (1) and (2), can be outlined as

$$ {\text{PATENT}}_{\text{i}} = \,\upalpha_{0} + \,\upalpha1\left[ {{\text{IPRcrime}}_{\text{i}} \;{\text{or}}\;{\text{CORRUPT}}_{\text{i}} } \right] \, + \,\upalpha_{2} {{\text{TotalR}}\&} {\text{D}}_{\text{i}} + \,\upalpha_{3} {\text{Institutions}}_{\text{ik}} + \,\upalpha_{4} {\text{Z}}_{\text{im}} + \,\upvarepsilon_{\text{a }} $$
(3a)

and

$$ {{\text{TotalR}}\&} \text{{D}}_{\text{i}} =\upbeta_{0} +\upbeta_{1} \left[ {{\text{IPRcrime}}_{\text{i}} \;{\text{or}}\;{\text{CORRUPT}}_{\text{i}} } \right] \, +\upbeta_{1} {\text{Institutions}}_{\text{ik}} + \,\upbeta_{2} \text{Z}_{\text{im}} + \,\upvarepsilon_{\text{b}} $$
(3b)

where εa and εb are error terms.

The total effect of IPRcrime and CORRUPT, respectively, on PATENT is the sum of the direct effect (via influence on PATENT in (3a)) and the indirect effect (via influence on TotalR&D in (3b)).Footnote 6

Results from the mediation analysis, with TotalR&D as the mediator variable, show that both the indirect effects of the mediator variable on patenting are negative and significant—the impacts of IPR crimes and corruption passing through total R&D are negative on patents. However, only the direct effect of corruption on patents is negative and significant (Model 7B). Consequently, the total effect of corruption on patents is negative and significant, while that of IPR crimes, albeit negative, is statistically insignificant.Footnote 7,Footnote 8 These findings support what was reported in Table 5 with robust regression and provide insights into the mechanics of the effects.

Overall, we find support for Hypotheses H1 and H4, but not robust support for H2 and H3. The concluding section follows.

5 Concluding remarks

Adding formal insights into the effects of IPR infringement (or more broadly unauthorized transfer of technologies), this paper examines the impact of IPR infringement crimes on the input (R&D spending) and output (patents) of the research process. The extant empirical literature trails theoretical contributions and proxies IPR enforcement via composite indices or measures of institutional quality, whereas this paper employs a direct (hard) measure of IPR crimes.

Using data across U.S. states, results show that IPR crimes reduce research spending but do not impact patenting. Upon comparison with a broader measure of weak institutional quality (proxied by corruption), we find that greater corruption has robust negative effect on patenting (but not necessarily on research spending). Quantitatively, the elasticities of R&D spending with respect to IPR crimes are greater than those of patents with respect to corruption. Given that research spending and patenting are sequential stages, a lowering of research spending via greater IPR crimes would in due course lower patenting also. The mediation analysis conducted provides insights into the channels of influence and supports earlier results. Specifically, with TotalR&D as the mediator variable, the total effect of corruption on patents is negative and significant, while that of IPR crimes is statistically insignificant (Table 7).

Insights into the drivers of R&D spending and patenting are of obvious importance for policymakers looking to boost economic growth via research. The results suggest that studies that proxy IPR crimes via other measures are likely underestimating their (negative) impacts on technological change. Underestimation with proxies has implications for R&D subsidies and for IPR protection policies. Another insight is that urbanization and literacy are able to somewhat counter the negative influences of IPR infringement and weak institutions on R&D spending and patenting. With appropriate data in due course, related time series dimensions may be considered.