1 Introduction

Generation of new technological knowledge lies at the heart of economic growth (Romer 1986; Lucas 1988). Technological inventions have become the most important source of growth, replacing land, energy, and raw materials.Footnote 1 Producing new knowledge, however, is not cheap and highly concentrated (Audretsch and Feldman 1996). One way to avoid replicating others’ ideas and improve the allocation of technology in an economy is by generating tradable intellectual property rights, for example, patents. Specifically, the market of patents provides incentives to develop innovations (Spulber 2008)—the inventor can simply sell the patent to a specialized producer and shifts his efforts to a new invention—and reinforces the value of current technology. For example, companies, such as Intel, spend considerable resources explicitly encouraging the external development of complementary technology (Gawer and Cusumano 2002). In addition to their contribution in increasing the rate of innovation, patent transactions improve the geographic distribution of technology. As knowledge production is concentrated in space, the market of patents facilitates the stretch of a patented idea in space, as potential distant buyers may, too, purchase the innovation.

Moreover, patent transactions act as a channel of acquiring technological knowledge. The idea that patents promote diffusion and creation of technology is not new and dates back to Arrow (1962). Transactions involving technology packages (patents, patent licensing, and other intellectual property and know-how) can entail knowledge exchange between transacted agents. In particular, buying a patent, is not just buying a legal piece of paper. Businesses buy patents to use the technology covered by the patent, which could be vital for their own line of knowledge or goods production. The willingness-to-pay of potential buyer depends on the technological knowledge contained in the patent (Anton and Yao 1994). In using a patent, sometimes the buyer could develop connections with the seller in order to implement the patented technology. Such contacts could also involve transferring the necessary ‘how-to’ knowledge to the firm, which buys the patent. Consequently, technological learning could be inferred from patent transactions.Footnote 2

An important question, in this respect, is whether geographic factors shape the trade of patents and, therefore, affect the allocation of technology across regions and accompanied technological flows. In a globalized word, ideas generated in one place can travel to another very distant one, at an unprecedented rate, via the Internet, phones and other means of communications and boundaries may play little role. Counterarguments, however, suggest that geographic nearness still plays an important role and strongly influences the diffusion of knowledge. Distance may still matter if face-to-face interactions are important, even sometimes in high-tech sectors as knowledge is tacit and hard to codify. Further, more globalization relates to more industry specialization and, therefore, less to learn from one another.

This paper aims to examine whether geographic nearness, state border and distance, shape the trade of patented ideas and, consequently, knowledge flows exemplified by this formal market-based channel for the US over the period 1995–2010. Using detailed and recently developed data on patent reassignments of US-based entities, we explore whether the trade of ‘weightless’ ideas is confined by geographic factors even within the same country and whether the importance of geography has changed over time due to technological developments.

In general, the literature has documented the localization of market transactions and potential knowledge diffusion generated. In particular, the trade-growth literature infers learning by analyzing trade flows (Coe and Helpman 1995; Keller 2002) between industries and countries. According to this literature, importing a foreign intermediate good allows a recipient country to learn from the R&D-, or ‘technology’-content embodied in the traded good and merchandise trade acts as an important conduit of market-based knowledge flows across industries and countries. A separate strand of literature, documents evidence of knowledge flows via the mobility of highly skilled personnel, the inventors of patents (Marx et al. 2009). Embodied knowledge in inventors can be spread out via face-to-face interaction between talented individuals (Arrow 1962). These two channels of knowledge flows generated either via the trade of goods and mobility of innovators involve movements of goods and people, respectively. Consequently, geography could still pertain a nuance role and this is indeed what both strands document. Despite the significant decrease in transportation costs and development of communication technology, distance has not decayed in importance (Disdier and Head 2008).

However, knowledge flows can be also generated via non-market mechanisms. Undoubtedly, one of the most important contributions in the past twenty years in measuring knowledge spillovers is the work of Jaffe et al. (1993), which introduces patent citations as a metric for knowledge spillovers. Since then, studies have shown that geographic proximity significantly affects both intra-national and international knowledge spillovers (Thompson 2006; Aldieri 2011) and such knowledge flows increase innovation output of nearby regions (Peri 2005). With respect to the significance of geography over time in knowledge diffusion, evidence from the patent-citation literature is rather mixed.Footnote 3 For instance, Thompson (2006) finds that intra-national knowledge flows in the US have become less localized, while international knowledge flows have remained constant. The studies of Griffith et al. (2011) and Aldieri (2011), on the other hand, provide support that there is a fall in the national home bias for patent citations; nonetheless, the overall home bias effect is still significant in comparison with the decrease observed in recent years.

Despite the importance of patents in economic growth and welfare, there is scant evidence so far on the impact of geography on patent trades and potential formal knowledge flows generated, primarily due to lack of comprehensive data. To our knowledge, there are two studies that undertake such investigation. A recently published study by Burhop and Wolf (2013) documents historical evidence on geographic effects of patent transactions to also infer knowledge diffusion across German regions during 1884–1913. An investigation of patent licenses, as opposed to patent transfers, as a potential channel of knowledge flows, and whether geography shapes the latter, is performed by Mowery and Ziedonis (2001). The authors find that formal knowledge flows, operating by academic licensing, are more bounded by geographic distance compared to informal flows exemplified by patent citations. However, as the authors state, their sample is small, consisting only of four US universities, and focuses only on academic patents.

This paper contributes to the aforementioned emerging literature by adding novel insights on how geography shapes the trade of patents and formal knowledge flows generated by this type of trade. The empirical investigation of the reach of traded patented ideas, both at aggregate and sectoral level, in the US—one of the most innovative countries in the world—consists the first attempt in the literature. To get a sense of our estimates, we also examine the geographic reach of patent citation flows, and contrast the findings.

At the outset of our paper we would like to note that patent trade flows, as a measure of knowledge flows, is susceptible to criticisms also applied to other measures of knowledge flows namely patent citations, trade of goods, and inventors’ mobility as to whether they capture ‘actual’ knowledge flows or noise, too.Footnote 4 Firms can also buy patents for defensive reasons—to help defend the patents the company already owns by acquiring similar technology. Consequently, patent trades could proxy more things than knowledge flows between buyer and seller. To capture actual knowledge transmitted between involved agents, one could, for instance, employ the citations a patent receives from the perspective buyer before the transaction, i.e., firms that acquire a patent could also previously cite the patent. The size and nature (e.g., start-up firms, established) of a firm could also be relevant, as small-sized firms tend to buy patents for technology purposes, while large ones for strategic reasons. Unfolding, however, the reason behind the patent transaction, i.e., whether it is pure strategy or knowledge acquisition, has not been easy thus far, due to data unavailability and still unmatched databases. The use of patent licenses, instead of patent trades, could provide an alternative more appropriate proxy of knowledge exchange due to the frequency of contacts between the patent licensor and licensee. Patent licensing data are, however, proprietary and constructed via surveys for a limited number of firms or US universities (Mowery and Ziedonis 2001). Finally, patent trades, as a metric of knowledge flows, capture a sub-set of knowledge diffusion—the patented knowledge—ignoring, for instance, knowledge, which is not patentable.

Our empirical analysis is developed around two key questions: First, to what extent do we observe similar localization in patent transfers as in the well-documented case of patent citations? Second, does the significance of geographic factors for patent transactions change over time?

Our empirical analysis proceeds as follows: We estimate a gravity-like equation in order to assess the role of nearness on patent transactions. To elaborate more on our findings we provide evidence from state level and sector level analysis, performing a number of robustness tests. To examine whether border and distance have declined or still persisting over time, the econometric specifications account for an early time period (1995–2002) and a recent time period (2003–2010) and examine the differences. Lastly, to get a sense of the size of our patent trade estimates, we also examine time and geographic effects on citation flows and contrast the evidence.

Our results clearly support strong localization of patent transaction flows as states tend to involve more in exchanging patents and citations within their borders than with other states or countries, but there is no nuance evidence, however, for higher significance of nearness over time. Citation flows also appear to be less geographically confined than patent transaction flows and eventually knowledge flows based on citations exchange are more far stretched in space than their counterparts based on patent trade. This is quite apparent, especially, when it comes to very large distances in the US, for instance, exchange of patents or citations between East and West Coast. While distance is a strong hurdle for both flows, its effect is more dramatic on patent transactions. Sector level analysis of patent transaction and citation flows confirms state level findings. Intra-sectoral flows are also found to be very localized with Mechanical sector to exhibit the most geographically confined knowledge flows, while flows in information technology sectors, i.e., Electronics and Computers, are the most far reached compared to the knowledge flows from the rest of the sectors, both in the US and abroad. We also explore the role of agglomeration in shaping the geographic localization of patent trade. Our resultss do not support a strong evidence between agglomeration and trade of ideas across space and time.

The implications of our findings for the growth literature are potentially relevant. Although theoretical studies (Rivera-Batiz and Romer 1991) emphasize the important consequences of disembodied knowledge flows over knowledge embodied, there has been little effort, on the empirical side, to explore all mechanisms of disembodied knowledge diffusion. Along with other important studies, this paper makes an effort towards analyzing disembodied knowledge diffusion via the channel of patent trades, complementing, thus, important contributions in the field of disembodied knowledge diffusion.

The remainder of the paper is laid out as follows. Section 2 introduces the framework of our analysis and discusses the data. Section 3 presents the results. Section 4 summarizes our findings and concludes.

2 Methodology

2.1 Model specification

We use a simple gravity-like equation, proposed and used in the trade literature (Krugman 1991) and economics of innovation and spillover literature (Peri 2005), to model bilateral patent transaction flows as a function of geographic and time factors:

$$\begin{aligned}\tau _{ijt} =& \beta _{0} + \beta _{1} State\, Border_{ij} + \beta _{2} Nearby\,States\,[500\, {\text{miles}}]_{ij} \\ & +\beta _{3}Distance\,[500{-}1{,}500\,{\text{miles}}]_{ij} + \beta _{4}Distance\,[>\!\!1{,}500\,{\text{miles}}]_{ij} \\ & +\beta _{5} Out\,Of\,Country_{ir} +\beta _{6} Dummy_{ijt} + \beta _{7} Z_{ijt} + \epsilon _{ij} \end{aligned}$$
(1)

where \(\tau _{ijt}\) is patent transaction flows between two states \(i\) (destination) and state \(j\) (origin); \(State\, Border\) takes the value of 1 for flows exchanged between states \(i\) and \(j\) that share a common border and 0 otherwise; \(Nearby\,States\,[500\, {\text{miles}}]\) takes the value of 1 for flows exchanged between states that do not share a common border and their geographical centers are located within a distance of 500 miles, and 0 otherwise; \(Distance\) [500–1,500 miles] is a distance class of 500–1,500 miles and takes the value of 1 for flows exchanged between states \(i\) and \(j\) that are located within 500–1,500 miles, and 0 otherwise; \(Distance\,[>\!\!1{,}500\,\hbox {miles}]\) is a distance class of larger than 1,500 miles and takes the value of 1 for flows exchanged between states \(i\) and \(j\) that are located in a distance larger than 1,500 miles and 0 otherwise; \(Out\,Of\,Country\) is a dummy that takes the value of 1 if states involve in international patent trade and 0 if they do not; \(Dummy\) is a time dummy variable which takes the value of 0 if a patent is traded in the period 1995–2002 and 1 if it is traded in the period 2003–2010; the vector \(Z\) contains time and distance interaction terms, and \(\epsilon \) is an iid error term.

The coefficients \(\beta _{1}\) till \(\beta _{5}\) provide a characterization of how pure geographic factors shape the flows of patent trade. By model construction, each geographic coefficient captures the difference between knowledge flows diffused in geographic space to knowledge flows within a state. Consequently, the coefficient of the first dummy, \(State\,Border\), captures how much patent transaction and possible knowledge exchange takes places between states that share a common state border compared to in-state level of exchange. Irrespective of the border effect, the second dummy, \(Nearby\,States\,[500\,\hbox{miles}]\), captures the differences in patent trade exchanged between states located in a distance of 500 miles compared to in-state transactions. The coefficients of the (rest of the) distance dummies, examine whether states that have been located in various distance classes exhibit different patent trade interactions in comparison to in-state level of interactions. We opted for this distance taxonomy because the longest distance in miles between two neighboring states is approximately 500 (517,705 miles to be precise), which is the distance between the centers of Colorado and Oklahoma. Then, we proceed with batches of 1,000 miles till the distance between East and West Coast is exhausted. In addition, the coefficient of \(Out\,Of\,Country.\) allows to explore international aspects of patent trade transactions. It captures the difference in patent trade of state \(i\) that has with the rest of the world (denoted as ‘region’ \(r\)) compared to in-state transactions. One would expect that larger geographic distance would reduce exchange of patents and consequently knowledge flows across the states due to the presence of spatial transaction and other information costs, signaling that knowledge flows are bounded in space and characterized by spatial declining effect. Strong localization effects occur when almost all patent trade activity takes place in-state rather than across states (or abroad).Footnote 5

Further, the time dummy, \(Dummy\) captures whether more recent trade is more (or less) localized than earlier one. If the role of nearness declines over time, then one would expect for \(\beta _{6}\) to be negative and statistically significant. Lastly, to explore time patterns of nearness and international patent transactions, we also include time-distance interaction terms, in the vector \(Z\). If their coefficients are negative and statistically significant, then this would be an indication of possibly higher significance of distance over time, i.e., strong “still persisting” distance effect.

2.2 Estimation technique

Patents and citations are ’count data’—and some states record zero information on either patents or citations. To handle this type of data, we use a generalization of the Poisson model, known as negative binomial estimator, which is frequently used in similar to ours contexts (Peri 2005; Perkins and Neumayer 2011; Furman and Stern 2011) and does not rely on the restrictive assumption that the conditional means to be equal the conditional variances, as Poisson does.Footnote 6 Another advantage of using negative binomial its ability to accommodate ’zero’ outcomes of the dependent variable.

To assess the geographic and time effects on patent transaction flows across states of the US, we estimate Eq. (1) for different sub-samples and alternative definitions of patent transaction flows, i.e., \(\tau _{ijt}\) in Eq. (1) could be either all flows, or first flow or last flow of patent transactions or even citation flows in order to compare our evidence of patent transactions with that of patent-citation literature.

2.3 Data description and analysis

Our empirical analysis is based on 50 states of the US for the period 1995–2010.

The primary source of patent data is a recently compiled dataset by the office of the chief economist of the United States Patent and Trademark Office (USPTO), kindly provided to us, referred as Patent Assignment Dataset, which contains assignments (transactions) of US issued patents between entities registered at the USPTO.Footnote 7 A typical assignment is characterized by a unique identifier (i.e., reel frame), the names of the buyer (i.e., assignee), the seller (i.e., assignor), the date that the transaction agreement was signed (execution date), and the patent numbers or patent applications that are traded per assignment.Footnote 8 In constructing our patent dataset, we faced two main challenges when employing assignment data. The first relates to the fact that entities are not required to disclose transactions to the USPTO. However, for legal and perhaps accounting reasons, they have incentives to do so.Footnote 9 A challenge associated with using assignment data is that it is still likely that a number of transactions have not been disclosed to the USPTO due to negligence or to strategic behavior. In any case, we do not expect this to be systematic for aggregated transactions across geographic areas. An additional challenge is associated with excluding ‘routine’ transactions. In the US, only an individual can file for a patent application. Subsequently, this individual may re-assign the patent application (or patent) to her firm or institution where she is employed. These transactions are also included to the dataset. Thus, the challenge here is to isolate the economically meaningful re-assignments and discard otherwise. Taking these two challenges into consideration, we end up having 96,478 patents issued between 1995 and 2010 that have been traded from US located entities and are associated with 102,876 transactions for which we have location information for both the assignor and assignee.

There is a many-to-many relationship between patents and transactions. That is, transactions may contain more than one patents and a patent may be transacted more than once. To construct the flows of patents, we aggregate the number of patents that have been traded from entities located in origin state to entities in destination state for any year. For patents, which are traded more than once, each transaction is registered as a new transaction and, therefore, counted accordingly.Therefore for patents traded more than once, and for robustness purposes, we construct two alternative measures of patent flows. The first is called, ’first flow’ and considers, for each patent, only its first assignment, ignoring the rest of its transactions. The second measure is called ’last flow’ and for each patent excludes all the intermediate transactions and records only the assignment between the first and last entity. For example, for a certain patent which is sold from California to New York state and then from New York state to Texas, the measure ’all flows’ registers both transactions, while the other two measures register only one transaction: ’first flow’ registers the transaction between California to New York state, and ’last flow’ registers the one between California to Texas.

Citations of traded patents data originate from the work of Lai et al. (2011), which is publicly available. The database contains citations of all US issued patents up until 2010. We construct bilateral citation flows between states for the period 1995–2010 by considering citations made from 1995 to 2010 to patents issued between 1995–2010. To construct bilateral citation flows across states, we consider the location of the lead inventor (written on the patent document wrapper) for all (citing and cited) patents.

Finally, the geographic distance (in miles) between two states is the distance between each state’s geographic center calculated as the crow flies. This information is obtained from Google Maps.Footnote 10 Information on a state’s R&D expenditure is extracted from the National Science Foundation Science and Engineering State Profiles.

Table 1 below provides the descriptive statistics of the variables included in our model.

Table 1 Summary statistics

According to Table 1, for the period under investigation, states, on average, trade about 3–4 patents per year and exchange about 47 citations per year. On average, each state pair is 8 % likely to be neighboring with each other. Furthermore, 11 % of all possible pairs of states are closer than 500 miles and do not share common state border, 50 % are located in a distance of 500–1,500 miles, and 26 % in a distance of larger than 1,500 miles.

Table 8 in the Appendix reports summary statistics per state. States like California (CA), New York (NY), Illinois (IL), Massachusetts (MA), Michigan (MI), and Texas (TX) are top patent producers, traders and citers of patents. These states also enjoy high R&D spending. In the opposite side of the spectrum are the states of Alaska (AK), Hawaii (HI), the Dakotas (SD and ND) and Wyoming (WY). Overall, summary statistics per state reveal a large variety of patterns.

Below, Fig. 1 shows the production of innovation in the US. As it is apparent, intense innovation activity is concentrated in few states in the US. More than 60 % of production of patents takes place in five states, California (CA) and New York (NY), which stand out among the top producers, followed by Texas (TX), Illinois (IL), and New Jersey (NJ). The least involved states in producing innovation are Alaska (AK), Hawaii (HI), North Dakota (ND), South Dakota (SD), and Wyoming (WY). Moreover, states which are patent production leaders are also top performers in patent citations and patent trades.

Fig. 1
figure 1

Patent production per state

Figure 2 shows the distribution of patent transactions across states in the US.

Fig. 2
figure 2

Patents traded per state

As one observes, states that produce large volumes of patents (see Fig. 1, above) are the ones greatly involved in patent transactions.

Lastly, Fig. 3 visualizes our data and depicts the decay of patent transaction flows moving out of a state, out of the nearby neighbor, and out by steps of 1,000 miles. Bold line represents early period flows, 1995–2002 and light, dashed line late period flows, 2003–2010.

Fig. 3
figure 3

Decay of patent transaction flows due to geographical barriers

The evidence from the figure above confirms the significance of nearness in shaping patent transaction flows. The decay effect is apparent: state border and distance restricts the trade of patents. There is a sharp decrease of flows and only a small share crosses 500 miles. As the distance becomes large enough—more than 1,500 miles—they pick up again due to the intensity of ideas exchange between East and West (California). Finally, flows for distance between 500 and 1,500 miles show a pick, which is probably due to flow exchange of a state with with Illinois, Texas, and Michigan. Prior to 2002, flows (bold line) are smaller in volume compared to ex post 2002 flows (dashed line). Neighboring patent trade is roughly similar between the two time periods, implying that in-state patent trade has increased in recent years.

3 Empirical results

This section presents our results. We first examine whether geography shapes patent transactions across the states of US. Then, we proceed by providing more disaggregated evidence from six technological sectors. We further relate to past studies by bringing evidence on the effect of geographic nearness on citation flows, which is the most investigated type of knowledge flow in the literature. Finally, we derive implications on a possible relationship between agglomeration and geographic localization of trading patents across the US.

3.1 How important is geography for patent transactions across states?

We investigate patent transaction flows originating from an average state and then flows originating from the most and less innovating states of the US. Accordingly, we present our findings.

3.1.1 Patent transactions of average state

Table 2 depicts the results. All columns report negative binomial estimates from Eq. (1). Column (i) is the basic specification and shows estimates for all traded patent flows (\(All \, Flows\)). For robustness purposes, we also report estimates based on first traded patent flow (\(First \, Flows\)) and last traded patent flow (\(Last \, Flows\)) in columns (ii) and (iii), respectively. Furthermore, columns (iv), (v), and (vi) report patent flows in accordance with columns (i), (ii), (iii), but for a slightly different sample: we exclude patents issued between 1995 and 2002, but were traded later between 2003 and 2010. Having kept them in the sample, any decay of importance of nearness could be attributed also to such patents which take many years to be transferred, and as they are known for quite long time, they may be transferred further away.Footnote 11

Recent evidence from the citations literature gives support to this factor, that in addition to a time dimension there might be an aging dimension in which geographic proximity becomes less important (Li 2009). Heteroscedastic robust standard errors are provided in parentheses.

As Table 2 shows, estimates are similar across alternative definitions of patent transaction flows and samples. Each geographic coefficient in Table 2 captures the difference between knowledge flows diffused in geographic space to in-state knowledge flows, which is our benchmark by model construction. To convert each value to percentage change, one needs to use the exponential formula. We begin our discussion with estimated coefficients of column (i), which is the basic specification.

The coefficient of \(State\, Border\) implies that states, which are neighbors and, therefore, share common border, exchange about 93.3 % (\(= 1-e^{-2.697}\)) less patents to what they would exchange within their borders. In other words, on crossing a state border, transactions of patents diminish to 6.7 % (\(=e^{-2.697}\)) compared to in-state level of transactions. Irrespective of the border, distance also shapes patent transactions. States that are nearby, in the vicinity of 500 miles, but do not share common borders, exchange 96.6 % (\(= 1- e^{-3.392}\)) less patents than what they would exchange within themselves as the coefficient of \(Distance\,[<500\,\hbox {miles}]\) indicates. Put it differently, the volume of transactions when crossing non-adjacent states, diminishes to 3.4 % (\(=e^{-3.392}\)) to its in-state level of transaction. Further, patent transactions between states located in an area between 500 and 1,500 miles decrease by 97.6 % (\(e^{-3.709}\)) compared to what takes place in-state, as the coefficient of \(Distance\,[500{-}1{,}500\,miles]\) indicates. In other words, only 2.4 % of the patent trade takes place within a distance of 500–1,500 miles. An interesting finding, however, emerges when distance between states becomes long enough, specifically bigger than 1,500 miles. Although (long) distance exerts a heavy toll on the volume of patent trade, the reduction of traded patent volume, however, is smaller compared to the previous distance interval. This seemingly controversial finding is due to the ‘California effect’. Despite its distance from a typical state, California is an exceptional producer and trader of patents. In sum, geographic factors (border and distance) comprise a serious limitation of patent transactions across the states of the US. The trade of patents and, consequently, generated knowledge based on these transactions, is limited mainly by the state border, which is an important hurdle to out-of-state patent transactions and by geographic distance, where significant knowledge reduction takes place already within a district of 500 miles and any further increase of distance has, practically, no additional reduction effect. States trade 60 % less patents with other countries than what they would trade within their state borders, as the coefficient of \(Out\ of \ Country\) enters negatively and statistically significant.

Table 2 Geographical effects of patent transaction flows (average state)

Next, we turn into examining whether there are any time effects shaping the pattern of patent transactions across the States of the US. As our results show, the coefficient of time dummy, \(Dummy\), is positive and statistically significant, indicating that over time, or after 2002 in particular, in-state patent exchange has risen by 88.5 % compared to in-state patent trade prior to 2002. This finding does not necessarily translate into increased importance of distance for patent trade over time. As there is an increase, over time, of patent transactions across states in the US, a higher share of this trade takes place in-state. Concerning the time and distance interaction terms, states seem to involve in less patent trade with their neighbors ex post 2002 than ex ante as, over time, there is an additional decrease of patent transactions across states compared to in-state level of transactions.Footnote 12 In contrast, there is a noticeable increase of patent trade between very distant states, more than 1,500 miles, compared to before-2002 as it is evident from from the coefficient of \(Distance\,[>\!\!1{,}500\,{\text{miles}}] \times Dummy\) mainly due to the California effect and its increased patent production and transaction over time. Lastly, over time, there is no concrete evidence on the pattern of patent transaction localization at national this time level, as the coefficient of the interaction term, \(Out \ of\, Country \times Dummy\), is not statistical significant. Based on the evidence provided by the interactions terms we cannot argue over greater or lower importance of distance over time as their coefficients appear to be statistically insignificant.

To sharp the robustness of our results, we have performed several checks. We first consider alternative definitions of patent transaction flows. More precisely, we estimate the same specification as in column (i), but with different dependent variables: \(First\,Flow\) in column (ii) and \(Last\,Flow\) in column (iii). Estimates in both columns are in the vicinity of those reported in column (i). Further robustness for aging effects, replicates the analysis in columns (i) to (iii), but excluding patents granted early but transacted after 2002. The estimated effects, reported in columns (iv) to (vi), remain virtually the same.Footnote 13

Concluding, geographic nearness in terms of border and distance, exerts a strong influence on patent transactions across states of the US. The general and well-documented finding of geographic restriction of flows reported in the literature, is corroborated by our results. In particular, local bias on the state level appears to be quite sturdy in all alternative definitions of patent transactions. The very significant impact of distance on traded patents across states of the US is at first sight quite surprising and puzzling: unlike goods, patented ideas are weightless, and distance cannot just proxy only transportation costs.Footnote 14 Further, state border appears to confine trade of ideas as it does with trade of goods, and certainly this effect cannot be attributed to higher transportation costs when a traded patent crosses a state border. Instead, distance and border could be seen as informational barriers, and therefore can serve as proxies for all types of informational frictions: agents within a state tend to know much more about each other and each other’s business and technologies, either because of direct interactions between their citizens or because of better media coverage. In consequence, distance and border act as barriers to interactions, various micro-cultural affinities, and networking of economic agents (Saxenian 1994). It appears that trade of patents requires intensive information to take place and this is reflected on the size of both border and distance coefficients.

The localization robustness of weightless ideas documented in our study, matches also findings from quite different lines of research. For example, studies in the financial trade literature, using ’gravity-like’ models have investigated whether geographic distance imposes a hurdle on financial asset transactions, which are weightless compared to goods. In fact, Portes et al. (2001) and Portes and Rey (2005) examine the determinants of cross-border assets (corporate bonds equities and treasury bonds) and show that gravity model explains financial asset transactions at least as well as goods trade transactions and further document a very strong negative effect of distance on all asset flows.

Our estimates confirm the picture emerged in Fig. 3 as they capture this significant and sharp drop when flows cross state border and distance of 50 miles and then their path remains pretty much stable with some spikes.

3.1.2 Patent transactions of top and least innovator states

In this section, as a further robustness check, we examine whether the geographic scope of patent transaction flows from top innovator states is wider than the average state flows (Peri 2005).

To explore this aspect, we consider knowledge flows originating only from the top innovator states when estimating Eq. (1).

We select the top innovators to be states with the highest R&D spending. The states of California (CA), Massachusetts (MA), Michigan (MI), New Jersey (NJ), and New York, (NY), Texas (TX), Illinois (IL), Pennsylvania (PA), Maryland (MD), Washington (WA), and Ohio (OH) are among the top 10 states in R&D spending and account for about 70 % of the total US R&D activity in our sample. Therefore, they may act as innovation leaders.

Table 3 depicts the results. Columns in Table 3 are in accordance with those of Table 2. The only difference is that the most innovative states, the top innovators, are the only source, \(j\), of the patent transaction flows in Eq. (1). Again, one should read the estimated coefficient in the same way as explained in the previous section.

Estimates show that top leaders’ patent transaction flows are about equally geographically localized, both in terms of state border and distance, as those of the average state flows. Consequently, our results do not confirm the broader reach of leaders’ traded patent flows and potential generated knowledge from the leaders via patent exchange. The analysis of leaders’ flows confirms that independent of the origin of a state, i.e., whether the state is an average or innovator state, the spatial reach of patent exchange, and consequent knowledge flows generated, is strongly geographically confined.

Table 3 Geographical effects of patent transaction flows in the US (top innovators)

Next, we consider knowledge flows originating from the rest forty states, excluding flows originating from the top innovators. Estimates are displayed in Table 4. Results show that patent transaction flows originating from the lower innovative states are slightly less localized than the average state and top innovative state. In addition, comparing the \(Dummy\) coefficient between Tables 3 and 4, we observe that it is smaller in the case of patent flows originating from the less innovative states compared to patent flows originating from the top innovative states. Moreover, the estimate of \(Dummy\) for the case of the less innovative states is negative in columns (4) and (5), though borderline significant.

Table 4 Geographical effects of patent transaction flows in the US (excl. top innovators)

The above comparison implies that flows originating from top innovators are on average more localized than flows originating from the less innovative states. This finding indicates that top innovators trade more within the state than less innovative states, which potentially seek for buyers outside the state. This difference becomes even stronger, in more recent years as the coefficient of \(Dummy\) shows, where top innovators trade more within the state compared to less innovative states.

3.2 How important is geography for patent transactions across US technological sectors?

The estimates of the previous section showed a strong degree of localization of patent transactions. In this section, we dig deeper and investigate whether localization of patent exchange is still the case across six different technology fields namely, chemical (\(Chemical\)), computer and communication (\(Computers\)), drugs and medical (\(Drug\)), electrical and electronic components, (\(Electronics\)), mechanical machinery (\(Mechanical\)), and all other (\(Others\)) sector.Footnote 15

Estimates are reported in Table 5 below and read in the same way as explained in the previous section.

Two noteworthy findings emerge from Table 5: first, there is no much variability of geographic and time effects across sectors, as estimates are similar across sectors and, second, sector-specific estimates are close and similar in statistical significance to aggregate (state level) estimates.

As sector-specific estimates show, state border continues to remain an important hurdle for patent transactions across all sectors, with \(Drug\) and \(Others\) to be the sectors most strongly affected as 94 % of the patent trade in these sectors takes place in-state than across states, whereas the information technology sectors, \(Electronics\) and \(Computers\), appear to be the least affected as 93 % of the patent trade in these sectors takes place in-state than across states. The impact of geographic distance on trade of patents across US technology fields is even stronger. \(Mechanical\) and \(Others\) sectors are highly restricted by distance as, on average, distant states exchange 98 % less patents compared to what they trade in-state. Patent exchange in \(Computers\) and \(Electronics\) is mildly more stretched in space compared to the rest of the sectors as distant states exchange about 97 % less patents between each other in these sectors compared to what they trade within their state border. Therefore, inter-state patent trade is by far much smaller compared to in-state trade across all sectors and the former is massively restricted by state border and geographic distance.

Table 5 Geographical effects of patent transactions across technological sectors (\(All \,Flows\))

Overall, our sector-specific findings emphasize that patent transactions in all sectors are highly geographically confined. Among sectors, the geographic spread of patent transactions, and knowledge generated via traded patents, in the information technology sectors is the most far reached. A high degree of international standardization in computer language and computer procedures as well as in electronics, patent transactions in these sector reach farther in states and countries. \(Mechanical\) sector’s technologies, on the other hand, are more linked to the specificities of regional economies and probably less engaged into globalization. Therefore, knowledge flows generated via patent transactions in this sector is less far reaching. Our findings about the localization of knowledge flows generated in US sectors are consistent with evidence provided from the (patent-citation) literature (Peri 2005; Griffith et al. 2011).

3.3 Comparison with citation flows

To get a better sense of the size of our coefficients—as there is no study so far that has explored the geographic effects of patent transactions in the US—we compare, in this section, our patent trade geographic estimates with those of citations’. To this purpose, we estimate Eq. (1) for citation (of traded patents) flows across states in the US. Table 6 below presents our results based on state level (column (i)) and sector level (columns (ii) to (vii)) analyses. The coefficients are read in the same manner as before.

The state level estimates of citation flows in column (i) are somewhat smaller than that of their patent transaction counterparts reported in Table 2. The border and (average) distance effects on citation flows are about 2 and 5 %, respectively, smaller than those of patent transaction flows. A further interesting geographic aspect of citation flows is that they are less restricted compared to patent transaction flows, when distance between states becomes more than 1,500 miles, as the California effect is at work. We cannot draw, however, any further conclusion over the time pattern of localization of citation flows, neither at a state nor at a national level as the time dummy and time-distance interaction terms are all statistically insignificant.

Moving to sector level analysis for citation flows performed in columns (ii) to (vii), the sector-specific border estimates are equal (\(Drugs\)) or up to 4–5 % (\(Mechanical\) & \(Others\)) smaller than their patent transaction counterparts reported in Table 5. Compared to exchanging patents, neighboring states exchange more citations in these sectors. Therefore, state border confines citation flows less than it confines traded patent flows. Furthermore, citation flows are more far stretched in space by 4 % (\(Chemical\), \(Drugs\), and \(Electronics\)) up to 7 % (\(Computers\)) compared to traded patent flows. The California effect is also apparent for the sectoral citation flows—quite strong particularly for \(Computers\) and \(Electronics\). Again, we cannot elaborate more on time patterns of sectoral citation flows—with the only exemption of Electronics sector where citation flows between a state and a foreign country have increased over time—as time dummy and interaction terms are, in all other cases, statistically insignificant.

Overall, we find that citation flows across states of the US are less geographically localized than flows of patent trade. Geographical distance severely reduces the volume of traded patents between East and West (California) Coast, whereas, it has a milder effect on citation flows. The potential need for the patent buyer to maintain contact with the inventor attaches a more nuance role to geography in shaping patent trade than shaping citations and consequent knowledge generated. To the extent that purchased patented technology plays a role in the buyer’s production, distance and border could act as informational barriers, and serve as proxies for all types of informational frictions. Agents within a state tend to know much more about each other and each other’s business and technologies, either because of direct interactions between their citizens or because of better media coverage. Therefore, distance and border also act as barriers to social connectedness (micro-cultural affinities) and networking of economic agents. Such social proximities have been identified in the literature (Saxenian 1994) as important factors for knowledge exchange.Footnote 16 Consequently, informal knowledge flows that operate via non-market mechanisms, such as exchange of citations, transcend easier a state border and are more diffused in space than formal knowledge flows that originate from market channels, such as patent transactions, as in the former case social proximities may play a less significant role.Footnote 17

Table 6 Geographical effects of citation flows across states in the US

Our evidence is in line with that documented in Mowery and Ziedonis (2001), the single study in the literature that examined the geographic reach of knowledge flows based on citation as well as on licensed university patent flows. Mowery and Ziedonis (2001) find that geographic distance matters more for university-generated knowledge flows that operate through the market of contracts (patent licenses) to university-generated knowledge flows that operate through non-market channels. Contrary to some arguments stated in the literature (Audretsch and Feldman 1996, p. 651), we too find that geographic proximity is more essential to the operation of market contracts (trade of patents) compared to the operation of non-market flows based on citation flows.Footnote 18

3.4 Agglomeration and geographic localization

At this point, we would like to add some additional consideration of the potential role of industry (sector) agglomeration in the localization effect in the trade of patents found earlier.Footnote 19 Perhaps, patent transfer is more likely within a state simply because the majority of firms in an industry are also clustered in that state. Therefore, in this section we would like to further explore whether some technological sectors are more geographically concentrated than other sectors and whether the innovation status of a state (i.e., top vs. low innovator) also relates to agglomeration issues.

In general, the terms “agglomeration” or “cluster” are used to refer to various forms of geographic concentrations (Fujita and Thisse 2002; Martin and Sunley 2003). The degree of concentration provides insights into the relative agglomeration or dispersion of a particular sector and into the respective levels of concentration between sectors. The literature offers several indices to measure the spatial concentration of activities, namely the concentration ratio, the Hirshman-Herfindhal index, the locational Gini coefficients, the Ellison and Glaeser (1997) concentration index, the location quotient among others.Footnote 20

We focus here on the most widely used measures, the locational Gini coefficient, the location quotient (LQ), and the Hirshman-Herfindhal Index (HHI). The output of a sector or state is the number of patents it produces and therefore we examine whether such production is concentrated in some sectors and some states. We begin with the locational Gini coefficient, which is defined as the maximum difference between the patent production (\(S_{i}^{j}\)) in sector \(i\) (\(i = 1,\ldots , 6\)) in state \(j\) (\(j = 1,\ldots , 50\)) and the total patent production (\(T_{i}\)) across all sectors in a given state. Formally,

$$\begin{aligned} Gini_{i}^{j} = 1/2 \big (\sum \nolimits _{j=1}^{50}\big |S_{i}^{j}\Big /\sum \nolimits _{i=1}^{6} S_{i}^{j} - T_{i}\Big /\sum \nolimits _{i=1}^{6} T_{i}\big |\big ). \end{aligned}$$

It is a summary measure of spatial dispersion derived from a spatial Lorenz curve. Smaller values of locational Gini coefficient indicate similarity between spatial distribution of sector \(i\) and country’s activity.

Table 7 below presents the locational Gini coefficient for the whole sample period, as well as for sub-periods, for six technological sectors.

Table 7 Concentration of technological sectors in the US (locational Gini coefficients)

We observe that the locational Gini coefficients do not vary much across technological sectors. Sectors such as \(Others\) and \(Electronics\) appear to be the least concentrated (agglomerated) as opposed to \(Computers\) and \(Drugs\), which exhibit the highest concentration across states of the US. Over time, localization slightly increases, as sub-period Gini coefficients indicate, with the exception of the \(Computers\) sector, where the opposite effect takes place.

Figure 4 below graphically depicts the locational Gini coefficient in the two sub-periods: 1995–2002 (blue boxes) and 2003–2010 (red boxes).

Fig. 4
figure 4

Locational Gini Coefficient over time (Color figure online)

Next, we proceed with examining whether there is agglomeration of innovation activity within a state. In doing so, we rely on the location quotient (LQ) index, which is another common approach to spatially delimit agglomeration. The LQ for sector \(i\) in region \(j\) is equal to the percentage of total patent production in region \(j\) devoted to sector \(i\) divided by the percentage of state’s total patenting activity devoted to sector \(i\). Formally,

$$\begin{aligned} LQ= \frac{S_{i}^{j}}{\sum \nolimits _{i=1}^{6} S_{i}^{j}}\Big /\frac{T_{i}}{\sum \nolimits _{i=1}^{6} T_{i}},\hbox { where }i\,\hbox{ is sector }(i= 1,\ldots , 6),\, j\hbox { is state }(j=1,..., 50). \end{aligned}$$

A region is said to be specialized in one sector if it has an LQ over 1. In such a case, this sector is over-represented within this region. This property is used to identify agglomerations for a particular sector, which are then defined as the areas with high LQs for that sector.

In addition to \(LQ\) index, we also employ the Hirshman-Herfindhal index (HHI), which shows whether the production is concentrated in few or many firms and is calculated by summing the squared market shares of all of the firms in the industry. In our case, we sum up the squared market shares (\(s_{i}\)) of all technological sectors for each state \(j\). Formally,

$$\begin{aligned} HHI = \sum \nolimits _{i=1}^{6} s_{i}^{2}. \end{aligned}$$

The HHI ranges between 0 (low concentration) and 1 (high concentration).

Table 9 in the ‘Appendix’ provides the location quotient for the six technological fields (sectors) and HHI per state.

According to that Table, many states show \(LQ\) index higher than one and therefore exhibit some concentration especially in \(Others\) (35 out of 50 states) and \(Mechanical\) (33 out of 50 states) sectors, while only few states (15 out of 50) show concentration in \(Computers\).Footnote 21

The top innovator states have HHI lower that the sample mean (0.249) and, therefore, exhibit less concentration with respect to the ‘average’ state. For the remaining 40 states, concentration patterns vary with respect to the sample mean, but overall, the dispersion of the HHI across states is rather small (0.044). Over time, there is not any noteworthy change neither in the HHI across states nor in its dispersion across states, as the HHIs indicate for the two sub-periods.

In sum, our evidence does not point to the emergence of significant agglomeration effect across geographic and technological space and time; so does the localization effect found earlier. The absence of variability in both patterns makes us cautious to establish a rigorous relationship between them.

4 Conclusion

Informal market channels of knowledge flows have received considerable attention in the economics of innovation. Studies have shown that they are significant components in the innovation process and that they are constrained by geographic distance; a result similar to the trade literature. In addition, some works have claimed that geographic distance plays a less significant role in recent years due to the increasingly interconnected world.

Using recently developed detailed data provided by the Office of the Chief Economist of re-assigned patents, we offer new insights on these important issues by adding a market channel of knowledge transfer, which has not yet comprehensively explored so far, the market of patents and knowledge flows based on patent transactions. We examine whether geographic and time effects shape patent transactions across states and sectors in the US and compare our findings with those of citation flows. Our results support that geographic nearness, in terms of distance and contingency, also matters for patent trade, as it has been massively documented in the trade and patent-citation literature. They further confirm that knowledge, generated from innovative ideas, which are patented and traded, are more geographically restricted and, therefore, their effective reach is less far stretched in space compared to knowledge flows based on citations. Although border and distance do not proxy transportation costs, as they do for goods trade—ideas are weightless—they could act as informational barriers, and therefore can hamper contacts and consequently patent transactions.

Our findings clearly support the presence of strong localization at a state level for patent transactions and to a lesser extent for citations. Although there is evidence of more patent trade in-state rather than across states, it is not, per se, a nuance evidence of higher importance of distance and border over time, neither for state nor for sector patent trade (and citation) flows. The information technology sectors, however consist a notable exemption. We also explore whether industry agglomeration relates to geographic localization found in the trading of ideas. We found no rigorous association. Overall, our results remain robust across sub-samples and alternative specifications.

The next research challenge is to further explore the causes of localization of patent trade. The relationship of agglomeration and localization of patent trade deserves some further investigation. An issue deserving also further inquiry is the role of social proximity in shaping the trade of ideas. Learning more about the causes of localization is important for assessing welfare consequences. Furthermore, another important research avenue is the pursue of the joint estimation of the geographic stretch of embodied and disembodied knowledge flows in one common framework and examine their individual effects on local production of innovation.