Introduction

Convergence is an important keyword that explains the propensity and nature of current technology development in the technology-innovation field. Technological innovation has occurred through convergences in various fields, including the computer, semiconductor, mobile-telecommunication, and medical-healthcare fields, and the possibilities of generating future technological innovations through convergence are infinite. Thus, firms face the challenge of having to constantly develop new technological products through unceasing innovation (Cho and Kim 2014) and making the best use of opportunities (Geum et al. 2012). In this situation, understanding the state of development of technology convergence and forecasting the promising convergence area will enable a firm to perform more effective R&D activities and supply the market with a new technology or product that reflects the needs of customers at the right moment.

Patent analysis is one of the most widely used approaches for technology convergence monitoring. Patent documents are an ample source of technical and commercial knowledge and have been used to gain insight into technology dynamics (Ernst 2003). Patents contain up to 80% of all of the technological knowledge that is currently available (Blackman 1995). In addition, to become a patent, the technology needs to meet the condition “for industrial use” so that only a potentially commercializable invention is granted patent status. Therefore, it is worth analyzing patent documents, which are valuable sources of information about technologies available in a market. Thus, patent analysis has been used not only to monitor technologies over a short period of time but also to analyze technology-innovation patterns in order to develop a long-term technology strategy (Kim and Lee 2015). In this way, taking the patent as the objective and mature indicator of technology changes, patent analysis has been used as an analytical tool for technology convergence.

The research on technology convergence using patent information has been performed from various perspectives. From the research-purpose perspective, previous studies have attempted to identify technological convergence trends (Gauch and Blind 2015; Ko et al. 2014; Rizzi et al. 2014) or classify the types of convergence technologies based on the trends (Jeong and Kwon 2014; Kim et al. 2014). These studies applied two main methods, which included patent-citation analysis (No and Park 2010) and co-classification analysis (Kim et al. 2014; Kwon and Jeong 2014; Kim and Kim 2012). From the perspective of the analysis target, some studies focused on the convergence of a single technology (You et al. 2014; Cho and Kim 2014), while others investigated the convergence between a pair of technologies (Geum et al. 2012).

However, in spite of their invaluable implications, the previous research had several limitations. First, most of the previous works emphasized the past trends of technological convergence. Of course, investigating past trends can provide meaningful insights for the future, but the ability of the quantitative approach to objectively forecast the future was limited. Little effort has been made to forecast technology convergence using quantitative data. Second, many studies have focused on the convergence of technology pairs, asking such questions as, “What is a pair of converging technologies” or “What are a set of technologies converging with technology A,” again by identifying a pair of converging technologies. Technology convergence is likely to happen not only within two technologies but also more than three technologies simultaneously, which should be traced. An analysis of the limited relationship of technologies does not provide a fair reflection of all of the changes in technology convergence.

To overcome those limitations, this paper proposes a forecasting methodology for multi-technology convergence. In this methodology, patent-citation analysis is first used to identify the convergence relationship. As the focus of this study is to forecast the future of technology convergence, we judged that it is more appropriate to adopt a citation-based approach, which shows the mechanism of convergence on the assumption that frequent knowledge that flows between different fields indicates a high probability of their convergence, rather than co-classification approach, which shows the current state of technology convergence with the promise that a large number of patents co-classified by different fields implies a great degree of their convergence. Second, neural-network (NN) analysis was applied to forecast the future data of the technology convergence. The citation relationships between technologies include large amounts of complicated data. NN is useful for extracting patterns and identifying trends that are too complex to be noticed by other techniques, thanks to its significant ability to derive meaning from complicated data. Thus, it is expected to be useful for this study. Finally, a dependency-structure matrix (DSM), which enables the modeling, visualization, and analysis of the dependencies among the entities of any systems (namely, the technologies in this study) is employed to present the multi-technology convergence. The proposed methodology is expected to help find promising converging technologies and prepare for them.

The paper is structured as follows. “Backgrounds” section outlines the theoretical and methodological backgrounds. “Methodology” section presents the proposed methodology for forecasting the state of future convergence. “Illustrative example” section gives an example to show the applicability of the proposed methodology, and finally, the study is concluded in “Conclusions” section.

Backgrounds

Theoretical background: patent-based studies of technology convergence

The previous studies on technology convergence based on patent information can generally be summarized from three different perspectives: the purpose, methodology, or object of the analysis. First, in terms of the purpose of the research, relevant studies can be divided into two categories: attempts to understand the trends of technology convergence and defining the types of technology convergence based on the trends. The studies in the first category investigated the evolution trajectory of the technology convergence, where developing a methodology for analyzing the evolution process with greater accuracy dominates the field. Rizzi et al. (2014) analyzed the worldwide convergence trends in hydrogen inventions. Ko et al. (2014) proposed a method to analyze the technology convergence trends by measuring the knowledge flows of patents. The studies in the second category analyzed the convergence patterns of technologies by measuring the degree of convergence among them. For example, Jeong and Kwon (2014) measured the degree of convergence in the green-technology field to consider which field of technology actually converged with it. Kim et al. (2014) identified the core technologies in printed electronics at particular time periods, along with the technologies that played a central role in the convergence of different core technologies with regard to printed e-technologies.

Second, the previous studies can be divided into two categories according to the methodology used for the technology convergence: The first group applied patent-citation analysis, and the other applied patent co-classification analysis. Of course, these two methodologies have often been used in conjunction with another methodology, and other methodologies, such as IO analysis (Xing et al. 2011; Choi and Hwang 2014) or co-word analysis (Wu and Leu 2014), have also been applied to technology-convergence analysis. Nevertheless, the most frequently used methodologies include patent-citation analysis and patent co-classification analysis. In studies that use patent-citation analysis, the goal is to identify the flow among different pieces of knowledge (technologies) and reveal the convergence mechanism from the knowledge flows. For example, No and Park (2010) developed a citation-based index that measures the degree of convergence of cross-disciplinary technologies. Then, this citation analysis was combined with social network analysis to classify the trajectory patterns of the technology convergence. On the other hand, the previous studies using patent co-classification analysis attempted to analyze the phenomena occurring in technology convergence. Kim et al. (2014) studied the degree of mergers and the relationships between different technology domains through the association rule mining of patent co-classification. Kwon and Jeong (2014) analyzed the associative relationship between green technologies through patent co-classification and relevance analyses. Some studies have used both patent-citation analysis and patent co-classification analysis. Kim and Kim (2012) proposed two approaches using patent data to examine technological convergence. After a patent-network analysis using patent citations, which could be considered a knowledge flow among technological fields, they carried out a co-classification analysis of technologies to measure the convergence intensity, rate, and coverage.

Third, depending on the object of the technology-convergence analysis, the existing studies can be divided into single technology studies and technology pair studies. The study type varies with the number of technologies for analysis, where most of the current emphasis lies in a single technology (You et al. 2014; Cho and Kim 2014) or a technology pair (Geum et al. 2012).

The previous studies had some limitations. First, most efforts were made to investigate and interpret the past trends of convergence, while forecasting future trends is vital to gain the preoccupancy of a promising technology. There has been relatively little interest in forecasting technology convergence. Of course, there have been several attempts to forecast the future status of technology convergence, mostly through qualitative approaches such as through expert-based forecasting (Bengisu and Nekhili 2006) or using the comprehensive results of past technological characteristics to identify promising technology fields (Lee and Yoo 2014). While these qualitative approaches to technology foresight focus on expert opinions, the incorporation of quantitative data, along with a qualitative approach, will enable greater reliability and accuracy in the analysis results. To forecast the future trends of convergence, we not only investigate the past data in an attempt to understand past trends, but we also extrapolate the data in an attempt systematically to forecast future trends. For this purpose, this study developed a neural network model, which produced a reliable future citation matrix that performed relatively well. Based on such forecasting results, more patent analyses could be conducted to develop more insights into future technology characteristics.

Second, numerous studies have focused on the convergence of a single technology or a pair of technologies to observe how a particular technology has converged with other technologies or how a particular pair of technologies shows remarkable convergence trends. However, a technology has boundless opportunities to converge with multiple other technologies simultaneously. For example, during the 90 s, convergence was mainly discussed in the context of the convergence of the IT, telecommunications, media, and entertainment industries into a giant “infocom” sector (Lind 2004). Now, more and more industries are experiencing convergence with other industries, and technological innovation no longer occurs within the previously existing industrial silos but rather between them (Hacklin et al. 2009). As a result, sticking to a pair of technologies in investigating convergence may limit the possibilities to identify the overall trends of multi-technology convergence in multiple industries. There are limitations in presenting the whole technology convergence using limited perspectives. To overcome the limitations, this paper proposes a methodology for multi-technology convergence, projecting the future trends of convergence based on past trends. In the multi-technology convergence approach, we look at pairs of technologies, but these pairs may converge at one point, which enables the relationships between multiple technologies to be investigated. Table 1 shows the difference between the existing approach to technology-convergence analysis and the approach taken in this study.

Table 1 The difference of the approach in this study and its advantages

Methodological background

Patent-citation analysis

Patent-citation analysis has been adopted for conceptual and qualitative investigation (Ernst 1998), along with quantitative and empirical analyses of technological innovation (Trajtenberg 1990). Its strategic importance becomes more apparent as the process of innovation becomes more complex, the cycle of innovation becomes shorter, and the market demand becomes more volatile (Lee et al. 2012). Accordingly, patent-citation analysis has been used with a variety of objectives. The basic concept of patent-citation analysis is that there exists a technological link between patents if one patent cites another (Lee et al. 2012). Once the link is created, new knowledge comes from combinations of existing knowledge (Érdi et al. 2013). In particular, it enables an investigation of the knowledge flows (Duguet and MacGarvie 2005; Fleming et al. 2007) and links between innovations and their technological “antecedents” and “descendants” (Jaffe et al 2000), as well as the multiple linkages among inventors, firms, regions, and so on (MacGarvie 2005; Gomes-Casseres et al. 2006). Therefore, measuring technological knowledge flows can be a good start to analyzing the trends of technology convergence (Ko et al. 2014).

There are two types of citations. The first is a backward citation, which is used to measure the inflow of knowledge from other technologies. The second is a forward citation, which is used to measure the inventive quality in terms of the technological or economic value (Henderson et al. 1998; Jaffe et al. 2000). These unique linking properties of citations provide useful information on what is vital in studying technology fusion, which is greatly influenced by the relationships among other technologies (No and Park 2010). The studies on technology convergence using patent-citation analysis take advantage of the previously mentioned citation characteristics because these provide a better understanding of the overlapping technology areas and trajectory changes with the emergence of citations (Karvonen and Kässi 2013). If one patent cites another (i.e., a new technology is created by combining existing technologies), this is regarded as a convergence. If intensive patent citations are observed in different industries, technologies in the industries are likely to be converged. In this respect, convergence can be observed from the citation information that reflects the knowledge flows at the technology level. Technologies that have higher citation counts with other technologies have a higher possibility of convergence.

As a result, patent-citation analysis has been used as a main approach in technology convergence studies to identify the convergence trends, analyze the convergence evolution trajectories, and determine the newly converged technologies. No and Lim (2009) analyzed “the degree of technological convergence and diffusion” and “the influence and potential of a technology” using backward-citation information. Lee et al. (2012) proposed a stochastic patent-citation-analysis method to assess future technological impacts. Ju and Sohn (2015) proposed a hierarchical quality function deployment (QFD) framework that makes it possible to set R&D priorities and then develop corresponding business models to meet future societal needs. In this research, we also adopted a backward-citation approach to identify converging technologies, but unlike the previous studies, we firstly developed a future-citation matrix based on past citation information and then analyzed the converging areas based on the “projected future.”

Neural-network analysis

The objective of an NN is to obtain meaningful outputs by controlling the weights on the relationships between inputs and outputs. There are various NN techniques, including a multilayer feed-forward NN, recurrent NN, and time delay NN. Among these techniques, a multilayer feed-forward NN trained by a back-propagation algorithm is most commonly used for forecasting (Gutierrez et al. 2008). One major application area of an NN is forecasting multivariate, nonlinear, nonparametric statistics (Sharda 1994). Many studies have established that an NN’s forecast is more accurate than those of forecasting methods such as the moving average (MA), autoregressive integrated moving average (ARIMA), linear regression, and exponential smoothing methods (Mitrea et al. 2009). An NN offers better forecasting performance than traditional methods.

Thus, an NN has also been applied to patent analysis, though not quite frequently or for various purposes. For example, Trappey et al. (2011) used an ontology-based NN algorithm to automatically classify and search knowledge documents. Zhang et al. (2012) used it to explore the nonlinear relationship between the patent performance and corporate performance of pharmaceutical companies. Lai and Che (2009) proposed a revolutionary valuation model for the monetary legal value of patents, and Zang and Niu (2011) suggested a forecast model for the patents granted in colleges. In this research, an NN was applied to produce expected future-citation values for a citation matrix. Considering the complexity of our data and the forecasting accuracy of NN, it would be appropriate to apply the NN to the patent data for this study. In addition, since the purpose of this study is to produce accurate results rather than to gain implications from parameter estimation, the use of the non-parametric analysis method is justified.

Dependency-structure matrix

A DSM, which is also known as a design-structure matrix or dependency-source matrix, is a network-modeling tool that is used to represent the elements comprising a system and the interactions between the elements. It is particularly well suited for analyzing complex systems and thus has previously been used actively in the areas of engineering management. This method is powerful for engineering management, especially due to its advantages in terms of compact format, visual nature, intuitive representation, powerful analytical capacity, and flexibility (Eppinger and Browning 2012). More in detail, it provides a simple and concise way to present a complex system. Second, it is amenable to a powerful analysis, such as clustering (e.g., to facilitate modularity) and sequencing (e.g., to minimize the cost and schedule risk in processes).

A DSM is an N by N matrix that has an equal number of rows and columns. It shows the relationship among the system’s elements. In the matrix, the off-diagonal cells are used to present relationships between the elements. That is, a marking of the cell indicates a link between two elements. In Fig. 1, The X symbol relating to C and D indicates that Technology D is dependent on Technology C. If intensive knowledge flows from Technology C to Technology D are observed, innovation of Technology D is likely to be affected greatly by that of Technology C. In other words, the advances in Technology D require knowledge in Technology C, while Technology C transfers knowledge to Technology D. If Technology C affects Technology D, and vice versa, it is likely that convergence between the two technologies occurs.

Fig. 1
figure 1

DSM matrix

In this research, instead of adopting a clustering algorithm, which finds subsets of DSM elements that are mutually exclusive or minimally interacting subsets, we applied the sequencing algorithm of DSM to the matrix to show the relationships between technologies. Sequencing is the reordering of the DSM rows and columns such that the new DSM arrangement contains the fewest iterations (feedback marks), thus transforming the DSM into an upper triangular form. If any iterations exist, they indicate the possibilities of technology convergence. We assume that technology convergence is realized only through bi-directional knowledge flows rather than one-directional knowledge flows. The sequencing algorithm identifying iterations is more suitable for our assumption than the clustering algorithm, considering both bi-directional and one-directional relationships between technologies.

The sequencing results are expected to provide insights into how convergence happens, highlighting knowledge flows and the roles of technologies. From the rearrangement, three types of relationships can be represented in a DSM: parallel, sequential, and coupled. Parallel means that the system elements do not interact with each other (e.g., A and B in Fig. 1). A sequential relationship shows that one element influences the behavior or decision of another element in a unidirectional fashion (e.g., C and D in Fig. 1). Coupled indicates that the flow of influence or information is intertwined (e.g., E and F in Fig. 1). Convergence happens through coupled technologies, which are the focus on this research. In addition to the identification of converging areas, the matrix visualizes the role of technologies (system elements) in the technology areas (elements). If a technology has a relatively large number of marks in its row, it is regarded as a knowledge provider (e.g., A in Fig. 1), whereas a technology with a large number of marks in its column is considered a knowledge absorber (e.g., F in Fig. 1).

The DSMs in existing studies were mainly used as tools to project scheduling, work division, design-process management, and software-architecture management. Danilovic and Browning (2007) attempted to reduce the risk arising from the product-development phase by handling the dynamics of project development using a DSM and domain-mapping matrix. Austin et al. (2000) established DSM tools to suit the building-design process based on the existing DSM techniques and tools and validated these matrixes by testing the analytical design-planning technique on historical and current building projects. Sangal et al. (2005) described an approach to managing software systems using dependencies. Now, DSM is increasingly extending its application to deal with complex issues in healthcare management, financial systems, public policy, natural sciences, and social systems (Eppinger and Browning 2012).

To summarize, the DSM concept may be applied to our study as follows. First, each row (or column) in the DSM represents a technology. Second, the interaction between system elements represents a citation between technologies (i.e., convergence). When a coupled relationship occurs, it is considered to be a convergence between technologies, and a block that occurs as a result of a conversion is called a convergence cluster. A convergence cluster becomes a newly introduced convergence technology because of the coupling between the existing technologies. This is one of the earliest but most meaningful attempts to apply the DSM concept to innovation studies.

Methodology

Proposed approach

Figure 2 shows the overall framework for the proposed methodology. For multi-technology convergence forecasting, the technology field for the forecast is first selected. The technology field under consideration may vary with the purpose of the researcher’s analysis, and two or more fields may be selected. Second, the patent data related to the target technology field are collected. The selected patent data are taken from the registered patents, and the sub-technologies belonging to the target technology field may be coded by a patent-classification coding system for technology, such as International Patent Classification (IPC) or United States Patent Classification (USPC). For example, USPC 700, which denotes “data processing: generic systems or specific applications” is considered a single technology in the information technology (IT) field. Third, the flow of knowledge among technologies is identified by analyzing the citation relationships among the collected patents and then presented in the DSM matrix to analyze existing convergence trends. Fourth, the value of a future patent-citation matrix is forecasted through the NN analysis, and the results are used to construct a DSM matrix for forecasting the status of technology convergence among future technologies in the final step. Steps 3–5 are explained in detail in “Detailed process” section.

Fig. 2
figure 2

Proposed process to forecast future technology convergence

Detailed process

Developing a patent-citation matrix

To identify the technology convergence, a patent-citation matrix is first developed. Figure 3 shows the structure of this matrix. The matrix takes an N by N structure containing the technology classes and number of citations. In this matrix, groups of technologies are classified according to technology classification codes, such as A01 (when adopting IPC) and 700 (when adopting USPC), which are selected by the researcher. The technology classes in the columns cite the technology classes in the rows. If T2 is cited by T3, cell X32 indicates the number of citations from T2 (patents on T2) to T3 (patents on T3).

Fig. 3
figure 3

Structure of a patent-citation matrix

Identifying key converging technologies

Before applying the DSM to the forecasted citation matrix, it is worth using it on past data divided into several periods, which helps understand the past trends of convergence. Thus, the DSM is constructed using the values of the patent-citation matrix. In this study, we adopted a binary DSM, which is simple to use but provides enough information for our research purpose. To convert the citation matrix into a matrix with binary values, cut-off values are applied: A value of one is assigned when at least a certain amount of citation occurs within the patent-citation matrix and zero when no or few citations occur (see Fig. 4). Because of the characteristic of the DSM, the citation relationship between the same technologies is excluded. Here, a DSM was constructed using the “Macros for Partitioning and Simulation” provided for research at www.dsmweb.org.

Fig. 4
figure 4

Transforming a citation matrix to a binary DSM

Forecasting future technology convergence

The target for analysis in this study is future convergence, and hence, a future-citation matrix is developed using an NN. The NN is frequently used as a time-series analysis method to estimate the lag dependence in the data. The first step to forecast the convergence status using the NN is defining the inputs and outputs. In general, the value of a given time, t (month, year or term), is correlated with the past values of precedent times, t − 1, t − 2, … and t − n. Therefore, instead of using only a single value, the sum of the past n time-series data was used. The number of data that shows the best performance is chosen to calculate the values for an input variable. The overall forecasting accuracy can be measured by the mean absolute percentage error (MAPE). The formula for the MAPE is

$${\text{MAPE}} = \frac{100}{n}\sum\nolimits_{t = 1}^{n} {\frac{{\left| {A_{t} - F_{t} } \right|}}{{A{}_{t}}}}$$
(1)

where A t is the actual value at period t and F t is the forecast value at period t.

Accordingly, the number of data sets was set to six. The experiment was conducted on the randomly selected variables in the 2006 citation matrix—the number of citations from 365 to 235, from 435 to 424, and from 705 to 379—and the real citation values were compared with the estimation results. The experiment results shown in Table 2 revealed that we could expect the greatest performance—minimum average and standard deviation values—by training the NN with the sum of the lasted six years’ data as an input (see the Italics).

Table 2 MAPE values under different input variable conditions

Let us assume that we forecast the value of a specific year in the future (here, 2006) using the data for the past 10 years (1996–2005). In the NN matrix, t is 1996, and the X 32 value of 1997 is X 32_t+1. The X 32 value of 2006 (t + n) that we are trying to forecast is the output value of 2005 (t + (n − 1)). If a value is to be forecast for 2016, the NN training can be performed 10 more times.

Illustrative example

Selecting target technology fields

To illustrate the analysis method, the fields of IT and biotechnology (BT) were selected. IT–BT convergence is a representative technology field for technology convergence, which is a technology field in which an existing IT is converged with a biological phenomenon to produce new IT products/services (i.e., hardware, software, and an application field) using biological principles and characteristics. We used these as an example to forecast the future status of convergence because they both have the potential to grow into a driving force for the development of a nation’s economy, and their convergence with various fields is currently being promoted. To classify the technology, this paper uses USPC, which is partially referenced from the research of Geum et al. (2012). Table 3 lists the technology fields and USPC codes used in this paper. The IT fields consist of three sub-fields: “mobile telecommunications, telematics (USPC 340, 375, 379, 701),” “electrical computing (USPC 235, 361, 365)” and “digital contents, software solutions (USPC 705, 707, 715).” The BT fields also have three sub-fields: “biomedical devices (USPC 623, 702),” “molecular bioengineering (USPC 424, 426, 514, 435, 800),” and “surgery (USPC 600, 602, 607).”

Table 3 The definition of USPCs used in this research (Geum et al. 2012)

Collecting patent data

The United States Patent and Trademark Office’s (USPTO) patent data registered from 1996 to 2005 in the IT and BT fields were collected from the National Bureau of Economic Research (NBER) database. The database has information about the number of patent citations, the patent number of the citation, the registered date of the cited patent, the USPC of the cited patent, and the USPC of the patent doing the citing. As a result, the information from 387,703 patents and 353,785 citations between them was used for this research.

Developing a patent-citation matrix

For analysis, a patent-citation matrix needs to be developed. Figure 5 shows a patent-citation matrix for 2004. The most number of citations was observed within 600 and 514. When a forecast is made with 10 years of data, 10 patent-citation matrixes are constructed, which are used as inputs for NN analysis and also to investigate the past trends of convergence.

Fig. 5
figure 5

Patent-citation matrix of 2004

Identifying key converging areas

A DSM sequencing algorithm was applied to the citation matrixes on each of the 1996 to 2005 data sets to examine the past trends of convergence. Then, for ease of visualization, the convergence trends over 10 years, split into five-year units are presented in Fig. 6. The cut-off value of the DSM was set to the average citation frequencies for all patents published over the past 10 years, which corresponded to 4134.4. If a citation frequency of a particular cell is greater than the cut-off value, a value of one was assigned to the relevant cell to develop a DSM. Figure 6a shows that DSM built one cluster consisting of USPC 424, USPC 435, and USPC 514 during the period from 1996 to 2000. This convergence cluster was formed within the same technology field, molecular bioengineering in BT. Here, it should be noted that USPC 514 is an integral part of USPC 424, and thus, it is the convergence between USPC 514 (drug, bio-affecting, and body-treating compositions) and USPC 435 (chemistry: molecular biology and microbiology). Nevertheless, the analysis results showed that not only the convergence between two technologies but also among three technologies could be identified with the suggested approach, which is one of its advantages over other approaches. On the other hand, two more convergence clusters were identified in the other unit, 2001–2005 (see Fig. 6b). One of them consists of USPC 340 (mobile telecommunications) and USPC 701 (telematics) within the IT sector, while the other consists of USPC 707 (digital contents) and USPC 715 (software solutions), again in the IT sector. The analysis results uncovered that technology convergence within the same industry field was dominant by 2005.

Fig. 6
figure 6

Key converging areas identified

Forecasting future technology convergence

It is more significant to forecast the future trends of IT–BT convergence based on the past data than to investigate the past trends for technology planning. The past 10-year citation data from 1996 to 2005 were used to project the data for 2006. SPSS 18 was used for NN training. The dependent variable was the normalized values at time t, and the independent variables were the normalized values at time t − 1. The training parameters are listed in Table 4.

Table 4 Parameters for training

To validate the use NN for the purpose of forecasting, we compared the MAPE values obtained by NN and growth curves. Table 5 shows the forecasting performance of the NN and growth curves, with the focus restricted to the three randomly selected variables. As in Table 2, a training data set included the values from the 1996 to 2005 citation matrixes, and a test data set was collected from the 2006 citation matrix. The MAPE value obtained from the NN was from 12.31 to 21.08%, which indicates that the forecasting performance is quite good, being greater than the performance of other growth curve models (see Table 5). Here, it should be noted that there exist various forecasting models (e.g., linear, exponential and logistic) but we chose the growth curve model, Y = e^(b_0 + b_1t)), for comparative analysis, which shows the greatest forecasting performance in terms of R square values among them.

Table 5 Comparative analysis of forecasting performance

Therefore, the NN was applied to forecast the citation matrix of 2020, and the projected values in the matrix (see Fig. 7) were transformed to a binary value by using the cut-off value. As we used the total number of citations made in the past 10 years as a cut-off value for the five-year-period citation matrix in Fig. 6, we divided the cut-off value by five to generate a new cut-off value for the one-year-period citation matrix, which corresponded to 826.9.

Fig. 7
figure 7

Patent-citation matrix of 2020 (projected data)

Figure 8 presents the results of applying a DSM technique to the projected patent-citation matrix of 2020, which helps identify the expected future convergence. The convergence of USPCs 340 (mobile telecommunications)—701 (telematics), USPCs 707 (digital contents)—715 (software solutions), and USPCs 424 (drug, bio-affecting, and body-treating compositions)—435 (chemistry: molecular biology and microbiology)—514 (drug, bio-affecting and body treating compositions) is expected to continue until 2020. In addition to the three convergence clusters, we could also expect that two more convergence clusters will be created in 2020 between USPCs 600 (surgery)—607 (surgery: light, thermal, and electrical application) and UPSCs 705 (data processing: financial, business practice, or management arrangement)—707 (digital contents). Until 2020, the convergence of IT and BT fields is observable within the same fields on the assumption that the convergence defined in this study is the knowledge exchanges among technologies. Moreover, in the convergence between the two fields, it is expected that USPCs 424 (drug, bio-affecting, and body-treating compositions) and 705 (data processing: financial, business practice, or management arrangement) will act as a knowledge provider in general, while USPC 340 (communications: electrical) will act as a knowledge absorber.

Fig. 8
figure 8

Expected technology convergence areas in 2020

The forecasting results suggested that the number and size of the convergence clusters would continue to increase over the past 10 years in terms of convergence trends. We can observe that the cluster whose number was 1 in 1996–2000 increases to 4 in 2020. In addition, IT and BT used for this analysis are proven to be converged with sub-technologies significantly. In terms of the convergence fields, the cluster of USPC 424, USPC 435, and USPC 514; the cluster of USPC 340 and USPC 701; the cluster of USPC 600 and USPC 607; and the cluster of USPC 707 and USPC 715 are core convergence technologies, which are noticeable for IT and BT. In terms of the continuity of the convergence, each cluster appeared continuously until 2020 from since they were built. Especially, cluster 424, 435, and 514 is believed to contain core convergence technologies for BT, regardless of time. Though our analysis was restricted to the citation matrix for 2020 using data from 1996 to 2005, the continued projection of the forecasted results will enable us to obtain the citation matrixes in the following years and identify the converging technologies in the distant future.

Types of technology convergence

From the matrix in Fig. 8, several types of technology convergence could be identified. Some of them are based on bi-directional knowledge flows. The simplest type is a coupled convergence between two technologies (e.g., USPC 600 and USPC 607, USPC 340 and USPC 701) where both the knowledge in-flow and out-flow between the two technologies are active. On the other hand, the number of converging technologies can be extended to more than three, which is called as an interrelated convergence, where all technologies in the converging cluster are closely related to each other with extensive knowledge exchanges (e.g., USPC 424, USPC 435 and USPC 514). Another interesting type is an intermediated convergence, where pairs of technologies are linked together through the mediation of a particular technology (e.g., USPC 705, USPC 707 and USPC 715, though the mediation of USPC 707).

There are also some other technology pairs worth noting. Although this study assumed that bi-directional knowledge flow was a requirement for technology convergence and thus that converging areas were identified based on this requirement, one-directional knowledge flow also needs to be considered if a substantial amount of knowledge flow is observed (e.g., from USPC 235 to USPC705; from USPC 235 to USPC 340; from USPC 424 to USPC 623; from USPC 705 to UPSC 702; and from USPC 705 to USPC 340); such one-directional knowledge flow can be an early signal of technology convergence, or it can be regarded as another type of technology convergence.

Conclusions

This paper proposed a future-oriented forecasting methodology for multi-technology convergence. First, we selected the fields for analysis and collected patent data for each field. Then, we analyzed the citation relationships among the technologies in each field, constructed a patent-citation matrix and DSM, and analyzed the status of the existing convergences. Then, using an NN, we forecasted the status of future technology convergences.

The proposed methodology provides the following contributions for studies of technology convergence. First, this methodology enables a quantitative forecast of “the status of future technology convergence.” By providing quantitative forecast values related to whether instances of convergence would increase or decrease between technologies in the future, we attempted to overcome the limitations of an expert-based forecast method or a convergence analysis focusing on the past trends. Second, the proposed methodology forecasts a future technology convergence from “a multi-technology perspective.” By considering the convergence potentials among all of the technologies in a field for a forecast, our methodology attempts to overcome the limitations of the existing studies of convergence with limited technologies. This proposed methodology generates convergence clusters through a DSM to enhance the visibility of convergence patterns among multiple technologies.

Despite these contributions, this study has some limitations. First, the elaboration of NN is required in order to determine the input variables or cut-off values. Instead of using a single input variable with the sum of several time-series values, it will be possible to adopt several input variables. Determining cut-off values is also worth considering. A smaller cut-off value will generate a greater number of converging areas and vice versa. Finding the most appropriate cut-off values should be addressed as a future research topic. Second, we defined converging areas as the technologies showing active knowledge inflows and outflows. However, convergence between Technologies T1 and T2 can happen via one-directional knowledge flows. Future research needs to define several types of convergence and suggest an approach suitable for each of the types.