Introduction

There is considerable and growing interest in emerging research topic (Mogoutov and Kahane 2007; Upham and Small 2010; Halaweh 2013; Zhao and Strotmann 2014; Kyebambe et al. 2017; Soriano et al. 2018; Calabrese et al. 2018; Kuhlmann et al. 2019, and among others). While elaborating this concept in the literature, many different phrases have been used, such as emerging research topics (Wang 2018; Xu et al. 2019), emerging topics (Ohniwa et al. 2010; Glänzel and Thijs 2012; Tu and Seng 2012; Small et al. 2014), research fronts (Persson 1994; Åström 2007; Chen et al. 2007; Aris et al. 2009; Toivanen 2014; Ma and Liu 2016), emerging trends (Glassey 2009; Xie 2015; Liu and Gui 2016; Sangam 2000; Naaman et al. 2011; Liu et al. 2018), emerging technologies (Smalheiser 2001; Adner and Levinthal 2002; Roche et al. 2010), emerging research fields (Scalise et al. 2007; Lee 2008; Bettencourt et al. 2008; Chang et al. 2010; Jang et al. 2011; Jarić et al. 2014; Rohrbeck et al. 2015; Weismayer and Pezenka 2017) and so on. In our opinion, main reason is that there is no consensus one a universal concept and accurate properties attached to it. Several examples related to the definition include: (1) the clusters of highly interactive literatures in science (Persson 1994; Morris et al. 2003; Åström 2007; Aris et al. 2009; Ma and Liu 2016), (2) the themes that appear frequently during a specified time period (Chen et al. 2007; Tu and Seng 2012), and (3) the topics that have properties of radical novelty and prominent impact (Rotolo et al. 2015; Wang 2018), fast relatively growth (Ohniwa et al. 2010), coherence (Rotolo et al. 2015; Wang 2018), persistence (Porter et al. 2018; Wang et al. 2018) and so on. On closer examination, we argue that the meanings of these terms are very close, so the emerging research topic is chosen to collectively refer to above mentioned concepts.

Last five decades witnessed significant progress in the field of emerging research topics ever since de Solla Price (1965). Several recent research activities, such as Emerging Research Areas and their Coverage (Reiss et al. 2013) supported by the European Research Council (ERC), Foresight and Understanding from Scientific Exposition (FUSE) (Small et al. 2014) funded by the Intelligence Advanced Research Projects Activity (IARPA) and 2018–2019 Contest of Measuring Tech Emergence,Footnote 1 promote further the development of emerging research topics. These studies can benefit the research foundations and policy-makers, whose main purpose is to promote and enhance the development of potentially promising research topics.

Until now, more than 1500 closely-related scholarly articles are published in the Web of Science database (see “Dataset” section for more details). To the best of our knowledge, there is no comprehensive literature review on emerging research topics until now. This makes it very difficult for beginning researchers to gain a proper understanding of the constantly updating field in the limited time. Moreover, it is very possible to miss important development in the areas beyond a researcher’s specialty. Therefore, this work devotes to systematically reviewing the main knowledge flows that reflect crucial development paths in this field and pointing out the directions of future research. It is well known that most existing literature surveys are conducted from the perspective of the resulting researchers’ experience. Thus, these reviews may be constrained by the time and the cognitive ability of the corresponding scholars on the interested field (Raghuram et al. 2010). Hence, it is very possible that some important articles in the field are excluded, which may result in several missing research lines. As an objective methodology, main path analysis (MPA) (Hummon and Doreain 1989) is a very promising alternative to review thousands of articles in this field. Furthermore, it has been successfully applied to survey the development of data envelopment analysis (Liu et al. 2013a, b), Hirsch index (Liu and Lu 2012), peer review (Batagelj et al. 2017), graphene (Yeo et al. 2014), IT outsourcing (Liang et al. 2016), and fuel cell (Verspagen 2007) fields.

It is well-documented that the citation-link network bears rich valuable information about how the knowledge diffuses (Bhupatiraju et al. 2012; Zhu et al. 2016). Therefore, it should be rational and feasible for MPA to scrutinize the development of the interested domain from the large citation-link network with efficient algorithms (Batagelj 2003) in terms of single main path (Hummon and Doreain 1989), multiple main paths (Liu and Lu 2012), key-route main path (Liu and Lu 2012) and so on. Since the most significant links (i.e., the links with the highest traversal weight) are not guaranteed to be included in the single main path and multiple main paths (Liu and Lu 2012), the key-route main path is preferred in this study. For purpose of calculating traversal weights, four methods are put forward in the literature (Hummon and Doreain 1989; Batagelj 2003): (a) search path link count (SPLC), (b) search path nodes pair (SPNP), (c) node pair projection count (NPPC), and (d) search path count (SPC), among which NPPC is not suitable for large networks due to high computation complexity (Batagelj 2003). Liu et al. (2019) recommended SPLC for tracing knowledge diffusion trajectory, since it imitates most closely the knowledge diffusion scenario in the scientific development where the intermediate nodes (i.e., individual publications) not only pass knowledge, but also are knowledge sources per se. Hence, SPLC is also adopted here. In summary, the key-route main path methodology, armed with the traversal weight of SPLC, is utilized in this work to review the literature in the emerging research topics field.

The rest of the article is organized as follows. After “Methodology” section introduces briefly the traversal weight calculation and key-route main path analysis, “Dataset” section shows how to define the emerging research topics fields as precisely as possible. Then, a knowledge diffusion trajectory with two convergence-divergence patterns is discovered in “Analysis and Results” section, from which one can identify a series of papers that play important roles in the emerging research topics field. The last section concludes this study with the directions of future research.

Methodology

In this study, the key-route main path analysis is utilized to explore the knowledge diffusion trajectories, and then to further understand the development track of emerging research topics area. From the perspective of evolution, the main path of a citation-link network can be seen as a time sequence diagram that reflects the knowledge diffusion of an interested area.

Traversal weights

In a citation-link network, a citation link’s SPC value is the total times that a citation link is traveled if one runs through all the possible citation paths from all the sources (i.e., the nodes that are cited while referring to no other nodes) to all the sinks (i.e., the nodes that are referred to while citing to no nodes). A citation link’s SPLC value is the total counts that a citation link is traveled if one runs through all the possible citation paths from all the ancestors (including themselves) of the tail nodes to all the sinks. The SPNP weight is more sophisticated, and it is the total number that a citation link is traveled if one runs through all the possible citation paths from all the ancestors (including themselves) of the tail nodes to all the descendants (including themselves) of the head nodes. The NPPC method calculates the number of times each link involved in connecting all node pairs in all subgraphs derived from the network.

Among these traversal weights, only SPC follows Kirchhoff’s node law, so a preference on SPC is suggested by Batagelj (2003). However, it is well known that in the diffusion process of the scientific knowledge, intermediate nodes not only pass knowledge, but also generate knowledge (Liu et al. 2019). In view of this, SPLC is closer in line with the actual knowledge diffusion situation, therefore this work decides to apply SPLC method for computing citation links’ weights.

Key-route main path analysis

The key-route search procedure (Liu and Lu 2012) begins with the citation-links with the top traversal weight (i.e., key-route), and then searches forward from the head node of the key-route until a sink is hit and searches backward from the tail node of the key-route until a source is hit. The resulting path-fragments are merged together to form the key-route main path. The more key-routes one selects, the more details of the main path will be revealed. Thus, prior knowledge can be readily incorporated into main path analysis, so that one can guarantee that the most significant links are included in the main path (Liu and Lu 2012). Hence, the key-route main path analysis method is preferred to in this study.

For convenient understanding, let’s take Fig. 1 as an example. It is easy to see that A and B are source nodes, and G, H, I and J are sink nodes. In all the citation links, <C, D> has the largest SPLC value. This means that this citation link plays a significant role in the knowledge diffusion process, so the link <C, D> is chosen as the key-route. When searching forward and backward from the link <C, D> from global or local viewpoint, the same key-route main path can be obtained in this case, as shown in Fig. 2. It is worth noting that the global main paths are usually not identical to the local counterparts in the real-world applications.

Fig. 1
figure 1

A citation-link network with SPLC traversal weights. The arrow indicates that the tail node is cited by the head one, and the thickness of each citation link is drawn in proportion to its SPLC values

Fig. 2
figure 2

The key-route main path from the citation-link network in Fig. 1. The arrow indicates that the tail node is cited by the head one, and the thickness of each citation link is drawn in proportion to its SPLC values

Dataset

The scientific publications in the emerging research topics field are retrieved from Web of Science (WoS) core collection on March 14, 2019 from the library of Beijing University of Technology. The data time span is set to range from 1965 to 2019. Given that the research front was first defined by de Solla Price in 1965, de Solla Price (1965) is often regarded as a pioneering work in this domain. So, the starting year of the dataset is fixed to 1965.

The following search strategy from Lu et al. (2019) is utilized in this study: TS = (“emerg* topic*” OR “emerg* research* topic*” OR “emerg* scien* topic*” OR “emerging* academic* topic*” OR “emergence of topic*” OR “emergence of scien* topic*”OR “emergence of research* topic*” OR “emergence of academic* topic*” OR “topic* emergence” OR “research* topic* emergence” OR “scien* topic* emergence” OR “academic* topic* emergence” OR “research* front*” OR “scien* front*” OR “academic* front*” OR “emerg* field*” OR “emerg* scien* field*” OR”emerg* research field*” OR “emerg* academic* field*” OR “emergence of field*”OR “emergence of scien* field*”OR “emergence of research field*” OR “emergence of academic* field*” OR “field* emergence” OR “research* field* emergence” OR “scien* field* emergence” OR “academic* field* emergence” OR “emerg* trend*” OR “emerg* research* trend*” OR “emerg* scien* trend*” OR “emerg* academic* trend*” OR “emergence of trend*” OR “emergence of research* trend*”OR “emergence of scien* trend*” OR “emergence of academic* trend*” OR “trend* emergence” OR “research* trend* emergence” OR “scien* trend* emergence” OR “academic* trend* emergence”). The document type is limited to article, review, and proceedings paper, and the research area includes science technology other topics, information science library science, and social sciences other topics. In total, the number of publications is 1567.

To make sure that all papers are pertaining to the emerging research topics, scholarly articles in the dataset is checked carefully one by one by reading the resulting titles and abstracts. In the meanwhile, several important papers from the reference list in Lu et al. (2019) but missed from the WoS core collection are added manually to the dataset. Finally, the dataset includes 1607 scientific publications. We assume that these articles will enable researchers to have a full picture on the development of emerging research topics field. The distribution of number of publications over year is illustrated in Fig. 3. From Fig. 3, one can see that the number of articles increases at a very slow rate before 2003, but after 2003, the number of publications increases rapidly, especially after 2011. This demonstrates that this field has been receiving increasing attention. In addition, a downward trend can be observed after 2018, since the latest publications are unavailable in the WoS core collection when the search is executed.

Fig. 3
figure 3

Distribution of number of publications over year for emerging research topics dataset

Analysis and results

According to our understanding of emerging research topics field, the following publications have contributed greatly to the development of the field and promoted largely the follow-up studies: de Solla Price (1965), Small (1973), Chen (2006), Rotolo et al. (2015) and so on. Intuitively, several convergence-divergence patterns should appear in the discovered main path with the above articles as the convergent nodes. Hence, the number of key-routes with top SPLC counts is chosen from 10 to 40 with a step size 5 (i.e., 10, 15, 20, …, 40), and then the exploratory main path analysis is conducted with Pajek software (Batagelj and Mrvar 1998). The expected convergence-divergence patterns emerge when the number of key-routes is 35, as shown Figs. 4 (local main path) and 5 (global main path), but becomes blurred if the number of key-routes is further increased or decreased. Hereinafter, our analysis will be limited to the case with the number of key-routes 35.

Fig. 4
figure 4

The local main paths with top 35 key-routes for the emerging research topics field. The citation links thicken by their SPLC values

Fig. 5
figure 5

The global main paths with top 35 key-routes for the emerging research topics field. The citation links thicken by their SPLC values

Before delving into more specifies, all scientific publications appearing in the key-route main paths are listed in Table 1, in which the first column (ID) corresponds to the resulting node’s label in Figs. 4 and 5. The nodes are labeled with the last name of the first author, the initial capitals of the corresponding first name, and then followed by the publication year. In the meanwhile, for the convenience of referring to specific scientific publications, scholarly articles are also given in the second column in Table 1. The last column in Table 1 highlights explicitly which main path the resulting nodes come from.

Table 1 A list of the scientific publications appearing in the key-route global or local main paths

Three stages of emerging research topics field development

From Figs. 4 and 5, the development process of emerging research topics field can be divided roughly into three stages. The emergence stage begins from de Solla Price (1965) (ID: PriceD1965) and Small (1973) (ID: SmallH1973), which lay the theoretical and methodological foundation of the field, respectively. The exploration stage is rather long, from Small and Griffith (1974) (ID: SmallH1974) to Rotolo et al. (2015) (ID: RotoloD2015), and it constantly separates, extends, occasionally weakens or disappears. Rotolo et al. (2015) (ID: RotoloD2015), which gives an elaborate overview of the emerging technology, bridges the exploration and development stages. From 2015 till now, two research lines can be observed from Figs. 4 and 5, where some citation-links gradually cross and merge.

It is noted that many triangle structures can be observed from Figs. 4 and 5, such as Shibata et al. (2008) (ID: ShibataN2008), Shibata et al. (2009) (ID: ShibataN2009) and Boyack and Klavans (2010) (ID: BoyackK2010). This phenomenon can be interpreted by a citation-based index of disruptiveness (Funk and Owen-Smith 2017, Wu et al. 2019). The intuition behind the knowledge paths containing the triangle structures is straightforward: if the papers cite not only a given article, but also a substantial portion of that article’s references, then this article can be seen as consolidating its scientific fields. If the opposite is true, that is to say, if the future citations to the article do not acknowledge the article’s own intellectual forebears, then the article can be seen as disrupting its field (Azoulay 2019; Bornmann and Tekles 2019). In this context, the articles (e.g. Shibata et al. 2009; Chen et al. 2010; Klavans and Boyack 2011; Small et al. 2014) tend to develop existing studies in emerging research topics field, whereas some other articles (e.g. Åström 2007; Takeda and Kajikawa 2009; Chen 2006) tend to disrupt this scientific field with new insights, ideas and methods. All in all, as Azoulay (2019) points out, sustained development of the given scientific fields requires both radical and incremental contributions. Consequently, both the consolidated and disruptive works are essential to emerging research topics fields.

To corroborate each other, we search forward and backward from the top 35 key-routes locally and globally, as seen in Figs. 4 and 5. Many scientific publications are shared by the global and local main paths (cf. Table 1). This is no doubt to admit that these works have great impact on emerging research topics development, and it is feasible for the key-route main path analysis to discover the knowledge diffusion trajectory.

The emergence stage: from 1965 to 1974

The groundbreaking paper (de Solla Price 1965) firstly defined the concept of research front and vividly expressed an active research front as a sort of growing tip or epidermal layer. It is easy to see that this concept emphasizes the novelty attribute. Several years later, the co-citation analysis method was proposed by Small (1973) to identify emerging research topics for future research. These two publications act as the root nodes of key-route main paths, and then converge to Small and Griffith (1974). On the basis of previous studies, Small and Griffith (1974) defined the research fronts as the clusters of highly interactive literature in science, and co-citation analysis approach was used to identify research fronts. The top articles in the exploration stage extend, modify, and explore further the ideas of these studies.

The exploration stage: from 1974 to 2015

This stage mainly deals with the research of emerging research topics from the perspective of the citation network analysis, such as co-citation, bibliographic coupling, direct citation and their variants. Furthermore, from Figs. 4 and 5, three streams can be roughly found: (1) the left stream emphasizes the performance comparison among several similar methods, (2) the middle stream mainly discusses citation-based methodological improvement, and (3) the right stream embodies citation-based and lexical-based methodological application. These three streams are described in more detail in the following paragraphs. It is worth mentioning that the Morris et al. (2003) redefined the research fronts as the clusters of documents that tend to cite a fixed and time invariant set of base documents (i.e., intellectual base).

The left stream consists of 4 articles: Jarneving (2007), Shibata et al. (2008, 2009), and Boyack and Klavans (2010). These publications are arguing which citation-link analysis approach should be preferred to detect the emerging research topics. Jarneving (2005) compared the results of research front portrayed by two different methods based on co-citation analysis and bibliographic coupling, and later on, Jarneving (2007) combined bibliographic coupling with a cluster method and found that this combined method could effectively identify coherent research themes. Shibata et al. (2008) performed a comparative study in two research domains [gallium nitride (GaN) and complex networks (CN)]. Shibata et al. (2009) and Boyack and Klavans (2010) both further investigated the performance of the various citation approaches through extensive experiments. Shibata et al. (2009) concluded that direct citation approach performs the best and the co-citation method shows the worst performance. However, Boyack and Klavans (2010) had a different view that bibliographic coupling approach slightly outperforms the co-citation method and the direct citation has the worst performance.

The papers in the middle stream discuss the citation-based methodological improvement and innovation regarding emerging research topics on the basis of the previous studies. Åström (2007) redefined the research fronts as current and influential co-cited articles on the basis of the Morris et al. (2003) and studied changes in research fronts of library and information science (LIS) domain using document co-citation analysis (DCA). The author bibliographic-coupling analysis (ABCA) was introduced by Zhao and Strotmann (2008) to map the research activities of active authors themselves for a more realistic picture of the current state of research in the information science (IS) field, and founded that research fronts can be detected based on weak signals in author bibliographic coupling analysis. Then, Chen et al. (2010) performed a comprehensive, multiple-perspective method for both author co-citation analysis (ACA) and document co-citation analysis (DCA) studies in the IS field. In addition, Klavans and Boyack (2011) recreated the document co-citation map of the IS field published by Chen et al. (2010) to compare the accuracy of local and global maps of science generated by intellectual base and research front. In the end, Boyack and Klavans (2014) absorbed and improved their previous work (Boyack and Klavans 2010; Klavans and Boyack 2011).

The right stream slightly differentiates local and global main paths, as presented in Figs. 4 and 5. More specifically, global main path includes the citation-link <ChenC2006, ChenC2010>. The topological clustering method was performed by Takeda and Kajikawa (2009) to detect emerging research domains. Chen (2006) considered that the highly bursting words identified by the burst algorithm (Kleinberg 2003) can be viewed as the indicators of emerging research front, and improved their influential visualization tool—CiteSpace and then was used in Chen et al. (2010). This tool makes substantial theoretical and methodological contributions to progressive emerging research topics domain visualization (Kim and Chen 2015; Li 2017). A mixed model, put forward by Guo et al. (2011), combines three different indicators: sudden increases in the frequency of specific words, the number and speed by which new authors are attracted to an interested area. The empirical studies on four emerging research areas show that these indicators are very indicative.

Later on, the right stream merges to the node Boyack et al. (2014), which challenges the fixed notion that the topic emerges once a certain number of articles have published on that topic. A methodology is proposed by Boyack et al. (2014) to characterize known emerging research topics and found two different patterns of emergence: one where the topic is not focused but then grows explosively, and one where the topic quickly becomes an area of focus and then grows steadily. In fact, as Cozzens et al. (2010) pointed out, most previous studies concentrate on how to measure emerging research topics, rather than how to identify them. Since Cozzens et al. (2010), several researchers begin to explore approaches of identification. For instance, Small et al. (2014) combined direct citation and co-citation analysis to recognize emerging topics in terms of the novelty (or newness) and growth indicators.

In addition, an interesting phenomenon can be observed that the citation-based methods dominate the field, and only several lexical-based studies (such as Chen 2006 and Guo et al. 2011) appear in the global or local key-route paths. Since the movement of open access (OA) to all research literature and the construction of comprehensive bibliographic databases, more and more text resources can be easily accessed. Furthermore, due to many known problems (such as time-lagging problem) with the citation-based methods, intuitively the lexical-based approaches should be able to serve as alternatives to the citation-based ones. To check if the dominance of the citation-based methods alienates the lexical-based studies, branch path procedure (Ho et al. 2014) is used here to trace the development trajectory of the lexical-based approaches. After the top 3 high-cited papers (cf. Table 2) on lexical-based studies are firstly selected from our dataset, the resulting branch paths are shown in Fig. 6 on the basis of the global main path. The branch paths for the local main path are very similar to those for the global main path, so they are not illustrated here. Table 2 describes the detailed information of these papers appearing in the branch paths. From Fig. 6, it is not difficult to find the citation links of branch paths are very thin. This indicates that our intuitive assumption does not hold. As a matter of fact, we also tried the top 4, 5, 6 and 7 high-cited papers on lexical-based studies. Though the resulting branch paths for these cases are not exactly the same, the same conclusion can be drawn. A reasonable explanation is given by Small et al. (2014) that the lexical-based methods can list emerging topics within some specific field, but they are not suitable for identifying emerging research topics across a wide swath of science.

Table 2 A list of the scientific publications appearing in the branch paths
Fig. 6
figure 6

The branch paths based on the global main path for the emerging research topics field. The citation links thicken by their SPLC values. The papers appearing in the global main path are represented by the yellow nodes, the top 3 papers are represented by the green nodes. (Color figure online)

The development stage: from 2015 to now

A step forward in 2015 triggered a new development in the emerging research topics field. Rotolo et al. (2015) reviewed the previous works and gave a clear and transparent concept of the emerging technologies that can be characterized by five attributes: (1) radical novelty, (2) relatively fast growth, (3) coherence, (4) prominent impact, and (5) uncertainty and ambiguity. This should be the first time to develop a comprehensive definition of an emerging technology, so its viewpoint is popular and widespread till now. After the node, this stage could be classified into two subgroups.

One subgroup includes three nodes (ID: JoungJ2017, LeeC2018 and BurmaogluS2019). They follow the viewpoint of Rotolo et al. (2015) and explore new approaches of identification and novel definition. Joung and Kim (2017) proposed a keyword-based patent analysis model to monitor emerging technologies, where the relatedness between the keywords was quantified by TF-IDF function. Lee et al. (2018) presented a machine learning approach by combining multiple patent indicators to identify emerging technologies at early stage of patents issued. Burmaoğlu et al. (2019) argued that the concept of emergence in technology was still ambiguous, so they traced emergence discussions to find the evolution of related concepts, and explored further usage in the technological context.

The other subgroup is mainly composed by the works from Professor Alan Porter and his research team. They have been working in the field of technology foresight and have developed several emergence indicators. Unlike Rotolo et al. (2015), four attributes of emerging research topics are taken into consideration: novelty, persistence, community, and growth. Garner et al. (2017), Carley et al. (2018), and Porter et al. (2018) developed a script to calculate emergence scores and set a family of viable technical emergence indicator based on the described emergence scoring. The primary emergence indicators identify “hot topics” and can be used to generate secondary indicators that reflect countries, organizations or authors actively engaging these hot topics. Wang et al. (2018) chose 3D printing as a case study and employed methodological steps in Porter et al. (2018) to identify emerging research topics of technological convergence using patent information.

Discussion and conclusion

Knowledge diffusion trajectory of emerging research topics field is characterized by the local and global key-route main paths. According to convergence-divergence patterns in the main paths, three different development stages can be observed: the emergence, exploration and development stages. After the research fronts are defined in the first stage, more and more attentions are paid to emerging research topics field, which result directly in hundreds of scientific publications in the second stage. It promotes in turn the rapid development of emerging research topics, including methodologies of identification and detection, analysis of characterization, description and visualization. Nevertheless, bibliographic methods are still widely used and developed by a lot of researchers, and citation analysis is considered as a useful tool for identifying emerging research topics.

Intuitively, due to many known problems (such as time-lagging problem) with citation-based methods and large-scale accessible text resources, the lexical-based approaches may be considered as alternatives or supplementations to recognize emerging research topics. For example, since burst-detection algorithm (Kleinberg 2003) can be adapted for detecting sharp increases of interest in target domains, it is general enough to be applied to a time series of multi-word terms or citations of articles (Chen 2006; Guo et al. 2011). However, as one can see from the branch paths in Fig. 6 that the citation-based method has been the mainstream approach of studying emerging research topics in the exploration stage. After 2015 (i.e., the development stage), with the development of large-scale text-processing technologies, machine learning methods are gradually utilized to detect emerging research topics due to their potential power, such as the feed-forward multilayer neural network (Lee et al. 2018).

In addition, from the key-route main paths in the emerging research topics field, several drifts can be observed as follows: (1) the identification methods are drifting from citation-based approaches to machine learning based ones, such as JoungJ2017 and leeC2018 in the Figs. 4 and 5. (2) The research direction transfers from the measurement (such as ShibataN2008 and ShibataN2009) to the identification (e.g. PorterA2018, WangZ2018 and so on). (3) Information resource is drifting from the papers to the patents (e.g. JoungJ2017, LeeC2018, PorterA2018 and WangZ2018 in Figs. 4, 5). The above drifts indicate that the scholars have being realized the complexity of the problem on detecting emerging research topics, and tried to solve this problem from different perspectives.

In the near future, there is still room to improve the identification method and to expand the types of information resources (such as scientific reports, projects and so on). What’s more, to our knowledge, there are no benchmark datasets public available with known emerging research topics until now. Currently, the judgment on the detected emerging topics still relies largely on expert-centric approaches such as Delphi and large-scale survey methods. Therefore, evaluation of emerging research topics also deserves to be explored.