Introduction

Contemporary science in general and GIScience in particular can be described as a dynamic, complex, and constantly evolving multiple-scale network of scientists, institutions, and ideas (Fortunato et al. 2018; Sun and Manson 2011). It has been recognized that one of the dominant mechanisms of facilitating scientific advances is research collaboration (Sun and Rahwan 2017; Wuchty et al. 2007). Research collaborations increase the productivity of researchers and accelerate scientific progress. Scientific publication data has been widely used to explore the patterns and trends of research collaborations (Sun and Rahwan 2017). Through joint work, individual researchers compose research networks which are amenable to scientometric social network visualization and analysis (SNVA) (Kim and Diesner 2015; Sun and Rahwan 2017; Sun and Manson 2011). SNVA has been adopted to visually and mathematically investigate how social system structures and evolutions are defined by relationships among its elements (e.g., people and organizations) which develop and grow with the intertwined systems of social networks (Andris 2016; Rogers 1987; Sun and Manson 2011).

One of the recent studies that explored the research network of scientific publications was by Chuan et al. (2018), who proposed a new metric for edge (link) prediction in research networks, i.e., predicting potential interactions among network elements, based on content similarity. In addition, Köseoglu et al. (2018) and Sun and Rahwan (2017) visualized and examined the authorship trends and explored the structures of scientific collaborations based on network centrality metrics in lodging studies and transportation research, respectively. Xie et al. (2016) proposed a geometric graph to model research networks, the connection mechanism that expresses the effects of the homophily of authors and scholarly influences, and the collaborations at the level of research teams rather than authors. Moreover, Hu et al. (2019) visualized and analyzed the structures of cited and uncited research communities in four disciplines (i.e., chemistry organic, engineering environmental, economics, and management) and three countries (i.e., the USA, the UK, and People’s Republic of China) based on co-authorship networks. One more interesting work was from Oliveira et al. (2017), where a Bayesian inferential approach was developed to measure the reliability of a research network, i.e., the probability of this network to remain connected, robust, and functionating, with emphasis on researchers (nodes).

SNVA can be considered a component of the science of science (SciSci), a field evolved from scientometrics, quantitatively examines the interactions among scientific agents (scientists, institutions, and ideas) across diverse spatial and temporal scales (Fortunato et al. 2018; Garfield 2009). For instance, some studies used large scholar datasets to explore the development of an academic field (Sun and Yin 2017), understand academic collaborations (Sun and Rahwan 2017; Sun and Manson 2011), or discover the impact of scientific work (Thelwall 2016; Wang et al. 2013). The emergence of SciSci has been driven by two main factors. The first factor is data availability (e.g., Web of Science, Scopus, and Google Scholar) pertinent to scientists from all fields and to their research output across the globe. The second factor is the collaborations among physical, social, and computational scientists, through which powerful (big) data processing, analysis, visualization tools, and models have been developed to uncover the mechanism underlying sciences and its institutions and workforce (Fortunato et al. 2018). In GIScience, there are also studies that have quantitatively analyzed certain research features such as citations, authorship, and publication patterns. For example, Biljecki (2016) analyzed 12,346 articles from 20 GIScience journals to extract patterns and trends; Duckham (2015) identified the expertise that GIScientists have in common based on keywords and citations; Wei et al. (2015) discovered and benchmarked the most important and highly cited articles published between 2003 and 2012; Sun and Manson (2011) examined the research networks and scientific collaborations. These studies have shed light on the Science of Science in the GIScience field. Moreover, there are studies using quantitative approaches to analyze research topics in GISciecne. For instance, Steiger et al. (2015) published a systematic literature review on spatiotemporal analyses of Twitter data in GIScience. Yan et al. (2020) performed a systematic review on volunteered geographic information (VGI) research topics through Latent Dirichlet allocation. An in-depth understanding of a scientific field through approaches of the SciSci can be beneficial for effective science funding allocations (Fortunato et al. 2018) as well as for high-quality education about the field.

Scientific publications in the field of VGI in particular have been booming in recent years. The term VGI was coined in 2007 by Goodchild (2007) and it has become one of the most important research topics in GIScience (Yan et al. 2020). VGI such as OpenStreetMap (OSM) and geotagged social media data can be an important source of understanding of the surface of the Earth (Goodchild 2007; Yan et al. 2017). The creators of VGI establish virtual networks to work on a common task (or subtasks) in either a synchronous or an asynchronous manner. They share their understanding of a common situation, shape contexts, and convey cognition through contextual knowledge of a place. VGI phenomenon thereby defies the traditional asymmetric power structure of geospatial information production and consumption, i.e., a minority of authorized data producers versus a majority of passive data consumers. On VGI platforms, geospatial data consumers are empowered to produce data and vice versa. The traditional division between data consumers and producers blurs (Mooney and Corcoran 2012). Indeed, the “producers” may have knowledge that is unknown to experts; local people in a sense may themselves count as experts in their own local or indigenous knowledge (Cinnamon and Schuurman 2013; Quinn and Yapa 2016).

The rapid development of VGI is attributed to Web 2.0 technologies, which favor participation and collaboration for the creation of common goods over the Internet (Goodchild 2007; Hachmann et al. 2018). The Internet in the Web 2.0 era enables the formation of a cyberspace of radical inclusion that transforms indirectly related physical communities into directly connected virtual communities. It creates platforms with techno-libertarian and egalitarian as the norms for open and pervasive collaborations of intelligence that promote digital democracy. Among the key principles of Web 2.0, utilizing collective intelligence is a key to sustaining VGI platform constructions (O’Reilly 2007). This principle encourages cyber-collectivism for the formation of Web 2.0 cyberspaces that offer opportunities for achieving higher productivity and greater innovations. In the Web 2.0 era, information technologies are more socially intertwined and new forms of social interactions within information networks are developed (Castells 2000). As such, Web 2.0 has enabled the general public to generate information and interact with one another on an unprecedented scale and in a real-time manner (Elwood et al. 2012). By contributing their collective intelligence, the general public is involved in GIS democracy in a true sense in the Web 2.0 era (Goodchild 2007).

As mentioned above, Yan et al. (2020) have recently published an article that reviews the decade-long research on VGI retrieved from 24 international refereed journals in the GIScience community. Their study has extracted 50 specific VGI research topics which have been subsequently clustered into three overarching themes including VGI contributions and contributors, main fields applying VGI, and conceptions and envisionings. The review has revealed the progress, patterns, and trends in the first decade of the VGI research. It has also proposed an agenda for future research endeavors. However, according to the review and to the best of our knowledge, no empirical research has examined the structure of the research network in the VGI research community, let alone providing insights into the collaboration mechanisms underlying the creativity and major genesis of VGI research discoveries. This study aims to build and visualize a research network of the VGI research from selected journal articles published during the first decade since the coining of VGI and investigate its patterns and structures using scientometric SNVA. This study uses four indicators including the number of citations, one local network centrality measures (i.e., degree), and three global network centrality measures (i.e., closeness centrality, betweenness centrality, and eigenvector centrality) to quantify the author importance across the research network. A new rule-based weighting method has also been developed for taking into account author sequences when computing the three global centrality measures.

Materials and Methods

VGI Research Articles

Following Yan et al. (2020), the dataset used to build the research network in this study includes the 346 articles published during the first decade of the VGI research since Goodchild (2007) coined the term (i.e., between 20 November 2007 and 20 November 2017). These articles are retrieved from 24 journals (based on the keyword “VGI”) that are indexed by the Science Citation Index (SCI), Science Citation Index Expanded (SCIE), Social Sciences Citation Index (SSCI), and Emerging Sources Citation Index (ESCI) (Table 1). The four indices are among the core collections of Web of Science according to Clarivate Analytics (http://mjl.clarivate.com/). The 346 articles involve 326 research articles and 20 review articles in which VGI is the main topic of investigation and discussion or at least is used as a source of geographic data. Figure 1 illustrates the temporal distribution of the articles for each journal.

Table 1 GIS journals included in this research
Fig. 1
figure 1

Number of articles for each journal during 2007 and 2017

Research Network and Topics

Since scientific collaborations are not unidirectional, an undirected research network CN (N, E) (N is the set of nodes (authors) and E is the set of edges (connections)) has to be adopted to build the research network for the 346 articles before performing SNVA. When building the research network, this study treats the journal articles for the 10 years as a whole rather than separates them into different temporal slots. This is in part for consistency with Yan et al. (2020) and is also because the number of articles generally kept increasing during 2007 and 2017, no other special temporal variation is observed, and the dataset (i.e., the 346 articles) is not big enough for highly meaningful data split, especially for the first 5 years (Fig. 1). To build a fully connected research network of the selected journal article, this study treats that all the articles are directly or indirectly related to Goodchild (2007) and thus the network is centered with the node that represents Michael F. Goodchild who coined the term VGI. Specifically, for all authors of the 346 articles, an edge is initiated to link each of them with Michael F. Goodchild, representing an indirect connection. An edge’s weight increases if there are actual collaborations (i.e., direct collaborations with co-authored articles) between the author and Michael F. Goodchild.

Gephi (https://gephi.org/) is used for network visualization. The nodes are colored based on the locations of the authors' affiliations. However, an author may have multiple affiliations. Therefore, this study employs a set of rules as follows.

  • If an author has more than one affiliation, then the one with which the author has published more articles is used for coloring;

  • If an author has published an equal number of articles under each of his or her affiliations, then the latest one is used for coloring.

The network built is then analyzed using one local and three global measures to reveal the node (author) importance of this network. The notions used in these measures include the following: nodes are denoted as i, j, and an edge linking node i and node j is denoted as eij; a neighbor j of node i is denoted as Nbij; the set of authors of an article p is denoted as Aup; the number of authors in an article p is denoted as |Aup|; and the sequence of author i in article p is denoted as \( {\mathrm{Au}S}_i^p\ \left({\mathrm{Au}S}_i^p\ge 1\right) \).

The local measure employed is degree Di, which is a count of how many neighbors (or co-authors) node i have:

$$ {D}_i={\sum}_{j\in N}{\mathrm{Nb}}_{ij}, $$
(1)

where Nbij equals to 1 if there is an edge directly linking i and j and 0 otherwise.

The three global measures employed, which provide the indications of author influence across the network, are closeness centrality (Bavelas 1950; Sabidussi 1966), betweenness centrality (Freeman 1977), and eigenvector centrality (Bonacich 1972).

Closeness centrality of a node is computed based on the network distance between the node and each other node in a graph. The higher the closeness centrality score is, the lower the node’s total distance to all other nodes is, meaning that the node is closer to all other nodes. It can be regarded as a measure of how long it will take to spread information from a node to all other nodes sequentially. For node i, its closeness centrality (CCi) can be calculated as the reciprocal of the sum of the length of the shortest paths between the node and all other nodes in the graph:

$$ {\mathrm{CC}}_i=\frac{1}{\sum_{i\ne j}{d}_{(ij)}}, $$
(2)

where d(i j) is the distance between node i and j.

Betweenness centrality of a node is the shortest path-based measure, which quantifies the number of times a node acting as a bridge along the shortest path between two other nodes. Therefore, a node with a higher betweenness centrality score has more control over the network because more information passes through that node. For node i, its betweenness centrality (BCi) can be calculated as:

$$ {\mathrm{BC}}_i={\sum}_{i\ne j,i\ne k,j\ne k}\frac{\delta_{jk}(i)}{\delta_{jk}}, $$
(3)

where δjk denotes the total number of shortest paths between node j and k, and δjk (i) denotes the number of shortest paths passing through node i (i ≠ j, i ≠ k). Note that there may exist multiple shortest paths between a pair of nodes (j, k).

Eigenvector centrality is a measure related to prestige, which is a more sophisticated view of centrality. The idea is that the prestige of node i is related to the prestige of its neighbors. A node with few links may have a very high eigenvector centrality if those few links were to very well-linked others. A high eigenvector score means that a node is linked to many high score nodes. For node i, its eigenvector centrality (ECi) is computed based on the assumption that its centrality is proportional to the sum of the centrality of its neighbors:

$$ Ax=\lambda x, $$
(4)

where A is the adjacency matrix of the graph CN with eigenvalue λ. Based on the Perron–Frobenius theorem, there is a positive and unique solution if λ is the greatest eigenvalue associated with the eigenvector of A (Newman 2010).

For accounting for the path lengths for computing the three global centrality measures, each connection can have a fixed weight (i.e., w = 1) regardless of the number of authors in an article and regardless of the author sequences in an article. However, this may lead to bias in the calculation results of the centralities. To address the inflation by the number of authors, an adjusted weight parameter \( \dot{W} \) introduced by Newman (2001) is adopted:

$$ {\dot{W}}_{ij}={\sum}_{p\in P}\frac{1}{\left|{Au}^p\right|-1}{\sigma}_i^p{\sigma}_j^p,\kern0.5em \left(i\in {Au}^p\ and\ j\in {Au}^p\right), $$
(5)

where P is the article set (346 articles). \( {\sigma}_i^p \) or \( {\sigma}_j^p \) equals to 1 if i or j is an author of article p and 0 otherwise. As such, \( {\dot{W}}_{ij} \) represents the strength of the collaboration (if any) between authors i and j. Each collaboration between two authors in an article contributes \( {\dot{w}}_{ij}=\frac{1}{\left|{\mathrm{Au}}^p\right|-1} \) units to the total weight \( {\dot{W}}_{ij} \).

Furthermore, to account for the author sequences in an article, we introduce a readjusted weight parameter \( \ddot{W} \) calculated based on the following rules:

  • If a collaboration is between the first author i and a non-first author j of article p, then the collaboration contributes the following readjusted units to the total weight \( {\ddot{W}}_{ij} \):

$$ \ddot{w_{ij}}=\frac{\frac{1}{\left|{Au}^p\right|-1}}{{ Au S}_j^p-1}=\frac{1}{\left(\left|{Au}^p\right|-1\right)\left({ Au S}_j^p-1\right)},\kern0.5em \left(i\in {Au}^p,j\in {Au}^p\ \mathrm{and}\ { Au S}_i^p<{ Au S}_j^p\right), $$
(6)
  • If a collaboration is between a non-first author i and a non-first author j of article p, then the collaboration contributes the following readjusted units to the total weight \( {\ddot{W}}_{ij} \):

$$ \ddot{w_{ij}}=\frac{\frac{1}{\left|{\mathrm{Au}}^p\right|-1}}{2\left(\left|{\mathrm{Au}}^p\right|-1\right)}=\frac{1}{2{\left(\left|{\mathrm{Au}}^p\right|-1\right)}^2},\kern0.5em \left(i\in {\mathrm{Au}}^p,j\in {\mathrm{Au}}^p\right), $$
(7)

According to the above rules, for articles with two authors, the w, \( {\dot{w}}_{ij} \), and \( \ddot{w_{ij}} \) all equal to 1. Additionally, the w, \( {\dot{w}}_{ij} \), and \( \ddot{w_{ij}} \) of the edge initiated (representing an indirect connection) to link an author with Michael F. Goodchild all equal to 1. Since edges with a stronger connection have a shorter distance, following Sun and Rahwan (2017), the weighted version of the global centrality measures is computed based on the reciprocal of the weight parameters as weighted edge cost. The network analysis is performed using NetworkX (https://networkx.github.io/) which is a Python package specifically for studying complex networks.

Apart from these network-based measures, the number of citations of each paper is also used to quantify author importance. Google Scholar citations as of 12 May 2019 are used in this study. The total number of citations of author i is denoted as:

$$ {C}_i={\sum}_{p\in P}\left({ct}_p\times {\gamma}_i^p\right),\left(i\in {Au}^p\right), $$
(8)

where ctp is the number of citations of paper p and \( {\gamma}_i^p \) equals to 1 if author i is an author of article p and 0 otherwise.

Overall, eight measures are selected for quantifying the authors’ importance, which are summarized in Table 2.

Table 2 The node (author) importance measures

Lastly, the spatiotemporal distributions of the 50 VGI research topics derived by Yan et al. (2020) from the 346 journal articles associated with the research network are visualized. In general, this work can be considered as an extension of the review work by Yan et al. (2020).

Results

Descriptive Statistics and the Network Visualization

A total of 1106 authors are found in the 346 articles. As an author may have published more than one article, we removed duplications, which resulted in 765 unique authors that were later used to create the nodes of the research network. Between these nodes, 2651 edges are established according to the network construction rules introduced in the “Research network and topics” section. It is observed that most of the articles have two to three authors (Fig. 2a). In addition, the unique authors are found to be affiliated with 804 institutions, with the majority of these institutions located in Europe (369 or 45.9%) and North America (258 or 32.1%) (Fig. 2b).

Fig. 2
figure 2

a The number of authors across all the articles. b The number of affiliations associated with the articles divided by region. Note: Oceania includes only Australia and New Zealand

A fully connected research network of the VGI research is shown in Fig. 3. For the six regions in Fig. 3 (i.e., Europe, North America, Asia, Oceania, South America, and Africa), it is observed that the connectedness within Europe seems stronger than the connectedness within North America. For instance, a large cluster of European connections can be seen at the bottom of the graph. In comparison, the connections within North America tend to scatter across the graph, and the size of the clusters is relatively smaller. In addition, cross-regional connections can be observed, most of which are, however, among Europe, North America, Asia, and Oceania. Authors from South America and Africa have also connected with those from the other four regions sporadically.

Fig. 3
figure 3

A fully connected research network of the first decade of the VGI research. The node sizes are proportional to the total number of citations of the individual authors. The top 20 authors ranked by their total number of citations are labeled with their names and the name sizes are proportional to the total number of citations of the individual authors. The thickness of an edge between each pair of authors is proportional to the total number of connections between the two authors. Note: Oceania includes only Australia and New Zealand

Research Network Metrics and the Related Topics

The rankings of authors based on the total number of citations Ci and degree Di are shown in Table 3. Except for Michael F. Goodchild, the top three highly cited authors are Mordechai (Muki) Haklay (University College London, the United Kingdom (UK)), Andrew T. Crooks (George Mason University, the United States of America (USA)), and Alexander Zipf (Heidelberg University, Germany). Two out of the three highly cited authors are affiliated with European Universities and one is affiliated with a North American university. The three authors with the highest degrees are Alexander Zipf, Anthony Stefanidis (George Mason University, the USA), and Andrew T. Crooks. Two out of these three are affiliated with a North American university, and one is affiliated with a European university.

Table 3 Ranking of authors based on the total number of citations and degree Di. Note: Since indirect collaboration edges between Michael F. Goodchild and all the other authors are created in this study (Research network and topics), his degree is 764 (including indirect collaboration edges) and 12 (excluding indirect collaboration edges)

The rankings of authors based on adjusted closeness centrality (normalized) \( \dot{{\mathrm{CC}}_i} \) and readjusted closeness centrality (normalized)\( \ddot{CC_i} \) are shown in Table 4. Except for Michael F. Goodchild, the three authors with the highest \( \dot{{\mathrm{CC}}_i} \) are Alexander Zipf, Peter Mooney (National University of Ireland, Maynooth, Ireland), and Jamal Jokar Arsanjani (Heidelberg University, Germany). All three authors are affiliated with European universities. Except for Michael F. Goodchild, the three authors with the highest \( \ddot{{\mathrm{CC}}_i} \) are Alexander Zipf, Peter Mooney, and Julian Hagenauer (Heidelberg University, Germany). Again, all three authors are affiliated with European universities.

Table 4 Ranking of authors based on adjusted closeness centrality (normalized) and readjusted closeness centrality (normalized)

The rankings of authors based on adjusted betweenness centrality (normalized) \( \dot{{\mathrm{BC}}_i} \) and readjusted betweenness centrality (normalized) \( \ddot{BC_i} \) are shown in Table 5. Except for Michael F. Goodchild, the three authors with the highest \( \dot{{\mathrm{BC}}_i} \) are Alexander Zipf, Peter Mooney, and Jamal Jokar Arsanjani, and the three authors with the highest \( \ddot{{\mathrm{BC}}_i} \) are Alexander Zipf, Jamal Jokar Arsanjani, and Peter Mooney. For both rankings, all three authors are affiliated with European universities.

Table 5 Ranking of authors based on adjusted betweenness centrality (normalized) and readjusted betweenness centrality (normalized)

The ranking of authors based on adjusted eigenvector centrality (normalized) \( \dot{{\mathrm{EC}}_i} \) and readjusted eigenvector centrality (normalized) \( \ddot{EC_i} \) is shown in Table 6. Except for Michael F. Goodchild, the three authors with the highest \( \dot{{\mathrm{EC}}_i} \) are Daniel Sui (The Ohio State University, the USA), Craig M. Dalton (Bloomsburg University of Pennsylvania and Hofstra University, the USA), and Mordechai (Muki) Haklay. Two of these three are affiliated with North American universities, and one is affiliated with a European university. Except for Michael F. Goodchild, the three authors with the highest \( \ddot{{\mathrm{EC}}_i} \) are Daniel Sui, Craig M. Dalton, and Sterling D. Quinn (Central Washington University and The Pennsylvania State University, the USA). All three authors are affiliated with North American universities.

Table 6 Ranking of authors based on adjusted eigenvector centrality (normalized) and readjusted eigenvector centrality (normalized)

Table 7 shows the pairwise Pearson correlation matrix for the node importance measures that indicate the importance of authors in the VGI research network. The result suggests that Di and Ci are not highly correlated with the closeness centrality measures (\( \dot{{\mathrm{CC}}_i} \) and \( \ddot{{\mathrm{CC}}_i} \)). In fact, the closeness centrality measures are not highly correlated with any other centrality measures. Furthermore, Fig. 4 shows the sum of each node importance measure for each region and the sum of each node importance measure for each region divided by the number of authors from each region, i.e., the average values of the measures. It is observed that Europe and North America occupy the top two layers of the sums, each of which accounts for a percentage of the sum of each measure that is much greater than other regions do. Regarding the average values, except for the average values of the citation and the betweenness measures which are relatively greater in Europe and North America, the average values of the other measures are more or less evenly distributed across the regions.

Table 7 Pairwise Pearson correlation table for the node (author) importance measures. Values for the highly collinear pairs (i.e., r > 0.75) are highlighted in italics
Fig. 4
figure 4

The sum of each node (author) importance measure for each region and the sum of each node (author) importance measure for each region divided by the number of authors from each region. Note: Oceania includes only Australia and New Zealand

Lastly, the spatiotemporal distributions of the VGI research topics associated with the research network are visualized in Fig. 5. It is observed that North America and Europe are associated with the highest diversity of research topics, followed by Oceania, South America, Asia, and South Africa. The USA, Canada, Germany, and the UK are among the leading countries in the VGI research that covers diverse topics such as data quality of OSM, sensor network, and disaster, crisis, emergency, and hazard management. The VGI research activities in North America and Europe also have a longer time span compared with other regions of the globe.

Fig. 5
figure 5

Three-dimensional scatter plot showing the spatiotemporal distribution of the 50 research topics derived from the 346 journal articles. Note: Each dot in the graph represents a journal article. The countries on the x-axis are associated with the first authors; the years in the legend are the publication years of the articles. Multiple articles produced in a single country are arranged horizontally, extending from the central cross point (offset plotting), rather than overlapped to each other

Discussion and Conclusions

Main Research Findings and Interpretations

This study has performed a scientometric SNVA of the first decade of the VGI research based on selected journal articles in GIScience. Recently, Yan et al. (2020) has performed a narrative review of the research articles concerning VGI published during the same period. Based on the same collection of articles, this quantitative scientometric research can be considered as a complement for the qualitative review. This study has used the number of citations, one local social network centrality measure (i.e., degree), and three global social network centrality measures (i.e., closeness centrality, betweenness centrality, and eigenvector centrality) for quantifying the node (author) importance in the network.

To take into account the number of authors in an article when computing the global centrality measures, this study has adopted an established edge weighting approach to derive an adjusted version of the global centrality measures. In addition, to appropriately consider the author sequences in an article when computing the global centrality measures, this study has developed a new rule-based edge weighting method to derive a readjusted version of the global centrality measures. This weighting method has further reduced the bias in the calculation results of the centralities.

Regarding the main research findings, firstly, although VGI was coined by a researcher from North America (Michael F. Goodchild), European institutions seem to be more actively engaged in the VGI research than the North American counterpart (Fig. 2b). One possible reason is that OSM as the most popular VGI platform was created in the UK in Europe (Yan et al. 2020). It is also found that the connectedness within Europe to be stronger than that within North America according to the research network visualization (Fig. 3). One possible reason for this network pattern is the geographic closeness of the European institutions. In addition, this study has demonstrated that the top researchers measured using the eight-node importance measures are all affiliated with North American and European universities (Tables 3, 4, 5, and 6), confirming the leading role of the two regions in the VGI research. Apart from OSM which was developed in Europe, other diverse and most influential sources of VGI, such as Twitter, Flickr, and Geo-Wiki, were mostly developed in either the USA or Europe. The diverse VGI platforms established in these two regions and the high VGI data accessibility may explain why these two regions are the most active in the VGI research (Yan et al. 2020).

According to the pairwise correlation table (Table 7), the results suggest that the closeness among researchers does not seem to highly contribute to the increase of citations. However, the degree, betweenness centrality, and eigenvector centrality are highly correlated with the citation numbers, suggesting that the number of direct connections in the network (i.e., the number of directly linked neighbors of an author without any intermedia node), the authors’ control over the network, and the quality of research connections is more important to increasing citations. Furthermore, this study discovers that the sums of the eight measures are high in Europe and North America (Fig. 4). Europe and North America also have high average values of the citation numbers and betweenness centrality measures, while the average values of the other measures are generally evenly distributed across the remaining four regions (Fig. 4). These findings suggest that European and North American authors as a whole play a leading role in the VGI research, but on average (per author influence) are only outstanding in terms of the citation numbers and have relatively more control over the network.

Lastly, this study has revealed the high diversity of the VGI research topics investigated in North America and Europe throughout the first decade of the VGI research development, highlighting the major problems that have been studied across the VGI research network (Fig. 5). The diverse VGI research topics investigated in North America and Europe and the longer time span of the research activities in North America and Europe further confirm the leading role of the two regions in the VGI research field (Fig. 5).

Research and Practical Implications

Overall, the research outcome of this work would benefit the development of policies, approaches, and tools that have the potential of accelerating the development of VGI science. This study has generated insights into the conditions and structures underlying the creativity in VGI research. It has also provided a quantitative understanding of the major genesis of scientific findings about VGI. Specifically, the authors involved in this study worked individually or collaboratively on the VGI research; therefore, this study provides indicators for person-directed funding, performance-based funding, topic-based funding, and scientist crowdfunding, contributing to the future development of the VGI research field (Fortunato et al. 2018). Additionally, the rule-based weighting method developed in this study for taking into account author sequences when computing the three global centrality measures has implications for researchers to better quantify author importance across a research network. Moreover, this study has pedagogical implications for VGI education through an in-depth understanding of the VGI science community. For example, VGI teaching can be based on the research outcomes of leading researchers from the leading regions with highly diverse VGI research topics (Figs. 3, 4, and 5 and Tables 3, 4, 5, and 6). Lastly, based on the results of this study and the relevant VGI review articles such as the one published by Yan et al. (2020), practitioners would have a direction for the enhancement of VGI platforms, e.g., seeking advice from a particular researcher or a cluster of researchers about how to motivate the user to contribute VGI and how to improve the data quality and credibility of a VGI platform.

Future Works

For future works, it will be necessary to explore the research network together with the research topics shown in Fig. 5. Yan et al. (2020) clustered the topics into three overarching themes including (1) VGI contributions and contributors, (2) main fields applying VGI, and (3) conceptions and envisionings. Further identifying these research topics that attract different degrees of regional or international collaborations and identifying the research topics that lack regional or international collaborations would be beneficial for facilitating the long-term development of this research field. Indeed, Yan et al. (2020) proposed a VGI research agenda about the identified research topics; it would be useful to discuss how researcher connectedness in the field would contribute to the fulfillment of the research agenda. Moreover, for building a fully connected network, this study assumes that every VGI article is related to Michael F. Goodchild (Research network and topics). Doing so automatically builds some relations that may not, in fact, exist. The result may be different if we do not make the assumption, and thus, an improved method is needed to build a fully connected network in order to compute the centrality measures across all the authors. Last but not least, the temporal variation of the network structure is not examined in this study. In fact, by observing Fig. 5, it can be inferred that the patterns of the research network did not vary strongly over the 10 years; i.e., basically, North America and Europe were consistently active in the VGI research throughout the 10 years. It would be interesting to keep tracking the temporal variation of the quantity of VGI articles and then investigate the structural changes of the network over the long-term run. For example, the network of the journal articles published during the first decade since the coining of VGI can be compared with that of the second decade. Indeed, there are many emerging VGI platforms that have been created in non-Western countries; VGI may attract more attention from researchers in non-Western countries, and thus, the research network structure may vary strongly over the long-term run.