Twenty Years of Network Science: A Bibliographic and Co-authorship Network Analysis

Molontay, Roland; Nagy, Marcell

doi:10.1007/978-3-030-67044-3_1

Part of the book series: Lecture Notes in Social Networks ((LNSN))

1380 Accesses
9 Citations
14 Altmetric

Abstract

Two decades ago three pioneering papers turned the attention to complex networks and initiated a new era of research, establishing an interdisciplinary field called network science. Namely, these highly-cited seminal papers were written by Watts and Strogatz, Barabási and Albert, and Girvan and Newman on small-world networks, on scale-free networks and on the community structure of complex networks, respectively. In the past 20 years – due to the multidisciplinary nature of the field – a diverse but not divided network science community has emerged. In this chapter, we investigate how this community has evolved over time with respect to speed, diversity and interdisciplinary nature as seen through the growing co-authorship network of network scientists (here the notion refers to a scholar with at least one paper citing at least one of the three aforementioned milestone papers). After providing a bibliographic analysis of 31,763 network science papers, we construct the co-authorship network of 56,646 network scientists and we analyze its topology and dynamics. We shed light on the collaboration patterns of the last 20 years of network science by investigating numerous structural properties of the co-authorship network and by using enhanced data visualization techniques. We also identify the most central authors, the largest communities, investigate the spatiotemporal changes, and compare the properties of the network to scientometric indicators.

Access provided by Autonomous University of Puebla. Download chapter PDF

Network Analysis and Indicators

A new network model for the study of scientific collaborations: Romanian computer science and mathematics co-authorship networks

Article 17 May 2016

Analysis of Significance and Evolution of Co-authorship Networks

Keywords

1 Introduction

Complex networks have been studied extensively since they efficiently describe a wide range of systems, spanning many different disciplines, such as Biology (e.g. protein interaction networks), Information Technology (e.g., WWW, Internet), Social Sciences (e.g., collaboration, communication, economic, and political networks), etc. Moreover, not only the networks originate from different domains, but the methodologies of network science as well, for instance, it heavily relies on the theories and methods of graph theory, statistical physics, computer science, statistics, and sociology.

In the last two decades, network science has become a new discipline of great importance. It can be regarded as a new academic field since 2005 when the U.S. National Research Council defined network science as a new field of basic research [11]. The most distinguished academic publishing companies announce the launch of new journals devoted to complex networks, one after another (e.g. Journal of Complex Networks by Oxford University Press, Network Science by Cambridge University Press, Applied Network Science and Social Network Analysis and Mining by Springer). Network science also has its own prestigious conferences attended by thousands of scientists. Leading universities continuously establish research centers and new departments for network science, furthermore, launch Master and Ph.D. programs in this field (such as Yale University, Duke University, Northeastern University, and Central European University).

The significance of network theory is also reflected in the large number of publications on complex networks and in the enormous number of citations of the pioneering papers by Barabási and Albert [5], Watts and Strogatz [35] and Girvan and Newman [13]. Some researchers interpret network science as a new paradigm shift [16]. However, complex networks are not only acknowledged by the research community, but innovative textbooks aimed for a wider audience have also been published [7, 23], moreover, the concepts of network science have appeared in the popular literature [4, 34] and mass media [30] as well.

In the last two decades, complex networks became in the center of research interest thanks to – among many others – the aforementioned three pioneering papers and due to the fact that the prompt evolution of information technology has opened up new approaches to the investigation of large networks. This period of 20 years can be regarded as the golden age of network science. The first challenge was to understand network topology, to this end, structural properties were put under the microscope one after the other (small-worldness, scale-free property, modularity, fractality, etc.) and various network models were proposed to understand and to mathematically describe the architecture and evolution of real-world networks [33]. In recent years, there has been a shift from the structural analysis to studying the control principles of complex networks [3]. Remarkable computing power, massive datasets, and novel computational techniques keep great potential for network scientists for yet another 20 years [33].

This work is a tribute to the achievements of the network science community in the past 20 years. We provide a bibliographic analysis of 31,763 network science papers and we also construct and investigate the co-authorship network of network scientists to identify how the network science community has been evolving over time.

The present study also extends the earlier conference version of this chapter [21] in several important directions. Namely, here we provide a more detailed literature review; we examine a longer time period; and we answer the question of how the network science community has evolved over time with respect to speed, diversity, and interdisciplinary nature by implementing novel analyses. Here we also provide an analysis of the co-occurrence network of the keywords. Moreover, we supplement our previous work with several other new methods and data visualizations that help to make insightful observations regarding the last two decades of network science.

The main contributions of this work can be summarized as follows:

We collect 31,763 network science papers and provide a bibliographic analysis investigating various characteristics of the papers and showing how the discipline has developed over time.
We construct the co-authorship network of 56,646 network scientists and undertake a scholarly network analysis study by analyzing its topology and dynamics.
We answer the following major research questions:
- What are the most important venues of network science and how have they changed over time?
- How the publication patterns vary over research areas and time?
- What are the most important topics of network science and how have they evolved through time? What relationships can we explore among the most frequent keywords of network science?
- How the network science community has evolved over time with respect to speed, diversity, and interdisciplinary nature?
- What are the most typical patterns in terms of international and interdisciplinary collaborations?
- Who are the most central authors and how do the largest communities look like? How do these network properties compare to other scientometric indicators?

2 Scholarly Networks Analysis

The present paper joins the line of research focused on scholarly network analysis that is based on big scholarly data [28, 37]. Big scholarly data refers to the rapidly growing data accessible in digital libraries containing information on millions of scientific publications and authors [36]. The easily available data sources (Web of Science, Scopus, PubMed, Google Scholar, Microsoft Academic, the U.S. Patent and Trademark Office, etc.) together with novel powerful data analysis technologies have led to the emergence of science of science [12] that gives us a better understanding of the self-organizing rules and patterns of science, e.g. how disciplines have emerged and evolved over time [19]. Various scholarly networks at many levels can be formed based on scholarly data; Pawar et al. identify the following forms of scholarly networks of great interest [28]:

1.
co-authorship networks (a link is formed between scientists by their co-authorship of at least one scientific paper),
2.
citation networks (a directed link is formed between documents referencing one another),
3.
co-citation networks (a link is formed between documents if they are cited together),
4.
bibliographic coupling (documents are linked if they share common references),
5.
co-occurrence networks (keywords/topics are linked if they occur in the same document), and
6.
heterogeneous networks (two or more coupled scholarly networks).

Among the aforementioned scholarly networks perhaps co-authorship networks have attracted the greatest deal of research interest, owing to the fact that co-authorship is one of the most important reflections of research collaboration, which is an essential mechanism that joins together distributed knowledge and expertise into novel discoveries. Furthermore, building the map of sciences is not only important for sociologists and other scholars to understand researchers’ interaction but for policymakers as well to address sharing resources [36]. Co-authorship networks have been studied extensively in various ways and from various perspectives: e.g. the collaboration network determined by the articles of a certain journal, a specific country or a research community that cites a particular influential paper or author [1, 2, 6, 17, 24, 25]. In this chapter, we investigate the co-authorship network of network scientists as defined in the following section.

Keywords co-occurrence networks have also been investigated thoroughly [15, 20, 29, 31]. Keywords of academic articles can provide a concise overview of the content and the core idea of the body of the papers. In contrast to word clouds, co-occurrence networks do not only show the frequency of the keywords but also allow us to discover the relationship between them. Li et al. investigated 6,900 articles published between 1982 and 2013 which had been indexed using the keyword ‘complex network(s)’ and provided a co-keyword network and a keyword co-occurrence network analysis [20].

3 Preliminaries and Data

In this section, we describe how the co-authorship network of network scientists was constructed, how the examined set of academic publications was chosen and collected, what data preparation steps were conducted. Moreover, we also present some useful notions and preliminaries.

3.1 Co-authorship Network of Network Scientists

To the best of our knowledge, the co-authorship network of network scientists has been analyzed only by Newman et al. [26, 27]. However, their network consists of 1,589 authors, while this study investigates a much larger network (56,646 vertices and 357,585 edges) spanning a longer time horizon (1998–2019).

We construct the co-authorship network of network scientists as follows. We consider three ground-breaking papers around the millennium that can be regarded as the roots of the rise of network science: the paper of Watts and Strogatz [35] on small-world networks, the work of Barabási and Albert [5] about scale-free networks and the paper of Girvan and Newman [13] that reveals the community structure of complex networks. We selected the aforementioned three papers since they initiated new areas of research in network science by introducing pivotal concepts two decades ago, that had a huge impact on the network science community that is also demonstrated by the large number of citations they received in the past 20 years.

In this work, we consider a paper as a network science paper if it cites at least one of the three aforementioned pivotal articles (in addition, the three originating papers are also regarded as network science papers, obviously). Similarly, we call someone a network scientist if (s)he has at least one network science paper. The previous definitions of a network science paper and a network scientist are of course quite arbitrary. It is important to note that the papers that refer to one of the three seminal papers are not necessarily about network science and there certainly exist network science articles that do not refer to any of the aforementioned pioneering papers. On the other hand, we believe that this concept is a good proxy for our purposes and it is worth studying. We construct the co-authorship network of the network scientists where two of them are connected if they have at least one joint network science paper (see Fig. 1). In other words, this network is a one-mode projection onto scientists, from the bipartite network of scientists and the network science papers they authored. The anonymized data of the constructed network and some figures in high resolution are available in the supplementary material [22].

3.2 Glossary

Definition 1 (Graph)

A simple (undirected) graph is an ordered pair G = (V, E), where V is the set of vertices or nodes and E is the set of edges or links, which are two-element subsets of V . The vertex and edge sets of G are denoted by V (G) and E(G) respectively. The size of the graph is the number of its nodes, and it is usually denoted by n.

Definition 2 (Complex network)

In network theory, the terms graph and network are used interchangeably, however, a complex network is a graph with non-trivial topological features, that characterize real-world networks.

Definition 3 (Average path length)

A path is a sequence of edges which connect a sequence of vertices. The distance d(u, v) between the vertices u and v is the length (number of edges) of the shortest path connecting them. The l_G average path length of a graph G of size n is defined as:

$$\displaystyle \begin{aligned}l_G = \frac{1}{n(n-1)}\sum_{\substack{u,v \in V(G) \\ u\neq v}} d(u,v). \end{aligned}$$

Definition 4 (Small-world property)

A network is said to be small-world, if the average path length is proportional to the logarithm of the size of the network i.e. $l_G \sim \log |V|$.

Definition 5 (Degree distribution)

The degree deg(v) of a vertex v in a simple, undirected graph is its number of incident edges. The degree distribution P is the probability distribution of the degrees over the whole network, i.e. P(k) is the probability that the degree of a randomly chosen vertex is equal to k.

Definition 6 (Scale-free property)

A scale-free network is a connected graph which P(k) degree distribution follows a power law asymptotically, i.e. P(k) ∼ k^−γ, where γ ≥ 1.

Definition 7 (Assortativity coefficient)

The assortativity coefficient is the Pearson correlation coefficient of degree between pairs of linked nodes. The assortativity coefficient is given by

$$\displaystyle \begin{aligned}r=\frac{\sum_{j,k}{j\cdot k (e_{j,k}-q_{j}q_{k})}}{\sigma_ q^2},\end{aligned}$$

where the term q_k is the mass function of the distribution of the remaining degrees (degree of the nodes minus one) and j and k indicates the remaining degrees. Furthermore, e_j,k refers to the mass function of the joint probability distribution of the remaining degrees of the two vertices. Finally, $\sigma _q^2$ denotes the variance of the remaining degree distribution with mass function q_k i.e. $\sigma _q^2 = \sum _k k^2q_k - \left (\sum _k k q_k \right )^2.$

Definition 8 (Local clustering coefficient)

The local clustering coefficient of vertex v is the fraction of pairs of neighbors of v that are connected over all pairs of neighbors of v. Formally:

$$\displaystyle \begin{aligned}C_{\mathrm{loc}}(v) = \frac{| \{(s,t)\ \text{ edges}: s,t \in N_v\ \text{ and }\ (s,t) \in E|\}}{\deg(v) (\deg(v)-1)},\end{aligned}$$

where N_v is the neighborhood of the node v i.e. the vertices adjacent to v.

The average (local) clustering coefficient of a G graph is defined as:

$$\displaystyle \begin{aligned}\bar{C}(G) = \frac{1}{n}\sum_{v\in V(G)} C_{\mathrm{loc}}(v), \end{aligned}$$

where n is the size of the graph.

Definition 9 (Global clustering coefficient)

The global clustering coefficient C of the graph G is the fraction of closed triplets (paths of length two in G that are closed) over all of the triplets (paths of length two) in G.

Definition 10 (Betweenness centrality)

Betweenness centrality of a node v is given by the expression:

$$\displaystyle \begin{aligned} g(v) = \sum_{s \neq v \neq t} \frac{\sigma_{st}(v)}{\sigma_{st}}, \end{aligned}$$

where σ_st is the total number of shortest paths from node s to node t and σ_st(v) is the number of those paths that pass through v.

Definition 11 (h-index)

The h-index is an author-level metric defined as the maximum value of h such that the given author has published h papers that have each been cited at least h times.

Definition 12 (Harmonic centrality)

Harmonic centrality of a node v is the sum of the reciprocal of the shortest path distances from all other nodes to v:

$$\displaystyle \begin{aligned} H(v) = \sum_{u\neq v}\frac{1}{d(u,v)}. \end{aligned}$$

3.3 Data Collection and Preparation

We build our analysis on data collected from the Web of Science bibliographic database, retrieved on January 2, 2020. The collected data consist of 41,245 rows with multiplicity corresponding to the citing works of the three seminal articles [5, 13, 35]. For each citing paper we have information on the document title, publication name, publication type, publisher, publication year, authors’ full name, authors’ address, research area, keywords, cited reference count, total times cited count, page count, abstract, etc.

After the data were collected, various data preparation steps were conducted, including merging the files, handling missing fields, deleting duplicates, and indicating which of the three seminal papers were cited by the given article. These preparation steps reduced the dataset to 31,763 unique rows and the citation pattern of the corresponding articles is shown in Fig. 2.

The authors are represented by the full name field of Web of Science, however, this field is unfortunately not consistent, the author called John Michael Doe may appear as Doe, John; John Doe; Doe, J.; Doe, J. M.; Doe, John Michael, and other variants. To overcome this issue, we created a dictionary that defines the name variants that correspond to the same author. Furthermore, we cannot distinguish between different scientists with the same name, this issue is mainly relevant for Asian authors. However, the error introduced by this problem is negligible, as also pointed out by Newman [24] and by Barabási et al. [6].

4 Analysis of Network Science Papers

First, we analyze the enormous number of citing works, i.e. the network science papers. Figure 3 shows the top 10 research areas where the citing works belong to, illustrating the interdisciplinary nature of network science. We can see that the first decade was dominated by physics while later computer science took over. It is also clear from the figure that neuroscience has started to use tools of network science in the last decade. The journals that publish the most network science papers are shown in Fig. 4. Considering the number of publications, Physical Review E was the leading scientific forum of network science in the first half of the examined period, while PLOS One and Scientific Reports emerged in the last decade. Currently, Physica A can be regarded as the leading journal of network science in terms of the number of published network science articles.

Figure 5 shows the number of collaborating authors per citing works, the most typical numbers of co-authors in a network science paper are 2 and 3. Almost one-tenth of the network science papers are sole-authored, M. E. J. Newman has the highest number of sole-authored network science papers, namely 27. While the figure shows only up to 15 number of authors, there are a few papers with a high number of collaborating authors e.g. the paper with the highest number of authors (387) is a paper of the Alzheimer’s Disease Neuroimaging Initiative [18]. The authors of this article emerge as a maximal clique of the co-authorship network of network scientists as it can be seen in Fig. 1.

We also investigate the distribution of network science papers written by network scientists. The authors with the highest number of network science papers together with the citation count of their network science papers are shown in Table 1. Guanrong Chen has the highest number of network science papers, his research areas are nonlinear systems and complex network dynamics and control.

Table 1 Top 12 authors with the most network science papers

Full size table

To gain some insights on how the publication patterns vary in network science depending on the research area, we show how the distribution of the number of cited references and the length of the papers differ across research areas (see Fig. 6). We can observe that in neuroscience and neurology authors typically cite a high number of articles while in computer science or engineering the typical number of cited references is much smaller. Engineering together with telecommunications and physics are also the areas with the shortest articles, while it can also be due to the fact that in those disciplines double-column publication formats are quite typical. It is important to emphasize that these observations are not necessarily representative of the disciplines in general, only for those papers that were defined as network science papers.

Figure 7 depicts separate word clouds of the most frequently used keywords of network science papers written in the four 5-year-long periods of the last two decades of network science. We can observe that in the first half of the examined period structure related (e.g., scale-free, small-world, topology) and modeling related keywords (e.g., preferential attachment, evolution, growing network, small-world model) dominated the study of complex networks, while in the last decade topics such as community detection, social network analysis, data-driven research (big data, data mining, link prediction, machine learning) have become more popular keywords. In the first decade of the examined period, the most studied real-world networks were Internet, peer-to-peer, and protein interaction networks, however, since 2010 the research has tended to focus on online social networks and brain networks. These word clouds can also be found in the supplementary material [22].

To gain a more comprehensive understanding of the keywords of network science papers, we also construct their co-occurrence network. Figures 8 and 9 together with Fig. 7 clearly shows that ‘complex network’ is the most important keyword of the examined set of papers that supports our hypothesis that our definition of network science paper was indeed a good proxy of our purposes. We can observe that the keyword ‘random graph’ often goes together with ‘network’ but less often with ‘complex network’ that suggests the term ‘complex network’ is not that widespread among mathematicians. Another observation is that community detection is rather popular in the social network domain, the bottom left side of the figures is dominated by terms associated with social network analysis and community detection. We can also observe that the keywords ‘scale-free’ and ‘small-world’ are frequently used together. The complete keyword co-occurrence network and the Figs. 8 and 9 can be found in the supplementary material [22].

Based on the address of the first author, we identify the network science hot-spots and investigate the spatiotemporal changes. Figure 10 demonstrates that China and the USA are the two leading countries of network science with a fast increase in Chinese network science papers in the last few years.

Figure 11 illustrates how the ratio of multidisciplinary and international papers varied over the years. We can observe that network science research gets both increasingly international and multidisciplinary. Here we consider a paper international if it has at least two co-authors who do not share an affiliation within the same country. The ratio of international papers in an important indicator, since it was also shown that scientific impact increases if researchers publish in international collaboration [8]. The multidisciplinary nature of papers is defined by the fact that more than one research area is attached to the document in Web of Science.

5 Analysis of the Co-authorship Network

The nodes of the co-authorship network of network scientists correspond to the authors who have at least one network science paper (i.e., a paper that cites at least one of the three seminal papers [5, 13, 35]), two of them are connected if they co-authored at least one network science paper. The network is simple, undirected, and unweighted meaning that here we ignore the strength of the connection between two scientists, i.e. the number of their joint papers. The network has 56,646 nodes and 357,585 edges with an average degree of 12.63, however, the median degree is just 4. The largest connected component consists of 35,716 nodes and it is depicted in Fig. 15.

The degree distribution of the network is illustrated in Fig. 12. There are 897 isolated nodes in the graph (nodes with zero degrees), i.e. scholars who have a single-authored network science paper but have not co-authored any network science papers. The most typical number of co-authors are between 2 and 4 and the tail of the distribution decays much slower than the number of authors per paper does (c.f. Fig. 5) since here the degree reflects all the number of co-authors who do not necessarily author the same paper. The highest degree is 546 corresponding to Roberto Bellotti, a medical physicist, who is also an author of the paper with the highest number of collaborating authors [18] and another many-authored paper [9]. While our network is unweighted by definition, a possible weight could be assigned to the edges corresponding to the number of joint papers written by the two authors at the endpoints of the edge. Table 2 shows the most ‘active links’, i.e. the edges with the highest weights in the edge-weighted version of the co-authorship network.

Table 2 The most active links between authors

Full size table

The network has a high assortativity coefficient of 0.53 that suggests that nodes tend to be connected to other nodes with similar degrees. The co-authorship network is highly clustered with a global clustering coefficient of 0.97 and an average local clustering coefficient of 0.8. The fact that the average shortest path length in the largest connected component is 6.6 also supports the small-world nature of co-authorship networks.

To identify the most central authors of the network science community as seen through the co-authorship network, we calculate centrality measures such as betweenness, harmonic and degree centralities of the nodes. The most central authors are shown in Table 3. We also compare the centrality measures of the authors with the citation count of their network science papers and with their h-indices (restricted only to their network science papers). Common characteristics of the most central authors that they are famous, well-established researchers, moreover, they are typically active in more research areas forming bridges between subdisciplines. The highest betweenness and harmonic centralities correspond to Jürgen Kurths, German physicist and mathematician whose research is mainly concerned with nonlinear physics and complex systems sciences. As we mentioned before, the highest degree corresponds to Roberto Bellotti, a medical physicist. Mark Newman English-American physicist has the highest number of citations on his network science papers (77,418), while Shlomo Havlin, Israeli physicist is ranked first with respect to h-index.

Table 3 The top 12 authors with the highest betweenness centrality. Their ranks with respect to each metric are shown in brackets

Full size table

Figure 14 illustrates the relationship between centrality measures of network scientists and the scientometric indicators of their network science papers. On the left it shows the number of citations against the vertex betweenness centrality, colored by the harmonic centrality; on the right one can see the h-index against the vertex betweenness centrality, colored by the degree. We can conclude that there is a positive correlation between the authors’ central role in the co-authorship network and their scientometric indicators. Figure 13 shows the Spearman’s rank correlation heatmap of the aforementioned measures indicating positive correlations, with the highest positive correlation between citation count and h-index. Considering centrality measures against scientometric indicators, betweenness centrality and h-index has the highest correlation (Figs. 13 and 14).

Network scientists have become more connected as time has gone by, as it is illustrated in Fig. 16, since not only the size of the largest component has increased over the years but also the ratio of the size of the giant component to the size of the entire network, indicating the emergence of a diverse but not divided network science community. The giant component consists of 35,716 nodes that is 63% of the entire network size and it is illustrated in Fig. 15.

Using Clauset-Newman-Moore greedy modularity maximization community detection algorithm [10], we identify the dense subgraphs of the network. To retrieve some important discipline and location-related characteristics of the largest communities, we assigned a research area and a country for each author as the majority of the research areas corresponding to their papers and the most frequent country of their affiliations respectively. The compositions of the ten largest communities are shown in Table 4. The largest community consists of 15,693 authors dominated by Chinese physicists and computer scientists. We can observe that the smaller the communities are, the more homogeneous they are. For example, the vast majority of the third-largest community are North American neuroscientists, moreover, there is a community with 53% EU scientists and 44% environmental scientists.

Table 4 Composition of the largest communities

Full size table

Network scientists come from 118 different countries which shows the international significance of network science. To illustrate the typical patterns of international collaborations, we created an edge-weighted network of countries where edge weights correspond to the number of network science papers that were written in the collaboration of at least one author from both countries (see Fig. 17). We can observe that while China has the highest number of network science papers (see also Fig. 10), US scientists wrote the most articles in international collaboration. It is also apparent that EU countries collaborate with each other a lot.

Similarly to the network of international collaborations, we also created a network of multidisciplinary collaborations illustrating the importance of multidisciplinary research in network science. Figure 18 shows an edge-weighted network of research areas where the edge weights correspond to the number of network science papers that were written in the collaboration of authors whose main research areas are the ones at the endpoints of the edge. The main research area of the authors is not given in the Web of Science, so for each author, we assigned the most frequent research area associated with their papers. We can observe that computer scientists and physicists dominate network science. It is also clear that the collaboration of physicists and network scientists made huge progress in network science. We can conclude that – as far as network science papers are concerned – mathematicians collaborate the most with physicists, while engineers collaborate more with computer scientists. It is not surprising that telecommunication experts usually collaborate with engineers and computer scientists, while mathematical & computational biologists work a lot with biochemists & molecular biologists and computer scientists on network science papers.

6 Conclusion

Two decades ago a new multidisciplinary scientific field was born: network science. In this chapter, we paid tribute to the network science community by investigating the past 20 years of complex network research as seen through the co-authorship network of network scientists. We studied 31,763 network science papers by extracting the distributions of research areas, journals, and keywords. We identified the most important publication venues and topics in network science and shed light on how they changed over time, we also explored the co-occurrence network of the keywords. Moreover, we constructed and extensively analyzed the co-authorship network of 56,646 network scientists, for example investigating its topological properties, namely its community structure, degree, and centrality distributions. We identified the most central authors of network science as seen through the co-authorship network. We also studied the spatiotemporal changes to provide insights on collaboration patterns. We can conclude that both international and interdisciplinary collaborations are on the increase and the network science community is getting more connected. Furthermore, we compared the centrality measures of authors with well-known scientometric indicators (e.g. citation count and h-index) and found a high correlation.

The present study also has its own limitations. Most importantly, our definitions of a network science paper and a network scientist are quite arbitrary but we believe our chosen notions are good proxies for the purpose of this study that is also supported by the distribution of the keywords of the examined papers. Additionally, the data set itself is not consistent due to different naming conventions that we aimed to resolve, furthermore, we cannot distinguish between different scientists with the same name. However, the error introduced by these problems is negligible.

After investigating the publication and collaboration patterns of network science and observing an increasing impact of complex networks, we are convinced that the next 20 years will produce at least as many fruitful scientific collaborations and outstanding discoveries in network science as the last two decades.

References

Barabás, B., Fülöp, O., & Molontay, R. (2019). The co-authorship network and scientific impact of László Lovász. Journal of Combinatorial Mathematics and Combinatorial Computing, 108, 187–192.
MathSciNet MATH Google Scholar
Barabás, B., Fülöp, O., Molontay, R., & Pályi, G. (2017). Impact of the discovery of fluorous biphasic systems on chemistry: A statistical and network analysis. ACS Sustainable Chemistry & Engineering, 5(9), 8108–8118.
Article Google Scholar
Barabási, A. (2019). Twenty years of network science: From structure to control. In APS March Meeting Abstracts (Vol. 2019, pp. S53–001).
Google Scholar
Barabási, A. L. (2003). Linked: The new science of networks. American Journal of Physics.
Google Scholar
Barabási, A. L., & Albert, R.: Emergence of scaling in random networks. Science, 286(5439), 509–512 (1999)
Article MathSciNet Google Scholar
Barabási, A. L., Jeong, H., Néda, Z., Ravasz, E., Schubert, A., & Vicsek, T. (2002). Evolution of the social network of scientific collaborations. Physica A: Statistical Mechanics and Its Applications, 311(3–4), 590–614.
Article MathSciNet Google Scholar
Barabási, A. L., et al. (2016). Network science. Cambridge: Cambridge University Press.
MATH Google Scholar
Breugelmans, J. G., Roberge, G., Tippett, C., Durning, M., Struck, D. B., & Makanga, M. M. (2018). Scientific impact increases when researchers publish in open access and international collaboration: A bibliometric analysis on poverty-related disease papers. PloS one, 13(9), e0203156.
Article Google Scholar
Choobdar, S., Ahsen, M. E., Crawford, J., Tomasoni, M., Fang, T., Lamparter, D., Lin, J., Hescott, B., Hu, X., Mercer, J., et al. (2019). Assessment of network module identification across complex diseases. Nature Methods, 16(9), 843–852.
Article Google Scholar
Clauset, A., Newman, M. E., & Moore, C. (2004). Finding community structure in very large networks. Physical Review E, 70(6), 066111.
Article Google Scholar
Council, N. R., et al. (2005). Network science committee on network science for future army applications.
Google Scholar
Fortunato, S., Bergstrom, C. T., Börner, K., Evans, J. A., Helbing, D., Milojević, S., Petersen, A. M., Radicchi, F., Sinatra, R., Uzzi, B., et al. (2018). Science of science. Science, 359(6379), eaao0185.
Google Scholar
Girvan, M., & Newman, M. E. (2002). Community structure in social and biological networks. Proceedings of the National Academy of Sciences, 99(12), 7821–7826.
Article MathSciNet Google Scholar
International Organization for Standardization. (2020). Officially assigned ISO 3166-1 alpha-3 codes. https://www.iso.org/obp/ui/.
Kastrin, A., & Hristovski, D. (2019). Disentangling the evolution of medline bibliographic database: A complex network perspective. Journal of Biomedical Informatics, 89, 101–113.
Article Google Scholar
Kocarev, L., & In, V. (2010). Network science: A new paradigm shift. IEEE Network 24(6), 6–9.
Article Google Scholar
Kumar, S. (2015). Co-authorship networks: A review of the literature. Aslib Journal of Information Management, 67(1), 55–73.
Article Google Scholar
Lella, E., Amoroso, N., Lombardi, A., Maggipinto, T., Tangaro, S., Bellotti, R., Initiative, A. D. N. (2018). Communicability disruption in Alzheimer’s disease connectivity networks. Journal of Complex Networks, 7(1), 83–100.
Article Google Scholar
Leonidou, L. C., Katsikeas, C. S., & Coudounaris, D. N. (2010). Five decades of business research into exporting: A bibliographic analysis. Journal of International Management, 16(1), 78–91.
Article Google Scholar
Li, H., An, H., Wang, Y., Huang, J., & Gao, X. (2016). Evolutionary features of academic articles co-keyword network and keywords co-occurrence network: Based on two-mode affiliation network. Physica A: Statistical Mechanics and Its Applications, 450, 657–669.
Article Google Scholar
Molontay, R., & Nagy, M. (2019). Two Decades of Network Science as seen through the co-authorship network of network scientists. In International Conference on Advances in Social Networks Analysis and Mining, ASONAM. IEEE/ACM.
Google Scholar
Nagy, M., & Molontay, R. (2020). Twenty years of network science – Supplementary material. https://github.com/marcessz/Twenty-Years-of-Network-Science.
Google Scholar
Newman, M. (2018). Networks. Oxford: Oxford University Press.
Book Google Scholar
Newman, M. E. (2001). The structure of scientific collaboration networks. Proceedings of the National Academy of Sciences, 98(2), 404–409.
Article MathSciNet Google Scholar
Newman, M. E. (2004). Coauthorship networks and patterns of scientific collaboration. Proceedings of the National Academy of Sciences, 101(suppl 1), 5200–5205.
Article Google Scholar
Newman, M. E. (2006). Finding community structure in networks using the eigenvectors of matrices. Physical Review E, 74(3), 036104.
Article MathSciNet Google Scholar
Newman, M. E., & Girvan, M. (2004). Finding and evaluating community structure in networks. Physical Review E, 69(2), 026113.
Article Google Scholar
Pawar, R. S., Sobhgol, S., Durand, G. C., Pinnecke, M., Broneske, D., & Saake, G. (2019). Codd’s world: Topics and their evolution in the database community publication graph. In Grundlagen von Datenbanken (pp. 74–81).
Google Scholar
Su, H. N., & Lee, P. C. (2010). Mapping knowledge structure by keyword co-occurrence: A first look at journal papers in Technology Foresight. Scientometrics, 85(1), 65–79.
Article Google Scholar
Tálas, A. (2008). Connected: The power of six degrees.
Google Scholar
Uddin, S., Khan, A., & Baur, L. A. (2015). A framework to explore the knowledge structure of multidisciplinary research fields. PloS one, 10(4), e0123537.
Article Google Scholar
Van Eck, N., & Waltman, L. (2009). Software survey: VOSviewer, a computer program for bibliometric mapping. Scientometrics, 84(2), 523–538.
Google Scholar
Vespignani, A. (2018). Twenty years of network science. Nature, 558, 528–529.
Article Google Scholar
Watts, D. J. (2004). Six degrees: The science of a connected age. W. W. Norton & Company is based in New York.
Google Scholar
Watts, D. J., & Strogatz, S. H. (1998). Collective dynamics of “small-world” networks. Nature, 393(6684), 440.
Article Google Scholar
Xia, F., Wang, W., Bekele, T. M., & Liu, H. (2017). Big scholarly data: A survey. IEEE Transactions on Big Data, 3(1), 18–35.
Article Google Scholar
Yan, E., & Ding, Y. (2014). Scholarly networks analysis. In Encyclopedia of Social Network Analysis and Mining (pp. 1643–1651). New York: Springer.
Chapter Google Scholar

Download references

Acknowledgements

We thank the anonymous reviewers for their observations and comment. The research reported in this chapter and carried out at the Budapest University of Technology and Economics has been supported by the National Research Development and Innovation Fund based on the charter of bolster issued by the National Research Development and Innovation Office under the auspices of the Ministry for Innovation and Technology. The research of Roland Molontay was partially supported by the NKFIH K123782 research grant.

Author information

Authors and Affiliations

Department of Stochastics, Budapest University of Technology and Economics, Budapest, Hungary
Roland Molontay & Marcell Nagy
MTA–BME Stochastics Research Group, Budapest, Hungary
Roland Molontay

Authors

Roland Molontay
View author publications
You can also search for this author in PubMed Google Scholar
Marcell Nagy
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Roland Molontay .

Editor information

Editors and Affiliations

Bilkent yerleşkesi, Turkish Ministry of Health, Çankaya, Ankara, Turkey
Mehmet Çakırtaş
Computer Engineering, Istanbul Medipol University, Istanbul, Turkey
Mehmet Kemal Ozdemir

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Molontay, R., Nagy, M. (2021). Twenty Years of Network Science: A Bibliographic and Co-authorship Network Analysis. In: Çakırtaş, M., Ozdemir, M.K. (eds) Big Data and Social Media Analytics. Lecture Notes in Social Networks. Springer, Cham. https://doi.org/10.1007/978-3-030-67044-3_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-67044-3_1
Published: 19 January 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-67043-6
Online ISBN: 978-3-030-67044-3
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Twenty Years of Network Science: A Bibliographic and Co-authorship Network Analysis

Abstract

Similar content being viewed by others

Network Analysis and Indicators

A new network model for the study of scientific collaborations: Romanian computer science and mathematics co-authorship networks

Analysis of Significance and Evolution of Co-authorship Networks

Keywords

1 Introduction

2 Scholarly Networks Analysis

3 Preliminaries and Data

3.1 Co-authorship Network of Network Scientists

3.2 Glossary

Definition 1 (Graph)

Definition 2 (Complex network)

Definition 3 (Average path length)

Definition 4 (Small-world property)

Definition 5 (Degree distribution)

Definition 6 (Scale-free property)

Definition 7 (Assortativity coefficient)

Definition 8 (Local clustering coefficient)

Definition 9 (Global clustering coefficient)

Definition 10 (Betweenness centrality)

Definition 11 (h-index)

Definition 12 (Harmonic centrality)

3.3 Data Collection and Preparation

4 Analysis of Network Science Papers

5 Analysis of the Co-authorship Network

6 Conclusion

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation