Collaboration networks of arab biomedical researchers

Al-Ayyoub, Mahmoud; Alawneh, Esra’a; Jararweh, Yaser; Al-Smadi, Mohammad; Gupta, Brij B.

doi:10.1007/s11042-018-6557-5

Collaboration networks of arab biomedical researchers

Published: 29 August 2018

Volume 78, pages 33435–33455, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Multimedia Tools and Applications Aims and scope Submit manuscript

Collaboration networks of arab biomedical researchers

Download PDF

Mahmoud Al-Ayyoub ORCID: orcid.org/0000-0001-9372-9076¹,
Esra’a Alawneh¹,
Yaser Jararweh¹,
Mohammad Al-Smadi¹ &
…
Brij B. Gupta²

169 Accesses
4 Citations
Explore all metrics

Abstract

Social networks (SN) consist of a set of actors and connections between them. A collaboration network (ColNet) is a special type of SN, in which the actors represent researchers and the link between them indicate that they have co-authored at least one paper. ColNet analysis reveals how researchers interact and behave. A wide range of applications can be based on such studies. The current works on ColNet usually focus on a specific domain/discipline, country/geographical region or time interval. In our study, we focus on one of the understudied regions (the Arab world), and present a novel study on the ColNet of researchers in this region. The domain of interest in our study is biomedicine. We construct, analyze, and study ColNet of biomedical researchers in the Arab world. We divide the region of interest (the Arab world) into four geographical regions and look into the evolution of ColNet of each region separately over time. Our analysis reveals that there is an increase in the number of both authors and publications over time, and that authors tend to work in increasingly larger groups rather than working individually, which is consistent with what is assumed about the nature of research in this field. Our analysis also reveals that a researcher’s productivity is correlated with the amount of change in his/her circle of collaborators over time. For example, researchers working in stable or fixed groups and researchers who have completely different research group every few years are not necessarily the most productive ones.

An extended study of collaboration networks of Levantine biomedical researchers

Article 19 June 2017

Analysis of Academic Research Networks to Find Collaboration Partners

Scientific collaboration of researchers and organizations: a two-level blockmodeling approach

Article 14 September 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

A Social network (SN) consists of a set of actors (persons, organizations, etc.), and the connections between them (friendship, ownership, etc.). It describes the social structure through the interactions among the actors, defines patterns and regulations hidden in data, and reflects the relationships between groups and individuals [13, 18, 24].

A collaboration network (ColNet), also known as a co-authorship network, is a special type of SN, in which the actors are researchers, scholars or authors and the connections between them represent their collaboration in writing papers. Two authors are connected if they authored at least one paper. The study of such networks helps in understanding the formation of the network and the factors affecting them [12, 22, 24, 28, 30].

The current works on ColNet usually focus on a specific domain/discipline, country/geographical region or time interval. In our study, we focus on one of the understudied regions (the Arab world), and present a novel study on the ColNet of researchers in this region. We choose to study the biomedical field because of its importance in addition to being one of the most active research fields in the Arab world in terms of the number of publications, the number of reputable journals, and the amount of research funds invested in it. In this field, the amount of available data is relatively better (larger, more consistent, easier to access, etc.), compared with other fields. This makes it a very good representative of the scientific/ academic/research community in the Arab world.

The problem we are interested in is to build and analyze ColNet of biomedicine researchers in the Arab world. Due to the many challenges associated with it, the research in this area is very limited despite its importance. To the best of our knowledge, the only prior work in this field is [1, 5]. However, [1, 5] focus on a specific part of the Arabic world, which is the Levant region. Here, we consider the entire Arab world. The goal is to answers a lot of questions such as, how does the interaction of an author (in terms of collaborations) affect his/her productivity? What are the patterns that are relevant to the success of certain authors (in terms of the number of publications)? Do the authors prefer to work individually or in groups? Do the authors prefer to collaborate with local authors or international authors?

This paper is arranged as follows. The second section discusses the related work, while the third section discusses the methodology including the measures we use. The next section presents the analysis of the computed measures. Finally, the last section discusses the conclusion and future work.

2 Literature review

The study of ColNet is very old. However, the rapid developments over the past century gave this field a different and new flavor. Below, we focus our discussion on modern ColNet and the studies conducted on them.

Among the earliest works on modern ColNet is Newman’s [24, 26]. In [24], he constructed and studied ColNet for the fields of computer science, biomedicine, and physics. He extracted the authors, addresses, journals, and other information in order to construct the network. He considered the timeline 1995-1999 as a base for his study. He collected many statistics for his network, such as the number of collaborating authors, the number of papers written by an author, and the number of authors per paper. In [26], he expanded his work by constructing such networks in other fields, using bibliographical databases, preprint databases in physics in the same period (1995-1999). Moreover, he studied an extended period (1940-2004) for mathematicians. In his works, he answered many questions about collaboration patterns such as who are the authors who tend to be more collaborative, how many papers does an author write on average, how many authors he/she collaborates with, does the field of search affects the size of the network and the collaboration patterns, and the distance between two scientists on average). As a result, he concluded that biologists tend to be more collaborative than physicists or mathematicians.

In [33], the authors used a database of articles on tourism and hospitality in the period 1991-2010. They proposed a new method in order to evaluate the researchers. The main objectives of this research are to study the collaborative behavior and its effects on the research productivity, discover the characteristics of such networks, and most importantly, find the most important researchers in this field to evaluate them. They considered both levels: macro (network) level, and micro (individual) level. Furthermore, they found that this type of ColNet is less mature and close than other types of networks. Other works in the same field include [6, 15]. In [6], the author discussed patterns of co-authorship and analyzed it using the database of New Zealand and Australian tourism research. In [15], the authors examined the tourism and hospitality research areas. The data collected from four influential and leading hospitality research journals of the time interval of (2001-2005). Their findings showcased the knowledge diffusion patterns and collaborative nature in the hospitality research domain. Moreover, it showed that the social structure affects the acquisition of knowledge.

ColNet of evolutionary computation (EC) researchers was studied in [7]. The data was collected from the Digital Bibliography & Library Project (DBLP) bibliography server. It consisted of more than 610,000 articles and thousands of computer scientists. The authors studied how making conferences, giving grants, etc., provide good hints about central actors in the network, or even building scientific societies. In [32], the studied how the position of an author in the ColNet affects the citation count of his papers. Day et al. [8] investigated the field of intelligence and security informatics (ISI) using social network analysis and visualization to identify main clusters, actors, and main components (subsets).

In [10], the author studied the ColNet in the information retrieval field in the period of 1956-2008. He tried to answer some questions like, do productive researchers tend to collaborate with researchers with the same interest or different interests? The results showed that the productive researchers tend to collaborate with researchers in the same field and interest, and they indirectly collaborate with researchers with different interests.

In [17], the authors examined the research collaborations among the countries of the Association of Southeast Asian Nations (ASEAN) in the economics field for the time period of 1979-2010. They found that the local collaboration between the countries of ASEAN accounted for just 4% of all international collaborations.

In [14], the authors investigated ColNet structured using bibliographic database of papers published in the journals of the Science Citation Index (SCI) in the interval of 1978-2004. They applied social network cluster analysis, co-occurrences analysis, and frequency analysis to explore the center of collaboration of the ColNet, the collaborative fields of the network as whole, and the ColNet microstructure on the scientist’s aspects.

In [19], the authors analyzed the ColNet of ACM, IEEE, and joint ACM/IEEE conferences on digital libraries in the interval 1994-2004. They established some network measures and authorRank to indicate individual author in the ColNet, in addition to standard network measures.

In [4], the authors examined complex and social networks to identify and characterize scientific collaboration process. They used a database of the period 2001-2009 of papers, MS and PhD theses, etc. The results showed the influential researchers, and indicated scale-free degree distribution behavior.

As mentioned earlier, our work is unique in the setting it considers (biomedicine research in the Arab world). We expand our earlier work on a subset of the region of interest [1, 5] and consider the entire Arab world. We follow the analysis techniques and measures popular in the ColNet literature in addition to less common techniques and measures such as the ones we proposed in [1, 5]. This study is far from perfect. We can still explore new techniques such as negative sample mining [23], self-paced learning [21], the use of meta-data [2] or even the linking with other domains such as financial networks [27].

3 Methodology

In this work, we discuss the steps of building a ColNet of biomedical researchers focusing on the Arab world in the time interval of 1991-2010. We confront some challenges and difficulties throughout our work. Let us start by briefly discussing them before getting into our methodology and analysis tools.

3.1 Challenges in network construction

Data collection is the first challenging step of our study. We need find a suitable source of bibliographic data for biomedical research in the Arab world. Unfortunately, these data may contain uncontrolled errors. In our research, we use the PubMed search engine to crawl data due to its comprehensive coverage and ease of use.

The data we collected is restricted in many aspects. The first one is the period of time. We collect the publications in the period of 1991-2010. The main reason for this restriction is the lack of complete data prior to 1991. The second restriction is the geographical region on interest. We consider only the authors affiliated with institutions based in the Arab world. We use a division of the Arab world into the following regions:

Region one: Jordan, Palestine, Lebanon, Syria, and Iraq.
Region two: Bahrain, Kuwait, Oman, Saudi, Qatar, Yemen, and UAE.
Region three: Egypt, and Sudan.
Region four: Libya, Morocco, Tunisia, Mauritania, and Algeria.

The authors affiliated with an institution based in one of these countries is considered as a local author. Other authors are considered as “undetermined.” We also have to deal with problems associated with authors’ identities. E.g., an author may write his/her name in different ways. Sometimes, parts of the name (such as the middle names) are abbreviated causing us to have a “match” between a certain author name and multiple known authors. In our prior work [1, 5], we ignored such authors. However, here, we try to resolve the confusion caused in these cases by matching the authors affiliations. If this step fails, then these author names are added as separate authors for completeness purposes.

3.2 Overview of methodology

The following paragraphs summarize our methodology. We first query PubMed to collect papers published in the period 1991-2010. Then, we divide the papers based on the intervals 1991-1994, 1995-1998, 1999-2002, 2003-2006, and 2007-2010. We pick this period of time to make sure that our data is consistent and to avoid any gaps. Moreover, this period witnessed a significant growth in the scientific fields in the region of interest as evident by the data we collected. So, it is interesting to study.

In the following step, we employ the tool of [1, 5] to extract the authors, their affiliations, and the papers information in order to construct the network. The tool addresses many problem related to authors names, but it does not properly address the “ignored authors” issue resulting from conflicts in authors names. As mentioned earlier, an author may write his/her name in different formats. Moreover, the name might be abbreviated leading to some confusion/conflicts. Table 1 shows different formats of the same author name and the numbering we devised for each format in our earlier work [1, 5]. The numbering system gives larger values to more specific names which have lower possibility of creating confusion. When facing different instances of the same author name using different formats, we group the names by defining parent/child relationship where the parent is the instance with the larger number.

Table 1 Formats for the authors names typically found in PubMed

Collaboration networks of arab biomedical researchers

Abstract

Similar content being viewed by others

An extended study of collaboration networks of Levantine biomedical researchers

Analysis of Academic Research Networks to Find Collaboration Partners

Scientific collaboration of researchers and organizations: a two-level blockmodeling approach

1 Introduction

2 Literature review

3 Methodology

3.1 Challenges in network construction

3.2 Overview of methodology

3.3 Used measures

4 Discussion and analysis

4.1 Topological measures

Network Size

Main Component

Distances

Clustering Coefficient

Density

Average Degree

Degree

Betweenness

4.2 Papers measures

Number of Papers

Papers Authored by Local Authors

Papers Authored by Local or Undetermined Authors

Percentage of Local Authors

Authors per Paper

4.3 Authors measures

Total Local Authors

Papers per Author

Collaborators

Percentage of Local Collaborators

4.4 Stability measures

Collaborators of Local Authors

New Collaborators

Deleted Collaborators

Stability Rate of Collaborators

4.5 Highly productive and influential authors

Top Authors

Top 20 Local authors in Terms of Productivity

5 Conclusion and future work

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation