Introduction

According to the latest indicators referred to the European Union, although women’s entry into and progression within science is progressing towards a more equal representation, they are still under-represented in leading positions (European Commission 2013). As a consequence, numerous reports and initiatives have emerged in Europe but also in different parts of the world with the aim of analyzing the presence of women in science and higher education (WSHE), and to advocate for a more gender-balanced structure of science (LERU 2012; Deloitte Consulting 2013; European Commission 2013; the Organization for Economic Co-operation and Development [OECD] 2013).

The scientific literature contains ample evidence regarding different aspects of women’s participation and performance in science and higher education. A significant body of studies has examined gender disparities in publication productivity, addressing issues such as the number of publications, citations, impact of researchers’ output, and patterns of collaboration. In terms of the number of publications many studies have demonstrated that female academics publish less, on average, than do their male colleagues (Braisher et al. 2005; Kelly and Jennions 2006; Symonds et al. 2006; Sidhu et al. 2009; Jagsi et al. 2011; Schrager et al. 2011). However, there is also research reporting no significant differences in productivity between the two sexes (Xie and Shauman 1998; Lewison 2001; Bordons et al. 2003; Tower et al. 2007; Mauleón et al. 2008), especially when it comes to younger generations of researchers (Arensbergen et al. 2012). Some of these studies have also evaluated the impact of publications, through the number of citations or the journal impact factor. The literature in this area has shown mixed results, including no differences in the citation patterns of male and female academics (Cole and Zuckerman 1984; Lewison 2001; Ledin et al. 2007; Mauleón et al. 2008; Copenheaver et al. 2010), a higher number of citations for female-authored papers (Long 1992; Symonds et al. 2006; Borrego et al. 2010), and fewer citations of papers authored by women (Hunter and Leahey 2010; Larivière et al. 2011, 2013a, b). Recently, the global, cross-disciplinary bibliometric study undertaken by Larivière et al. (2013a, b) showed that papers with female authors in key positions (sole authorship, first- and last-authorship) are cited less than those with males in key positions.

Studies on the journal impact factor have also produced mixed findings: while some of them have highlighted the similarity of the journals in which women and men publish (Lewison 2001; Bordons et al. 2003; Mauleón and Bordons 2006; Gonzalez-Brambila and Veloso 2007; Mauleón et al. 2008), others have shown that men choose to publish in journals with a higher impact factor (Hunter and Leahey 2010), or alternatively that women are the ones who tend to publish in higher impact journals (Borrego et al. 2010). Another approach to the study of scientific activity concerns the collaborative practices of researchers. Here, there is ample evidence showing that women collaborate to a lesser extent than do men with foreign authors (Lemoine 1992; Lewison 2001; Webster 2001; Larivière et al. 2011, 2013a, b). However, the results on authorship patterns have proved inconclusive: whilst some research has obtained similar rates of co-authorship among women and men (Maske et al. 2003; McDowell et al. 2006), other studies have found significantly lower rates of co-authorship for women (Boschini and Sjögren 2007; West et al. 2013; Larivière et al. 2013a, b). Another sizeable body of research on WSHE has addressed elements that have an influence on the development of a scientific career, such as grant and manuscript reviewing, access to funding and hiring and career progression. With regard to manuscript reviewing, Budden et al. (2008) reported that the acceptance rate for female first-authored manuscripts increased after the onset of blind review, suggesting that when reviewers were aware of the authors’ sex they were less likely to accept manuscripts from women. However, further work on this issue has found no differences in the acceptance/rejection ratio for papers submitted by male and female corresponding authors (Aarssen et al. 2008; Whittaker 2008). The situation is different in relation to the participation of women in peer review processes, where many more authors have found gender differences in female recruitment as editors and reviewers (Gilbert et al. 1994; Dickersin et al. 1998; Davo et al. 2003). Concerning grant peer reviewing, one of the most frequently cited studies on gender bias, that carried out by Wennerås and Wold (1997), demonstrated that female applicants for postdoctoral fellowships at the Swedish Medical Research Council had to be 2.5 times more productive than the average male applicant in order to obtain the same peer-review rating for scientific competence. Since then, an ever-growing body of academic research has found no conclusive evidence of sex discrimination in the awarding of specific project grants (Wellcome Trust 1997; Ward and Donnelly 1998; Bornmann et al. 2007; Marsh et al. 2008). In this regard, the meta-analyses conducted by Bornmann et al. (2007) and Marsh et al. (2009), and more recently the study by Mutz et al. (2012), have all concluded that there is negligible evidence of gender bias in grant awarding programs. There is also a body of academic literature attesting to the idea that male scientists face fewer difficulties when seeking to access financial support and better facilities (Stack 2004; Larivière et al. 2011; LERU 2012), since they occupy a greater proportion of high positions from which it is possible to apply for and receive larger grants (Blake and La Valle 2000; Waisbren et al. 2008; Ranga et al. 2012). Other aspects that have been clearly highlighted in the literature are the overrepresentation of women in lower faculty ranks (D’Amico et al. 2011), due to the difficulties of progressing up the academic ladder (LERU 2012), a persistent gender pay gap (Ward 2001; Ginther 2003; Henderson et al. 2014), and gender bias in researcher recruitment (Andersen 2001), especially in certain male-dominated areas (Isaac et al. 2009). Recent research suggests that many mechanisms prevalent in recruitment and appointment practices of professors result in disadvantages for women and privileges for men (Van den Brink 2011). These mechanisms include “gate keeping”, academic networks that are male-dominated and the way scientific excellence is defined (LERU 2012). Based on an empirical study of professorial appointments in the Netherlands, Van den Brink and Benschop (2012) show how gender bias is practised in the evaluation of professorial candidates, resulting in substantial inequalities in the construction of excellence.

As the above-mentioned evidence suggests, different studies have reported diverse results. Hence, many of these studies have sought to explain the systematic disparities between men and women by incorporating different factors into their analysis. Factors surrounding family formation and childrearing have been one of the major causes of female underrepresentation in academia evidenced by the literature (Sax et al. 2002; Stack 2004; Fox 2005; Ginther and Kahn 2006; Prozesky 2008; Hunter and Leahey 2010). Another large body of literature has focused on structural factors such as the greater presence of women in less research-oriented institutions (Allison and Long 1990; Xie and Shauman 1998), the higher teaching and service load among females (Taylor et al. 2006; Snell et al. 2009; DesRoches et al. 2010), their lower degree of specialization (Leahey 2006) and of academic status and rank (Leta and Lewison 2003; Tower et al. 2007; Puuska 2010; Pashkova et al. 2013), their difficulties in accessing funding (Xie and Shauman 1998), the low percentage of women on selection committees (European Commission 2009; Zinovyeva and Bagues 2011), or the academic assessment systems that have traditionally ignored factors that especially affect women (LERU 2012). From a psychological perspective, gender differences have been explained by women’s lower levels of career orientation, ambition, and aggressiveness (Sonnert 1996). Similarly, career preferences, ability, and biological differences have been key factors proposed to explain female underrepresentation in science, technology, engineering, and mathematics (STEM) disciplines. Finally, there is also a body of academic research attesting to overt or unconscious gender bias as the main factor against retention and progression of female scientists in academia (Dewandre 2002; Moss-Racusin et al. 2012; Shen 2013).

Despite the relevance of the issue and the number of academic publications, initiatives, and reports on WSHE, no systematic analysis has yet to be carried out of the large body of research in this area. Using standard bibliometric indicators (such as the number of papers and authors, and productivity by country, among others), laws (Price’s, Lotka’s, and Bradford’s laws) and an index for measuring gender disparity [the Gender Inequality Index (GII)], this article aims to assess the development and growth of research in this field by reviewing the related scientific literature.

Methods

Data collection

The data were extracted from the Thomson Reuters Web of Science (WOS) in February 2013, with the search in the topic field covering the period from 1991 to 2012. Several strategies were used to retrieve the related scientific literature (Fig. 1). First, three different searches were carried out, taking into account the three main topics in the field: publication productivity, issues related to gender in academia and science, and factors related to gender bias.

Fig. 1
figure 1

Flowchart illustrating the process of data collection

To do so, for each topic we went through the related literature in order to identify the related key terms. A preliminary combination of key terms was used to extract the papers related to the subject. Next, after reviewing the keywords of these preliminary papers, we added more specific terms to the query in order to check whether these new terms increased the number of records retrieved; if they did, they were included in the query, and if not, they were eliminated.

By publication productivity we refer to those papers dealing with scientific productivity, citation, and collaboration patterns. Papers addressing elements other than publication productivity that have an influence on the development of a scientific career, such as interviewing and hiring, salaries, promotion and advancement, having access to funding, mentoring and networking, and being a member of an editorial board or a peer reviewer were categorized in the second group, labeled issues related to gender in academia and science. Finally, given that the performance of researchers in the two above-mentioned groups of scientific activities can be affected by several factors, another pool of papers were considered as a third group called factors related to gender bias. Among these factors we took into account family-related issues (e.g. marital status, childrearing activities, home stress, family formation, work-home balance, etc.), structural or institutional variables (e.g. type of institution, research support, academic system, etc.), professional issues (e.g. job stress, motivation, rank, research orientation, etc.), biological factors (e.g. age, race, etc.), psychological traits (e.g. self-esteem, ambition, aggressiveness, etc.), social and cultural factors (e.g. stereotypes, norms and values, etc.) and political variables (policies and legislation regarding WSHE).

As a result of this step, 2082 records were initially retrieved for the first topic, 12,770 for the second and 3545 for the third. In order to identify the papers related to these three main topics the titles and abstracts of each paper were checked. Papers which focused on the field of higher education and science, dealt with one or more of above three topics mentioned, and in which gender was one of the variables of interest, were included in the sample.

Finally, a total of 1746 papers were retrieved: 416 (23.83 %) records for the publication productivity topic, 651 (37.29 %) records for the issues related to gender in academia and science topic, and 679 (38.88 %) records for the factors related to gender bias topic. After elimination of duplicates a total of 1225 records were considered. Additionally, and in order to ensure that all the references dealing with the subject were included in the database, the references of the most recent papers were checked. If any new paper was found it was added to the database, and we continued doing this until no new references were identified. This process led to the inclusion of 190 new papers. As a result the final corpus comprised 1415 articles and reviews, all of which were coded according to the three headings mentioned above. It should be noted that any given paper may simultaneously address more than one issue, and it will therefore belong to more than one group.

Data analysis

The main bibliometric laws were applied to study scientific growth over time, the dispersion of scientific output across journals, and author productivity.

Scientific growth over time was assessed using Price’s law (Price 1963), which proposes that the growth of scientific production follows an exponential function. In order to test whether our data followed Price’s law, different regression models were fitted, including linear, exponential and logistic curves, the latter being applied to assess the hypothesis of literature growth saturation.

Bradford’s law (Bradford 1934, 1948) was applied in order to study the dispersion of the literature. Specifically, Bradford’s law describes how the articles in a specific area are scattered across journals, postulating a model of concentric productivity zones with a decreasing information density. Following the proposal of Egghe (1986, 1990), the Bradford multiplier was obtained by \(k = (1.781 \times y_{\text{m}} )^{1/P}\) where y m is the number of articles published by the most productive journal and P is the number of zones including the core. The estimated k value for each zone was calculated by the ratio between the number of journals in a given zone and the number of journals in any immediate zone. The number of Bradford zones was determined by the solution that minimized the difference between the Bradford multiplier k and each estimated value of k, and between the estimated values of k.

In addition, the predicted frequencies were fitted according to Leimkuhler’s formulation (Leimkuhler 1967), obtaining the constants as \(A = y_{0} /\log_{e} k\) and \(B = (k - 1)/r_{0}\) where y 0 is the constant number of articles in each group (\(y_{0 = a/P}\), where a is the total number of articles and both P and k are as defined above) and r 0 is the expected number of journals in the core (\(r_{0} = \frac{T(K - 1)}{{k^{p} - 1}},\) where T is the total number of journals, and k and P are as defined above). The estimated cumulative number of articles produced by the journals of rank \(1,2, \ldots ,r\) was obtained by: \(R(r) = A \times \log_{e} (1 + B \times r).\)

After standardization of authors’ names (we began with a total of 3538 authors and after standardization we ended with a total of 3064 single authors), Lotka’s law was also applied using the method proposed by Pao (1985), including both the first author and co-authors in the analysis. According to Lotka’s law the number of authors (y x ) with x number of articles is inversely proportional to x. This relationship is expressed by the formula:

$$x^{n} \times y_{x} = C,$$

where y x is the number of authors producing x number of articles in a given research field, and C and n are constants that can be calculated from the observed data set.

In our study, we used the least-squares method in order to calculate the n exponent, using the following formula:

$$N = \frac{{N\sum {XY - \sum {X\sum Y } } }}{{N\mathop {\sum {X^{2} - \left( {\sum X } \right)^{2} } }\nolimits }}$$

where N is the number of pairs considered, X is the logarithm of x and Y is the logarithm of y x . The constant C is calculated using the formula:

$$C = \frac{1}{{\sum {\left( {1/x^{n } } \right)} }}$$

In addition, and in order to verify that the observed data fitted the estimated distribution, the non-parametric Kolmogorov–Smirnov goodness-of-fit test was applied.

In order to calculate relative measures of productivity and international collaboration productivity by country, the Relative Intensity Index (RII) (Larivière et al. 2013a, b) and the International Collaboration Relative Intensity Index (IC-RII) were calculated for each country. Both measures indicate the relative proportion of publications of a given country in a domain relative to the proportion of the world in the same domain. Thus, the RII indicates the relative proportion of publications of a given country in the WSHE area relative to the proportion of the world in the same domain. The RII was calculated for each country using the following formula:

$${\text{RII}} = \frac{{\frac{{n_{{{\text{WSHE}}\,{\text{country}}}} }}{{n_{\text{country}} }}}}{{\frac{{n_{{{\text{WSHE}}\,{\text{world}}}} }}{{n_{\text{world}} }}}}$$

where \(n_{{{\text{WSHE}}\,{\text{country}}}}\) is the number of publications of a given country in the WSHE area, n country is the total number of publications of the country in all disciplines, n WSHE world is the total publications of the world dealing with WSHE, and n world is the total of publications in all disciplines in the world.

The IC-RII indicates the relative proportion of publications of a country in the WSHE area in international collaboration relative to the proportion of the world’s publications published through international collaboration in the same area. The IC-RII is expressed by the formula:

$${\text{IC}}{\hbox{-}}{\text{RII}} = \frac{{\frac{{n_{{{\text{IC}}\_{\text{WSHE}}{\kern 1pt} {\text{country}}}} }}{{n_{{{\text{IC}}\_{\text{country}}}} }}}}{{\frac{{n_{{{\text{IC}}\_{\text{WSHE}}{\kern 1pt} {\text{world}}}} }}{{n_{{{\text{IC}}\_{\text{world}}}} }}}}$$

where all terms have the same meaning as above except this time publications include only those carried out through international collaboration.

In both cases a value above 1 means that an observed country publishes more in the domain than would be expected, while an index value below 1 indicates the opposite.

It is worth saying that by “World” in our study we mean just those countries which contributed to the publication productivity in WSHE, while by the number of papers we mean the number of scientific papers by authors from a given country, as indicated in the address field. To do so, we used the full counting method. In other words, each paper is counted once for each country listed in the address field. For example if a paper is authored by two researchers from Germany and one from Spain, this paper is counted once for each country.

Finally, with the aim of exploring the existence of a possible relationship between the interest of different countries in that topic and the existence of a broad spectrum of gender inequalities in these countries, the most recent GII was considered. GII is a new index for measuring gender disparity that was introduced in the 2010 Human Development Report (20th anniversary edition) of the United Nations Development Program (UNDP). According to the UNDP (2013), this index is a composite measure which yields insights, within the same country, about gender gaps in major areas of human development. It uses three dimensions to do so: reproductive health, measured by maternal mortality ratio and adolescent birth rates; empowerment, measured by proportion of parliamentary seats occupied by females and proportion of adult females and males aged 25 years and older with at least some secondary education; and labour market participation measured by employment rates of female and male populations aged 15 years and older. The GII values range between 0 to 1, where a value close to 0 means equality between genders and a value close to 1 means inequality.

Results

Number of papers

Data showed an upward trend in the percentage of publications, 52.29 % of which appeared in the last 7 years (2006–2012). The linear, exponential, and logistic regression models were fitted in order to test whether the data followed Price’s law. Although all three regression models were statistically significant and captured a high proportion of the explained variance, the exponential model (R 2 = .834) explained a higher proportion of the variance than did either the linear (R 2 = .707) or logistic (R 2 = .578) approaches, showing a good fit to Price’s law (Fig. 2).

Fig. 2
figure 2

Growth of literature over time (1991–2012): linear, exponential and logistic regression model fit to Price’s law

An upward trend was also observed in the three main topics of the field. Table 1 shows the frequency, percentage, and regression fit for the three topics. As can be seen from the table, the research line with the highest frequency of papers was issues related to gender in academia and science. Furthermore, all the topics showed a good fit to the exponential model.

Table 1 Growth of literature over time (1991–2012) by topic: publication frequency and regression fit

Research areas

According to the classification of journal areas used in Thomson Reuters WOS, the top ten research areas in terms of frequency of papers were Education and Educational Research (n = 71, 18.3 %), Psychology (n = 237, 11.7 %), Information Science and Library Science (n = 212, 10.5 %), Business and Economics (n = 167, 8.2 %), Women’s Studies (n = 165, 8.1 %), Computer Science (n = 95, 4.7 %), Sociology (n = 93, 4.6 %), General and Internal Medicine (n = 63, 3.1 %), Health Care Sciences and Services (n = 58, 2.9 %), and, finally, Science and TechnologyOther Topics (n = 53, 2.6 %).

Figure 3 shows the contribution of these top ten research areas in the three topics. Information Science and Library Science was the research area with the highest number of publications (n = 90, 30.41 %) in topic 1 followed by Computer Science (n = 48, 20.61 %). Education and Educational Research and Psychology were the research areas with the highest number of publications in topics 2 and 3. Education and Educational Research corresponds respectively to 27.03 % (n = 276) of publications in topic 2 and 28.45 % (n = 132) in topic 3, while Psychology corresponds respectively to 14.99 % (n = 153) of publications in topic 2 and 20.26 % (n = 94) in topic 3.

Fig. 3
figure 3

Percentage of papers for the top ten research areas in the three topics. 1 Business and Economics, 2 Computer Science, 3 Education and Educational Research, 4 General and Internal Medicine, 5 Health Care and Services, 6 Information Science and Library Science, 7 Psychology, 8 Science and Technology—Other Topics, 9 Sociology, 10 Women’s Studies

Number of authors

The total number of authors who contributed to the output set was 3064, with the mean number of authors per article being 2.41 (SD = 1.60, M = 2). The data showed that 477 (33.71 %) of the papers had a single author, 398 (28.12 %) had two, 266 (18.79 %) had three, and 274 (19.36 %) had more than three authors. It can be seen in Fig. 4 that the number of papers with a single author showed a decreasing trend, whereas the number of papers with three authors and with more than three authors increased slightly over time. Over the same period there was a fluctuation in the number of papers with two authors.

Fig. 4
figure 4

Changes over time (1991–2012) in the number of authors per paper

With regard to the productivity of authors the data showed that 2750 (89.75 %) of them contributed with just one item to the field. Lotka’s law was applied using the method proposed by Pao (1985), and all the authors of publications (first authors and collaborators) were considered for the analysis. To determine whether the data fitted Lotka’s law, the n value was calculated using the least squares method (n = 3.40), yielding a C value of 0.88. The critical value obtained by the non-parametric Kolmogorov–Smirnov goodness-of-fit test was 0.029. As the maximum difference between the observed and the estimated accumulated frequencies was 0.014, which is below the critical value, we can conclude that the data fitted Lotka’s law.

Countries

Sixty-seven countries participated in publishing the set of studies, although it should be noted that the provenance of authors was not available for 204 papers (14.4 %). Only 8 % (n = 113) of papers involved international collaboration and of these, 22.12 % (n = 25) were published recently (in 2012). The top ten countries in terms of absolute contributions, RII, and IC-RII are shown in descending order in Table 2. While the United States and United Kingdom are the most productive countries in terms of absolute contributions, Nicaragua and Botswana, based on the RII, are the countries that showed the highest proportion of publications in WSHE relative to the proportion of the world in the same area. Taking into account just those papers published in international collaboration, based on the IC-RII, Sudan, Nepal and Ghana are the countries with the highest international collaboration productivity.

Table 2 Top ten countries based on absolute contributions, relative intensity index (RII), and international collaboration relative intensity index (IC-RII)

Interestingly, a significant positive correlation coefficient was obtained between the most recent GII and the RII (r = .277, p = .029) and the IC-RII (r = .497, p = .001), showing that countries that present higher gender inequalities present higher relative productivity related to the topic as well as higher levels of international collaboration productivity. As the GII was not available for five countries, only 62 countries were considered in the analysis of the above-mentioned correlations.

Regarding the topics, Issues related to gender in academia and science (topic 2) was the most frequent among the top ten countries in terms of absolute frequency. Among the papers involving international collaboration, 56.64 % (n = 64) of them dealt with the topic of issues related to gender in academia and science, 30.08 % (n = 34) addressed factors related to gender bias, and 28.31 % (n = 32) examined publication productivity. As any given paper may simultaneously address more than one topic, the sum of papers is more than the total number of papers, and the sum of percentages exceeds 100 %.

Journals

The papers included in the present study were published in a total of 595 journals, of which 366 (61.5 %) published only one paper. The distribution of papers published in the set of journals was described using Bradford’s law, which revealed that the papers were distributed in four zones. The core was composed of 13 journals. Table 3 shows the expected number of journals given the Bradford multiplier (3.17), the actual number of journals in each zone, the number of articles included in each zone, the cumulative number of articles, the estimated values of k, and, finally, the predicted cumulative number of articles R(r).

Table 3 Data fit to Bradford’s law

Table 4 shows core journals, publication frequency and the corresponding research areas of each journal based on the Thomson Reuters WOS. As can be seen from the table, six of the core journals belong to the area of Education and Educational Research, and Scientometrics is the journal with the highest number of papers related to topic 1, while the rest of the journals published more papers related to topic 2.

Table 4 Core journals, the corresponding research area, total publication frequency and publication frequency in each topic (1991–2012)

Discussion

This study has analyzed the main bibliometric indicators in relation to the literature on women’s participation in science and higher education. With regard to the number of publications, results showed a significant increase and interest in the field over the last 21 years, particularly since 2002, when a steady increase begins. This increment was supported by the fit of the data to Price’s law, which indicates that productivity in the studied field shows an exponential growth. Of the three topics considered, namely publication productivity, issues related to gender in academia and science, and factors related to gender bias, the highest number of papers corresponded to the second topic (i.e., issues related to gender in academia and science), accounting for 66.64 % of the total publications. This topic was also the most frequently addressed in papers involving international collaboration (56.64 %). This result may be due to the fact that this topic addresses a wide variety of issues such as a scientific career, having access to funding, mentoring and networking, and being a member of an editorial board or a peer reviewer, among others. Furthermore, these studies often consider the other two topics as well in an attempt to examine existing differences in academia and science by searching for explanatory factors, and they often using bibliometric indicators such as publication productivity.

Regarding the research areas showing the most interest in the studied field, it can be inferred that scholars from different disciplines, such as Education and Educational Research, Psychology, Information Science and Library Science, Computer Science, Business and Economics, and Women’s Studies, among others, have all contributed to the field, thereby indicating a high degree of multidisciplinarity. However, the interest of these areas in the three different topics is quite unbalanced. Not surprisingly, Information Science and Library Science and Computer Science were the areas with the highest contributions in the third topic, publication productivity, while Education and Educational Research and Psychology were the areas with the highest weight in the topics “issues related to gender in academia and science”, and “factors related to gender bias”.

In addition, the result of Bradford’s law identified a small set of core journals focused on the studied field, which were also from eleven different research areas. Ordered according to frequency these core journals correspond to the following areas: Computer Science; Information Science and Library Science; Psychology; Women’s Studies; Education and Educational Research; Health Care Sciences and Services; Business and Economics; Public, Environmental, and Occupational Health; General and Internal Medicine; Obstetrics and Gynecology; and Psychiatry. This means that this field is widely scattered across journals of different disciplines and is regarded as relevant in many research areas. However, it should be noted that the most productive research area is Education and Educational Research, accounting for 6 of the 13 (46.15 %) core journals identified by Bradford’s law.

With respect to the number of authors, a large number of papers (477; 33.71 %) are single author papers, suggesting that the scientific community in the field is composed of small groups and individual authors. Furthermore, author productivity was found to fit Lotka’s law, indicating that there are a few, highly productive authors and a great majority who only contribute occasionally to research on WSHE. However, analysis of how the number of authors has evolved over time showed an increasing trend in terms of the number of papers being signed by three authors and by more than three authors, a trend that was paralleled by a clear reduction in the percentage of single-author papers. On the one hand, this may reflect the constitution of research groups, which is a feature associated with the consolidation of a field. On the other hand, and as has already been mentioned in other studies, this trend may be due to academic factors such as the increasing pressure to publish, the specialization of research expertise, collaborative efforts, or the interdisciplinarity of this field (Wren et al. 2007; Lozano 2013).

Authors contributing to the articles came from 67 different countries. It should be noted, however, that the increase in the number of authors and the high number of countries that contributed to the field have not been accompanied by greater international collaboration; only 8 % of papers involved this kind of joint work, and the largest proportion of these were published recently. This indicates a field of study that has yet to become truly international and in which collaboration between countries and institutions needs to be reinforced.

The most productive countries, in absolute terms, are the United States and the United Kingdom, generating around half the total scientific production in this field (54 %). However, when RII, a relative measure to calculate country productivity, were used, Nicaragua and Botswana were the counties that showed the highest productivity in WSHE, relative to the proportion of the world in the same area. Similarly, when IC-RII were used to measure the relative proportion of publications of a country in international collaboration, Sudan, Nepal and Ghana were the countries that showed the highest international collaboration productivity.

With regard to the GII, the results showed a direct relationship between the most recent GII and the RII and IC-RII. This means that countries with higher GII values (higher inequality) are, in relative terms, the most productive in the field and are the most likely to collaborate internationally. Thus, although countries such as the United States and the United Kingdom are the most productive countries in terms of absolute contributions, countries with higher levels of inequality showed higher relative values of productivity in the field. This last finding is consistent with Glänzel’s (2001) study, which analyzed country profiles and compared domestic and internationally co-authored papers. According to his findings, some countries compensate relatively weak domestic activities through international collaboration or even intensify their own strong activities in the preferred science fields by international collaboration. However, as Glänzel (2001) points out, international scientific collaboration has proved to be a complex and heterogeneous phenomenon which cannot be sufficiently characterized by bibliometric indicators alone.

In summary, the present study provides an overview of how scientific output in the field of WSHE has evolved over time. The results show that since 1991 until the present day there has been a clear upward trend in the number of publications, with increasing interest extending to many authors, countries, and scientific journals, as well as to several research areas. These results are consistent with Zosuls et al.’s (2011) conclusion that the growth in the diversity of topics covered by researchers over the past few decades has mainly been due to the multidimensionality of gender, and that the idea that the assumptions made about one domain of gender development can predict all others has come under serious scrutiny. Moreover, leaders in the field have encouraged increasingly precise and clear terminology, more sophisticated methods and analytic techniques, and a greater diversity of topics of study.

Overall, this increasing body of evidence shows that gender inequality remains a global problem, despite substantial initiatives and policies undertaken at national and international level (Larivière et al. 2013a, b). Although progress continues to be made, change is on average slow, patchy, and subject to significant variation according to country, research field, and other factors (LERU 2012). Numerous scholars from different parts of the world continue to address this problem and provide valuable material for evidence-based policies.